Skip to content

GitLab

  • Menu
Projects Groups Snippets
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in
  • C cmsgemos-analysis
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Locked Files
  • Issues 28
    • Issues 28
    • List
    • Boards
    • Service Desk
    • Milestones
    • Iterations
  • Merge requests 3
    • Merge requests 3
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Releases
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Container Registry
    • Infrastructure Registry
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • cmsgemonline
  • gem-daq
  • cmsgemos-analysis
  • Issues
  • #6

Closed
Open
Created May 18, 2020 by Mykhailo Dalchenko@mdalchen

Unpacked data storage

Summary

Currently unpacker just creates a pandas.DataFrame. This data has to be compressed and stored. Possible options are:

  • root with lz compression
  • HDF5
  • parquet
  • other...

Although initially we were considering to store the events in ROOT format, I don't think it makes much sense as the data will have to be converted back to pandas.DataFrame for subsequent analysis.

An interesting overview together with the performance tests can be found here

Please provide your opinion and/or present other options.

Edited May 18, 2020 by Mykhailo Dalchenko
Assignee
Assign to
Time tracking