@@ -6,7 +6,11 @@ Mainly used as dependance of `pyjapcscout`, but it can be used for other purpose
## Purpose of this project
The idea is to provide a few sweet functions to go from a nested `dict` of `numpy` arrays to `parquet` (and to `pickle`, and `json`) and come back **preserving** the data types (but for `json`, for which no coming back is impelmented here!). The aspect related to data types preservation is important for the roud-trip of meachine parameter reading, saving and settings.
This package is meant to be simple enought and with very little dependencies to allow for *home* data analysis without the needs of *CERN TN Network* or *Java* libraries.
This package is meant to be simple enough and with very little dependencies to allow for *home* data analysis without the needs of *CERN TN Network* or *Java* libraries.
The basic data unit (or dataset) is assumed to be a (nested) **dictionary** of **numpy values** and **numpy arrays**.
**Lists** are in principle not allowed (at least not supported) inside a dataset.
On the other hand, **lists** might be used to define a list of datasets (e.g. a list of consecutive acquisitions of a accelerator data).
@@ -9,6 +9,10 @@ The main purpose is to use it as dependance of `pyjapcscout` in the control room
acquired form the control system as parquet files, and then on the user's "GPN" computer for data
analysis without the need of JAVA or other dependances needed to interact with the control system.
The basic data unit (or dataset) is assumed to be a (nested) **dictionary** of **numpy values** and **numpy arrays**.
**Lists** are in principle not allowed (at least not supported) inside a dataset.
On the other hand, **lists** might be used to define a list of datasets (e.g. a list of consecutive acquisitions of a accelerator data).
This package provides the following (main) functions. Note that many of those functions are simple wrappers of external functions (from `pandas`, `pyarrow`, `awkward`), but sometimes with some twiks to make sure data type/shape is somewhat always preserved.
- `dict_to_pandas(input_dict)`: Creates a `pandas` dataframe from a (list of) `dict`.