Data File Formats
Data storage is diverse. For data on smaller scales, we are mostly dealing with some data files.
Efficiencies and Compressions
Parquet
Parquet is fast. But
- Don’t use json or list of json as columns. Convert them to strings or binary objects if it is really needed.
Planted:
by L Ma;
References:
Dynamic Backlinks to
cards/machine-learning/datatypes/data-file-formats
:cards/machine-learning/datatypes/data-file-formats
Links to:LM (2021). 'Data File Formats', Datumorphism, 02 April. Available at: https://datumorphism.leima.is/cards/machine-learning/datatypes/data-file-formats/.