Effacer les filtres
Effacer les filtres

Import large .par file

12 vues (au cours des 30 derniers jours)
Carlo Zarattini
Carlo Zarattini le 24 Mai 2023
Réponse apportée : Saffan le 29 Mai 2023
I need to import a .par file with 2.5gb of dimension. The .par file is a numerical table with 10 columns and few millions of rows.
I tried to used the parquetread function but i got an error message ("Unable to read Parquet"). A reduced version of the parquet file, 500mb, is uploaded successfully. My conclusion is that the original .par file is too large.
Any suggestion on how to import large .par file?

Réponses (1)

Saffan
Saffan le 29 Mai 2023
Hi Carlo,
One way to solve this issue could be by splitting the original parquet file into multiple smaller files and create “parquetDatastore” of them. The data can then be obtained from the datastore using the “read” function. A datastore is a repository of files where each individual file can fit in the memory but the entire collection of files may not fit. They are used to work with the data incrementally and hence reduces the memory usage.
You can refer to the following link for more information on datastores: https://www.mathworks.com/help/releases/R2022b/matlab/ref/datastore.html

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by