I have a 2 csv files one with 2.5gb and one with 450mb. I am using datastore to upload these function and need something like "outer join" to merge them. Any ideas?
11 vues (au cours des 30 derniers jours)
Afficher commentaires plus anciens
I have a 2 csv files one with 2.5gb and one with 450mb. I am using datastore to upload these function and need something like "outer join" to merge them. Any ideas?
0 commentaires
Réponses (1)
Pratyush Swain
le 28 Mar 2025
Hi Rathies,
You can utilise the 'datastore' and 'outerjoin' functions to achieve this. Please refer to this example implementation:
% Create datastore objects for both CSV files
ds1 = datastore('file1.csv');
ds2 = datastore('file2.csv');
% Convert to tall tables
t1 = tall(ds1);
t2 = tall(ds2);
% Perform outer join
mergedTable = outerjoin(t1, t2, 'MergeKeys', true);
Datastores enable you to work with large data sets in small blocks that individually fit in memory, instead of loading the entire data set into memory at once. Tall arrays extend this capability to enable you to work with out-of-memory data using common functions.
For more information, please refer to following documentation links:
0 commentaires
Voir également
Catégories
En savoir plus sur Big Data Processing dans Help Center et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!