I have a 2 csv files one with 2.5gb and one with 450mb. I am using datastore to upload these function and need something like "outer join" to merge them. Any ideas?

11 vues (au cours des 30 derniers jours)
I have a 2 csv files one with 2.5gb and one with 450mb. I am using datastore to upload these function and need something like "outer join" to merge them. Any ideas?

Réponses (1)

Pratyush Swain
Pratyush Swain le 28 Mar 2025
Hi Rathies,
You can utilise the 'datastore' and 'outerjoin' functions to achieve this. Please refer to this example implementation:
% Create datastore objects for both CSV files
ds1 = datastore('file1.csv');
ds2 = datastore('file2.csv');
% Convert to tall tables
t1 = tall(ds1);
t2 = tall(ds2);
% Perform outer join
mergedTable = outerjoin(t1, t2, 'MergeKeys', true);
Datastores enable you to work with large data sets in small blocks that individually fit in memory, instead of loading the entire data set into memory at once. Tall arrays extend this capability to enable you to work with out-of-memory data using common functions.
For more information, please refer to following documentation links:

Catégories

En savoir plus sur Big Data Processing dans Help Center et File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by