How to read\open a csv file with millions of rows and hundreds of coloumns to compare/delete and save

1 vue (au cours des 30 derniers jours)
Hi
i have csv files with millions of rows and hundreds of coloumns that i want to open\read in order to compare the files, remove duplicates and save the new file as csv also, and many other modifications..
when i used csvreader the PC stuck! so any help here
  9 commentaires
Shayma
Shayma le 21 Sep 2016
Modifié(e) : Shayma le 21 Sep 2016
o.k it have been a long time now, but i got the R2014b,i tried to use datastore and it works, at least it opens the first chunk thank you :)
Shayma
Shayma le 22 Sep 2016
How long it suppose to take reading 100000 lines each chunk from 13.5 GB files ?

Connectez-vous pour commenter.

Réponses (1)

George
George le 12 Août 2016
You can do this with textscan, but your formatSpec is going to be pretty gnarly.
fid = fopen('data.csv');
% your formatSpec will be very long because of the number of fields
formatSpec = '%s %s %f %d'; % reads a string, a string, a float, an integer
A = textscan(fid, formatSpec, 'HeaderLines', 1, 'Delimiter', ',');
fclose(fid);
csvread

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by