How to parse a file with value pairs

4 vues (au cours des 30 derniers jours)
Ruben Ruiloba
Ruben Ruiloba le 25 Août 2020
Modifié(e) : Stephen23 le 25 Août 2020
Hi was looking for ideas on how to best parse a text file which lines are in the following format
!FIX.BODY=xxxxx|2125=aaaaaa|1067=bbbbb|150=ccccc|329=ddddd|
the line is delimited by a '|' and it represents a value pair with the column name followed by an equal sign = followed by the value.
So in the example above I would like to end up with a table like this
FIX.BODY 2125 1067 150 329
___________________________________________
xxxxx aaaaaa bbbbb ccccc ddddd
  3 commentaires
Ruben Ruiloba
Ruben Ruiloba le 25 Août 2020
Hi thanks for the reply. Yes the file has multiple lines and the lines don't all have the same number of variable names.
The lines can differ. I can't upload the file as it has confidential data but it would look something like this.
!FIX.BODY=xxxxxx|2125=aaaaaa|1067=bbbbb|150=ccccc|329=dddddd|
!FIX.BODY=xxxxxx|2125=akcklsd|150=cchscc|329=ddddsd|
!FIX.BODY=xxxxxx|2125=ajfalkjfa|4555=ndlsuel|908=akncld|123=hdeudnc|
Thanks
Stephen23
Stephen23 le 25 Août 2020
Modifié(e) : Stephen23 le 25 Août 2020
"I can't upload the file as it has confidential data but it would look something like this."
A sample file does not have to contain confidential data in it, because you can write it with random, invented, made up data. However it should include all of the salient features of the actual data files, such as EOL character/s, file encoding (especially if this file is being generated from some other application), representative character strings, number encodings, etc.
Providing a file gives us an agreed reference with which we can test our code.
Not providing a file slows down you getting the solution you want.

Connectez-vous pour commenter.

Réponse acceptée

Stephen23
Stephen23 le 25 Août 2020
Modifié(e) : Stephen23 le 25 Août 2020
This works with the attached file (which I had to create myself):
T = table();
[fid,msg] = fopen('temp0.txt','rt');
assert(fid>=3,msg)
while ~feof(fid)
str = fgetl(fid);
spl = regexp(str,'[^!|=]+','match');
vnm = genvarname(spl(1:2:end)); % GENVARNAME is not required for R2019b or later
T{end+1,vnm} = spl(2:2:end); %#ok<SAGROW>
end
fclose(fid);
Giving:
>> T
T =
FIX0x2EBODY x2125 x1067 x150 x329 x4555 x908 x123
___________ ___________ _______ ________ ________ _________ ________ _________
'xxxxxx' 'aaaaaa' 'bbbbb' 'ccccc' 'dddddd' [] [] []
'xxxxxx' 'akcklsd' [] 'cchscc' 'ddddsd' [] [] []
'xxxxxx' 'ajfalkjfa' [] [] [] 'ndlsuel' 'akncld' 'hdeudnc'
  1 commentaire
Ruben Ruiloba
Ruben Ruiloba le 25 Août 2020
Thanks this has given me ideas and exactly what I was looking for. Thanks again.

Connectez-vous pour commenter.

Plus de réponses (0)

Catégories

En savoir plus sur Data Type Conversion dans Help Center et File Exchange

Produits


Version

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by