Import text file with blank lines. Matlab not replacing them with NaN

Question

0 votes

Matlab is not replacing my blank lines in my txt file with NaN but just joins all the data together. Unfortunately I need to data in the exact order it is as each line is a unique timestamp but the times are do not come in the txt file.

Any ideas? Tried importdata and textscan with no luck. Using R2014b

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Answer 1

per isakson le 29 Jan 2015

Modifié(e) : per isakson le 30 Jan 2015

Ouvrir dans MATLAB Online

1 vote

Remains (at least) two possibilities

a loop over fgetl
read the file as one string, replace empty lines by 'nan nan ... ' and parse with textscan

Example (R2013a)

    >> cac = cssm;
    >> cac{:}
    ans =
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1

where

    function cac = cssm
        str = fileread('cssm.txt');
        str = regexprep( str, '(?<=\r?\n)[ ]*(?=\r?\n)', 'nan nan nan nan');
        cac = textscan( str, '%f%f%f%f', 'CollectOutput', true );
    end

and where cssm.txt contains

   2     3    13
  11    10     8
   7     6    12
  14    15     1

&nbsp

Replace

str = regexprep( str, '(?<=\r?\n)[ ]*(?=\r?\n)', 'nan nan nan nan');

by

    str = regexprep( str, '(?<=\r?\n)[ ]*(?=\r?\n)' ...
                  , 'nan nan nan nan', 'emptymatch' );

to handle empty lines

10 commentaires
Afficher 8 commentaires plus anciens Masquer 8 commentaires plus anciens

per isakson le 29 Jan 2015

Modifié(e) : per isakson le 29 Jan 2015

Ouvrir dans MATLAB Online

Hi Cedric,

To illustrate how I think, I have created three text files, cssm_0.txt, cssm_1.txt, cssm_2.txt, with zero, one and two empty lines at the end, respectively. The image are clips of the files in NotePad++.

&nbsp

With the expression

'(?<=\r?\n|^)\s*?(?=\r?\n)'

I get the results below

    >> clear all,cac = cssm('cssm_0.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1
    >> clear all,cac = cssm('cssm_1.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1
       NaN   NaN   NaN   NaN
    >> clear all,cac = cssm('cssm_2.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
    >>

per isakson le 29 Jan 2015

Ouvrir dans MATLAB Online

\s* or \s*?

I can reproduce your example

    >> regexprep( 'ab', '(?<=a)b*(?=b)', 'z', 'emptymatch' )
    ans =
    azb

and the lazy ? doesn't hurt

    >> regexprep( 'ab', '(?<=a)b*?(?=b)', 'z', 'emptymatch' )
    ans =
    azb

However it doesn't work with the string from the text file. With the expression

'(?<=\r?\n|^)\s*(?=\r?\n)'

I get

    >> clear all,cac = cssm('cssm_1.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13

and with the expression

'(?<=\r\n|^)\s*(?=\r\n)'

I get

    >> clear all, clear classes,cac = cssm('cssm_1.txt');cac{:}
    ans =
       NaN   NaN   NaN   NaN
        16     2     3    13
         5    11    10     8
       NaN   NaN   NaN   NaN
       NaN   NaN   NaN   NaN
         9     7     6    12
         4    14    15     1
       NaN   NaN   NaN   NaN

The problem is with the "?" in \r?\n - I think. In this context "\s*" matches "\r" and the look-ahead is happy with "\n". With "\s*?" the "\r" goes to the look-ahead.

I used "\r*\n" in the first place to match both the DOS and the Windows style of new-line.

mashtine le 30 Jan 2015

Thanks a lot guys!!

per isakson le 30 Jan 2015

Hi Cedric,

You are right, "\r" is not needed in the "look behind". And possibly, it saves on execution time to exclude it.

Connectez-vous pour commenter.

Import text file with blank lines. Matlab not replacing them with NaN

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Réponse acceptée

10 commentaires
Afficher 8 commentaires plus anciens Masquer 8 commentaires plus anciens

Plus de réponses (0)

Catégories

Tags

Community Treasure Hunt

Import text file with blank lines. Matlab not replacing them with NaN

0 commentaires Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Réponse acceptée

10 commentaires Afficher 8 commentaires plus anciens Masquer 8 commentaires plus anciens

Plus de réponses (0)

Catégories

Tags

Voir également

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

10 commentaires
Afficher 8 commentaires plus anciens Masquer 8 commentaires plus anciens