Unable to read the last line by using regexp

1 vue (au cours des 30 derniers jours)
Rui Zhang
Rui Zhang le 1 Août 2019
Commenté : Walter Roberson le 2 Août 2019
I used the following code to read text line by line. But the result excludes the last line. For example, if my text is:
void myfunc1()
{
// body;
}
by using the following code:
matchedText = regexp(myText,'(?<=\n.*?(\r)\n','match');
The matchedText is:
matchedText = {'void myfunc1()','{','','//body',''};
How can I get the last line of the text? -- '}' is missing.
Thanks.
  7 commentaires
Guillaume
Guillaume le 2 Août 2019
Again, the regexp that you've written in the question is invalid.
>> myText = sprintf('\nvoid myfunc1()\r\n{\r\n\t// body\r\n}')
myText =
'
void myfunc1()
{
// body
}'
>> matchedText = regexp(myText,'(?<=\n.*?(\r)\n','match') %exact copy/paste of what you wrote in the question
matchedText =
0×0 empty cell array
The regexp is missing a closing bracket. Maybe you meant (notice the ) after the first \n)
>> matchedText = regexp(myText,'(?<=\n).*?(\r)\n','match')
matchedText =
1×3 cell array
{'void myfunc1()←↵'} {'{←↵'} {'→// body←↵'}
You'll notice that:
  • I had to start the text with a \n because of the look-behind
  • you capture all the line return\line feed
Again, there are much simpler ways to split text into lines.
Rui Zhang
Rui Zhang le 2 Août 2019
Oh, I missed a ')' in my post. You are right.
My actual code in my file is:
%% read a line
searchPattern = '(?<=\n).*?(\r)\n';
matchedText = regexp(mytext,searchPattern,'match');
Thank you very much for your help!

Connectez-vous pour commenter.

Réponse acceptée

Guillaume
Guillaume le 2 Août 2019
As I said, your regular expression is invalid, and even corrected doesn't work very well. I also suspect that the (\r) was meant to be (\r)?
Anyway, if the goal is just to split lines:
>> myText = sprintf('void myfunc1()\r\n{\r\n\t// body\r\n}') %demo data
myText =
'void myfunc1()
{
// body
}'
>> matchedText = regexp(myText, '\r?\n', 'split')
matchedText =
1×4 cell array
{'void myfunc1()'} {'{'} {'→// body'} {'}'}
  1 commentaire
Walter Roberson
Walter Roberson le 2 Août 2019
Note that when you use split to split into lines that it is common to end up with an empty character vector as the last element. That shows up when the file does end in newline, because it splits at the newline and then there does not happen to be anything more in the string.

Connectez-vous pour commenter.

Plus de réponses (0)

Catégories

En savoir plus sur Characters and Strings dans Help Center et File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by