Info

Cette question est clôturée. Rouvrir pour modifier ou répondre.

Regular expressions, words in a pattern

2 vues (au cours des 30 derniers jours)
bububu
bububu le 20 Nov 2013
Clôturé : MATLAB Answer Bot le 20 Août 2021
I am trying to generate a pattern including the following items:
do not run
does not run
has not run
have not run
and here comes the problem...
have not recently run
or
do not go out or run
so, what I am using is the following: '((do(es)|ha(s|ve)|did|are)\s?no(t)\s?(\w+)\s?run*)' trying to capture with '(\w+)\s' the fact that other words could be in between, but it does not work with words in between. The text is a string... Thanks in advance

Réponses (3)

Simon
Simon le 20 Nov 2013
Hi!
"\w" does not match spaces!
  3 commentaires
Simon
Simon le 20 Nov 2013
It does in my test:
str = 'have not recently run';
pattern = '((do(es)|ha(s|ve)|did|are)\s?no(t)\s?(\w+)\s?run*)';
[a,b] = regexp(str, pattern, 'start', 'tokens');
Do you have more than one space in front of or behind "recently"? Then "\s?" doesn't match.
You may also remove the outer parentheses around your pattern to get all matches like:
str = 'have not recently run';
pattern = '(do(es)|ha(s|ve)|did|are)\s?no(t)\s?(\w+)\s?run*';
[a,b] = regexp(str, pattern, 'start', 'tokens');
b{:}
bububu
bububu le 20 Nov 2013
THANKS A LOT! IT WAS USEFUL!

Image Analyst
Image Analyst le 20 Nov 2013
I'm very confused by what you want to do - I have no idea if you want to generate patterns from combinations of those phrases, or if you want to extract words or phrases in that list from larger sentences. Anyway (whatever you want to do), I think that either ismember () or allwords may be useful to you.

bububu
bububu le 20 Nov 2013
I am looking for a certain pattern within a text. The pattern includes: i) do not run, ii) does not run, iii) has not run, iv) have not run or v) have not recently run. However, when I have more than one word between NOT and RUN, as in vi) do not go out or run, my pattern: pattern =' ((do(es)|ha(s|ve)|did|are)\s?no(t)\s?((\w+)\s|\s)?run*)'; does not find vi) do not go out or run with regexp. I would like to have not only characters between NOT and RUN, but also words... such that it can also find "do NOT go out or RUN"

Cette question est clôturée.

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by