17

If I have an HTML string such as:

<div><p>£20<span class="abc" /><span class="def">56</span></p></div>

And I want the text:

20<span class="abc" /><span class="def">56

How do I define a regular expression to match the target sections multiple times. So far I have:

str.match(/\d*<[^>]*>\d*/)

But this will only return the first number section 20<span class="abc" />

I need this to be flexible to match multiple tag / numeric sections while trimming anything leading or trailing the first / last digit in the string.

0

3 Answers 3

10

Adding /g isn't enough if you wish to match multiple occurrences of a substring. If that's the case, reluctant quantifiers may be used as described herein.

Given the string:

<div><p>£20<span class="abc" /><span class="def">56</span></p></div>

You will arrive at the text you wanted using:

\d+.*>\d+

But given the same string repeated two times:

<div><p>£20<span class="abc" /><span class="def">56</span></p></div><div><p>£20<span class="abc" /><span class="def">56</span></p></div>

You will not find the target selection multiple times. You'll only find it once due to the greedy nature of .*. To make .* non-greedy, or reluctant, simply add a ? after the * and you will arrive at:

\d+.*?>\d+

Which will find both occurrences of the substring you asked for as shown here.

8

To match multiple times use to need use the global option

str.match(/your_expression_here/g)
                                ^
0
1

Just allow the group to be repeated: (?:...)+ means "Match ... 1 or more times:

str.match(/\d+(?:<[^>]*>)+\d+/)

As per Alan Moore's suggestion, I've also changed the \d* into \d+, making the numbers required instead of optional.

1
  • While you're at it, change each \d* to \d+; that's got to be a mistake.
    – Alan Moore
    Commented Aug 13, 2011 at 2:38

Not the answer you're looking for? Browse other questions tagged or ask your own question.