Regexp help needed

Roy_Pardee · 27 April 2007 21:50

Hey All,

I need to parse lines that look like this:

<lines>
1 'Not qualified' 2 'Overquota' 3 'Qualified'/
1 'SSI' 2 'Mall Facility'/
1 'Real Interview' 2 'Practice Interview'/
</lines>

So I've got N sets of <<digits>><<space>><<quote-delimited-label>> on
each line. I want to grab each of the digits & labels, but I'm having
trouble w/the repetition stuff. Below is a simple script that doesn't
work--it grabs the first set, but seems to ignore the rest.

str = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"
rgx = /((\d+)\s+(\'[^']+?\'))+/i
m = rgx.match(str)
unless m.nil?
m.captures.each {|c| puts(c)}
end

Can anybody throw me a regex clue here?

Thanks!

- Roy

Aaron_Patterson1 · 27 April 2007 22:09

Hey Roy,

···

On Sat, Apr 28, 2007 at 06:50:10AM +0900, rpardee@gmail.com wrote:

Hey All,

I need to parse lines that look like this:

<lines>
1 'Not qualified' 2 'Overquota' 3 'Qualified'/
1 'SSI' 2 'Mall Facility'/
1 'Real Interview' 2 'Practice Interview'/
</lines>

So I've got N sets of <<digits>><<space>><<quote-delimited-label>> on
each line. I want to grab each of the digits & labels, but I'm having
trouble w/the repetition stuff. Below is a simple script that doesn't
work--it grabs the first set, but seems to ignore the rest.

str = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"
rgx = /((\d+)\s+(\'[^']+?\'))+/i
m = rgx.match(str)
unless m.nil?
m.captures.each {|c| puts(c)}
end

Can anybody throw me a regex clue here?

You might want to try the scan method:

  str = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"

  str.scan(/(\d+)\s+('[^']+')+/).each do |match|
    p match
  end

Hope that helps!

--
Aaron Patterson
http://tenderlovemaking.com/

Duane_Johnson1 · 27 April 2007 22:11

Perhaps a little something like this:

str = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"
rgx = /((\d+)\s+(\'[^']+?\'))+/i
m = str.scan(rgx)
m.each do |match|
puts match.first
end

Duane Johnson
(canadaduane)

···

On 4/27/07, rpardee@gmail.com <rpardee@gmail.com> wrote:

Hey All,

I need to parse lines that look like this:

<lines>
1 'Not qualified' 2 'Overquota' 3 'Qualified'/
1 'SSI' 2 'Mall Facility'/
1 'Real Interview' 2 'Practice Interview'/
</lines>

So I've got N sets of <<digits>><<space>><<quote-delimited-label>> on
each line. I want to grab each of the digits & labels, but I'm having
trouble w/the repetition stuff. Below is a simple script that doesn't
work--it grabs the first set, but seems to ignore the rest.

str = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"
rgx = /((\d+)\s+(\'[^']+?\'))+/i
m = rgx.match(str)
unless m.nil?
m.captures.each {|c| puts(c)}
end

Can anybody throw me a regex clue here?

Thanks!

- Roy

--
Duane Johnson
(canadaduane)

Tim_Pease · 27 April 2007 22:15

r = %r/(\d+)\s+('[^']*')/
s = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"

s.scan(r) #=> [["1", "'Not qualified'"], ["2",
"'Overquota'"], ["3", "'Qualified'"]]

Blessings,
TwP

···

On 4/27/07, rpardee@gmail.com <rpardee@gmail.com> wrote:

Hey All,

I need to parse lines that look like this:

<lines>
1 'Not qualified' 2 'Overquota' 3 'Qualified'/
1 'SSI' 2 'Mall Facility'/
1 'Real Interview' 2 'Practice Interview'/
</lines>

So I've got N sets of <<digits>><<space>><<quote-delimited-label>> on
each line. I want to grab each of the digits & labels, but I'm having
trouble w/the repetition stuff. Below is a simple script that doesn't
work--it grabs the first set, but seems to ignore the rest.

str = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"
rgx = /((\d+)\s+(\'[^']+?\'))+/i
m = rgx.match(str)
unless m.nil?
m.captures.each {|c| puts(c)}
end

Roy_Pardee · 27 April 2007 22:25

Ah--that's the magic! Thanks guys!

-Roy

···

On Apr 27, 3:15 pm, "Tim Pease" <tim.pe...@gmail.com> wrote:

On 4/27/07, rpar...@gmail.com <rpar...@gmail.com> wrote:

> Hey All,

> I need to parse lines that look like this:

> <lines>
> 1 'Not qualified' 2 'Overquota' 3 'Qualified'/
> 1 'SSI' 2 'Mall Facility'/
> 1 'Real Interview' 2 'Practice Interview'/
> </lines>

> So I've got N sets of <<digits>><<space>><<quote-delimited-label>> on
> each line. I want to grab each of the digits & labels, but I'm having
> trouble w/the repetition stuff. Below is a simple script that doesn't
> work--it grabs the first set, but seems to ignore the rest.

> str = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"
> rgx = /((\d+)\s+(\'[^']+?\'))+/i
> m = rgx.match(str)
> unless m.nil?
> m.captures.each {|c| puts(c)}
> end

r = %r/(\d+)\s+('[^']*')/
s = "1 'Not qualified' 2 'Overquota' 3 'Qualified'"

s.scan(r) #=> [["1", "'Not qualified'"], ["2",
"'Overquota'"], ["3", "'Qualified'"]]

Blessings,
TwP

Topic		Replies	Views
Can't find appropriate regexp ruby-talk	16	83	24 June 2003
RegExp problem ruby-talk	2	90	17 May 2007
Regexp issue on parsing from file ruby-talk	10	135	15 August 2009
Regexp help needed ruby-talk	3	101	2 May 2008
Regexp Parsing -- What's the right way? ruby-talk	6	128	12 August 2006

Regexp help needed

Related topics