Ruby regex lookarounds?

7stud1 · 14 March 2007 07:54

In Programming Ruby(1st Ed), it says:

···

--------
The match operators return the character position at which the match
occurred. They also have the side effect of setting a whole load of Ruby
variables. $& receives the part of the string that was matched by the
pattern, $` receives the part of the string that preceded the match, and
$' receives the string after the match...

The match also sets the thread-global variables $~ and $1 through $9.
The variable $~ is a MatchData object (described beginning on page 336)
that holds everything you might want to know about the match. $1 and so
on hold the values of parts of the match.
-------
Here is an example:

irb(main):009:0> "abc" =~ /(a)(b)/
=> 0
irb(main):010:0> $&
=> "ab"
irb(main):011:0> $1
=> "a"
irb(main):012:0> $2
=> "b"

Now and example with a lookbehind:

irb(main):013:0> "abc" =~ /(?:ab)(c)/
=> 0
irb(main):014:0> $&
=> "abc"
irb(main):015:0> $1
=> "c"

Isn't that output inconsistent? If Ruby wants to say that the regex
matches the whole string, then shouldn't $1 be 'ab'. I know that
lookarounds shouldn't be considered part of the match--they are just
assertions, but Ruby's $& variable doesn't seem to respect that.

--
Posted via http://www.ruby-forum.com/.

7stud1 · 14 March 2007 08:35

Isn't that output inconsistent? If Ruby wants to say that the regex
matches the whole string, then shouldn't $1 be 'ab'. I know that
lookarounds shouldn't be considered part of the match--they are just
assertions, but Ruby's $& variable doesn't seem to respect that.

I see what's going on. (?:ab) is not a grouping: (?: and ) do not form a
grouping that gets a $ variable. If I write it as (?:(ab)), then ab is
a grouping:

irb(main):021:0> "abc" =~ /(?:(ab))(c)/
=> 0
irb(main):022:0> $&
=> "abc"
irb(main):023:0> $1
=> "ab"
irb(main):024:0> $2
=> "c"
irb(main):025:0>

It still seems a little strange that $& contains the lookaround.

···

--
Posted via http://www.ruby-forum.com/\.

come · 14 March 2007 12:30

Hi,

The syntax (?:re) isn't a lookbehind. It is a grouping form like (re)
but without capture. So (?:(re)) is the same as (re). Off course, you
could write something like (?:a(bc)), and you will get "bc" in $1, and
not "abc".

···

On 14 mar, 09:35, 7stud 7stud <dol...@excite.com> wrote:

I see what's going on. (?:ab) is not a grouping: (?: and ) do not form a
grouping that gets a $ variable. If I write it as (?:(ab)), then ab is
a grouping:

irb(main):021:0> "abc" =~ /(?:(ab))(c)/
=> 0
irb(main):022:0> $&
=> "abc"
irb(main):023:0> $1
=> "ab"
irb(main):024:0> $2
=> "c"
irb(main):025:0>

It still seems a little strange that $& contains the lookaround.

--
Posted viahttp://www.ruby-forum.com/.

James_Edward_Gray_II · 14 March 2007 12:52

Exactly. And just to make this description complete, Ruby 1.8 does not support lookbehind.

James Edward Gray II

···

On Mar 14, 2007, at 7:30 AM, come wrote:

The syntax (?:re) isn't a lookbehind.

7stud1 · 18 March 2007 10:42

come wrote:

Hi,

The syntax (?:re) isn't a lookbehind. It is a grouping form like (re)
but without capture.

I don't understand the distinction.

···

--
Posted via http://www.ruby-forum.com/\.

Robert_K1 · 18 March 2007 11:30

Lookahead and -behind do not consume characters while non capturing groups do.

robert

···

On 18.03.2007 11:42, 7stud 7stud wrote:

come wrote:

The syntax (?:re) isn't a lookbehind. It is a grouping form like (re)
but without capture.

I don't understand the distinction.

Michael_Bevilacqua-L · 19 March 2007 14:14

Non-capturing grouping is exactly what it sounds like, a group that doesn't
capture whatever it matches, but still consumes. So..

#non capturing group
irb(main):009:0> "1234" =~ /(?:1234)/
=> 0
irb(main):010:0> $1
=> nil

#capturing group
irb(main):011:0> "1234" =~ /(1234)/
=> 0
irb(main):012:0> $1
=> "1234"

Lookahead and lookbehind peek ahead or behind without consuming what they
match.

MBL

···

On 3/18/07, 7stud 7stud < dolgun@excite.com> wrote:

come wrote:
> Hi,
>
> The syntax (?:re) isn't a lookbehind. It is a grouping form like (re)
> but without capture.

I don't understand the distinction.

--
Posted via http://www.ruby-forum.com/\.

Topic		Replies	Views
Working with regular expressions and "$"? ruby-talk	3	132	23 November 2007
Match data variables ruby-talk	5	94	11 June 2009
Regular expression: Is this a bug or feature? ruby-talk	1	121	23 January 2007
Problem modifying captured regexp results ruby-talk	1	106	20 October 2006
Scope of regex match $variables ruby-talk	5	108	1 December 2002

Ruby regex lookarounds?

Related topics