Very ineficient regular expression match

Mauricio · 7 August 2002 00:11

Hi,

The following code is supposed to print multiple regular expression

matches on a given string:

class Myregexp < Regexp
def each (str)
pos = 0
while ((pos<str.size) && (m = match(str[pos…str.size])))
yield m
pos += m.end(0)
end
end
end

Myregexp.new(/pattern/).each(“string”){|m| $stdout << m[0]}

It works. However, it takes a lot of time and memory when "string" is

big. In my case, with a 10Mb string, it takes 40Mb of RAM. The problem comes
from the str[pos…str.size] (I solved the problem for my situation with a
hack where I changed it for str[pos,300], and everything runs fast and with
low memory). Since Ruby uses garbage collection and strings are, I believe,
copy on write, why does this occur?
I’m using Pragmatic Programmers Ruby 1.66 with Windows XP.

Thanks,
Maurício

Joel_VanderWerf1 · 7 August 2002 04:35

Maurício wrote:

Hi,

The following code is supposed to print multiple regular expression

matches on a given string:

Why not use scan?

“foo bar baz”.scan(/\w\w\w/) { |str| puts str }

   # ==> foo
   #     bar
   #     baz

Mauricio_Antunes · 7 August 2002 11:10

You’re right, I didn’t know it.
Anyway, just for educational purposes, I’m still curious about why my
class didn’t work well.

Thanks,
Maurício

···

----- Original Message -----
From: “Joel VanderWerf” vjoel@PATH.Berkeley.EDU
Newsgroups: gmane.comp.lang.ruby.general
Sent: Wednesday, August 07, 2002 1:35 AM
Subject: Re: Very ineficient regular expression match

Maurício wrote:
Hi,
The following code is supposed to print multiple regular expression
matches on a given string:
Why not use scan?

“foo bar baz”.scan(/\w\w\w/) { |str| puts str }
   # ==> foo
   #     bar
   #     baz

Nobuyoshi_Nakada · 7 August 2002 11:34

Hi,

···

At Wed, 7 Aug 2002 20:10:14 +0900, Maurício Antunes wrote:

Anyway, just for educational purposes, I'm still curious about why my

class didn’t work well.

See the thread from [ruby-talk:42596].

–
Nobu Nakada

Topic		Replies	Views
Printing regex results ruby-talk	9	114	5 March 2011
Problem with getting RegEx to work ruby-talk	3	154	17 September 2008
Do You Understand Regular Expressions? ruby-talk	19	112	22 June 2007
Simple regexp question ruby-talk	0	64	26 October 2005
Stumped: How do I iterate over regular expression matches? ruby-talk	5	119	27 April 2006

Very ineficient regular expression match

Related topics