Scanning a String

Hello,
How do I scan a string for a pattern and process it when found? For
example:

for all occurrences of two vowel letters in succession, insert the
apostrophe between the vowels.

Thus:

breakground > bre'akgro'und

thank you for your help!
basi

some_string.gsub(/([aeiou])([aeiou])/) {|match| $1 + "'" + $2}

···

On 4/27/05, basi <basiibarra@yahoo.com> wrote:

Hello,
How do I scan a string for a pattern and process it when found? For
example:

for all occurrences of two vowel letters in succession, insert the
apostrophe between the vowels.

Hi --

Hello,
How do I scan a string for a pattern and process it when found? For
example:

for all occurrences of two vowel letters in succession, insert the
apostrophe between the vowels.

some_string.gsub(/([aeiou])([aeiou])/) {|match| $1 + "'" + $2}

(You don't really need the block arg (match) :slight_smile:

Just to add a possible variant: depending how you want to handle three
or more vowels in a row, you might want to do:

   pious".gsub(/([aeiou])(?=[aeiou])/) { "#{$1}'#{$2}" }

which will result in: pi'o'us

(I thought there was a [[:vowel:]] POSIX character class but
apparently not.)

David

···

On Thu, 28 Apr 2005, Nicholas Seckar wrote:

On 4/27/05, basi <basiibarra@yahoo.com> wrote:

--
David A. Black
dblack@wobblini.net

David,
Yes, three vowels in a row should be handled as you indicated.
Thank you, and to Nicholas Seckar as well, for the quick reply.

I think I should finally bite the bullet and learn regex.

Basi

(You don't really need the block arg (match) :slight_smile:

Good point :slight_smile:

   pious".gsub(/([aeiou])(?=[aeiou])/) { "#{$1}'#{$2}" }

We could also do away with the block altogether and use
"pious".gsub(/([aeiou])(?=[aeiou])/, '\1\'\2')

Although I don't know if one approach is preferred to the other.

···

On 4/27/05, David A. Black <dblack@wobblini.net> wrote:

I managed to omit the " before pious... and also, I didn't actually
need #{$2} in there. In fact, $2 is nil (because ?= isn't captured).
So....

    "pious".gsub(/([aeiou])(?=[aeiou])/) { "#{$1}'" }

David

···

On Thu, 28 Apr 2005, David A. Black wrote:

Just to add a possible variant: depending how you want to handle three
or more vowels in a row, you might want to do:

pious".gsub(/([aeiou])(?=[aeiou])/) { "#{$1}'#{$2}" }

which will result in: pi'o'us

--
David A. Black
dblack@wobblini.net

"Nicholas Seckar" <nseckar@gmail.com> schrieb im Newsbeitrag
news:6bfb9c8a050427193967475977@mail.gmail.com...

···

On 4/27/05, David A. Black <dblack@wobblini.net> wrote:
> (You don't really need the block arg (match) :slight_smile:

Good point :slight_smile:

> pious".gsub(/([aeiou])(?=[aeiou])/) { "#{$1}'#{$2}" }

We could also do away with the block altogether and use
"pious".gsub(/([aeiou])(?=[aeiou])/, '\1\'\2')

Although I don't know if one approach is preferred to the other.

Yes, it's preferred because it's faster. Although your variant works, I
usually prefer to put the correct number of backslashes in there:

"pious".gsub(/([aeiou])(?=[aeiou])/, '\\1\'\\2')

Kind regards

    robert

Hi --

···

On Thu, 28 Apr 2005, Robert Klemme wrote:

"Nicholas Seckar" <nseckar@gmail.com> schrieb im Newsbeitrag
news:6bfb9c8a050427193967475977@mail.gmail.com...

On 4/27/05, David A. Black <dblack@wobblini.net> wrote:

(You don't really need the block arg (match) :slight_smile:

Good point :slight_smile:

   pious".gsub(/([aeiou])(?=[aeiou])/) { "#{$1}'#{$2}" }

We could also do away with the block altogether and use
"pious".gsub(/([aeiou])(?=[aeiou])/, '\1\'\2')

Although I don't know if one approach is preferred to the other.

Yes, it's preferred because it's faster. Although your variant works, I
usually prefer to put the correct number of backslashes in there:

"pious".gsub(/([aeiou])(?=[aeiou])/, '\\1\'\\2')

Why do you consider that more correct?

David

--
David A. Black
dblack@wobblini.net

"Nicholas Seckar" <nseckar@gmail.com> schrieb im Newsbeitrag
news:6bfb9c8a050427193967475977@mail.gmail.com...

> (You don't really need the block arg (match) :slight_smile:

Good point :slight_smile:

> pious".gsub(/([aeiou])(?=[aeiou])/) { "#{$1}'#{$2}" }

We could also do away with the block altogether and use
"pious".gsub(/([aeiou])(?=[aeiou])/, '\1\'\2')

Although I don't know if one approach is preferred to the other.

Yes, it's preferred because it's faster. Although your variant works, I
usually prefer to put the correct number of backslashes in there:

"pious".gsub(/([aeiou])(?=[aeiou])/, '\\1\'\\2')

Single quotes, herr Klemme! '\1\'\2' works fine.

Kind regards

   robert

E

···

Le 28/4/2005, "Robert Klemme" <bob.news@gmx.net> a écrit:

On 4/27/05, David A. Black <dblack@wobblini.net> wrote:

--
template<typename duck>
void quack(duck& d) { d.quack(); }

"David A. Black" <dblack@wobblini.net> schrieb im Newsbeitrag
news:Pine.LNX.4.61.0504280320190.13336@wobblini...

Hi --

>
> "Nicholas Seckar" <nseckar@gmail.com> schrieb im Newsbeitrag
> news:6bfb9c8a050427193967475977@mail.gmail.com...
>>> (You don't really need the block arg (match) :slight_smile:
>>
>> Good point :slight_smile:
>>
>>> pious".gsub(/([aeiou])(?=[aeiou])/) { "#{$1}'#{$2}" }
>>
>> We could also do away with the block altogether and use
>> "pious".gsub(/([aeiou])(?=[aeiou])/, '\1\'\2')
>>
>> Although I don't know if one approach is preferred to the other.
>
> Yes, it's preferred because it's faster. Although your variant works,

I

> usually prefer to put the correct number of backslashes in there:
>
> "pious".gsub(/([aeiou])(?=[aeiou])/, '\\1\'\\2')

Why do you consider that more correct?

Although both produce the same:

p '\1'

"\\1"
=> nil

p '\\1'

"\\1"
=> nil

because Ruby is so kind to take "\1" literally (i.e not using the
backslash as escaping char because "\1" is not an escaping sequence). I
prefer to explicitely escape the backslash and put the "1" in there
literally. IMHO it's more fail safe when changes are done (especially
when changing single quotes to double quotes, see below).

Consider also

p "\n"

"\n"
=> nil

p "\\n"

"\\n"
=> nil

p "\1"

"\001"
=> nil

p "\\1"

"\\1"
=> nil

Might be overly cautious but this is the kind of error that bites you and
if you're not lucky it can take quite some time to figure what's going on
here.

Kind regards

    robert

···

On Thu, 28 Apr 2005, Robert Klemme wrote:
>> On 4/27/05, David A. Black <dblack@wobblini.net> wrote:

Hi --

···

On Thu, 28 Apr 2005, Robert Klemme wrote:

"David A. Black" <dblack@wobblini.net> schrieb im Newsbeitrag
news:Pine.LNX.4.61.0504280320190.13336@wobblini...

Hi --

On Thu, 28 Apr 2005, Robert Klemme wrote:

Yes, it's preferred because it's faster. Although your variant works,

I

usually prefer to put the correct number of backslashes in there:

"pious".gsub(/([aeiou])(?=[aeiou])/, '\\1\'\\2')

Why do you consider that more correct?

Although both produce the same:

p '\1'

"\\1"
=> nil

p '\\1'

"\\1"
=> nil

because Ruby is so kind to take "\1" literally (i.e not using the
backslash as escaping char because "\1" is not an escaping sequence). I
prefer to explicitely escape the backslash and put the "1" in there
literally. IMHO it's more fail safe when changes are done (especially
when changing single quotes to double quotes, see below).

If you're talking about double quotes then clearly it has to be "\\1".
I meant specifically in the case of single quotes. Anyway, it's not a
big deal -- I was just curious.

David

--
David A. Black
dblack@wobblini.net

"David A. Black" <dblack@wobblini.net> schrieb im Newsbeitrag
news:Pine.LNX.4.61.0504280509220.30582@wobblini...

Hi --

>
> "David A. Black" <dblack@wobblini.net> schrieb im Newsbeitrag
> news:Pine.LNX.4.61.0504280320190.13336@wobblini...
>> Hi --
>>
>>> Yes, it's preferred because it's faster. Although your variant

works,

> I
>>> usually prefer to put the correct number of backslashes in there:
>>>
>>> "pious".gsub(/([aeiou])(?=[aeiou])/, '\\1\'\\2')
>>
>> Why do you consider that more correct?
>
> Although both produce the same:
>
>>> p '\1'
> "\\1"
> => nil
>>> p '\\1'
> "\\1"
> => nil
>
> because Ruby is so kind to take "\1" literally (i.e not using the
> backslash as escaping char because "\1" is not an escaping sequence).

I

> prefer to explicitely escape the backslash and put the "1" in there
> literally. IMHO it's more fail safe when changes are done (especially
> when changing single quotes to double quotes, see below).

If you're talking about double quotes then clearly it has to be "\\1".
I meant specifically in the case of single quotes.

Well, yes. But quotes don't necessarily stay single through the course of
a software's lifecycle - with all this dating and matching around... :slight_smile:
Sorry, drifting off-topic.

Anyway, it's not a
big deal -- I was just curious.

Thought so. I hope I could satisfy your curiosity. :slight_smile:

Kind regards

    robert

···

On Thu, 28 Apr 2005, Robert Klemme wrote:
>> On Thu, 28 Apr 2005, Robert Klemme wrote: