Premature end of regular expression with non-ascii chara

Dear Nick,

I'm glad things finally worked out for you.
Actually, I things like '\352' do not to split any further (at least on my
system (cygwin 6.7.0.0-6) on Windows XP),
this seems to be some encoding for "ê" etc. that is Windows-specific.
In particular, it is not equivalent to a string composed of '\', '3','5','2'
(you would have to double the '\' in the regexp if you were searching
for a string '\352' in a text).

But the line

splitted_text=text.split(/(?=.)/)

should produce an Array with the individual letters in the string.
That involves a concept about regexps called zero-width positive
lookahead (see _http://www.regular-expressions.info/lookaround.html_
(http://www.regular-expressions.info/lookaround.html) ).

Please let us all know if you still encounter problems.

Best regards,

Axel

Zero width positive lookahead is a little overkill, split(//) works just as well, AFAIK

···

On Jan 31, 2006, at 1:56 PM, Nuralanur@aol.com wrote:

Dear Nick,

I'm glad things finally worked out for you.
Actually, I things like '\352' do not to split any further (at least on my
system (cygwin 6.7.0.0-6) on Windows XP),
this seems to be some encoding for "ê" etc. that is Windows-specific.
In particular, it is not equivalent to a string composed of '\', '3','5','2'
(you would have to double the '\' in the regexp if you were searching
for a string '\352' in a text).

But the line

splitted_text=text.split(/(?=.)/)

should produce an Array with the individual letters in the string.
That involves a concept about regexps called zero-width positive
lookahead (see _http://www.regular-expressions.info/lookaround.html_
(http://www.regular-expressions.info/lookaround.html\) ).

Please let us all know if you still encounter problems.

Best regards,

Axel