Premature end of regular expression with non-ascii chara

Nuralanur · 31 January 2006 18:56

Dear Nick,

I'm glad things finally worked out for you.
Actually, I things like '\352' do not to split any further (at least on my
system (cygwin 6.7.0.0-6) on Windows XP),
this seems to be some encoding for "ê" etc. that is Windows-specific.
In particular, it is not equivalent to a string composed of '\', '3','5','2'
(you would have to double the '\' in the regexp if you were searching
for a string '\352' in a text).

But the line

splitted_text=text.split(/(?=.)/)

should produce an Array with the individual letters in the string.
That involves a concept about regexps called zero-width positive
lookahead (see _http://www.regular-expressions.info/lookaround.html_
(http://www.regular-expressions.info/lookaround.html) ).

Please let us all know if you still encounter problems.

Best regards,

Axel

Logan_Capaldo · 31 January 2006 20:06

Zero width positive lookahead is a little overkill, split(//) works just as well, AFAIK

···

On Jan 31, 2006, at 1:56 PM, Nuralanur@aol.com wrote:

Dear Nick,

I'm glad things finally worked out for you.
Actually, I things like '\352' do not to split any further (at least on my
system (cygwin 6.7.0.0-6) on Windows XP),
this seems to be some encoding for "ê" etc. that is Windows-specific.
In particular, it is not equivalent to a string composed of '\', '3','5','2'
(you would have to double the '\' in the regexp if you were searching
for a string '\352' in a text).

But the line

splitted_text=text.split(/(?=.)/)

should produce an Array with the individual letters in the string.
That involves a concept about regexps called zero-width positive
lookahead (see _http://www.regular-expressions.info/lookaround.html_
(http://www.regular-expressions.info/lookaround.html\) ).

Please let us all know if you still encounter problems.

Best regards,

Axel

Topic		Replies	Views
Premature end of regular expression with non-ascii chara ruby-talk	6	111	1 February 2006
Regexp: zero-width match for PRECEDING atom ruby-talk	4	96	7 December 2002
How to split(//) with respect to bigraphs? ruby-talk	9	156	2 August 2006
Premature end of regular expression with non-ascii character ruby-talk	5	110	1 February 2006
Regular expression: zero-width look-behind? ruby-talk	2	107	18 July 2003

Premature end of regular expression with non-ascii chara

Related topics