Character encoding in 1.9

I have some code in 1.8 that would strip certain special characters out
of a string (escape sequences in Telnet specifically, such as
\377\373\001).

I've moved to 1.9 and now that code fails with "invalid multibyte
escape".

I've looked at some of the tutorials regarding encodings in 1.9 and to
be honest I find them very intimidating. I'm hoping someone can suggest
a quick fix?

Here's the code:

[3,4,5,6,7].each do |n|
    line.gsub!(/#{('\377\37' + n.to_s + '\001')}/,"")
end

Many thanks in advance!

- Andrew

···

--
Posted via http://www.ruby-forum.com/.

Check out the Regexp documentation, specifically the 'new'/'compile'
method: Class: Regexp (Ruby 1.9.3). I
think you'll need to use Regexp.compile to create an "encoding-less"
regular expression.

I would guess something like this -

[3,4,5,6,7].each do |n|
    line.gsub!(Regexp.compile('\377\37' + n.to_s + '\001', nil, 'n'),"")
end

You'll have to fiddle with it, as I didn't have a chance to test it out to
see if this actually works. I'd have to know the encoding the 'line'
variable to accurately test it anyway.

···

On Mon, Dec 24, 2012 at 2:34 PM, Andrew S. <lists@ruby-forum.com> wrote:

I have some code in 1.8 that would strip certain special characters out
of a string (escape sequences in Telnet specifically, such as
\377\373\001).

I've moved to 1.9 and now that code fails with "invalid multibyte
escape".

I've looked at some of the tutorials regarding encodings in 1.9 and to
be honest I find them very intimidating. I'm hoping someone can suggest
a quick fix?

Here's the code:

[3,4,5,6,7].each do |n|
    line.gsub!(/#{('\377\37' + n.to_s + '\001')}/,"")
end

Many thanks in advance!

- Andrew

--
Posted via http://www.ruby-forum.com/\.

line = "hello\377\373\001 \377\374\001 \377\375\001world"

(3..7).each do |n|
  pattern = "\377" << (0370 + n).chr << "\001"
  line.gsub! pattern, " "
  puts line
end

--output:--
hello ?? ??world
hello ??world
hello world
hello world
hello world

···

--
Posted via http://www.ruby-forum.com/.

line = "hello\377\373\001 \377\374\001 \377\375\001world"
p line

regex = /\377[\373-\377]\001/

line.gsub! regex, ''
p line

--output:--
"hello\xFF\xFB\x01 \xFF\xFC\x01 \xFF\xFD\x01world"
"hello world"

···

--
Posted via http://www.ruby-forum.com/.