Wanted: Script to convert to/from UTF-8/UTF-16/UTF-32

Does someone have a little ruby script that will read a file in
UTF-8/UTF-16/UTF-32 (my choice) and search for all the characters between
0x7f-0xffffff and convert them to an ASCII digit string that begins with
"&#" and ends with ";" and output the whole thing? If not, could someone
tell me how to write one?

How about a script to do the inverse?

Thanks!
siegfried

you could just convert everything to it's html entity form

http://htmlentities.rubyforge.org/
gem install -r htmlentities

+David

···

--- On Sat, 8/30/08, Siegfried Heintze <siegfried@heintze.com> wrote:

From: Siegfried Heintze <siegfried@heintze.com>
Subject: Wanted: Script to convert to/from UTF-8/UTF-16/UTF-32
To: "ruby-talk ML" <ruby-talk@ruby-lang.org>
Date: Saturday, August 30, 2008, 9:00 PM
Does someone have a little ruby script that will read a file
in
UTF-8/UTF-16/UTF-32 (my choice) and search for all the
characters between
0x7f-0xffffff and convert them to an ASCII digit string
that begins with
"&#" and ends with ";" and output
the whole thing? If not, could someone
tell me how to write one?

How about a script to do the inverse?

Thanks!
siegfried

Siegfried,

There is a wrapper library for the standard iconv that takes care of
this stuff:

http://www.ruby-doc.org/stdlib/libdoc/iconv/rdoc/index.html

Take a look at:

http://www.websideattractions.com/2007/10/24/ruby-script-to-convert-files-character-encoding/

Best regards,

···

--
Aníbal Rojas

http://anibal@rojas.com.ve

On Aug 30, 8:22 pm, "Siegfried Heintze" <siegfr...@heintze.com> wrote:

Does someone have a little ruby script that will read a file in
UTF-8/UTF-16/UTF-32 (my choice) and search for all the characters between
0x7f-0xffffff and convert them to an ASCII digit string that begins with
"&#" and ends with ";" and output the whole thing? If not, could someone
tell me how to write one?

How about a script to do the inverse?

Thanks!
siegfried