New in this release:
···
====================
* Updated to Unicode 6.2.0.
* UnicodeUtils.debug accepts single Integer code point
* New method UnicodeUtils.white_space_char?
Usage
Ruby 1.9.1 or higher is required.
$ gem install unicode_utils
require "unicode_utils/display_width"
UnicodeUtils.display_width("にっき") # => 6
$ irb -r unicode_utils/u
irb(main):001:0> U.debug 0x20ba
Char | Ordinal | Sid | General Category | UTF-8
------+---------+-------------------+------------------+----------
"₺" | 20BA | TURKISH LIRA SIGN | Currency_Symbol | E2 82 BA
irb(main):003:0> U.casefold("Straße") == U.casefold("STRASSE")
=> true
irb(main):004:0> U.titlecase "willkommen österreich"
=> "Willkommen Österreich"
irb(main):005:0> U.nfkc "finland"
=> "finland"
Documentation & Source
http://unicode-utils.rubyforge.org
http://github.com/lang/unicode_utils
Issues
It should work on all Ruby 1.9.1 implementations or higher
independently of operating system. If not, please report
it on http://github.com/lang/unicode_utils/issues
All tests pass with jruby-1.7.0.RC1. Not all tests pass with
MRI 1.9.3p194 due to unexptected behaviour of String#<< with
UTF-16 strings. As long as you use only UTF-8, there's no problem.
(http://bugs.ruby-lang.org/issues/7090).
--
Stefan Lang