[ANN] trenni-sanitize - a fast markup sanitizer

Hi - I've been processing a lot of HTML content with the intent to extract
either text summaries or sanitise it for further use. Frustrated primarily
by the existing sanitize gem due to it's performance and nokogiri's
handling of namespaces, I have released trenni-sanitize - a gem which uses
a native parser to sanitize general markup style input and runs about 50x
faster :slight_smile:

Kind regards,
Samuel

Sounds great!

···

On Mar 15, 2018 4:09 AM, "Samuel Williams" <space.ship.traveller@gmail.com> wrote:

Hi - I've been processing a lot of HTML content with the intent to extract
either text summaries or sanitise it for further use. Frustrated primarily
by the existing sanitize gem due to it's performance and nokogiri's
handling of namespaces, I have released trenni-sanitize - a gem which uses
a native parser to sanitize general markup style input and runs about 50x
faster :slight_smile:

GitHub - ioquatix/trenni-sanitize: Sanitize markup by adding, changing or removing tags.

Kind regards,
Samuel

Unsubscribe: <mailto:ruby-talk-request@ruby-lang.org?subject=unsubscribe>
<http://lists.ruby-lang.org/cgi-bin/mailman/options/ruby-talk&gt;