Html 2 pdf

Hi

IIs there a gem for converting html to pdf?

I need to build a routine to step through a website and create a pdf from the
downloaded pages.

Thanks in advance

david Southwell

David Southwell wrote:

Hi

IIs there a gem for converting html to pdf?

I need to build a routine to step through a website and create a pdf from the
downloaded pages.

Thanks in advance

david Southwell

If not, you could probably just shell out to the open source htmldoc
Easy Software Products. There's a Perl interface you can learn
from, HTML-HTMLDoc
(http://search.cpan.org/~mkfrankl/HTML-HTMLDoc-0.10/\). Or you could
build something out of "hpricot" and "pdf-writer".

I have one that only works on OSX, but it works well. Lemme know if you want it and I'll try to dig it up.

···

On Jul 18, 2007, at 07:56 , David Southwell wrote:

IIs there a gem for converting html to pdf?

I need to build a routine to step through a website and create a pdf from the
downloaded pages.

If not, you could probably just shell out to the open source htmldoc
http://www.easysw.com/htmldoc/\. There's a Perl interface you can learn
from, HTML-HTMLDoc

This would probably work pretty good. I've done it before. A ruby
wrapper for this would be most appreciated, if anyone is looking for a
project!

(http://search.cpan.org/~mkfrankl/HTML-HTMLDoc-0.10/\). Or you could
build something out of "hpricot" and "pdf-writer".

If you do this, I suggest using Ruport instead of PDF::Writer for your
PDF stuff. Of course, Ruport just wraps PDF::Writer, but it fixes
some bugs (thanks to Austin Ziegler) and adds some higher level
features and helps wrap the formatting code.

Some relevant docs if that's feasible for you:

http://rubyreports.org/cheatsheets/formatting.html
http://ruportbook.com/printable_documents.html
http://ruportbook.com/integration.html#pdf_writer_proxy
http://ruportbook.com/renderer_logic.html

···

On 7/18/07, M. Edward (Ed) Borasky <znmeb@cesmail.net> wrote:

No Good.
It doesn't even fully support HTML 4 !
And no support for stylesheets...?
There must be other alternatives.
OS X has built-in print to PDF (has had for many years) so there must be GNU tool to do it.

···

On Jul 18, 2007, at 10:17 AM, M. Edward (Ed) Borasky wrote:

David Southwell wrote:

Hi

IIs there a gem for converting html to pdf?

I need to build a routine to step through a website and create a pdf from the
downloaded pages.

Thanks in advance

david Southwell

If not, you could probably just shell out to the open source htmldoc
http://www.easysw.com/htmldoc/\. There's a Perl interface you can learn
from, HTML-HTMLDoc
(http://search.cpan.org/~mkfrankl/HTML-HTMLDoc-0.10/\). Or you could
build something out of "hpricot" and "pdf-writer".

I want it:)

···

-----Original Message-----
From: Ryan Davis [mailto:ryand-ruby@zenspider.com]
Sent: Friday, July 20, 2007 2:00 PM
To: ruby-talk ML
Subject: Re: html 2 pdf

On Jul 18, 2007, at 07:56 , David Southwell wrote:

IIs there a gem for converting html to pdf?

I need to build a routine to step through a website and create a
pdf from the
downloaded pages.

I have one that only works on OSX, but it works well. Lemme know if
you want it and I'll try to dig it up.

Sure, but nothing I'm aware of that doesn't require a full browser renderer
to work. The print to PDF feature as I understand it uses a modified print
"driver" that converts the post script output into a PDF doc instead of
sending it to a printer. This is a far cry from just taking an aribtrary
html file and making it a PDF.

···

On 7/20/07, John Joyce <dangerwillrobinsondanger@gmail.com> wrote:

On Jul 18, 2007, at 10:17 AM, M. Edward (Ed) Borasky wrote:

> David Southwell wrote:
>> Hi
>>
>> IIs there a gem for converting html to pdf?
>>
>> I need to build a routine to step through a website and create a
>> pdf from the
>> downloaded pages.
>>
>> Thanks in advance
>>
>> david Southwell
>>
>
> If not, you could probably just shell out to the open source htmldoc
> Easy Software Products. There's a Perl interface you can learn
> from, HTML-HTMLDoc
> (http://search.cpan.org/~mkfrankl/HTML-HTMLDoc-0.10/\). Or you could
> build something out of "hpricot" and "pdf-writer".
>
No Good.
It doesn't even fully support HTML 4 !
And no support for stylesheets...?
There must be other alternatives.
OS X has built-in print to PDF (has had for many years) so there must
be GNU tool to do it.

--
===Tanner Burson===
tanner.burson@gmail.com
http://www.tannerburson.com