[ARTICLE] HTML Scraping Using WWW::Mechanize

Recent discussion here on HTML parsing and using Michael Neumann's outstanding Mechanize library prompted me to straighten up some code and write up an explanation of how I parse CafePress pages to build the rubystuff.com Web site.

I've put the draft up at:

http://neurogami.com/cafe-fetcher/

Not sure if that will be the long-term home, though.

Comments welcome.

James Britt

···

--

http://www.ruby-doc.org - Ruby Help & Documentation
http://www.artima.com/rubycs/ - Ruby Code & Style: Writers wanted
http://www.rubystuff.com - The Ruby Store for Ruby Stuff
http://www.jamesbritt.com - Playing with Better Toys
http://www.30secondrule.com - Building Better Tools

Super! Thanks James.

"James Britt" <james_b@neurogami.com> wrote in message
news:4392941F.8090504@neurogami.com...

···

Recent discussion here on HTML parsing and using Michael Neumann's
outstanding Mechanize library prompted me to straighten up some code and
write up an explanation of how I parse CafePress pages to build the
rubystuff.com Web site.

I've put the draft up at:

How I Did It: Building RubyStuff.com

Not sure if that will be the long-term home, though.

Comments welcome.

James Britt

--

http://www.ruby-doc.org - Ruby Help & Documentation
Ruby Code & Style - Ruby Code & Style: Writers wanted
http://www.rubystuff.com - The Ruby Store for Ruby Stuff
http://www.jamesbritt.com - Playing with Better Toys
http://www.30secondrule.com - Building Better Tools

Dude. You rule. Thank you!

--Steve

···

On Dec 3, 2005, at 10:59 PM, James Britt wrote:

Recent discussion here on HTML parsing and using Michael Neumann's outstanding Mechanize library prompted me to straighten up some code and write up an explanation of how I parse CafePress pages to build the rubystuff.com Web site.

James-

  Thank you! Very nice article with good info thats a bit hard to find elsewhere.

Thanks-
-Ezra Zygmuntowicz
WebMaster
Yakima Herald-Republic Newspaper
ezra@yakima-herald.com
509-577-7732

···

On Dec 3, 2005, at 10:59 PM, James Britt wrote:

Recent discussion here on HTML parsing and using Michael Neumann's outstanding Mechanize library prompted me to straighten up some code and write up an explanation of how I parse CafePress pages to build the rubystuff.com Web site.

I've put the draft up at:

How I Did It: Building RubyStuff.com

Not sure if that will be the long-term home, though.

Comments welcome.

James Britt