[ARTICLE] HTML Scraping Using WWW::Mechanize

James_Britt4 · 4 December 2005 06:59

Recent discussion here on HTML parsing and using Michael Neumann's outstanding Mechanize library prompted me to straighten up some code and write up an explanation of how I parse CafePress pages to build the rubystuff.com Web site.

I've put the draft up at:

http://neurogami.com/cafe-fetcher/

Not sure if that will be the long-term home, though.

Comments welcome.

James Britt

···

--

http://www.ruby-doc.org - Ruby Help & Documentation
http://www.artima.com/rubycs/ - Ruby Code & Style: Writers wanted
http://www.rubystuff.com - The Ruby Store for Ruby Stuff
http://www.jamesbritt.com - Playing with Better Toys
http://www.30secondrule.com - Building Better Tools

Its_Me · 4 December 2005 14:22

Super! Thanks James.

"James Britt" <james_b@neurogami.com> wrote in message
news:4392941F.8090504@neurogami.com...

···

Recent discussion here on HTML parsing and using Michael Neumann's
outstanding Mechanize library prompted me to straighten up some code and
write up an explanation of how I parse CafePress pages to build the
rubystuff.com Web site.

I've put the draft up at:

How I Did It: Building RubyStuff.com

Not sure if that will be the long-term home, though.

Comments welcome.

James Britt

--

http://www.ruby-doc.org - Ruby Help & Documentation
Ruby Code & Style - Ruby Code & Style: Writers wanted
http://www.rubystuff.com - The Ruby Store for Ruby Stuff
http://www.jamesbritt.com - Playing with Better Toys
http://www.30secondrule.com - Building Better Tools

Stephen_Waits1 · 4 December 2005 17:01

Dude. You rule. Thank you!

--Steve

···

On Dec 3, 2005, at 10:59 PM, James Britt wrote:

Recent discussion here on HTML parsing and using Michael Neumann's outstanding Mechanize library prompted me to straighten up some code and write up an explanation of how I parse CafePress pages to build the rubystuff.com Web site.

Ezra_Zygmuntowicz3 · 4 December 2005 19:39

James-

Thank you! Very nice article with good info thats a bit hard to find elsewhere.

Thanks-
-Ezra Zygmuntowicz
WebMaster
Yakima Herald-Republic Newspaper
ezra@yakima-herald.com
509-577-7732

···

On Dec 3, 2005, at 10:59 PM, James Britt wrote:

Recent discussion here on HTML parsing and using Michael Neumann's outstanding Mechanize library prompted me to straighten up some code and write up an explanation of how I parse CafePress pages to build the rubystuff.com Web site.

I've put the draft up at:

How I Did It: Building RubyStuff.com

Not sure if that will be the long-term home, though.

Comments welcome.

James Britt

Topic		Replies	Views
Fun with WWW::Mechanize ruby-talk	7	70	23 January 2005
Ruby HTML Tools - ruby-htmltools Examples ruby-talk	2	132	24 March 2006
Website screen scraping with Mechanize or Rubyful Soup ruby-talk	9	120	13 September 2005
How to search a website ruby-talk	9	167	7 September 2008
Waiter, there's a noob in my soup! ruby-talk	14	151	29 March 2006

[ARTICLE] HTML Scraping Using WWW::Mechanize

Related topics