Hi, Young:
I think you may checkout universal feed parser at feedparser.org,
which is a python package, mainly created by Mark Pilgrim. And, I'm
guessing Google Reader uses it in the backend.
As far as I know, there's no equivalent ruby package for that.
Regards,
···
On 7/13/09, Young Gyu Park <ygpark2@gmail.com> wrote:
They added an unnecessary html tag which made malformed rss format.
Glancing at the output of their feed I see no malformed RSS. I do see
them "exercising some options" that most feeds don't, such as embedding
CDATA in the link tags.
Not all feeds are this straightforward or well constructed. That's where
using a pre-built library to parse comes in handy but I haven't found
one yet that handles everything out there correctly. Even Google's
reader gets it wrong on some malformed feeds.
Aaron Patterson (AKA tenderlove) has done a great job with Nokogiri.
I've tested a lot of feeds and seen occasions where the built-in RSS
reader and other libraries puked or spun off and never returned. I've
run into feeds that caused Hpricot to be unable to strip broken HTML
embedded inside the descriptions, but Nokogiri was able to handle it.
So, if you can't get a library to do what you want, jump in with
Nokogiri and give it a try.
In <587ca64f0907130826q6eebbb0ar8c7b3404a277f522@mail.gmail.com>
"Re: rss parsing error." on Tue, 14 Jul 2009 00:26:37 +0900,
Hi, Young:
I think you may checkout universal feed parser at feedparser.org,
which is a python package, mainly created by Mark Pilgrim. And, I'm
guessing Google Reader uses it in the backend.
As far as I know, there's no equivalent ruby package for that.
The RSS can be parsed with the bundled RSS Parser.
We doesn't need to use Universal Feed Parser.
···
Juvenn Woo <machese@gmail.com> wrote:
On 7/13/09, Young Gyu Park <ygpark2@gmail.com> wrote: