SAX2Parser, anyone?

Hi guys.

I'm trying to mine some info from wikipedia's xml dump. The code im
using is as follows:

require 'rexml/parsers/sax2parser'
xml = REXML::Parsers:SAX2Parser.new(File.new('enwiki-20070402-pages-articles-1.xml')
)

i = 0
parser.listen(:characters, %w{title text}){|text| puts text}
parser.parse

Which is generating the error:
xmlreap.rb:2: undefined method `new' for :SAX2Parser:Symbol (NoMethodError)

Anyone have any experience with SAX2? Or ruby stream parsing in
general? What I'm trying to do is pretty simple: grab the title and
text tag element contents and do something with them.

Thanks,
Krister Johnson

Krister Johnson wrote:

Hi guys.

I'm trying to mine some info from wikipedia's xml dump. The code im
using is as follows:

require 'rexml/parsers/sax2parser'
xml = REXML::Parsers:SAX2Parser.new(File.new('enwiki-20070402-pages-articles-1.xml')

)

i = 0
parser.listen(:characters, %w{title text}){|text| puts text}
parser.parse

Which is generating the error:
xmlreap.rb:2: undefined method `new' for :SAX2Parser:Symbol (NoMethodError)

Anyone have any experience with SAX2? Or ruby stream parsing in
general? What I'm trying to do is pretty simple: grab the title and
text tag element contents and do something with them.

I may have something useful here:

http://www.jamesbritt.com/content

···

--
James Britt

This is a syntax error. You have written
  REXML::Parsers:SAX2Parser.new()
which, I guess, is interpreted as
  REXML::Parsers(:SAX2Parser.new())

but :SAX2Parser is a symbol, and there is no new method on it ...
You wanted (two ':')
  REXML::Parsers::SAX2Parser.new()

···

On Sunday 06 May 2007, Krister Johnson wrote:

Hi guys.

I'm trying to mine some info from wikipedia's xml dump. The code im
using is as follows:

require 'rexml/parsers/sax2parser'
xml =
REXML::Parsers:SAX2Parser.new(File.new('enwiki-20070402-pages-articles-1
.xml') )

i = 0
parser.listen(:characters, %w{title text}){|text| puts text}
parser.parse

Which is generating the error:
xmlreap.rb:2: undefined method `new' for :SAX2Parser:Symbol
(NoMethodError)

--
Sylvain Joyeux

oh no. that really was it. (slaps forehead)

thanks.

krister

···

On 5/6/07, Sylvain Joyeux <sylvain.joyeux@polytechnique.org> wrote:

On Sunday 06 May 2007, Krister Johnson wrote:
> Hi guys.
>
> I'm trying to mine some info from wikipedia's xml dump. The code im
> using is as follows:
>
> require 'rexml/parsers/sax2parser'
> xml =
> REXML::Parsers:SAX2Parser.new(File.new('enwiki-20070402-pages-articles-1
>.xml') )
>
> i = 0
> parser.listen(:characters, %w{title text}){|text| puts text}
> parser.parse
>
> Which is generating the error:
> xmlreap.rb:2: undefined method `new' for :SAX2Parser:Symbol
> (NoMethodError)

This is a syntax error. You have written
  REXML::Parsers:SAX2Parser.new()
which, I guess, is interpreted as
  REXML::Parsers(:SAX2Parser.new())

but :SAX2Parser is a symbol, and there is no new method on it ...
You wanted (two ':')
  REXML::Parsers::SAX2Parser.new()

--
Sylvain Joyeux