How to process a long string ( get some parts ? )

RaymondLHW · 20 March 2003 16:57

if I get the content of a HTML file and put in a string
how can I get all the links inside and store them in array ( or any kind of lists ) ?

I use regular expression , but if I use .match function, the result is “Match” type , I can’t use “gsub” or “sub” to process it

please give me some advice

thank you

Robert · 20 March 2003 18:18

Administrator@BOND ~/bin/test
$ cat href.rb

def links(fileName)
text = File.open(fileName) do |f|
f.readlines.join
end

text.scan(/<a\s+href=“([^”]*)"/i) do |m|
puts m[0]
end
end

links(“href.rb”)

END

Administrator@BOND ~/bin/test $ ruby href.rb http://foo.bar http://foo.baz

Administrator@BOND ~/bin/test
$

“RaymondLHW” raymondlhw@sinaman.com schrieb im Newsbeitrag
news:b5cri9$27p391$1@ID-157330.news.dfncis.de…

if I get the content of a HTML file and put in a string
how can I get all the links inside and store them in array ( or any kind
of lists ) ?

I use regular expression , but if I use .match function, the result is
“Match” type , I can’t use “gsub” or “sub” to process it

···

please give me some advice

thank you

RaymondLHW · 21 March 2003 01:00

text.scan … let me try
thank you : )

“Robert Klemme” bob.news@gmx.net wrote in message news:b5cv74$28okso$2@ID-52924.news.dfncis.de…

···

Administrator@BOND ~/bin/test
$ cat href.rb

def links(fileName)
text = File.open(fileName) do |f|
f.readlines.join
end

text.scan(/<a\s+href=“([^”]*)"/i) do |m|
puts m[0]
end
end

links(“href.rb”)

END
Administrator@BOND ~/bin/test $ ruby href.rb http://foo.bar http://foo.baz
Administrator@BOND ~/bin/test
$

“RaymondLHW” raymondlhw@sinaman.com schrieb im Newsbeitrag
news:b5cri9$27p391$1@ID-157330.news.dfncis.de…

if I get the content of a HTML file and put in a string
how can I get all the links inside and store them in array ( or any kind
of lists ) ?

I use regular expression , but if I use .match function, the result is
“Match” type , I can’t use “gsub” or “sub” to process it

please give me some advice

thank you

Topic		Replies	Views
Using regular expressions ruby-talk	1	113	11 November 2008
Accessing all matches using regex ruby-talk	4	135	6 August 2007
Search on text ruby-talk	3	110	7 August 2010
Getting a list of results from one regular expression ruby-talk	6	146	23 June 2005
Is there link extractor or similar html processing libs for Ruby ruby-talk	16	172	10 March 2006

How to process a long string ( get some parts ? )

Related topics