Read html then output one line of html if found?

Mmcolli00_Mom · 12 March 2009 20:36

Hi
I have many rspecs showing the duration of several test examples. These
rspec fils are in an HTML format. I want to be able to grab the
durations and htmls and write to a textfile. However I can not figure
out how to pull the 1 line from the html. When I use puts it shows the
whole html. Will you help me grab one line out of the html?

Here is my snippet.

require "fileutils"

Dir["C:/Respecs/*.html"].each do |htmlfile|
    readhtml = File.read(htmlfile)
    if readhtml.include?("seconds") == true
     htmlbase = File.basename(htmlfile)
     puts htmlbase #<--shows full html file not just the line that
"seconds" is located on.

end

···

--
Posted via http://www.ruby-forum.com/.

F_Senault · 13 March 2009 08:18

Here is my snippet.

require "fileutils"

Dir["C:/Respecs/*.html"].each do |htmlfile|
    readhtml = File.read(htmlfile)
    if readhtml.include?("seconds") == true
     htmlbase = File.basename(htmlfile)
     puts htmlbase
end

For the easy way, try readlines and grep :

h = File.readlines('f1.txt')

=> ["<h1>hhhhhh</h1>\n", "<h2>20 seconds</h2>\n", "<p>Blah.</p>\n",
"\n"]

h.grep(/seconds/)

=> ["<h2>20 seconds</h2>\n"]

For a more sophisticated (and time-consuming) approach, try an HTML
parser like Hpricot :

require "hpricot"

=> true

doc = Hpricot(File.read('f1.txt'))

=> #<Hpricot::Doc {elem <h1> "hhhhhh" </h1>} "\n" {elem <h2> "20
seconds" </h2>} "\n" {elem <p> "Blah." </p>} "\n\n">

doc.children.select { |e| e.inner_html =~ /seconds/ }

=> [{elem h2 "20 seconds" h2}]

HTH.

Fred

···

Le 12 mars à 21:36, Mmcolli00 Mom a écrit :
--
Everyone is bad in their own way. Finding out each person's unique way
of being bad is most of the fun of getting to know them.
(Lee Wilson)

Mmcolli00_Mom · 13 March 2009 12:59

Thanks Fred - this is very helpful!

···

--
Posted via http://www.ruby-forum.com/.

Topic		Replies	Views
Parsing of Html/Text files ruby-talk	3	138	4 January 2010
Search string in HTML file ruby-talk	8	125	29 September 2006
Extract a number from a line of HTML file ruby-talk	4	120	13 August 2007
Return first line of parsing ruby-talk	4	111	17 August 2007
Need help parsing HTML with Hpricot ruby-talk	3	131	25 October 2007

Read html then output one line of html if found?

Related topics