I'm trying to parse through some html code and count the number of times
a match happens. The file is a large table with a ton of <tr> and <tr
'something'>. There are no spaces in the file. I'm trying to count and
print each <tr> and <tr 'something'>.
I haven't even gotten to counting my matches. I'm still working on
matching with <tr> or <tr 'anything'>
I've done:
op_file = HTML_CODE
if op_file =~ /(<tr(.*?)>)+/
but it catches everything on from the first <tr to the end of the line.
Any ideas?
I'm trying to parse through some html code and count the number of times
a match happens. The file is a large table with a ton of <tr> and <tr
'something'>. There are no spaces in the file. I'm trying to count and
print each <tr> and <tr 'something'>.
I haven't even gotten to counting my matches. I'm still working on
matching with <tr> or <tr 'anything'>
I've done:
op_file = HTML_CODE
if op_file =~ /(<tr(.*?)>)+/
You want if op_file =~ /<tr.*?>/
But see below.
but it catches everything on from the first <tr to the end of the line.
I'm trying to parse through some html code and count the number of times a match happens. The file is a large table with a ton of <tr> and <tr 'something'>. There are no spaces in the file. I'm trying to count and print each <tr> and <tr 'something'>.
I haven't even gotten to counting my matches. I'm still working on matching with <tr> or <tr 'anything'>
I've done:
op_file = HTML_CODE
if op_file =~ /(<tr(.*?)>)+/
You are parsing always one line only.
Perhaps you mean a Regular Expression like
/(<tr([^>]*?>)+/m
Anyway I am not sure if the if... is the right
construct. Don't you want to get the return value
of the match, which delivers you a MatchData
object from which you can get the results as
an array or so.
I'm trying to parse through some html code and count the number of times a match happens. The file is a large table with a ton of <tr> and <tr 'something'>. There are no spaces in the file. I'm trying to count and print each <tr> and <tr 'something'>.
I haven't even gotten to counting my matches. I'm still working on matching with <tr> or <tr 'anything'>
I've done:
op_file = HTML_CODE
if op_file =~ /(<tr(.*?)>)+/
You are parsing always one line only.
Perhaps you mean a Regular Expression like
/(<tr([^>]*?>)+/m
The /m doesn't make any difference there, because you're not using the
wildcard dot. /m just adds \n to the dot class.