Hello,
I have a variable which holds the following html code:
<object width="425" height="344"><param name="movie"
value="http://www.youtube.com/v/6fEogbKnNcQ&hl=en&fs=1& "></param><param
name="allowFullScreen" value="true"></param><param
name="allowscriptaccess" value="always"></param><embed
src="http://www.youtube.com/v/6fEogbKnNcQ&hl=en&fs=1& "
type="application/x-shockwave-flash" allowscriptaccess="always"
allowfullscreen="true" width="425" height="344"></embed></object>
<span>some text<a href="..">link</a> .. </span>
Now I'd like to process it so that everything would be stripped except
object/param/embed tags (span tag and its content should be removed).
What is the best way to compliment that?
I've installed sanitize gem but it wont allow me to strip/remove content
inside html tags as well.
I though of regex but I'm unsure about this.
Many thanks for help!
···
--
Posted via http://www.ruby-forum.com/ .
Tim_Pease
(Tim Pease)
22 October 2009 21:15
2
Have a look at nokogiri. It will parse the HTML for you and you can then tinker with it as you please. After you're done, just write it back out as a string.
Blessings,
TwP
···
On Oct 22, 2009, at 12:03 PM, Aljaz Fajmut wrote:
Hello,
I have a variable which holds the following html code:
<object width="425" height="344"><param name="movie"
value="http://www.youtube.com/v/6fEogbKnNcQ&hl=en&fs=1&"></><param
name="allowFullScreen" value="true"></param><param
name="allowscriptaccess" value="always"></param><embed
src="http://www.youtube.com/v/6fEogbKnNcQ&hl=en&fs=1&" ;
type="application/x-shockwave-flash" allowscriptaccess="always"
allowfullscreen="true" width="425" height="344"></embed></object>
<span>some text<a href="..">link</a> .. </span>
Now I'd like to process it so that everything would be stripped except
object/param/embed tags (span tag and its content should be removed).
What is the best way to compliment that?
I've installed sanitize gem but it wont allow me to strip/remove content
inside html tags as well.
I though of regex but I'm unsure about this.
Many thanks for help!
Please don't use regexes. Think of the children!
Instead, try Loofah, which is based on Nokogiri and has some nice
sanitization helpers:
http://loofah.rubyforge.org/
Good luck!
···
On Thu, Oct 22, 2009 at 5:15 PM, Tim Pease <tim.pease@gmail.com> wrote:
On Oct 22, 2009, at 12:03 PM, Aljaz Fajmut wrote:
Hello,
I have a variable which holds the following html code:
<object width="425" height="344"><param name="movie"
value="http://www.youtube.com/v/6fEogbKnNcQ&hl=en&fs=1&"></param><param
name="allowFullScreen" value="true"></param><param
name="allowscriptaccess" value="always"></param><embed
src="http://www.youtube.com/v/6fEogbKnNcQ&hl=en&fs=1&" ;
type="application/x-shockwave-flash" allowscriptaccess="always"
allowfullscreen="true" width="425" height="344"></embed></object>
<span>some text<a href="..">link</a> .. </span>
Now I'd like to process it so that everything would be stripped except
object/param/embed tags (span tag and its content should be removed).
What is the best way to compliment that?
I've installed sanitize gem but it wont allow me to strip/remove content
inside html tags as well.
I though of regex but I'm unsure about this.