Regexp strip html

i have a regexp able to strip html :

/<[^>]*>/

however, between <script and </script> all the "text is preserved, tjen
i've tried :

  def stripHTML
# self.gsub(/<\S[^><]*>/, '')
    self.gsub(/\A.*<body [^>]*>(.*)<\/body>\s*\Z/, '\1').gsub(/<[^>]*>/,
'')
  end

without success : the various javascript functions are kept ?

what's my error here ?

···

--
une bévue

i have a regexp able to strip html :

/<[^>]*>/

however, between <script and </script> all the "text is preserved, tjen

...

what's my error here ?

Look at it this way: you have '<script>Javascript</script>'. You
remove everything between angle brackets. You still have 'Javascript',
because that's not actually inside <...>.

The simplest solution is probably to do something like this before
stripping out the remaining tags:

gsub(/<script.*?</script>/im, '')

Paul.

···

On 26/03/06, Une bévue <pere.noel@laponie.com.invalid> wrote:

The simplest solution is probably to do something like this
before

stripping out the remaining tags:

gsub(/<script.*?</script>/im,

'')

yes, sounds clever ))

···

Paul Battley <pbattley@gmail.com> wrote:

--
une bévue