How to extract something in between a pattern

Cheyne_Li · 22 September 2008 16:58

Hi experts,

I'm not very familiar with ruby's library. I wonder if there a method
can extract something in a pattern? For example,

I have a string: a=aabbccccddee

I wanna get the anything between and , which is ccddee

Thanks in advance.

···

--
Posted via http://www.ruby-forum.com/.

Guest2 · 22 September 2008 17:44

Cheyne Li wrote:

Hi experts,

I'm not very familiar with ruby's library. I wonder if there a method
can extract something in a pattern? For example,

I have a string: a=aabbccccddee

I wanna get the anything between and , which is ccddee

Thanks in advance.

C:\Users\Alex>irb
irb(main):001:0> require 'hpricot'
=> true
irb(main):002:0> a="aabbccccddee"
=> "aabbccccddee"
irb(main):003:0>
irb(main):004:0* doc=Hpricot(a)
=> #<Hpricot::Doc "aabbcc" {elem "ccddee" }>
irb(main):005:0> p doc.at('p').inner_text
"ccddee"
=> nil
irb(main):006:0>

Li

···

--
Posted via http://www.ruby-forum.com/\.

Suroot57 · 25 September 2008 00:24

If you want to use regexp, a quick and dirty way would be :

(a.split %r{</?p>})[1]

···

On Sep 22, 12:58 pm, Cheyne Li <happy.go.lucky....@gmail.com> wrote:

Hi experts,

I'm not very familiar with ruby's library. I wonder if there a method
can extract something in a pattern? For example,

I have a string: a=aabbccccddee

I wanna get the anything between and , which is ccddee

Thanks in advance.
--
Posted viahttp://www.ruby-forum.com/.

Tod_Beardsley · 22 September 2008 18:11

Ya, in this case (HTML/XML), Hpricot is your best bet.

Otherwise, standard regex stuff would apply, imo.

···

On Mon, Sep 22, 2008 at 12:44 PM, Li Chen <chen_li3@yahoo.com> wrote:

Cheyne Li wrote:

Hi experts,

I'm not very familiar with ruby's library. I wonder if there a method
can extract something in a pattern? For example,

I have a string: a=aabbccccddee

I wanna get the anything between and , which is ccddee

Thanks in advance.

C:\Users\Alex>irb
irb(main):001:0> require 'hpricot'
=> true
irb(main):002:0> a="aabbccccddee"
=> "aabbccccddee"
irb(main):003:0>
irb(main):004:0* doc=Hpricot(a)
=> #<Hpricot::Doc "aabbcc" {elem "ccddee" }>
irb(main):005:0> p doc.at('p').inner_text
"ccddee"
=> nil
irb(main):006:0>

Li
--
Posted via http://www.ruby-forum.com/\.

--
todb@planb-security.net | ICQ: 335082155 | Note: Due to Google's
privacy policy <http://tinyurl.com/5xbtl> and the United States'
policy on electronic surveillance <http://tinyurl.com/muuyl>,
please do not IM/e-mail me anything you wish to remain secret.

Ittay_Dror · 25 September 2008 05:57

suroot57@gmail.com wrote:

Hi experts,

I'm not very familiar with ruby's library. I wonder if there a method
can extract something in a pattern? For example,

I have a string: a=aabbccccddee

I wanna get the anything between and , which is ccddee

Thanks in advance.
--
Posted viahttp://www.ruby-forum.com/.

If you want to use regexp, a quick and dirty way would be :

(a.split %r{</?p>})[1]

or:
irb(main):001:0> a = 'aabbccccddee'
=> "aabbccccddee"
irb(main):002:0> a[%r{(.*)}, 1]
=> "ccddee"

···

On Sep 22, 12:58 pm, Cheyne Li <happy.go.lucky....@gmail.com> wrote:

--
Ittay Dror <ittayd@tikalk.com>
Tikal <http://www.tikalk.com>
Tikal Project <http://tikal.sourceforge.net>

--
--
Ittay Dror <ittay.dror@gmail.com>

Patrick_He · 28 September 2008 19:42

Or:

irb(main):004:0> a = 'aabbccccddeeccceee'
=> "aabbccccddeeccceee"
irb(main):005:0> a.scan(%r{([^<]*)})
=> [["ccddee"], ["eee"]]

I perfer to specify what character(s) not to match explicitly.

Ittay Dror wrote:

···

or:
irb(main):001:0> a = 'aabbccccddee'
=> "aabbccccddee"
irb(main):002:0> a[%r{(.*)}, 1]
=> "ccddee"

Topic		Replies	Views
Regular expression ruby-talk	7	118	23 March 2009
How do I extract the ID name of a Div and its content? ruby-talk	4	153	12 December 2008
Extracting text ruby-talk	0	83	11 July 2003
Regular expressions - Again ruby-talk	13	90	8 March 2007
Scan HTML ruby-talk	15	81	3 March 2008

How to extract something in between a pattern

Related topics