Duplicating characters in a string

Adam_Akhtar · 6 March 2008 16:12

If i have a string "abc" and want to make it like this "aabbcc", how do
i go about it?

I thought convert it into an array and then use string.map{|x| x << x}

though i havnt tested that yet. Is there a better way or a string method
that saves me from having to convert it into an array?

···

--
Posted via http://www.ruby-forum.com/.

Tim_Hunter1 · 6 March 2008 16:18

Adam Akhtar wrote:

If i have a string "abc" and want to make it like this "aabbcc", how do
i go about it?

I thought convert it into an array and then use string.map{|x| x << x}

though i havnt tested that yet. Is there a better way or a string method
that saves me from having to convert it into an array?

I strongly doubt that this is the only way, but I'll offer

irb(main):004:0> "abc".gsub(/./) { |c| c + c}
=> "aabbcc"

Now let's see how many other ways people can think of...

···

--
Posted via http://www.ruby-forum.com/\.

Gary_Wright · 6 March 2008 16:27

If i have a string "abc" and want to make it like this "aabbcc", how do
i go about it?

I thought convert it into an array and then use string.map{|x| x << x}

though i havnt tested that yet. Is there a better way or a string method
that saves me from having to convert it into an array?

Ask 10 Ruby programmers this question and I'm sure you'll get 11
different answers. If your definition of 'best' is 'fastest' you'll
simply have to code and benchmark a couple of solutions. Here is one
way to do it:

>> a = "abc"
=> "abc"
>> (b = a.split(//)).zip(b).join
=> "aabbcc"

Gary Wright

···

On Mar 6, 2008, at 11:12 AM, Adam Akhtar wrote:

W_James · 8 March 2008 19:49

s = "aabbcc"
    ==>"aabbcc"
s = s.squeeze
    ==>"abc"
s = s.unsqueeze
    ==>"aabbcc"

Yes, that should have been saved till April 1.

···

On Mar 6, 10:12 am, Adam Akhtar <adamtempor...@gmail.com> wrote:

If i have a string "abc" and want to make it like this "aabbcc", how do
i go about it?

I thought convert it into an array and then use string.map{|x| x << x}

though i havnt tested that yet. Is there a better way or a string method
that saves me from having to convert it into an array?
--
Posted viahttp://www.ruby-forum.com/.

Drew_Olson · 11 March 2008 21:53

Adam Akhtar wrote:

If i have a string "abc" and want to make it like this "aabbcc", how do
i go about it?

I thought convert it into an array and then use string.map{|x| x << x}

though i havnt tested that yet. Is there a better way or a string method
that saves me from having to convert it into an array?

Another approach:

"abc".split(//).map{|c| c*2}.join

···

--
Posted via http://www.ruby-forum.com/\.

Sebastian_Hungereck1 · 6 March 2008 16:40

Tim Hunter wrote:

"abc".gsub(/./) { |c| c + c}
=> "aabbcc"

"abc".gsub(/(.)/,'\1\1')
=> "aabbcc"

···

--
NP: Placebo - Every You Every Me
Jabber: sepp2k@jabber.org
ICQ: 205544826

Adam_Akhtar · 6 March 2008 16:53

Ask 10 Ruby programmers this question and I'm sure you'll get 11
different answers. If your definition of 'best' is 'fastest' you'll
simply have to code and benchmark a couple of solutions.

Well actually im new to ruby so although fastest/best solutions would be
appreciated I'd actually prefer ones that were not to advanced or
cryptic to read but at the same time not to innefficent.

Thanks so far for your suggestions!!

···

--
Posted via http://www.ruby-forum.com/\.

7stud · 6 March 2008 16:57

Adam Akhtar wrote:

Ask 10 Ruby programmers this question and I'm sure you'll get 11
different answers. If your definition of 'best' is 'fastest' you'll
simply have to code and benchmark a couple of solutions.

Well actually im new to ruby so although fastest/best solutions would be
appreciated I'd actually prefer ones that were not to advanced or
cryptic to read but at the same time not to innefficent.

Thanks so far for your suggestions!!

str = 'abc'
repeat = 2

new_str = ""
0.upto(str.length) do |i|
new_str << str[i, 1] * repeat
end

puts new_str

···

--
Posted via http://www.ruby-forum.com/\.

7stud · 6 March 2008 17:03

Adam Akhtar wrote:

Ask 10 Ruby programmers this question and I'm sure you'll get 11
different answers. If your definition of 'best' is 'fastest' you'll
simply have to code and benchmark a couple of solutions.

Well actually im new to ruby so although fastest/best solutions would be
appreciated I'd actually prefer ones that were not to advanced or
cryptic to read but at the same time not to innefficent.

Thanks so far for your suggestions!!

str = 'abc'
duplicate = 2

new_str = ""
str.each_byte do |byte|
  duplicate.times do
    new_str << byte
  end
end

puts new_str

···

--
Posted via http://www.ruby-forum.com/\.

Robert_K1 · 6 March 2008 17:59

You do not need groups:

irb(main):003:0> "abc".gsub /./, '\\&\\&'
=> "aabbcc"

Cheers

robert

···

On 06.03.2008 17:40, Sebastian Hungerecker wrote:

Tim Hunter wrote:

"abc".gsub(/./) { |c| c + c}
=> "aabbcc"

"abc".gsub(/(.)/,'\1\1')
=> "aabbcc"

Bermejo_Rodrigo · 6 March 2008 17:14

7stud -- wrote:

Adam Akhtar wrote:

Ask 10 Ruby programmers this question and I'm sure you'll get 11
different answers. If your definition of 'best' is 'fastest' you'll
simply have to code and benchmark a couple of solutions.

#answers.succ

"abc".gsub(/./) { |x| x * 2 }

···

--
Posted via http://www.ruby-forum.com/\.

Sebastian_Hungereck1 · 6 March 2008 18:06

Robert Klemme wrote:

You do not need groups:

irb(main):003:0> "abc".gsub /./, '\\&\\&'
=> "aabbcc"

You do not need double backslashes either:

"abc".gsub /./, '\&\&'

=> "aabbcc"

···

--
NP: Katatonia - Quiet World
Jabber: sepp2k@jabber.org
ICQ: 205544826

Joel_VanderWerf1 · 6 March 2008 22:02

Rodrigo Bermejo wrote:

"abc".gsub(/./) { |x| x * 2 }

That's the most elegant way to refer to captures, but if speed matters:

"abc".gsub(/(.)/, "\\1\\1")

The "\\1" is always easy to mess up, which is why I generally prefer the block form.

It's only about a factor of two faster, according to the following, but that matters sometimes.

require 'benchmark'

Benchmark.bmbm do |b|
s = "abc" * 1_000_000
re = /(.)/

   b.report("s.gsub(re){...}") do
     s.gsub(re) { |x| x * 2 }
   end

   b.report("s.gsub(re,...)") do
     s.gsub(re, "\\1\\1")
   end
end

__END__

Rehearsal ---------------------------------------------------
s.gsub(re){...} 5.970000 0.000000 5.970000 ( 6.009089)
s.gsub(re,...) 2.900000 0.060000 2.960000 ( 3.148006)
------------------------------------------ total: 8.930000sec

user system total real
s.gsub(re){...} 5.760000 0.010000 5.770000 ( 5.925049)
s.gsub(re,...) 2.800000 0.060000 2.860000 ( 2.914432)

···

--
vjoel : Joel VanderWerf : path berkeley edu : 510 665 3407

Robert_K1 · 7 March 2008 14:49

I know but I prefer to have them in because it's clearer what happens.
Sometimes a single backslash works and sometimes not. IMHO some of
the recurring discussions about the number of backslashes needed for
replacement strings would not happen or be easier and shorter if there
was a clear rule that \x always results in a non escaped string or
error and in order to have a backslash in a string there must be two
in the source. In other words, I would forbid '\1' and instead
require '\\1'.

Kind regards

robert

···

2008/3/6, Sebastian Hungerecker <sepp2k@googlemail.com>:

Robert Klemme wrote:
> You do not need groups:
>
> irb(main):003:0> "abc".gsub /./, '\\&\\&'
> => "aabbcc"

You do not need double backslashes either:

>> "abc".gsub /./, '\&\&'
=> "aabbcc"

--
use.inject do |as, often| as.you_can - without end

Robert_K1 · 7 March 2008 15:02

What version did you test with? I get much more dramatic differences:

15:56:11 ~
$ ruby /c/Temp/gs.rb
Rehearsal -----------------------------------------------------------
s.gsub(re){|x| x * 2} 25.313000 0.031000 25.344000 ( 25.354000)
s.gsub(re){|x| x << x} 22.812000 0.000000 22.812000 ( 22.845000)
s.gsub(re, '\1\1') 6.516000 0.015000 6.531000 ( 6.539000)
s.gsub(re, '\&\&') 6.578000 0.016000 6.594000 ( 6.595000)
s.gsub(/./){|x| x * 2} 25.172000 0.016000 25.188000 ( 25.182000)
s.gsub(/./){|x| x << x} 22.843000 0.000000 22.843000 ( 22.857000)
s.gsub(/(.)/, '\1\1') 6.344000 0.015000 6.359000 ( 6.355000)
s.gsub(/./, '\&\&') 6.188000 0.032000 6.220000 ( 6.217000)
------------------------------------------------ total: 121.891000sec

user system total real
s.gsub(re){|x| x * 2} 25.484000 0.015000 25.499000 ( 25.502000)
s.gsub(re){|x| x << x} 22.813000 0.031000 22.844000 ( 22.856000)
s.gsub(re, '\1\1') 6.312000 0.000000 6.312000 ( 6.337000)
s.gsub(re, '\&\&') 6.359000 0.000000 6.359000 ( 6.359000)
s.gsub(/./){|x| x * 2} 25.922000 0.000000 25.922000 ( 25.994000)
s.gsub(/./){|x| x << x} 22.672000 0.015000 22.687000 ( 22.707000)
s.gsub(/(.)/, '\1\1') 6.375000 0.016000 6.391000 ( 6.389000)
s.gsub(/./, '\&\&') 6.235000 0.000000 6.235000 ( 6.239000)
16:00:22 ~
$ ruby -v
ruby 1.8.6 (2007-03-13 patchlevel 0) [i386-cygwin]
16:00:26 ~
$ cat /c/Temp/gs.rb
require 'benchmark'

s = ("abc" * 1_000_000).freeze
re = /(.)/
re2 = /./

Benchmark.bmbm do |b|
  b.report("s.gsub(re){|x| x * 2}") do
    s.gsub(re) { |x| x * 2 }
  end

  b.report("s.gsub(re){|x| x << x}") do
    s.gsub(re) { |x| x << x }
  end

  b.report("s.gsub(re, '\\1\\1')") do
    s.gsub(re, "\\1\\1")
  end

  b.report("s.gsub(re, '\\&\\&')") do
    s.gsub(re, "\\&\\&")
  end

  b.report("s.gsub(/./){|x| x * 2}") do
    s.gsub(/./) { |x| x * 2 }
  end

  b.report("s.gsub(/./){|x| x << x}") do
    s.gsub(/./) { |x| x << x }
  end

  b.report("s.gsub(/(.)/, '\\1\\1')") do
    s.gsub(/(.)/, "\\1\\1")
  end

  b.report("s.gsub(/./, '\\&\\&')") do
    s.gsub(/./, "\\&\\&")
  end
end
16:00:31 ~
$

Kind regards

robert

···

2008/3/6, Joel VanderWerf <vjoel@path.berkeley.edu>:

Rodrigo Bermejo wrote:
> "abc".gsub(/./) { |x| x * 2 }

That's the most elegant way to refer to captures, but if speed matters:

"abc".gsub(/(.)/, "\\1\\1")

The "\\1" is always easy to mess up, which is why I generally prefer the
block form.

It's only about a factor of two faster, according to the following, but
that matters sometimes.

Rehearsal ---------------------------------------------------
s.gsub(re){...} 5.970000 0.000000 5.970000 ( 6.009089)
s.gsub(re,...) 2.900000 0.060000 2.960000 ( 3.148006)
------------------------------------------ total: 8.930000sec

user system total real
s.gsub(re){...} 5.760000 0.010000 5.770000 ( 5.925049)
s.gsub(re,...) 2.800000 0.060000 2.860000 ( 2.914432)

--
use.inject do |as, often| as.you_can - without end

Sebastian_Hungereck1 · 8 March 2008 15:20

Robert Klemme wrote:

In other words, I would forbid '\1' and instead
require '\\1'.

Then why have single quotes at all?
Personally I'd rather have \ always being a literal backslash inside single
quotes and just live with fact that you can't have single quotes inside
single quotes.

···

--
Jabber: sepp2k@jabber.org
ICQ: 205544826

Joel_VanderWerf1 · 7 March 2008 18:41

Robert Klemme wrote:

What version did you test with? I get much more dramatic differences:

$ ruby -v
ruby 1.8.6 (2007-09-24 patchlevel 111) [i686-linux]

Surprising. The slower cases in your results are all the block cases. Does cygwin ruby perform poorly with blocks in general? Or is it because my ruby is compiled for i686 instead of i386?

···

--
vjoel : Joel VanderWerf : path berkeley edu : 510 665 3407

Todd_Benson · 7 March 2008 20:28

On Fri, Mar 7, 2008 at 9:02 AM, Robert Klemme

What version did you test with? I get much more dramatic differences:

FreeBSD was way faster (probably due to optimized compiling) but the
differences were about the same. This is on the same machine...

FreeBSD:
Rehearsal -----------------------------------------------------------
s.gsub(re){|x| x * 2} 9.523438 0.015625 9.539062 ( 9.875376)
s.gsub(re){|x| x << x} 8.453125 0.031250 8.484375 ( 9.576175)
s.gsub(re, '\1\1') 3.320312 0.039062 3.359375 ( 3.482295)
s.gsub(re, '\&\&') 3.429688 0.000000 3.429688 ( 3.545766)
s.gsub(/./){|x| x * 2} 9.523438 0.015625 9.539062 ( 9.885323)
s.gsub(/./){|x| x << x} 8.460938 0.015625 8.476562 ( 9.092159)
s.gsub(/(.)/, '\1\1') 3.312500 0.007812 3.320312 ( 3.436467)
s.gsub(/./, '\&\&') 3.117188 0.007812 3.125000 ( 3.251097)
------------------------------------------------- total: 49.273438sec

user system total real
s.gsub(re){|x| x * 2} 9.539062 0.007812 9.546875 ( 9.913748)
s.gsub(re){|x| x << x} 8.312500 0.023438 8.335938 ( 8.757943)
s.gsub(re, '\1\1') 3.335938 0.000000 3.335938 ( 3.456838)
s.gsub(re, '\&\&') 3.335938 0.007812 3.343750 ( 3.463104)
s.gsub(/./){|x| x * 2} 9.351562 0.031250 9.382812 ( 9.720232)
s.gsub(/./){|x| x << x} 8.343750 0.023438 8.367188 ( 8.666177)
s.gsub(/(.)/, '\1\1') 3.304688 0.031250 3.335938 ( 3.447005)
s.gsub(/./, '\&\&') 3.093750 0.039062 3.132812 ( 3.404753)

WinXP
Rehearsal -----------------------------------------------------------
s.gsub(re){|x| x * 2} 25.844000 0.235000 26.079000 ( 26.062000)
s.gsub(re){|x| x << x} 22.906000 0.312000 23.218000 ( 23.250000)
s.gsub(re, '\1\1') 8.016000 0.391000 8.407000 ( 8.406000)
s.gsub(re, '\&\&') 8.000000 0.484000 8.484000 ( 8.485000)
s.gsub(/./){|x| x * 2} 25.437000 0.438000 25.875000 ( 25.875000)
s.gsub(/./){|x| x << x} 22.766000 0.437000 23.203000 ( 23.218000)
s.gsub(/(.)/, '\1\1') 7.875000 0.656000 8.531000 ( 8.532000)
s.gsub(/./, '\&\&') 7.609000 0.500000 8.109000 ( 8.109000)
------------------------------------------------ total: 131.906000sec

user system total real
s.gsub(re){|x| x * 2} 25.750000 0.609000 26.359000 ( 26.359000)
s.gsub(re){|x| x << x} 23.469000 0.266000 23.735000 ( 23.734000)
s.gsub(re, '\1\1') 8.500000 0.609000 9.109000 ( 9.109000)
s.gsub(re, '\&\&') 8.296000 0.672000 8.968000 ( 8.969000)
s.gsub(/./){|x| x * 2} 26.032000 0.391000 26.423000 ( 26.422000)
s.gsub(/./){|x| x << x} 23.234000 0.390000 23.624000 ( 23.625000)
s.gsub(/(.)/, '\1\1') 8.734000 0.547000 9.281000 ( 9.282000)
s.gsub(/./, '\&\&') 8.235000 0.656000 8.891000 ( 8.890000)

My personal favorite, though, is still s.gsub(/./) {|x| x * 2}.

Todd

7stud · 8 March 2008 00:56

Robert Klemme wrote:

It's only about a factor of two faster, according to the following, but
that matters sometimes.

Rehearsal ---------------------------------------------------
s.gsub(re){...} 5.970000 0.000000 5.970000 ( 6.009089)
s.gsub(re,...) 2.900000 0.060000 2.960000 ( 3.148006)
------------------------------------------ total: 8.930000sec

                       user system total real
s.gsub(re){...} 5.760000 0.010000 5.770000 ( 5.925049)
s.gsub(re,...) 2.800000 0.060000 2.860000 ( 2.914432)

What version did you test with? I get much more dramatic differences:

15:56:11 ~
$ ruby /c/Temp/gs.rb
Rehearsal -----------------------------------------------------------
s.gsub(re){|x| x * 2} 25.313000 0.031000 25.344000 ( 25.354000)
s.gsub(re){|x| x << x} 22.812000 0.000000 22.812000 ( 22.845000)
s.gsub(re, '\1\1') 6.516000 0.015000 6.531000 ( 6.539000)
s.gsub(re, '\&\&') 6.578000 0.016000 6.594000 ( 6.595000)
s.gsub(/./){|x| x * 2} 25.172000 0.016000 25.188000 ( 25.182000)
s.gsub(/./){|x| x << x} 22.843000 0.000000 22.843000 ( 22.857000)
s.gsub(/(.)/, '\1\1') 6.344000 0.015000 6.359000 ( 6.355000)
s.gsub(/./, '\&\&') 6.188000 0.032000 6.220000 ( 6.217000)
------------------------------------------------ total: 121.891000sec

                              user system total real
s.gsub(re){|x| x * 2} 25.484000 0.015000 25.499000 ( 25.502000)
s.gsub(re){|x| x << x} 22.813000 0.031000 22.844000 ( 22.856000)
s.gsub(re, '\1\1') 6.312000 0.000000 6.312000 ( 6.337000)
s.gsub(re, '\&\&') 6.359000 0.000000 6.359000 ( 6.359000)
s.gsub(/./){|x| x * 2} 25.922000 0.000000 25.922000 ( 25.994000)
s.gsub(/./){|x| x << x} 22.672000 0.015000 22.687000 ( 22.707000)
s.gsub(/(.)/, '\1\1') 6.375000 0.016000 6.391000 ( 6.389000)
s.gsub(/./, '\&\&') 6.235000 0.000000 6.235000 ( 6.239000)
16:00:22 ~
$ ruby -v
ruby 1.8.6 (2007-03-13 patchlevel 0) [i386-cygwin]
16:00:26 ~
$ cat /c/Temp/gs.rb
require 'benchmark'

s = ("abc" * 1_000_000).freeze
re = /(.)/
re2 = /./

Benchmark.bmbm do |b|
  b.report("s.gsub(re){|x| x * 2}") do
    s.gsub(re) { |x| x * 2 }
  end

  b.report("s.gsub(re){|x| x << x}") do
    s.gsub(re) { |x| x << x }
  end

  b.report("s.gsub(re, '\\1\\1')") do
    s.gsub(re, "\\1\\1")
  end

  b.report("s.gsub(re, '\\&\\&')") do
    s.gsub(re, "\\&\\&")
  end

  b.report("s.gsub(/./){|x| x * 2}") do
    s.gsub(/./) { |x| x * 2 }
  end

  b.report("s.gsub(/./){|x| x << x}") do
    s.gsub(/./) { |x| x << x }
  end

  b.report("s.gsub(/(.)/, '\\1\\1')") do
    s.gsub(/(.)/, "\\1\\1")
  end

  b.report("s.gsub(/./, '\\&\\&')") do
    s.gsub(/./, "\\&\\&")
  end
end
16:00:31 ~
$

And taking your non-perl champion:

s.gsub(/(.)/, '\1\1')

(I refuse to consider any code that uses perl syntax), and pitting it
against:

s = 'abc'
new_str = ""
s.each_byte do |byte|
  2.times do
    new_str << byte
  end
end

I get:

gsub:
t1 exec time(1,000,000 loops): 5.625847 total

each_byte:
t2 exec time(1,000,000 loops): 5.325978 total

···

2008/3/6, Joel VanderWerf <vjoel@path.berkeley.edu>:

--
Posted via http://www.ruby-forum.com/\.

Joel_VanderWerf1 · 8 March 2008 04:45

7stud -- wrote:

And taking your non-perl champion:

s.gsub(/(.)/, '\1\1')

(I refuse to consider any code that uses perl syntax), and pitting it against:

s = 'abc'
new_str = ""
s.each_byte do |byte|
  2.times do
    new_str << byte
  end
end

I get:

gsub:
t1 exec time(1,000,000 loops): 5.625847 total

each_byte:
t2 exec time(1,000,000 loops): 5.325978 total

Hm, I don't see an improvement, unless I replace 2.times... with an explicit unrolling of that inner loop (which is heading downhill in the elegance department):

require 'benchmark'

Benchmark.bmbm do |b|
s = "abc" * 1_000_000
re = /(.)/

   b.report("s.gsub(re) {|x| x*2}") do
     s.gsub(re) {|x| x*2}
   end

   b.report("s.gsub(re, '\1\1')") do
     s.gsub(re, '\1\1')
   end

   b.report("7stud1") do
     new_str = ""
     s.each_byte do |byte|
       2.times do
         new_str << byte
       end
     end
   end

   b.report("7stud2") do
     new_str = ""
     s.each_byte do |byte|
       new_str << byte << byte
     end
   end
end

__END__

Rehearsal ---------------------------------------------------
s.gsub(re){...} 6.220000 0.010000 6.230000 ( 6.325230)
s.gsub(re,...) 3.020000 0.060000 3.080000 ( 3.115286)
7stud 4.110000 0.000000 4.110000 ( 4.142912)
7stud 2.050000 0.000000 2.050000 ( 2.072398)
----------------------------------------- total: 15.470000sec

user system total real
s.gsub(re){...} 5.910000 0.020000 5.930000 ( 5.993456)
s.gsub(re,...) 3.000000 0.000000 3.000000 ( 3.017119)
7stud 4.100000 0.020000 4.120000 ( 4.300772)
7stud 2.030000 0.010000 2.040000 ( 2.068190)

···

--
vjoel : Joel VanderWerf : path berkeley edu : 510 665 3407

Topic		Replies	Views
Convert ascii codes in a string to a string of characters ruby-talk	4	156	8 September 2008
Check string format ruby-talk	4	118	22 November 2012
Array question ruby-talk	4	99	27 October 2009
Turning a string into array of ASCII bytes ruby-talk	16	111	17 January 2004
"aaabbccccdadd" => [ [a, 3], [b, 2], [c, 4], [d, 1], [a, 1], [d, 2] ] ruby-talk	3	134	13 December 2011

Duplicating characters in a string

Related topics