[ANN] mp3scrape

a11 · 1 August 2008 04:10

NAME
mp3scrape

SYNOPSIS
mp3scrape uri [options]+

DESCRIPTION
   mp3scrape will scour any url for it's mp3 content - the script mirrors,
   never downloading the same file twice. it does not, however, crawl a
   website for links, it simple scapes all the songs from a single page.

PARAMETERS
   uri (1 -> uri(uri))
       the uri to scrape
   --pattern=pattern, -p (0 ~> pattern=['"](http://[^\s]+[^/\s]+.mp3)["'])
       specifiy the mp3 pattern
   --basedir=basedir, -b (0 ~> basedir=/Users/ahoward/mp3)
       specifiy the base download dir - default(/Users/ahoward/mp3)
   --destination=destination, -d (0 ~> destination)
       specifiy the absolute download dir -
       default(/Users/ahoward/mp3/auto-based-on-uri)
   --list
       only list the mp3s that would be scraped
   --threads=threads, -t (0 ~> integer(threads=8))
       specify the number of threads to download with in parallel -
       default(8)
   --noop, -n
       show the downloads that would be performed
   --help, -h

INSTALL
sudo gem install mp3scrape

URI
http://codeforpeople.com

EXAMPLES
1) get a bunch of xmas tunes
mp3scrape http://fuelfriends.blogspot.com/2007/12/christmas-mixery.html

2) get a bunch of tunes
mp2scrape http://troubledsoulsunite.blogspot.com/

a @ http://codeforpeople.com/

···

--
we can deny everything, except that we have the possibility of being better. simply reflect on that.
h.h. the 14th dalai lama

Michael_Fellinger1 · 4 August 2008 01:18

NAME
mp3scrape

SYNOPSIS
mp3scrape uri [options]+

DESCRIPTION
mp3scrape will scour any url for it's mp3 content - the script mirrors,
never downloading the same file twice. it does not, however, crawl a
website for links, it simple scapes all the songs from a single page.

PARAMETERS
uri (1 -> uri(uri))
     the uri to scrape
--pattern=pattern, -p (0 ~> pattern=['"](http://[^\s]+[^/\s]+.mp3)["'])
     specifiy the mp3 pattern
--basedir=basedir, -b (0 ~> basedir=/Users/ahoward/mp3)
     specifiy the base download dir - default(/Users/ahoward/mp3)
--destination=destination, -d (0 ~> destination)
     specifiy the absolute download dir -
     default(/Users/ahoward/mp3/auto-based-on-uri)
--list
     only list the mp3s that would be scraped
--threads=threads, -t (0 ~> integer(threads=8))
     specify the number of threads to download with in parallel -
     default(8)
--noop, -n
     show the downloads that would be performed
--help, -h

INSTALL
sudo gem install mp3scrape

URI
http://codeforpeople.com

EXAMPLES
1) get a bunch of xmas tunes
   mp3scrape http://fuelfriends.blogspot.com/2007/12/christmas-mixery.html

2) get a bunch of tunes
   mp2scrape http://troubledsoulsunite.blogspot.com/

Nice, i've been doing similar stuff with a script of mine:
http://p.ramaze.net/1818

That i would use like:
curl -s http://fuelfriends.blogspot.com/2007/12/christmas-mixery.html

uris | egrep '\.mp3$' | xargs wget

But thanks, I'll check this one out
^ manveru

···

On Fri, Aug 1, 2008 at 1:10 PM, ara howard <ara.t.howard@gmail.com> wrote:

Joel_VanderWerf1 · 7 August 2008 00:08

ara howard wrote:

  mp3scrape will scour any url for it's mp3 content - the script mirrors,
  never downloading the same file twice. it does not, however, crawl a
  website for links, it simple scapes all the songs from a single page.

Might be nice as a firefox plugin, too.

···

--
vjoel : Joel VanderWerf : path berkeley edu : 510 665 3407

Jos_Backus1 · 9 September 2008 17:05

Thanks for this gem, Ara.

Attached is a patch which changes mp3scrape to use Hpricot. It also prepends
the supplied URL to relative URLs (that do not match with /^http:/). This
allows mp3scrape to be used on pages such as

http://staff.telkomsa.net/packages/SRPMS/openldap/

to download RPMs instead. Please let me know what you think.

mp3scrape.patch (1.24 KB)

···

--
Jos Backus
jos at catnook.com

a11 · 4 August 2008 16:32

cool. my next plan is to make it crawl - but limiting is a big issue... rfc

a @ http://codeforpeople.com/

···

On Aug 3, 2008, at 7:18 PM, Michael Fellinger wrote:

Nice, i've been doing similar stuff with a script of mine:
http://p.ramaze.net/1818

That i would use like:
curl -s I AM FUEL, YOU ARE FRIENDS: Christmas mixery
> uris | egrep '\.mp3$' | xargs wget

But thanks, I'll check this one out
^ manveru

--
we can deny everything, except that we have the possibility of being better. simply reflect on that.
h.h. the 14th dalai lama

Michael_Fellinger1 · 7 August 2008 12:42

See http://www.downthemall.net/

^ manveru

···

On Thu, Aug 7, 2008 at 9:08 AM, Joel VanderWerf <vjoel@path.berkeley.edu> wrote:

ara howard wrote:

mp3scrape will scour any url for it's mp3 content - the script mirrors,
never downloading the same file twice. it does not, however, crawl a
website for links, it simple scapes all the songs from a single page.

Might be nice as a firefox plugin, too.

Topic		Replies	Views
How to download all mp3 files from this website? ruby-talk	10	140	3 February 2009
Finding Duplicate MP3s ruby-talk	3	86	5 December 2007
Merry xmas tunes for rubyists ruby-talk	0	99	22 December 2007
[ANN] ruby-mp3info 0.5 ruby-talk	0	109	6 December 2005
Reading mp3 Tag ruby-talk	2	88	11 May 2009

[ANN] mp3scrape

Related topics