DESCRIPTION
mp3scrape will scour any url for it's mp3 content - the script mirrors,
never downloading the same file twice. it does not, however, crawl a
website for links, it simple scapes all the songs from a single page.
PARAMETERS
uri (1 -> uri(uri))
the uri to scrape
--pattern=pattern, -p (0 ~> pattern=['"](http://[^\s]+[^/\s]+.mp3)["'])
specifiy the mp3 pattern
--basedir=basedir, -b (0 ~> basedir=/Users/ahoward/mp3)
specifiy the base download dir - default(/Users/ahoward/mp3)
--destination=destination, -d (0 ~> destination)
specifiy the absolute download dir -
default(/Users/ahoward/mp3/auto-based-on-uri)
--list
only list the mp3s that would be scraped
--threads=threads, -t (0 ~> integer(threads=8))
specify the number of threads to download with in parallel -
default(8)
--noop, -n
show the downloads that would be performed
--help, -h
DESCRIPTION
mp3scrape will scour any url for it's mp3 content - the script mirrors,
never downloading the same file twice. it does not, however, crawl a
website for links, it simple scapes all the songs from a single page.
PARAMETERS
uri (1 -> uri(uri))
the uri to scrape
--pattern=pattern, -p (0 ~> pattern=['"](http://[^\s]+[^/\s]+.mp3)["'])
specifiy the mp3 pattern
--basedir=basedir, -b (0 ~> basedir=/Users/ahoward/mp3)
specifiy the base download dir - default(/Users/ahoward/mp3)
--destination=destination, -d (0 ~> destination)
specifiy the absolute download dir -
default(/Users/ahoward/mp3/auto-based-on-uri)
--list
only list the mp3s that would be scraped
--threads=threads, -t (0 ~> integer(threads=8))
specify the number of threads to download with in parallel -
default(8)
--noop, -n
show the downloads that would be performed
--help, -h
mp3scrape will scour any url for it's mp3 content - the script mirrors,
never downloading the same file twice. it does not, however, crawl a
website for links, it simple scapes all the songs from a single page.
Might be nice as a firefox plugin, too.
···
--
vjoel : Joel VanderWerf : path berkeley edu : 510 665 3407
Attached is a patch which changes mp3scrape to use Hpricot. It also prepends
the supplied URL to relative URLs (that do not match with /^http:/). This
allows mp3scrape to be used on pages such as
On Thu, Aug 7, 2008 at 9:08 AM, Joel VanderWerf <vjoel@path.berkeley.edu> wrote:
ara howard wrote:
mp3scrape will scour any url for it's mp3 content - the script mirrors,
never downloading the same file twice. it does not, however, crawl a
website for links, it simple scapes all the songs from a single page.