176985602337900

Find freelancers. Lose those costly delays.

With 1.7 million freelancers, we'll match you with the perfect talent.

or, Register with Elance »

Retrieve English Epub Titles from Gutenberg.org
Sign in to Add to Watch List

IT & Programming > Other IT & Programming

View Change History

The client has made the following changes to the job.

Description
Date

Job awarded.

May 7, 2012
Close

Job Description

  |  Change History >>

Updated

Close
  • Posted: Tue, May 01, 2012
  • Time Left: Closed
  • Location: Anywhere
  • Client prefers freelancers from:
    Anywhere

    You are still able to submit a proposal for this job.

    The client prefers freelancers from
    a different location.

    You're still able to submit a proposal for this job, regardless of your location.
  • Start: Immediately
  • Budget: $500 - $1,000
  • Fixed Price Job
  • Elance Escrow Protection
  • W9 Not Required
Sign in to view client's details

Hi there,

I'm looking for a skill coder to scrape all available english titles from Project Gutenberg. The contents of Gutenberg.org is available via a very large RDF file. The coder should be able to parse the 200+ mb file and extract the needed data. If the data is not available via this file, then using the reference below, the coder should be able to whip up a scraper to grab it.

  [obscured]  /wiki/Gutenberg:Feeds

After looking at the RDF files, it appears we can use the catalog.rdf file to get the english books, then get the individual RDF files for the remaining data - available here -   [obscured]  /ebooks/12345.rdf (Replace 12345 with the ebook no. you are interested in.)

The required data will be placed into a CSV file with the following columns:

- Book no. (numeric value gutenberg assigns each book)
- Book title
- Author full name
- Author first name (some parsing required from full name)
- Author last name (some parsing required from full name)
- ePub URL (location ...

Sign in or Register to see more

Job ID: 30249164
Proposals
Avg $ | High $ | Low $ — Show Pricing
  • Submit Date (Latest)

 Canada  |  
I am a software engineer with 10+ years of experience designing and building everything from websites, query languages, and associative database...
1
  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 30324974  |  Submitted: May 4, 2012 16:35 ET 
Proposal SEALED

Openzki Unit - Aruhat Technol...      
One or more team members at  are verified.  Learn More
 India  |  
Introduction Openzki, a unit of Aruhat Technologies Pvt. Ltd., been formed to serve global customers with a basket of multiple business purpose...
1
  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 30290764  |  Submitted: May 3, 2012 06:46 ET 
Proposal SEALED

Shenzhen XingShang is located in the Futian District E-Commerce Industrial Park in Shenzhen, Guangdong. It currently has 13 engineers that are...
1
  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 30285899  |  Submitted: May 3, 2012 02:11 ET 
Proposal SEALED

 India  |  
We have Good experience in designing of ePUB, Fixed Layout ePUB, ePUB 3, Kindle & Kindle Fire. We expertise and experience in all phases of project...
1
  |  
 0.0   |  $0 Earnings   |  0 Jobs
Bid ID: 30270739  |  Submitted: May 2, 2012 11:30 ET 
Proposal SEALED

 Russia  |  
Wordpress / Woocommerce / API expert. Bespoke complex backends / integrations.
10
  |  
 4.9   |  Private   |  71 Jobs
Bid ID: 30268841  |  Submitted: May 2, 2012 10:08 ET 
Proposal SEALED

 Kenya  |  
Hi, I'm an IT Security Consultant, skilled in UNIX and Linux based operating systems, TCP/IP based network services with software development...
1
  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 30267622  |  Submitted: May 2, 2012 09:08 ET 
Proposal SEALED

 Egypt  |  
- Over 12 years of software development professional experience. - Excellent Python development skills. - Excellent knowledge of development under...
4
  |  
 5.0   |  Private   |  2 Jobs
Bid ID: 30266590  |  Submitted: May 2, 2012 08:19 ET 
Proposal SEALED

 Romania  |  
GeniiWeb provides advanced server-side programming, safe web applications and top quality solutions for your business. Specific technology and...
11
  |  
 5.0   |  $34,780 Earnings   |  8 Jobs   |  4 verified credential(s)
Bid ID: 30266313  |  Submitted: May 2, 2012 08:03 ET 
Proposal SEALED

 Philippines
Expertise in web scraping, web crawling, data mining, data extraction, and mailing list development using php, wget, curl, perl, and bash in linux...
1
  |  
 4.4   |  Private   |  2 Jobs
Bid ID: 30264712  |  Submitted: May 2, 2012 06:41 ET 
Proposal SEALED

 Croatia (Hrvatska)  |  
Experienced developer and consultant specialized in Internet business and technologies, provides safe outsourcing services, secure processes and...
2
  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 30263482  |  Submitted: May 2, 2012 05:27 ET 
Proposal SEALED

 Pakistan
Winner
Do you want an expert in web scraping,data mining,automation who delivers superior-quality results, on-time, and without busting your budget? Try...
1
  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 30259251  |  Submitted: May 2, 2012 01:26 ET 
Delivery
Within 1 week
$500.00

 India  |  
INVITED
*** One of the top 5 "Web Extraction/Scraping/Crawling" experts on Elance. I provide the output with fast turnaround and within the budget. I am...
6
  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 30250192  |  Submitted: May 1, 2012 15:11 ET 
Proposal SEALED

 United States  |  
Five years experience coding in the .Net framework and JAVA. Skilled in both desktop and web application development, database modeling and design....
2
  |  
 0.0   |  $0 Earnings   |  0 Jobs
Bid ID: 30249739  |  Submitted: May 1, 2012 14:52 ET 
Proposal SEALED
Sign in to Elance and start working on jobs today.
Sign in to view more of the job details and submit a proposal. Once registered, you'll have access to thousands of jobs online or through email.
Are you ready to post a job like this one?
Post a Similar Job »