The client has made the following changes to the job.
Client prefers freelancers from:
You are still able to submit a proposal for this job.
The client prefers freelancers from
a different location.
I need to scrape recipes from the internet--a lot of them.The total number is probably in the neighborhood of >1 million recipes. I am not interested in the recipes but the ingredients they contain. So you would probably need to scrape the pages as whole blocks of text and then parse out the individual ingredients. This parse script would remove things like quantities, measures (teaspoon, grams, etc.), prep (chilled, diced, sliced).
Since I originally looked into this project a number of recipe search engines have propagated the web. Rather than write countless scripts unique to each site would it be possible to index these search engines?
Sign in or Register to see more