Find freelancers. Lose those costly delays.

With 1.7 million freelancers, we'll match you with the perfect talent.

or, Register with Elance »

Crifan Li | Elance
 
176985602337900
Last Sign-in: Feb 12, 2014

Crifan Li

Website Crawl/Scrape,Data Mining via Python/C#/PHP
   China
  |   SuZhou, JiangSu
  |  6:57 pm Local Time

Overview

Minimum Hourly Rate $33

=== For General Client ===
Concentrate on Website Crawl/Scrape, Data Mining, using many different languages and technologies

=== For Technical Client ===
Related and skilled technologies:
[Python]
1.Scrapping libs
(1)fetch html: urllib/urllib2
(2)parse html: BeautifulSoup/lxml
(3)output data: mysql/json/csv/excel
2. Scrapping Framework
(1) Scrapy

[C#]
1.Scrapping libs
(1)fetch html: HttpWebResponse/HttpWebRequest
(2)parse html: HtmlAgilityPack/SgmlReader
(3)output data: mysql/json/csv/excel

[PHP]
1.Scrapping libs
(1)fetch html: curl...

Read More »
May 20, 2013|Other IT & Programming|Private|Completed
May 13, 2013|Other IT & Programming|Private|Completed
|
5.0
May 8, 2013|Database Development|Private|Completed
Mar 24, 2013|Software Application|Private|Completed

View All »

Portfolio

Write Python crawler to extract specific info from website, then save extracted data...
http://www.chaosgroup.com/en/2/purchase.html?g=0&pID=1
using python script, extract specific info from given html file, output in prettified...
use python script, emulate human input info in some website to do search, then...
http://www.gcgis.org/default.html
written by myself, support up to 10 type main stream Chinese Blog, extract all...
http://code.google.com/p/blogs-to-wordpress/
for website fishersci.com, emulate search, then from the search result to extract...
http://www.fishersci.com
for picture share website,yupoo.com, emulate search by tag, found matched pictures...
http://www.yupoo.com/
write a C# winform application, to scrape the fiverr.com search result, then store...
http://fiverr.com
C# winform application, to emulate Google Alert search function, then support export...
http://www.google.com/alerts
write a C# winform scraper to scrape/search the amazon hot new releases data, can...
http://www.amazon.com/gp/new-releases/
C# winform executable, emulate Google Search, also support for each search item, to...
http://www.google.com
python crawler download files(books) from Qisuu website internally including: scraping...
http://www.qisuu.com/
C# winform executable to scrape songtaste.com, to find out the music file real address...
http://www.crifan.com/crifan_released_all/website/dotnet/downloadsongtastemusic/
Python Scraper/Crawler to scrape search result for http://autoexplosion.com/cars/...
http://autoexplosion.com
a C# winform exe file, emulate Amazon search, then check each product is valid or not,...
http://www.amazon.com
for http://www.wheelbynet.com, total four type: auto, rv, moto, boat, for each type, do...
http://www.wheelbynet.com
C# winform application emulate http://chasethefootprint.com/ to add footprint then do...
http://chasethefootprint.com/

Skills (11)

Tested
Crawl
Crawler
Scrape
Scraper
Scrapping
Data mining
Data mine
Python
C#
PHP5
JSON

Service Description

Wide skill experience, but now focus on crawl/scape/content extract/data minning using Python/C#/PHP

Related skills / projects / evidence:
1. Written a book about regular expression:
http://www.crifan.com/files/doc/docbook/regular_expression/release/html/regular_expression.html
2. several projects, whose internal used skills is crawler related:
http://code.google.com/p/blogs-to-wordpress/
http://code.google.com/p/downloadsongtastemusic/
http://code.google.com/p/recsongtastemusic/
3. Especially, I have summary all the website crawler related functions and other common functions into my personal library:
http://code.google.com/p/crifanlib/

Just for tip:
Other software skills
1. More than 3 year Embedded Linux experiences:
have written many books:
http://www.crifan.com/files/doc/docbook/arm_vs_mips/release/html/arm_vs_mips.html
http://www.crifan.com/files/doc/docbook/char_encoding/release/html/char_encoding.html
http://www.crifan.com/files/doc/docbook/dma_pl08x_analysis/release/html/dma_pl08x_analysis.html
http://www.crifan.com/files/doc/docbook/fieldbus_intro/release/html/fieldbus_intro.html
http://www.crifan.com/files/doc/docbook/firmware_download/release/html/firmware_download.html
http://www.crifan.com/files/doc/docbook/hardware_basic/release/html/hardware_basic.html
http://www.crifan.com/files/doc/docbook/interrupt_related/release/html/interrupt_related.html
http://www.crifan.com/files/doc/docbook/linux_nand_driver/release/html/linux_nand_driver.html...

Read More »

Keywords

Crawl
Scrape
Data Mining
Python
C#
PHP
My Snapshot
IT & Programming
4
Elance Level
Level represents activity and experience on Elance. Freelancers start at Level 1 and achieve higher levels through their work. A higher "Level" indicates greater earnings, ratings and other achievements on Elance. Learn More »
  • 12 months
  • Lifetime
Jobs
3
4
0
Total
Milestones
Hours
Reviews
5.0
Recommend
Clients
Total
Repeat
Earnings
Private
Private
Total
Per Client
Identity
Username
crifan
Type
Individual
Member Since
October 2012
Elance URL
Verifications
0
Crifan Li | Elance

Crifan Li