Save this Search
     
Sort by:
  • Posted Date
Posted: Within 7 days
Fixed Price: $120 - $150   |  Posted: 21h, 44m ago  |  Ends: 2d, 2h  |   0 Proposals
SCRAPER AND PARSER FOR AKOMANTOSO SAYIT VERSION SENADO AND CAMARA COLOMBIA Example of scrape and parser in python:   [obscured]  /johnfelipe/actas-consejo-medellin/blob/master/scrape.py Example of final sayit akomantoso xml file format:   [obscured]  /acta-205.an   [obscured]  /acta-210.an Php, Python, ROR or any way of scrape and parser for Senado or Camara Hansard, same file or two one for senado and one for camara de representantes. May be used Pupa.rb [4] to scrape the transcripts as well as the speakers (Pupa.rb is based on the Python version of Pupa from OpenCivicData [5]). Formats allowed: RTF, PDF, DOC, DOCX or other Suggestions: pdf2txt like pdftotext 21147.pdf 21147.txt workflow: 1. Download from url (pdf, rtf, doc, docx) 2. Change readeable format may be txt 3. take speakers, take speeches / speaker, take other data in hansard and create akomantoso xml format 4. Upload that xml format with ./manage.py load_akom...
Category: Data Science       
Skills: PHP, Python, Ruby on Rails, XML       
Preferred Location: Central & South America

p****015
 [?]
Sign in to view client's details.
| p****015
|    Colombia
Symbol Key
Payment method not yet verified
Payment verified
Purchased $1-$500
Purchased $500-$5,000
Purchased more than $5,000
You have already submitted a
proposal to this job