Fixed Price: Not Sure
| Posted: Apr 22, 2015 | Ends: 20h, 36m |
I have some large files that need to be cleaned or transformed. For example, the Freebase download has a number of issues with date times in its large Gzipped dump file. Some dates and times are inserted as "T00:00" but most parsers fail with errors like : "T00:00" is not a valid value for datatype [obscured] /2001/XMLSchema#dateTime You would need to create a tool to correct all of these issues in the data Other errors are things that are simple like: IRI includes string escapes: '\92' I also need a tool to convert many TSV (tab separated files) that contains Triples into N-Triples, Turtle or RDF XML format.
Category: Other IT & Programming
Hourly Rate: Not Sure
| Duration: 1-3 months
| Posted: Apr 08, 2015 | Ends: 1d, 23h |
This job is focused on advancement of the experience that thousands of users get navigating, browsing, searching and comparing the content offered through our proprietary technology platform. The end-result will be creation of intuitive and comprehensive multi-level navigation structures (hierarchical taxonomies, facets) for browsing and searching the content offered to our clients. Key tasks: - Creation of several core multi-level hierarchies (tens of thousands of elements) structuring and comprehensively describing the content in our system - Definition, structuring and optimization of hierarchical data sets, definition and maintenance of hierarchical relationships of particular terms (facets) - Conducting research (independent, as well as guided by management team) on publicly available data related to the content of the platform from public (international standards, patent databases, public and government databases, various organizational, available XML datasets, etc.), as well as...