176985602337900

Find freelancers. Lose those costly delays.

With 1.7 million freelancers, we'll match you with the perfect talent.

or, Register with Elance »

Tuned Trie in C or Python
Sign in to Add to Watch List

IT & Programming > Other IT & Programming

View Change History

The client has made the following changes to the job.

Description
Date
Close

Job Description

  |  Change History >>

Updated

Close
  • Posted: Mon, Sep 10, 2012
  • Time Left: Closed
  • Location: Anywhere
  • Client prefers freelancers from:
    Anywhere

    You are still able to submit a proposal for this job.

    The client prefers freelancers from
    a different location.

    You're still able to submit a proposal for this job, regardless of your location.
  • Start: Immediately
  • Budget: Less than $500
  • Fixed Price Job
  • Elance Escrow Protection
  • W9 Not Required
Sign in to view client's details

I have a large amount of data in a flat file, which is just strings separated by newlines like this:

12314323413231\n
431234214323\n
4243123121122123\n
etc..

All I wish to ask from my data is "is string xyz in the dataset", also know as candidacy checking. The dataset is never added too, nor subtracted from, once it is created.
The problem is that I need this candidacy checking to happen very very fast, many times per second - and the size of my data is several Gigabytes. When i took a small sample of my data (2.5Gb) and put it into a indexed SQLite database, the resultant database was 10Gb big, and was too slow. The slowness was almost entirely attributed to the fact that at 10Gb the dataset had to be stored on disk and not in memory, and caching cannot speed up my query times as the string being queried is essentially random.

The plan therefore is to put my data into a Trie data-structure, which will hopefully be far more compact and will be able to fit into memory. To make the...

Sign in or Register to see more

Desired Skills
C, Data Structures, Python, C#, C++
Job ID: 33421265
Proposals
Avg $ | High $ | Low $ — Show Pricing
  • Submit Date (Latest)

 United States  |  
Winner
RedHat Certified Engineer (RHCE), 15+ years of experience in Linux/UNIX/Mac OS X and Web services administration. Scripting guru and Internet...
1
  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 33421703  |  Submitted: Sep 10, 2012 15:14 ET 
Delivery
Within 3 days
$200.00
Sign in to Elance and start working on jobs today.
Sign in to view more of the job details and submit a proposal. Once registered, you'll have access to thousands of jobs online or through email.
Are you ready to post a job like this one?
Post a Similar Job »