Find freelancers. Lose those costly delays.

With 1.7 million freelancers, we'll match you with the perfect talent.

or, Register with Elance »

data pre-processing
Sign in to Add to Watch List

IT & Programming > Search Engine Optimization

View Change History

The client has made the following changes to the job.


Job awarded.

Nov 21, 2012

Job awarded.

Nov 17, 2012

Job Description

  |  Change History >>


  • Posted: Fri, Nov 16, 2012
  • Time Left: Closed
  • Location: Anywhere
  • Client prefers freelancers from:

    You are still able to submit a proposal for this job.

    The client prefers freelancers from
    a different location.

    You're still able to submit a proposal for this job, regardless of your location.
  • Start: Immediately
  • Budget: Not Sure
  • Fixed Price Job
  • Elance Escrow Protection
  • W9 Not Required
Sign in to view client's details

Hello, please have a look at the attached file. I need you to implement first stage only: data pre-processing.

Data normalization: continuous features need be converted to discrete. Make sure that all values are in numerical formats.

Use Entropy based feature selection method for selecting the attributes and removing the redundant ones

applies the k-means clustering algorithm to the given dataset to split the data records into normal cluster and anomalous clusters. Specify the number of clusters as five to the k-means and cluster the records in the dataset into normal cluster and anomalous clusters. The anomalous clusters are U2R, R2L, PROBE, and DoS. The records are labeled with the cluster indices. Then, divide the data set into two parts. One part is used for training and the other one is used for evaluation.

Download the below data from:   [obscured]  /databases/kddcup99/kddcup99.html

training data:
kddcup.data_10_percent.gz A 10% subset. (2.1M; 75M Uncompressed)


Sign in or Register to see more

Desired Skills
Job ID: 35269570
Avg $ | High $ | Low $ — Show Pricing
  • Submit Date (Latest)

 Israel  |  
I am expirienced and highly educated algorithm developer. My main field is statistical analysis, especially classification, of neural signals and...
 0.0   |  $0 Earnings   |  0 Jobs
Bid ID: 35316436  |  Submitted: Nov 19, 2012 06:26 ET 
Within 1 week

 Spain  |  
 0.0   |  Private   |  0 Jobs
Bid ID: 35299034  |  Submitted: Nov 18, 2012 13:23 ET 
Proposal SEALED

TAS has a solid team of programmers that covers a wide spectrum of languages and technologies. We believe in hard work and put a lot of value in...
 0.0   |  Private   |  0 Jobs
Bid ID: 35270631  |  Submitted: Nov 17, 2012 00:32 ET 
Proposal SEALED
Elance is now an Upwork company.
Upwork is the choice of 4M+ clients. Get started working on Upwork today.
Are you ready to post a job like this one?
Post a Similar Job »