Skip to content

Latest commit

 

History

History
18 lines (15 loc) · 967 Bytes

README.md

File metadata and controls

18 lines (15 loc) · 967 Bytes

BigDataUtils

Build Status

A few prototypes of various utilities to deal with big datasets. Specially throught online algorithms.

##Utilities

  • MassMean - Utility to compute the mean, variance or std.Dev of a continuous stream of values, in an efficient way. While also being capable of removing some values from the calculation on a later date.
  • NeuralNetwork - Utility for classifiying.
  • KMeans - Utility for clustering.
  • InsertionSort - Utility for keeping a list always sorted.
  • TimedList - Utility to keep a list of items for a given time.
  • BuyOrRent - Utility to determine weather to 'buy' or to 'rent' - ski rental problem.
  • WeightedMajority - Utility to choose the best prediction from a pool of predictions.
  • Winnow - Utility for classifiying.

##TODOS

  • Improve performance of NeuralNetwork class.