Principal Investigator Ronitt Rubinfeld
Project Website http://bigdata.csail.mit.edu/?q=node/44
The goal of this project is to develop powerful algorithmic sampling techniques which allow one to estimate parameters of the data by viewing only a miniscule portion of it. Such parameters may be combinatorial, such as whether a large network has the "six degrees of separation property", algebraic, such as whether the data is well-approximated by a linear function, or even distributional, such as whether the data comes from a distribution over a large number of distinct elements.