Entry Date:
August 29, 2012

Social: TwitInfo

Principal Investigator David Karger

Co-investigators Robert Miller , Samuel Madden


TwitInfo extracts a series of tweets that match a keyword from Twitter and arranges them on a timeline, provide a quick summary of a collection of Tweets on topic in a simple visualization. The key idea is to identify “peaks” in the frequency of tweets that represent interesting occurrences in time (e.g., points scored in a sporting event, or a major speech by a politician), and then assign labels to peaks using information retrieval techniques. A related system, called TweeQL is used to implement TwitInfo; TweeQL provides a SQL-like streaming language for running queries over the Twitter stream in real time.