Data Mining in the Streaming Model; Approximating Massive Matrices

Seminar: 
Applied Mathematics
Event time: 
Thursday, October 25, 2012 - 10:00am to 11:00am
Location: 
AKW 200
Speaker: 
Edo Liberty
Speaker affiliation: 
Yahoo
Event description: 

This talk will briefly introduce the streaming computational model in the context of data mining. We will focus on working with matrices that are revealed over time and are too large to store. Working with such massive matrices requires creating a concise yet accurate approximation for them. These are called matrix sketches. This talk will shortly survey new results for matrix sketching in two streaming models.

In the first, the matrix is presented to the algorithm one entry at a time. Examples include recommender systems whose input is of the form user i rated item j 3 stars. In the second, the matrix is revealed row by row, for example, document i contains terms j_1,…,j_k. This is the case in web crawling where the crawler cannot store all the documents it visits.

Special note: 
*SEMINAR BEGINS AT 2:00 P.M.*