Events & News
Colloquium
Category | Lecture |
Date | Tuesday, December 10, 2013 |
Time | 11:00 am |
Concludes | 12:15 pm |
Location | Gould-Simpson 906 |
Details | Faculty Host: John Hartman Coffee and light refreshments to be served in the atrium at 10:45am. |
Speaker | Carlos Scheidegger |
Affiliation | AT&T Labs-Research |
How do you look at a billion data points? - An Exploratory Visualization for Big Data
Consider exploration of large multidimensional spatiotemporal datasets with billions of entries. Are certain attributes correlated spatially or temporally? How do we even look at data of this size? In this talk, I will present the techniques and algorithms to compute and query a nanocube, a data structure that enables interactive visualizations of data sources in the range of billions of elements.
Data cubes are widely used for exploratory data analysis. Although they are sometimes assumed to take a prohibitively large amount of space (and to consequently require disk storage), nanocubes fit in a modern laptop's main memory, even for hundreds of millions of entries. I will present live demos of the technique on a variety of real-world datasets, together with comparisons to the previous state of the art with respect to memory, timing, and network bandwidth measurements.
Nanocubes merge database technology and visualization, and increase by two orders of magnitude the scale of data that can be quickly visualized. Time permitting, I will touch upon other work in the same spirit; a novel approximate numerical linear algebra algorithm that has applications in graph visualization, and a novel trajectory clustering and visualization algorithm.
Biography
Carlos Scheidegger is a member of the technical staff at AT&T Labs–Research, in the Information Visualization group. He works in data visualization, data analysis, computer graphics, and geometry processing. Mr. Scheidegger completed his PhD in Computing at the University of Utah in 2009.