Leveraging the Data Sciences
Thursday, November 14, 2013
3:00 p.m. - 4:15 p.m.
University Center, Carnegie Mellon University
We live in a world awash in data. There are close to 7 billion mobile phones in use in the world today. Many of these phones come equipped with a variety of sensors such as cameras and devices like GPS and produce vast amounts of data. These data are shared via social media sites such as Facebook (just over a billion users) and Twitter (.5 billion users), which generate over half a petabyte of structured, unstructured and image data every 24 hours.
Search engines such as Bing and Google index upwards of 4 billion pages on the web. From the cyber world to the physical world, the data and analytics technologies that permit their large scale analysis are transforming decision making in every sector of the economy. They are reshaping the design of systems that deliver key services such as transportation and public safety and enable the creation of smart infrastructure such as roads, bridges and buildings that we rely on every day of our lives.
Similar vignettes of transformation can be sketched in other domains with significant societal impact, such as in healthcare, where large databases of economic, biomedical and genomic data have become available. In science, continually improving data recording and storage facilities are enabling a new style of investigation of complex systems using novel machine learning algorithms and statistical methods. The Sloan Digital Sky Survey, for example, has transformed astronomy.
These transformations are being made possible by Data Science. Data Science involves the design and collection of large-scale data, as well as instrumentation, protection and analysis of the data. As such, it draws on a number of disciplines and sciences — the social sciences, to understand data about people, their behaviors, and the decisions they make; operations research and the statistical sciences, including machine learning to conduct large-scale analyses and draw inferences from them; and the computing sciences, to provide the principles that guide data management and design of distributed systems.
This panel made data science come to life. It provided a glimpse into exciting research and development activities at leading technology firms, into research initiatives in this area from both a US and an international perspective, and into the unique Carnegie Mellon multi-disciplinary approach to research and education as it pertains to data science.
Subra Suresh, President, Carnegie Mellon University
Ramayya Krishnan, Dean of the H. John Heinz III College and William W. and Ruth F. Cooper Professor of Management Science and Information Systems, Carnegie Mellon University
Yuichiro Anzai, President, Japan Society for the Promotion of Science
Anuj Dhanda, Sr. Vice President and Chief Information Officer, Giant Eagle, Inc.
Stephen Fienberg, Maurice Falk University Professor of Statistics and Social Science, Carnegie Mellon University
Andrew Moore, Vice President of Engineering, Google
Jeannette Wing, Corporate Vice President, Microsoft Research