Carnegie Mellon University

The Artificial Intelligence and Big Data group at Pittsburgh Supercomputing Center converges Artificial Intelligence and high-performance computing capabilities, empowering research to grow beyond prevailing constraints.

bridges_closeup_220x150.jpg

Bringing the Power of HPC to AI

Harness the power of Bridges: 700+ nodes (128GB - 12TB RAM), the world's most powerful GPUs, rich set of software, frameworks, and environments.
big-data_220x150.jpg

Developing Best Practices

Work with the AI & Big Data group to apply best practices to overcome the constraints faced by your research.
hands-ai_220x150.jpg

Connecting to Experts

Learn from experts in the field who will consult to help solve your problems.

Core Initiatives

Compass

The AI & Big Data group supports PSC's Compass, which consists of several parts:

Generic placeholder image
Compass Lab

Leveraging PSC's relationships with vendors, Compass Lab exposes new technology to faculty, students and partners.

Open Compass

Working with the research community, Open Compass undertakes deep exploration of research projects. Benchmarks and best practices in AI will be developed and adopted.

AI Compass Consortium

Partnering with the private sector, the AI Compass Consortium applies best practices to understand and overcome the challenges in adopting and scaling AI.

Hardware & Software Resources

PSC brings powerful hardware and software to bear on your research


BRIDGES is a uniquely capable resource for empowering research by bringing together HPC, AI and Big Data. It is designed to support familiar, convenient software and environments for both traditional and non-traditional HPC users. Its richly connected set of interacting systems offers exceptional flexibility for data analytics, simulation, workflows and gateways, leveraging interactivity, parallel computing, Spark and Hadoop.

Bridges holds a rich set of software, frameworks and environments to engage AI and Big Data research.

CPU Resources

Over 700 compute nodes with 128GB to 12TB of hardware-supported shared memory support research where partitioning data is impractical: genomics, ML, and graph analytics among others.

GPU Nodes

The most advanced GPU nodes available, to accelerate applications as diverse as machine learning, image processing and materials science

Data Network

Data transfer nodes with 10 GigE connections to enable data movement between Bridges and XSEDE, campuses, instruments and other advanced cyberinfrastructure

Software Environments

We provide custom-built environments for AI that run on Bridges' GPUs. Or you can build your own with Anaconda or virtualenv.

AI & Big Data Software

All modern AI software is installed on Bridges, including containers providing a complete environment for many popular packages.

Community Datasets

A number of datasets relevant to the AI/BD community are hosted on Bridges, including ImageNet, NLTK and MNIST.

AI & Big Data Team

paola_625x625.jpg

Paola Buitrago

Artificial Intelligence and Big Data Group Leader, Pittsburgh Supercomputing Center
rajanie_625x625.jpg

Rajanie Prabha

Machine Learning Research Scientist, Pittsburgh Supercomputing Center
julian_625x625.jpg

Julian Uran

Machine Learning Research Engineer, Pittsburgh Supercomputing Center

Current Interns

Interns in the AI & Big Data group collaborate with group members and experts in data and computational science on projects which apply AI to real-world challenges.

Wanting Huang is a master student at the CMU Information Networking Institute, and has an interest in full-stack software development. Her project focuses on designing and implementing REST APIs for querying bigfile-format data from the BlueTides 3, a simulation of the Universe generated by the McWilliams Center for Cosmology. Additionally, Wanting developed a web portal that works as an entry point for the COSMO REST API and explains how to use the data.

Nianyi Chen is a second year PhD student, is a member of Associate Professor of Physics Hy Trac’s lab, and is working to simulate the Epoch of Reionization—the time in the early life of the universe when the first stars and galaxies formed and started to ionize neutral hydrogen—; She is  using her domain-expertise skills at the HPC AI and Big Data Group to process the BlueTides 3 simulation data, so it can be served using the COSMO REST API, a project between the Pittsburgh Supercomputing Center and the McWilliams Center for Cosmology.

Former Interns

Pankaj Bhojwani has an interest in machine learning and is double majoring in Physics and Computer Science at Carnegie Mellon, expecting to graduate in May 2020. His project focuses on using machine learning to analyze medical waveform signals.
Matthew Bialecki will complete a dual BS/MS program in Computer Science at the University of Pittsburgh in May 2020, and is interested in data and AI. His work with the AI&BD group includes creating a dashboard using ElasticSearch, Kibana, and ZomboDB to help visualize job and grant data for users, Bridges metrics, and log reporting; creating web pages for database data using Ruby on Rails; and creating an interactive page to distinguish grants in traditional HPC fields from those in areas that have not traditionally used HPC.
Tina Chang is a student in the Masters in Information Science Management program, Business and Data Analytics path, at Carnegie Mellon, with an interest in predictive modeling and exploratory data analysis. She expects to graduate in December 2018. Her project involves RNA small molecule binding.
Alice Lee is a student in the Masters in Information Science Management program at Carnegie Mellon. She expects to graduate in December 2018. Her interests include data analytics and visualization. She is working on a dashboard displaying real-time visualization of Bridges data for the AI&BD group.
Anand Sakhare is a student in the Masters in Information Science Management program, Business and Data Analytics path, at Carnegie Mellon. He expects to graduate in December 2018. His interests lie in deep learning, machine learning, and AI.