Carnegie Mellon University

Computational Biology

Empowering diverse young minds to apply computation to solve research-inspired questions in modern biology.

Program Overview

The Pre-College Program in Computational Biology offers comprehensive training in both state-of-the-art laboratory techniques for generating biological data and the computational methods necessary to analyze it.

Computer science has transformed biology and medicine. The next generation of life scientists must be well-versed not only in laboratory techniques for data generation but also in the computational skills needed to analyze and model this data. Pre-College Computational Biology provides high school students with an exceptional opportunity to explore this interdisciplinary relationship in a university setting.

Our program addresses key biological questions, including the study of microbes in Pittsburgh’s three rivers and the ongoing COVID-19 pandemic. After collecting water samples from one of these rivers, students will use modern laboratory techniques to isolate bacterial DNA, fragment it into millions of tiny strands, and sequence them. The challenge then becomes interpreting this vast amount of information - a task perfectly suited for computational biology.

The program is designed to highlight the synergy between experimentation and computational analysis in modern biology. Each day, students will spend half their time in a hackathon-style setting, working in small groups to develop code that solves computational problems, with guidance from instructors and teaching assistants. The other half of the day will be dedicated to laboratory work, where students will generate large datasets to be analyzed using their own code.

Carnegie Mellon University, a leader in automated science, offers students the unique opportunity to work with automation in an educational setting. Here, they will use robots (such as the Opentrons OT-2 liquid handling robot and custom hardware) to conduct experiments while learning how machine learning enhances experiment design and execution.

Curriculum

Preparatory materials will be provided from Professor Compeau’s Programming for Lovers open education project to admitted students in advance of the program to provide fundamental programming skills.

IMPORTANT: Admitted students will be required to complete some assignments taken from this project before starting the program.

Experimental
  1. How do we design an experiment to learn about microbes in the environment?
  2. How do we collect samples from environmental water sources?
  3. How do we generate DNA sequence data from water samples?
Hackathon
  1. How can we extract DNA from samples with a variety of organic material with different structures (viruses, plants, bacteria, other microorganisms)?
  2. How can we use our knowledge of evolution and molecular biology to focus our experiments on studying bacteria?
  3. How can we use sequence data to determine the diversity of microbes in the rivers?
  4. How can we compare bacterial populations of two water samples?
  5. How can we determine what drives microbial diversity in river water?
Experimental
  1. How can we isolate and identify individual bacteria in the laboratory?
  2. From bacteria, how do we isolate DNA?
  3. How can we match a DNA sequence to a database of known bacteria?
Hackathon
  1. How can we quantitatively determine the difference between two DNA strands containing only A’s, C’s, T’s, and G’s?
Experimental
  1. How can we break genomes into many small pieces for sequencing?
  2. How can we sequence very long strands of DNA?
Hackathon
  1. How do we assemble short strands of DNA to reconstruct genomes of organisms?
  2. With an assembled genome, how can identify genes and determine the function of those genes?
Hackathon
  1. What are the evolutionary relationships among bacteria in Pittsburgh’s rivers?
  2. Can we use evolutionary trees of viruses sampled from patients to determine the origin of SARS-CoV-2 in the U.S.?
  3. How can we visually compare multiple sequences to one another?
  4. How can we quickly determine where mutations in the coronavirus occurred and use this to identify variants?
  1. How can we use liquid handling robots to improve the efficiency and precision of experimentation?
  2. How can we use robotics to help identify and isolate extremophile bacteria?
  3. How can we use machine learning to build a model to predict growth conditions for bacteria?
  4. How can we use artificial intelligence to direct experimentation to answer scientific questions?
CompBio student in lab coat looking at laptop

Application Details

Program Length

Jun. 21 to
Jul. 19, 2025

 (4 weeks)

Early Decision & International Applications Due

february-1-final-final.png

Scholarship¹ & Regular Decision Applications Due

march-1-final-final.png

Housing Options

Resident 
or 
Commuter²

¹ Scholarship decision notifications released on Friday, April 4, 2025. All others are rolling admission.
² To be a commuter, the student and parent/guardian must have a permanent residence within approximately 30 miles of our Pittsburgh campus or within Allegheny County.

Eligibility and Application Requirements

To be eligible for Pre-College Computational Biology, students must: 

  • Be at least 16 years old by the program start date.
  • Be a current sophomore or junior in high school at the time of application submission. Please note: Talented sophomores are encouraged to apply, however, most of our admitted students will be juniors.
  • Have an academic average of B (3.0/4.0) or better.

The complete application for Pre-College Computational Biology consists of the following:

  • Completed online application
  • Unofficial transcript
  • Standardized test scores (optional)
Standardized tests are not required. We assess applicants holistically and take into consideration many factors, including quantitative background and skill. One way in which this skill can be demonstrated is through optional submission of PSAT, SAT, ACT, or SAT Subject Test scores and/or by mathematics coursework.
  • One letter of recommendation
  • Responses to essay prompts

Essays are required for the following prompts (300-500 words each):

  • What do you hope to gain from participating in Carnegie Mellon’s Pre-College Programs?
  • Why are you interested in studying Computational Biology?

What is computational biology?

Great question! The short answer is the application of high-powered computational approaches to analyze biological or medical datasets. For a lengthier explanation, check out the first 20 minutes of this video recorded by Professor Compeau.

Is this the right fit for me? 

If you are good at math and love science, we think you’re in the right place.

Do I need to bring my own computer? What other supplies do I need?

Because our program is heavily dependent on coding, each student in our program will need to bring a laptop. We will provide all other resources needed.

Will I earn college credit from this program?

No, Pre-College Computational Biology students do not earn college credit. 

Are international students allowed to participate?

Yes, our program is open to international students as long as they are able to enter the United States and come to Pittsburgh. They are not, however, eligible for scholarship consideration.