Carnegie Mellon University

Pre-College Computational Biology provides extensive training in both cutting-edge laboratory experiments to generate biological data and the computational analysis of the data that these experiments generate.


Biological and medical research have become fully fledged computational disciplines. Tomorrow's life scientists will need deep knowledge of not only the laboratory techniques for generating experimental data but also the rigorous computational techniques necessary to analyze and model these data. The Pre-College Computational Biology program offers an unparalleled experience for high school students to explore this relationship in a university setting.

After sampling water from one of Pittsburgh’s three rivers, students will isolate the bacterial DNA from the water and break the DNA strands into millions of tiny fragments that are then sequenced. Students will then build algorithms to identify the species of microorganisms present in the water samples and construct an evolutionary tree determining how they've evolved.

Picture of Pre-College Computational Biology students talking and laughing after working in the Biology Lab.


The curriculum changes from year to year and is subject to change as we continue to hone our program to find fun activities that we can cover with students. For an updated curriculum, please consult our program homepage.

Overall Structure

We will begin the program by setting sail on Pittsburgh’s three rivers for a full day of collecting water samples and engaging in fun interactive educational activities.

The rest of the program allows students to appreciate the inherent interplay between experimentation and computational analysis. Our students will spend half of each day in a laboratory working together to perform biological experiments, many of which center on answering biological questions about the microbes living in the three rivers.  The other half of the day will follow a hackathon model, in which students will work in small groups to write programs solving computational problems, with hands-on guidance from the instructor and teaching assistants. The code that students write in the hackathon will serve as the foundation for analyzing the biological data that they generate as a result of laboratory experiments.

Students will explore the following topics:


  • Bacterial colonization and genome sequencing
  • DNA extraction
  • Genome assembly
  • Polymerase chain reaction
  • 16S ribosomal RNA sequencing


  • Genome assembly
  • Downstream genome analysis, such as gene finding
  • Sequence alignment and its applications to species identification, genome annotation, and gene comparison
  • Evolutionary tree construction
  • Metagenomics analysis

Carnegie Mellon University is a leader in automated science, therefore students will also have the opportunity to work in our automation lab. They will use robots to run experiments while learning how machine learning can be used in the design and execution of experiments.

Final presentations at the close of the program will allow students to present their work to parents, guardians, and other guests.

Programming boot camp

Preparatory materials will be provided to admitted students in advance of the program to provide fundamental programming skills.  No prior programming experience is required to be successful.


We are looking for students who love biology, have demonstrated that they are proficient in mathematics, and who are looking for a program that will teach them how computational approaches are fundamental to a complete understanding of modern biology.

To be eligible for Pre-College Computational Biology students must: 

  • Be at least 16 years old by the program start date (to participate in the residential program).
  • Be a current sophomore or junior in high school.
    • Talented sophomores are encouraged to apply, however most of our admitted students will be juniors.

  • Have an academic average of B (3.0/4.0) or better.

Programming experience is not required. 

We do not require that students have experience in programming, as we provide preparatory materials that provide our students with the foundation in programming that they will need to be successful.

Application Requirements

The complete application for the Pre-College Computational Biology consists of the following:

  • Completed Online Application
  • Unofficial Transcript
  • Standardized Test Scores (optional)
    • Standardized tests are not required. We assess applicants holistically and take into consideration many factors, including quantitative background and skill. One way in which this skill can be demonstrated is through optional submission of PSAT, SAT, ACT, or SAT Subject Test scores and/or by mathematics coursework.
  • One Letter of Recommendation
  • Responses to the following essay prompts (300-500 words):
    • What do you hope to gain from participating in the Carnegie Mellon Pre-College program?
    • Why are you interested in studying Computational Biology?

Frequently Asked Questions

Great question! The short answer is the application of high-powered computational approaches to analyze biological or medical datasets. For a lengthier explanation, check out our post from Dr. Bob Murphy.

We will provide all supplies needed for the pre-college program. The computational components of the program will be completed in University computer clusters, and so students will not need to bring their own computer.