Carnegie Mellon University

Pre-College Computational Biology provides extensive training in both cutting-edge laboratory experiments to generate biological data and the computational analysis of the data that these experiments generate.

Biological and medical research have become fully fledged computational disciplines. Tomorrow's life scientists will need deep knowledge of not only the laboratory techniques for generating experimental data but also the rigorous computational techniques necessary to analyze and model these data. The Pre-College Computational Biology program offers an unparalleled experience for high school students to explore this relationship in a university setting.

After sampling water from one of Pittsburgh’s three rivers, students will isolate the bacterial DNA from the water and break the DNA strands into millions of tiny fragments that are then sequenced. Students will then build algorithms to identify the species of microorganisms present in the water samples and construct an evolutionary tree determining how they've evolved.

Picture of Pre-College Computational Biology students talking and laughing after working in the Biology Lab.


Students in the Pre-College Program in Computational Biology conduct cutting-edge experiments in molecular biology and build computational approaches from the ground up to analyze the datasets that they obtain from these experiments.  The synergy between experimentation and computation is a hallmark of modern biological research, and for the first time, there is a program explicitly for high school students allowing them to explore this relationship while solving a real research problem.

Students will complete laboratory and coding bootcamps, as well as collect their own biological samples; in 2019, we sampled river water from Pittsburgh's three rivers to unlock the hidden bacterial ecosystems inhabiting these rivers throughout the year.  Students will then explore the following topics:


  • Bacterial colonization and genome sequencing
  • DNA extraction
  • Genome assembly
  • Polymerase chain reaction
  • 16S ribosomal RNA sequencing


  • Genome assembly
  • Downstream genome analysis, such as gene finding
  • Sequence alignment and its applications to species identification, genome annotation, and gene comparison
  • Evolutionary tree construction
  • Metagenomics analysis


To be eligible for the Pre-College Computational Biology students must: 

  • Be at least 16 years old by the program start date (to participate in the residential program).
  • Be a current sophomore or junior in high school.
    • Program is typically for juniors, however some sophomores will be accepted.
  • Have an academic average of B (3.0/4.0) or better.
  • Have a strong interests in math, biology, and computer science.
    •  Previous exposure to computational biology isn't required.
  • Have programming experience. No specific language requirements. 

Application Requirements

The complete application for the Pre-College Computational Biology consists of the following:

  • Completed Online Application
  • Unofficial Transcript
  • Standardized Test Scores (required)
  • One Letter of Recommendation
  • Responses to the following essay prompts (300-500 words):
    • What do you hope to gain from participating in the Carnegie Mellon Pre-College program?
    • Why are you interested in studying Computational Biology?

Frequently Asked Questions

Great question! The short answer is the application of high-powered computational approaches to analyze biological or medical datasets. For a lengthier explanation, check out our post from Dr. Bob Murphy.

We will provide all supplies needed for the pre-college program. The computational components of the program will be completed in University computer clusters, and so students will not need to bring their own computer.