Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

University of California, San Diego

Bioinformatics Capstone: Big Data in Biology

University of California, San Diego via Coursera


In this course, you will learn how to use the BaseSpace cloud platform developed by Illumina (our industry partner) to apply several standard bioinformatics software approaches to real biological data.

In particular, in a series of Application Challenges will see how genome assembly can be used to track the source of a food poisoning outbreak, how RNA-Sequencing can help us analyze gene expression data on the tissue level, and compare the pros and cons of whole genome vs. whole exome sequencing for finding potentially harmful mutations in a human sample.

Plus, hacker track students will have the option to build their own genome assembler and apply it to real data!


  • Week 1: Identifying the Culprit in a Food Poisoning Outbreak
    • This week, we will apply genome sequencing algorithms to identify the bacterium causing a deadly food poisoning outbreak.
  • Week 2: Comparing Gene Expression in Tissue Samples with RNA-Seq
    • In this week's Application Challenge, we will learn how RNA-Sequencing can be applied to perform tissue-level gene expression analysis. In particular, which is more similar on a gene expression level: different tissues from the same organism, or analogous tissues from related species? This question was the subject of recent debate and a Twitter controversy -- read more to find out!
  • Week 3: Weighing the Pros and Cons of Whole Genome and Whole Exome Sequencing on a Human Sample
    • Comparing the differences between sequencing an entire human genome and sequencing only the exome, or the DNA that is eventually translated into protein. Can we obtain a complete picture of someone's genetic disease predispositions from only the exome, or is there information lurking in introns that can provide doctors with vital information? How do we weigh the trade-offs when considering genome and exome sequencing?

Taught by

Phillip Compeau and Pavel Pevzner


Related Courses


1.0 rating, based on 1 reviews

Start your review of Bioinformatics Capstone: Big Data in Biology

  • Paolo Binetti

    Paolo Binetti is taking this course right now and found the course difficulty to be medium.

    This course, as others in Coursera catalog, suffer from Coursera's new learning policies, which deteriorate the learnang experience, significantly slowing progress towards completion. In particular I refer to their decision NOT to give you full access...

Never Stop Learning!

Get personalized course recommendations, track subjects and courses with reminders, and more.

Sign up for free