lecture-8 - Practical Bioinformatics for Life Scientists...

Info iconThis preview shows pages 1–5. Sign up to view the full content.

View Full Document Right Arrow Icon
Practical Bioinformatics for Life Scientists Week 4, Lecture 8 István Albert Bioinformatics Consulting Center Penn State
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Reminder Before any serious work re-check the documentation for small but essential details. Example: bwa needs to be indexed differently for small and large genomes bwa has to be invoked with different alignment modes for short reads (200 < ) and long reads (200 >)
Background image of page 2
Sequencing Coverage (Depth) Lander/Waterman model: 1. random reads 2. ability to detect overlap does not change coverage C = N * L / G N = number of reads, L = length of reads, G = size of genome Probability of a base not being sequenced P = exp(-C) To get the percent of genome not covered (multiply by 100) N=35 million, L=35, G=250 million C = 5 0.6% genome not sequenced 15 million bases not covered
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Neither of the models assumptions are correct multiply required coverage at least 10 fold What part of the genome is coverable to begin with? Also known as
Background image of page 4
Image of page 5
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 15

lecture-8 - Practical Bioinformatics for Life Scientists...

This preview shows document pages 1 - 5. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online