This preview shows pages 1–2. Sign up to view the full content.
This preview has intentionally blurred sections. Sign up to view the full version.View Full Document
Unformatted text preview: 7.03 Recitation -- Section Eleven Main concepts from Lectures #24 -27: 1) We can characterize gene expression and protein-DNA interactions with sequencing technology. To characterize gene expression we turn expressed RNA into DNA, and sequence the resulting DNA. To characterize where a protein binds to DNA, we purify protein-DNA complexes using an antibody, and then release and sequence the DNA to discover where the protein binds in the genome. 2) We can test how likely it is that data from two genes occurred assuming the genes are independent by using a chi-square test with one degree of freedom. If the genes are independent we expect: G2 off G2 on G1 on P(G1)(1-P(G2))total P(G1)P(G2)total G1 off (1-P(G1))(1-P(G2))total (1-P(G1))P(G2)total 3) If we do many comparisons to look for dependent pairs of genes, we need to adjust our uncorrected p-value to get a corrected p-value as follows: corrected p-value = uncorrected p-value x number of comparisons 4) One can characterize single base nucleotide polymorphisms ( SNPs ) genome wide. Current technology gives you both alleles at each locus, but does not tell you which chromosome the alleles are on. A haplotype assigns alleles to their chromosome of origin (from Mom or Dad in human). 5) One can identify a gene that, when mutated, causes a human disorder by finding one marker (out of a collection of markers that are spread out over the genome) that appears be closely linked to the gene associated with the disorder. A LOD score tells us the “log of the odds” that a marker and the gene are linked.of the odds” that a marker and the gene are linked....
View Full Document
This note was uploaded on 02/06/2009 for the course 7 7.03 taught by Professor Fink during the Spring '09 term at MIT.
- Spring '09