26 Pages

LecturesPart11

Course: BIO 03510, Fall 2009
School: Carnegie Mellon
Rating:
 
 
 
 
 

Word Count: 669

Document Preview

Biology, Computational Part 11 Genefinding Robert F. Murphy Copyright 1997, 2001, 2003-2007. All rights reserved. Clues to locations of genes (Prokaryotic Signals) s for Transcription x Promoters x Transcription factor binding sites s for Translation x Ribosome binding sites x Start/stop codons Prokaryotic vs. Eukaryotic Genefinding s Simple to create programs to look for "grammatical"...

Register Now

Unformatted Document Excerpt

Coursehero >> Pennsylvania >> Carnegie Mellon >> BIO 03510

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Biology, Computational Part 11 Genefinding Robert F. Murphy Copyright 1997, 2001, 2003-2007. All rights reserved. Clues to locations of genes (Prokaryotic Signals) s for Transcription x Promoters x Transcription factor binding sites s for Translation x Ribosome binding sites x Start/stop codons Prokaryotic vs. Eukaryotic Genefinding s Simple to create programs to look for "grammatical" combination of prokaryotic signals s Much more complicated for eukaryotes due to the presence of introns and additional regulatory elements Clues to locations of genes (Eukarytoic Signals) s for Transcription x x x x x Promoters Transcription terminators Topoisomerase II binding sites Topoisomerase I cleavage sites Transcription factor binding sites Donor and acceptor sites Branch points s for Splicing x x Clues to locations of genes (Eukaryotic Signals) s for mRNA Processing x Polyadenylation sites s for Translation x Ribosome binding sites x Start/stop codons DNA->RNA->protein Signal Sensors: Consensus Sequences s Simple x TATA s PROSITE expression x Y-x-G-A-[FL]-[KRHNQ]-C-L-x(3,4)-G- [DENQ]-V-[GA]-[FYW] x (iron binding site in transferrin) Signal Sensors: Networks s Profile, PSSM (equivalent to perceptron) s Neural Network (multi-layer) Content Sensors: Coding Regions s GeneMark: 3 fifth-order Markov models x one for each reading frame s GRAIL: uses neural net with inputs from x coding potential measures x base composition x signal sensor output for flanking splice sites Integrated Systems s Use dynamic programming to find best combination of signal/content sensors s Apply "linguistic" rules to say what parts are required and in what order Gene model B = gene start S = translation start D = donor A = accceptor T = translation stop E = gene end Basic Implementation s Use an HMM to model what state Q each nucleotide from X is in (given parameters ) s Train HMM with known genes to estimate s For unknown sequence, find Q to maximize P(Q | X, s Used by GENSCAN, HMMgene GENSCAN HMM s Handles genes on both forward and reverse strands Adding Homology s Can try to include information databases from of known proteins to help decide whether an exon is coding s For each candidate exon, increase the score if there is homology with a known protein s This approach used by Genie, GeneID+, GeneParser3, Grail Adding ESTs s Can try to include information from EST databases s EST (Expressed Sequence Tag) databases show sequences that are "known" to be present in mRNA (cDNA) s For each candidate exon, increase the score if it matches to an EST s Used by AAT, Grail Drawbacks s Using homology or ESTs may bias results toward genes similar to known genes (homology) or highly expressed genes (ESTs) Assessing Performance s 1995: Burset and Guigo used benchmark set of 575 vertebrate genes to compare s 2001: Rogic et al. redid comparison with better test set of 195 genes (Genome Research 11:817-832, 2001) x http://www.cs.ubc.ca/~rogic/evaluation/ Machine Learning 101 s s Two types of learning methods Supervised x x have examples of things in different "classes" that you want program to recognize or "classify" goal is for program to learn "rules" or "boundaries" to distinguish classes have program figure out what classes exist s Unsupervised x Machine Learning 101 s s s s Assume that have a group of...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Carnegie Mellon - BIO - 03510
Computational Biology, Part 12 Predicting from Protein SequenceRobert F. Murphy Copyright 1996, 1999-2007 All rights reserved.1Starting PointsBroad Goal: To determine or predict as much as we can from a "new" protein sequence s Have covered h
Carnegie Mellon - BIO - 03510
Computational Biology, Part 13 Retrieving and Displaying Macromolecular StructuresRobert F. Murphy Copyright 1996, 1999, 2001-2007. All rights reserved.1Retrieving 3D structuressProtein Data Bank (PDB)x home page = http:/www.rcsb.org/pdb/s
Carnegie Mellon - BIO - 03510
Computational Biology Part 22 Biological Imaging IE. Glory, G. Steven Vanni Meel Velliste, Robert F. Murphy Copyright 1998, 2000-2007. All rights reserved.Biological imagingsSignificant advances in the fields of optics and electronics in the p
ASU - ECN - 470
TOPIC I INTRODUCTION AND SET THEORY[1] Introduction Economics Vs. Mathematical Economics. "Income positively affects consumption. Consumption level can never be negative. The marginal propensity consume is less than one." C = a + bY, where a >
ASU - ECN - 470
TOPIC II EQUILIBRIUM MODELS[1] Concept of Equilibrium Equilibrium: A state without tendency to change.EX 1:EX 2:II-1 Types of Equilibrium Stable Equilibrium: Once out of equilibrium, there is a tendency to go back. Unstable Equilibrium:
ASU - ECN - 470
TOPIC III LINEAR ALGEBRA[1] (1) 1) Linear Equations Case of Two Endogenous Variables Linear vs. Nonlinear Equations Linear equation: Nonlinear equation: ax + by = c, where a, b and c are constants. ax2 + by = c. Can express a linear equation by
ASU - ECN - 470
[6]VectorDefinition: Vector is a n1 or 1n matrix.Note: Vector is usually expressed by the form of (1,1) or (1,1) row vector;1 . 11 column vector. 1 When people talk about vectors, they are usually column vectors. Vector arithmetics f
ASU - ECN - 470
TOPIC IV CALCULUS[1] Limit Suppose that we have a function y = f(x). What would happen to y as x xo?Definition: Let y = f(x). Then, the limit value of y as x a is denoted by limxaf(x). EX 1: y = 1 + 2x. y = 1/x, x 0.As x 0, y 1. limx0y =
ASU - ECN - 470
TOPIC V COMPARATIVE STATICS[1] MotivationP SD(Y1) D(Yo) Q Here, D(Yo) denotes the demand curve when Y = Y o. Similarly, D(Y1) denotes the demand curve when Y = Y1. Observe that as Y changes, P and Q change. In economic models, equilibrium
ASU - ECN - 470
TOPIC VII UNCONSTRAINED OPTIMIZATION II[1] Relative Maximum and Minimum(1)Review for y = f(x): f(x) = 0 get x*. f(x*) > 0 x* is relative min. point. f(x*) < 0 x* is relative max. point. FOC: SOC:(2)For y = f(x1,x2,.,xn): FOC:f f f
ASU - ECN - 470
1. Matrix Algebra and Linear Economic ModelsReferences Ch. 1 3 (Turkington); Ch. 4 5.2 (Klein). [1] Motivation One market equilibrium Model Assume perfectly competitive market: Both buyers and sellers are price-takers. Demand: Qd = a + bP , a >
Texas A&M - GEOL - 648
Stable Isotope Geology GEOL 648Fall, 2007 E. GrossmanANALYSIS OF CARBONATES ON DELTAPLUSXP AND GAS BENCH II LAB 2The purpose of this lab is to perform carbon and oxygen isotopic analyses of carbonate minerals using the Gas Bench II system and D
ASU - ECN - 470
2. Further Topics in Matrix Algebra(1) Quadratic Form Consider: a11 a12 . a1n x1 a x a22 . a2 n 21 ; xn1 = 2 , = [aij ] = : : : : an1 an 2 . ann xn Annwhere A is symmetric ( aij = a ji ). The following form is called a qua
ASU - ECN - 470
S. C. AHNASSIGNMENT 1 Due February 16 (Monday)1998, SPRING1. (10 pts.) Let S1 = {3,6,9} and S2 = {a,b}. Is {(3,a),(6a),(9,a)} a function from S1 to S2. Why or why not? Explain. 2. (10 pts.) Suppose that the domain of the function y = x 2 - x +
ASU - ECN - 470
ECN 485ASSIGNMENT 2 Due March 25 (Wednesday)Spring 19981. (30 pts, 6 pts. on each.) Let 2 u 0 1 3 (a) (b) (c) (d) (e) ; v 5 4 7 1 ;w 6 2 0 9Find 7u + 3w Find 2u - (v + w) Find uv Can u, v and w span 4? Why or Why not? Explain it. Check whethe
ASU - ECN - 470
Spring Semester 1998Mathematical EconomicsDr. Seung Chan (Min) Ahn !"#$% &%' ()*"# +,- ()*&+$%' !+./ !" #$ %&# '%$() *#$ +,-.#/# 0 $)1#*1& # # # !# .2&# #"# %)+# #!# 3*)# #"#4+"(4)&
Cincinnati - P - 508
%!PS-Adobe-2.0 %Creator: dvips(k) 5.86 Copyright 1999 Radical Eye Software %Title: W05MPart1.dvi %Pages: 2 %PageOrder: Ascend %BoundingBox: 0 0 612 792 %DocumentFonts: CMBX12 CMR12 CMMI8 CMMI12 CMEX10 CMR8 CMSY8 CMTI12 %EndComments %DVIPSWebPage: (ww
Washington University in St. Louis - MEXMRS - 0513
PDS_VERSION_ID = PDS3 RECORD_TYPE = STREAM OBJECT = TEXT PUBLICATION_DATE = 2006-12-06 NOTE
Cincinnati - P - 508
Mid-term Examination Part 1 Introduction to Quantum Mechanics (Physics 508) February 2005This is the first part of the mid-term exam. It is due at the end of the in-class exam period on Monday, February 7, along with the second part which will be gi
Cincinnati - P - 508
%!PS-Adobe-2.0 %Creator: dvips(k) 5.86 Copyright 1999 Radical Eye Software %Title: MPart1A.dvi %Pages: 2 %PageOrder: Ascend %BoundingBox: 0 0 612 792 %EndComments %DVIPSWebPage: (www.radicaleye.com) %DVIPSCommandLine: dvips MPart1A -o %DVIPSParameter
Cincinnati - P - 508
%!PS-Adobe-2.0 %Creator: dvips(k) 5.86 Copyright 1999 Radical Eye Software %Title: MPart2.dvi %Pages: 2 %PageOrder: Ascend %BoundingBox: 0 0 612 792 %EndComments %DVIPSWebPage: (www.radicaleye.com) %DVIPSCommandLine: dvips MPart2 -o %DVIPSParameters:
Cincinnati - P - 508
%!PS-Adobe-2.0 %Creator: dvips(k) 5.92b Copyright 2002 Radical Eye Software %Title: W07MPart1.dvi %Pages: 2 %PageOrder: Ascend %BoundingBox: 0 0 596 842 %DocumentFonts: CMBX12 CMR12 CMSY10 CMMI12 CMR8 CMMI8 CMSY8 %EndComments %DVIPSWebPage: (www.radi
Cincinnati - P - 508
%!PS-Adobe-2.0 %Creator: dvips(k) 5.86 Copyright 1999 Radical Eye Software %Title: jan05.dvi %Pages: 5 %PageOrder: Ascend %Orientation: Landscape %BoundingBox: 0 0 612 792 %DocumentFonts: CMBX12 CMMIB10 CMBSY10 CMR17 CMEX10 CMBXTI10 %EndComments %DVI
Oregon State - ENGR - 202
Final Specification Presentation ScheduleMonday - Jun-01 8 - 8:30 8:30 - 9 9 - 9:30 9:30 - 10 10 - 10:30 10:30 - 11 11 - 11:30 11:30 - 12 12 - 12:30 12:30 - 1 1 - 1:30 1:30 - 2 2 - 2:30 2:30 - 3 3 - 3:30 3:30 - 4 4 - 4:30 4:30 - 5 5 - 5:30 5:30 - 6
Arizona - SCHEDULE - 081
Crs. #(units) Call # Sec #Course/Section Title Time DaysInstructor Bldg. Room #Crs. #(units) Call # Sec #Course/Section Title Time DaysInstructor Bldg. Room #NAVAL SCIENCE (N S )Thomas C. Abel, Professor of Naval Science, Dept. Head,
Oregon State - ENGR - 202
Preliminary Specification Presentation ScheduleMonday - Apr-27 8 - 8:30 8:30 - 9 9 - 9:30 9:30 - 10 10 - 10:30 10:30 - 11 11 - 11:30 11:30 - 12 12 - 12:30 12:30 - 1 1 - 1:30 1:30 - 2 2 - 2:30 2:30 - 3 3 - 3:30 3:30 - 4 4 - 4:30 4:30 - 5 5 - 5:30 5:3
Oregon State - ENGR - 202
ENGR202 Presentation 1 RubricTopic Understanding Score Criteria 3 Student demonstrates all of the items below. 2 Student demonstrates two of the item below. 1 Student demonstrates only one of the items below. 0 Student demonstrates none of the i
Oregon State - ENGR - 202
ENGR202 Presentation 2 RubricTopic Understanding Score Criteria 3 Student demonstrates all of the items below. 2 Student demonstrates two of the item below. 1 Student demonstrates only one of the items below. 0 Student demonstrates none of the i
Oregon State - ENGR - 202
ENGR202 Prototype RubricTopic Completeness of Prototype Score Criteria 10 Student demonstrates a generator that outputs specified energy level, or has clearly define explanation of why generator does not output desired energy. 6 Mechanical or elec
Oregon State - ENGR - 202
ECE322 Lab Grading ScheduleWeek of: 1/7/2007 1/14/2007 Section Title Section 1 - Tools Section 2 - AC Rectifier To Turn in Nothing Prelab for Section 2 Study Questions from Section 1 Report From Section 1 Nothing Prelab Questions Part 1, 2, 3, and 4
Oregon State - ENGR - 202
ENGR202 Final Report Rubric Description Technical Knowledge6 Supports points through correct use of relevant factual knowledge and theories. Some theories and factual knowledge may be drawn from outside of the class. Technical information provided