17 Pages

FLecture4

Course: CS 482, Fall 2009
School: W. Alabama
Rating:
 
 
 
 
 

Word Count: 1112

Document Preview

4 Topics Lecture for today: Some philosophical/scientific background Goals of the course This really should have been lecture 1, but wasnt. CS 482/682, Fall 2004 Lecture 4 1 Why is bioinformatics? Note the question word. Where did this science come from, and why is there not cheminformatics or physinformatics or geoinformatics? Well, for one, there sort of is: Computational chemistry Computational physics...

Register Now

Unformatted Document Excerpt

Coursehero >> Alabama >> W. Alabama >> CS 482

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
4 Topics Lecture for today: Some philosophical/scientific background Goals of the course This really should have been lecture 1, but wasnt. CS 482/682, Fall 2004 Lecture 4 1 Why is bioinformatics? Note the question word. Where did this science come from, and why is there not cheminformatics or physinformatics or geoinformatics? Well, for one, there sort of is: Computational chemistry Computational physics Chaos and complexity But, with the possible exception of the third of these, theyre entirely contained within their own disciplines. Why is biology different? CS 482/682, Fall 2004 Lecture 4 2 Why bioinformatics? People often say that biology requires computer scientists because the data are huge. Thats not it at all. Astronomy data sets are routinely 10s of terabytes, yet they dont work with CS people outside of scientific computation Particle accelerators also produce enormous amounts of data (Terabytes per second, and Petabytes overall!) By contrast, the human genome fits on 2 CDs. All of GenBank is 42 GBases of sequence, or 10.5 GBytes of data. (as of January 2004) CS 482/682, Fall 2004 Lecture 4 3 Why bioinformatics? Rather, there are a few other reasons why bioinformatics exists as the interdisciplinary science it is. Practical reasons CS reasons Political reasons, etc, which I wont go into. CS 482/682, Fall 2004 Lecture 4 4 Practical reasons Biology, unlike physics, has undergone a complete change in the last 10 years. While biologists spend lots of time in lab, they suddenly have computers in them High-throughput methods, long common in other fields, are now found in biology Much experimentation is also in silico now Much of this makes the science quite different from what its practitioners studied. CS 482/682, Fall 2004 Lecture 4 5 A little overblown Actually, what I just said is a little overblown Biologists certainly have been very strong information scientists They invented modern statistics! Genetics has always been a very mathematical science Mathematical ecology is a rich and very old discipline And, of course, other fields certainly havent stopped: Physics: solid state, quantum experiments, etc. Geology: nonlinear dynamics, etc. But biology has been especially abrupt. CS 482/682, Fall 2004 Lecture 4 6 New biology problems These new techniques require completely new analysis techniques. Example: How much of a given protein is being produced at a given time? Ways to answer it: Northern blot (~1980): Isolate RNA from the organism, then bind to radioactively-labeled RNA for that gene. Intensity of radioactivity = amount of transcription Until relatively recently, that was the only answer. CS 482/682, Fall 2004 Lecture 4 7 Nowadays Well, first, is that really the question I wanted? I asked: how much of this protein is being produced at this time? Did I really want: Which proteins are being produced at a particular time in an organisms life cycle? Is the amount of a particular protein higher or lower at that time than usual? What else is higher or lower? Those experiments werent possible to have happen all at once before, but they are, now. CS 482/682, Fall 2004 Lecture 4 8 Example, contd Current techniques: Mass-spec based proteomics: Pick all of the protein being produced at that time. Break it into pieces Use a mass spectrometer to find out how much the pieces weigh Identify what they are based on matches to a database of fragment weights Not quantitative; cant tell how much is produced Can identify everything produced CS being 482/682, Fall 2004 Lecture 4 9 Example, contd Another technique: DNA microarrays Trap all of the RNA being produced at the time Attach it to a chip with DNA probes for each gene, tagged with a color Intensity of color = level of RNA production (more or less) (There are other techniques, but this will do for now) Both of these techniques suddenly switch to questions that feel more like computer science CS 482/682, Fall 2004 Lecture 4 10 Before / After In the case of the Northern blot, the major questions were: 1. Is the experiment reproducible? 2. How accurate is the measure of RNA production? 3. Is a difference between one condition and another significant? Now, these questions matter, but so do: 1. How accurate is the database lookup on the fragment weights? 2. Is there a pattern that characterizes the genes that are being turned on at the same time? 3. Whats the probability of error? CS 482/682, Fall 2004 Lecture 4 11 End of example Suddenly, we have computer science problems in the midst of our biology! However, this case is really a biology problem, with a CS interpretation. This is true for many cases of highthroughput biology: the problem isnt coming from CS, but rather it just feels like CS once its brought up to date. Ironically, we wont talk about this any more in this class CS 482/682, Fall 2004 Lecture 4 12 CS reasons Some problems having to do with biology really just are CS problems (And, of course, thats what well spend the whole term on) The primary reason is that some of the fundamental data in biology are discrete elements. This, of course, is also true of quantum physics, but thats confusing With biology, an awful lot of the data are expressable as strings, directed graphs, vectors, or trees. And, of course, those are some of the central objects of computer science CS 482/682, Fall 2004 Lecture 4 13 Examples The obvious one: DNA. Well talk more about this shortly, but DNA sequences are long strings of letters over the finite alphabet ACGT. Note that this isnt sufficient, in all cases, to model DNA. DNA is, of course, a molec...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

W. Alabama - CS - 482
Lecture 33Topics for today: End of homology-based gene finding Wrap up of use of HMMs.CS 482/682, Fall 2004 Lecture 331Recall from beforeHMM-based gene finders are great, but their only input is the sequence. Cant we start with more input t
W. Alabama - CS - 482
Lecture 24Topics for today: Likelihood-based approaches to phylogeny This is the most complicated phylogenetic reconstruction method well see. Its also possibly the most often used. Were almost done with trees. Well finish them on Wednesday.CS 48
W. Alabama - REC - 415
REC 415 Week Two!Understanding academic journals Recreation specialization Ego involvementUnderstanding refereed journals and their role in advancing knowledgeThe knowledge presented in most textbooks is at least five years old The knowledge in
W. Alabama - CS - 446
The SuperTel IP Phone Hardware Interface Description1 IntroductionThis document describes the hardware interface for SuperTels new IP phone. A layered perspective on the software for the IP phone is given below in Figure 1. In particular, the cont
W. Alabama - CS - 370
CS 370 Winter 2009: Assignment 2Instructor: G. Labahn Lectures: MWF 9:30 MC2038 Oce: DC3629 e-mail: glabahn@uwaterloo.ca Oce Hours: Tues 2:30-3:30 PM Oce Hours: Thurs 9:30-10:30 AMWeb Site: www.cs.uwaterloo.ca/ glabahn/cs370/ Due Friday Feb 13, 5:
W. Alabama - REC - 415
Cross-Cultural Research Issues International Travel Role PreferencesPart Two Some discussion notesMo, Howard, & Havitz (1993)Cohens (1972) tourist typology was widely cited, but not tested for 15 years Snepenger (1987) made the first attempt Beh
W. Alabama - ECE - 750
Meta-programmingKrzysztof Czarnecki, University of Waterloo czarnecki@acm.org2OutlineWhat is meta-programming Taxonomy of meta-programming Examples of languages with support for metaprogrammingLisp Scheme MetaOCaml Template Haskell 2003 by C
W. Alabama - ECE - 750
Re-Engineering the Java Buffer Library with OpenJavaMichael Jarrett December 11, 2003AbstractThe Java New I/O APIs includes a powerful library for creating a variety of memory buffers optimized for input and output. Due to internal platform-depen
W. Alabama - ECE - 750
ECE 750 SSF & GP Final Exam PreparationKrzysztof Czarnecki University of WaterlooMaterial for the exam preparation Required material: Lecture slides Supplementary material: Assigned reading2Topic overview Basic concepts (GP & SSF) Domain
W. Alabama - STAT - 231
# 4.4University of WaterlooSTAT 231 - W. H. CherryFigure 4.4 .ANAL YSIS STAGE: Scatter DiagramsThe information given below is repr inted from the Journal of the American Statistical Association 84 (#408), pages 945-957, December, 1989 [DC Libr
W. Alabama - STAT - 231
# 4.2University of WaterlooSTAT 231 - W. H. CherryFigure 4.2. ANAL YSIS STAGE: Histograms 1In the Sanitary District of Chicago in early 1966, there were 15 job openings for operating engineers. On March 12, 223 male applicants for the jobs wro
W. Alabama - CS - 349
Universal Accessibility CS 349Professor Michael Terry July 21, 2006Today's Agenda Announcements Assignment 4 notesCS349 / 2Announcements Assignment 4 due Monday, July 23rd before class Assignment 3 appeals by today Demo volunteers for n
W. Alabama - CS - 349
Input Devices Events CS 349Professor Terry May 15, 2006Todays Agenda Announcements / updates Demos Input technologies Fitts Law EventsCS349 / 2Announcements Timer/sleep issues XFlush / animation issues Questions, comments, concerns?
W. Alabama - CS - 445
Reading for this topic:User Interface SpecicationDaniel M. BerryRequired:Nothing!Recommended, Optional, Supplemental:Nothing! 2003 Daniel M. BerryRequirements Engineering User Interface Specication Pg. 1 2003 Daniel M. BerryRequireme
W. Alabama - ECE - 318
University of Waterloo Department of Electrical and Computer EngineeringECE 318 Communication Systems Midterm ExaminationFeb. 12, Thursday, 5:30-7:00 P.M. DWE 2527/3516/3518/3522 __ Instructor: Pin-Han Ho Time allowed: 1:30 hours. Grading: Each q
W. Alabama - ECE - 428
CryptologyCryptographyCryptanalysisCryptographyThe science of "secret" writinga cipher is a function which transforms a plaintext message into a ciphertext(cryptogram) by the process of encipherment plaintext is recovered from the ciphertext
W. Alabama - ECE - 318
(6.1.1) The instantaneous frequency is the time-derivative of the instantaneous phase. (a) The instantaneous phase is (100t + /3). Hence, the instantaneous frequency is d(100t+/3)/dt = 100. (b) The instantaneous phase is (200t + 10sint). Hence, the i
W. Alabama - CS - 350
Virtual Memory1Virtual MemoryGoals: Allow virtual address spaces that are larger than the physical address space. Allow greater multiprogramming levels by using less of the available (primary) memory for each process. Method: Allow pages (or
W. Alabama - CS - 350
Virtual Memory1Virtual MemoryGoals: Allow virtual address spaces that are larger than the physical address space. Allow greater multiprogramming levels by using less of the available (primary) memory for each process. Method: Allow pages (or
W. Alabama - CS - 350
Intro11What is an Operating System? Three views of an operating systemApplication View: what services does it provide? System View: what problems does it solve? Implementation View: how is it built?An operat
W. Alabama - MATH - 239
MATH 239 Winter 2009 Assignment 5 Solutions[23 marks] 1. Prove that the following two graphs are not isomorphic.Solution. [3 marks] First step: note that each graph has only one vertex of degree 3. (In the first graph, it is B, and in the second,
W. Alabama - AMATH - 231
AMATH231 Midterm PreparationDavid Se Jun Lee Wednesday, October 4, 200611.1Curves & Vector FieldsScalar & Vector FunctionsRemark 1.1 | Function A function is a mapping from a given input (domain) and yields output (range). It is necessary th
W. Alabama - MATH - 239
Problem 1.Prove that the graph G = n-cube is non-planar for all n 4.Problem 2. Prove that every planar graph having girth at least 6 has a vertex of degree at most 2. Problem 3. Suppose that graph G satisfies p = 2n and deg(v) n for every verte
W. Alabama - CS - 882
Lecture 7Topics for today: Suffix trees: building them I am following the description of Weiners algorithm in Chapter 6 of Gusfield, but I think my representations may be a bit simpler. Gusfield argues that Ukkonens algorithm is simpler, but I disa
W. Alabama - PSCI - 264
THE EXECUTIVEOctober 31, 2002Controls Budget, Approves Nominations and Treaties, Override Veto, Impeach VETOLegislation, Appropriation, OversightPresidential Directives, Executive AppointmentsPolitical Control of the Bureaucracythe Presiden
W. Alabama - PSYCH - 207
Reading and LanguageMarch 24Outline Stroop demonstration Components Sounds (phonology and morphology) Syntax Semantics DyslexiaThe Stroop Effect Stroop (1935) Used cards with 100 stimuli Timed participants/card Demonstration1The S
W. Alabama - PSCI - 264
AN ELECTIONS PRIMERNovember 5th, 2002 Mid-Term ElectionsHouse of Representatives elections s Senate elections s gubernatorial electionssNovember 5th, 2002 Mid-Term ElectionssHouse of Representatives 435 seats contested basis of represent
W. Alabama - ECE - 309
IDEALGASWEB.TEXECE 309 M.M. YovanovichIdeal Gas RelationsVariable Speci c Heatsu2 , u1 = h2 , h1 = s2 , s1 =R21cv dT cP dTR21v2 cv dT + R ln v1 T R P s2 , s1 = 12 cP dT + R ln P2 T 1R21Incompressible Fluid or Solid Relationsv = co
W. Alabama - ECE - 309
DEPARTMENT OF ELECTRICAL ENGINEERING ECE 309 Thermodynamics and Heat TransferUNIVERSITY OF WATERLOOFinal Examination M.M. Yovanovich NOTE:Spring 1997 August 5, 1997 9:00 A.M.-12:00 Noon1. Open book examination. You are permitted to use your t
W. Alabama - CHEM - 254
Final Examination CHEM 254 Question #1: Phase Diagram (25%) 1.11April 11th, 2003Sketch the phase diagram of acetic acid. Special attention will be paid to indicate the axes labels, the position of the triple point, the critical point, each phas
W. Alabama - ECE - 427
1 Price-Performance-Power Equations1.1 Performancef =1 clockperiod 1 (Amdahl0s Law) Speed U p =(1F ractionenhanced)+ Speedup enhanced CPU time for a program = (# P clock cycles f or program) clock cycle time of = ( i IiCP Ii) clock cycle tim
W. Alabama - ECE - 427
1 Price-Performance-Power Equations1.1 Performancef =1 clockperiod (1F ractionenhanced)+ Speedup enhanced 1F raction enhanced (Amdahl0s Law) Speed U p = CPU time for a program = (# P clock cycles f or program) clock cycle time of = ( i IiCP I
W. Alabama - ECE - 427
1 Digital Systems Design IntroductionDIGITAL vs ANALOG : Digital- has most advanced CAD tools.- Design complexities and performance increasing rapidly. Analog- 30% of ASICs by year 2000 will have analog technologies support analog and digital hybrid
W. Alabama - ECE - 427
1 Solution:1.1 Price_1 (single chip system) Dies per wafer = Die Yield = :95 1 + ( & # ) 320:p (82:510) 5 = 68: 338 2320 " 320 4 # # = : 18177 4 Therefore good die per wafer = 68: 18177 = 12: 361.2 Price_2(two chip system) Dies per wa
W. Alabama - CE - 265
CE 265 Materials2004Pandey/StraubeLab Report : Tensile Testing of PolymersTo better understand the properties of polymers, tensile testing will be carried out in the laboratory. Four different specimens will be tested: low-density polyethylene
W. Alabama - CIVIL - 205
Tutorials 1 & 2 - CivE. 205 _Exercise 1:10 KN 3 KN / m 10 KN 2 KNExercise 2:10 KN 1 KN / m2 KN.ma 2m 2m 4m 2m2m c b2m10.5 10.5a+0.5 0.54m 1m4m1.5 m11.51131 22 21 17 22.04313N.F.D.3 001 1019 13 2 10 2T
W. Alabama - GE - 121
Sheet1 39.62 45.17 40.44 41.33 39.33 44.53 38.19 38.65 40.46 39.06 43.10 38.97 42.90 41.49 40.31 40.47 41.45 41.86 38.02 44.16 42.41 40.82 45.26 43.50 42.63 41.51 42.66 42.55 43.71 41.49 44.28 42.66 42.72 43.73 41.79 44.44 40.09 42.96 39.38 42.83 41.
St. Francis IL - PHYS - 322
Physics 322 Course OutlineSt. Francis Xavier University, 2008OverviewPhysics 322 is a third year course on static electric and magnetic fields. It will cover chapters 1-5 of Griffiths' book on Electrodynamics. On the physics side this includes th
W. Alabama - STAT - 231
STAT 231Page 2 of 61. All par ts (a) to (k) of Question 1, except par t (h), are to be answered with reference to the article Stressful jobs raise men's blood pressure, reprinted on page 6 of the quiz. Problem stage: MARKS (a) What is a target un
W. Alabama - STAT - 231
Fats (Ch. 11) University of Waterloo EM9337: Kitchener-Waterloo Record, December 1, 1993, page D8 STAT 231 - W. H. Cher ryFats raise risk of lung cancer in non -smokersWASHINGTON (AP) - Add lung can cer to the growing list of diseases that seem to
W. Alabama - STAT - 231
STAT 231Page 2 of 51. With reference to the article Stressful jobs raise men's blood pressure, reprinted on page 5 of the quiz, suppose the question of interest is whether people in high-stress low-freedom jobs have higher average blood pressure
W. Alabama - STAT - 231
Maple Die-back Maple syrup is an important cash crop for many farmers in Eastern Canada and especially in the Region of Waterloo. The syrup is produced by collecting the sap in the early spring from the sugar maple, a tree that grows commonly in loca
W. Alabama - ECE - 316
ECE 316 Probability Theory and Random ProcessesChapter 8 Solutions Xinxin FanProblems2. From past experience a professor knows that the test score of a student taking her nal examination is a random variable with mean 75. (a) Given an upper boun
W. Alabama - ECE - 204
University of Waterloo Final ExamName: _ Student ID: _ E&CE 104: Numerical Methods for Electrical Engineers Spring 2004 August 11th, 2004 Duration: 3 Hours Instructor: Douglas Wilhelm Harder Calculators allowed. No other additional materials or aide
W. Alabama - ECE - 204
ECE 104 Final ExaminationE&CE 104 Numerical Methods for Electrical Engineers FINAL EXAMINATION2005/08/15 09:00 11:30 (2.5 hours)Instructor: Douglas Wilhelm Harder Aides: None (specifically, no calculators) You may ask two questions during the ex
W. Alabama - ECE - 204
ECE 204 Numerical Methods for Computer EngineersECE 204 Numerical Methods for Computer Engineers MIDTERM EXAMINATION2006-10-24/5:30-7:00 The examination is out of 75 marks. Instructions: No aides. Write your name and student ID number on each book
W. Alabama - MATH - 440
PMATH 440/640Assignment 21. Let Re(a), Re(b) 0. Prove Ramanujan's identityn=1a (n)b (n) (s)(s - a)(s - b)(s - a - b) , = ns (2s - a - b)Re(s) > 1 + Re(a + b).Note, the condition Re(a), Re(b) 0 is not crucial, it just simplifies the dis
W. Alabama - CS - 798
CS798G Assignment #1Introduction to Public Sequence Databases and other ResourcesComplete the following tasks. Your fellow students are your most valuable resource so please work together. For each task include some documentation indicating you hav
W. Alabama - CS - 744
CS 744 - Advanced Compiler Design Assignment 1Due: Tuesday, June 10th, 2008Consider the following code in a Jimple-like intermediate representation: 1 int a, b, c; 2 a = 0; 3 c = 1; 4 L1: 5 b = a+1; 6 c = c+b; 7 a = b*2; 8 if(a < 10) goto L1; 9 re
W. Alabama - PLG - 246
A SSIGNMENT #1 A BRISK WALK WITH C+CS246, W INTER 2002CS246 Software Abstraction and SpecicationAssignment #1 A brisk walk with C+Due date: 8:00 am, Monday January 28, 2002This assignment is to be done individually. Its purpose is to give
W. Alabama - PLG - 246
Classes, Objects, References, and Pointers: C+ and JavaOverview Classes, files, and interfaces Constructors, destructors, and initializers Heap-based vs. stack-based objects Objects, references, and pointers C LASSES O BJECTS R EFERENCES A
W. Alabama - PLG - 246
Advanced Object-Oriented ProgrammingOverview Operator overloading operator= and copy constructors Static vs. dynamic types virtual functions, static vs. dynamic dispatch Downcasting, RTTI, and reection Object equality and equivalence A DVANCE
W. Alabama - PLG - 246
February 28, 2002Genericity, Templates, and the STLOverview Genericity and type-safety Genericity in Java A lite overview of C+ templates The Adapter design pattern The C+ Standard Template Library: containers, iterators, and algorithms [See
W. Alabama - PLG - 246
March 8, 2002Errors, Exception Handling, and Smart PointersOverview Kinds of errors, fault vs. failure, common run-time errors Pre/post/invariants and robustness Exceptions and exception handling in C+ and Java Standard exception classes Hints on
W. Alabama - PLG - 246
March 8, 2002March 8, 2002Errors, Exception Handling, and Smart PointersOverview Kinds of errors, fault vs. failure, common run-time errors Pre/post/invariants and robustness Exceptions and exception handling in C+ and Java Standard exception cl
W. Alabama - CS - 744
Kinds of AllocationStaticallocate before execution (compile-time) never deallocate size xed at compile-timeStackallocate at method entry deallocate at method exit size xed at method entryHeapallocate at any time deallocate at any time size xe
W. Alabama - CS - 246
School of Computer ScienceCourse Notes CS 246Software Abstraction and Specicationhttp: /www.student.cs.uwaterloo.ca / cs246 Winter 20091 Administration 1.1 What CS 246 is about C/C+ UNIX tools software design2CS 24631.2 Individual C
W. Alabama - CS - 745
CS 745 (Fall 2004) Linear Temporal Logic Exercise13 Oct 20041. Every request must eventually be acknowledged.2. Whenever the request signal is true, it must remain true until it is acknowledged.3. It is always the case that the request signal
W. Alabama - CS - 745
Todays Agenda CS 745 / ECE 725 Computer Aided VericationLecture 4: Introduction to Model CheckingJo Atlee DC 2337, jmatlee@uwaterloo.ca Ofce Hours: Mon 1:00-2:00, Wed 1:00-2:00http:/www.student.cs.uwaterloo.ca/cs745Model checking overview Linear
Virgin Islands - ECON - 420
Chapter 13: LaborKey Issues: Labor markets and labor categories: casual labor and permanent labor Poverty, nutrition, and labor markets Voluntary and involuntary unemployment Wage determination of casual and permanent labor Ef ciency wageCasual a
Virgin Islands - ECON - 371
Topic 7.2 Unemployment Causes Professor H.J. Schuetze Economics 371Why is there longer term unemployment?We have shown that a certain amount of frictional unemployment will exist We have not offered any explanations for longer-term unemployment Le