3 Pages

GCB535Lab8

Course: CIS 535, Fall 2009
School: UPenn
Rating:
 
 
 
 
 

Word Count: 893

Document Preview

535 GCB/CIS Fall 2004 Lab Exercise 8: Analysis of MS/MS data (12/03/04) Tools: 1. PeptideSearch tools from the Bioanalytical Research Group at EMBL: http://www.narrador.embl-heidelberg.de/GroupPages/Homepage.html (Contains the tools we need to use in the homework for peptide and protein identification.) 2. ExPASy Proteomics Server: http://us.expasy.org/ (Comprehensive collection of tools useful in proteomics.)...

Register Now

Unformatted Document Excerpt

Coursehero >> Pennsylvania >> UPenn >> CIS 535

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
535 GCB/CIS Fall 2004 Lab Exercise 8: Analysis of MS/MS data (12/03/04) Tools: 1. PeptideSearch tools from the Bioanalytical Research Group at EMBL: http://www.narrador.embl-heidelberg.de/GroupPages/Homepage.html (Contains the tools we need to use in the homework for peptide and protein identification.) 2. ExPASy Proteomics Server: http://us.expasy.org/ (Comprehensive collection of tools useful in proteomics.) Questions: 1. Swiss-2DPAGE database: As we learned in the lecture, 2-D PAGE is a technique that are often used for separating mixture of proteins based on their pI and MW. ExPASy has a link to the Swiss-2DPAGE database (http://us.expasy.org/ch2d/ ), which contains information on proteins identified on 2-D PAGE reference maps. Now, lets try to get access of these data. Click on by clicking on a spot under Access to SWISS-2DPAGE. Here you will see a collection of reference maps for different tissues and species. Select the map for Saccharomyces cerevisiae. Here we have a reference 2-D PAGE for budding yeast. Note that some of the points on the gel are marked with red cross. Those are the identified proteins. Click on one of the marked spots that have a pI ~6.0 and Mw slightly less than 50 Kd. You will get a list of data on this protein. In the section 2D PAGE maps for identified protein, there are a list of map locations corresponding to this protein. Its often the case that you would get more than one map location for the same protein. Click on one of the locations listed. You will see the locations of this protein marked on the reference map together with a rectangle box around the theoretical pI and Mw values for that protein. The box indicates the region where one would expect to see the protein on the gel. Now, suppose you are interested in the yeast protein Glyceraldehyde-3phosphate dehydrogenase 3 (TDH3p) and want to locate it on the yeast reference 2-D PAGE. Retrieve the protein sequence from NCBI. Use the tool from ExPASy (http://us.expasy.org/tools/pi_tool.html ) to calculate the pI and Mw value for this protein. Can you find the spot(s) corresponding to this protein on the reference map? 2. Identification of proteins from mass spectrum data: a) By amino acid sequence of a peptide: As we can see, identifying proteins just based on their location on the 2DPAGE is not easy. But suppose we know the amino acid sequence of a fragment of it, say a small peptide produced by Trypsin digestion of the protein. It would be much more straightforward to identify the protein. Suppose we learnt the amino acid sequence of a peptide from DeNovosequencing of MS/MS data (or some other sequencing method). The sequence contains 12 amino acids: RHPEYAVSVLLR. to Try use the amino acid sequence option in the PeptideSearch homepage. Search for proteins with mass between 0 and 300 kDa that has the above peptide in unspecified organisms. How many results did you get from this search? Retrieve the protein sequence from one of the results and use the PeptideCutter at ExPASy (http://us.expasy.org/tools/peptidecutter/ ) to calculate theoretical Trypsin digestion products of this protein. Set the options such that only the Trypsin digestion is predicted and only sites with cleavage probability >= 80% are displayed. Can you find the peptide with the above sequence in the predicted product? b) By sequence tag method: DeNovo-sequencing of MS/MS data requires sophisticated software. However, oftentimes, you need not to know the entire sequence of the peptide in order to identify the peptide and the protein. Suppose that we learned from Full Scan and Zoom Scan of a protein that the average Mw value of a peptide is ~1440. And from the MS/MS spectrum of that peptide, we identified three neighboring amino acids, YAV, which locates between two peaks with m/z value of 520.6 and 941.0. We could then use the information available and the sequence tag method to identify the peptide and protein. Click on the sequence tag option of the PeptideSearch homepage. Enter 1440 for peptide mass (Average mass). Enter (520.6)YAV(941.0) for peptide sequence tag. Set the Match regions for 1 and 2 and 3. Allow for only one missed cleavage sites per peptide. Try to search first by Y-type sequence ions, then by B-type sequence ions. How many results you got in each case? Which of t...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

UPenn - CIS - 535
1GCB 535 / CIS 535 Fall 2004Midterm 2 ReviewAs before, you should also go over all the homework problems and make sure you understand the concepts behind them. Generally, you should not need to worry about equations, numbers, etc. To encourage y
UPenn - CIS - 535
GCB/CIS 535 Microarray TopicsJohn Tobias November 3rd, 2004OverviewExperimental Design Technology Replicates Experimental Execution Data ProcessingOverviewExperimental DesignIdentify Critical Conditions Identify Critical Comparisons Minimiz
UPenn - CIS - 535
Markov ModelHidden Markov ModelsSequence of statesE.g., exon, intron, Sequence of observationsE.g., AATCGGCGT Called emissionsProbability of transitionThe Markov matrix Mij = p(Sj | Si)Probability of emissionP(Ok|Sj)Lyle Ungar, Univer
UPenn - CIS - 535
GCB/CIS 535 Fall 2004Lab Exercise 3: Motif finding IBiological background:(10/01/04)The transcription activities of the genes of the yeast high-affinity copper and iron transport system are controlled by at least three transcription factors: M
UPenn - CIS - 535
GCB/CIS 535 Fall 2004Lab Exercise 4: Promoter PredictionTools:(10/08/04)1. Gene/Protein sequence search: NCBI Entrez: http:/www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein 2. Promoter Prediction: http:/www.cbs.dtu.dk/services/Promoter/ (Prom
UPenn - CIS - 535
GCB/CIS 535 Microarray TopicsJohn Tobias November 8th, 2004Overview Normalization Sample Quality Control Finding Differentially Expressed Genes Gene List AnnotationOverview Normalization Sample Quality Control Finding Differentially Expre
UPenn - CIS - 535
PROTEOMICSDe novo sequence prediction for: nsi78_11.1803.1806.2.dta Sequence Absolute Relative Probability Probability 3.9% 36.3% CRGSVNFP[PL]FK 2.3% 24.7% CRGSVN[DE][PL]FK 6.1% 17.2% CRGSVPFN[PN]FK 3.1% 6.5% CRGSV[SR]D[PL]FK 2.7% CRGSVPFNWGDK <0.1%
UPenn - CIS - 535
GCB/CI S 535 M icr oar r ay TopicsJohn Tobias November 3r d, 2004Over view Exper iment al Design Technology Replicat es Exper iment al Execut ion Dat a Pr ocessingOver view Exper iment al Design Technology Replicat es Exper iment al Exe
UPenn - CIS - 535
GCB/CI S 535 M icr oar r ay TopicsJohn Tobias November 15t h , 2004Over view Gene L ist Annot at ion Significance of Over St at ist icalion Repr esent atSimilar it y / Clust er ing Applicat ions Public Dat a For mat s and Dat abasesOver vi
UPenn - CIS - 535
Introduction to BioInformaticsGCB/CIS535Lyle Ungar, University of PennsylvaniaCourse OverviewSequence alignment Dynamic programming Blast and its variants statistical significance Motif and promoter prediction Homology and HMMs Experime
UPenn - CIS - 535
Hidden Markov ModelsLyle Ungar, University of PennsylvaniaMarkov Model Sequenceof states of observations of transition of emissionE.g., exon, intron, E.g., AATCGGCGT Called emissions The Markov matrix Mij = p(Sj | Si) P(Ok|Sj) Sequence
UPenn - CIS - 535
GCB535/CIS535 Homework 71. (10 pts) Experimental design: The following is an example of results from a spotted microarray done on mouse liver tissue:Gene A B C D Clock knockout Cy3 Cy5 2300 8028 201 100 8000 3082 2020 1200 Ratio 0.286497 2.01 2.595
UPenn - CIS - 535
1GCB 535 / CIS 535: Introduction to BioinformaticsMidterm Examination Wednesday, 12 October 2005 This is a closed-book exam. Write your answers on the exam paper, in the spaces provided. If you need more space, use the back side of the page, c
UPenn - CIS - 535
1GCB 535 / CIS 535: Introduction to BioinformaticsMidterm Examination Friday, 10 December 2004 This midterm examination consists of 11 pages (including this one), 5 questions, and 70 points. Please check to make sure you have all the pages. T
UPenn - CIS - 535
GCB 535 /CIS 535 Homework 5 Logistic regressionCircadian rhythmicity of biologic processes is a fundamental property of all eukaryotic and some prokaryotic organisms. These rhythms are driven by an internal time-keeping system. Changes in the extern
UPenn - CIS - 535
1.a) NGNA & NGNC 1 A 0.25 T 0.25 C 0.25 G 0.25 Score = 0.25 * 1 * 0.25 * 0 = 0 b) CACAF & NNNNG 2 0 0 0 1 3 0.25 0.25 0.25 0.25 4 0.5 0 0.5 0A T C G1 2 3 4 0.125 0.625 0.125 0.625 0.125 0.125 0.125 0.125 0.625 0.125 0.625 0.125 0.125 0.125 0.12
UPenn - CIS - 535
1GCB 535 / CIS 535: Introduction to BioinformaticsMidterm Examination Wednesday, 12 October 2005 This is a closed-book exam. Write your answers on the exam paper, in the spaces provided. If you need more space, use the back side of the page, c
UPenn - CIS - 535
CIS/GCB535 Homework - conservation papers. Please hand in on Monday 1) Three questions about the papers which would be interesting for class discussion. You need not answer these questions; just pose them. 2) Brief answers to the following questions:
UPenn - CIS - 535
YORFNAMEGWEIGHTCell-cycle Alpha-Factor 1Cell-cycle Alpha-Factor 2Cell-cycle Alpha-Factor 3Cell-cycle Alpha-Factor 4Cell-cycle Alpha-Factor 5Cell-cycle Alpha-Factor 6Cell-cycle Alpha-Factor 7Cell-cycle Alpha-Factor 8Cell-cycle Alpha-Factor
UPenn - CIS - 535
Introduction Squence analysis
UPenn - CIS - 535
YORFNAMEGWEIGHTGORDER01020304050607080100110120130140150160EWEIGHT1111111111111111YOR348CYOR348C PUT4 transport proline and gamma-aminobutyrate permease S0005875110.65116280.
UPenn - CIS - 535
probe id92257_at93619_at93694_at97724_at100122_at94420_f_atgene symbolclock per1per2cry2Gnb5cry1labelCT18-14.333.2615.776511.83380.90333.62050CT 22-13.772.8694.644710.0353-0.22653.77890CT 2-23.52.
UPenn - CIS - 535
Gene IDM_AM_BM_CM_DM_E_Re-IVTM_FM_GM_HM_IM_J_NewPoolM_KM_LM_MM_NM_OM_PM_QM_RM_TOM_AM_TOM_BM_TOM_CM_TOM_DM_TOM_EM_TOM_FM_TOM_GM_TOM_HM_TOM_IM_TOM_J_Re-IVTM_TOM_K_Re-IVTMGU74Av2.CELM_TOM_L_Re-fragM_TOM_MM_TOM_NM_TOM_O_R
UPenn - CIS - 535
Genomics: the big pictureLyle Ungar, University of PennsylvaniaHow do genes control phenotype? FindgenesAnnotate them (e.g. function) Identifyintrons, exons, regulatory elements Determine genetic regulatory networkE.g. Transcription
UPenn - CIS - 535
Gap penalties Gap penalties are chosen appropriately for each scoring matrix based on empirical studies involving a set of similar sequences. Bayesian adaptive sequence alignment (Zhu, Liu and Lawrence) This approach samples from all optimal alignme
UPenn - CIS - 535
Sequence alignmentInformally, aligning a pair of sequences means matching up the letters across the sequences preserving the order of letters (no cross-overs) which makes the most sense in a given context. Example: Align THISISREALLYSTRANGE and THIS
UPenn - CIS - 535
Mayo Clin Proc, November 2002, Vol 77Primer on Medical Genomics Part IV1185Medical GenomicsPrimer on Medical Genomics Part IV: Expression ProteomicsANIMESH PARDANANI, MD, PHD; ERIC D. WIEBEN, MD; THOMAS C. SPELSBERG, PHD; AND AYALEW TEFFERI,
UPenn - CIS - 535
Comparative Genomics"Know then thyself, presume not God to scan; The proper study of mankind is man." Alexander Pope, 1733"Nothing in biology makes sense except in the light of evolution." Theodosius Dobzhansky, 1932Comparative Genomics1. 2.
UPenn - CIS - 581
CIS581, Computer Vision Project 4, Automatic Face Morphing Due December 16, 1:00 pmOverviewThis project focuses automatic face morphing. You need to collect 12 or more face images. You are free to pick anyone, but it should include 6 celebrities (
UPenn - TCOM - 503
TCOM 503/EE 509 Problem Set 2 Due: 09/28/05Read: Chapters 1, 2 and 3 of the textbook by Palais (using 5th edition). Review: Overheads from the Volume I of bulk pack. Review: Complex numbers and vectors. Dwights Office Hours: 4:35 5:45 on Wednesday,
UPenn - TCOM - 503
TCOM 503/EE 509 Problem Set 5 Due: 10/21/05 @ 12 Noon in 203 MooreReview: Overheads from the Volume II of bulk pack. Review: Fourier Transforms use your own text of one of the texts in the bulk pack of readings on Fourier Transforms (the first one
UPenn - DRAGON - 2
XML is widely accepted as the standard for data exchange between businesses on the Internet. However, most corporations publish only selected portions of their proprietary business data as XML documents, and even then only virtually, that is by e
UPenn - CSE - 140
The best way to approach a Bayes problem is 1) Determine what the data and hypotheses are. Try to be precise. Data are things that you observe. Hypotheses are things you want to know, but don't observe.2) What are the prior and the likelih
UPenn - SAS - 540
FILE h123.eh 104th CONGRESS 2d Session AN ACT To amend title 4, United States Code, to declare English as the official language of the Government of the United States.
UPenn - STAT - 550
Take Home Final, Statistics 550, Fall 2008This is a take home final exam and is due Wednesday, December 17th by 5 pm (put in Prof. Smalls mailbox in the statistics department or e-mail him). You can consult any references but cannot speak with anyon
UPenn - STAT - 510
Statistics 510: Notes 21Reading: Sections 7.3-7.4 I. Moments of the Number of Events That Occur (Chapter 7.3) Example 1: Consider n independent trials, with each trial being a success with probability p. Let X be the number of successes in the n tri
UPenn - STAT - 510
Statistics 510: Notes 16Reading: Section 5.6.2-5.6.4, 5.7 I. Other Continuous Distributions (5.6.2-5.6.4) 1. Weibull Distribution (Section 5.6.2) The Weibull distribution is widely used in engineering as a model for the lifetime of objects. A random
UPenn - STAT - 510
Statistics 510: Notes 13Reading: Sections 5.1-5.3 Note: Room and Time for Question and Answer Review Session for midterm. Monday, October, 16th, 6:30 pm, Huntsman Hall 265. I. Wrap up on cumulative distribution functions (Section 4.9) The cumulative
UPenn - STAT - 510
Statistics 510: Notes 22Reading: Section 7.6-7.7 I. Conditional Expectation and Prediction (Chapter 7.6) Sometimes a situation arises where the value of a random variable X is observed and then, based on the observed value, an attempt is made to pre
UPenn - STAT - 550
Statistics 550 Notes 6Reading: Section 1.5 I. Sufficiency: Review and Factorization Theorem Motivation: The motivation for looking for sufficient statistics is that it is useful to condense the data X to a statistic T ( X ) that contains all the inf
UPenn - STAT - 550
Statistics 550 Notes 13Reading: Section 2.3. Schedule: 1. Take home midterm due Wed. Oct. 25th 2. No class next Tuesday due to fall break. We will have class on Thursday. 3. The next homework will be assigned next week and due Friday, Nov. 3rd. I. A
UPenn - STAT - 510
Statistics 510: Notes 20Reading: Sections 7.1-7.3 In Chapter 7, we study more properties of expected values. I. Expectations of Sums of Random Variables (Section 7.2) Recall Proposition 4.1 of Chapter 4: E[ g ( X )] = g ( x) P( X = x)possible valu
UPenn - STAT - 510
Practice Problems for the Midterm Exam, Statistics 510, Fall 20061. In a certain community, 36% of the families own a dog, 30% of the families own a cat, and 22% of the families that own a dog also own a cat. What is (a) the probability that a rando
UPenn - LDC - 2009
Publication Title: Butler University Audio-Visual Database of Spoken Language, version 1Authors: Carolyn Richie, Sarah Warburton, Megan CarterData type: Audio-VideoData sources: Video data were recorded on August 10th, 13th, and 24th of 2007 fo
UPenn - LDC - 2008
PROPBANK ANNOTATION GUIDELINESOlga Babko-Malaya October 20061. PropBank Annotation Goals . 2 2. Task 1: Argument Labeling .. 2 2.1 Frame Files. 2 2.2 Choosing Arg0 versus Arg1. 4 2.3 Annotation of null elements. 6 2.3.1 Passive sentences . 6 2.3.2
UPenn - LDC - 2003
PENN ARABIC TREEBANK GUIDELINES*DRAFT, January 28, 2003*Ann Bies and Mohamed Maamouri Linguistic Data Consortium University of Pennsylvania 3600 Market Street, Suite 810 Philadelphia, PA 19104 bies@ldc.upenn.edu, maamouri@ldc.upenn.edu1Table o
UPenn - LDC - 2004
Lexicon for the Transliteration of Spontaneous Speech Verbmobil IISubset for the transcription of the ISL Meeting CorpusSusanne Burger Interactive Systems Laboratories, Carnegie Mellon University, Pittsburgh, PA, USA sburger@cs.cmu.eduAbstract..
UPenn - LDC - 97
Sheet1 User information for the KIDS database DATABASE CONTENTS This database is comprised of sentences read aloud by children. It was originally designed in order to create a training set of children's speech for the SPHINX II automatic speech recog
UPenn - LDC - 94
'bout 'cause 'cept 'em 'ment 'pletely 'tache 'til 'wards -sh-shirt -shirt's -shirts @ [\aircraft] [\dog_barking] [\engine] [\laughter] [\noise] [\phone_ringing] [\ringing] [\static] [\tone] [\whistling] [aircraft] [breathing] [cough] [dog_barking] [e
UPenn - LDC - 2009
ADDENDUM TO THE SWITCHBOARD TREEBANK GUIDELINESAnn Bies, Justin Mott, Colin WarnerAugust 2005This file contains a list of Part-of-Speech-tagging and parsing decisions that were made in the course of annotation. As such, it is primarily a refere
UPenn - LDC - 98
Description of the Switchboard-2 Phase I telephone speech corpus_May, 1998Project Leader:David GraffProgramming:George ZipperlinZhibiao WuPersonnel:Alexandra CanavanRecruiters:Elisa Munoz-FrancoLiz O'Connor
UPenn - LDC - 93
train/dr6/mabc0/sa1she had your dark suit in greasy wash water all year train/dr6/mabc0/sa2don't ask me to carry an oily rag like that train/dr6/mabc0/sx331the big dog loved to chew on the old rag doll train/dr6/mabc0/sx61chocolate and roses n
UPenn - LDC - 95
TRAINS Dialog Corpus This CD-ROM contains a corpus of task-oriented dialogs. These dialogs were collected as part of the TRAINS project, a project to develop a conversationally proficient planning assistant, which helps a user construct a plan to ach
Skidmore - ROY - 2
BioModel '02Introduction to Computer Modeling of Biological SystemsBio 370 (2002) Course Introduction/ GuidelinesBeing partly a question and answer session between student and teacher about the course.a) "Say, what is this course about?!" Well.
Skidmore - ROY - 2
Biomodel'02Due Wednesday 11thBiomodeling - Model Set #1Constructing A New Model From Scratch I. Setting up to work A) A new Master Model to work on Always make a new MasterModel folder each time before beginning to work on a new model. To do so
Skidmore - ROY - 2
Mammalian Physiology - Final Paper & Talk Guidelinesrevised 11/21/02The purpose of the final review paper is to have you examine some of the current research literature in respiratory, renal or neural physiology. The purpose of the final talk is t
Skidmore - ROY - 2
BioMod 02Final Project and Talk Evaluation Many have asked about how to approach the final project and how it is evaluated. I thought the material below may be of help. I. Final Project Unlike all other course model evaluations, evaluation of the f
Skidmore - ROY - 2
BioMod02Final*Note : Models done in class are not availbale as projectsProject Model ListSpain (available for perusal in 316 (45132), many have their equivalents in Keen) 8.5 Deer and hunters- age class model age class 8.6 North Sea fisheries
Skidmore - ROY - 2
Systems Neurophysiology- Functional Neuroanatomy F'08 The final stretch schedule version Monday Dec. 1Read Present Present figs PresenterBefore Tuesday Nov. 18 review or learn key figures 1-7,1-8,1-9,3-1,6-1,6-2 Nov. 18 Nov. 19 Nov. 20 Nov. 25 Nov
Skidmore - ROY - 2009
Sample "questions"- Part 1 The items below are meant to get you thinking in a mode similar to that of some of the questions you are likely to see on the exams. They are NOT meant to be exhaustive of the subject in any way; they merely are sample repr
Skidmore - ROY - 2
Systems Neurophysiology- Functional Neuroanatomy F'08 The final stretch schedule version Thursday Nov. 13Read Present Present figs PresenterBefore Tuesday Nov. 18 review or learn key figures 1-7,1-8,1-9,3-1 Nov. 18 Nov. 19 Nov. 20 Nov. 25 Nov. 26