28 Pages

BinderAbstracts2

Course: CS 585, Fall 2008
School: Kentucky
Rating:
 
 
 
 
 

Word Count: 3951

Document Preview

2007 RECOMB - LNBI 4453 CD-ROM RB-Finder: An Improved Distance-Based Sliding Window Method to Detect Recombination Breakpoints Wah-Heng Lee1,2 and Wing-Kin Sung1,2 1Genome 2National Institute of Singapore University of Singapore leewhc@gis.a-star.edu.sg ksung@comp.nus.edu.sg Abstract. Recombination detection is important before inferring phylogenetic relationships. This will eventually lead to a better...

Register Now

Unformatted Document Excerpt

Coursehero >> Kentucky >> Kentucky >> CS 585

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
2007 RECOMB - LNBI 4453 CD-ROM RB-Finder: An Improved Distance-Based Sliding Window Method to Detect Recombination Breakpoints Wah-Heng Lee1,2 and Wing-Kin Sung1,2 1Genome 2National Institute of Singapore University of Singapore leewhc@gis.a-star.edu.sg ksung@comp.nus.edu.sg Abstract. Recombination detection is important before inferring phylogenetic relationships. This will eventually lead to a better understanding of pathogen evolution, more accurate genotyping and advancements in vaccine development. In this paper, we introduce RB-Finder, a fast and accurate distance-based window method to detect recombination in a multiple sequence alignment. Our method introduces a more informative distance measure and a novel weighting strategy to reduce the window size sensitivity problem and hence improve the accuracy of breakpoint detection. Furthermore, our method is faster than existing phylogenybased methods since we do not need to construct and compare complex phylogenetic trees. When compared with the current best method Pruned-PDM, we are about a few hundred times more efficient. Experimental evaluation of RB-Finder using synthetic and biological datasets showed that our method is more accurate than existing phylogeny-based methods. We also show how our method has potential use in other related applications such as genotyping. Keywords: Recombination detection, sliding window, phylogeny, genotyping. LNBI 4453, p. 518 ff. Full article in PDF | BibTeX file:///S|/bibs/4453/44530518.htm (1 of 2)5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530518.htm (2 of 2)5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM Multivariate Segmentation in the Analysis of Transcription Tiling Array Data Antonio Piccolboni Affymetrix, Inc., 6650 Vallejo St. Suite 100, Emeryville, CA 94608 antonio_piccolboni@affymetrix.com Abstract. Tiling DNA microarrays extend current microarray technology by probing the nonrepeat portion of a genome at regular intervals in an unbiased fashion. A fundamental problem in the analysis of these data is the detection of genomic regions that are differentially transcribed across multiple conditions. We propose a linear time algorithm based on segmentation techniques and linear modeling that can work at a user-selected false discovery rate. It also attains a four-fold sensitivity gain over the only competing algorithm when applied to a whole genome transcription data set spanning the embryonic development of Drosophila melanogaster. LNBI 4453, p. 311 ff. Full article in PDF | BibTeX lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530311.htm5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM A Bayesian Model That Links Microarray mRNA Measurements to Mass Spectrometry Protein Measurements Anitha Kannan1, Andrew Emili2, and Brendan J. Frey3 Research, Cambridge, UK ankannan@microsoft.com http://www.research.microsoft.com/~ankannan 2Banting 1Microsoft & Best Department of Medical Research, University of Toronto, Canada andrew.emili@utoronto.ca http://emililab.med.utoronto.ca 3Electrical & Computer Engineering, University of Toronto, Canada frey@psi.toronto.edu http://www.psi.toronto.edu Abstract. An important problem in biology is to understand correspondences between mRNA microarray levels and mass spectrometry peptide counts. Recently, a compendium of mRNA expression levels and protein abundances were released for the entire genome of the laboratory mouse, Mus musculus. The availability of these two data sets facilitate using machine learning methods to automatically infer plausible correspondences between the gene products. Knowing these correspondences can be helpful either for predicting protein abundances from microarray data or as an independent source of information that can be used for learning richer models such as regulatory networks. We propose a probabilistic model that relates protein abundances to mRNA expression levels. Using cross-mapped data from the above-mentioned studies, we learn the model and then score the genes for their strength of relationship by performing probabilistic inference in the learned model. While we gave a simplified outline of our technique in a publication aimed at biologists (Cell 2006), in this paper, we give a complete file:///S|/bibs/4453/44530325.htm (1 of 2)5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM description of the Bayesian model and the computational technique used to perform inference. In addition, we demonstrate that the Bayesian technique achieves mappings with higher statistical significance, compared to standard linear regression and a maximum likelihood version of the proposed model. LNBI 4453, p. 325 ff. Full article in PDF | BibTeX lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530325.htm (2 of 2)5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM Rearrangements in Genomes with Centromeres Part I: Translocations* Michal Ozery-Flato and Ron Shamir School of Computer Science, Tel-Aviv University, Tel Aviv 69978, Israel ozery@post.tau.ac.il rshamir@post.tau.ac.il Abstract. A centromere is a special region in the chromosome that plays a vital role during cell division. Every new chromosome created by a genome rearrangement event must have a centromere in order to survive. This constraint has been ignored in the computational modeling and analysis of genome rearrangements to date. Unlike genes, the different centromeres are indistinguishable, they have no orientation, and only their location is known. A prevalent rearrangement event in the evolution of multi-chromosomal species is translocation, i.e., the exchange of tails between two chromosomes. A translocation may create a chromosome with no centromere in it. In this paper we study for the first time centromeres-aware genome rearrangements. We present a polynomial time algorithm for computing a shortest sequence of translocations transforming one genome into the other, where all of the intermediate chromosomes must contain centromeres. We view this as a first step towards analysis of more general genome rearrangement models that take centromeres into consideration. *This study was supported in part by the Israeli Science Foundation (grant 309/02). LNBI 4453, p. 339 ff. Full article in PDF | BibTeX lncs@springer.com file:///S|/bibs/4453/44530339.htm (1 of 2)5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530339.htm (2 of 2)5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM Identification of Deletion Polymorphisms from Haplotypes Erik Corona1, Benjamin Raphael2, and Eleazar Eskin3 1Dept. of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92092 ecorona@ucsd.edu 2Dept. of Computer Science & Center for Computational Molecular Biology, Brown University, Providence, RI 02912 braphael@brown.edu 3Dept. of Computer Science, Dept. of Human Genetics, University of California, Los Angeles, CA 90095 eeskin@cs.ucla.edu Abstract. Numerous efforts are underway to catalog genetic variation in human populations. While the majority of studies of genetic variation have focused on single base pair differences between individuals, i.e. single nucleotide polymorphisms (SNPs), several recent studies have demonstrated that larger scale structural variation including copy number polymorphisms and inversion polymorphisms are also common. However, direct techniques for detection and validation of structural variants are generally much more expensive than detection and validation of SNPs. For some types of structural variation, in particular deletions, the polymorphism produces a distinct signature in the SNP data. In this paper, we describe a new probabilistic method for detecting deletion polymorphisms from SNP data. The key idea in our method is that we estimate the frequency of the haplotypes in a region of the genome both with and without the possibility of a deletion in the region and apply a generalized likelihood ratio test to assess the significance of a deletion. Application of our method to the HapMap Phase I data revealed 319 candidate deletions, 142 of these overlap with variants identified in earlier studies, while 177 are novel. Using Phase II HapMap data we predict 6730 deletions. file:///S|/bibs/4453/44530354.htm (1 of 2)5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM LNBI 4453, p. 354 ff. Full article in PDF | BibTeX lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530354.htm (2 of 2)5/26/2007 12:12:23 PM RECOMB 2007 - LNBI 4453 CD-ROM Minimizing and Learning Energy Functions for Side-Chain Prediction Chen Yanover1, Ora Schueler-Furman2, and Yair Weiss1 1School of Computer Science and Engineering, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel cheny@cs.huji.ac.il yweiss@cs.huji.ac.il 2Department of Molecular Genetics and Biotechnology, Hadassah Medical School, The Hebrew University of Jerusalem, 91120 Jerusalem, Israel oraf@ekmd.huji.ac.il Abstract. Side-chain prediction is an important subproblem of the general protein folding problem. Despite much progress in side-chain prediction, performance is far from satisfactory. As an example, the ROSETTA program that uses simulated annealing to select the minimum energy conformations, correctly predicts the first two side-chain angles for approximately 72% of the buried residues in a standard data set. Is further improvement more likely to come from better search methods, or from better energy functions? Given that exact minimization of the energy is NP hard, it is difficult to get a systematic answer to this question. In this paper, we present a novel search method and a novel method for learning energy functions from training data that are both based on Tree Reweighted Belief Propagation (TRBP). We find that TRBP can find the global optimum of the ROSETTA energy function in a few minutes of computation for approximately 85% of the proteins in a standard benchmark set. TRBP can also effectively bound the partition function which enables using the Conditional Random Fields (CRF) framework for learning. Interestingly, finding the global minimum does not significantly improve side-chain prediction for an energy function based on ROSETTA's default energy terms (less than 0.1%), while file:///S|/bibs/4453/44530381.htm (1 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM learning new weights gives a significant boost from 72% to 78%. Using a recently modified ROSETTA energy function with a softer Lennard-Jones repulsive term, the global optimum does improve prediction accuracy from 77% to 78%. Here again, learning new weights improves side-chain modeling even further to 80%. Finally, the highest accuracy (82.6%) is obtained using an extended rotamer library and CRF learned weights. Our results suggest that combining machine learning with approximate inference can improve the state-of-the-art in side-chain prediction. LNBI 4453, p. 381 ff. Full article in PDF | BibTeX lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530381.htm (2 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM Protein Conformational Flexibility Analysis with Noisy Data Anshul Nigham1 and David Hsu2 1SingaporeMIT Alliance, Singapore 117576, Singapore anshulni@comp.nus.edu.sg 2National University of Singapore, Singapore 117543, Singapore dyhsu@comp.nus.edu.sg Abstract. Protein conformational changes play a critical role in biological functions such as ligand-protein and protein-protein interactions. Due to the noise in structural data, determining salient conformational changes reliably and efficiently is a challenging problem. This paper presents an efficient algorithm for analyzing protein conformational changes, using noisy data. It applies a statistical flexibility test to all contiguous fragments of a protein and combines the information from these tests to compute a consensus flexibility measure for each residue of the protein. We tested the algorithm, using data from the Protein Data Bank and the Macromolecular Movements Database. The results show that our algorithm can reliably detect different types of salient conformational changes, including well-known examples such as hinge and shear, as well as the flap motion of HIV-1 protease. The software implementing our algorithm is available at http://motion.comp.nus.edu.sg/projects/ proflexana/proflexana.html. LNBI 4453, p. 396 ff. Full article in PDF | BibTeX file:///S|/bibs/4453/44530396.htm (1 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530396.htm (2 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM Deterministic Pharmacophore Detection Via Multiple Flexible Alignment of Drug-Like Molecules Yuval Inbar1, Dina Schneidman-Duhovny1, Oranit Dror1, Ruth Nussinov2,3, and Haim J. Wolfson1 of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel, Fax: +972-3-640 6476 inbaryuv@tau.ac.il 2Sackler 1School Inst. of Molecular Medicine, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel Research Program, SAIC-Frederick, Inc. Center for Cancer Research, Nanobiology Program NCI-Frederick, Frederick, MD 21702, USA Abstract. We present a novel highly efficient method for the detection of a pharmacophore from a set of ligands/drugs that interact with a target receptor. A pharmacophore is a spatial arrangement of physico-chemical features in a ligand that is responsible for the interaction with a specific receptor. In the absence of a known 3D receptor structure, a pharmacophore can be identified from a multiple structural alignment of the ligand molecules. The key advantages of the presented algorithm are: (a) its ability to multiply align flexible ligands in a deterministic manner, (b) its ability to focus on subsets of the input ligands, which may share a large common substructure, resulting in the detection of both outlier molecules and alternative binding modes, and (c) its computational efficiency, allows which to detect pharmacophores shared by a large number of molecules on a standard PC. The algorithm was extensively tested on a dataset of almost 80 ligands acting on 12 different receptors. The results, which were achieved using a standard default parameter set, were consistent with reference pharmacophores that were derived from the bound ligand-receptor complexes. The pharmacophores detected by the algorithm are expected to be a key component in the discovery of new leads by screening large drug-like molecule databases. 3Basic file:///S|/bibs/4453/44530412.htm (1 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM Supplementary Material: http://bioinfo3d.cs.tau.ac.il/pharma/supp.html Keywords: Computer-Aided Drug Design (CADD), Rational Drug Discovery, 3D Molecular Similarity, 3D Molecular Superposition. LNBI 4453, p. 412 ff. Full article in PDF | BibTeX lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530412.htm (2 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM Design of Compact, Universal DNA Microarrays for Protein Binding Microarray Experiments Anthony A. Philippakis1,3,4, Aaron M. Qureshi1,5, Michael F. Berger1,4, and Martha L. Bulyk1,2,3,4 1Division of Genetics, Department of Medicine and 2Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115 3Harvard/MIT Division of Health Sciences and Technology (HST), Cambridge, MA 02138 mlbulyk@receptor.med.harvard.edu 4Harvard University Graduate Biophysics Program, Cambridge, MA 02138 of Mathematics, University of Maryland, College Park, MD 20742 5Department Abstract. Our group has recently developed a compact, universal protein binding microarray (PBM) that can be used to determine the binding preferences of transcription factors (TFs) [1]. This design represents all possible sequence variants of a given length k (i.e., all k-mers) on a single array, allowing a complete characterization of the binding specificities of a given TF. Here, we present the mathematical foundations of this design based on de Bruijn sequences generated by linear feedback shift registers. We show that these sequences represent the maximum number of variants for any given set of array dimensions (i.e., number of spots and spot lengths), while also exhibiting desirable pseudo-randomness properties. Moreover, de Bruijn sequences can be selected that represent gapped sequence patterns, further increasing the coverage of the array. This design yields a powerful experimental platform that allows the binding preferences of TFs to be determined with unprecedented resolution. file:///S|/bibs/4453/44530430.htm (1 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM Keywords: de Bruijn sequences, linear feedback shift registers, protein binding microarrays, motif, transcription factor. LNBI 4453, p. 430 ff. Full article in PDF | BibTeX lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530430.htm (2 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM Improved Ranking Functions for Protein and ModificationSite Identifications Marshall Bern and David Goldberg Palo Alto Research Center, 3333 Coyote Hill Rd., Palo Alto, CA 94304, USA bern@parc.com goldberg@parc.com Abstract. There are a number of computational tools for assigning identifications to peptide tandem mass spectra, but many fewer tools for the crucial next step of integrating spectral identifications into higher-level identifications, such as proteins or modification sites. Here we describe a new program called ComByne for scoring and ranking higher-level identifications. We compare ComByne to existing algorithms on several complex biological samples, including a sample of mouse blood plasma spiked with known concentrations of human proteins. A Web interface to our software is at http://bio.parc.xerox.com. LNBI 4453, p. 444 ff. Full article in PDF | BibTeX lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530444.htm5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM Peptide Retention Time Prediction Yields Improved Tandem Mass Spectrum Identification for Diverse Chromatography Conditions Aaron A. Klammer, Xianhua Yi, Michael J. MacCoss, and William Stafford Noble Genome Sciences Department, University of Washington, 1705 NE Pacific Street, Seattle, WA 98195-5065 aklammer@gs.washington.edu xhyi@gs.washington.edu maccoss@gs.washington.edu noble@gs.washington.edu Abstract. Most tandem mass spectrum identification algorithms use information only from the final spectrum, ignoring precursor information such as peptide retention time (RT). Efforts to exploit peptide RT for peptide identification can be frustrated by its variability across liquid chromatography analyses. We show that peptide RT can be reliably predicted by training a support vector regressor on a single chromatography run. This dynamically trained model outperforms a published statically trained model of peptide RT across diverse chromatography conditions. In addition, the model can be used to filter peptide identifications that produce large discrepancies between observed and predicted RT. After filtering, estimated true positive peptide identifications increase by as much as 50% at a false discovery rate of 3%, with the largest increase for non-specific cleavage with elastase. Keywords: Mass spectrometry, proteomics, peptide identification, retention time, chromatography, machine learning, support vector regression. LNBI 4453, p. 459 ff. Full article in PDF | BibTeX file:///S|/bibs/4453/44530459.htm (1 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530459.htm (2 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM A Fast and Accurate Algorithm for the Quantification of Peptides from Mass Spectrometry Data Ole Schulz-Trieglaff1,2, Rene Hussong3, Clemens Grpl2, Andreas Hildebrandt3, and Knut Reinert2 1Max Planck Research School, Berlin, Germany trieglaf@inf.fu-berlin.de 2Department 3Center of Computer Science and Mathematics, Free University Berlin for Bioinformatics, Saarland University Abstract. Liquid chromatography combined with mass spectrometry (LC-MS) has become the prevalent technology in high-throughput proteomics research. One of the aims of this discipline is to obtain accurate quantitative information about all proteins and peptides in a biological sample. Due to size and complexity of the data generated in these experiments, this problem remains a challenging task requiring sophisticated and efficient computational tools. We propose an algorithm that can quantify even low abundance peptides from LC-MS data. Our approach is flexible and can be applied to preprocessed and raw instrument data. It is based on a combination of the sweep line paradigm with a novel wavelet function tailored to detect isotopic patterns. We evaluate our technique on several data sets of varying complexity and show that we are able to rapidly quantify peptides with high accuracy in a sound algorithmic framework. LNBI 4453, p. 473 ff. Full article in PDF | BibTeX file:///S|/bibs/4453/44530473.htm (1 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM lncs@springer.com Springer-Verlag Berlin Heidelberg 2007 file:///S|/bibs/4453/44530473.htm (2 of 2)5/26/2007 12:12:24 PM RECOMB 2007 - LNBI 4453 CD-ROM Association Mapping of Complex Diseases with Ancestral Recombination Graphs: Models and Efficient Algorithms Yufeng Wu Department of Computer Science, University of California, Davis, Davis, CA 95616, U.S.A. wuyu@cs.ucdavis.edu Abstract. Association, or LD (linkage disequilibrium), mapping is an intensely-studied approach to gene mapping (genome-wide or in candidate regions) that is widely hoped to be able to efficiently locate genes influencing both complex and Mendelian traits. The logic underlying association mapping implies that the best possible mapping results would be obtained if the genealogical history of the sampled individuals were explicitly known. Such a history would be in the form of an "ancestral recombination graph (ARG)". But despite the conceptual importance of genealogical histories to association mapping, few practical association mapping methods have explicitly used derived genealogical aspects of ARGs. Two notable exceptions are [35] and [23]. In this paper we develop an association mapping method that explicitly constructs and samples minARGs (ARGs that minimize the ...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Kentucky - CS - 585
Basic BLAST Infohttp:/www.sp.uconn.edu/~mcb232vc/blast.htmlNLM Tutorialshttp:/www.ncbi.nlm.nih.gov/Education/blasttutorial.htmlhttp:/www.ncbi.nlm.nih.gov/Education/blasttutorial.htmlWU SLUhttp:/blast.wustl.edu/blast/~
Kentucky - CS - 585
CS-585 - Special Topics: Algorithms and Software for Bioinformatics Jerzy W. Jaromczyk Brief description for Summer 2007 Location: 265 FPAT and B-35 Young Library Time: MTWRF 8:00 AM to 10:00 AM Textbook: We will use tools, manuals, papers and resour
Kentucky - CS - 585
Class,this Memorial Day weekend is the second to the last weekend ofour semester.Judging by the results of the review that we held last Friday,the first part of the semester was successful. Although certainlynot all of you were equal in the nu
Kentucky - CS - 585
CS-585 Special Topics: "Algorithms and Software for Bioinformatics"4-weeks session, Dr. JaromczykMay 8 - June 5, 20078:00 AM to 10:00 AMCan be used by CS graduate and undergradute students. Backgroundin programming and algorithms is expected.D
Washington - P - 209
In-class activity 4 Sanocki, T. (2001). Interaction of scale and time during object identification. Journal of Experimental Psychology: Human Perception and Performance, 27(2), 290-302. In parentheses In text st 1 time (Sanocki, 2001) Sanocki (2001)
Kentucky - CS - 585
CS-585 Summer 2007 JWJ CS UK Assignment # 3 Due: In Class Learning objectives: Algorithms for sequencing and for global sequence alignment. Random properties of sequences. Specifications: 3-1 Consider the following SBH method. The DNA chip is designe
Kentucky - CS - 585
CS-585 Summer 2007 JWJ CS UK Longest Subsequence Learning objectives: Dynamic Programming Problem and its solution: Let S = (a1 , ., an ) be a sequence of n numbers. Provide a recursive formula for l(i), the length of the longest monotonically increa
Kentucky - CS - 585
Matt Ryavec <Matt.Ryavec@gmail.com>: Pairwise Global Alignment of Protein Interaction Networks by Matching Neighborh ood Topology by Rohit Singh, Jinbo Xu, and Bonnie Berger http:/www.cs.uky.edu/~jurek/cs585/cs585su07/RECOMB/READINGS-RECOMB/445300
Kentucky - CS - 585
CS-585 Summer 2007 JWJ CS UK Phylogenetic Trees - NJ algorithm1Algorithm: the Neighbor-Joining MethodInstance: a set L of n point and a matrix dij (symmetric) of additive distances. Question: What is the Phylogenetic tree based on the Neighbour
Kentucky - CS - 585
MVVSGAPPAL GGGCLGTFTS LLLLASTAIL NAARIPVPPA CGKPQQLNRV VGGEDSTDSEWPWIVSIQKN GTHHCAGSLL TSRWVITAAH CFKDNLNKPY LFSVLLGAWQ LGNPGSRSQKVGVAWVEPHP VYSWKEGACA DIALVRLERS IQFSERVLPI CLPDASIHLP PNTHCWISGWGSIQDGVPLP HPQTLQKLKV PIIDSEVCSH LYWRGAGQGP ITEDMLC
Kentucky - CS - 585
http:/swift.cmbi.ru.nl/gvteach/SBC/Run Clustal for this sequences.which residues in the alignment are best/worst conserved?Use the alignment scores to compute distances between sequences using the formula (100 - a[i,j]) / 100Find the parsimo
Kentucky - CS - 585
http:/swift.cmbi.ru.nl/gvteach/SBC/Use this similarity matrix to construct the phylogenetictree:100 33 33 46 6 20 2033 100 20 86 6 20 2633 20 100 33 13 6 646 86 33 100 0 6 26 6 6 13 0 100 46 4020 20 6 6 46 100 3320 26 6 26 40 33
Kentucky - CS - 585
CS-585 Summer 2007 JWJ CS UK Review for Sequence Alignments and Phylogenetic trees Answer the following questions. Write your answer in a file (MSoft document, for example) and then turn it in. A: These questions pertain to pairwise alignments and BL
Kentucky - CS - 585
Using the PAM250 matrix compute te score for the following alignment.The score is the sum of the pairwise scores.Score of an alignment is the sum of the scores of all pairs of residues in the alignmentsequence 1: TCCPSIVARSNsequence 2: SCCPSISA
Kentucky - CS - 585
http:/www.ebi.ac.uk/clustalw/>TLN itgtstvgvgrgvlgdqkninttystyyylqdntrgngiftydakyrttlpgslwadadnqffasydapavdahyyagvtydyyknvhnrlsydgnnaairssvhysqgynnafwngsqmvygdgdgqtfiplsggidvvahelthavtdytagliyqnesgaineaisdifgtlvefyanknpdweigedvytpgisgdslrsmsdpak
Kentucky - CS - 585
>crab_anapl ALPHA CRYSTALLIN B CHAIN (ALPHA(B)-CRYSTALLIN). MDITIHNPLIRRPLFSWLAPSRIFDQIFGEHLQESELLPASPSLSPFLMRSPIFRMPSWLETGLSEMRLEKDKFSVNLDVKHFSPEELKVKVLGDMVEIHGKHEERQDEHGFIAREFNRKYRIPADVDPLTITSSLSLDGVLTVSAPRKQSDVPERSIPITREEKPAIAGAQRK
Berkeley - I - 271
It took too much time to find I lost a lot of time trying to find I didn't like the way they organized the information I would organize in another way I would prefer to use a search tool instead of browsing their categories I frustrated, I did not fi
Kentucky - CS - 585
Network Legos: Building Blocks of Cellular Wiring DiagramsT.M. Murali and Corban G. Rivera660 McBryde Hall, Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg VA 24061Abstract. Publicly-available data
Kentucky - CS - 585
A Feature-Based Approach to Modeling Protein-DNA InteractionsEilon Sharon and Eran SegalDepartment of Computer Science, Weizmann Institute of Science, Rehovot, 76100, Israel {eilon.sharon,eran.segal}@weizmann.ac.il http:/genie.weizmann.ac.ilAbstr
Kentucky - CS - 585
Network Motif Discovery Using Subgraph Enumeration and Symmetry-BreakingJoshua A. Grochow and Manolis KellisComputer Science and AI Laboratory, M.I.T. Broad Institute of M.I.T. and Harvard joshuag@cs.uchicago.edu, manoli@mit.eduAbstract. The stud
Kentucky - CS - 585
Framework for Identifying Common Aberrations in DNA Copy Number DataAmir Ben-Dor1, , Doron Lipson2 , Anya Tsalenko1 , Mark Reimers3 , Lars O. Baumbusch4 , Michael T. Barrett1,5 , John N. Weinstein3 , Anne-Lise Brresen-Dale4, and Zohar Yakhini1,2Agi
Kentucky - CS - 585
Estimating Genome-Wide Copy Number Using Allele Specific Mixture ModelsWenyi Wang1 , Benilton Carvalho2 , Nate Miller3 , Jonathan Pevsner4, Aravinda Chakravarti5, and Rafael A. Irizarry6,Department of Biostatistics, Johns Hopkins Bloomberg School o
Kentucky - CS - 585
GIMscan: A New Statistical Method for Analyzing Whole-Genome Array CGH DataYanxin Shi1 , Fan Guo1 , Wei Wu2 , and Eric P. Xing1,1School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213 epxing@cs.cmu.edu 2 Division of Pulmona
Kentucky - CS - 585
Production-Passage-Time Approximation: A New Approximation Method to Accelerate the Simulation Process of Enzymatic ReactionsHiroyuki Kuwahara1 and Chris Myers2School of Computing, University of Utah Salt Lake City, UT 84112, U.S.A. kuwahara@cs.uta
Kentucky - CS - 585
Shift-Invariant Adaptive Double Threading: Learning MHC II - Peptide BindingNoah Zaitlen1 , Manuel Reyes-Gomez2 , David Heckerman2 , and Nebojsa Jojic2,1University of California San Diego, La Jolla CA 92093, USA 2 Microsoft Research, Redmond WA 9
Kentucky - CS - 585
Reconstructing the Phylogeny of Mobile ElementsSean O'Rourke1, Noah Zaitlen2 , Nebojsa Jojic3 , and Eleazar Eskin41Dept. of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92092 sorourke@ucsd.edu 2 Dept. of Bio
Kentucky - CS - 585
Beyond Galled Trees - Decomposition and Computation of Galled NetworksDaniel H. Huson and Tobias H. Klpper oCenter for Bioinformatics (ZBIT), Tbingen University, u Sand 14, 72076 Tbingen, Germany uAbstract. Reticulate networks are a type of phylo
Kentucky - CS - 585
Variational Upper Bounds for Probabilistic Phylogenetic ModelsYdo Wexler and Dan GeigerDept. of Computer Science, Technion - Israel Institute of Technology, Haifa 32000, Israel {ywex,dang}@cs.technion.ac.ilAbstract. Probabilistic phylogenetic mod
Kentucky - CS - 585
Heuristics for the Gene-Duplication Problem: A (n) Speed-Up for the Local SearchMukul S. Bansal1 , J. Gordon Burleigh2 , Oliver Eulenstein1 , and Andr Wehe3 eDepartment of Computer Science, Iowa State University, Ames, IA, USA {bansal,oeulenst}@cs.
Kentucky - CS - 585
Support Vector Training of Protein Alignment ModelsChun-Nam John Yu1 , Thorsten Joachims1, Ron Elber1 , and Jaroslaw Pillardy21Dept. of Computer Science, Cornell University, Ithaca NY 14853, USA {cnyu,tj,ron}@cs.cornell.edu 2 Cornell Theory Cente
Kentucky - CS - 585
Tools for Simulating and Analyzing RNA Folding KineticsXinyu Tang, Shawna Thomas, Lydia Tapia, and Nancy M. AmatoParasol Lab, Dept. of Comp. Sci., Texas A&M University, College Station, TX 77843Abstract. It has recently been found that some RNA f
Kentucky - CS - 585
Multiple Sequence Alignment Based on Profile Alignment of Intermediate SequencesYue Lu1 and Sing-Hoi Sze1,2Department of Biochemistry & Biophysics 2 Department of Computer Science, Texas A&M University, College Station, TX 77843, USA1Abstract. D
Kentucky - CS - 585
Multivariate Segmentation in the Analysis of Transcription Tiling Array DataAntonio PiccolboniAffymetrix, Inc., 6650 Vallejo St. Suite 100, Emeryville, CA 94608 antonio piccolboni@affymetrix.comAbstract. Tiling DNA microarrays extend current micr
Kentucky - CS - 585
Rearrangements in Genomes with Centromeres Part I: TranslocationsMichal Ozery-Flato and Ron ShamirSchool of Computer Science, Tel-Aviv University, Tel Aviv 69978, Israel {ozery,rshamir}@post.tau.ac.ilAbstract. A centromere is a special region in
Kentucky - CS - 585
Identification of Deletion Polymorphisms from HaplotypesErik Corona1, Benjamin Raphael2 , and Eleazar Eskin3Dept. of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92092 ecorona@ucsd.edu 2 Dept. of Computer Scie
Kentucky - CS - 585
Free Energy Estimates of All-Atom Protein Structures Using Generalized Belief PropagationHetunandan Kamisetty, Eric P. Xing, and Christopher J. LangmeadComputer Science Department, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 1521
Kentucky - CS - 585
Minimizing and Learning Energy Functions for Side-Chain PredictionChen Yanover1, Ora Schueler-Furman2, and Yair Weiss1School of Computer Science and Engineering, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel {cheny,yweiss}@cs.huji.ac.
Kentucky - CS - 585
Protein Conformational Flexibility Analysis with Noisy DataAnshul Nigham1 and David Hsu2SingaporeMIT Alliance, Singapore 117576, Singapore anshulni@comp.nus.edu.sg National University of Singapore, Singapore 117543, Singapore dyhsu@comp.nus.edu.sg
Kentucky - CS - 585
Deterministic Pharmacophore Detection Via Multiple Flexible Alignment of Drug-Like MoleculesYuval Inbar1 , Dina Schneidman-Duhovny1 , Oranit Dror1 , Ruth Nussinov2,3 , and Haim J. Wolfson11School of Computer Science, Raymond and Beverly Sackler F
Kentucky - CS - 585
Design of Compact, Universal DNA Microarrays for Protein Binding Microarray ExperimentsAnthony A. Philippakis1,3,4,*, Aaron M. Qureshi1,5,*, Michael F. Berger1,4, and Martha L. Bulyk1,2,3,4,*1 Division of Genetics, Department of Medicine and Depart
Kentucky - CS - 585
Improved Ranking Functions for Protein and Modification-Site IdentificationsMarshall Bern and David GoldbergPalo Alto Research Center, 3333 Coyote Hill Rd., Palo Alto, CA 94304, USA bern@parc.com, goldberg@parc.comAbstract. There are a number of
Kentucky - CS - 585
Peptide Retention Time Prediction Yields Improved Tandem Mass Spectrum Identification for Diverse Chromatography ConditionsAaron A. Klammer, Xianhua Yi, Michael J. MacCoss, and William Stafford NobleGenome Sciences Department University of Washingt
Kentucky - CS - 585
A Fast and Accurate Algorithm for the Quantification of Peptides from Mass Spectrometry DataOle Schulz-Trieglaff1,2, Rene Hussong3 , Clemens Grpl2 , o Andreas Hildebrandt3 , and Knut Reinert2Max Planck Research School, Berlin, Germany trieglaf@inf.
Kentucky - CS - 585
Association Mapping of Complex Diseases with Ancestral Recombination Graphs: Models and Efficient AlgorithmsYufeng WuDepartment of Computer Science University of California, Davis Davis, CA 95616, U.S.A. wuyu@cs.ucdavis.eduAbstract. Association,
Kentucky - CS - 585
An Efficient and Accurate Graph-Based Approach to Detect Population SubstructureSrinath Sridhar1 , Satish Rao2 , and Eran Halperin3Computer Science Dept, Carnegie Mellon University srinath@cs.cmu.edu Computer Science Dept, University of California,
Kentucky - CS - 585
RB-Finder: An Improved Distance-Based Sliding Window Method to Detect Recombination BreakpointsWah-Heng Lee1,2 and Wing-Kin Sung1,2Genome Institute of Singapore National University of Singapore leewhc@gis.a-star.edu.sg, ksung@comp.nus.edu.sg2 1A
Kentucky - CS - 585
Comparative Analysis of Spatial Patterns of Gene Expression in Drosophila melanogaster Imaginal DiscsCyrus L. Harmon1 , Parvez Ahammad2 , Ann Hammonds1 , Richard Weiszmann3 , Susan E. Celniker3 , S. Shankar Sastry2 , and Gerald M. Rubin11 Departmen
Kentucky - CS - 585
RECOMB 2007 - LNBI 4453 CD-ROMPairwise Global Alignment of Protein Interaction Networks by Matching Neighborhood TopologyRohit Singh1, Jinbo Xu2, and Bonnie Berger*11ComputerScience and AI Lab., Massachusetts Institute of Technology rsingh@mit.
Kentucky - CS - 585
RECOMB 2007 - LNBI 4453 CD-ROMRB-Finder: An Improved Distance-Based Sliding Window Method to Detect Recombination BreakpointsWah-Heng Lee1,2 and Wing-Kin Sung1,21Genome 2NationalInstitute of SingaporeUniversity of Singapore leewhc@gis.a-star.
Kentucky - CS - 585
CS 585 Designing GamesSummer 2008JWJ-CS-UKFor CS 585 students only. Please do not distribute.=
Elon - CSC - 176
Security and Privacy Paranoia How much computer communication can be considered private? How likely are we to put important information on the internet? In email? We can't continue to use the computer for business transactions or person
Elon - CSC - 176
Program TranslationModule 6, Analytical EngineReview We have learned. about local applications, global applications, and the logical problem solving that programmers use to build themToday: How the Computer Works Binary Representation of D
Elon - CSC - 331
Exam 2 Review1. A sort whose main operation is finding whether or not 2 elements are <, >, or = is called.2. A sort in which equal elements stay in their original order is called.3. Give an example of an in-place sort.4. What is the runni
Elon - CSC - 331
CSC 331: Algorithm AnalysisCryptographyThe Rivest-Shamir-Adleman (RSA) cryptosystem, uses all the ideas we have talked about so far.Polynomial-time computability: modular exponentiation, greatest common divisor, primality testingExponential-t
Arizona - CE - 467
CE 467 Highway Safety and OperationsGroup exercise Effect of driver factors on safety Thu Sep 2, 2004Bill Hanson Ian Penn Bob SmithFactor Eyesight of driver Nearsightedness Farsightedness Focus time Peripheral vision Glare Recovery Light sensit
Arizona - CE - 467
Environmental Factors that Cause AccidentsFactorHeat Ice Wind Raingenerated by Tom, Judd and LisaExpected EffectRaises risk while driving Icy roads Low visibility from dust Low visibility, hydroplane, slick roads, flooding Low visibility, Less
Arizona - CE - 467
OPERATIONAL FACTORS AND SAFETYFactor Pedestrians walking in the roadway Expected Effect Pedestrian accidents Explanation Pedestrians are particularly vulnerable when sharing the road environment with motor vehicles. When mixing with traffic it is li
Arizona - CE - 467
Roadway and Roadsides Roadside SafetyFactor Expected Effect Proper Signalization Lower accident rate Lower speeds Speed Higher speed causes more accedients Lane Width Shoulder Width Access Tighter Lanes Smaller shoulder widths Explanation Proper pha
Arizona - CE - 467
DOT HS 809 481U.S. Department of Transportation National Highway Traffic Safety AdministrationTraffic Safety Facts 2001State Alcohol EstimatesA Public Information Fact Sheet on Motor Vehicle and Traffic Safety Published by the National Highway
Arizona - CE - 467
Vehicle FactorsFactor Expected Effect ExplanationConservation of momentum and energy There is a greater moment arm with a higher center of gravity A tall vehicle in a crash will traverse over a shorter vehicle roadFundamental Vehicle Characterist
Elon - CSC - 331
Greedy AlgorithmsCSC 331: Algorithm AnalysisGreedy AlgorithmsHuffman EncodingIn the MP3 audio compression scheme, a sound signal is encoded in three steps. Its is digitized by sampling at regular intervals, yielding a sequence of real numbers.