Biological Data Analysis (CSE 182) : Assignment 3LogisticsSubmit a hard copy containing the code and results. Create a compressed file containing the code and outputas separate files, and email Julio Ng.Sequence Alignment and Gap penalties1. Build an automaton for a dictionary containing 3 words. Show all failure links, and transition links. Submita sheet of paper with the automaton hand-drawn. The words are: CAMPERS, AMPERE, and AMINO(18pts.).2. You are given the following: A databaseD(represented as a single sequence), a familyFof 20 sequences,and a scoring matrixM(40pts.).(a) Design an appropriate algorithm to find homologs ofFinD. Submit a written (pseudo-code) de-scription of the algorithm, and your reasoning on why it is appropriate.(b) Implement the algorithm, and apply it to finding novel homologs ofFin the databaseD. Report thehomologs you found in the output file.(c) Compute an empirical P-value for the homologs, by first computing a distribution of scores on arandom database.
This is the end of the preview.
access the rest of the document.