Use of Short Amino Acid Motifs in the Computational Analysis of Protein Diversity and Function
Loading...
Authors
Dussaq, Alex M.
Issue Date
2010
Type
Thesis
Language
en_US
Keywords
Alternative Title
Abstract
The explosion of whole genome sequence and environmental sequence data afford us the opportunity to explore protein diversity and protein function. This is particularly exciting given the nascent field of synthetic biology. A comprehensive computational analysis of extant proteins is needed in order to define the limitations on protein structure and diversity from a bioengineering perspective. This paper focuses on defining an upper limit for protein diversity using computational approaches derived from linguistic analyses. These methods are used to make a prediction on the upper limit of unique proteins and number of highly conserved motifs. Motifs deemed highly conserved will, more than likely represent important structural components of basic proteins. Results were gathered from two large data sets: all of the currently available microbial genome sequences available from NCBI and the Global Ocean Survey data set. There were 6.6 million unique proteins at 95% amino acid identity. The majority of unique motifs in these data sets were only found once. The motifs deemed highly conserved in lifestyle groupings of organisms and individual organisms were analyzed for function based on a conserved domain search. The importance between pathogenicity and cell motility and secretion related genes and proteins was observed. These motifs represent potential new drug targets or areas of future experimentation.
Description
The University of Nevada, Reno Libraries will promptly respond to removal requests related to content that violates intellectual property laws, data protections, or has been uploaded without creator consent. Takedown notices should be directed to our ScholarWolf team (scholarwolf@library.unr.edu) with information about the object, including its full URL and the nature of your complaint.
Citation
Publisher
License
In Copyright(All Rights Reserved)