Predicting phenotypic traits of prokaryotes from protein domain frequencies

2010 | journal article. A publication with affiliation to the University of Göttingen.

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Predicting phenotypic traits of prokaryotes from protein domain frequencies​
Lingner, T.; Muehlhausen, S. ; Gabaldon, T.; Notredame, C. & Meinicke, P. ​ (2010) 
BMC Bioinformatics11 art. 481​.​ DOI: https://doi.org/10.1186/1471-2105-11-481 

Documents & Media

Lingner.pdf352.78 kBAdobe PDF

License

Published Version

Special user license Goescholar License

Details

Authors
Lingner, Thomas; Muehlhausen, Stefanie ; Gabaldon, Toni; Notredame, Cedric; Meinicke, Peter 
Abstract
Background: Establishing the relationship between an organism's genome sequence and its phenotype is a fundamental challenge that remains largely unsolved. Accurately predicting microbial phenotypes solely based on genomic features will allow us to infer relevant phenotypic characteristics when the availability of a genome sequence precedes experimental characterization, a scenario that is favored by the advent of novel high-throughput and single cell sequencing techniques. Results: We present a novel approach to predict the phenotype of prokaryotes directly from their protein domain frequencies. Our discriminative machine learning approach provides high prediction accuracy of relevant phenotypes such as motility, oxygen requirement or spore formation. Moreover, the set of discriminative domains provides biological insight into the underlying phenotype-genotype relationship and enables deriving hypotheses on the possible functions of uncharacterized domains. Conclusions: Fast and accurate prediction of microbial phenotypes based on genomic protein domain content is feasible and has the potential to provide novel biological insights. First results of a systematic check for annotation errors indicate that our approach may also be applied to semi-automatic correction and completion of the existing phenotype annotation.
Issue Date
2010
Status
published
Publisher
Biomed Central Ltd
Journal
BMC Bioinformatics 
ISSN
1471-2105
Sponsor
German Academic Exchange Service (DAAD)

Reference

Citations


Social Media