Stability of multiple alignments and phylogenetic trees: an analysis of ABC-transporter proteins family

2008 | journal article. A publication with affiliation to the University of Göttingen.

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Stability of multiple alignments and phylogenetic trees: an analysis of ABC-transporter proteins family​
Wagner, H.; Morgenstern, B.   & Dress, A.​ (2008) 
Algorithms for Molecular Biology3 art. 15​.​ DOI: https://doi.org/10.1186/1748-7188-3-15 

Documents & Media

Wagner_Dress.pdf352.48 kBAdobe PDF

License

Published Version

Special user license Goescholar License

Details

Authors
Wagner, Holger; Morgenstern, Burkhard ; Dress, Andreas
Abstract
Background: Sequence-based phylogeny reconstruction is a fundamental task in Bioinformatics. Practically all methods for phylogeny reconstruction are based on multiple alignments. The quality and stability of the underlying alignments is therefore crucial for phylogenetic analysis. Results: In this short report, we investigate alignments and alignment-based phylogenies constructed for a set of 22 ABC transporters using CLUSTAL W and DIALIGN. Comparing the 22 "one-out phylogenies" one can obtain for this sequence set, some intrinsic phylogenetic instability is observed - even if attention is restricted to branches with high bootstrapping frequencies, the so-called safe branches. We show that this instability is caused by the fact that both, CLUSTAL W as well as DIALIGN, apparently get "confused" by sequence repeats in some of the ABC-transporter. To deal with such problems, two new DIALIGN options are introduced that prove helpful in our context, the "exclude-fragment" (or "xfr") and the "self-comparison" (or "sc") option. Conclusion: "One-out strategies", known to be a useful tool for testing the stability of all sorts of data-analysis procedures, can successfully be used also in testing alignment stability. In case instabilities are observed, the sequences under consideration should be carefully checked for putative causes. In case one suspects sequence repeats to be the cause, the new "sc" option can be used to detect such repeats, and the "xfr" option can help to resolve the resulting problems.
Issue Date
2008
Status
published
Publisher
Biomed Central Ltd
Journal
Algorithms for Molecular Biology 
ISSN
1748-7188

Reference

Citations


Social Media