Stability of multiple alignments and phylogenetic trees: an analysis of ABC-transporter proteins family

2008 | Zeitschriftenartikel. Eine Publikation mit Affiliation zur Georg-August-Universität Göttingen.

Spring zu: Zitieren & Links | Dokumente & Medien | Details | Versionsgeschichte

Zitiervorschlag

​Stability of multiple alignments and phylogenetic trees: an analysis of ABC-transporter proteins family​
Wagner, H.; Morgenstern, B.   & Dress, A.​ (2008) 
Algorithms for Molecular Biology3 art. 15​.​ DOI: https://doi.org/10.1186/1748-7188-3-15 

Dokumente & Medien

Wagner_Dress.pdf352.48 kBAdobe PDF

Lizenz

Published Version

Special user license Goescholar License

Details

Autor(en)
Wagner, Holger; Morgenstern, Burkhard ; Dress, Andreas
Zusammenfassung
Background: Sequence-based phylogeny reconstruction is a fundamental task in Bioinformatics. Practically all methods for phylogeny reconstruction are based on multiple alignments. The quality and stability of the underlying alignments is therefore crucial for phylogenetic analysis. Results: In this short report, we investigate alignments and alignment-based phylogenies constructed for a set of 22 ABC transporters using CLUSTAL W and DIALIGN. Comparing the 22 "one-out phylogenies" one can obtain for this sequence set, some intrinsic phylogenetic instability is observed - even if attention is restricted to branches with high bootstrapping frequencies, the so-called safe branches. We show that this instability is caused by the fact that both, CLUSTAL W as well as DIALIGN, apparently get "confused" by sequence repeats in some of the ABC-transporter. To deal with such problems, two new DIALIGN options are introduced that prove helpful in our context, the "exclude-fragment" (or "xfr") and the "self-comparison" (or "sc") option. Conclusion: "One-out strategies", known to be a useful tool for testing the stability of all sorts of data-analysis procedures, can successfully be used also in testing alignment stability. In case instabilities are observed, the sequences under consideration should be carefully checked for putative causes. In case one suspects sequence repeats to be the cause, the new "sc" option can be used to detect such repeats, and the "xfr" option can help to resolve the resulting problems.
Erscheinungsdatum
2008
Status
published
Herausgeber
Biomed Central Ltd
Zeitschrift
Algorithms for Molecular Biology 
ISSN
1748-7188

Export Metadaten

Referenzen

Zitationen


Social Media