Protein function prediction in genomes: Critical assessment of coiled-coil predictions based on protein structure data

2019 | Preprint. Eine Publikation mit Affiliation zur Georg-August-Universität Göttingen.

Spring zu: Zitieren & Links | Dokumente & Medien | Details | Versionsgeschichte

Zitiervorschlag

​Simm, D., Hatje, K., Waack, S. & Kollmar, M. (2019). Protein function prediction in genomes: Critical assessment of coiled-coil predictions based on protein structure data.​ Unpublished manuscript. ​doi: https://doi.org/10.1101/675025 

Dokumente & Medien

Lizenz

GRO License GRO License

Details

Autor(en)
Simm, Dominic ; Hatje, Klas; Waack, Stephan ; Kollmar, Martin 
Zusammenfassung
Coiled-coil regions were among the first protein motifs described structurally and theoretically. The beauty and simplicity of the motif gives hope to detecting coiled-coil regions with reasonable accuracy and precision in any protein sequence. Here, we re-evaluated the most commonly used coiled-coil prediction tools with respect to the most comprehensive reference data set available, the entire Protein Data Base (PDB), down to each amino acid and its secondary structure. Apart from the thirtyfold difference in number of predicted coiled-coils the tools strongly vary in their predictions, across structures and within structures. The evaluation of the false discovery rate and Matthews correlation coefficient, a widely used performance metric for imbalanced data sets, suggests that the tested tools have only limited applicability for large data sets. Coiled-coil predictions strongly impact the functional characterization of proteins, are used for functional genome annotation, and should therefore be supported and validated by additional information.
Erscheinungsdatum
2019
Sprache
Englisch

Export Metadaten

Referenzen

Zitationen


Social Media