Data Entry: Please note that the research database will be replaced by UNIverse by the end of October 2023. Please enter your data into the system https://universe-intern.unibas.ch. Thanks

Login for users with Unibas email account...

Login for registered users without Unibas email account...

 
Uncovering new families and folds in the natural protein universe
JournalArticle (Originalarbeit in einer wissenschaftlichen Zeitschrift)
 
ID 4697907
Author(s) Durairaj, Janani; Waterhouse, Andrew M.; Mets, Toomas; Brodiazhenko, Tetiana; Abdullah, Minhal; Studer, Gabriel; Tauriello, Gerardo; Akdel, Mehmet; Andreeva, Antonina; Bateman, Alex; Tenson, Tanel; Hauryliuk, Vasili; Schwede, Torsten; Pereira, Joana
Author(s) at UniBasel Schwede, Torsten
Soares Pereira, Joana Maria
Waterhouse, Andrew
Studer, Gabriel
Durairaj, Janani
Year 2023
Title Uncovering new families and folds in the natural protein universe
Journal Nature
Volume 622
Number 7983
Pages / Article-Number 646-653
Mesh terms Protein Structure, Tertiary; Databases, Protein; Proteins, chemistry; Amino Acid Sequence; Computational Biology
Abstract We are now entering a new era in protein sequence and structure annotation, with hundreds of millions of predicted protein structures made available through the AlphaFold database; 1; . These models cover nearly all proteins that are known, including those challenging to annotate for function or putative biological role using standard homology-based approaches. In this study, we examine the extent to which the AlphaFold database has structurally illuminated this "dark matter" of the natural protein universe at high predicted accuracy. We further describe the protein diversity that these models cover as an annotated interactive sequence similarity network, accessible at https://uniprot3d.org/atlas/AFDB90v4 . By searching for novelties from sequence, structure, and semantic perspectives, we uncovered the β-flower fold, added multiple protein families to Pfam database; 2; , and experimentally demonstrate that one of these belongs to a new superfamily of translation-targeting toxin-antitoxin systems, TumE-TumA. This work underscores the value of large-scale efforts in identifying, annotating, and prioritising novel protein families. By leveraging the recent deep learning revolution in protein bioinformatics, we can now shed light into uncharted areas of the protein universe at an unprecedented scale, paving the way to innovations in life sciences and biotechnology.
Publisher Macmillan
ISSN/ISBN 0028-0836 ; 1476-4687
URL https://rdcu.be/dl34l
edoc-URL https://edoc.unibas.ch/95694/
Full Text on edoc Restricted
Digital Object Identifier DOI 10.1038/s41586-023-06622-3
PubMed ID http://www.ncbi.nlm.nih.gov/pubmed/37704037
ISI-Number MEDLINE:37704037
Document type (ISI) Journal Article
Top-publication of... Schwede, Torsten
 
   

MCSS v5.8 PRO. 0.347 sec, queries - 0.000 sec ©Universität Basel  |  Impressum   |    
01/05/2024