Mining folded proteomes in the era of accurate structure prediction |
| |
Authors: | Charles Bayly-Jones James C. Whisstock |
| |
Affiliation: | 1. Department of Biochemistry and Molecular Biology, Monash University, Clayton, Australia;2. Biomedicine Discovery Institute, Faculty of Medicine, Nursing and Health Sciences, Monash University, Clayton, Australia; Hebrew University of Jerusalem, ISRAEL |
| |
Abstract: | Protein structure fundamentally underpins the function and processes of numerous biological systems. Fold recognition algorithms offer a sensitive and robust tool to detect structural, and thereby functional, similarities between distantly related homologs. In the era of accurate structure prediction owing to advances in machine learning techniques and a wealth of experimentally determined structures, previously curated sequence databases have become a rich source of biological information. Here, we use bioinformatic fold recognition algorithms to scan the entire AlphaFold structure database to identify novel protein family members, infer function and group predicted protein structures. As an example of the utility of this approach, we identify novel, previously unknown members of various pore-forming protein families, including MACPFs, GSDMs and aerolysin-like proteins. |
| |
Keywords: | |
|
|