首页 | 本学科首页   官方微博 | 高级检索  
   检索      


LambdaPP: Fast and accessible protein-specific phenotype predictions
Authors:Tobias Olenyi  Céline Marquet  Michael Heinzinger  Benjamin Kröger  Tiha Nikolova  Michael Bernhofer  Philip Sändig  Konstantin Schütze  Maria Littmann  Milot Mirdita  Martin Steinegger  Christian Dallago  Burkhard Rost
Institution:1. TUM (Technical University of Munich) Department of Informatics, Bioinformatics- & Computational Biology—i12, Garching, Germany;2. TUM (Technical University of Munich) Department of Informatics, Bioinformatics- & Computational Biology—i12, Garching, Germany

Contribution: Software (equal);3. TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Garching, Germany;4. School of Biological Sciences, Seoul National University, Seoul, South Korea;5. School of Biological Sciences, Seoul National University, Seoul, South Korea

Korea Artificial Intelligence Institute, Seoul National University, Seoul, South Korea

Korea Institute of Molecular Biology and Genetics, Seoul National University, Seoul, South Korea

Contribution: Resources (equal), Software (supporting)

Abstract:The availability of accurate and fast artificial intelligence (AI) solutions predicting aspects of proteins are revolutionizing experimental and computational molecular biology. The webserver LambdaPP aspires to supersede PredictProtein, the first internet server making AI protein predictions available in 1992. Given a protein sequence as input, LambdaPP provides easily accessible visualizations of protein 3D structure, along with predictions at the protein level (GeneOntology, subcellular location), and the residue level (binding to metal ions, small molecules, and nucleotides; conservation; intrinsic disorder; secondary structure; alpha-helical and beta-barrel transmembrane segments; signal-peptides; variant effect) in seconds. The structure prediction provided by LambdaPP—leveraging ColabFold and computed in minutes—is based on MMseqs2 multiple sequence alignments. All other feature prediction methods are based on the pLM ProtT5. Queried by a protein sequence, LambdaPP computes protein and residue predictions almost instantly for various phenotypes, including 3D structure and aspects of protein function. LambdaPP is freely available for everyone to use under embed.predictprotein.org , the interactive results for the case study can be found under https://embed.predictprotein.org/o/Q9NZC2 . The frontend of LambdaPP can be found on GitHub ( github.com/sacdallago/embed.predictprotein.org ), and can be freely used and distributed under the academic free use license (AFL-2). For high-throughput applications, all methods can be executed locally via the bio-embeddings ( bioembeddings.com ) python package, or docker image at ghcr.io/bioembeddings/bio_embeddings , which also includes the backend of LambdaPP.
Keywords:artificial intelligence  protein annotation  protein function prediction  protein language models  protein structure prediction  web server
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号