Table of Contents
Effect
Summary
Home Technology peripherals AI World's first: Molecular Heart's open source new AI algorithm to overcome the problems of protein side chain prediction and sequence design

World's first: Molecular Heart's open source new AI algorithm to overcome the problems of protein side chain prediction and sequence design

Jun 05, 2023 pm 06:10 PM
ai algorithm

The formation of protein structure and function depends largely on the interaction between side chain atoms. Therefore, accurate protein side chain prediction (PSCP) is the key to solving the problems of protein structure prediction and protein design. ring. However, previous protein structure predictions mostly focused on the main chain structure, and side chain structure prediction has always been a difficult problem that has not been completely solved.

Recently, Xu Jinbo’s team at Molecular Heart launched a new PSCP deep architecture AttnPacker, which has achieved significant improvements in speed, memory efficiency and overall accuracy. It is currently the best known side. The chain structure prediction algorithm is also the world's first AI algorithm that can simultaneously predict protein side chains and sequence design.

The paper was published in the Proceedings of the National Academy of Sciences (PNAS), and its pre-trained model, source code and inference scripts have been open sourced on Github.

全球首创 :分子之心开源新AI算法,攻克蛋白质侧链预测与序列设计难题

  • ##Paper link: https://www.pnas. org/doi/10.1073/pnas.2216438120#supplementary-materials
  • Open source link: https://github.com/MattMcPartlon/AttnPacker
Background

Proteins are folded from several amino acids, and their structures are divided into main chains and side chains. Differences in side chains have a huge impact on protein structure and function, especially biological activity. Based on a clear understanding of the side chain structure, scientists can more accurately determine the three-dimensional structure of proteins, analyze protein-protein interactions, and conduct rational protein design. When applied to the field of drug design, scientists can quickly and more accurately find suitable binding sites for drugs and receptors, and even optimize or design binding sites as needed; in the field of enzyme optimization, scientists can optimize sequences Transformation allows multiple side chains to participate in catalytic reactions to achieve more efficient and specific catalytic effects.

Most current protein structure prediction algorithms mainly focus on the structural analysis of the main chain, but protein side chain structure prediction is still a problem that has not been completely overcome. Whether it is popular protein structure prediction algorithms such as AlphaFold2 or algorithms focusing on side chain structure prediction such as DLPacker and RosettaPacker, the accuracy or speed are not satisfactory. This also imposes limitations on protein design.

Traditional methods, such as RosettaPacker, mainly use energy optimization methods, first grouping the distribution of side chain atoms, and then searching for the grouping of side chains for a specific amino acid to find the minimum energy The combination. These methods differ primarily from the researcher's choice of rotamer libraries, energy functions, and energy minimization procedures, with accuracy limited by the use of search heuristics and discrete sampling procedures. There are also side chain prediction methods based on deep learning in the industry, such as DLPacker, which formulates PSCP as an image-to-image conversion problem and adopts a U-net model structure. However, the prediction accuracy and speed are still not ideal.

Method

AttnPacker is an end-to-end deep learning method for predicting protein side chain coordinates. It jointly simulates side chain interactions, with directly predicted side chain structures that are more physically feasible, with fewer atomic collisions and more ideal bond lengths and angles.

Specifically, AttnPacker introduces a depth map converter architecture that leverages the geometric and relational aspects of PSCP. Inspired by AlphaFold2, Molecular Heart proposes position-aware triangle updates to optimize pairwise features using a graph-based framework to compute triangle attention and multiplicative updates. With this approach, AttnPacker has significantly less memory and a higher capacity model. Furthermore, Molecular Heart explores several SE (3) equivariant attention mechanisms and proposes an equivariant transformer architecture for learning from 3D points.

全球首创 :分子之心开源新AI算法,攻克蛋白质侧链预测与序列设计难题

AttnPacker runs the process. The protein backbone coordinates and sequence are used as input, and the spatial feature map and equivariable basis are derived based on the coordinate information. The feature map is processed by the invariant graph-transformer module and then passed to an equivariant TFN-Transformer that outputs predicted side chain coordinates, confidence scores for each residue, and optional design sequences. The predicted coordinates are post-processed to remove all spatial conflicts and ensure idealized geometry.

Effect

In terms of prediction performance, AttnPacker shows improvements in accuracy and efficiency for both natural and non-natural backbone structures. At the same time, physical feasibility is ensured, deviations from ideal bond lengths and angles are negligible, and minimal atomic steric hindrance is produced.

Molecular Heart conducts comparative tests on AttnPacker and the current state-of-the-art methods - SCWRL4, FASPR, RosettaPacker and DLPacker on the CASP13 and CASP14 natural and non-native protein backbone data sets. Results show that AttnPacker significantly outperforms traditional protein side chain prediction methods on CASP13 and CASP14 native backbones, with average reconstruction RMSDs more than 18% lower than the suboptimal method on each test set. AttnPacker also outperforms the deep learning method DLPacker, reducing average RMSD by more than 11% while also significantly improving sidechain dihedral accuracy. In addition to accuracy, AttnPacker has significantly fewer atomic collisions than other methods.

全球首创 :分子之心开源新AI算法,攻克蛋白质侧链预测与序列设计难题

##When the natural main chain structure is given, each algorithm performs in CASP13 and CASP14 Side chain structure prediction results on the target protein. Asterisks indicate that the average conflict values ​​are lower than the native structure—56.0, 5.9, and 0.4 for CASP13 and 80.4, 7.9, and 2.5 for CASP14.

On the CASP13 and CASP14 non-native backbones, AttnPacker is also significantly better than other methods, and the atomic collisions are also significantly less than other methods.

全球首创 :分子之心开源新AI算法,攻克蛋白质侧链预测与序列设计难题

When the non-natural main chain structure is given, each algorithm is in CASP13 and Side chain structure prediction results on CASP14 target proteins. Asterisks indicate that the average conflict values ​​are lower than the corresponding native structures—34.6, 2.2, 0.5 for CASP13 and 40.0, 2.7, 0.7 for CASP14.

Innovatively abandons the discrete rotamer library and computationally expensive conformational search and sampling steps, and directly combines the main chain 3D geometry to calculate all sides in parallel chain coordinates. Compared with the deep learning-based method DLPacker and the traditional computing method-based RosettaPacker, AttnPacker has significantly improved computing efficiency and reduced inference time by more than 100 times.

Time comparison of different PSCP methods. Reconstructing the relative times of side chain atoms for all 83 CASP13 target proteins.

AttnPacker performs equally well in protein design. Molecular Heart trained an AttnPacker variant for co-design that achieves native sequence recovery rates comparable to current state-of-the-art methods while also producing highly accurate assemblies. Rosetta simulation validation shows that AttnPacker-designed structures generally produce subnative (lower) Rosetta energies.

全球首创 :分子之心开源新AI算法,攻克蛋白质侧链预测与序列设计难题

##Use ESMFold scTM and plDDT indicators to compare native protein sequences and The sequences generated by AttnPacker were used to evaluate the generation quality of AttnPacker, and the results showed strong correlation.

In addition to its amazing effectiveness and efficiency, AttnPaker also has a very practical value - it is very easy to use. AttnPaker only requires a protein structure file to run. In contrast, OPUS-Rota4 (28) requires a voxel representation of the atomic environment from DLPacker, logic, secondary structure from trRosetta100, and constraint files from OPUS-CM output. Additionally, since AttnPacker directly predicts side chain coordinates, the output is fully differentiable, which facilitates downstream prediction tasks such as optimization or protein-protein interactions. "The advantages of good prediction effect, high efficiency and ease of use are conducive to the widespread use of AttnPacker in research and industrial fields." Professor Xu Jinbo said.

Summary

1. AttnPacker is a SE (3) equivariant model used to directly predict sequence and side chain coordinates, and can be used for protein side chain structure prediction. , which can also be used for protein sequence design and is a pioneering work.

2. AttnPacker's accuracy is better than other methods, its efficiency is greatly improved, and it is extremely easy to use.

The above is the detailed content of World's first: Molecular Heart's open source new AI algorithm to overcome the problems of protein side chain prediction and sequence design. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1662
14
PHP Tutorial
1262
29
C# Tutorial
1235
24
Which of the top ten currency trading platforms in the world are among the top ten currency trading platforms in 2025 Which of the top ten currency trading platforms in the world are among the top ten currency trading platforms in 2025 Apr 28, 2025 pm 08:12 PM

The top ten cryptocurrency exchanges in the world in 2025 include Binance, OKX, Gate.io, Coinbase, Kraken, Huobi, Bitfinex, KuCoin, Bittrex and Poloniex, all of which are known for their high trading volume and security.

How much is Bitcoin worth How much is Bitcoin worth Apr 28, 2025 pm 07:42 PM

Bitcoin’s price ranges from $20,000 to $30,000. 1. Bitcoin’s price has fluctuated dramatically since 2009, reaching nearly $20,000 in 2017 and nearly $60,000 in 2021. 2. Prices are affected by factors such as market demand, supply, and macroeconomic environment. 3. Get real-time prices through exchanges, mobile apps and websites. 4. Bitcoin price is highly volatile, driven by market sentiment and external factors. 5. It has a certain relationship with traditional financial markets and is affected by global stock markets, the strength of the US dollar, etc. 6. The long-term trend is bullish, but risks need to be assessed with caution.

Decryption Gate.io Strategy Upgrade: How to Redefine Crypto Asset Management in MeMebox 2.0? Decryption Gate.io Strategy Upgrade: How to Redefine Crypto Asset Management in MeMebox 2.0? Apr 28, 2025 pm 03:33 PM

MeMebox 2.0 redefines crypto asset management through innovative architecture and performance breakthroughs. 1) It solves three major pain points: asset silos, income decay and paradox of security and convenience. 2) Through intelligent asset hubs, dynamic risk management and return enhancement engines, cross-chain transfer speed, average yield rate and security incident response speed are improved. 3) Provide users with asset visualization, policy automation and governance integration, realizing user value reconstruction. 4) Through ecological collaboration and compliance innovation, the overall effectiveness of the platform has been enhanced. 5) In the future, smart contract insurance pools, forecast market integration and AI-driven asset allocation will be launched to continue to lead the development of the industry.

Which of the top ten currency trading platforms in the world are the latest version of the top ten currency trading platforms Which of the top ten currency trading platforms in the world are the latest version of the top ten currency trading platforms Apr 28, 2025 pm 08:09 PM

The top ten cryptocurrency trading platforms in the world include Binance, OKX, Gate.io, Coinbase, Kraken, Huobi Global, Bitfinex, Bittrex, KuCoin and Poloniex, all of which provide a variety of trading methods and powerful security measures.

What are the top currency trading platforms? The top 10 latest virtual currency exchanges What are the top currency trading platforms? The top 10 latest virtual currency exchanges Apr 28, 2025 pm 08:06 PM

Currently ranked among the top ten virtual currency exchanges: 1. Binance, 2. OKX, 3. Gate.io, 4. Coin library, 5. Siren, 6. Huobi Global Station, 7. Bybit, 8. Kucoin, 9. Bitcoin, 10. bit stamp.

What are the top ten virtual currency trading apps? The latest digital currency exchange rankings What are the top ten virtual currency trading apps? The latest digital currency exchange rankings Apr 28, 2025 pm 08:03 PM

The top ten digital currency exchanges such as Binance, OKX, gate.io have improved their systems, efficient diversified transactions and strict security measures.

How to use the chrono library in C? How to use the chrono library in C? Apr 28, 2025 pm 10:18 PM

Using the chrono library in C can allow you to control time and time intervals more accurately. Let's explore the charm of this library. C's chrono library is part of the standard library, which provides a modern way to deal with time and time intervals. For programmers who have suffered from time.h and ctime, chrono is undoubtedly a boon. It not only improves the readability and maintainability of the code, but also provides higher accuracy and flexibility. Let's start with the basics. The chrono library mainly includes the following key components: std::chrono::system_clock: represents the system clock, used to obtain the current time. std::chron

How to handle high DPI display in C? How to handle high DPI display in C? Apr 28, 2025 pm 09:57 PM

Handling high DPI display in C can be achieved through the following steps: 1) Understand DPI and scaling, use the operating system API to obtain DPI information and adjust the graphics output; 2) Handle cross-platform compatibility, use cross-platform graphics libraries such as SDL or Qt; 3) Perform performance optimization, improve performance through cache, hardware acceleration, and dynamic adjustment of the details level; 4) Solve common problems, such as blurred text and interface elements are too small, and solve by correctly applying DPI scaling.

See all articles