Official implementation of "Generating 3D Molecules for Target Protein Binding"

Last update: Dec 07, 2022

Related tags

Overview

Generating 3D Molecules for Target Protein Binding

This is the official implementation of the GraphBP method proposed in the following paper.

Meng Liu, Youzhi Luo, Kanji Uchino, Koji Maruhashi, and Shuiwang Ji. "Generating 3D Molecules for Target Protein Binding".

Requirements

We include key dependencies below. The versions we used are in the parentheses. Our detailed environmental setup is available in environment.yml.

PyTorch (1.9.0)
PyTorch Geometric (1.7.2)
rdkit-pypi (2021.9.3)
biopython (1.79)
openbabel (3.3.1)

Preparing Data

Download and extract the CrossDocked2020 dataset:

wget https://bits.csb.pitt.edu/files/crossdock2020/CrossDocked2020_v1.1.tgz -P data/crossdock2020/
tar -C data/crossdock2020/ -xzf data/crossdock2020/CrossDocked2020_v1.1.tgz
wget https://bits.csb.pitt.edu/files/it2_tt_0_lowrmsd_mols_train0_fixed.types -P data/crossdock2020/
wget https://bits.csb.pitt.edu/files/it2_tt_0_lowrmsd_mols_test0_fixed.types -P data/crossdock2020/

Note: (1) The unzipping process could take a lot of time. Unzipping on SSD is much faster!!! (2) Several samples in the training set cannot be processed by our code. Hence, we recommend replacing the it2_tt_0_lowrmsd_mols_train0_fixed.types file with a new one, where these samples are deleted. The new one is available here.

Split data files:

python scripts/split_sdf.py data/crossdock2020/it2_tt_0_lowrmsd_mols_train0_fixed.types data/crossdock2020
python scripts/split_sdf.py data/crossdock2020/it2_tt_0_lowrmsd_mols_test0_fixed.types data/crossdock2020

Run

Train GraphBP from scratch:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main.py

Note: GraphBP can be trained on a 48GB GPU with batchsize=16. Our trained model is avaliable here.

Generate atoms in the 3D space with the trained model:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main_gen.py

Postprocess and then save the generated molecules:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main_eval.py

Reference

@article{liu2022graphbp,
      title={Generating 3D Molecules for Target Protein Binding},
      author={Meng Liu and Youzhi Luo and Kanji Uchino and Koji Maruhashi and Shuiwang Ji},
      journal={arXiv preprint arXiv:2204.09410},
      year={2022},
}

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Related tags

Overview

Generating 3D Molecules for Target Protein Binding

Requirements

Preparing Data

Run

Reference

Owner

DIVE Lab, Texas A&M University

Jittor 64*64 implementation of StyleGAN

Codes for AAAI 2022 paper: Context-aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs

PyTorch code for the ICCV'21 paper: "Always Be Dreaming: A New Approach for Class-Incremental Learning"

Deep Sea Treasure Environment for Multi-Objective Optimization Research

Benchmarking the robustness of Spatial-Temporal Models

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Denoising images with Fourier Ring Correlation loss

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

E2VID_ROS - E2VID_ROS: E2VID to a real-time system

Research on Event Accumulator Settings for Event-Based SLAM

PyTorch implementation of our CVPR2021 (oral) paper "Prototype Augmentation and Self-Supervision for Incremental Learning"

GANfolk: Using AI to create portraits of fictional people to sell as NFTs

A basic reminder tool written in Python.

Fast and customizable reconnaissance workflow tool based on simple YAML based DSL.

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

[ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

This repo is about to create the Streamlit application for given ML model.

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

BraTs-VNet - BraTS(Brain Tumour Segmentation) using V-Net