当前位置:网站首页>Chapter9 : De Novo Molecular Design with Chemical Language Models
Chapter9 : De Novo Molecular Design with Chemical Language Models
2022-08-10 11:42:00 【UniversalNature】
reading notes of《Artificial Intelligence in Drug Design》
1.Introduction

- These molecular representations are human-made models designed to capture certain properties. The molecules possess the syntactic properties and semantic properties.
- Several factors contribute to the popularity of SMILES in the context of deep learning:
- SMILES are strings, which renders them suitable as inputs to sequence modeling algorithms.
- Compared to other string-based molecular representations such as InChI, SMILES strings have a straightforward syntax. This permissive syntax allows for a certain “flexibility of expression”.
- SMILES are easily legible and interpretable by humans.
- The tool example demonstrates how deep learning methods can be employed to generate sets of new SMILES strings, inspired by the structures of four known retinoid X receptor (RXR) modulators, using a recently developed method, the bidirectional molecule generation with alternate learning (BIMODAL). The program code is freely available here.
2.Materials
2.1.Computational Methods
- All calculations were performed using Python 3.7.4 in Jupyter Notebooks. The models rely on PyTorch and RDKit.
- After installing Anaconda and Git, we can run the code below:
git clone https://github.com/ETHmodlab/de_novo_design_RNN.git
cd <path\to\folder>
conda env crate -f environment.yml
conda activate de_novo
cd example
jupyter notebook
2.2.Data
- To emulate a realistic scenario, we provide a tool molecule library containing four RXR modulators (Fig. 2). Molecule 1 is bexarotene, a pharmacological RXR agonist. Molecules 2–4 were obtained from ChEMBL and have a potency on RXR (expressed as EC50, IC50, Ki, or Kd) below0.8μM. This set of bioactive compounds (available in the repository, under “/exam- ple/fine_tuning.csv”) will be used to generate a focused library of de novo designs.

边栏推荐
- The author of open source also has a life problem
- 三星计划2023年开始在越南生产半导体零部件
- 嘉为蓝鲸荣获工信部“数字技术融合创新应用解决方案”
- 吃透Chisel语言.36.Chisel实战之以FIFO为例(一)——FIFO Buffer和Bubble FIFO的Chisel实现
- 厚积薄发!安全狗再次获得科技成果转化认证!
- IDC第一的背后,阿里云在打造怎样的一朵“视频云”?
- SMIC CIM localization project suspended?Rising software: not shut down, changed to remote development!
- 态路小课堂丨如何为CXP光模块选择光纤跳线?
- 托米的咒语
- 中芯CIM国产化项目暂停?上扬软件:未停摆,改为远程开发!
猜你喜欢

技术人必看!数据治理是什么?它对数据中台建设重要吗?

You have a Doubaqiong thesaurus, please check it

APP automation testing practice based on UiAutomator2+PageObject mode

基于PLECS的离网(孤岛)并联逆变器的Droop Control下垂控制仿真

时间序列的数据分析(五):简单预测法

你有一份斗破苍穹词库,请查收

16. Getting Started with Pytorch Lightning

Go 事,Gopher 要学的数字类型,变量,常量,运算符 ,第2篇

一文详解 implementation api embed

Nocalhost - Making development more efficient in the cloud-native era
随机推荐
Network sockets (UDP and TCP programming)
IDC第一的背后,阿里云在打造怎样的一朵“视频云”?
LeetCode 362. Design Hit Counter
面试美团被问到了Redis,搞懂这几个问题,让你轻松吊打面试官
常量及数据类型你还记得多少?
LeetCode 109. 有序链表转换二叉搜索树
配置swagger
Network Fundamentals (Section 1)
网络套接字(UDP和TCP编程)
LeetCode 237. Delete a node in a linked list
Can CLIP also do segmentation tasks?The University of Göttingen proposed a model CLIPSeg that uses text and image prompts to perform three segmentation tasks at the same time, draining CLIP capabiliti
HDU 4135: Co-prime (the principle of inclusion and exclusion)
What are some useful performance testing tools recommended? Performance testing report charging standards
阿里架构师整理一份企业级SSM架构实战文档,让你熟悉底层原理
Centos7 environment uses Mysql offline installation package to install Mysql5.7
不止跑路,拯救误操作rm -rf /*的小伙儿
Excel function formulas - HLOOKUP function
SMIC CIM localization project suspended?Rising software: not shut down, changed to remote development!
Analysis of the implementation principle of UUID from the perspective of source code
LeetCode 25. A set of K flipped linked lists