当前位置:网站首页>Chapter9 : De Novo Molecular Design with Chemical Language Models
Chapter9 : De Novo Molecular Design with Chemical Language Models
2022-08-10 11:42:00 【UniversalNature】
reading notes of《Artificial Intelligence in Drug Design》
1.Introduction
- These molecular representations are human-made models designed to capture certain properties. The molecules possess the syntactic properties and semantic properties.
- Several factors contribute to the popularity of SMILES in the context of deep learning:
- SMILES are strings, which renders them suitable as inputs to sequence modeling algorithms.
- Compared to other string-based molecular representations such as InChI, SMILES strings have a straightforward syntax. This permissive syntax allows for a certain “flexibility of expression”.
- SMILES are easily legible and interpretable by humans.
- The tool example demonstrates how deep learning methods can be employed to generate sets of new SMILES strings, inspired by the structures of four known retinoid X receptor (RXR) modulators, using a recently developed method, the bidirectional molecule generation with alternate learning (BIMODAL). The program code is freely available here.
2.Materials
2.1.Computational Methods
- All calculations were performed using Python 3.7.4 in Jupyter Notebooks. The models rely on PyTorch and RDKit.
- After installing Anaconda and Git, we can run the code below:
git clone https://github.com/ETHmodlab/de_novo_design_RNN.git
cd <path\to\folder>
conda env crate -f environment.yml
conda activate de_novo
cd example
jupyter notebook
2.2.Data
- To emulate a realistic scenario, we provide a tool molecule library containing four RXR modulators (Fig. 2). Molecule 1 is bexarotene, a pharmacological RXR agonist. Molecules 2–4 were obtained from ChEMBL and have a potency on RXR (expressed as EC50, IC50, Ki, or Kd) below0.8μM. This set of bioactive compounds (available in the repository, under “/exam- ple/fine_tuning.csv”) will be used to generate a focused library of de novo designs.
边栏推荐
- bat脚本——提取多个文件夹到指定路径
- 孩子自律性不够?猿辅导:计划表要注意“留白”给孩子更多掌控感
- 技术人必看!数据治理是什么?它对数据中台建设重要吗?
- LeetCode 146. LRU Cache
- 配置druid数据源「建议收藏」
- 一文详解 implementation api embed
- Database management tool: dynamic read-write separation
- codevs 2370 Small room tree (LCA)
- LeetCode 109. Sorted Linked List Conversion Binary Search Tree
- LeetCode 369. Plus One Linked List(链表加1)
猜你喜欢
技术人必看!数据治理是什么?它对数据中台建设重要吗?
Can CLIP also do segmentation tasks?The University of Göttingen proposed a model CLIPSeg that uses text and image prompts to perform three segmentation tasks at the same time, draining CLIP capabiliti
IDC第一的背后,阿里云在打造怎样的一朵“视频云”?
面试美团被问到了Redis,搞懂这几个问题,让你轻松吊打面试官
16、Pytorch Lightning入门
蚂蚁金服+拼多多+抖音+天猫(技术三面)面经合集助你拿大厂offer
OPNsense安装配置Zenarmor
three.js模糊玻璃效果
APP automation testing practice based on UiAutomator2+PageObject mode
22年BATJ大厂必问面试题(复盘):JVM+微服务+多线程+锁+高并发
随机推荐
CodeForces - 628D (digital dp)
LeetCode 25. K 个一组翻转链表
Redis常用命令
再有人问你分布式事务,把这篇扔给他
人脸考勤是选择人脸比对1:1还是人脸搜索1:N?
Threshold-based filtering buffer management scheme in a shared buffer packet switch core part of the paper
托米的咒语
Accumulated and thin hair!Safety Dog has once again obtained the certification of scientific and technological achievements transformation!
基于PLECS的离网(孤岛)并联逆变器的Droop Control下垂控制仿真
LeetCode 237. 删除链表中的节点
It is rumored that Samsung 3nm has won the second customer, and the current production capacity is in short supply
16、Pytorch Lightning入门
Licking Exercise - 63 Find all anagrams in a string
Does your child lack self-discipline?Ape Counseling: Pay attention to "blank" in the schedule to give children more control
The 6th "Blue Hat Cup" National College Student Network Security Skills Competition Semi-Final Part WriteUp
Microchip launched a high-performance 77GHz millimeter-wave radar chip, and has received tens of thousands of orders before mass production
search--09
面试官:你们是如何保证接口的幂等性?
Ssm framework construction process [easy to understand]
LeetCode 61. Rotating linked list