Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Integration:  dvc git github
naisuu d39d3f00a5
Add predictor training script
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
3443680585
Update to use drugex v2 algorithm properly with updates to agent
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
51a6d390e6
Add in generator from drugex+r
2 years ago
317e9a0a05
Add all modifications along with RL loop
2 years ago
a3b5aea98e
Add DVC tracked models and update pipeline for predictors
2 years ago
3443680585
Update to use drugex v2 algorithm properly with updates to agent
2 years ago
src
d39d3f00a5
Add predictor training script
2 years ago
63c8e32e31
revamp to using stoned
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
317e9a0a05
Add all modifications along with RL loop
2 years ago
30c42a7f9d
Setup skeleton of entire project
2 years ago
30c42a7f9d
Setup skeleton of entire project
2 years ago
63c8e32e31
revamp to using stoned
2 years ago
d4cac7861c
Update dvc.lock
2 years ago
a3b5aea98e
Add DVC tracked models and update pipeline for predictors
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
70458fd311
Update environment yaml file for unix
2 years ago
30c42a7f9d
Setup skeleton of entire project
2 years ago
30c42a7f9d
Setup skeleton of entire project
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Rxitect

License Code style: black version

Rxitect was created for the purpose of experimenting with the development and implementation of retrosynthesis engines within molecular generators for the goal of de novo drug design; the focus of my Master's thesis. The code in this repository is based on DrugEx, released by Xuhan Liu (First Author) and Gerard J.P. van Westen (Correspondent Author) on March 8th, 2021. The same license terms apply for this repository, and can be found in the LICENSE file.

Getting started

After cloning this repository, make sure you have a conda distribution installed. We recommend miniforge for licensing reasons, but anaconda/miniconda will work as well.

Installing the environment

  • conda env create -f environment.yml
  • conda activate rxitect
  • pip install -e .

Mamba

Mamba is a drop-in replacement for conda, it's much faster at resolving environments and is recommended. You can install it by running conda install -c conda-forge mamba. In case you want to use this from the start, replace conda with mamba in the instructions above.

Data Version Control (DVC)

This project uses DVC to version control large data files and trained models on DagsHub. To use DVC, run the following commands:

  • conda install -c conda-forge mamba
  • mamba install -c conda-forge dvc

Known development issues

  • black requires specific versions of typing-extensions, so you may need to run pip install typing-extensions --upgrade first.
  • If you are developing on windows, we recommend to download pytorch manually to fit your needs, e.g.: conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch.
  • If you run into issues after importing torch, try running conda install -c defaults intel-openmp -f.
  • If you are developing on a mac, you may run into issues with xgboost. To fix this, you need to have cmake installed, which can be done by running the following commands (assuming you have brew installed): brew install gcc@11, followed by brew install cmake. Note that because RA Score has a hard dependency on tensorflow-gpu to run their pretrained models, development on a mac is currently limited to just the base functionality of Rxitect (unless you have a CUDA-compatible GPU).
  • there's a recent (as of February 7, 2022) bug in rdkit where you could not import it or its modules properly if you have boost-cpp=1.74.0=h359cf19_6. If you run into this issue, update it by conda/mamba installing boost-cpp again.

Additional information

The paper that accompanies the original DrugEx code can be found here.

Tip!

Press p or to see the previous file or, n or to see the next file

About

A deep reinforcement learning-based drug molecule generator focused on generation of molecules using SELFIES to exploit the guarantee of valid molecular structures.

Collaborators 1

Comments

Loading...