Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Integration:  dvc git github
naisuu b858c2bcc5
Fix overwriting chembl features file
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
53e9f21b73
Rollback dvc dependency on yml file, add perm. to git.
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
51a6d390e6
Add in generator from drugex+r
2 years ago
9a2df1adcc
Update dvc pipeline
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
src
b858c2bcc5
Fix overwriting chembl features file
2 years ago
63c8e32e31
revamp to using stoned
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
30c42a7f9d
Setup skeleton of entire project
2 years ago
30c42a7f9d
Setup skeleton of entire project
2 years ago
63c8e32e31
revamp to using stoned
2 years ago
6f1068041c
Add dvc.lock with updated smiles hashes
2 years ago
53e9f21b73
Rollback dvc dependency on yml file, add perm. to git.
2 years ago
3fc1694deb
Big functionality update, add dvc, lightning etc.
2 years ago
70458fd311
Update environment yaml file for unix
2 years ago
30c42a7f9d
Setup skeleton of entire project
2 years ago
30c42a7f9d
Setup skeleton of entire project
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Rxitect

License Code style: black version

Rxitect was created for the purpose of experimenting with the development and implementation of retrosynthesis engines within molecular generators for the goal of de novo drug design; the focus of my Master's thesis. The code in this repository is based on DrugEx, released by Xuhan Liu (First Author) and Gerard J.P. van Westen (Correspondent Author) on March 8th, 2021. The same license terms apply for this repository, and can be found in the LICENSE file.

Getting started

After cloning this repository, make sure you have a conda distribution installed. We recommend miniforge for licensing reasons, but anaconda/miniconda will work as well.

Installing the environment

  • conda env create -f environment.yml
  • conda activate rxitect
  • pip install -e .

Mamba

Mamba is a drop-in replacement for conda, it's much faster at resolving environments and is recommended. You can install it by running conda install -c conda-forge mamba. In case you want to use this from the start, replace conda with mamba in the instructions above.

Data Version Control (DVC)

This project uses DVC to version control large data files and trained models on DagsHub. To use DVC, run the following commands:

  • conda install -c conda-forge mamba
  • mamba install -c conda-forge dvc

Known development issues

  • black requires specific versions of typing-extensions, so you may need to run pip install typing-extensions --upgrade first.
  • If you are developing on windows, we recommend to download pytorch manually to fit your needs, e.g.: conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch.
  • If you run into issues after importing torch, try running conda install -c defaults intel-openmp -f.
  • If you are developing on a mac, you may run into issues with xgboost. To fix this, you need to have cmake installed, which can be done by running the following commands (assuming you have brew installed): brew install gcc@11, followed by brew install cmake. Note that because RA Score has a hard dependency on tensorflow-gpu to run their pretrained models, development on a mac is currently limited to just the base functionality of Rxitect (unless you have a CUDA-compatible GPU).
  • there's a recent (as of February 7, 2022) bug in rdkit where you could not import it or its modules properly if you have boost-cpp=1.74.0=h359cf19_6. If you run into this issue, update it by conda/mamba installing boost-cpp again.

Additional information

The paper that accompanies the original DrugEx code can be found here.

Tip!

Press p or to see the previous file or, n or to see the next file

About

A deep reinforcement learning-based drug molecule generator focused on generation of molecules using SELFIES to exploit the guarantee of valid molecular structures.

Collaborators 1

Comments

Loading...