Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Mohammed Salman 78385d1c1d
Delete preprocess.py
2 years ago
..
0dd491d218
Added the vocabulary for regressionmodeling
3 years ago
baa0fefa3c
Update finetuneRegr.py
2 years ago
52c7ca8b2d
Bugfixes necessary to make fineTuneRegr.py run
3 years ago
0dd491d218
Added the vocabulary for regressionmodeling
3 years ago
a2b4edf02e
Add Regression tasks
3 years ago

README.md

You have to be logged in to leave a comment. Sign In

Training of regression models from pre-trained MolBART models

The prop_bart_vocab.txt file is the vocabulary for regression modeling of the Chemformer. This vocabulary except from the bart_vocab.txt vocabulary contains the tokens for the 133 QSAR tasks as gene symbols, and the three physical chemistry properties.

The finetune_regression_modules.py file contains classes of the Data Helpers and Models for training the Chemformer on downstream Regression tasks. Most of these classes inherit from classes which are created in molbart implementation.

The scripts trainRegr.py, finetuneRegr.py are for training and fine-tuning respectively; the Encoder part of the Chemformer on a number of regression tasks simultaneously.

Each script can be run using python -m molbart.example_scripts.finetune_regression.<script_name> <args>.

See the ArgumentParser args in each file for more details on each argument.

Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...