Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

README.md 489 B

You have to be logged in to leave a comment. Sign In

Source Code Classification

nl2ml_notebook_parser.py - script for parsing Kaggle notebooks and process them to JSON/CSV/Pandas.

NL2ML BERT Distances.ipynb - notebook with expiremints concerning sense of distance between BERT embeddings where input tokens were tokenized source code chunks.

NL2ML_BERT_Classifier.ipynb - notebook with preprocessing and training pipeline.

NL2ML_Regex_Labeling + LogReg.ipynb - notebook with data labelling with regex and building logreg

Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...