3 Branches

.dvc

2f0a4b49a1

DVC Init

3 years ago

.ipynb_checkpoints

82b810b974

last checkpoints added

3 years ago

code2vec

97c5b91e60

code2vec folder added

3 years ago

data

graph

78e469d2ff

Logreg trained on the new regex (graph_v2)

3 years ago

models

.gitattributes

bdb584c714

Get rid of csv in Git LFS

3 years ago

.gitignore

b1fcb7eafc

gitignore updated

3 years ago

Comments vs commented code.ipynb

a4b1957697

in-code comments classification added

3 years ago

README.md

a1740b38d7

Update README.md

3 years ago

bert_classifier.ipynb

45a0bcfcdb

changed names

3 years ago

bert_distances.ipynb

45a0bcfcdb

changed names

3 years ago

data.dvc

c11ff2b3b2

Kaggle datasets has been updated from the latest parsing

3 years ago

kaggle.sh

fc9e9f9c28

ramazyant files added

3 years ago

kaggle_parser.ipynb

fc9e9f9c28

ramazyant files added

3 years ago

logreg_classifier.ipynb

78e469d2ff

Logreg trained on the new regex (graph_v2)

3 years ago

metrics.csv

10d7a0e2a4

Logreg trained on the new regex (graph_v2)

3 years ago

models.dvc

37eaaa3400

models added

3 years ago

nl2ml_notebook_parser.py

8ba024a8c5

no message

4 years ago

params.yml

10d7a0e2a4

Logreg trained on the new regex (graph_v2)

3 years ago

predict_tag.ipynb

ca45532636

bugs fixed; variables names changed;

3 years ago

regex.ipynb

78e469d2ff

Logreg trained on the new regex (graph_v2)

3 years ago

svm_classifier.ipynb

65746108fc

pipelines optimized

3 years ago

svm_train.py

440cab265f

TFIDF_DIR fixed

3 years ago

DagsHub Storage

Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

You have to be logged in to leave a comment.

Source Code Classification

This is an old repo of NL2ML-project of the Laboratory of Big Data Analysis of Higher School of Economics (HSE LAMBDA).

The project page - https://www.notion.so/NL2ML-Corpus-1ed964c08eb049b383c73b9728c3a231

The repo is currently migrating to the HSE LAMBDA GitLab - https://gitlab.com/lambda-hse/nl2ml

Project Goals:

The current short-term goal is to build a model that will be able to classify a source code chunk and to specify where the detected class is exactly in the chunk (tag segmentation).

The global goal is to build a model that will be able to generate code using a text of the task in english.

README.md

Source Code Classification

Project Goals:

Contents:

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

levin / source_code_classification mirror of https://github.com/whatevernevermindbro/source_code_classification

README.md

Source Code Classification

Project Goals:

Contents:

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

levin
/
source_code_classification
mirror of https://github.com/whatevernevermindbro/source_code_classification