Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
swarajpande4 68c03e2523
Added building instructions in README
2 years ago
ca45a9c4f7
Initial commit
2 years ago
3a6765cd7d
A lot of changes
2 years ago
362858fad7
Added the CNN Model
2 years ago
e83eed4c29
Added Support Vector Classifier with PCA20
2 years ago
362858fad7
Added the CNN Model
2 years ago
616693093c
Added the CNN Notebook
2 years ago
ca45a9c4f7
Initial commit
2 years ago
ca45a9c4f7
Initial commit
2 years ago
68c03e2523
Added building instructions in README
2 years ago
362858fad7
Added the CNN Model
2 years ago
e83eed4c29
Added Support Vector Classifier with PCA20
2 years ago
3a6765cd7d
A lot of changes
2 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

Digit Classification

An ML / MLOps classification project on MNIST - Database of Handwritten Digits implemented using convolutional neural network (ConvNet / CNN) with the help of PyTorch and DVC libraries.

DAGsHub Link (for experimentation and pipelining): Click Here!


To build from source

  1. Install Git and DVC.

  2. Clone the repository.

    git clone https://github.com/swarajpande4/Digit-Classification.git
    
    cd Digit-Classification/
    
  3. Set up virtual environment for python.

    pip install virtualenv
    
    virtualenv venv/
    
    source venv/bin/activate
    
    pip install -r requirements.txt
    
  4. Run the following command to execute the pipeline after making changes to the code/ scripts.

    dvc repro
    
  5. Deactivate the virtual environment.

    deactivate
    

Files and Directory Structure

.
├── .github/workflows
    └── cml.yaml                    // CML workflow for GitHub Actions
├── code
    ├── eval.py                     // Evaluation metrics script
    ├── featurization.py            // Featurization script
    ├── get_data.py                 // Fetches the datasets for CML container (GitHub Actions Job)
    ├── model_class.py              // Model Class script
    └── train_model.py              // Trains the model instance
├── data
    ├── model.pkl                   (DVC)
    ├── norms_params.json           (DVC)
    ├── processed_test_data.npy     (DVC)
    ├── processed_train_data.npy    (DVC)
    ├── test_data.csv               (DVC)
    ├── train_data.csv              (DVC)
    ├── train_data.csv.dvc
    └── test_data.csv.dvc        
├── metrics 
    ├── confmat.png                 // Confusion matrix displayed by GitHub Actions
    ├── eval.json                   // Evaluation metrics for pipeline 
    ├── metrics.txt                 // Evaluation metrics displayed by Github Actions                       
    └── train_metric.json           
├── notebook
    └── notebook.ipynb              // Jupyter Notebook 
├── dvc.lock
├── dvc.yaml
├── requirements.txt
└── README.md

Tip!

Press p or to see the previous file or, n or to see the next file

About

An ML / MLOps classification project on MNIST - Database of Handwritten Digits implemented using CNN with the help of PyTorch and DVC libraries.

Collaborators 1

Comments

Loading...