Dean/mlops

You have to be logged in to leave a comment.

Use existing model

Set up environment.

export venv_name="venv"
make venv name=${venv_name} env="prod"
source ${venv_name}/bin/activate

Pull latest model.

dvc pull experiments
tagifai fix-artifact-metadata

Run Application

make app env="dev"

You can interact with the API directly or explore via the generated documentation at http://0.0.0.0:5000/docs.

Update model (CI/CD)

Coming soon after CI/CD lesson where the entire application will be retrained and deployed when we push new data (or trigger manual reoptimization/training). The deployed model, with performance comparisons to previously deployed versions, will be ready on a PR to push to the main branch.

Update model (manual)

Set up the development environment.

export venv_name="venv"
make venv name=${venv_name} env="dev"
source ${venv_name}/bin/activate

Pull versioned data.

dvc pull data/tags.json
dvc pull data/projects.json

Optimize using distributions specified in tagifai.main.objective. This also writes the best model's params to config/params.json

tagifai optimize \
    --params-fp config/params.json \
    --study-name optimization \
    --num-trials 100

We'll cover how to train using compute instances on the cloud from Amazon Web Services (AWS) or Google Cloud Platforms (GCP) in later lessons. But in the meantime, if you don't have access to GPUs, check out the optimize.ipynb notebook for how to train on Colab and transfer to local. We essentially run optimization, then train the best model to download and transfer it's artifacts.

Train a model (and save all it's artifacts) using params from config/params.json and publish metrics to metrics/performance.json. You can view the entire run's details inside experiments/{experiment_id}/{run_id} or via the API (GET /runs/{run_id}).

tagifai train-model \
    --params-fp config/params.json \
    --experiment-name best \
    --run-name model \
    --publish-metrics  # save to metrics/performance.json

Predict tags for an input sentence. It'll use the best model saved from train-model but you can also specify a run-id to choose a specific model.

tagifai predict-tags --text "Transfer learning with BERT"  # test with CLI app
make app env="dev"  # run API and test if you want

View improvements Once you're done training the best model using the current data version, best hyperparameters, etc., we can view performance difference.

tagifai diff --commit-a workspace --commit-b HEAD

Commit to git This will clean and update versioned assets (data, experiments), run tests, styling, etc.

git add .
git commit -m ""
<precommit (dvc, tests, style, clean, etc.) will execute>
git push origin main

Tip!

Press p or to see the previous file or, n or to see the next file

Specify your S3 bucket

Bucket name cannot be the same as the repository name. Please change one of them.

Bucket url and prefix

Region

Endpoint Url

Disable SSL verification

getting_started.md 3.1 KB

History Raw

Use existing model

Update model (CI/CD)

Update model (manual)

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Dean / mlops mirror of https://github.com/GokuMohandas/mlops

getting_started.md 3.1 KB History Raw

Use existing model

Update model (CI/CD)

Update model (manual)

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Dean
/
mlops
mirror of https://github.com/GokuMohandas/mlops

getting_started.md 3.1 KB

History Raw