Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Task:  text-to-image generation Data Domain:  nlp computer vision Framework:  pytorch Integration:  git
08d9bc3c35
initial commit
1 year ago
08d9bc3c35
initial commit
1 year ago
59eb516145
Move the DagsHub login to the training script, so the docker image doesn't need to have the token built into it
1 year ago
59eb516145
Move the DagsHub login to the training script, so the docker image doesn't need to have the token built into it
1 year ago
bf03d3d951
Update README.md
1 year ago
08d9bc3c35
initial commit
1 year ago
da17d099d4
Add MLflow logging
1 year ago
8fb75009dd
Add function to calculate the average loss of the unets
1 year ago
c0b7b67b2d
Get latest version of dagshub
1 year ago
fbb9afd6eb
fix a bug where the path wasn't being sent to `log_artifact`, but rather a function
1 year ago
Storage Buckets

README.md

You have to be logged in to leave a comment. Sign In

Minimagen training

This repo contains the files needed to build a Docker image, which can train Minimagen on LAION-Aesthetics V2.

It does so by using the Direct Data Access (DDA) to stream the large dataset to the Docker container at runtime.

This allows you to:

  • Avoid having to manually copy data to a training server, and
  • Allow you to train on a subset of the data, by letting you programmatically decide which data points to use

The training script will log training parameters, metrics, and artifacts to an MLflow server connected to a DagsHub repo of your choice.

Build the Docker image

To build the Docker image as is, run:

docker build -t minimagen .

Start training

To start training, spin up a Docker container and pass in the appropriate environment variable and training parameters:

docker run --rm --name minimagen-test \
        --gpus all \
        -e DAGSHUB_TOKEN=<token> \
        -e DAGSHUB_USER_NAME=<username> \
        -e DAGSHUB_REPO_NAME=<repo_to_write_to> \
        minimagen:latest --BATCH_SIZE 2 --TIMESTEPS 25 --TESTING

Here:

  • DAGSHUB_TOKEN is your access token to DagsHub (generated here)
  • DAGSHUB_USER_NAME is the user name for the DagsHub account you want to use
  • DAGSHUB_REPO_NAME is the repo name to send MLflow training parameters, metrics, and artifacts to
  • BATCH_SIZE is the batch size to run with
  • TIMESTEPS is the number of steps to train on
  • TESTING limits the number of images to train/validate on to 16 (to test the pipeline)
Tip!

Press p or to see the previous file or, n or to see the next file

About

A repository that can generate a Docker image to train Minimagen

https://dagshub.com/blog/use-kubeflow-with-dagshub/
Collaborators 1

Comments

Loading...