1 Branches

parameters

08d9bc3c35

initial commit

1 year ago

.gitignore

08d9bc3c35

initial commit

1 year ago

Dockerfile

59eb516145

Move the DagsHub login to the training script, so the docker image doesn't need to have the token built into it

1 year ago

Dockerfile.mac

59eb516145

Move the DagsHub login to the training script, so the docker image doesn't need to have the token built into it

1 year ago

README.md

bf03d3d951

Update README.md

1 year ago

data.py

08d9bc3c35

initial commit

1 year ago

helper.py

da17d099d4

Add MLflow logging

1 year ago

loss.py

8fb75009dd

Add function to calculate the average loss of the unets

1 year ago

requirements.txt

c0b7b67b2d

Get latest version of dagshub

1 year ago

train.py

fbb9afd6eb

fix a bug where the path wasn't being sent to `log_artifact`, but rather a function

1 year ago

DagsHub Storage

You have to be logged in to leave a comment.

Minimagen training

This repo contains the files needed to build a Docker image, which can train Minimagen on LAION-Aesthetics V2.

It does so by using the Direct Data Access (DDA) to stream the large dataset to the Docker container at runtime.

This allows you to:

Avoid having to manually copy data to a training server, and
Allow you to train on a subset of the data, by letting you programmatically decide which data points to use

The training script will log training parameters, metrics, and artifacts to an MLflow server connected to a DagsHub repo of your choice.

Build the Docker image

To build the Docker image as is, run:

docker build -t minimagen .

Start training

To start training, spin up a Docker container and pass in the appropriate environment variable and training parameters:

docker run --rm --name minimagen-test \
        --gpus all \
        -e DAGSHUB_TOKEN=<token> \
        -e DAGSHUB_USER_NAME=<username> \
        -e DAGSHUB_REPO_NAME=<repo_to_write_to> \
        minimagen:latest --BATCH_SIZE 2 --TIMESTEPS 25 --TESTING

Here:

DAGSHUB_TOKEN is your access token to DagsHub (generated here)
DAGSHUB_USER_NAME is the user name for the DagsHub account you want to use
DAGSHUB_REPO_NAME is the repo name to send MLflow training parameters, metrics, and artifacts to
BATCH_SIZE is the batch size to run with
TIMESTEPS is the number of steps to train on
TESTING limits the number of images to train/validate on to 16 (to test the pipeline)

Tip!

Press p or to see the previous file or, n or to see the next file

README.md

Minimagen training

Build the Docker image

Start training

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

yonomitt / Minimagen

README.md

Minimagen training

Build the Docker image

Start training

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

yonomitt
/
Minimagen