# ViT Imagenet1K fine tuning from Imagenet21K classification training:
# This example trains with batch_size = 32 * 8 GPUs, total 256.
# Training time on 8 x GeForce RTX A5000 is 52min / epoch.
# ViT Large : 85.64 (Final averaged model)
#
# Log and tensorboard at s3://deci-pretrained-models/vit_large_cutmix_randaug_v2_lr=0.03/
# Instructions:
# 0. Make sure that the data is stored in dataset_params.dataset_dir or add "dataset_params.data_dir=<PATH-TO-DATASET>" at the end of the command below (feel free to check ReadMe)
# 1. Move to the project root (where you will find the ReadMe and src folder)