# This example trains with batch_size = 64 * 4 GPUs, total 256.
# Training time on 4 X GeForce RTX 3090 Ti is 10min / epoch, total time ~ 20h 22m (DistributedDataParallel).
# Reach => 72.05 Top1 accuracy.
#
# Log and tensorboard at s3://deci-pretrained-models/repvggg-a0-imagenet-tensorboard/
#
# Instructions:
# 0. Make sure that the data is stored in dataset_params.dataset_dir or add "dataset_params.data_dir=<PATH-TO-DATASET>" at the end of the command below (feel free to check ReadMe)
# 1. Move to the project root (where you will find the ReadMe and src folder)