🌟💡 YOLOv5 Study: batch size

## Study 🤔

I did a quick study to examine the effect of varying batch size on YOLOv5 trainings. The study trained YOLOv5s on COCO for 300 epochs with `--batch-size` at 8 different values: `[16, 20, 32, 40, 64, 80, 96, 128]`.

We've tried to make the train code batch-size agnostic, so that users get similar results at any batch size. This means users on a 11 GB 2080 Ti should be able to produce the same results as users on a 24 GB 3090 or a 40 GB A100, with smaller GPUs simply using smaller batch sizes.

We do this by scaling loss with batch size, and also by scaling weight decay with batch size. At batch sizes smaller than 64 we accumulate loss before optimizing, and at batch sizes above 64 we optimize after every batch.

## Results 😃

Initial results vary significantly with batch size, but final results are nearly identical (good!).
<img width="1301" alt="Screen Shot 2021-03-05 at 1 22 03 PM" src="https://user-images.githubusercontent.com/26833433/110174990-dd487d00-7db5-11eb-9118-48777ecd57c0.png">

Closeup of mAP@0.5:0.95:
<img width="1109" alt="Screen Shot 2021-03-05 at 1 27 33 PM" src="https://user-images.githubusercontent.com/26833433/110175415-9018db00-7db6-11eb-888f-6400376923e1.png">


One oddity that stood out is val objectness loss, which did vary with batch-size. I'm not sure why, as val-box and val-cls did not vary much, and neither did the 3 train losses. I don't know what this means or if there's any room for concern (or improvement).
<img width="1109" alt="Screen Shot 2021-03-05 at 1 27 21 PM" src="https://user-images.githubusercontent.com/26833433/110175428-9444f880-7db6-11eb-8607-1de5a2f81f4e.png">





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

🌟💡 YOLOv5 Study: batch size #2377

Study 🤔

Results 😃

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

🌟💡 YOLOv5 Study: batch size #2377

Description

Study 🤔

Results 😃

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions