confused about the train loss、size_average and the performance.

Hi, @hirotomusiker.
I come here again. As the title said, I am confused about the train loss、size_average and the performance. I have train the original darknet repo and this repo on my own dataset (3 classes). And I want to share the results here.
The params are same: MAXITER: 6000, STEPS: (4800, 5400), IMGSIZE: 608 (both for train and test).
With darknet, I gain the mAP@0.5 as 79.0, and the final loss was 0.76 (avg).
![image](https://user-images.githubusercontent.com/38886481/75874319-6a2bc400-5e4c-11ea-85e6-c56ca37f9fd6.png)
With this repo, the mAP@0.5 was 76.9, and the final loss was 4.7 (total).
![image](https://user-images.githubusercontent.com/38886481/75874382-8891bf80-5e4c-11ea-9365-435f1378128e.png)
It seens that with this repo, the loss is harder to converge. So I changed the params for this repo (MAXITER: 8000, STEPS: (6400, 7200)), and gain the mAP@0.5 as 78.3, and the final loss was 8.2 (total).
![image](https://user-images.githubusercontent.com/38886481/75874430-9e9f8000-5e4c-11ea-8a68-a0cd5b3862c4.png)
![image](https://user-images.githubusercontent.com/38886481/75874443-a65f2480-5e4c-11ea-9104-0dc13fe6549d.png)
So I have some questions.
1. the performance seens different, may be caused by the shuffle of the dataset?
2. the loss of this repo is larger and harder to converge compared to the darknet. What's the reason?
3. in [#44](https://github.com/DeNA/PyTorch_YOLOv3/issues/44#issuecomment-557457023), you haved talked about the param ```size_average``` and said that the loss of darknet is also high?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

confused about the train loss、size_average and the performance. #58

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

confused about the train loss、size_average and the performance. #58

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions