Skip to content
This repository was archived by the owner on Jun 15, 2022. It is now read-only.
This repository was archived by the owner on Jun 15, 2022. It is now read-only.

confused about the train loss、size_average and the performance. #58

@chengcchn

Description

@chengcchn

Hi, @hirotomusiker.
I come here again. As the title said, I am confused about the train loss、size_average and the performance. I have train the original darknet repo and this repo on my own dataset (3 classes). And I want to share the results here.
The params are same: MAXITER: 6000, STEPS: (4800, 5400), IMGSIZE: 608 (both for train and test).
With darknet, I gain the [email protected] as 79.0, and the final loss was 0.76 (avg).
image
With this repo, the [email protected] was 76.9, and the final loss was 4.7 (total).
image
It seens that with this repo, the loss is harder to converge. So I changed the params for this repo (MAXITER: 8000, STEPS: (6400, 7200)), and gain the [email protected] as 78.3, and the final loss was 8.2 (total).
image
image
So I have some questions.

  1. the performance seens different, may be caused by the shuffle of the dataset?
  2. the loss of this repo is larger and harder to converge compared to the darknet. What's the reason?
  3. in #44, you haved talked about the param size_average and said that the loss of darknet is also high?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions