Skip to content

Conversation

glenn-jocher
Copy link
Member

@glenn-jocher glenn-jocher commented Nov 10, 2021

πŸ› οΈ PR Summary

Made with ❀️ by Ultralytics Actions

🌟 Summary

Enhancement to duplicate label removal logic in dataset verification.

πŸ“Š Key Changes

  • Improved the method for identifying and removing duplicate labels in datasets.
  • Maintained integrity of the corresponding segmentations when duplicates are removed.

🎯 Purpose & Impact

  • πŸ› οΈ Purpose: The new code is designed to more accurately identify and remove duplicate labels, which can cause issues during model training.
  • πŸ“ˆ Impact: This change helps ensure that data used for training machine learning models is clean and error-free, potentially improving model accuracy and reliability for users.
  • 🧹 Users can expect a more streamlined data cleaning process with reduced chances of encountering problems due to duplicate annotations in their datasets.

@glenn-jocher glenn-jocher merged commit 6903251 into master Nov 10, 2021
@glenn-jocher glenn-jocher deleted the fix/segments branch November 10, 2021 23:15
BjarneKuehl pushed a commit to fhkiel-mlaip/yolov5 that referenced this pull request Aug 26, 2022
* `np.unique()` sort fix for segments

* Update datasets.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant