MOSE: Complex Video Object Segmentation Dataset

Quick Links

🔥 MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

If you want to test your VOS model's performance in real-world complex scenarios, MOSEv2 is the right choice. Here are some cases from MOSEv2.

MOSEv1: A New Dataset for Video Object Segmentation in Complex Scenes

News

[2025/08/07] MOSEv2 dataset has been released! 🔥🎉🚀✨🎊🌟💫🎈
[2023/02/09] MOSEv1 dataset has been released!

Download

MOSEv2 Dataset

🤗 Hugging Face
☁️ Baidu Pan (pwd: p2m6)
☁️ Google Drive
☁️ OneDrive

MOSEv1 Dataset

🤗 Hugging Face
☁️ OneDrive
☁️ Google Drive
☁️ Baidu Pan (pwd: MOSE)

File Structure

The dataset follows a similar structure as DAVIS and Youtube-VOS. The dataset consists of two parts: JPEGImages which holds the frame images, and Annotations which contains the corresponding segmentation masks. The frame images are numbered using five-digit numbers. Annotations are saved in color-pattlate mode PNGs like DAVIS.

Please note that while annotations for all frames in the training set are provided, annotations for the validation set will only include the first frame.

<train/valid.tar>
│
├── Annotations
│ │ 
│ ├── <video_name_1>
│ │ ├── 00000.png
│ │ ├── 00001.png
│ │ └── ...
│ │ 
│ ├── <video_name_2>
│ │ ├── 00000.png
│ │ ├── 00001.png
│ │ └── ...
│ │ 
│ ├── <video_name_...>
│ 
└── JPEGImages
  │ 
  ├── <video_name_1>
  │ ├── 00000.jpg
  │ ├── 00001.jpg
  │ └── ...
  │ 
  ├── <video_name_2>
  │ ├── 00000.jpg
  │ ├── 00001.jpg
  │ └── ...
  │ 
  └── <video_name_...>

BibTeX

Please consider to cite MOSE if it helps your research.

@article{MOSEv2,
    title={{MOSEv2}: A More Challenging Dataset for Video Object Segmentation in Complex Scenes},
    author={Ding, Henghui and Ying, Kaining and Liu, Chang and He, Shuting and Jiang, Xudong and Jiang, Yu-Gang and Torr, Philip HS and Bai, Song},
    journal={arXiv preprint arXiv:2508.05630},
    year={2025}
}

@inproceedings{MOSE,
  title={{MOSE}: A New Dataset for Video Object Segmentation in Complex Scenes},
  author={Ding, Henghui and Liu, Chang and He, Shuting and Jiang, Xudong and Torr, Philip HS and Bai, Song},
  booktitle={ICCV},
  year={2023}
}

License

MOSE is licensed under a CC BY-NC-SA 4.0 License. The data of MOSE is released for non-commercial research purpose only.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
MOSEv1		MOSEv1
MOSEv2		MOSEv2
assets/mosev2		assets/mosev2
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MOSE: Complex Video Object Segmentation Dataset

Quick Links

News

Download

MOSEv2 Dataset

MOSEv1 Dataset

File Structure

BibTeX

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

henghuiding/MOSE-api

Folders and files

Latest commit

History

Repository files navigation

MOSE: Complex Video Object Segmentation Dataset

Quick Links

News

Download

MOSEv2 Dataset

MOSEv1 Dataset

File Structure

BibTeX

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages