EOV-MMP-VidVRD

This repository is for the paper End-to-End Open-Vocabulary Video Visual Relationship Detection using Multi-modal Prompting (EOV-MMP). The provided link contains the datasets and models obtained from the first three training steps. The complete training and testing code will be released after the paper is officially published.

To-Do List

1. End-to-End Model Inference

Make the end to end model ready for inference (Before Oct 30, 2025. I've been extremely busy with my internship and autumn recruitment recently, so I really don't have time to organize the code. If you do have a code requirement, please send an email to [email protected].)

2. Object Detection Training

Prepare the object detection part for training.

3. Relationship Detection Training

Get the relationship detection part ready for training.

4. End-to-End Training

Prepare for end-to-end training.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
end2end_model.py		end2end_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EOV-MMP-VidVRD

To-Do List

1. End-to-End Model Inference

2. Object Detection Training

3. Relationship Detection Training

4. End-to-End Training

About

Uh oh!

Releases

Packages

Languages

wangyongqi558/EOV-MMP-VidVRD

Folders and files

Latest commit

History

Repository files navigation

EOV-MMP-VidVRD

To-Do List

1. End-to-End Model Inference

2. Object Detection Training

3. Relationship Detection Training

4. End-to-End Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages