Skip to content

wangyongqi558/EOV-MMP-VidVRD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

EOV-MMP-VidVRD

This repository is for the paper End-to-End Open-Vocabulary Video Visual Relationship Detection using Multi-modal Prompting (EOV-MMP). The provided link contains the datasets and models obtained from the first three training steps. The complete training and testing code will be released after the paper is officially published.

To-Do List

1. End-to-End Model Inference

Make the end to end model ready for inference (Before Oct 30, 2025. I've been extremely busy with my internship and autumn recruitment recently, so I really don't have time to organize the code. If you do have a code requirement, please send an email to [email protected].)

2. Object Detection Training

Prepare the object detection part for training.

3. Relationship Detection Training

Get the relationship detection part ready for training.

4. End-to-End Training

Prepare for end-to-end training.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages