RemDet

Official PyTorch implementation of "RemDet: Rethinking Efficient Model Design for UAV Object Detection" [AAAI 2025]

Abstract

Object detection in Unmanned Aerial Vehicle (UAV) images has emerged as a focal area of research, which presents two significant challenges: i) objects are typically small and dense within vast images; ii) computational resource constraints render most models unsuitable for real-time deployment. Current real-time object detectors are not optimized for UAV images, and complex methods designed for small object detection often lack real-time capabilities. To address these challenges, we propose a novel detector, RemDet (Reparameter efficient multiplication Detector). Our contributions are as follows: 1) Rethinking the challenges of existing detectors for small and dense UAV images, and proposing information loss as a design guideline for efficient models. 2) We introduce the ChannelC2f module to enhance small object detection performance, demonstrating that high-dimensional representations can effectively mitigate information loss. 3) We design the GatedFFN module to provide not only strong performance but also low latency, effectively addressing the challenges of real-time detection. Our research reveals that GatedFFN, through the use of multiplication, is more cost-effective than feed-forward networks for high-dimensional representation. 4) We propose the CED module, which combines the advantages of ViT and CNN downsampling to effectively reduce information loss. It specifically enhances context information for small and dense objects. Extensive experiments on large UAV datasets, Visdrone and UAVDT, validate the real-time efficiency and superior performance of our methods. On the challenging UAV dataset VisDrone, our methods not only provided state-of-the-art results, improving detection by more than 3.4, but also achieve 110 FPS on a single 4090.

Introduction

In this paper, we propose a simple yet efficient model.

The overview of the proposed RemDet.

Comparison of the core innovations of the proposed RemDet:

Main results

Object Detection Performance for VisDrone2019:

Model	AP	AP₅₀	AP₇₅	AP_S	AP_M	AP_L	#Params	FLOPs	Log
RemDet-Tiny	21.8	37.1	21.9	12.7	33.0	44.5	3.2M	4.6G	log & pth
RemDet-S	24.7	41.5	25.0	15.4	36.7	47.0	11.9M	16.0G	log & pth
RemDet-M	27.3	44.7	28.2	17.3	40.5	57.8	23.3M	34.4G	log & pth
RemDet-L	29.3	47.4	30.3	18.7	43.4	55.8	35.3M	66.7G	log & pth
RemDet-X	29.9	48.3	31.0	19.5	44.1	58.6	74.1M	112G	log & pth

Object Detection Performance for COCO2017:

Model	AP	AP₅₀	AP₇₅	AP_S	AP_M	AP_L	Log
RemDet-Tiny	39.5	55.8	42.8	21.0	43.9	54.0	log
RemDet-S	45.5	62.8	49.6	27.8	50.5	60.0	log
RemDet-M	49.8	66.9	54.0	32.8	54.7	65.0	log

Object Detection

Environments

conda create -n remdet -y python=3.11
pip3 install -y pytorch==2.2.0 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121  # cu121
pip install -r ./requirements.txt &&
pip install albumentations==1.4.4 timm  &&
pip install -U openmim &&
mim install mmengine &&
mim install mmcv==2.2.0 &&
pip install -v -e .

Prepare VisDrone2019 Dataset

Download and extract VisDrone2019 dataset in the following directory structure:

├── VisDrone2019-DET-COCO
    ├── images
        ├── train
            ├── 0000002_00005_d_0000014.jpg
            ├── ...
        ├── val
            ├── 0000001_02999_d_0000005.jpg
            ├── ...
    ├── annotations
        ├── VisDrone2019-DET_train_coco.json
        ├── VisDrone2019-DET_val_coco.json

Download UAVDT from [Baidu Drive][Google Drive].

Train

Train with 8 GPUs:

bash tools/dist_train.sh config_remdet/remdet/remdet_x-300e_coco.py 8 --amp --work-dir work_dir/remdet_x

Acknowledgements

We thank but not limited to following repositories for providing assistance for our research:

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
config_remdet		config_remdet
configs		configs
demo		demo
mmdet		mmdet
requirements		requirements
resources		resources
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
model-index.yml		model-index.yml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RemDet

Introduction

Main results

Object Detection Performance for VisDrone2019:

Object Detection Performance for COCO2017:

Object Detection

Environments

Prepare VisDrone2019 Dataset

Train

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

HZAI-ZJNU/RemDet

Folders and files

Latest commit

History

Repository files navigation

RemDet

Introduction

Main results

Object Detection Performance for VisDrone2019:

Object Detection Performance for COCO2017:

Object Detection

Environments

Prepare VisDrone2019 Dataset

Train

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages