[arXiv:2408.09764] Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms Xiao Wang, Shiao Wang, Pengpeng Shao, Bo Jiang, Lin Zhu, Yonghong Tian [arXiv]
Human Action Recognition (HAR) stands as a pivotal research domain in both computer vision and artificial intelligence, with RGB cameras dominating as the preferred tool for investigation and innovation in this field. However, in real-world applications, RGB cameras encounter numerous challenges, including light conditions, fast motion, and privacy concerns. Consequently, bio-inspired event cameras have garnered increasing attention due to their advantages of low energy consumption, high dynamic range, etc. Nevertheless, most existing event-based HAR datasets are low resolution (
A demo video can be found by clicking the image below:
A Spatial-Temporal Scanning framework for Event Stream-based Human Action Recognition.
conda create -n evmamba python=3.10.13
conda activate evmamba
pip install torch==2.0.0 torchvision==0.15.1 torchaudio==2.0.1 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt
Please refer to Vmamba to install the following packages:
cd kernels/selective_scan && pip install .
Install the required packages in mmaction
pip install -U openmim
mim install mmengine
pip install mmcv==2.0.0 -f https://download.openmmlab.com/mmcv/dist/cu118/torch2.0/index.html
pip install -v -e .
Download pre-trained vssmbase_dp06_ckpt_epoch_241 and put it under $/pretrained_models
Then, put the HAR dataset Celex-HAR in ./data
You can modify the paths of pretrained_model and dataset by editing
EVMamba\mmaction\models\backbones\classification\config_b.py
EVMamba\configs\recognition\EVMamba\CeleX-HAR.py
# train & test
bash train.sh
bash dist_train.sh  (For Distributed Training)
- BaiduYun:
 
💾 Baidu Netdisk link: https://pan.baidu.com/s/1yvJje7DqIn8qT9fmQMbeeQ?pwd=wsad code:wsad
- Dropbox:
 
💾 [Dropbox URL]
The directory should have the following format:
├── CeleX-HAR (124,625 videos (training subset: 99,642 videos;  testing subset: 24,983 videos;))
    ├── CeleX_HR (377.32GB)
        ├── rawframes
            ├── action_001_pull_up_weeds
                ├── action_001_20220221_110904108_EI_70M
                    ├── 0000.png
                    ├── 0001.png
                    ├── 0002.png
                    ├── ...
                ├── action_001_20220221_110910254_EI_70M
                ├── ...
            ├── action_002_take_somebody's_pulse
            ├── action_003_move_the_chair
            ├── ...
    ├── celex_voxel (15.55GB)
        ├── action_001_pull_up_weeds
            ├── action_001_20220221_110904108_EI_70M.mat
            ├── action_001_20220221_110910254_EI_70M.mat
            ├── ...
        ├── action_002_take_somebody's_pulse
        ├── action_003_move_the_chair
        ├── ...If you have any questions about this work, please leave an issue. Also, please give us a star if you think this paper helps your research.
@article{wang2024celexhar,
  title={Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms},
  author={Wang, Xiao and Wang, Shiao and Shao, Pengpeng and Jiang, Bo and Zhu, Lin and Tian, Yonghong},
  journal={arXiv preprint arXiv:2408.09764},
  year={2024}
}

  


