Attention Masked Dataset Condensation

Dataset condensation is proposed in paper Dataset Condensation with Gradient Matcing. It aims to condense a large traing set into a small synthetic set such tahat model trained on the small synthetic set would obtain comparable testing performance to that trained on large training set.

Our paper propose attention masking to input images so that model can learn only important visual parts of the image. Future work can further stretch out to background removal or unbiasing. Our paper is accepted to IPIU2022.

Method

Figure 1: Proposed method masks important visual part of the input using attention map. After training classification model using CE loss only with masked images, we synthesize small training set which generates similar gradients to given masked images.

Setup

Install packages in the requirements. Attention mask can be downloaded here.

Proposed model - Table 1

python main.py  --dataset CIFAR10  --model AlexCifarNet  --ipc 10
# --ipc (images/class): 1, 10, 20, 30, 40, 50

Proposed model + Differential Augmentation - Table 2

python main.py  --dataset CIFAR10  --model AlexCifarNet  --ipc 10 --method DSA  --init real --dsa_strategy color_crop_cutout_flip_scale_rotate
# --ipc (images/class): 1, 10, 20, 30, 40, 50

Performance

	DD	DC	Ours
1 img/cls	-	24.2	24.5
10 img/cls	36.8	39.1	40.1

Table 1: Testing accuracies (%) of AlexNet trained from scratch on 1 or 10 synthetic image(s)/class.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
docs		docs
LICENSE		LICENSE
README.md		README.md
config.py		config.py
data.py		data.py
diffaugment.py		diffaugment.py
main.py		main.py
networks.py		networks.py
process.py		process.py
requirements.txt		requirements.txt
self-attention.py		self-attention.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Attention Masked Dataset Condensation

Method

Setup

Proposed model - Table 1

Proposed model + Differential Augmentation - Table 2

Performance

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

dhkim2810/MaskedDatasetCondensation

Folders and files

Latest commit

History

Repository files navigation

Attention Masked Dataset Condensation

Method

Setup

Proposed model - Table 1

Proposed model + Differential Augmentation - Table 2

Performance

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages