Improve post-processing performance #10159

soof-golan · 2024-12-09T16:02:23Z

What does this PR do?

Use multiplication instead of division in VaeImageProcessor.denormalize
Avoid splitting and re-stacking tensors to reduce memory bandwidth and CPU-GPU syncs

HuggingFaceDocBuilderDev · 2024-12-09T16:17:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

hlky · 2024-12-09T16:18:25Z

Thanks for your contribution @soof-golan! Can you run make style?

* Use multiplication instead of division * Avoid splitting and re-stacking tensors to reduce memory bandwidth and CPU-GPU syncs

soof-golan · 2024-12-09T16:26:01Z

Thanks for your contribution @soof-golan! Can you run make style?

@hlky Done!

yiyixuxu · 2024-12-09T22:50:07Z

src/diffusers/image_processor.py

+
+        # De-normalizing a batch and selectively torch.stack'ing the results turns out to be
+        # significantly faster than performing a lot of smaller denormalizations
+        denormalized = self.denormalize(images)


I think there is some context to that, each image in the batch may have a different value for do_normalize for sd1.5, see the code here

diffusers/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

Line 1075 in 6131a93

image = self.image_processor.postprocess(image, output_type=output_type, do_denormalize=do_denormalize)

Since most of the new pipelines (sdxl, sd3, flux), we do not pass the do_normalize from the pipeline, i.e. do_normalize is None here , see SDXL https://github.com/huggingface/diffusers/blob/main/src/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl.py#L1304 ,

and we already did this so it will be batch processed for all the new pipelines already, I think this line is sufficient here

if do_denormalize is None: return self.denormalize(images) if self.config.do_normalize else images

yiyixuxu · 2024-12-09T22:53:44Z

src/diffusers/image_processor.py

-        image = torch.stack(
-            [self.denormalize(image[i]) if do_denormalize[i] else image[i] for i in range(image.shape[0])]
-        )
+        image = self._denormalize_conditionally(image, do_denormalize)


if do_denormalize is None: image = self.denormalize(images) if self.config.do_normalize else: image = torch.stack( [self.denormalize(image[i]) if do_denormalize[i] else image[i] for i in range(image.shape[0])] )

soof-golan · 2024-12-10T10:28:31Z

@yiyixuxu I've opened up #10170 with only the simpler optimization, leaving the homogeneous case untouched.

Choose whichever you see fit for the scope of the project :)

hlky · 2024-12-10T18:11:20Z

Closed by #10170

Improve post-processing performance

61fc97b

* Use multiplication instead of division * Avoid splitting and re-stacking tensors to reduce memory bandwidth and CPU-GPU syncs

soof-golan force-pushed the perf-denormlization-vectorization branch from f94fd19 to 61fc97b Compare December 9, 2024 16:25

hlky approved these changes Dec 9, 2024

View reviewed changes

hlky added the close-to-merge label Dec 9, 2024

yiyixuxu reviewed Dec 9, 2024

View reviewed changes

soof-golan mentioned this pull request Dec 10, 2024

Improve post-processing performance #10170

Merged

hlky closed this Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve post-processing performance #10159

Improve post-processing performance #10159

soof-golan commented Dec 9, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 9, 2024

Uh oh!

hlky commented Dec 9, 2024

Uh oh!

soof-golan commented Dec 9, 2024

Uh oh!

yiyixuxu Dec 9, 2024

Uh oh!

yiyixuxu Dec 9, 2024

Uh oh!

soof-golan commented Dec 10, 2024

Uh oh!

hlky commented Dec 10, 2024

Uh oh!

Uh oh!

Improve post-processing performance #10159

Improve post-processing performance #10159

Conversation

soof-golan commented Dec 9, 2024

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 9, 2024

Uh oh!

hlky commented Dec 9, 2024

Uh oh!

soof-golan commented Dec 9, 2024

Uh oh!

yiyixuxu Dec 9, 2024

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Dec 9, 2024

Choose a reason for hiding this comment

Uh oh!

soof-golan commented Dec 10, 2024

Uh oh!

hlky commented Dec 10, 2024

Uh oh!

Uh oh!