Provide a way to extract XMP metadata (png & webp & tiff only for now) #2567

1c3t3a · 2025-08-28T16:32:42Z

XMP is a common metadata format and multiple image formats supported by this crate are able to extract XMP metadata. Similiar to the icc profile and exif metadata, we extend the ImageDecoder trait to provide this functionality.

For now this is only implemented for png and webp and tiff.

This is related to #2568.

src/codecs/png.rs

XMP is a common metadata format and multiple image formats supported by this crate are able to extract XMP metadata. Similiar to the icc profile and exif metadata, we extend the ImageDecoder trait to provide this functionality. For now this is only implemented for png.

1c3t3a · 2025-09-01T10:30:46Z

Yehaa, I rebased after #2574, looks like this is ready to go from my side? Any further points to do @197g?

1c3t3a · 2025-09-01T16:57:57Z

Sorry for the heck-meck, I decided to add Tiff now as well, since the roll pulled in the fix for processing the Bytes in Tiff :)

197g

Actually having more decoders support it, that set in particular is really core, makes the addition much more convincing as common behavior.

Shnatsel · 2025-09-01T21:02:36Z

zune-jpeg crate also supports XMP, but only in the 0.5.x series which is yet to see a stable release, while image is still on 0.4.x.

It would be great to coordinate with the zune-jpeg author to clear the remaining issues and cut a release.

fintelia · 2025-09-01T21:26:43Z

This PR unfortunately makes it harder to fix the memory limit handling for the PNG decoder because it requires PngDecoder::new to read all the text chunks into memory before we know what the memory limit should be. Not impossible to fix, but potentially quite tricky

Shnatsel · 2025-09-01T21:34:27Z

Isn't that what we added the Seek bound for?

1c3t3a · 2025-09-01T21:59:56Z

I am also happy to move the metadata extraction to the png crate if it fits better there. That way we could just peek the keyword and otherwise seek over the contents?

197g · 2025-09-01T22:34:36Z

Isn't that what we added the Seek bound for?

Yes, since we've moved to ignore ancillary chunks that are broken we might also partially ignore them when they exceed memory limits. And by partially I mean keep a list of offsets where they occur so that their contents can be selectively retrieved. That should combine well with interfaces to read such chunks without fully allocating them in memory (also planned for chunks we want to ignore). We could add a flag to do so preemptively while keeping only a prefix in memory, enough to determine if they are relevant for XMP or Adobe embeds. I think there are a lot of potential variants that won't break functionality (with minor coordination to use those features in image as they are released maybe).

fintelia · 2025-09-01T22:53:47Z

Thought about this a bit more, and I think there's a way we can implement this without needing invasive changes to the png crate chunk parsing state machine. The main point would be storing the positions of text chunks during initial parsing, and only seeking to each one and reading its label if the xmp_metadata method is called. libpng defaults to only storing 1000 text chunks, so we could similarly bound the worst-case behavior.

1c3t3a commented Aug 28, 2025

View reviewed changes

src/codecs/png.rs Show resolved Hide resolved

1c3t3a force-pushed the xmp-metadata branch from 9a30dc4 to cb0935d Compare August 28, 2025 18:11

1c3t3a changed the title ~~Provide a way to extract XMP metadata (png-only for now)~~ Provide a way to extract XMP metadata (png & webp only for now) Aug 29, 2025

1c3t3a force-pushed the xmp-metadata branch 3 times, most recently from 4cdedb2 to 53f7700 Compare September 1, 2025 10:27

Implement retrieving XMP metadata for Webp

c0d1781

1c3t3a force-pushed the xmp-metadata branch from 53f7700 to c0d1781 Compare September 1, 2025 10:30

1c3t3a force-pushed the xmp-metadata branch from dc27e0f to 82fdd4c Compare September 1, 2025 16:25

1c3t3a changed the title ~~Provide a way to extract XMP metadata (png & webp only for now)~~ Provide a way to extract XMP metadata (png & webp & tiff only for now) Sep 1, 2025

1c3t3a force-pushed the xmp-metadata branch from 82fdd4c to 86fff31 Compare September 1, 2025 16:46

Implement retrieving XMP metadata for tiff

2686db5

1c3t3a force-pushed the xmp-metadata branch from 86fff31 to 2686db5 Compare September 1, 2025 16:48

197g approved these changes Sep 1, 2025

View reviewed changes

197g merged commit 9afe256 into image-rs:main Sep 1, 2025
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Provide a way to extract XMP metadata (png & webp & tiff only for now) #2567

Provide a way to extract XMP metadata (png & webp & tiff only for now) #2567

Uh oh!

1c3t3a commented Aug 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

1c3t3a commented Sep 1, 2025

Uh oh!

1c3t3a commented Sep 1, 2025

Uh oh!

197g left a comment

Uh oh!

Uh oh!

Shnatsel commented Sep 1, 2025

Uh oh!

fintelia commented Sep 1, 2025

Uh oh!

Shnatsel commented Sep 1, 2025

Uh oh!

1c3t3a commented Sep 1, 2025

Uh oh!

197g commented Sep 1, 2025

Uh oh!

fintelia commented Sep 1, 2025

Uh oh!

Uh oh!

Provide a way to extract XMP metadata (png & webp & tiff only for now) #2567

Provide a way to extract XMP metadata (png & webp & tiff only for now) #2567

Uh oh!

Conversation

1c3t3a commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

1c3t3a commented Sep 1, 2025

Uh oh!

1c3t3a commented Sep 1, 2025

Uh oh!

197g left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Shnatsel commented Sep 1, 2025

Uh oh!

fintelia commented Sep 1, 2025

Uh oh!

Shnatsel commented Sep 1, 2025

Uh oh!

1c3t3a commented Sep 1, 2025

Uh oh!

197g commented Sep 1, 2025

Uh oh!

fintelia commented Sep 1, 2025

Uh oh!

Uh oh!

1c3t3a commented Aug 28, 2025 •

edited

Loading