CLIP skip experiment on style coadapter #50
hafriedlander
started this conversation in
Show and tell
Replies: 1 comment
-
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I noticed sometimes when using style images that semantic elements from the style image would affect the result - for instance, a photo with many people would result in people in the output, even if the sketch image and prompt are for something else (i.e. a motorcycle).
I wondered if using earlier hidden layers from the clip image encoding would help.
The answer is "sort of". It's a sometimes useful technique, but also has some drawbacks.
Here's a couple of examples:
Starry night style, final layer results in many stars in the sky, penultimate layer reduces them while still giving a painterly style (but the colors shift too, undesirably)
Style source:
Final hidden layer:
Penultimate layer:
A random image from unsplash, when using the "Deliberate" model you get a lot of the houses from the style image appearing in the result. Penultimate still gives a similar style without repeating those elements. This is probably my best success so far with this technique:
Style source:
Final hidden layer:
Penultimate layer:
Beta Was this translation helpful? Give feedback.
All reactions