Replies: 1 comment 2 replies
-
You might also want to try using the matrix before/after softmax. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi!
I was wondering if it's possible to directly measure similarity between any 2 frames (as a real number).
The idea is simple - since in the process of memory matching XMem takes top-k most similar frames to the query frame, it should be pretty straightforward to measure this "similarity" explicitly.
However, I'm stuck on implementation. The affinity matrix of size
[N*H*W x H*W]
implies that every single pixel in the output image/feature map is a linear combination of every single pixel in all theN
memory frames.So I have 2 questions:
A
would be of sizeHW x HW
, so what do the individual entries mean? IfA[i, j]
is a large positive number, does this mean that the pixeli
in image 1 is very similar to the pixelj
in image 2?Beta Was this translation helpful? Give feedback.
All reactions