Target layers of ViT models #513

WangYZ1608 · 2024-07-05T07:14:21Z

Hi @jacobgil. It's a nice work! Thanks to you and all the contributors behind it.

I tried replacing norm1 with block[-1].attn or norm2 without any other changes, but the gradient seems to be 0, as shown below (I test with GradCAM and LayerCAM. And I could get satisfactory result when using norm1). What is the reason for this?

Also, I would like to ask, how to visualize the class token? And how to use CAM in an image containing multiple instances?

original image

result

zermatt-luo · 2024-07-24T02:18:24Z

Hello, did you resolve this issue? I have a similar problem!

sophmrtn · 2024-09-19T10:06:46Z

I also have the same issue!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Target layers of ViT models #513

Target layers of ViT models #513

WangYZ1608 commented Jul 5, 2024

zermatt-luo commented Jul 24, 2024

sophmrtn commented Sep 19, 2024

Target layers of ViT models #513

Target layers of ViT models #513

Comments

WangYZ1608 commented Jul 5, 2024

zermatt-luo commented Jul 24, 2024

sophmrtn commented Sep 19, 2024