Accessing image x text features #362

adrian-dalessandro · 2024-10-10T07:09:22Z

I'm interested in inspecting the image crossed with text features during the fusion step. However, when I extract them it appears the multi scale image features are concatenated and have the approximate shape (B, 10000+, 256). There isn't a square number of image patches so I can't just reshape it to (B,H,W,256). How can I parse out the multi-scale features.

ParthaNanjanagud · 2024-12-17T03:09:47Z

Hey were you able to find the solution to your issue ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accessing image x text features #362

Accessing image x text features #362

adrian-dalessandro commented Oct 10, 2024

ParthaNanjanagud commented Dec 17, 2024

Accessing image x text features #362

Accessing image x text features #362

Comments

adrian-dalessandro commented Oct 10, 2024

ParthaNanjanagud commented Dec 17, 2024