How to get last-layer hidden states $$H_{link}$$ during testing ? #11

xushilin1 · 2024-06-30T12:40:36Z

As mentioned in your paper, the Super-Link Queries are automatically added after the input embeddings of the routing token. However, during testing, users' input prompts do not include any routing token. How can you send the Super-Link Queries to MLLM and obtain the corresponding hidden states $H_{link}$?

wjn922 · 2024-07-02T03:17:52Z

Thanks for your question.

During testing, we rely on the LLM to interpret the users' input prompts and output the different routing tokens when needed. That's why we need to construct instruction templates for different tasks and finetune the LLM, which is specified in Sec.3.2 (1) and Appendix E.

Here is an example for detection.
USER: Where can we locate the dog in the image?
ASSISTANT: The detection results for dog [DET] are presented.

xushilin1 · 2024-07-02T03:54:23Z

During training, you will input the [DET] and corresponding super-link queries $Q_{link}$ into LLM to obtain $H_{link}$, which is then sent to the downstream decoder.

During testing, since the input prompt does not include the [DET] and $Q_{link}$, so how can you get the $H_{link}$ ?

Is it correct that during training, the downstream decoders receive $H_{link}$ while during testing they receive $Q_{link}$?

Is there any inconsistency in the input of downstream decoders during training and testing?

wjn922 · 2024-07-02T04:41:46Z

During testing, the LLM will output [DET], and we immediately append the $Q_{link}$ after it. Then, in the current generation step, the input_embeds will expand from [1, C] to [1 + num_embeds, C]. We can still get the last-layer hidden states $H_{link}$ during testing.

This part is the code for handling the super-link queries, which works well for both training and testing:

VisionLLM/VisionLLMv2/visionllmv2/model/modeling_visionllmv2.py

Line 421 in 34a8144

    
           # NOTE: special operation for the [emb] tokens, this works well for both train and generation (use_cache=True)

wjn922 · 2024-07-02T04:42:17Z

During testing, the LLM will output [DET], and we immediately append the $Q_{link}$ after it. Then, in the current generation step, the input_embeds will expand from [1, C] to [1 + num_embeds, C]. We can still get the last-layer hidden states $H_{link}$ during testing.

This part is the code for handling the super-link queries, which works well for both training and testing:

VisionLLM/VisionLLMv2/visionllmv2/model/modeling_visionllmv2.py

Line 421 in 34a8144

    
           # NOTE: special operation for the [emb] tokens, this works well for both train and generation (use_cache=True)

pangzss · 2024-07-08T13:38:29Z

During testing, the LLM will output [DET], and we immediately append the Qlink after it. Then, in the current generation step, the input_embeds will expand from [1, C] to [1 + num_embeds, C]. We can still get the last-layer hidden states Hlink during testing.

This part is the code for handling the super-link queries, which works well for both training and testing:

VisionLLM/VisionLLMv2/visionllmv2/model/modeling_visionllmv2.py

Line 421 in 34a8144

# NOTE: special operation for the [emb] tokens, this works well for both train and generation (use_cache=True)

Does this mean that during training, the earlier superlink embeddings do not attend to later ones due to the causal attention mask, but during test, different superlink embeddings get to attend to each other as one forward pass is used to get all their hidden states?

wjn922 · 2024-07-08T15:07:21Z

During both training and testing, the LLM always uses the causal mask.

haofuly · 2024-10-15T04:06:24Z

hi, @wjn922
Thanks for your detailed reply. I am not sure how does the [EMB] play a role during training. Can you show an example to illustrate what is the relationship between [EMB] and the emb_embeddings_det tensor? And where should we insert the emb_embeddings_det tensor into input_ids during training?
Thanks!

xushilin1 closed this as completed Jul 2, 2024

zc-zhao mentioned this issue Nov 19, 2024

How to get all object super link of $$H_{link}$$ in last-layer hidden states during testing ? #14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get last-layer hidden states $$H_{link}$$ during testing ? #11

How to get last-layer hidden states $$H_{link}$$ during testing ? #11

xushilin1 commented Jun 30, 2024

wjn922 commented Jul 2, 2024

xushilin1 commented Jul 2, 2024 •

edited

Loading

wjn922 commented Jul 2, 2024

wjn922 commented Jul 2, 2024

pangzss commented Jul 8, 2024

wjn922 commented Jul 8, 2024

haofuly commented Oct 15, 2024 •

edited

Loading

How to get last-layer hidden states $$H_{link}$$ during testing ? #11

How to get last-layer hidden states $$H_{link}$$ during testing ? #11

Comments

xushilin1 commented Jun 30, 2024

wjn922 commented Jul 2, 2024

xushilin1 commented Jul 2, 2024 • edited Loading

wjn922 commented Jul 2, 2024

wjn922 commented Jul 2, 2024

pangzss commented Jul 8, 2024

wjn922 commented Jul 8, 2024

haofuly commented Oct 15, 2024 • edited Loading

xushilin1 commented Jul 2, 2024 •

edited

Loading

haofuly commented Oct 15, 2024 •

edited

Loading