[fix] VisualBERT returns attention tuple #1036 #1

abhinav-bohra · 2021-08-22T19:58:01Z

PROBLEM: The default value of output_attentions in forward( ) call of BertEncoderJit (in mmf/modules/hf_layers.py) is set as False. So even if the user/developer specifies output_attentions = True in config; its value is taken as default False and thus VisualBERT returns an empty tuple for attentions.

FIX: Set output_attentions as None in BertEncoderJit's forward( ) definition, and update output_attentions to self.output_attentions if it is not passed as an argument (i.e it is None). Therefore, now output_attentions will take the value of self.output_attentions (which was initialized using config during instantiation of BertEncoderJit class)

The issue with output_hidden_states was the same, and it was fixed in a similar way.

Tested locally.

PROBLEM: The default value of output_attentions in forward( ) call of BertEncoderJit (in mmf/modules/hf_layers.py) is set as False. So even if the user/developer specifies output_attentions = True in config; its value is taken as default False and thus VisualBERT returns an empty tuple for attentions. FIX: Set output_attentions as None in BertEncoderJit's forward( ) definition, and update output_attentions to self.output_attentions if it is not passed as an argument (i.e it is None). Therefore, now output_attentions will take the value of self.output_attentions (which was initialized using config during instantiation of BertEncoderJit class) Same problem and same fix for output_hidden_states as well. Tested locally.

abhinav-bohra merged commit 9aa9e22 into master Aug 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] VisualBERT returns attention tuple #1036 #1

[fix] VisualBERT returns attention tuple #1036 #1

abhinav-bohra commented Aug 22, 2021

[fix] VisualBERT returns attention tuple #1036 #1

[fix] VisualBERT returns attention tuple #1036 #1

Conversation

abhinav-bohra commented Aug 22, 2021