Slight difference between implementation and paper. #6

AI-Guru · 2019-11-06T14:46:39Z

Namaste!

I think this is my final issue :D

In the implementation of the classifier, you start with a single-head-attention, then you do the spatial transformation followed by multi-head-attention. In the paper, on the other hand, you start with the transformation.

I have the feeling that I have missed something. What is your opinion?

AI-Guru · 2019-11-06T16:14:39Z

Another difference. Correct me if I am wrong: The implementation of attn_feature does not 100% match. Especially "edge_feature = input_feature_tiled - neighbors" seems to be missing on the picture.

AI-Guru · 2019-11-09T15:34:44Z

Yes. The paper mentions that a single-head attention happens as part of the spatial transform step.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slight difference between implementation and paper. #6

Slight difference between implementation and paper. #6

AI-Guru commented Nov 6, 2019

AI-Guru commented Nov 6, 2019

AI-Guru commented Nov 9, 2019

Slight difference between implementation and paper. #6

Slight difference between implementation and paper. #6

Comments

AI-Guru commented Nov 6, 2019

AI-Guru commented Nov 6, 2019

AI-Guru commented Nov 9, 2019