score of softmax on Text4k; linformer-256 & nystrom-64 doesn't work #15

ZiweiHe · 2022-03-24T17:10:15Z

Hi,

Thanks for the excellent work!

I found some issues in my humble trials (I didn't change anything in the code):

using softmax attention on Text4k I got ~63.7 acc instead of 65.02 you posted in your paper.
again I tried linear attention Text4k I got ~64 acc, it's even higher than vanilla transformer, I wonder did you get the same result from your side?
the attention types linformer-256 and nystrom-64 doesn't work, the errors are either dimensions mismatching or config key error. It seems like not all the attention types can successfully run when you release the code. Btw I didn't try out all the choices.

Thank you for your time, I look forward to your reply~

Ziwei

mlpen · 2022-03-28T23:14:10Z

Are you using code from LRA? This config file is an example. To run LRA on other attentions, you can modify the "attn_type" (see possible attention methods in code) and add the specified attention related setting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

score of softmax on Text4k; linformer-256 & nystrom-64 doesn't work #15

score of softmax on Text4k; linformer-256 & nystrom-64 doesn't work #15

ZiweiHe commented Mar 24, 2022 •

edited

Loading

mlpen commented Mar 28, 2022

score of softmax on Text4k; linformer-256 & nystrom-64 doesn't work #15

score of softmax on Text4k; linformer-256 & nystrom-64 doesn't work #15

Comments

ZiweiHe commented Mar 24, 2022 • edited Loading

mlpen commented Mar 28, 2022

ZiweiHe commented Mar 24, 2022 •

edited

Loading