Make force_fp32_for_softmax
arg in MultiHeadDotProductAttention
u…
#13331
Job | Run time |
---|---|
28s | |
2s | |
34s | |
11s | |
16s | |
33s | |
5m 21s | |
7m 7s | |
4m 1s | |
5m 25s | |
6m 53s | |
3m 53s | |
6m 39s | |
8m 27s | |
12m 50s | |
1h 2m 40s |