-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Have you considered TokenMix in hidden layers? #6
Comments
I did try it. |
Yep, I am indeed effectively talking about Manfold Mixup for TokenMix/CutMix. The Manifold Mixup paper is very interesting and thank you so much for bringing it up. I haven't tried the hidden layer version of CutMix/TokenMix. If you have tried hidden-layer TokenMix, could you share the results? |
OK. |
Great! Do you want to share the results on this issue or somewhere else? |
@jihaonew Did you get any results? |
In addition to TokenMix before the first Transformer block, have you considered or tried TokenMix in the middle of the model?
The text was updated successfully, but these errors were encountered: