Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Have you considered TokenMix in hidden layers? #6

Open
cmsflash opened this issue Sep 10, 2022 · 5 comments
Open

Have you considered TokenMix in hidden layers? #6

cmsflash opened this issue Sep 10, 2022 · 5 comments

Comments

@cmsflash
Copy link

In addition to TokenMix before the first Transformer block, have you considered or tried TokenMix in the middle of the model?

@jihaonew
Copy link
Collaborator

I did try it.
If I understand correctly, it is close to Manifold Mixup. I believe this will be an interesting extension of CutMix/TokenMix (etc) in the feature space.
Have you tried CutMix in the feature space?

@cmsflash
Copy link
Author

Yep, I am indeed effectively talking about Manfold Mixup for TokenMix/CutMix. The Manifold Mixup paper is very interesting and thank you so much for bringing it up. I haven't tried the hidden layer version of CutMix/TokenMix. If you have tried hidden-layer TokenMix, could you share the results?

@jihaonew
Copy link
Collaborator

jihaonew commented Sep 22, 2022

OK.

@cmsflash
Copy link
Author

Great! Do you want to share the results on this issue or somewhere else?

@cmsflash
Copy link
Author

@jihaonew Did you get any results?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants