Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Subject: Inquiry About Lightweight Feature Extraction with Your Attention Mechanism #16

Open
Zhangyuhaoo opened this issue Jul 10, 2024 · 1 comment

Comments

@Zhangyuhaoo
Copy link

I hope this message finds you well. I recently read your impressive paper on [SwiftFormer: Efficient Additive Attention for Transformer-based
Real-time Mobile Vision Applications], and I must say I was truly amazed by your work.

I am currently working on a task related to feature point extraction and matching, and my focus is on developing lightweight models. I am particularly interested in whether it would be feasible to replace the standard self-attention mechanisms in backbone networks with the attention mechanism you proposed in your research.

I would be grateful for your insights or suggestions on this approach. I apologize for any inconvenience my inquiry might cause and look forward to your response.

Thank you very much, and best wishes.

@Amshaker
Copy link
Owner

Hi @Zhangyuhaoo,

Thank you for your kind words regarding SwiftFormer. I am delighted to hear that you found the work impressive.

Regarding your task on feature point extraction and matching with a focus on lightweight models, I believe integrating the additive attention mechanism from SwiftFormer could be a promising approach. SwiftFormer’s additive attention mechanism is designed to be computationally efficient while maintaining performance, making it suitable for real-time mobile vision applications. The extracted feature maps will contain rich spatial information, which is crucial for accurate feature point extraction.

However, if the order of the points is critical, you will need to consider how to incorporate positional embeddings effectively into SwiftFormer. This will ensure that the model retains the necessary spatial order information.

I hope this helps. Please feel free to reach out if you have any further questions or need additional insights.

Best regards,

Abdelrahman

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants