Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Video VL Support #1290

Open
kleineluka opened this issue Dec 29, 2024 · 0 comments
Open

[Feature Request] Video VL Support #1290

kleineluka opened this issue Dec 29, 2024 · 0 comments

Comments

@kleineluka
Copy link

Describe the Issue
New multimodal models are supporting not only image captioning (which Kobold implements) but video captioning as well. For examples see Qwen2-VL or Apollo (which is built on Qwen).

Additional Information:
For UI implementation, a simple "Add video" button beside the "Add img" button would suffice - although I believe getting it working with the API is more important. If there is already a way to achieve this with Kobold and I'm mistaken, please let me know!

Thank you for all the hard work! ^_^

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant