[Feature Request] Video VL Support #1290

kleineluka · 2024-12-29T14:47:23Z

Describe the Issue
New multimodal models are supporting not only image captioning (which Kobold implements) but video captioning as well. For examples see Qwen2-VL or Apollo (which is built on Qwen).

Additional Information:
For UI implementation, a simple "Add video" button beside the "Add img" button would suffice - although I believe getting it working with the API is more important. If there is already a way to achieve this with Kobold and I'm mistaken, please let me know!

Thank you for all the hard work! ^_^

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Video VL Support #1290

[Feature Request] Video VL Support #1290

kleineluka commented Dec 29, 2024

[Feature Request] Video VL Support #1290

[Feature Request] Video VL Support #1290

Comments

kleineluka commented Dec 29, 2024