Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

图片识别的函数接口 #10

Open
1487879421888 opened this issue Mar 18, 2024 · 2 comments
Open

图片识别的函数接口 #10

1487879421888 opened this issue Mar 18, 2024 · 2 comments

Comments

@1487879421888
Copy link

作者你好,我之前看到一个,使用3.5的key,然后进行函数调用,可以实现,你任意发一张图片,然后可以识别,返回结果。请问作者知道这个不?

@devcxl
Copy link
Owner

devcxl commented Mar 18, 2024

我去翻了下文档,这个功能可以做到。但是gpt-3.5-turbo是不具备视觉能力的,得用gpt-4-vision-preview模型

@devcxl
Copy link
Owner

devcxl commented Mar 21, 2024

目前为止,开源的image2text模型依赖比较多,需要transformers torch等框架的支持,依赖比较复杂庞大,为了保持项目的精简,使用开源模型进行识图的功能后期会新开一个项目为此项目提供相应可选的image2text-api。当前阶段后续会添加gpt-vision-preview的支持。
如果你能找到一些支持识图的api我也可以接入支持

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants