Consider bring VLM into RagFlow project? #1725
simoncai519
started this conversation in
Ideas
Replies: 0 comments 1 reply
-
Hi, actually VLM has already been integrated already. There are a bag of multi modal models including openai, openrouter, tongyi-qianwen, gemini, stepfun, and you can decide which one to use. These models will convert the images into text. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
In generation, VLM can answer questions based on picture stored in knowledge base, instead of parsed text from image. Has this been considered?
Beta Was this translation helpful? Give feedback.
All reactions