vercel/modelfusion

Add support for Ollama multimodal

suavelizard opened this issue ยท 6 comments

Draft PR for multimodal Ollama: ollama/ollama#1216

This would add Ollama as a supported provider alongside structure generation: https://github.com/lgrammel/modelfusion#generate-structure

@suavelizard thanks for letting me know. I plan to add Ollama multi-modal, once it is available in the Ollama API (similar to Llama.cpp)

Can you add more examples of using ollama with tools that make it be able to do more like for example aa ollama ts file with a search tool or to get more creative a ollama model with a stable diffusion tool to generate imags from query etc. If u get what im trying to say. Best i can explain it lol

Can you add more examples of using ollama with tools that make it be able to do more like for example aa ollama ts file with a search tool or to get more creative a ollama model with a stable diffusion tool to generate imags from query etc. If u get what im trying to say. Best i can explain it lol

Yeah an image generation tool has been on my list. There are existing tools and you can also define your own. Feel free to contribute working examples!

Multi-modal models are now available on the ollama main branch, there is also a pre-release https://github.com/jmorganca/ollama/releases/tag/v0.1.15. Would be great to have Modelfusion support it.

Basic multi-modal support was added w/ ModelFusion v0.97: https://github.com/lgrammel/modelfusion/releases/tag/v0.97.0

Thank you ๐Ÿ˜„