Add support for Ollama multimodal

Question

Add support for Ollama multimodal

suavelizard opened this issue a year ago · 6 comments

Draft PR for multimodal Ollama: ollama/ollama#1216

This would add Ollama as a supported provider alongside structure generation: https://github.com/lgrammel/modelfusion#generate-structure

Answer 1 · 2023-11-27T09:49:39.000Z

@suavelizard thanks for letting me know. I plan to add Ollama multi-modal, once it is available in the Ollama API (similar to Llama.cpp)

Answer 2 · 2023-11-29T01:13:46.000Z

Can you add more examples of using ollama with tools that make it be able to do more like for example aa ollama ts file with a search tool or to get more creative a ollama model with a stable diffusion tool to generate imags from query etc. If u get what im trying to say. Best i can explain it lol

Answer 3 · 2023-12-02T10:02:55.000Z

Can you add more examples of using ollama with tools that make it be able to do more like for example aa ollama ts file with a search tool or to get more creative a ollama model with a stable diffusion tool to generate imags from query etc. If u get what im trying to say. Best i can explain it lol

Yeah an image generation tool has been on my list. There are existing tools and you can also define your own. Feel free to contribute working examples!

Answer 4 · 2023-12-13T16:31:31.000Z

Multi-modal models are now available on the ollama main branch, there is also a pre-release https://github.com/jmorganca/ollama/releases/tag/v0.1.15. Would be great to have Modelfusion support it.

Answer 5 · 2023-12-14T18:41:12.000Z

Basic multi-modal support was added w/ ModelFusion v0.97: https://github.com/lgrammel/modelfusion/releases/tag/v0.97.0

Answer 6 · 2023-12-14T22:03:39.000Z

Thank you 😄