(question) How to use image generation/vision ?

Question

(question) How to use image generation/vision ?

Closed this issue 23 days ago · 6 comments

Hi there !

I want to start playing with image generation/vision but I couldn't find any documentation. I found this which helps a bit : https://github.com/quarkiverse/quarkus-langchain4j/blob/main/integration-tests/openai/src/main/java/org/acme/example/openai/images/ImageResource.java

I tried to find also how it is implemented for the devUI but got a bit lost in the codebase 😅

Can I use @RegisterAIService for an image generation endpoint ?

Answer 1 · 2024-06-19T10:00:02.000Z

AI services aren't supported for image content right now AFAIK.

There are two types of models that can work with images:

ImageModel is purely text-to-image - for this, we support OpenAI's DALL-E models, example at https://github.com/quarkiverse/quarkus-langchain4j/blob/0.15.1/integration-tests/openai/src/main/java/org/acme/example/openai/images/ImageResource.java#L33
some ChatModels support passing images into a ChatMessage in the form of ImageContent, as is shown in https://github.com/quarkiverse/quarkus-langchain4j/blob/0.15.1/integration-tests/openai/src/main/java/org/acme/example/openai/images/ImageResource.java#L40

In the Dev UI, we have this 'Image' page that appears if you have the OpenAI extension and uses an ImageModel underneath.
The Chat page in the Dev UI doesn't yet support images, but I'm working on it.

Answer 2 · 2024-06-19T10:02:39.000Z

@jmartisk Thanks a lot for the detailed (and super fast) reply !

Answer 3 · 2024-06-19T10:10:31.000Z

@jmartisk And could you point me where the devUI image page is implemented, I'm interested how you pass the different options like resolution, model etc ?

Answer 4 · 2024-06-19T10:17:40.000Z

You need an application that uses OpenAI (for example, the samples/chatbot app from this repo will do) and make sure that you look under the 'OpenAI' card, not the general one, because right now the support is specific to OpenAI. The URL will be like http://localhost:8080/q/dev-ui/io.quarkiverse.langchain4j.quarkus-langchain4j-openai/images

Answer 5 · 2024-06-19T10:18:39.000Z

And the backing bean that performs the actual calls to the model is at https://github.com/quarkiverse/quarkus-langchain4j/blob/main/model-providers/openai/openai-vanilla/runtime/src/main/java/io/quarkiverse/langchain4j/openai/runtime/devui/OpenAiImagesJsonRPCService.java

Answer 6 · 2024-06-19T10:22:20.000Z

And the backing bean that performs the actual calls to the model is at https://github.com/quarkiverse/quarkus-langchain4j/blob/main/model-providers/openai/openai-vanilla/runtime/src/main/java/io/quarkiverse/langchain4j/openai/runtime/devui/OpenAiImagesJsonRPCService.java

That was exactly what I was looking for ! Thx !