Skip to content

Conversation

@dennemark
Copy link

Hi,
i added bit of documentation for image generation in open webui.
I have skipped adding images, since that would involve adding them to the assets repository.

But if you need:

grafik Image-Gen Native-Image-Gen Image-Gen-Chat

Copilot AI and others added 3 commits January 30, 2026 20:15
Updated the image generation section with improved formatting and clarity, including links to documentation and configuration steps.
@dennemark
Copy link
Author

Now the LLM model might not be aware of its image generation capabilities, since the integration approach in Open WebUI does not call a tool, but just first the image gen endpoint and then the text gen endpoint, i assume:

grafik

@jeremyfowers
Copy link
Contributor

Note to self: try this out with https://huggingface.co/mradermacher/PromptBridge-0.6b-Alpha-GGUF as the SD prompt model

@dennemark
Copy link
Author

dennemark commented Jan 31, 2026

Ah I just tried out the native tool calling with Qwen Next. It kind of does that prompt bridge stuff.

grafik And Qwen3-VL-30B-A3B grafik GLM 4.7 Flash (you see also how it check the available tools in thinking) grafik GPT OSS 120B grafik

I have to update the docs a bit, since for native tool calling, the image gen still needs to be toggled. Otherwise the tool wont be available to the LLM.

@jeremyfowers
Copy link
Contributor

Both approaches worked for me!

image image

Copy link
Contributor

@jeremyfowers jeremyfowers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Instructions worked great for me! Thanks very much for the contribution @dennemark !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants