Dealing with VLM

Hello,

When dealing with VLM, the content of the prompt / chat template is not pure text but contains PIL images or base64 images. So the prompt formater that deals with the reflective LLM is limited since it transforms the output in text. Is there a way to overwrite this function easily ? I guess it would be even better to adapt this function so it outputs a chat template format that is then fed to the reflective LLM (open AI format) instead of pure string.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dealing with VLM #121

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Dealing with VLM #121

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions