Multimodal support missing

#101
by dashesy - opened

It is critical for agent works to have image input support

Sign up or log in to comment