Multimodal inputs

Hi all,
I want to know how do I provide both image and text as inputs to a single model.It would be great if I could get any help.Thanks in advance.

1 Like