Cradawx@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

ShareGPT4V - New multi-modal model, improves on LLaVA

sharegpt4v.github.io

1

ShareGPT4V - New multi-modal model, improves on LLaVA

sharegpt4v.github.io

Cradawx@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

ShareGPT4V

sharegpt4v.github.io

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Chat

justletmefuckinggo@alien.topB
link
fedilink
English
arrow-up
1·
2 years ago
im new here. but is this true multimodality, or is it the llm communicating with a vision model?

and what are those 4 models being benchmark tested here for exactly?