minus-squarejustletmefuckinggo@alien.topBtoLocalLLaMA@poweruser.forum•ShareGPT4V - New multi-modal model, improves on LLaVAlinkfedilinkEnglisharrow-up1·1 year agoim new here. but is this true multimodality, or is it the llm communicating with a vision model? and what are those 4 models being benchmark tested here for exactly? linkfedilink
im new here. but is this true multimodality, or is it the llm communicating with a vision model?
and what are those 4 models being benchmark tested here for exactly?