• justletmefuckinggo@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    im new here. but is this true multimodality, or is it the llm communicating with a vision model?

    and what are those 4 models being benchmark tested here for exactly?