What with the ongoing turmoil at OAI, has anyone found an alternative for their vision
endpoint that offers comparable functionality? I am aware of LLaVa which seems early in its maturity, but are there any commercial offerings?
What with the ongoing turmoil at OAI, has anyone found an alternative for their vision
endpoint that offers comparable functionality? I am aware of LLaVa which seems early in its maturity, but are there any commercial offerings?
https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/neva-22b
https://replicate.com/joehoover/instructblip-vicuna13b/api
Here are a couple that haven’t been mentioned; they’re quite a lot weaker than GPT4V though, as to be expected from small models.