Vectara’s Hallucination Evaluation Model and leaderboard was launched last week. I notice Mistral having a hallucination rate of 9.4% compared to 5.6% for Llama2. Any thoughts?
Source: https://github.com/vectara/hallucination-leaderboard
Vectara’s Hallucination Evaluation Model and leaderboard was launched last week. I notice Mistral having a hallucination rate of 9.4% compared to 5.6% for Llama2. Any thoughts?
Source: https://github.com/vectara/hallucination-leaderboard
How is possible that Llama2 13B and 7B have lower hallucination rate than Claude?