Is Open LLM Leaderboard reliable source ? yi:34B is at the top but I get better results with neural-chat:7B model

grigio@alien.top · 2 年前

Is Open LLM Leaderboard reliable source ? yi:34B is at the top but I get better results with neural-chat:7B model

USM-Valor@alien.top · 2 年前

I’ve had the same experiences with the Yi finetunes. I tried them on single-turn generations and they were very promising. However, starting with one from scratch I was having a ton of repetition and looping. Some models need a very tight set of parameters to get them to perform well, whereas other ones will function will under almost any sane set of guidelines. I’m thinking Yi leans more towards the former, which will have users thinking they are inferior to simpler, but more flexible models.