So RWKV 7b v5 is 60% trained now, saw that multilingual parts are better than mistral now, and the english capabilities are close to mistral, except for hellaswag and arc, where its a little behind. all the benchmarks are on rwkv discor, and you can google the pro/cons of rwkv, though most of them are v4.
Thoughts?
I tested it. It understands Persian, but not so well, it also hallucinates people.
and Mistral doesn’t?
keep in mind that the demo is for 3B model, and the post is about 7B, which I expect to be way better