Qwen-72B released

PookaMacPhellimen@alien.top · 2 years ago

Qwen-72B released

PookaMacPhellimen@alien.top · 2 years ago

https://preview.redd.it/sdofti9odg3c1.jpeg?width=1792&format=pjpg&auto=webp&s=d6f56d56c3596924ea61e1e5429018c0222907d2

Amazing capabilities on some benchmarks if true.

Disastrous_Elk_6375@alien.top · 2 years ago

big if true

a_slay_nub@alien.top · 2 years ago

Bit disappointed by the coding performance but it is a general use case model. It’s insane how good gpt 3.5 is for how fast it is.

ambient_temp_xeno@alien.top · 2 years ago

Apparently the chat version has about 64 for humaneval.

Secret_Joke_2262@alien.top · 2 years ago

What do these tests mean for LLM? There are many values, and I see that in most cases qwen is better than gpt4. In others it is worse or much worse

rileyphone@alien.top · 2 years ago

All the cases it is better than GPT-4 are benchmarks involving Chinese language. OpenAI is going to have a hard time getting access to extensive Chinese language datasets so it’s not surprising a 72B model can beat GPT-4, though it’s still impressive in it’s own right.

Qwen-72B released

Qwen-72B released

Qwen/Qwen-72B · Hugging Face