Qwen-72B released

PookaMacPhellimen@alien.top · 2 years ago

Qwen-72B released

omniron@alien.top · 2 years ago

There’s an audio multimodal too

matsu-morak@alien.top · 2 years ago

I could not undestand it. Is this true audio (can differentiate a helicopter sound from a fire engine for example, or a dog bark) or it just transforms speech into text and then it feeds the model?

omniron@alien.top · 2 years ago

It’s the former. It’s looking at audio data

So you can ask it sentiment, determine if someone is giggling, crying, laughing, can maybe even detect a condescending tone or flirtatious tone etc.

Gigiboi@alien.top · 2 years ago

Use cases??

kxtclcy@alien.top · 2 years ago

Maybe for audio data that have both sound and words? For example if you want to summarize a concert or sth

Qwen-72B released

Qwen-72B released

Qwen/Qwen-72B · Hugging Face