New Microsoft codediffusion paper suggests GPT-3.5 Turbo is only 20B, good news for open source models?

obvithrowaway34434@alien.top · 1 year ago

New Microsoft codediffusion paper suggests GPT-3.5 Turbo is only 20B, good news for open source models?

Combinatorilliance@alien.top · 1 year ago

I think it’s plausible. Gpt3.5 isn’t ultra smart. It’s very hood most of the time, but it has clear limitations.

Seeing what mistral achieved with 7b, I’m sure we can get something similar to gpt3.5 in 20b given state of the art training and data. I’m sure OpenAI is using some tricks as well that aren’t released to the public.