New Microsoft codediffusion paper suggests GPT-3.5 Turbo is only 20B, good news for open source models?

obvithrowaway34434@alien.top · 2 years ago

New Microsoft codediffusion paper suggests GPT-3.5 Turbo is only 20B, good news for open source models?

BalorNG@alien.top · 2 years ago

Given how good 7b Mistral is in my personal experience, it seems that a model 3x its size can BE GPT3.5 Turbo is no longer implausible.

artelligence_consult@alien.top · 2 years ago

It is given the age - if you would build it today, with what research has shown now - yes, but GPT 3.5 predates that, It would indicate a brutal knowledge advantage of OpenAi compared to published knowledge.

ironic_cat555@alien.top · 2 years ago

GPT 3.5 turbo was released on March 1 2023, for what it’s worth. Which makes it not a very old model.

artelligence_consult@alien.top · 2 years ago

Only if you assume that 3.5 TURBO is not a TURBO version of GPT 3.5 THAT would make the RELEASE in March 2022, likely with 6 months or more of training and tuning. So, you say that when they did the turbo version, they started fresh, with new training data and an approach based on the MS ORCA papers which were released in June, and still did not change the version number?

Let me say your assumption bare a thread of logic.

ironic_cat555@alien.top · 2 years ago

Oh it’s a TURBO version you say? Is that a technical term? I never said whatever you seem to think I said.

Fun_Analyst_1234@alien.top · 2 years ago

I think so too. I really hope those guys are funded to improve the model. Serious talent in that team.