Why didn't gpt4 work at first and how did they "fix it"?

Amgadoz@alien.top · 2 years ago

In no specific order:

Zephyr B OpenHermes2.5 OpenChat3.5

Amgadoz@alien.top · 2 years ago

Why didn't gpt4 work at first and how did they "fix it"?

Amgadoz@alien.top · 2 years ago

u/ehartford

Amgadoz@alien.top · 2 years ago

It’s source available, not open source. They don’t even allow commercial usage.

Amgadoz@alien.top · 2 years ago

Important: researcher only, non commercial license.

Amgadoz@alien.top · 2 years ago

I think Karpatgy’s nanogpt series is a great alternative to this.

https://youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ&feature=shared

Amgadoz@alien.top · 2 years ago

OpenAI hires ml experts AND software engineers. For the first few years at OpenAI, he was much more of the second as his knowledge of ml was very minimal.

Amgadoz@alien.top · 2 years ago

Are your finetunes full training or lora/qlora?

Amgadoz@alien.top · 2 years ago

Do they plan on offering Mi200 or MI300 to the public?

Can’t wait to try out a GPU with more than 80gb vram

Amgadoz@alien.top · 2 years ago

How about T4 GPU or something like 3090 from runpod? the 3090 costs around 0.5$ per hour which is around 350 dollars per month and it gives you 24 GB which should be enough for t4

Amgadoz@alien.top · 2 years ago

Is Goliath that good? Is it that better than all of the Llama2-70B tunes that’s worth the hardware investments needed for running it?

Amgadoz@alien.top · 2 years ago

dolphin-2.2-yi-34b released

Amgadoz@alien.top · 2 years ago

I wouldn’t call it “Arabic”. It doesn’t handle Arabic text well.

Amgadoz@alien.top · 2 years ago

Oh Today I discovered there’s a programming language called zig.

Amgadoz@alien.top · 2 years ago

Can I dm you for career related questions?

Amgadoz@alien.top · 2 years ago

I really like this pattern!

Someone starts a cool ml project in python. Another person re-creates it more efficiently in c++.

Then someone else rebuilds it in rust.

Amgadoz@alien.top · 2 years ago

Well, two areas where Llama can improve are:

Multilingual capabilities
Mixture of Experts architecture

Amgadoz@alien.top · 2 years ago

Yes, every ML practitioner knows him but probably the prompt engineer aren’t aware of his massive contributions to the field.