- 2 Posts
- 16 Comments
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Are there people who have ran MI25s, MI60s, etc for LLMs?
1·2 years agou/ehartford
It’s source available, not open source. They don’t even allow commercial usage.
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Orca 2: Teaching Small Language Models How to ReasonEnglish
1·2 years agoImportant: researcher only, non commercial license.
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Many people here are probably getting started on machine learning, here's a blog post from Greg Brockman who recently left OpenAI with Sam Altman. I found this quite inspiring.English
1·2 years agoI think Karpatgy’s nanogpt series is a great alternative to this.
https://youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ&feature=shared
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Many people here are probably getting started on machine learning, here's a blog post from Greg Brockman who recently left OpenAI with Sam Altman. I found this quite inspiring.English
1·2 years agoOpenAI hires ml experts AND software engineers. For the first few years at OpenAI, he was much more of the second as his knowledge of ml was very minimal.
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Top 3B model is a distilled llama 7BEnglish
1·2 years agoAre your finetunes full training or lora/qlora?
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Microsoft announced the Maia 100 AI Accelerator Chip. It's also expanding the use of the AMD MI300 in it's datacenters. Is this the beginning of the end of CUDA dominance?English
1·2 years agoDo they plan on offering Mi200 or MI300 to the public?
Can’t wait to try out a GPU with more than 80gb vram
How about T4 GPU or something like 3090 from runpod? the 3090 costs around 0.5$ per hour which is around 350 dollars per month and it gives you 24 GB which should be enough for t4
Is Goliath that good? Is it that better than all of the Llama2-70B tunes that’s worth the hardware investments needed for running it?
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Introducing Jais 30B, the latest open source Arabic-English model developed by Core42 & CerebrasEnglish
1·2 years agoI wouldn’t call it “Arabic”. It doesn’t handle Arabic text well.
Amgadoz@alien.topBtoMachine Learning@academy.garden•[P] I replicated micrograd in C++ and added more functionalityEnglish
1·2 years agoOh Today I discovered there’s a programming language called zig.
Amgadoz@alien.topBtoMachine Learning@academy.garden•[D] How much does everyone make doing AI? I make $140k.English
1·2 years agoCan I dm you for career related questions?
Amgadoz@alien.topBtoMachine Learning@academy.garden•[P] I replicated micrograd in C++ and added more functionalityEnglish
1·2 years agoI really like this pattern!
Someone starts a cool ml project in python. Another person re-creates it more efficiently in c++.
Then someone else rebuilds it in rust.
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Llama 3 will be released in the first quarter of 2024?English
1·2 years agoWell, two areas where Llama can improve are:
- Multilingual capabilities
- Mixture of Experts architecture
Amgadoz@alien.topBto
LocalLLaMA@poweruser.forum•Google Brain cofounder says Big Tech companies are lying about the risks of AI wiping out humanity because they want to dominate the marketEnglish
1·2 years agoYes, every ML practitioner knows him but probably the prompt engineer aren’t aware of his massive contributions to the field.
In no specific order:
Zephyr B OpenHermes2.5 OpenChat3.5