- 2 Posts
- 16 Comments
It’s source available, not open source. They don’t even allow commercial usage.
Amgadoz@alien.topBto LocalLLaMA@poweruser.forum•Orca 2: Teaching Small Language Models How to ReasonEnglish1·2 年前Important: researcher only, non commercial license.
Amgadoz@alien.topBto LocalLLaMA@poweruser.forum•Many people here are probably getting started on machine learning, here's a blog post from Greg Brockman who recently left OpenAI with Sam Altman. I found this quite inspiring.English1·2 年前I think Karpatgy’s nanogpt series is a great alternative to this.
https://youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ&feature=shared
Amgadoz@alien.topBto LocalLLaMA@poweruser.forum•Many people here are probably getting started on machine learning, here's a blog post from Greg Brockman who recently left OpenAI with Sam Altman. I found this quite inspiring.English1·2 年前OpenAI hires ml experts AND software engineers. For the first few years at OpenAI, he was much more of the second as his knowledge of ml was very minimal.
Are your finetunes full training or lora/qlora?
Amgadoz@alien.topBto LocalLLaMA@poweruser.forum•Microsoft announced the Maia 100 AI Accelerator Chip. It's also expanding the use of the AMD MI300 in it's datacenters. Is this the beginning of the end of CUDA dominance?English1·2 年前Do they plan on offering Mi200 or MI300 to the public?
Can’t wait to try out a GPU with more than 80gb vram
How about T4 GPU or something like 3090 from runpod? the 3090 costs around 0.5$ per hour which is around 350 dollars per month and it gives you 24 GB which should be enough for t4
Is Goliath that good? Is it that better than all of the Llama2-70B tunes that’s worth the hardware investments needed for running it?
Amgadoz@alien.topBto LocalLLaMA@poweruser.forum•Introducing Jais 30B, the latest open source Arabic-English model developed by Core42 & CerebrasEnglish1·2 年前I wouldn’t call it “Arabic”. It doesn’t handle Arabic text well.
Amgadoz@alien.topBtoMachine Learning@academy.garden•[P] I replicated micrograd in C++ and added more functionalityEnglish1·2 年前Oh Today I discovered there’s a programming language called zig.
Amgadoz@alien.topBtoMachine Learning@academy.garden•[D] How much does everyone make doing AI? I make $140k.English1·2 年前Can I dm you for career related questions?
Amgadoz@alien.topBtoMachine Learning@academy.garden•[P] I replicated micrograd in C++ and added more functionalityEnglish1·2 年前I really like this pattern!
Someone starts a cool ml project in python. Another person re-creates it more efficiently in c++.
Then someone else rebuilds it in rust.
Amgadoz@alien.topBto LocalLLaMA@poweruser.forum•Llama 3 will be released in the first quarter of 2024?English1·2 年前Well, two areas where Llama can improve are:
- Multilingual capabilities
- Mixture of Experts architecture
Amgadoz@alien.topBto LocalLLaMA@poweruser.forum•Google Brain cofounder says Big Tech companies are lying about the risks of AI wiping out humanity because they want to dominate the marketEnglish1·2 年前Yes, every ML practitioner knows him but probably the prompt engineer aren’t aware of his massive contributions to the field.
In no specific order:
Zephyr B OpenHermes2.5 OpenChat3.5