https://huggingface.co/deepnight-research
I’m not affiliated with this group at all, I was just randomly looking for any new big merges and found these.
100B model: https://huggingface.co/deepnight-research/saily_100B
220B model: https://huggingface.co/deepnight-research/Saily_220B
600B model: https://huggingface.co/deepnight-research/ai1
They have some big claims about the capabilities of their models, but the two best ones are unavailable to download. Maybe we can help convince them to release them publicly?
so it sounds like for the 600b they just finetuned llama2 again with the same stuff Llama2 was trained with, just more of it…
RefinedWeb
Opensource code from GitHub