nvidia released a new 8B base model (and a few fine-tunes), albeit under a restrictive license.
https://huggingface.co/nvidia/nemotron-3-8b-base-4k
Happily, they did specify enough details about their training regimen for the model to be a useful data-point.
They also note that they trained on all the training sets for all the popular benchmarks, which…at least they’re honest about.
You must log in or register to comment.
I feel like I just inadvertently sold my soul for access to an 8b model with all that agreement clicking.
an 8b model? surely releasing larger ones is good for their own game :/