starkiller1298@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

Rocket 🦝 - smol model that overcomes models much larger in size

1

Rocket 🦝 - smol model that overcomes models much larger in size

starkiller1298@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

We’re proud to introduce Rocket-3B 🦝, a state-of-the-art 3 billion parameter model!

🌌 Size vs. Performance: Rocket-3B may be smaller with its 3 billion parameters, but it punches way above its weight. In head-to-head benchmarks like MT-Bench and AlpacaEval, it consistently outperforms models up to 20 times larger.

https://preview.redd.it/fxmz9sl1ls1c1.png?width=1273&format=png&auto=webp&s=63c3838cf4f01f7efcad9ec92b97c1e493111842

🔍 Benchmark Breakdown: In MT-Bench, Rocket-3B achieved an average score of 6.56, excelling in various conversation scenarios. In AlpacaEval, it notched a near 80% win rate, showcasing its ability to produce detailed and relevant responses.

https://preview.redd.it/rpgaknn3ls1c1.png?width=1280&format=png&auto=webp&s=6d2d7543f1459ceae7f96ad05ea064e8f8076517

🛠️ Training: The model is fine-tuned from Stability AI’s StableLM-3B-4e1t, employing Direct Preference Optimization (DPO) for enhanced performance.

📚 Training Data: We’ve amalgamated multiple public datasets to ensure a comprehensive and diverse training base. This approach equips Rocket-3B with a wide-ranging understanding and response capability.

👩‍💻 Chat format: Rocket-3B follows the ChatML format.

For an in-depth look at Rocket-3B, visit Rocket-3B’s HugginFace page

Chat

uti24@alien.topB
link
fedilink
English
arrow-up
1·
1 year ago
Tried gguf format of this model from huggingface and they just wont load.
- 3m84rk@alien.topB
  link
  fedilink
  English
  arrow-up
  1·
  1 year ago
  I tried both GGUF models currently on HF. Same result.
  
  Curious to try this out when it’s working!
- those2badguys@alien.topB
  link
  fedilink
  English
  arrow-up
  1·
  1 year ago
  Same, even the model from the bloke that was released hours ago wouldn’t work :-(
- brobruh211@alien.topB
  link
  fedilink
  English
  arrow-up
  1·
  1 year ago
  The latest version of KoboldCpp v1.50.1 now loads this model properly.