Longjumping-Bake-557@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

Why is no one releasing 70b models?

1

Why is no one releasing 70b models?

Longjumping-Bake-557@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

There has been a lot of movement around and below the 13b parameter bracket in the last few months but it’s wild to think the best 70b models are still llama2 based. Why is that?

We have 13b models like 8bit bartowski/Orca-2-13b-exl2 approaching or even surpassing the best 70b models now

Chat

Exotic-Estimate8355@alien.topB
link
fedilink
English
arrow-up
1·
2 years ago
$1/hour for an A100 ? Where? I can barely get one in GCE and it’s almost 4$ / hr
- toothpastespiders@alien.topB
  link
  fedilink
  English
  arrow-up
  1·
  2 years ago
  I’d like to know too if there’s one for exactly $1. Even half a buck or so difference builds up over time.
  
  But runpod’s close at least, at $1.69/hour.
- __JockY__@alien.topB
  link
  fedilink
  English
  arrow-up
  1·
  2 years ago
  Yes, but you don’t have Meta’s purchasing power to rent 10,000 GPUs for a month. Economies of scale, my friend!