Fun_Tangerine_1086@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

Why is Mistral-7b so capable? Any ideas re: dataset?

1

Why is Mistral-7b so capable? Any ideas re: dataset?

Fun_Tangerine_1086@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

So Mistral-7b is a pretty impressive 7B param model … but why is it so capable? Do we have any insights into its dataset? Was it trained very far beyond the scaling limit? Any attempts at open reproductions or merges to scale up # of params?

Chat

Dorialexandre@alien.topB
link
fedilink
English
arrow-up
1·
1 year ago
My current hunch is that they use a lot of non easily accessible online ressources (including a specific archive owned by someone named Anna).
- Hulksulk666@alien.topB
  link
  fedilink
  arrow-up
  1·
  1 year ago
  oh, anna !