Life_Ask2806@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

in the context of evaluating LLMs, what do these scores technically mean?

3

1

in the context of evaluating LLMs, what do these scores technically mean?

Life_Ask2806@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

3

when we benchmark different LLMs on different datasets (MMLU, TriviaQA, MATH, HellaSwag, etc.), what are the the signification of these scores? the accuracy? another metric? how can i know the metrics of each dataset (MMLU, etc.)

https://preview.redd.it/5glmddnwsb3c1.png?width=2158&format=png&auto=webp&s=fcaf6e55d62445f3007380f06649455b29f8b2ec

Chat

shaman-warrior@alien.topB
link
fedilink
English
arrow-up
1·
2 years ago
Everything is common sense reasoning, we need better definitions