Wonderful_Ad_5134@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

"Base" models were actually trained with some GPT instruct datasets

9

1

"Base" models were actually trained with some GPT instruct datasets

Wonderful_Ad_5134@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

9

Look at this, apart Llama1, all the other “base” models will likely answer “language” after “As an AI”. That means Meta, Mistral AI and 01-ai (the company that made Yi) likely trained the “base” models with GPT instruct datasets to inflate the benchmark scores and make it look like the “base” models had a lot of potential, we got duped hard on that one.

https://preview.redd.it/vqtjkw1vdyzb1.png?width=653&format=png&auto=webp&s=91652053bcbc8a7b50bced9bbf8638fa417387bb

Chat

trailer_dog@alien.topB
link
fedilink
English
arrow-up
1·
1 year ago
So it turns out you just need to train on GPT output for better benchmarks lol. Not to say there’s a chance GPT models are contaminated with benchmark test data too. “Distillation” went a little too far. Easy VC money though, I would do the same.