vatsadev@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

Why not test all models for training on the test data with Min-K% Prob?

5

1

Why not test all models for training on the test data with Min-K% Prob?

vatsadev@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

5

So there detect pretrain data, https://swj0419.github.io/detect-pretrain.github.io/ , where one can test if a model has been pretrained on the text or not, so why dont we just test all the models going on the leaderboard, and just reject those detected for pretrain data? It would end the “train on test” issue

Chat

mcmoose1900@alien.topB
link
fedilink
English
arrow-up
1·
1 year ago
Ask on the huggingface leaderboard page!

The HF staff do seem to look at it, and have an interest in weeding out “contaminated” models (as they have already marked a few).