• shibe5@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    With the abundance of models, most developers and users have to select a small subset of available models for own evaluation, and that has to be based on some already available data about models’ performance. At that stage, selecting models with, for example, highest MMLU score is one way to go about it.