• cegras@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    11 months ago

    What is the size of ChatGPT or the biggest LLMs compared to the dataset? (Not being rhetorical, genuinely curious)

    • StartledWatermelon@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      11 months ago

      GPT-4: 1.76 trillion parameters, about 6.5* trillion tokens in the dataset.

      • could be twice that, the leaks weren’t crystal clear. The above number is more likely though.