Biden Executive Order regulates VERY large models

PookaMacPhellimen@alien.top · 1 year ago

Biden Executive Order regulates VERY large models

FairSum@alien.top · 1 year ago

Assuming number of FLOPs in compute is 6ND (N = number of parameters, D = dataset size in tokens) you could take the full RedPajama dataset (30T tokens) and a 500B parameter model and it’d come out to:

6*(30*10^12)*(500*10^9) = 9*10^25

In order to qualify, you would need a cluster that could train this beast in about:

10^26 / 10^20 = 1000000 seconds = 11.57 days