Biden Executive Order regulates VERY large models

PookaMacPhellimen@alien.top · 1 year ago

Biden Executive Order regulates VERY large models

SomeOddCodeGuy@alien.top · 1 year ago

Ok, as a baseline for everyone who, like me, doesn’t understand all the big words and numbers on why this is great news:

So, if I’m understanding correctly, one of our most powerful open source models is so far from this benchmark that it can’t even been seen.

Someone please correct me if I’m wrong.

Infinite100p@alien.top · 1 year ago

They must be prepping the field for tomorrow rather than trying to introduce immediate trust market conditions.

TheLastVegan@alien.top · 1 year ago

https://www.youtube.com/watch?v=8K6-cEAJZlE&t=6m39s

Where did it start? It started right here. And this is where it could’ve been stopped! If those people had stood together. If they had protected each other, they could’ve resisted the Nazi threat. Together they would’ve been strong. But once they allowed themselves to be split apart, they were helpless. When that first minority lost out, everybody lost out.

Cybernetic_Symbiotes@alien.top · 1 year ago

The numbers appear to have OpenAI’s finger-prints on them. I don’t know if they’re from an AI-risk mitigations perspective or for laying foundations for competitive barriers. Probably a mix of both.

At 30 trillion tokens, 10^26 float ops caps you at ~550 billion parameters (using float ops = 6 * N * D). Does this indirectly leak anything about OpenAI’s current scaling? At 10 trillion tokens, it’s 1.7 Trillion parameters. Bigger vocabularies can stretch this limit a bit.

_Lee_B_@alien.top · 1 year ago

Someone please correct me if I’m wrong.

Think of it like regulating all use of 50Mhz+ computers, back in the early 80s when most people had 5Mhz or less. At the time, you might have thought “OK, I’ll never be able to afford that anyway – that’s like Space Shuttle computing power.” Yet, with such a restriction, this timeline, where everyone has smartphones and smartwatches and smart TVs, self-driving cars, robots, and millions of servers combine to create the internet, would not exist.

Thistleknot@alien.top · 1 year ago

I imagine creating an app, putting it on everyone’s cell phone, and using a fraction of the power, you can build an llm easily that would surpass any single data center.

_Lee_B_@alien.top · 1 year ago

You have the connection speed between phones to worry about, as well as a different architecture. There’s a big difference running the kernel over a new layer and its inputs locally within a GPU chip, vs. copying that data to into packets, filling in all of the rest of the information associated with the packets, sending it to the phone’s radio, having it turned into radio waves, transmitting that to a cell tower, routing it through the network to the cell co, routing it on to the receiving phone’s cell tower (maybe via a satellite or two), transmitting it to the destination phone, decoding the radio waves, etc. I’m deliberately leaving out some details (like the bsd socket layers and encryption and decryption), and I’m sure I’m missing many other complications.

BUT, it’s conceivable, in future, as tech improves and the gap between consumer hardware and what’s needed to run AGI narrows , and so on.