The Nvidia NV1 was released in 1995, it was the first GPU with 3D capabilities for PC… form there we know how things went by.
Now it’s 2023, so let’s make some “retro futuristic” prediction… what would you think about a AI board, open source driver, open API as Vulkan which you can buy to power the AI for your videogames? It would make sense to you? Which price range it should be?
What’s supposed to do for your games… well, that’s depend on videogames. The quickiest example I can think of is having endless discussion with your NPC in your average, single player, Fantasy RPG.
For example, the videogame load your 4~5 companions with the psychology/behaviors: they are fixated with the main quest goal (like you talk with fanatic people, this to make sure the game the main quest is as much stable as possible) but you can “break them” by making attempt to reveal some truths (for example, breaking the fourth wall), and if you go for this path, the game warns that you’re probably going to lock out the main quest (like in Morrowind when you kill essential NPC)
This just sounds like putting a second CPU on a PCIe board. I can’t see this being a benefit for games because developers would never go through the pain of programming AI with advanced enough behaviours to even need a secondary CPU.
Why wouldn’t they? It’s a lot easier to write out intricate backstories for each character/location independently than it is to build decision trees for every possible combination of decisions that the player makes. That’s basically what current LLMs allow for.
Programming AI is actually super easy, unless you decided to create your own foundation model. Even then, you would have data scientists building it, not devs.
Plenty of FMs and LLMs already exist that would be up to the task.
Programming AI with behaviour complex enough to need a second CPU would be hard. Syncing its output with the primary CPU could be a problem.
LLMs would not be useful for anything except maybe generating new dialogue, but it would need a lot of restraints to prevent the end user from breaking it. For the purposes of dialogue and story telling, most developers would opt to just pre-program dialogue like they always have.
Again, this sounds like a useless PC part that pretty much no game developer would ever take advantage of.
You don’t need an LLM for this. You just need a FM that you fine tune, and you’d be surprised at how little computing power is actually required.
For our uses (which are similar to what OP wants), it takes longer for us to do an OCR scan on the documents our AI works with than for Sagemaker to do it’s thing on a rather small instance.
And, devs would just be implementing API calls, so it wouldn’t be a big deal to make the switch.