New APU’s close to Gpu processing, but with unlimited memory?

bkm_s@alien.top · 2 years ago

New APU’s close to Gpu processing, but with unlimited memory?

ccbadd@alien.top · 2 years ago

I didn’t think so either about the 3d vcache until the article about getting 10X the performance from a ramdrive that came out a few days ago. If it works for ramdrives then surely we can figure a way to use that performance for inferencing.

FlishFlashman@alien.top · 2 years ago

It’s not going to help because the model data is much larger than the cache and the access pattern is basically long sequential reads.

rarted_tarp@alien.top · 2 years ago

It might help for LLMs since a lot of values are cached after each loop, but still highly unlikely to make a difference.