Following the release of Dimensity 9300 and S8G3 phones, I am expecting growth in popularity of LLMs running on mobile phones, as quantized 3B or 7B models can already run on high-end phones from five years ago or later. But despite it being possible, there are a few concerns, including power consumption and storage size. I’ve seen posts about successfully running LLMs on mobile devices, but seldom see people discussing about future trends. What are your thoughts?
Theoretically doable, practically unlikely. Battery life will take a significant hit, and the 3B/7B models don’t provide THAT much benefit to just take that hit.
It is something to consider in the future, though. Like, 5 years from now we will probably have SoCs that are efficient enough to do it live.