Following the release of Dimensity 9300 and S8G3 phones, I am expecting growth in popularity of LLMs running on mobile phones, as quantized 3B or 7B models can already run on high-end phones from five years ago or later. But despite it being possible, there are a few concerns, including power consumption and storage size. I’ve seen posts about successfully running LLMs on mobile devices, but seldom see people discussing about future trends. What are your thoughts?

  • sshan@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Hard to make a broad use case here until power consumption drops. Best approach is still push to cloud.

    Edge cases like robotics / cars / high availbility likely exist though and could be big niche.