Has Anyone Successfully Utilized the Neural Networks API on Android for LLMS with EdgeTPU?

dewijones92@alien.top · 1 year ago

Has Anyone Successfully Utilized the Neural Networks API on Android for LLMS with EdgeTPU?

phree_radical@alien.top · 1 year ago

I dipped my toes in while comparing different methods of running Whisper on Android, and learned that they don’t intend developers to use NNAPI directly, but instead use a solution like TensorFlow Lite or PyTorch Mobile, which detects support and implements delegates which it may decide to use depending on the most efficient scenario. A developer needs to convert/“optimize” a model so that it doesn’t use any unsupported operations, but there’s also size considerations, like the TPU and other areas probably don’t have that much memory just yet