Need help setting up a cost-efficient llama v2 inference API for my micro saas app

m1ss1l3@alien.top · 2 years ago

Need help setting up a cost-efficient llama v2 inference API for my micro saas app

noobgolang@alien.top · 2 years ago

also the build is 100% built in public with the source code on the page, you can check the Actions button to see it, there is nothing hidden here

MannowLawn@alien.top · 2 years ago

thanks, ill have a look. It seems very promising with my use case as well. Btw is nitro different than the download you have on the main page? Nitro seems only for m1 models of apple and on main page it mentions m2 models as well?

noobgolang@alien.top · 2 years ago

m1 models of apple and on main page it mentions m2 models as well?

yeah arm64 mac should be able to run on all mac m1 and m2 including, we also have cuda version in the release

MannowLawn@alien.top · 2 years ago

cheers! ill keep a close watch on this, nice work!