@ggerganov - Communick News

0 Posts
1 Comment

Joined 1 year ago

Cake day: November 8th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

ggerganov@alien.topBtoLocalLLaMA@poweruser.forum•Need help setting up a cost-efficient llama v2 inference API for my micro saas app
link
fedilink
English
arrow-up
1·
1 year ago
I just wrote a post today about serving 7B models with `llama.cpp` from cheap AWS instances - might be useful:

https://github.com/ggerganov/llama.cpp/discussions/4225

link
fedilink