Appropriate-Tax-9585@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

What kind of specs to run local llm and serve to say up to 20-50 users

1

What kind of specs to run local llm and serve to say up to 20-50 users

Appropriate-Tax-9585@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

Hi all,

Just curious if anybody knows the power required to make a llama server which can serve multiple users at once.

Any discussion is welcome:)

Chat

pablines@alien.topB
link
fedilink
English
arrow-up
1·
1 year ago
Hugging face text inference can handle concurrency you just need to power with gpus