oppenbhaimer@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

Why LocalLLaMa when GPT-4 exists?

1

Why LocalLLaMa when GPT-4 exists?

oppenbhaimer@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

Title says it all. Why spend so much effort finetuning and serving models locally when any closed-source model will do the same for cheaper in the long run. Is it a philosophical argument? (As in freedom vs free beer) Or are there practical cases where a local model does better.

Where I’m coming from is the requirement of a copilot, primarily for code but maybe for automating personal tasks as well, and wondering whether to put down the $20/mo for GPT4 or roll out my own personal assistant and run it locally (have an M2 max, compute wouldn’t be a huge issue)

Chat

ThisGonBHard@alien.topB
link
fedilink
English
arrow-up
1·
2 years ago
https://github.com/oobabooga/text-generation-webui

How much ram do you have? It matters a lot.

For a BIF simplification, think of the models you can run as the size (billion parameter, for example 13B means 13 billion) = 50-60% of your RAM.

If you have 16 GB, you can run a 7B model for example.

If you have 128GB, you can run 70B,