Budget machine for tinkering with LLMs

the-uncle@alien.top · 2 years ago

Budget machine for tinkering with LLMs

sshan@alien.top · 2 years ago

Mistral 7B is very good and can be run on 8gb vram. It was blazing fast on my 3070. I have a 4090 as well and for all intents and purposes its indistinguishable.

Right now Mistral7B competes with the best 13B paramater models. Unless you plan on using code LLMs there aren’t many new 30B parameter models that matter that much.

I have a 3070 on my proxmox home server with I think only 2 physical cores and 16gb ram allocated and I’m getting 40 + tokens per second.

You wouldn’t be futureproofed but would work fine now.

Herr_Drosselmeyer@alien.top · 2 years ago

The 4060ti 16 has a bad reputation because it doesn’t provide any real improvement for gaming over the regular 4060 but that’s of no concern to us.

ttkciar@alien.top · 2 years ago

You can absolutely do interesting and useful things with very little hardware, with quantized models, especially if you don’t mind if inference is slow. My preferred quantization is q4_K_M (with GGUF and llama.cpp).

I started with a spare Lenovo T560 Thinkpad with 8GB of RAM, which handled 7B models no problem. That’s a $120 eBay purchase. Once I was hooked, I shifted to one of the Dell T7910 in the homelab and moved up to larger models.

I’m still not using a GPU for anything. It’s been CPU inference, which is slow but otherwise great.

You could get just about any $300 desktop and put a decent GPU in it (16GB VRAM will allow fast inference with 13B models, and 24GB should allow heavily-quantized 30B) and enjoy fast inference. The most expensive bit is the GPU.

See this sub’s wiki for more detailed hardware tips.

a_beautiful_rhind@alien.top · 2 years ago

They sell P40s on ali and ebay that ship from CN. Fill some used box with that and use llama.cpp. You can also try your hand with the dead cheap AMD Mi25. P100s are an option too if you want better FP16.

All depends on what you want to do and what is importable/available and in your budget.