What is the best current uncensored Storytelling LLM that can run with 32gb system ram and 8 gb Vram PC?

Acrobatic_Internal_2@alien.top · 10 months ago

What is the best current uncensored Storytelling LLM that can run with 32gb system ram and 8 gb Vram PC?

uti24@alien.top · 10 months ago

Interesting, everyone suggesting 7B models, but you can run much better models using not only your GPU memory, so I would highly recommend mxlewd-l2-20b its very smart, its fantastic for writing scenes and such.

Kevinswelt@alien.top · 10 months ago

At 20 words per minute… Oh the joys of CPU interference

IXAbdullahXI@alien.top · 10 months ago

I personally like and use echidna-tiefigther-25. There’s also another good one which is Openhermes-2.5-Mistral.

zware@alien.top · 10 months ago

If you want speed, you’ll want to use Mistral-7B-OpenOrca-GPTQ with ExLLama v2, that’ll give you around 40-45 tokens per second. TheBloke/Xwin-MLewd-13B-v0.2-GGUF to trade speed for quality (llama.cpp)