Help for the n00b? Optimal loader parameters...

AirwolfPL@alien.top · 10 months ago

Help for the n00b? Optimal loader parameters...

AirwolfPL@alien.top · 10 months ago

Thanks for the informative answer. I will take a look at GGUF models (although I’m not sure yet how to split them between cpu/gpu yet (I will take a look at llama.cpp parameters).

Kevinswelt@alien.top · 10 months ago

You can find a n-gpu-layers slider, when you select llama.cpp. You can just input the max amount if you want everything on the GPU. Otherwise the model you loaded will say how many layers it has during loading in the terminal.