Civil_Ranger4687@alien.topBtoLocalLLaMA@poweruser.forum•Is there any way to speed up the MythoMax-L2-13B on a 6GB GPU?English
1·
11 months agoNever use the Q_8 versions of GGUFs unless most/all of the model can comfortably fit into your VRAM. The Q_6 version is much smaller, and almost the same quality.
For your setup, I would use mythomax-l2-13b.Q4_K_M.gguf.
Yeah there’s so much to learn I’m still figuring a lot out too.
Good tip for settings: Play around mostly with temperature, top-p, and min-p.