How much does Quantization actually impact models? - KL Divergence Tests

kindacognizant@alien.top · 2 years ago

JealousAmoeba@alien.top · 2 years ago

Would I get better results in general by running a 7B model with Q8, or a 13B model with Q4/Q5? My laptop can do either.

I’m guessing the quantized 13B model will be better but has anyone ever benchmarked 7B vs 13B for different levels of quantization?

LOLatent@alien.top · 2 years ago

I’m in the exact same boat, if you get an answer, pls lettus know! 7b q8 or 13b q4?