Lower quality responses with GPTQ model vs GGUF?

Civil_Ranger4687@alien.top · 1 year ago

Lower quality responses with GPTQ model vs GGUF?

FieldProgrammable@alien.top · 1 year ago

I think exl2 is being let down by the number of quants that are using wikitext as the quantization dataset, even when it is obvious that this is completely mismatched to the model’s fine tuning. Activation order based quantization needs good measurement data to make the correct decisions on quantization.

If however you see the quantization data fits with the fine tune then the effects would be completely the opposite.