How much does Quantization actually impact models? - KL Divergence Tests

kindacognizant@alien.top · 2 years ago

dnsod_si666@alien.top · 2 years ago

You could also use this to measure different models against each other right? And just in general, use this as a model benchmark.

-Separate Idea- Also isn’t getting the true probabilities useful anyway, because then we could have the training process be:

Like instead of training twice (sequence to probabilities):

So you are training on less data which would reduce training costs and whatnot.