Proposed Alternative to Repetition Penalty - Noisy Sampling

kindacognizant@alien.top · 10 months ago

Proposed Alternative to Repetition Penalty - Noisy Sampling

WolframRavenwolf@alien.top · 10 months ago

imagine a language model that was tasked to do trivial math problems, and a user always involved the number 3 in his first 5 questions. After a certain amount of context, it will bias against using the number 3 in the solution even if if it is correct.

I used to think that, but one of the Transformers devs (Joao Gante from HF) told me that it is “only applied at most once per token” within the repetition penalty range, so it doesn’t matter how often the number 3 appears in the first 5 questions, as long as the repetition penalty is a “reasonable value (e.g. 1.2 or 1.3)”, it won’t have a negative impact on tokens the model is reasonably sure about. So for trivial math problems, and other such situations, repetition penalty is not a problem.

Same with other tokens like EOS, newlines, punctuation, etc. - if the repetition penalty would affect them negatively, we’d quickly see lots of problems. So it’s not preventing the output of tokens the model is sure about, it’s trying to prevent repetition in cases the token isn’t that predetermined.

Just something non-obvious to keep in mind.

Proposed Alternative to Repetition Penalty - Noisy Sampling

Proposed Alternative to Repetition Penalty - Noisy Sampling

Noisy Sampling

- Context Free

- Scales with Confidence