Proposed Alternative to Repetition Penalty - Noisy Sampling

kindacognizant@alien.top · 10 months ago

Proposed Alternative to Repetition Penalty - Noisy Sampling

andrewlapp@alien.top · 10 months ago

Very interesting idea. If you can create a simple benchmark (really just any prompt applied to a noisy 7B model) and demonstrate a reduction in repetition compared to baseline, this method will proliferate across the open LLM development ecosystem.

Looking forward to seeing your implementation!

kindacognizant@alien.top · 10 months ago

Applying Gaussian noise randomization to the logits with a gaussian deviation factor of 1.0 is totally coherent at top k = 1 (aka it’s picking the top token post randomization) on my Lora that I trained that I’m doing testing on and I haven’t seen repetition issues thus far. How might I test this? Like what are your best benchmark ideas?

andrewlapp@alien.top · 10 months ago

Here are some factors that may help induce repetition:

1. Llama 2 7B, Mistral 7B, or Yi 6B variant
1. Use a lossy quantization such as Q2_K (2 bit), Q4_0 (4 bit), or GPTQ (4 bit)
1. Use a sequence length of at least 1024 tokens, if not 2048
1. Use a text corpus with a lot of repetition, e.g. https://github.com/Lyrics/lyrics

Additionally, you should use lm-evaluation-harness to test for any degradation in performance in common benchmarks.

aseichter2007@alien.top · 10 months ago

rep penalty off, repeat a ton of text over and over, use the wrong instruct to make it sperg out, and watch to see deviations in the regular output, if I understand from my quick look, you should eventually have some outliers as you increase the strength of the deviation even with top k = 1. Am I sane or out of my depth?

Proposed Alternative to Repetition Penalty - Noisy Sampling

Proposed Alternative to Repetition Penalty - Noisy Sampling

Noisy Sampling

- Context Free

- Scales with Confidence