DPO models seem to be pretty good

lemon07r@alien.top · 11 months ago

DPO models seem to be pretty good

sebo3d@alien.top · 11 months ago

I tried DPOpenHermes from TheBloke(Q6 GGUF version) and i love it but i think there’s an issue with an EOS token as for some reason the model just keep generating text way past where it should logically stop. I see myself using it more but i hope there will be an update that adresses the EOS issue.

Feztopia@alien.top · 11 months ago

There is already an updated version that is supposed to fix that (with additional training on top which lowered it’s overall capabilities apparently). I don’t know if TheBloke has it already. But I see the first set of dpo models as test runs the next ones should fix the issues (except for NeuralHermes, maybe it’s already good, I didn’t hear much feedback about it).