For roleplay purposes, Goliath-120b is absolutely thrilling me

tenmileswide@alien.top · 1 year ago

For roleplay purposes, Goliath-120b is absolutely thrilling me

Sabin_Stargem@alien.top · 1 year ago

I tend to use models with at least 16k context. Goliath 120b q2 was coherent, but was also very much out of character when telling the NSFW bust massage story. “Yeahyeah” and other lingo. Probably quite good at a lower context, but 16k definitely isn’t the proper fit for Goliath.

The search for the Goldilocks Model continues.

Ok_Relationship_9879@alien.top · 1 year ago

Which models do you find to be good at 16k context for story writing?

Sabin_Stargem@alien.top · 1 year ago

I don’t think any small models are actually good for that usecase, at least not for serious writing. The best we got access to are probably Mistral finetunes (up to 32k), and Yi-34b, but Yi doesn’t have any finetunes yet. An Dolphin should on the way for Yi, IIRC.

In any case, my favorite 7b model tend to be franken merges, which stitch together an assortment. This allows the resulting model to be able to grasp a wider range of topics. At the moment, the best for this size is likely Undi’s Toppy, which is uncensored is well rounded.

The issue with Mistral 7b and small models is that they tend to lose flavor over time, and the logic also gets weaker. Coherent, but the ‘X’ factor is gone.