Goliath-120B - quants and future plans

AlpinDale@alien.top · 2 years ago

Goliath-120B - quants and future plans

ReturningTarzan@alien.top · 2 years ago

If anyone has suggestions, please let me know. Cheers!

The suggestion I’d give, apart from finetuning, would just be to do some actual tests. Construct some scenarios that test the model’s ability to “show not tell” and so on, and contrast with smaller models and/or with a “null hypothesis” Frankenstein model where the added layers are just random matrices, etc.

Ideally, if there’s nothing you can do to objectively measure the model’s performance, try to set up a blind test of some sort to see if users actually prefer the Frankenstein model over the two models it was spliced together from.

Not to disparage the project or anything, but confirmation bias is a real thing, and it’s especially rampant in the LLM space.

AlpinDale@alien.top · 2 years ago

>confirmation bias
That’s true. The model is up on the Kobold Horde if anyone wants to give it a try.