Asking for tips how to use base models instead of instruct/chat tuned models

noeda@alien.top · 2 years ago

Asking for tips how to use base models instead of instruct/chat tuned models

FullOf_Bad_Ideas@alien.top · 2 years ago

Yi-34b and Llama 2 70B in my opinion are pretty bad in raw state. Llama 1 65B is pretty good raw. Llama 2 models are not actually raw bases, they clearly recognize instruction prompts and have refusals ingrained, it’s not really a base model. I am not aware of any non-instruct storywriting fine-tunes, but this sounds exciting. If I can find some small storywriting dataset, i can try to train yi-34B or mistral on it.

Base Yi-34B and Mistral get into repetitive patterns fast, llama 65b sometimes start outputting python code out of nowhere, but it should be your best bet for raw storywriting model.