Asking for tips how to use base models instead of instruct/chat tuned models

noeda@alien.top · 2 years ago

Asking for tips how to use base models instead of instruct/chat tuned models

phree_radical@alien.top · 2 years ago

I think base model is preferable in many cases for for developers, particularly if instruction-following abilities don’t cut it, or you worry about instruction injection, or just want to make sure the text you get isn’t bent into the curves of the “helpful” fine-tuning distribution

It’s easy to recommend base model for targeted generations that leverage the pattern-following ability. You get what you want after a number of examples, almost like fine-tuning examples. I went through my history for examples of few-shot completion: classification, rewrite sentence copying style, classify, basic Q&A example, fact check yes/no, rewrite copying style and sentiment, extract list of musicians, classify user intent, tool choice, rewrite copying style again, flag/filter objectionable content, detect subject changes, classify profession, extract customer feedback into json, write using specified words, few-shot cheese information, answer questions from context, classify sentiment w/ probabilities, summarize, replace X in conversation

Most of that is aimed at developers, though, and with many use-cases necessitating using temperature of 0

For long-form writing, on the other hand, you’ve found some hindrances. First, results will benefit a great deal from longer context. Second, you’ll probably get some looping patterns you can avoid by increasing repetition penalties in your generator

Finetunes for storywriting do seem like a good idea, I found at least this one

Capital-Alps5626@alien.top · 2 years ago

https://www.reddit.com/r/LocalLLaMA/comments/17yxoxv/local_llm_for_hot_dog_or_not_hot_dog_kind_of_fact/

Would you say your advice in this post is applicable to my post? I think I’m in this same camp. I don’t want to go through the hundreds of fine-tuned models. I just want to talk to the model with the kinds of things you’ve mentioned.

Then why do people fine-tune for instruction? Perhaps the answer to my question is how do you fine tune a model for instruction? Is there a document or steps?

AutomataManifold@alien.top · 2 years ago

That’s a good point about few-shot prompting: the big thing about GPT-3 and instruction training was that it allowed for zero-shot prompting (i.e., prompting with zero examples). But if we’re manually prompting a base model, there’s no reason not to provide those examples, and you get dramatically improved performance versus the same model with no examples.