Is LLaMA-1-65B or LLaMA-2-70B more creative at storytelling ?

nuvalab@alien.top · 1 year ago

That sounds like CPU speed. What you see from `watch nvidia-smi -d -n 0.1` while you’re running inference ?

nuvalab@alien.top · 1 year ago

Thanks for writing this, it’s an interesting idea and very relevant to the issue that I am trying to solve too - creative writing, which definitely hates repetition, and very interested to try out what you proposed once it’s available :)

One technical question for this approach: Wouldn’t it change the original distribution of training data / output, specially in case where there is one and obviously good one next token to choose from? I can see the value when multiple next tokens are all considered great with close probability, but curious how would it behave otherwise in terms of consistency in correctness.

nuvalab@alien.top · 1 year ago

That’s an interesting idea … in my experience anything <1 works, >1.2 goes wild and for things we expect to be a bit more deterministic, setting it to 0 is preferred.

What’s your best setup and temperature for creative writing ?

nuvalab@alien.top · 1 year ago

Is LLaMA-1-65B or LLaMA-2-70B more creative at storytelling ?