A Lecture on How to Scale Open LLMs to GPT4 Level

johnolafenwa@alien.top · 3 years ago

A Lecture on How to Scale Open LLMs to GPT4 Level

klenen@alien.top · 3 years ago

Thank you!

xadiant@alien.top · 3 years ago

Really cool, will check the video out. Since we found an actually qualified person though, let me ask a few layman questions, hope you have time to answer them!

sampling methods. Most of them look simple, but we still don’t really know how to tune them. Do you think novel sampling methods or specific combinations could improve output quality by a lot?

For instance, beam search. Does beam search provide a linear improvement in quality as you go up or not?

Do you think ideal numbers for temperature, top_k and top_p are context or model based, or both?