Yi-34B vs Yi-34B-200K on sequences <32K and <4K

DreamGenX@alien.top · 2 年前

Yi-34B vs Yi-34B-200K on sequences <32K and <4K

a_beautiful_rhind@alien.top · 2 年前

The regular 34b “feels” like it ignores my prompt a lot.

FullOf_Bad_Ideas@alien.top · 2 年前

It’s supposed to be a base model and not Instruction finetuned model. That’s how base models generally behave unless they are sold as base but actually finetuned (llama 2 base models).

mcmoose1900@alien.top · 2 年前

I felt this too. It seems to “grab on” when you give it a longer context to continue though.

Yi-34B vs Yi-34B-200K on sequences &lt;32K and &lt;4K

Yi-34B vs Yi-34B-200K on sequences &lt;32K and &lt;4K

Yi-34B vs Yi-34B-200K on sequences <32K and <4K

Yi-34B vs Yi-34B-200K on sequences <32K and <4K