permalip@alien.topBtoLocalLLaMA@poweruser.forum•Brand New Mistral 16k Context Size Models got released last night from NurtureAI!English
1·
1 year agoThere is nothing “true” context length about MistralLite. You are essentially removing the sliding window by doing what Amazon or Yarn is doing.
It has 32k, they mention it in their config “max_position_embeddings”: 32768. This is the sequence length.
https://preview.redd.it/5r2c9592vr0c1.png?width=256&format=png&auto=webp&s=be88f25168e3cec16cbe7f9aad15f678edf97e99