madmax_br5@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

Anyone get Amazon’s long-context MistralLite to work properly?

2

1

Anyone get Amazon’s long-context MistralLite to work properly?

madmax_br5@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

2

https://huggingface.co/TheBloke/MistralLite-7B-GGUF

This is supposed to be a 32k context finetune of mistral. I’ve tried the recommended Q5 version in both GPT4all and LMStudio, and it works for normal short prompts but hangs and produces no output when I crank up the context length to 8k+ for data cleaning. I tried it cpu only (machine has 32GB of RAM so should be plenty) and hybrid with the same bad outcomes. Curious if there’s some undocumented ROPE settings that need to he adjusted.

Anyone get this to work with long prompts? Otherwise, what do y’all recommend for 32k+ context with good performance on data augmentation/cleaning, with <20B params for speed?

Chat

Ok_Neck_@alien.topB
link
fedilink
English
arrow-up
1·
1 year ago
You can try our hosted version, and see if you get better results out of it.