Having a hard time setting deepseek coder instruct to work

iChrist@alien.top · 1 year ago

Having a hard time setting deepseek coder instruct to work

iChrist@alien.top · 1 year ago

I found the fix for this issue (Tested by me only, thanks to u/FullOf_Bad_Ideas for the suggestion)

reduce the Repetition penalty to 1, the code will be much better, and closely resemble what is generated on the website. (tested multiple times with pong and snake)

YearZero@alien.top · 1 year ago

Yeah I basically turn the temperature to 0.1, disable every sampler, and turn the temperature as low as the GUI will allow (I have mine at 1). I’m using Koboldcpp 1.50 and deepseek-coder-instruct 33b is working very well for me. If it’s not on par with GPT-4, it’s incredibly close. I tested the 7b model and it’s pretty good, but it does mess up more frequently requiring you to fix its mistakes. 33b gives me workable code more often than not.

I’ve been testing it on a bunch of different problems on this site: https://www.w3resource.com/index.php

…and it seems to ace everything I throw at it. Granted those aren’t particularly challenging problems, but still, it’s very consistent, which means I can use it for work reliably (I work mainly with SQL). The 16k context doesn’t hurt either!

Now I just wish I had more than 8gb vram, cuz I’m getting like 1.3 Tokens per second, so I have to be super patient.

Also I just added it to my VSCode using the Continue extension (while the model runs using llamacpp). It works beautifully there too (once you configure the prompt correctly to what the model expects). If you use VSCode at all, you can now have a really good copilot for free.

AfterAte@alien.top · 1 year ago

Oh nice! I’ll have to try those settings and compare with the StarChat preset in Oobabooga. I hear ya, I get 1t/s too… it’s unbearable.

YearZero@alien.top · 1 year ago

I miss the days when high-end gpu’s were like $400-500! I’m not made of moneys, and I also use a laptop, so the most I could buy right now would be 16gb vram anyway. I’ll probably save up and wait for next gen and see if they make any headway there.

iChrist@alien.top · 1 year ago

Thank you for the response!

Il try to adjust the temp too, how can I disable samplers in oobabooga? what is the setting?

Is there a way to set rep penalty lower than 1?

YearZero@alien.top · 1 year ago

unfortunately I haven’t used ooba in a few months so I can’t tell you, but in koboldcpp it just tells you what values disable the samplers.