Finaly i finish my thesis defense and got chance for upgrade my laptop ram to 20 gb, that so far best thing i can do, i currently run 7b mistral with it with koboldcpp but speed is… kinda slow 0.3 token per second sometime it at peak 0.8 what wrong here ? or should i try ooboga instead or gpt4free ?
the real solution here is a new laptop buddy.
or using the cloud.