DreamGen Opus — Uncensored model for story telling and chat / RP

DreamGenX@alien.top · 2 years ago

DreamGen Opus — Uncensored model for story telling and chat / RP

sharockys@alien.top · 2 years ago

Thank you for sharing! I am going to learn cooking with it.

DreamGenX@alien.top · 2 years ago

I hope it will be something tasty! :)

trollsalot1234@alien.top · 2 years ago

I have no idea how your model is but that prompting guide is probably the nicest one I’ve seen, so kudos on that.

The_One_Who_Slays@alien.top · 2 years ago

Sooo, how’s the model?

harrro@alien.top · 2 years ago

Any GGUF quantized download available?

DreamGenX@alien.top · 2 years ago

It’s here!

DreamGenX@alien.top · 2 years ago

Great news, the great /u/TheBloke is working on this!

https://preview.redd.it/uqebzbr1q8zb1.png?width=2175&format=png&auto=webp&s=46ab334fa4b2b3cabab7d36461f991edfd2e8a60

DreamGenX@alien.top · 2 years ago

There was a bug on the website where the first time the “Continue” would not work if you did not refresh, should work now even though the editor is quite janky still, sorry for that :(

(can’t wait for AI to take over React from me :P)

mcmoose1900@alien.top · 2 years ago

I was going to suggest you triain on Yi 34B 200K instead of Llama 70B, as my biggest issue with storytelling models is slamming into the context limit.

…But I just remembered that Yi has a stupid noncommercial license https://huggingface.co/01-ai/Yi-34B-200K/blob/main/LICENSE

Ugh. I hope a long context ~34B comes out that doesn’t have such an ugly license.

AbsorbingCrocodile@alien.top · 2 years ago

Why only 7B?

Revolutionalredstone@alien.top · 2 years ago

Awesome l

Becoming your own AI company has never been easier 😊

Dazzling_Ad1507@alien.top · 2 years ago

Very cool website and model!

DreamGenX@alien.top · 2 years ago

Thank you!

deccan2008@alien.top · 2 years ago

Currently seems very expensive. Use of 7b models is effectively available for free in many places, Openrouter, Agnaistic, etc. Seems ridiculous that you don’t get unlimited usage even with a subscriptions.

DreamGenX@alien.top · 2 years ago

I agree, I hope I can make things cheaper with better utilization. You have to consider that a single GPU is not used 100% the time, so there’s a lot of waste. And due to lack of scale, I also do not get any special pricing on the GPUs. The more users, the closer the utilization will be to 100%, and the better GPU pricing. (For instance, I heard that on Google Cloud, enterprise customers can negotiate the on-demand GPU price down to the regular spot price for some of the GPUs)

Proud-Point8137@alien.top · 2 years ago

Is this the first fully uncensored mistral 7b?

trollsalot1234@alien.top · 2 years ago

Alright, super technical review time: I got this running on the potato I connect to Reddit with, even though I usually only try gguf and only on days when the sun is shining and God seems happy. It made my Gtx 1070 ti cry (see I told you I would be technical!), but it worked. Then I altered a demo prompt, and it wrote me a story at about 1 token every 3 seconds where Little Red Riding Hood drank pee. So I’m giving this model a score of 8.6 dead babies, which is better than Tiefighter.

vitlaska@alien.top · 2 years ago

Amazing. Reminds me of my favorite story testing prompt: [insert character] tricking Dr. Manhattan into drinking their piss at an Irish Pub. Can’t wait to try it out with this one.

DreamGenX@alien.top · 2 years ago

Wow, amazing, thanks for giving it a try GGUF and other quants are coming, so your computer should have an easier time soon! :)

What’s the maximum possible dead babies score? :D

mcmoose1900@alien.top · 2 years ago

but Llama 2 70B version is in the works

Might I suggest you use Yi-34B-200K instead? Or maybe later?

The problem I always have with storytelling models is slamming into the context limit, but Yi is already storytelling well out to 42K tokens for me, with just a basic Alpaca LoRA.

Healthy_Cry_4861@alien.top · 2 years ago

Looking forward to the release of 70b!

Shaggy07tr@alien.top · 2 years ago

is the context length 2048?

DreamGenX@alien.top · 2 years ago

The training data had example of up to 4096 tokens. The model should also work beyond that, but I did not do a deep analysis of degradation.

DreamGen Opus — Uncensored model for story telling and chat / RP

DreamGen Opus — Uncensored model for story telling and chat / RP

How to try it out

Using vLLM

Using DreamGen.com website (free)

What’s next