@ReMeDyIII

ReMeDyIII@alien.top · 1 year ago

At the current moment I have not changed, but Wolfram released a good rankings list that makes me want to test Tess-XL-v1.0-120b and Venus-120b.

I’m using lzlv GPTQ via ST’s Default + Alpaca prompt and didn’t have misspelling issues. Wolfram did notice misspelling issues when using the Amy preset (e. g. “sacrficial”) so maybe switch the preset?

ReMeDyIII@alien.top · 1 year ago

Exclusively 70B models. Current favorite is:

Role-playing: lzlv 70B GPTQ on gptq-4bit-32g-actorder_True

Although ask me again a week from now and my answer will probably change. That’s how quick improvements are.

ReMeDyIII@alien.top · 1 year ago

Okay, but when are they going to get Amazon Alexa into an actual AI?

ReMeDyIII@alien.top · 1 year ago

Toner held firm in her belief that Altman shouldn’t be at the helm of OpenAI after Sutskever reversed course, and said during those initial reinstatement discussions that because the company charter charges its board with creating AI that “benefits all of humanity,” it was more consistent with that mission that the company be destroyed in Altman’s absence than see him as its chief executive again.

https://www.yahoo.com/news/openai-board-apparently-seething-rage-195257117.html

Wow, so this bitch wanted to bring the whole company down rather than allow Sam Altman back on. Unbelievable. EA advocates can walk the plank.

ReMeDyIII@alien.top · 1 year ago

It’s like when Tony Stark was in a cave and made a prototype Ironman suit.

ReMeDyIII@alien.top · 1 year ago

Damn, no 13B?

ReMeDyIII@alien.top · 1 year ago

Then they should be furious with Sutskever for wanting to slow things down. Slowing things down is not in the best interest of their shareholders. Sutskever needs to go, now and Sam Altman should be reinstated. Bring on the singularity.

ReMeDyIII@alien.top · 1 year ago

We discover it was Jimmy Apples sending us inferences all this time.

ReMeDyIII@alien.top · 1 year ago

*In steps his replacement.*

Sam: “Understandable, have a nice day.”

ReMeDyIII@alien.top · 1 year ago

According to TheBloke the Sequence Length is 8192 ctx, so I’m assuming 8192 ctx is its default and it can extend up to 200k ctx via alpha_scale?

ReMeDyIII@alien.top · 1 year ago

I would shit a brick if they say, “Oh by the way everyone, we dropped a new Mistral model just now.”

ReMeDyIII@alien.top · 1 year ago

For real-time uses like Voxta+VaM, EXL2 4-bit is better

Wow, I didn’t expect to see a Virt-a-Mate reference. You left no stone unturned and are doing God’s work.

ReMeDyIII@alien.top · 1 year ago

lol I had to look up what EOY meant. Thought it was some kind of tech convention.

ReMeDyIII@alien.top · 1 year ago

I find it comical it took this long to get a proper dissection of what these settings meant and to no surprise it spikes to 387 upvotes in 13 hours.

ReMeDyIII@alien.top · 1 year ago

Plus, isn’t GPT-3.5-Turbo multimodal? There’s no way a 7B can outperform that.

ReMeDyIII@alien.top · 1 year ago

Every time I went on Horde, it’s typically models people could run on an RTX 4090 or 3090. The problem is I too own an RTX 4090, so I don’t see why someone should gain credits when there’s no 70B models to spend their credits on.

ReMeDyIII@alien.top · 1 year ago

If I were to attempt to fit this on a Runpod $0.79/hr (A6000 48GB VRAM, 50GB RAM, 8vCPU ), what’s my best option? Is it even possible?

ReMeDyIII@alien.top · 1 year ago

Did we used to spell “we” as “wee?”