3 Posts
0 Comments

Joined 11 months ago

Cake day: November 8th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

Covid-Plannedemic_@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 10 months ago

What's the simplest way to run speculative decoding?

0

1

What's the simplest way to run speculative decoding?

Covid-Plannedemic_@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 10 months ago

0

Covid-Plannedemic_@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 10 months ago

Yi-23B-Llama: Distil version of Yi-34B-Llama

1

Yi-23B-Llama: Distil version of Yi-34B-Llama

Covid-Plannedemic_@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 10 months ago

Covid-Plannedemic_@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 10 months ago

Training on the rephrased test set is all you need: 13B models can reach GPT-4 performance in benchmarks with no contamination detectable by traditional methods

1

Training on the rephrased test set is all you need: 13B models can reach GPT-4 performance in benchmarks with no contamination detectable by traditional methods

Covid-Plannedemic_@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 10 months ago