@esotericloop - Communick News

0 Posts
1 Comment

Joined 1 year ago

Cake day: November 14th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

esotericloop@alien.topBtoLocalLLaMA@poweruser.forum•Questions on Attention Sinks and Their Usage in LLM Models
link
fedilink
English
arrow-up
1·
1 year ago
See, you’re attending to the initial token across all layers and heads. :P

link
fedilink