Ok_Relationship_9879@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 11 months ago

GPT-4's 128K context window tested

6

1

GPT-4's 128K context window tested

Ok_Relationship_9879@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 11 months ago

6

This fella tested the new 128K context window and had some interesting findings.

* GPT-4’s recall performance started to degrade above 73K tokens

* Low recall performance was correlated when the fact to be recalled was placed between at 7%-50% document depth

* If the fact was at the beginning of the document, it was recalled regardless of context length

Any thoughts on what OpenAI is doing to its context window behind the scenes? Which process or processes they’re using to expand context window, for example.

He also says in the comments that at 64K and lower, retrieval was 100%. That’s pretty impressive.

https://x.com/GregKamradt/status/1722386725635580292?s=20

Chat

Tiny_Arugula_5648@alien.topB
link
fedilink
English
arrow-up
1·
10 months ago
Their needle in a haystack test isn’t very compelling. Sure no test is flawless but a random out of context fact placed at different points in the context window there is a lot of reasons why the model would fail to retrieve that.