[D] What's this new Q* algorithm in relation to OpenAI breakthrough ?

to4life4@alien.top · 3 years ago

[D] What's this new Q* algorithm in relation to OpenAI breakthrough ?

AdEarly832@alien.top · 3 years ago

So, what about https://arxiv.org/pdf/2310.04406v1.pdf ? LANGUAGE AGENT TREE SEARCH UNIFIES REASONING ACTING AND PLANNING IN LANGUAGE MODELS. For me, it looks like very similar. The results are 98% on HumanEval

FernandoMM1220@alien.top · 3 years ago

I assume its similar to Q learning.

The star implies its a pathfinding algorithm which Q learning already kinda is.

Western-Image7125@alien.top · 3 years ago

“All you need is attention” made me twitch my eyes a little