Handling Generic Queries in RAG Applications

tinyburger1@alien.top · 11 months ago

Handling Generic Queries in RAG Applications

DarthNebo@alien.top · 11 months ago

The way to do this is to generate a bunch of hypothetical questions from the FAQ, index these in the vDB

Then for the user prompt do a two stage inference with very small CTX size which only determines if the user is asking a question related to items specifically mentioned on the FAQ. Then you can retrieve the relevant FAQ section or source document accordingly only if the score is within a threshold