Chunking and storing structured data and vectors for RAG

Smerfj@alien.top · 1 year ago

Chunking and storing structured data and vectors for RAG

grumpy_autist@alien.top · 1 year ago

@smerfj - I’m currently researching same problem. You can find some information in LlamaIndex project docs. What you probably need is so called composite index with both vector database and knowledge graph that links particular knowledge bits or text paragraphs together. Alternatively you can try restricting vector search to chunks computed from one particular document.

I suspect that knowlege graphs are “the shit” because you can keep and query really small but highly relevant pieces of data without overflowing LLM context and slowing it down.

Smerfj@alien.top · 1 year ago

Thanks for the pointers. Since my aims are using local models eventually, I’ll take any efficiency I can squeeze out.