Generally if what you want is to impart new knowledge what you want is a embedding.
Assuming it is a large amount of data you will want a vector db.
Using retrieval augmented generation, RAG.
This is better explained by this guy 16 days ago
https://www.reddit.com/r/LocalLLaMA/comments/17qse19/comment/k8e7fvx/
It’s not that CPUs are slow it’s that typically RAM that the CPU is connected to is slow.
That’s why unified memory is fast it’s just faster and connected to the CPU.