And they’d lose out on every single thing they built in-house.
- Their MLOPs and data scraping infra? Gone
- Their results and experiments (like millions of iterations)? Gone
- They would take 1-2 yrs to catch back up and would lose the lead.
Don’t assume Google, Inflection, Anthropic etc. are more than 1 yr behind.
So do engineering innovations make sense for discussions?
I would love for someone to do extensive experimentation on best practices for RAG- including implications on cost, latency and performance (precision), etc.
It’s not the very latest per se, but knowing the differences and tricks to reduce wall time would be amazing.