Besides cost the other bottleneck is time. If model has to process millions of tokens then it will obviously be slower . Moreover I feel it’s counter intuitive;I.e what is better option ? Passing… - Maheboob Patel - Medium

RAG 2.0, Finally Getting RAG Right!
3.4K
25
Ignacio de Gregorio
Maheboob Patel
·Follow
Apr 11, 2024
--
Besides cost the other bottleneck is time. If model has to process millions of tokens then it will obviously be slower .

Moreover I feel it’s counter intuitive;I.e what is better option ? Passing millions of SAME words in every API call or somehow pass it once and then query it as many times as we want
--
--
Written by Maheboob Patel281 Followers
·72 Following
GenAI | Co-Founder
Responses (1)
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams