Although, atm I am only using retrieval without any LLM involved. Might try integrating if it significantly improves UX without compromising speeds.