| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by marcklingen 1003 days ago
	We struggled with this ourselves while building LLM-based products and then open-sourced our observability/monitoring tool [1]. Many use it to track RAG and agents in production, run custom evals on the production traces (focused on hallucination), and track how metrics are different across releases or customers. Feel free to dm if there is something specific you are looking to solve, happy to help. [1] https://github.com/langfuse/langfuse