How effective are LLMs at triaging issues? Has anyone found success using them to find the root cause? I've only been able to triage effectively for toy examples.
Wild Moose just made a blog post[0] about this. They found that putting things into foundation models wasn't cutting it, that you had to have small finely-tuned models along with deterministic processes to use AI for RCA.
LogClaw algorithm is the moat here that flags logs first. Those only flagged usually less than 10% of the logs are analyzed by LLM. LLM is great at finding root cause if the logs are clear and detailed. So the LLM heavily depends on the quality of your logs. So if your logs are rich with info, it will have a better insights at understanding it.
[0] https://www.wildmoose.ai/post/micro-agents-ai-powered-invest...