|
|
|
|
|
by lmeyerov
115 days ago
|
|
My intuition is it comes down to error-correcting codes. We're dealing with lossy systems that get off track, so including parity bits helps. Ex: <message>...</message> helps keep track. Even better? <message78>...</message78>. That's ugly xml, but great for LLMs. Likewise, using standard ontologies for identifiers (ex: we'll do OCSF, AT&CK, & CIM for splunk/kusto in louie.ai), even if they're not formally XML. For all these things... these intuitions need backing by evals in practice, and part of why I begrudgingly flipped from JSON to XML |
|