Hacker News new | ask | show | jobs
by stingraycharles 54 days ago
How does it handle “unredaction” in responses? E.g. let’s say the LLM does something with the document. You redacted its input, so it emits redacted content. Now what?
2 comments

The proxy keeps 2-way mapping of identified PII and the redaction e.g. Jane Doe <-> <PERSON_1> so the process is reversable i.e. redactions from LLM response will be replaced back to the original, and it should feel transparent on user end. I'll add more detailed example in README to make it clear.
The way I handled it is by assigning the redacted tag an id which gets translated back to the saved PII in the output.