Hacker News new | ask | show | jobs
by theptip 390 days ago
This is a great case study. I wonder how hard o3 would find it to build a minimal repro for these vulns? This would of course make it easier to identify true positives and discard false positives.

This is I suppose an area where the engineer can apply their expertise to build a validation rig that the LLM may be able to utilize.