|
|
|
|
|
by aksophist
601 days ago
|
|
What is a false positive rate? Is it when the agent falsely passes or falsely “finds a bug”? And regardless of which: why don’t you include the other as a key metric? I’m not aware of any evals or shared metrics. But measuring a testing agents performance seems pretty important. What is your tool’s FPR on your golden suite? |
|