|
|
|
|
|
by jeremyloy_wt
114 days ago
|
|
> we as humans can guide the LLM toward a rigorous test suite, rather than one that has a lot of "coverage" but doesn't actually provide sound guarantees about behavior. I have a hard enough time getting humans to write tests like this… |
|