Hacker News new | ask | show | jobs
user: BalinKing
created: 2017-01-02
karma: 854

Third-year Ph.D. student at CMU, working on programming languages and formal verification; CS undergrad at Caltech (BS '23, Venerable); ex-professional software developer.

https://github.com/jgrosso

submissions:

0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases
2 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments