|
|
|
|
|
by kristianp
584 days ago
|
|
> It keeps complaining that GitHub is spelled like Github, when it isn't I feel like this is unfair. That's the only thing it got wrong? But we want it to pass all of our evals, even ones the perhaps a dictionary would be better at solving? Or even an LLM augmented with a dictionary. |
|
LLM has its place and it will forever change how we think about UX and other things, but we need to realize you really can't create a public facing solution without significant safe guards, if you don't want egg on your face.