|
|
|
|
|
by azakai
46 days ago
|
|
A carb counting app might use API calls to these frontier models and then do some kind of analysis. It could see if different models agree or not, or multiple calls, and with how much variance. So it would be more accurate to test the apps rather than the APIs, unless the goal is to warn people that just open chatgpt and ask there. |
|
(And of course it would also be far more tedious to submit each picture 500 times manually using an app and manually log the response than using a script which is designed to collect the data automatically as fast as API rate limits permit)