|
|
|
|
|
by jarulraj
978 days ago
|
|
As we do not have ground truth, we only qualitatively checked for accuracy -- no quantitative metrics. We did find a significant drop in accuracy with GPT 3.5 as opposed to GPT 4. Are you measuring accuracy with data wrangling prompts? Would love to learn more about that. |
|
I'm skeptical of any claim that "A works better than B" without some numbers to back it up.