|
|
|
|
|
by whymauri
499 days ago
|
|
After prompt optimization with something like DSPy and a good eval set, significantly faster and just about as good. Occasionally higher accuracy on held out data than human labelers given a policy/documentation e.g. customer support cases. |
|