|
|
|
|
|
by ajhai
1206 days ago
|
|
Congrats on the launch! I'm glad to see all the tooling come up in this space. Regarding tests, how do you evaluate the generated completions for tests? Allowing users to execute a set of tests against a prompt and show completions for visual inspection is a good start but imho doesn't scale when the app is in production with a large corpus of tests. Something we are exploring right now is to generate a similarity/divergence score between generated completions to make this easy at scale. Disclosure: We are building something very similar at Promptly (https://trypromptly.com) out of our experience using GPT-3 at MakerDojo |
|
p.s. it's cool to hear from another company that's helping expand this market!