Hacker News new | ask | show | jobs
by timfsu 1226 days ago
This is great, thanks for sharing. I've been pretty reluctant to use davinci in production due to costs, but the other models seem significantly less capable. It seems like there's still a lot to learn about working with these models at scale.
2 comments

Yes definitely give this a go, for us with our use case davinci is prohibitively expensive in production so we literally cannot use it given the number of requests we make - interestingly I saw some OpenAI documentation the other day at the YC event which basically said that with a large enough dataset (I recall >= 30k examples) all of the models (yes even ada) start to behave with similar performance, so bear that in mind!
My impression is that the workflow is supposed to be davinci for prototyping, then fine-tuning and downgrading to curie in several steps as you polish prompts and breakdown tasks (see https://github.com/openai/openai-cookbook/blob/main/techniqu...).

I haven't gotten beyond the davinci for prototyping tho. It's cheap enough that a month of tinkering a chatbot was $7 in jan.