Hacker News new | ask | show | jobs
by rsiqueira 1030 days ago
This means that the cost would be more than US$ 1 for each 16 user interactions, using this fine-tuning pricing: 16 interactions * ($0.012*4 input + $0.016 output) = US$ 1.02

It's 8x more expensive, indeed. I'm comparing with my use case, the standard gpt-3.5 API, where my users consume 4k input tokens (due to context plus chat history) and almost 1k output tokens.