| HN Mirror

Before considering cost, and you might've already done this, but I'd try and run a 5-shot prompt for your use case with GPT-3.5 and MPT-30B and Falcon-40B. That way you can get a sense of how performance compares, without needing to go through the fine tuning. My guess is that 3.5 might still be significantly better on the 5-shot. I guess you're really comparing base 3.5 with fine tuned MPT-30B/Falcon-40B though, so perhaps for a fairer comparison (until 3.5 fine tuning is available), you could do something like 2-shot with 3.5 and 10-shot with Falcon and MPT.