| Every time I take a careful look at someone's handpicked example of GPT getting it right there's at least one serious mistake. > Florence, Italy: Florence is a beautiful city with a rich history and culture. It is only a 90-minute train ride from Rome, making it a convenient stopover point. Florence also has plenty of cafes, co-working spaces, and other amenities that are conducive to a workathon. Mistake: The purpose is to split up a long journey. Florence would split it into a 1.5h day and a 10.5h day. > Marseille, France: Marseille is a bustling port city on the Mediterranean coast of France. It is about a 3.5-hour train ride from Rome and has a vibrant cultural scene and a variety of co-working spaces and cafes. You could spend the day exploring the city's many museums, parks, and markets. Mistake: Factually wrong about the travel time, per Google maps GPT is off by a factor of 3. Mistake: The purpose is to split up a long journey. Marseille would split it into a 10.5h day and a 2h day. Mistake: Forgot about the workathon requirement |
GPT3.5 suggested a list of Florence, Nice, Genoa and Marseille. When I asked it for a breakdown of the travel times, it got them pretty close ("Can you give me rough travel times for each of these options?") given the large variations in travel time depending on specific timing and transfers for several of these.
When I then asked it for somewhere closer to the midway point, it didn't come through (it suggested Pisa, which is about as bad as Florence), Turin, which is an alternative to Genoa, but doesn't fix the imbalance, or Lyon which is about as bad as the Pisa option but in the opposite direction (fast train to Montpellier), but the options also aren't in aggregate much worse.
The problem here is that there aren't as far as I can tell any very balanced options. You can try to do e.g. Ventimiglia or Sanremo (which GPT4 gave when I pressed it on splitting it more evenly), which will give you a more evenly distributed travel time, but because the overall travel time will be longer they're not at all obvious options.
You also seems to focus on catching it out rather than getting a result. If you want precision from the first query, then you need to give far more precise questions.
You need to treat it as a conversation, not a brief to be answered with a report. If you insist on treating it as a brief you will not get good results out of it. Your loss, in that case.
For cases where a simple lookup will work, it's a waste. Just use Google. For cases where there is in fact not a single perfect option, and where you need to weigh pro and con, it works well with the caveat that you do indeed need to be careful and check specifics. The same way I'd check specifics if I had a conversation about this with a friend.
With plugins, so it can verify precise details we can expect a significant leap in capability here.
But even for now, the answers I got to this were ones I was happy with, and another reason for me to use it more.