Hacker News new | ask | show | jobs
by adrianmonk 1159 days ago
The point is that it's a badly-worded question because there are two ways to interpret it.

It's clear (after reading the score's explanation) what the intended interpretation is, which is the one you're describing.

The other interpretation is that the length of the line is set by the government based on how many people they choose to to allow to buy raspberries at a controlled price. Once 100 people are in line, they won't let a 101st person get in line, so it is always 100 people. In this interpretation, a cup of coffee that would normally be worth $1.00 is only valued at $0.75 because what the people are there for is raspberries, not coffee. Some of them may not even like coffee that much.

2 comments

I definitely agree on the vagueness. You have to understand that the 100 is caused by emerging phenomenon, not a population limit or government restrictions - it's really not obvious. You also have to understand that for some reason you actually have to enqueue before the raspberries are distributed (also not obvious).

That said, it doesn't fundamentally help GPT-4. It's too complex for it to get.

PROMPT:

In a country with thousands of people where everyone has identical preferences, the government sells raspberries at a controlled price. To obtain the raspberries, people must line up before the raspberries are sold and the market will process people's orders one at a time. Once the market opens up, additional people are not allowed to enqueue.

Currently, 100 people wait in line every day to buy raspberries.

The government additionally decides to hand out free coffee to a raspberry purchaser (after the raspberries are bought). The coffee costs the government $1 per cup, but the people in line value that coffee at only 75 cents per cup. What is the social cost of providing the coffee?

ANSWER: To calculate the social cost of providing the coffee, we need to consider the value of the coffee to the people and the cost of providing the coffee to the government.

There are 100 people in line who receive free coffee after purchasing raspberries. The cost of providing the coffee to the government is $1 per cup, totaling $100 for all 100 people ($1 * 100).

The people in line, however, value the coffee at only 75 cents per cup. So, the total value of the coffee for the 100 people in line is $75 (0.75 * 100).

The social cost of providing the coffee can be found by subtracting the total value of the coffee to the people from the cost of providing the coffee to the government:

Social cost = Cost to government - Value to people Social cost = $100 - $75 Social cost = $25

Therefore, the social cost of providing the coffee is $25.

I tried pushing GPT-4 a bit:

PROMPT: "Why are you assuming the line stays the same size?"

ANSWER: You're right, I made an incorrect assumption that the line would stay the same size. With the introduction of free coffee, it's possible that more people would be attracted to join the line to buy raspberries and receive the free coffee.

Let's analyze the potential change in the line size.

The value of the free coffee to the people is 75 cents per cup. Assuming people are rational and will join the queue if the value of the free coffee is higher than the cost of waiting in line, we can expect the line to grow until the marginal person values waiting in line at exactly 75 cents.

The social cost of providing the coffee can then be calculated by considering the additional waiting time for the people in line and the cost of providing the coffee to the government.

Unfortunately, without more information about how many new people would join the line and the cost of waiting in line, it is not possible to accurately calculate the social cost of providing the coffee.

Hardly a "0" answer with the hint and in fact GPT-4 comes close, but it isn't able to make the connection to what the equilibrium point must be.

I think with any test there's an element of general test-taking ability required to interpret the question. From context, I think it's clear what the intended interpretation is, and I'd be surprised if any student read the question as saying that the size of the line was fixed by the government.