Hacker News new | ask | show | jobs
by energy123 45 days ago
> All SOTA model providers are losing money.

Source? I only read one article on this topic and they approximated gross margins at 50%.

> When users run Opus, they are essentially renting a GPU cluster worth half a million dollars for a $100/$200 subscription.

They use a large batch size, you're sharing the GPU with many other people.

1 comments

Gross margin calculated on API token pricing with discounted training and hardware deprecation.

I am no so sure about batch sizes, chatgpt napkin calculation for 5T model show 10-300 sessions.