Hacker News new | ask | show | jobs
by ed25519FUUU 1687 days ago
I got a very surprising bill from GCP. I used bigquery to query the google public patent database[1]. I thought I was using enough predicates for it to effectively be a point query (using patent ID, etc).

Turns out each single query was a full table scan of the 4TB dataset and cost about $5 each! I luckily stopped after 4 queries once my curiosity was satisfied, but I could have easily spent hundreds of dollars. It wasn't until some time later I saw my bill and realized what happened.

They did do a "one-time charge reversal" for me. I didn't really feel satisfied with that response. I wanted them to admit such datasets are a huge liability for users.

1. https://github.com/google/patents-public-data

1 comments

They should have just billed you - they reversed the charges (for your own issue) and you aren't happy.

I think folks are used to being pretty catered to.

And yes, being able to play with 4TB databases in the cloud and huge GPU instances is a "liability" - you pay for what you use.

Google actually gives you a lot of data about jobs folks are running - check out

https://cloud.google.com/bigquery/docs/information-schema-jo...

and total bytes processed if you want to do estimates of a per job cost etc.