|
|
|
|
|
by ed25519FUUU
1687 days ago
|
|
I got a very surprising bill from GCP. I used bigquery to query the google public patent database[1]. I thought I was using enough predicates for it to effectively be a point query (using patent ID, etc). Turns out each single query was a full table scan of the 4TB dataset and cost about $5 each! I luckily stopped after 4 queries once my curiosity was satisfied, but I could have easily spent hundreds of dollars. It wasn't until some time later I saw my bill and realized what happened. They did do a "one-time charge reversal" for me. I didn't really feel satisfied with that response. I wanted them to admit such datasets are a huge liability for users. 1. https://github.com/google/patents-public-data |
|
I think folks are used to being pretty catered to.
And yes, being able to play with 4TB databases in the cloud and huge GPU instances is a "liability" - you pay for what you use.
Google actually gives you a lot of data about jobs folks are running - check out
https://cloud.google.com/bigquery/docs/information-schema-jo...
and total bytes processed if you want to do estimates of a per job cost etc.