|
|
|
|
|
by httparchive
846 days ago
|
|
I used this data when I was a grad student, back when there wasn't a fee for it, so I'm mostly concerned students will get hit with charges that will make it so they can't buy groceries. The website has the Internet Archive logo on it, and it looks like a public resource for researchers, and it used to be free to use. The point of this is for the HTTP Archive to make it clear this is a paid product from Google Cloud, not a "public service". |
|
https://github.com/HTTPArchive/httparchive.org/blob/main/doc...
There are multiple notes about cost. In particular, this one stands out.
> Note: The size of the tables you query are important because BigQuery is billed based on the number of processed data. There is 1TB of processed data included in the free tier, so running a full scan query on one of the larger tables can easily eat up your quota. This is where it becomes important to design queries that process only the data you wish to explore