|
|
|
|
|
by ElPeque
540 days ago
|
|
Interesting! This is a little different though. This dataset is is part of the AWS Open Data program and so it is freely available from S3. By running the API within AWS then you get a massive latency and bandwidth advantage. So with GribStream you are pushing the computation closer to where the big data is and only downloading the small data. And GribStream uses a custom grib2 parser that allows it to extract the data in a streaming fashion, using very little memory. It makes a huge difference if you need to extract timeseries of a handful of coordinates for months at a time. Cheers! |
|
How does this compare to using Xarray on a netCDF dataset?