|
|
|
|
|
by simonw
463 days ago
|
|
OK the way you're publishing the data with Parquet and making it accessible through DuckDB is spectacular. Your README shows R and Python examples: https://github.com/dfsnow/opentimes?tab=readme-ov-file#using... I got it working with the `duckdb` terminal tool like this: INSTALL httpfs;
LOAD httpfs;
ATTACH 'https://data.opentimes.org/databases/0.0.1.duckdb' AS opentimes;
SELECT origin_id, destination_id, duration_sec
FROM opentimes.public.times
WHERE version = '0.0.1'
AND mode = 'car'
AND year = '2024'
AND geography = 'tract'
AND state = '17'
AND origin_id LIKE '17031%' limit 10;
|
|
I've been thinking about how to swap it in as a backend for datasette (maybe as a plugin?) but it seems inherently riskier as it needs to at very least be able to read a folder to list all the csvs available for my usecase. If I could hook that up with its native s3 support I'd be unstoppable (at work)