Hacker News new | ask | show | jobs
Show HN: Socrata Roulette – run random SQL on a random government dataset (splitgraph.github.io)
5 points by mildbyte 1284 days ago
1 comments

It's a little slow. Any way you can speed it up?

Always a fan of splitgraph.

It's possible! Currently this is running GROUP BY queries using Socrata's query API on the original government data portal. We're adding the ability to import data from these sources into a columnar format in the future, either into Splitgraph itself or syncing the data out into Seafowl (https://seafowl.io/) which uses Parquet and is much faster.

Technically, the ability is already there (you can add a dataset to Splitgraph and select Socrata as a source if you know the dataset ID), but it's not as turnkey as landing on a dataset page and clicking a button. More to come!