Hacker News new | ask | show | jobs
by alexatkeplar 4980 days ago
At SnowPlow we break home-grown analytics down into five stages:

     Track -> Collect -> ETL -> Store -> Analyse
SnowPlow straddles all five stages - and the data is in non-proprietary formats throughout.

Have a look at https://github.com/snowplow/snowplow if you want to find out more...

1 comments

Nice, I've gotten close to building this exact data flow from scratch and it was not fun.

You're just missing step 6 ("-> Present"). I'd build some really simple jquery datatables template that will present the output of a hive query, if only to have some screenshots for non-technical people involved in the decision.

Thanks jparker, and you're totally right - we are still missing 6. -> Present :-) We will get round to it - it should be easier once we have connected Infobright as a storage option alongside Hive...