Hacker News new | ask | show | jobs
by khaledh 4496 days ago
Very good article. It aligns with our envisioned architecture for our next-gen analytics platform.

So far our decision is to keep the raw events in Cassandra, and pre-aggregate most data for fast reads. Just wondering about your decision to not store raw events in Cassandra, and use raw files for that, and using Cassandra only for storing Hadoop analysis results. Do you think this decision may affect you later if you ever decide to support real-time analytics?