|
|
|
|
|
by cle
3410 days ago
|
|
I think this is only half the story. There are other use cases other than mere size that can necessitate "big data" solutions. E.g. timeliness, resiliency, maintainability... If you are building production data processing systems that have constraints on data size, latency, resiliency, scheduling, dependency management, etc., you might be better off with a "big data" system. Even if the data could all fit on a beefy box. This was a painful lesson for me to learn. |
|
That leaves resiliency and etc. I can't answer etc., but—how is resilience helped with a big data solution? That seems like Lampson's distributed system: more machines, but you need k-of-n, k>1. Better to just mirror to two machines with the data in RAM.