|
|
|
|
|
by Cloven
5132 days ago
|
|
"At scale in production, adding a new node took days to complete all the handoffs..." That's a bit of a headscratcher. What is happening during those 'days' and what is the primary limiting factor? I keep meaning to get into Riak, but then stuff like this where the system has crazy moments that are impossible to coherently reason about keep popping up. |
|
The operational challenge I infer from this is that they had waited to add that node until they really needed it, because their expectation was that adding the node would get them quick relief to their scaling issue. Instead, they got relief a few days later when the node was fully integrated.
Solution: don't wait to add nodes until the last minute.