|
|
|
|
|
by nano_o
2288 days ago
|
|
Great idea and great work! A couple nitpicks: it would be nice to see what happens when the leader fails. Optimizing for the case of a stable leader might have impact on recovery time. Another important aspect for fault-tolerance is whether you can really survive any minority crashing. For example, if only the strictly necessary number of nodes keep up with the leader, then if most of those crash the system will have a really hard time recovering due to the backlog accumulated at slow nodes which now need to catch up for the system to continue operating. A performance number that does not take those things into account may not be very realistic. Nevertheless the idea is pretty good. |
|