|
|
|
|
|
by moatra
4036 days ago
|
|
if you want to build a new derived datastore, you can just start a new consumer
at the beginning of the log, and churn through the history of the log, applying
all the writes to your datastore.
For high-throughput environments with lots of appends to the log, how do you get around the ever-increasing size of your log file? I know the traditional answer is to take a periodic snapshot and compact the previous data, but is that built in to tools like Kafka? |
|
The other cleanup policy is to just have a retention time. After X minutes/days/weeks segments of the log are simply deleted.