Hacker News new | ask | show | jobs
by OkPin 336 days ago
Fascinating root cause: a missing CHECK_FOR_INTERRUPTS() left pg_create_logical_replication_slot basically unkillable on hot standbys. Simple fix, but huge impact.

Makes me wonder how many other Postgres processes might ignore SIGTERM under edge conditions. Do folks here test signal handling during failovers or replica maintenance? Seems like something worth adding to chaos tests.

1 comments

Is that the root cause though? Why did this process get into a position that it needed to be manually killed/restarted, isn't that the real problem?