Hacker News new | ask | show | jobs
by bmurphy1976 458 days ago
This is part of hardening a system for production. Making it easy to operate:

* Make sure the logs are actionable

* Make sure the logs are readable

* Make sure you are collecting operational metrics

* Make sure the metrics are useful

* Make sure you have error handling

* Make sure you have alerting

* Make sure you document how to support the application

* Make sure you have knows and levers you can pull in an emergency to change the systems behavior or fix things

* Make sure you have vetted the system for security issues

etc.