|
|
|
|
|
by torqu3e
1068 days ago
|
|
TLDR - use Grafana cloud and configure the agent to do the needed things. There should almost be no need to check for SSH access if the machine is up but yes you can process monitor for sshd. Bit of a longer dump for an answer... Having been running services at home for way too long now and my day job being running the cloud for large businesses I am tired of special snowflakey prone to breaking hand rolled solutions. My idea of a well run infrastructure is that I should be able to walk away from it hands off for extended periods of time and it just continues running/self heals, to that effect this is what I've come down to: - 3 node k8s cluster on a bunch of random mini nucs - Github repo with helm charts/manifests hooked to ArgoCD (runs on cluster) for CD. All changes get checked into repo and auto deploy to cluster. https://www.argonaut.dev/ is an option to not run own ArgoCD - Grafana cloud free tier for shipping machine/cluster metrics and monitoring. Alerting is via pushover, you can email too - Uptime kuma on a fly.io free instance for inbound HTTP/DNS/Cert etc. monitoring from the outside hooked to techulus push/pushover for alerting - Terraform for DNS/cloudflare management via TF cloud offering for automated deploys again |
|