|
|
|
|
|
by seiji
5513 days ago
|
|
Almost. They all suffer from inflexible data models (how many are using SQL and rrdtool in that matrix?), death at scale (what happens when you go from 10 to 500 to 3000 to 10000 servers? across three data centers? and transient xen servers?), lack of UI design, and community involvement (because of that massive comparison grid). That's not even considering broken models for alerting (a server dies at 3am -- should it page you? no, because you have 200 of the same servers in the same roll. the load balancer will compensate.), historical logging, trending, and event aggregation/dedup. It's a big problem, but making flexible tools from the ground up with sensible defaults can go a long way towards helping everyone. We can fix this. We can make the redis of monitoring. |
|
Alerting is quite flexible from what I read to the point that they are quite customiseable. I agree that a server dying at 3 am is not as important but should still be a valid alert to make an API call to the host to start a new server (Not sure if possible, alerts seem to be shell based).
Here's what your offering needs top in what I'm considering lately: http://www.zabbix.com/features.php
I'd love more competition but even you point out community involvement won't be as much because there's a lot of competition. Including your offering, soon.
Disclaimer: I started researching server monitoring a few weeks ago and considering Zabbix since last week.
Edit: The one issue I find is that there's lack of web transactions like New Relic has: http://newrelic.com/features/performance-analytics
You can see it in action with average response time: http://blog.tstmedia.com/news_article/show/86942
As far as I know, no open network monitoring service offers it.