We are currently planning to upgrade our monitoring system (Opsview) due to scalability issues and I was wondering what do you recommend for monitoring 5000 hosts and 35000 services. We would like to use a monitoring system that
Another consideration is check_mk. We use it in our shop. The check_mk people wrapped a bunch of python around the Nagios notification engine. No longer do you need to worry about the tedium of nagios config files, those are all built automatically from commands from a gui or from a single configuration file. Check_mk has a benchmarking page which scales to more hosts than you specified: https://mathias-kettner.de/checkmk_checkmk_benchmarks.html For an architecture diagram of how they use nagios for alerting, and python for scanning: http://mathias-kettner.com/check_mk.html If an included agent isn't available, new ones can be written. We are quite happy with the solution. We've replaced cricket, cacti, nagios, observium, and a little bit of smokeping with this almost all in one tool. -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean.