Infrastructure monitoring is essential to carry out proper System Administration. Infrastructure consists of many components, starting with the basics such as server hardware, network components, storage, uninterruptible power supplies, backup equipment, but also environmental factors such as temperature and air humidity in server rooms. Apart from understanding the hardware, software is the next layer. Starting with Operating Systems; monitoring of resources such as CPU, memory, storage, network, state of essential services etc. Next level is applications and chained applications. Examples, monitoring mail queues of a mail server or databases from a SQL Server and so on.
Today, many monitoring products are available; many of these are tailored to special purposes and don’t cover all aspects of an Infrastructure.
I have noticed in recent years that many organizations are searching for a single product that can be used for monitoring as many components. Because nowadays most organizations run workloads on virtualized infrastructure, this means an extra challenge for the monitoring software.
Years ago, when I worked as a Sysadmin and virtualization was in a very early stage, my favorite monitoring software was a combination of the following products Nagios, Cacti and an advanced Syslog server.
Nagios has its origins as an Open Source product. Due to its open source nature, there are many products derived from Nagios, examples; OP5, Opsview, Groundwork, Check_MK and many more.
Some time ago Check_MK caught my attention, mainly because of its versatility and its ability to monitor diverse infrastructure including VMware vSphere.
In this and subsequent blog posts, I will investigate the potential of Check_MK, in particular the possibilities to monitor vSphere and other VMware products. Since there are already many excellent articles written about the installation and configuration I will not repeat these steps. Where needed, I will include references to articles that I used to build my Proof-of-Concept and issues that I encountered.
Read the rest of this entry »