Change is a constant in today’s increasingly complex IT
environment. There are internal and external demands
for speed, availability and reliability — not to mention the
continual pressure to do more with less.
Infrastructure availability and reliability are not
competitive advantages. Instead, they are essential
to the survival of any Web business. Web pages
that won’t load, unresponsive applications and
back-end systems that fail to complete transactions
cost website owners in revenue, productivity
and reputation. Even a delay by one second in
page-loading time can double your bounce rate.
Ouch, that hurts!
IT staffs are getting smaller while infrastructures
are now more complex. In addition to servers and
databases, your infrastructure includes networking
gear, applications and storage. Cloud computing,
virtualization, clustering and failover
systems add to the complexity.
Outages and service degradation can happen
at multiple points, leading to downtime, resource
inefficiency and lost revenue. With so many devices,
your entire technology stack must be monitored
to assure the continuity of your Web
business.
IT monitoring solutions are a critical aide in
optimizing today’s infrastructure performance. A
monitoring system can proactively discover and
help accelerate remediation of problems such as
erratic network connections, slow webpage
downloads, database inefficiencies and email delivery
problems.
However, installing, configuring and maintaining
monitoring tools can be daunting, especially
for organizations with small IT staffs. For
each device in your stack, you have to know the
following things:
• what needs to be monitored
• how to collect the right data
• how to display that data in a useful, visual form
• at what point alert thresholds should be set
• how to set up escalating alert chains
These are onerous requirements for teams
already overburdened with critical IT operational
issues.
Bad medicine
While monitoring is essential, choosing the
wrong system or approach can add to rather than
alleviate the pain. Consider the experience of a
social media website called Ranker.com.
Ranker makes it easy to create lists on any
topic imaginable. Visitors then vote and rank their
favorites. The company launched in 2009, was associated
with the hot tech startup accelerator
LaunchPad LA, and now has 3.5 million unique
visitors a month.
Ranker had a monitoring system in place, but
the rapid growth it was undergoing was not helping
the website meet its business goals for availability
and performance. There were two major
issues, according to Premesh Purayil, Ranker’s V.P.
of engineering.
“Whenever we added a new server to our environment,
we had to jump through hoops to add
the new server into monitoring,” he says.
There was also the issue of outages not always
being detected and escalated appropriately,
causing significant impact to Ranker’s traffic
and growth.
“There were times where the site might be
down for an hour before someone realized it,” Purayil
explains.
Ranker’s IT monitoring challenges are not
unique. In most organizations, a “sys-admin” lacks
the time and the operational experience to figure
out what matters on a new device or software.
With the fast rate of technology changes, not
everyone can be expert in everything in their infrastructure.
In addition, even the best sys-admins
are human. For example, in the heat of resolving
a technical issue, the sys-admin may create a new
storage volume.
With higher-priority issues to solve, they intend
to add the new object into monitoring but
often forget. Six months later, the volume fills
up, all hell breaks loose, and everyone wonders
why the monitoring system didn’t warn them.
Even when problems are identified, without
good historical data, the engineers could easily
devote from a full day of developer time to trace
the issue.
Perhaps a cure?
A monitoring system adds value if the benefits of
the system are greater than the acquisition, implementation
and operational costs. Generally, the
benefits obtained from a good monitoring system
will be increased uptime, fewer service degradations
and the elimination of downtime.
Two notable outcomes are reduced staff time
to investigate performance issues and better utilization
of, and transparency into, IT performance
intelligence throughout the organization.
A monitoring system that is easy to set up and
does not require a dedicated sys-admin improves
the efficiency of an IT organization. The ideal
monitoring system UI should be intuitive, empowering
the entire IT staff with real-time visibility
into IT performance in order to quickly
identify and respond to issues.
Your monitoring system should collect a wide
variety of data, from all devices in the IT infrastructure,
and that information should be able to
be assessed in a way that provides operational efficiency
to the business. For example, the overall
performance of device types, of data center, collocated
or hosted IT and cloud services, should
be accessible by those who need access to the intelligence.
The information should be made visible
in graphical format and allow for automated
reporting that makes it easy to spot trends, correlate
issues and be utilized for decision-making
throughout the IT organization.
Your selection criteria should be driven
by business values, except for specific technical requirements
such as the ability to monitor a specific
protocol. Once you’ve identified monitoring-solution
options, weigh the benefits against the acquisition,
plus implementation, plus the operational
ownership costs of each option.
After the assessment and discussions, however,
your selection of a monitoring system comes
down to one simple statement: Forget about features.
Pick the monitoring system that adds the
most value to your business.
About the Author: kevin McGibben, CEO, Logic Monitor. Kevin brings strong global experience in market strategy and channel implementation. He previously founded 32 South to assist mobile and new-media technology companies in strategically expanding into international markets. Kevin also founded firms in the VOIP, consumer products and Web 2.0 industries. His experience includes international marketing, channel development and general management positions with technology companies Fujistu, TEKELEC and CIDCO.