Brownouts Vs. Blackouts.

NetworksFirst.com has recently created an online “Impact of Network Downtime” calculator, which you can use to estimate how much money it would cost if your network went down.  It makes a compelling case for fault management and worrying about outages.

However, the cost of poor application performance is harder to quantify – or at least, requires more sophisticated tools and data – than the cost of fault.  That may be the reason that many companies still consider fault management, and not performance management, to be the core responsibility of the IT team. Our most recent research conducted with Ashton, Metzler & Associates bears this out:

Fifty percent of respondents indicated that they measure and report on the mean time to repair (MTTR) for a network or application outage. However, only thirty percent confirmed they actually measure and report on the MTTR for degraded application performance, revealing a continuing legacy of fault and availability management over performance management.

As technology has improved, fault performance problems have, for the most part, been solved.  It’s no longer a distinguishing feature for a network service provider to promise 99.999% uptime.  The next big challenge is maintaining good performance throughout the network.

But in many ways, it’s a hard sell, because unlike a fault cost calculator, it’s difficult to show you exactly why you need performance management tools until you have the more nuanced calculation of what poor performance costs your business. What’s the difference in employee productivity when an application is 10% slower, 50% slower?

These types of metrics have typically been calculated for customer facing applications like Web retailers, but getting the data for internal IT users has been far less popular since it’s considered a soft cost in some arenas. But it really starts to add up if you pay attention.

One NetQoS customer said their typical critical business application “brownout” (before deploying NetQoS products) cost them $6000 per hour and they had about 20 of these per year, each taking about six hours to isolate and resolve. That’s $720k gone per year due to poor application performance ($6k * 6 hours * 20 events = $720k/year). True, the brownout costs less per hour than most estimates you see for out-and-out downtime, but they occur a lot more frequently.

It took some investigation and understanding on the customer end to establish the value of different applications, who was using them, and then run the numbers, but now they have some idea of the cost of all of those shades of gray between up and down and this helps them justify their investments in technology and process improvements to reduce the brownouts as well as the blackouts.

This is why vendors, such as ourselves, are willing to come out and have a conversation and demo with your company.

But even so, consider this idea as an inaccurate but useful shorthand in the form of a Zen koan: If the network is so slow that nothing gets done, is it any different than if the network were down all together?  And what is the difference between a network down for half a day than a network that takes twice as long to get anything done for a full day?

And if a computer goes down in the woods, but no one receives an error message, did it really have an error at all? And what is the sound of one router crashing?

, , ,

No comments yet.

Leave a Reply