Cloud SLAs: Does Your Deal Leave You Exposed?
The potential for cloud service outages raises a basic, but important, question: Would a more stringent SLA have helped your organization avoid problems?
Of course, an SLA can do nothing to prevent an outage caused by a force majeure event, such as Hurricane Isaac. Rather, an SLA is a promise. If a cloud vendor fails to live up to that promise, there are certain, often exclusive, remedies available to the customer, usually in the form of service credits. Sometimes, if negotiated into the SLA, you'll even have right to terminate the cloud services agreements. For example, this right of termination might be applicable if your vendor fails to provide services consistent with those promises over a certain agreed-upon period of time, such as three consecutive months, or four (non-consecutive) months in any six-month period, or six months in any 12-month period.
Many cloud vendors will not offer an SLA. Most will provide one if asked. Some will agree to negotiate the SLA, if only along the edges. No reputable vendor will agree to rip up its SLA and accept one proposed by any one customer. If there will be any changes, they will be financial (e.g., a bigger service credit) and not operational. The vendor will cite administrative reasons why proposed variations will not be accommodated. It will also point to the prospect of reputational damage as a further motivator to provide good service.
Cloud SLAs: Beware The Litany of Exceptions
SLAs are subject to a litany of exceptions, including force majeure. Force majeure events excuse performance failures. As a customer, you'd be able to ask for some modification of the force majeure exception. For example, your SLA can state that force majeure can only be used as an excuse for service failure if the outage could not have been prevented by the vendor taking precautionary measures. These precautionary measures could include having in place a backup generator, or perhaps even a backup to the backup, accompanied by periodic testing of one or both generators. In practice, few vendors would agree to anything so intrusive. If they did, as a customer you would probably wonder about their ability to deliver. Further, SLAs are not really intended to address force majeure, but rather outages due to poor management or technical failures.
Some observers posit that a cloud vendor that offers redundant data centers - in locations not affected by a particular force majeure event -- may alleviate, if not preclude, an outage of cloud services. Perhaps. But keep in mind that cloud providers often make such backup options available only for an additional cost. Redundant data centers mean redundant costs for you.
For some cloud services, especially Infrastructure-as-a-Service (IaaS), such redundancies are particularly attractive to customers because your costs can be tied to actual usage. But even if there is another data center in a different region, there is still the question of whether your cloud service provider can assure seamless failover in the event of an outage.
For other cloud services - especially Software-as-a-Service (SaaS) -- true redundancy becomes significantly more complicated. The application (including any customizations to it), and underlying infrastructure would need to be fully replicated.
In either case, however, the location of the redundant data center can be a major sticking point. There are a variety of reasons - including export controls, intellectual property protection and regulatory compliance - why you might not want your data transferred outside the United States. If your cloud provider can't assure you that the redundant data center is U.S.-based, then be prepared to use another vendor. Otherwise, you'll be running the risk of degraded availability due to a lack of data center redundancy where you need it.
In fairness to cloud vendors, your risks of outages do not magically disappear if you decide to keep running your enterprise software on-premise or in your own data centers. In fact, in many cases, it's questionable whether an organization's internal team can address such risks better than experienced cloud vendors.
Certainly, the use-case matters. For example, if internal email is down for a period of time, It's inconvenient but unlikely to have significant impact on your revenues.. On the other hand, if the e-commerce Website that accounts for the vast majority of your company's sales goes down, revenue and brand impact will be immediate and damaging. In the latter case, you'll want to carefully consider whether to host that Website yourself, especially if you have a proven disaster recovery plan and considerable resources.