I contend that Software As a Service offerings are not less reliable than on premise solutions as measured by uptime. As an example lets compare the following scenarios
Scenario 1 : SaaS
Lets assume that a SaaS provider has 1,000 customers. Lets assume that this provider has a track record of 1 severe outage every year and the outage lasts 30 minutes on average. All customers are impacted since the outage is severe. Therefore every customer experiences 30 minutes of downtime every year.
Scenario 2: On Premise
Lets assume that the 1,000 customers have an on premise system (same or similar solution in complexity/offering) to the SaaS scenario. Following are the questions that need to be answered
- What is the likelihood that a particular customers on premise system will experience a similarly severe outage as the SaaS provider?. Lets say that one believes it to be 0.50. So a system is half as likely to go down if on premise versus in the cloud. So 500 of the 1000 customers will experience the outage.
- Now here is the key : Even if you believe the above to be true, what is the likelihood that the on premise IT support staff will be trained in high availability and have failover mechanisms(other than a restart), have a NOC etc for the on premise solution(s) - in order to bring up the system within 30 minutes? I would contend that this will become a multi hour outage as the large majority of the internal IT staff are just not trained to respond to emergencies. In addition they just do not have the tools at their disposal to deal with severe outages such as data center failovers, redirecting traffic etc. Also they do not know the product intimately as the SaaS provider.
So what does this all mean?
- It means that even if you believe that SaaS offerings may be less reliable one better take into account the notification times and restore times into account when comparing the two options.
- Measure the operational capabilities of your organization. It is very likely the emergency response procedures for the on premise solution have not been defined or practiced.
- Even if you think that the On premise system will be 2 times more reliable than the SaaS offering, that advantage means nothing if your restore time is more than twice as long as the restore time of the SaaS provider.
- Uptime should be considered within the context of an organizations own operational abilities.
The above quantified
Following are the key parameters that need to be thought through and measured over a defined period of time.
(Probability of Severe Outage) * (Notification Time + Restore Time) = Expected Downtime
Plugging in numbers one may get
(1) * (1 min + 29 min) = 30 minutes. - For SaaS
(0.5)*(20 min + 180 min) = 100 minutes – For On Premise
The above example shows that an on premise system is more reliable than a SaaS offering but has lower uptime. Even if one gives the on premise systems the edge in reliability(debatable), that does not automatically mean that this will result in higher uptime. Restore times(failovers) are the key drivers for high uptime.
Filed under: Software As A Service Tagged: | ha, operations, saas
Hey!!! Welcome to the blogging world. First post!
It’s not the quantifiable numbers when talking to folks who are considering SAAS, it is the feeling of loss of control.
Hi Alan,
I agree with you. SAAS gets a bad name due to this perception issue. Some people would rather have downtime that they are directly responsible for than having downtime via a SAAS provider, even if the downtime experienced is less with the SAAS provider. Gradually the rational heads will win out. Over time the uptime delta between on premise and SAAS solutions will grow to a level where more and more businesses will make the jump. In addition, as this availability delta continues to grow the cost for on premise solutions to keep up will become prohibitive. A results oriented management will be able to move past this feeling of loss of control.
Who controls who? I would say, a machine that is hanging and forces you to wake up in the middle of the night so you can recycle it is controlling you… not the other way around