High Availability With A Hosted SaaS Solution

by | Jul 15 | CAKE News

In a real-time, extremely demanding industry like ours, time very truly is money. For CAKE, a service interruption would not only carry the universal reputation degradation that would affect any other online service, but would also manifest in quantifiable dollars lost.

So, what are some of the ways modern “always on” solutions are architected? Let’s take a look at some of the considerations of creating a highly available online service from scratch.

At the bottom of our stack we have the actual server and network hardware the software runs on. A very common way of providing both fault-tolerance and the ability to scale horizontally is the use of a load balancer. These devices sit in between your users and your webservers. They distribute incoming requests to any one of several servers that are equipped to field that request. For example, let’s say you host a website and you would like to ensure you can continue to serve your customers even if an entire web server goes offline. A load balancer can automatically detect if a web server has failed and will no longer route incoming requests to it. This is seamless to your users so your service continues uninterrupted. One thing to consider here is how much capacity you have in the way of available servers if one or more fail. If you had four web servers handling all the load for your web service and one fails unexpectedly, are the remaining three capable of handling the entire load? How quickly can you scale back out?

Taking another step back, what would happen if the data center your servers are located in experiences a major environmental or network related outage? All your meticulous planning, server redundancy, and load balancing goes right out the window if your users aren’t able to reach any of it. Geographic separation is another important piece of the puzzle. Ideally, you would have your entire redundant server stack architecture replicated in at least one other data center.

A commonly overlooked aspect of this part of the design is just how much geographic separation is enough. There have been many instances where solutions are deployed in different locations in the same city or state. This makes it easier for management if you need staff to physically service both data centers but it isn’t quite as resilient as one might think. While rare, it is not unheard of for an upstream network provider to experience a major regional problem. This can and has resulted in multiple, seemingly separate data centers being unreachable simultaneously. Additionally, the same capacity issue arises. If both data centers were sharing the load for your service and one is suddenly not available, will the doubled demand on the remaining site effectively overload it?

Now let’s take a look at the top layer. You’ve got your multiple data centers full of load balancers and redundant servers and everything is working great. Your next decision is what type of failover model to use. There are a few strategies to choose from that would be appropriate here. The most common in this use case would be active-active and active-passive (warm stand-by). In an active-active scenario, we would have all of our data centers participating in serving up our product. As the name suggests in the other scenario, we would have our other data center running in stand-by mode with no traffic going to it. If there was a problem, we could change our DNS records to point traffic to the other data center instead.

The option we will go with here is an active-active solution. This offers us an additional feature that comes in really handy in a latency sensitive product like a tracking solution. If we have both of our data centers fielding requests, we can actually route end users to the data center closest to them. There are a number of enterprise level DNS providers that offer this latency based routing feature. When someone attempts to access one of our URLs, the DNS service looks at their IP address and then answers their request with the IP address of our site closest to them. Another really cool technology we can leverage here is the ability for the DNS service to monitor your data centers health. Just like the load balancer can detect failed web servers and stop routing to them, the DNS service can detect failed data centers, and stop sending traffic there.

As you can tell by this high-level overview, there is an ever-evolving landscape of new technologies and techniques. Like any good service provider, at CAKE we love to stay abreast of the best and most reliable way of providing the best-in-class service to our customers!

Happy architecting!

Author

Garth Harris

Garth Harris

As COO of the Affiliate Marketing Group, Garth's focus is to drive growth and adoption through the marketing and product teams while providing excellent customer service within the onboarding and support departments. During his 15 years in the affiliate marketing industry, Garth has held a wide range of roles, from client services manager to senior director of product engineering and, most recently, general manager at CAKE. These experiences have helped him build a deep understanding of his customers and their businesses. Outside of work, Garth is happiest behind a grill or at the beach with his family.

Related Articles

2025 Year in review - CAKE Product Updates
Dec 09 2025

Year in Review: 2025 Product Updates

As we wrap up the year, we want to take a moment to thank our...
Batch Processing
Nov 06 2025

Streamline Link Generation and Data Cleanup with New Batch Actions in CAKE

New in CAKE: Two bulk actions, Batch Link Generation and Batch Data...
Saas customer support
Aug 14 2025

Experience Excellence With CAKE’s Top-Tier SaaS Customer Success

Have you ever encountered an issue with a tech provider or software...
Better Together: CAKE and TUNE Align Under One Unified Affiliate Marketing Vision
Jul 29 2025

Better Together: CAKE and TUNE Align Under One Unified Affiliate Marketing Vision

CAKE and TUNE, the most trusted platforms in the industry, have come...
CAKE SOC 2 Type 2 and SOC 1 Type 2 Compliance for 2025
Apr 07 2025

CAKE Achieves SOC 2 Type 2 and SOC 1 Type 2 Certification

We are proud to announce CAKE has successfully achieved its Service...
Domains Management
Sep 04 2024

Domains Management Simplified: Introducing CAKE’s Latest Innovation

  In today’s fast-paced landscape of digital marketing, staying...
New ideas
Jun 03 2024

Discover What’s New in CAKE

CAKE is packed with powerful features designed for you to remain...
James Boden
May 23 2023

Spotlight Q&A With James Boden, CAKE’s Commercial Director (EMEA, MENA, APAC)

In this spotlight Q&A we’re delighted to introduce James Boden,...
Mar 14 2023

Introducing CAKE Pay – A Better Way to Pay Partners

We're thrilled to offer a new, fully integrated and automated...
Jan 11 2023

2022 – A Year in Review

Looking back on 2022, we are both overwhelmed with gratitude for our...