Over half of all internet traffic is now carried by CDNs (Content Delivery Networks)1, and many more companies are considering a CDN. In particular, online video, rich web media, file downloads, gaming, and e-commerce content can benefit from a CDN. And the stakes are high. Today’s digital audience will abandon a video that rebuffers 3 times, a website that takes more than 3 seconds to load, a file download that takes too long to complete.
CDN delivery improves performance, but a CDN is only as a strong as its weakest link. In many CDN environments, CDN origin storage is the weakest link, a critical but underappreciated element of the content delivery chain. The right origin storage can dramatically improve response time, reliability, overall cost and management overhead.
This paper introduces the basic concepts, explores challenges, and outlines best practices for CDN origin storage. It illuminates ways that origin storage can have a significant positive or negative impact on online delivery performance and therefore on user experience. It also explores impact on business operations, staff resources, and capital or operational expenses.
The open internet by itself could not handle the demands of today’s traffic, amounting to billions of video streams, rich web pages, file downloads, game updates, online shopping pageviews, and more every day. Enter content delivery networks, which make the modern internet possible by improving the performance and reliability of digital delivery.
CDNs use edge caching of popular content—replicating popular assets to multiple locations around the edges of the Internet. By storing content in multiple geographically-distributed locations, users can access content from a site that is close to them, rather than needing to go to a single location that may be far away.
The fastest digital content delivery response generally takes place on what is called a cache hit. Cache hit means the asset is found in edge cache and can be retrieved and delivered quickly without having to go back to the original storage location of the content. As shown in Fig. 1: Delivery on Cache Hit, an audience member requests an asset (1), the CDN finds the asset in edge cache and delivers it (2).
Figure 1: Delivery on Cache Hit.
Requested asset is found in edge cache.
On cache miss, the CDN needs to retrieve the asset from origin storage. As shown in Fig. 2: Delivery on Cache Miss, an audience member requests an asset that is not in edge cache (1). The CDN requests the asset from origin storage (dotted line below 2). Origin storage processes the request and serves the asset to the CDN (solid line below 2). Then the CDN can deliver it to the audience member (3).
Figure 2: Delivery on Cache Miss with External Origin Storage.
Requested asset is not found in edge cache.
Note: Fig. 2 shows origin storage that is external to the CDN, requiring request and delivery to traverse the open internet.
All told, there are billions of cache misses per day.
Some companies spread their content delivery across multiple CDN vendors, a practice known as multi-CDN. However, the probability of a cache miss is generally higher in multi-CDN environments. As content is divided across multiple CDNs, a given asset is less likely to be found in a given vendor’s cache. Origin storage is a more important factor in multi-CDN environments. Best practices become more of a challenge because of this diverse nature.
Origin storage is a critical component of any CDN infrastructure, but it can be complex to efficiently deploy and maintain. Depending on how it’s implemented, origin storage can cost time, resources, and money while delivering poor customer experience. Fortunately, if you know the secrets it’s easy to do right.
According to Gartner, “Talent has now been recognized globally as the single biggest issue standing in the way of CIOs achieving their objectives.”2
If that’s the case in your organization, it’s critical that you deploy your talent for the highest benefit. Cost-benefit decisions often hinge on what may be unique advantages for your business vs. what may be available on the open market. It makes sense to develop unique capabilities. However in the case of CDN origin storage, some companies select general-purpose storage, then end up spending significant resources on application-specific development, management and optimization.
One of Gartner’s recommendations is “considering customers, citizens, vendors and partners as extensions (and digital accelerators) of the talent platform”. A vendor skilled in CDN best practices can automate key tasks associated with CDN setup, content upload, content distribution, management and delivery. Benefits of such automation include more efficient use of staff time and resources, and higher quality delivery performance to users.
Uploading large amounts of content into storage can be time-consuming and expensive. Certain limitations can make the overall process worse. If upload sites do not have high-speed access and are far from the storage location, upload times can increase exponentially. If high-speed file transfer protocols to upload content to the storage are limited or not available, workflow productivity and success rate can suffer. Impact can include additional staff overhead, lengthy delays before content is available, failed uploads, and higher costs of bandwidth and fees.
The highest efficiency upload uses high-speed links to a nearby location and an optimized file transfer protocol. Many organizations want to upload from multiple locations using different protocols. A solution that can operate at scale is required.
The estimated cost of an unplanned outage is $9,000 per minute3. Businesses are in a constant struggle to avoid such an outage.
In early 2017, a cloud storage outage cost an estimated $150 million to the S&P 500 alone4. The total impact including small and medium business was far higher. Even though this outage took place in a single location in the Eastern US, the impact was global. However, the losses and the global impact were preventable. The companies impacted by this outage relied on a single storage location—a single point of failure. When that storage location became unavailable, the businesses’ digital services lost key functionality. With no backup or failover in place, the service outage became catastrophic.
Without redundancy, if one storage location fails, the entire content delivery value chain goes down.
Eliminating a single point of failure—and the risks associated—requires the practices of redundancy, automatic failover, and in the case of content delivery, geographic distribution. These are well known, tried-and-true practices, but they may not be easy to implement without significant expenditure of resource. Selecting a proven, tested solution with built-in redundancy can save organizations manpower and time, while providing exceptional user experience to the desired audiences.
Storage failover needs to be rapid, reliable, and robust. If it takes your team significant time and effort to detect the issue and implement the failover procedures, the impact to your business could be significant.
Where is your audience located? Whether you’re targeting a limited geography or a global audience, the location of origin storage matters. Latency routinely jumps an order of magnitude from in-region to intercontinental—from tens of milliseconds to hundreds5. Throughput can also be affected: “with standard TCP connections the round trip time between client and server (i.e. ping time) imposes an upper limit on maximum throughput”6. So, geographical placement of origin storage is an important factor in delivering a high-quality user experience.
If you’re relying on a single origin storage location or region, it could be halfway around the world from much of your audience. In addition to sheer distance, both the request and the retrieval may be hampered by internet traffic routing, network hops, time spent waiting in router queues, and other delays. Delivery will be sluggish at best, and may experience unacceptable delay or failure.
Placing content close to your audience is a fundamental CDN concept for improving performance. Best practices for origin storage adopt this same concept, and replicate content in multiple geographic locations that are close to the audience that will be requesting the content. Ideally, this replication should be automated to minimize manual intervention and the possibility of errors. These practices provide better response time and availability while minimizing overhead.
Abandon rates are a KPI for most digital businesses. 78% of people will stop watching online video if it rebuffers three times. Similar stats exist for file downloads, web online shopping, gaming, and other payloads. Slow retrieval from origin storage is a frequent contributor to slow performance.
For both signaling and content planes, latency becomes cumulative. Delay can be introduced by server load, congestion on the open internet, traversing multiple networks, authentication, device queue latency, reroutes, and outages. With each link or handoff, compounding impacts can hamper user experience. In addition, management complexity, service issues, and expense can also be affected.
To minimize abandon rates, recognize the impact that origin storage can have. Plan to access content from origin storage with minimal technology fragmentation.
Migrating content from one storage platform to another can consume significant resources. It’s not just the cost of the migration itself, it’s also the time and effort involved in managing migration. You may have a large library of videos, many files to deliver, a website with many assets, a lot of items in your online store, or a lot of files in your gaming system. How do you know which ones to move and in what order? The staff time drain can be significant, involving data gathering, traffic reporting and analysis, determining which assets to move and in what order, setting up the process, managing the migration, developing necessary scripts or other code, and monitoring progress—all with manual operations and oversight involved.
Best practices for streamlining content migration center around improving efficiency via automation. In migrating content, automation can eliminate the overhead involved, especially if intelligent processes can be applied.
It could cost an organization $500,000 to implement multi-region stability with high availability and best practices7. For example, a large cloud storage provider offers a detailed document with general design principles, best practices and guidance for architecting solutions on their platform. But in the end, you need to spend your resources to design, architect, build, test, and deploy an origin storage solution.
The opportunity cost may be even more significant than the cost of the storage. Designing, building, deploying and managing a custom CDN storage consumes significant resources. Those are resources that won’t be available for higher-value projects that set you apart from your competition, achieve key business objectives, and improve your efficiency.
Improved total cost of ownership, time to market, delivery quality, reliability, and customer satisfaction can all result from deploying a proven solution. In addition, the freedom to allocate resources to higher-value projects converts opportunity cost into opportunity. Finally, where customization is required, it’s faster, more efficient and more reliable to customize a proven platform than to build a custom solution from scratch.
Limelight Origin Storage Services are optimized for the unique needs of CDN origin storage. Process automation saves staff time and resources and reduces errors. Delivery automation ensures exceptional user experience, with best-of-breed performance and resiliency on origin storage retrieval. Tight integration with the Limelight Orchestrate Platform, including PoP collocation and private fiber network transport, combines with intelligent best-site serving to provide the highest possible performance on cache miss.
Limelight automates key operations specific to the unique needs of CDN origin storage, so you can spend your time and resources on higher value tasks that are unique to your business.
At no extra cost, Limelight applies high-availability best practices to both populating content and delivering it.
Placing content closer to the audience is a primary way CDNs accelerate content delivery. Limelight extends this concept to origin storage, with a menu of regional replication policies to choose from. Choose policies based on where your target users are concentrated, and Limelight does the rest, distributing content automatically to the appropriate worldwide storage locations.
Tight integration of origin storage with the content delivery infrastructure improves delivery performance. Limelight Origin Storage is collocated in many of the same Points of Presence as the edge devices that are delivering content. Our private fiber backbone network transfers content faster than the open internet, whether for replication or delivery.
Figure 3: Delivery on Cache Miss with Limelight Origin Storage.
Limelight’s technology stack is platform-agnostic. Unlike other vendors, Limelight doesn’t employ custom scripting languages, non-standard authentication modules, or other tricks to lock you in to a vendor. Limelight views a customer relationship as a partnership.
Limelight’s unique Intelligent Ingest automates the migration of content from current storage to Limelight’s high-performance Origin Storage via HTTP. Intelligent Ingest has two modes, which can be combined for flexibility.
In addition to Intelligent Ingest that automates content migration, Limelight offers worldwide upload locations and multiple upload methods, all at no extra cost. Upload methods include a web-based GUI, Limelight’s API which supports multi-part uploads, Aspera, and legacy protocols including FTP, SCP, SFTP, RSYNC, and FTP-SSL.
Simply upload your content to a nearby location. Limelight automatically replicates the content based on your policies. Your content is transferred across Limelight’s private fiber backbone to Origin Storage locations that are close to your users. No matter where your content is created, Limelight makes it fast and easy to automatically position content in storage locations close to where it will be delivered.
Limelight has made a significant investment in developing, testing, and optimizing the world’s leading CDN origin storage system. Limelight Origin Storage is a proven turnkey solution that can be easily configured to support any content delivery workflow. And if you have complex workflow challenges, Limelight’s Advanced Services team can become a valuable extension of your team. Limelight’s experts can analyze your existing workflow, optimize your storage configuration, and train your team—reducing costs to you and accelerating your time to market.
For those companies that are using or considering deploying multiple CDN’s, Limelight has the ideal origin storage solution. Origin storage is even more important in a multi-CDN environment, because there’s a significantly higher chance of cache miss on a given request. Limelight Origin Storage can be used in a multi-CDN workflow as the origin storage for all of your CDNs. With Limelight Origin Storage, you get the benefits of workflow automation and flexibility, high performance and availability at global scale, and a private fiber network for transfer and handoff to over 1,000 peering partners for delivery to other CDN’s.
Origin storage is a critical yet often underappreciated element of the CDN value chain. The quality of content delivery and user experience—the reason companies use a CDN in the first place—can suffer greatly due to poorly optimized origin storage. Typical cloud storage is not optimized for content delivery workflows. Developing these critical CDN capabilities on top of general-purpose storage requires significant time and effort that can drain staff and financial resources that could be much better spent elsewhere.
Limelight Origin Storage Services are optimized for the unique requirements of global content delivery through the Limelight Orchestrate Platform or a multi-CDN environments. They can save resources while providing exceptional audience experience. The solution integrates best practices, including CDN-focused automation, redundancy, geographic content distribution, automatic best-site serving with failover, automated migration, and tight integration with delivery infrastructure.
Limelight Networks has worked with many of the largest brands in the world to implement CDN origin storage. Whether you currently use Limelight CDN or not, we invite you to learn how you could gain significant efficiency and performance with Limelight Origin Storage by signing up for a free trial.
The Limelight Orchestrate Platform is built upon a global, private backbone network with the speed, capacity, and availability to deliver the experiences today’s audiences demand. This industry-leading Platform includes integrated content delivery, web acceleration, origin storage, video management, cloud security, and support services. The unique combination of global private infrastructure, advanced software, and expert services surpasses other CDNs, to enable today’s and tomorrow’s workflows and put audience experience first.
1Cisco white paper "The Zettabyte Era: Trends and Analysis", June 2017
4Estimate by cyber-risk startup Cyence Inc.
5Global ping statistics from wondernetwork.com
6"The Surprising Effect of Distance on Download Speed", httpwatch.com referencing Aladdin Nassar