Blockbuster Success: Scaling a Streaming Platform with Reliability and RedundancyDownload Case Study
All images provided by Kanopy.
Dave Barney, Chief Technology Officer, Kanopy
“We’re rapidly accelerating the amount of content we onboard. Had reliability not become an issue, cost containment very quickly would have. Backblaze and the Bandwidth Alliance helped us attain both.”
Kanopy, a video streaming platform, serves more than 4,000 libraries and 45 million library patrons worldwide. They used an all-in-one video processing, storage, and delivery provider to manage 25,000+ videos, but reliability became an issue after missing files led to service disruptions. They needed cost-effective origin storage and safe backups they could count on.
After moving to Backblaze B2 Cloud Storage in less than three days via our cloud to cloud migration service, Kanopy unlocked a partner network that allowed them to use best-of-breed providers. Now, films are transferred from Kanopy’s video processing infrastructure directly to Backblaze B2 where they’re then served to users through Cloudflare’s content delivery network.
As a key part of their new tech stack, Backblaze B2 gives Kanopy the ability to scale efficiently and rapidly accelerate content onboarding. Backblaze B2’s reliability means Kanopy can provide a seamless customer experience. Taken together, Backblaze allows Kanopy to deepen their value proposition by providing thoughtful films their millions of users can’t find elsewhere.
Founded in 2008, Kanopy is a video streaming platform providing academic institutions and public libraries with essential films that foster learning and conversation. Today, they serve more than 4,000 library customers, providing free, on-demand access to video content for 45 million patrons worldwide. Their mission is to ensure that everyone has access to enriching films that bridge cultural boundaries, spark discussion, and expand worldviews.
We Have a Reliability Problem
Dave Barney, Chief Technology Officer at Kanopy, often describes the company as “Netflix for libraries.” It’s an easy comparison, until you realize Kanopy has 25,000+ titles under management, many thousands more than Netflix. That extraordinary depth is their key value proposition. Where Netflix entertains the masses with content that appeals to the biggest audiences, Kanopy serves the professor who needs a hard to find 1960s title for their existentialism in French New Wave cinema class, or the film buff looking for a long-lost favorite.
Rather than compete with Netflix, Kanopy pursued a blue ocean market in academic and, later, public libraries. The libraries pay for the service, offering patrons free access to films that can’t be found anywhere else in the world. “On the public library side, we regularly hear from users saying, ‘I can’t believe you have this film’ or ‘I haven’t seen this film in 30 years,’” Barney noted. Unlike Netflix, when a title’s popularity dips, Kanopy doesn’t immediately retire it. Instead, they’ve ramped up content onboarding, adding the equivalent of Netflix’s entire library, if not more, every year, with plans to continue expanding.
That growth made the reliability and cost of their storage solution all the more important. “We don’t have deep pockets like Netflix, even though we have to keep a lot more data spinning. With all that content, reliable and inexpensive storage are two very important things from an operations and cost-efficiency standpoint,” Barney explained. When customers started complaining about reliability issues, the development team knew that they had to make a change to their legacy storage and distribution infrastructure if they wanted their expansion plans to succeed.
"We don't have deep pockets like Netflix, even though we have to keep a lot more data spinning. With all that content, reliable and inexpensive storage are two very important things from an operations and cost-efficiency standpoint."
Dave Barney, Chief Technology Officer, Kanopy
What We’ve Got Here Is a Failure to Back Up
Founded in 2008 in Australia, Kanopy’s early delivery model paralleled Netflix—they shipped videos on DVDs. As technology evolved, they relocated their headquarters to San Francisco and transitioned to streaming video, engaging an all-in-one video processing, storage, and content delivery network (CDN) provider to manage their media.
“It was a trusted provider, so nothing should have gone wrong,” Pierre-Antoine Tible, Lead Video Software Engineer, said. “With time, we discovered things can go wrong.”
In order for a film to be streamed without delays or buffering, it must first be transcoded, or broken up into smaller, compressed files known as “chunks.” A feature-length film may translate to thousands of five to 10-second chunks, and losing just one can cause playback issues that disrupt the customer viewing experience.
“We weren’t able to recover the files [our legacy provider] lost, so it was disabling HD for the films,” Tible acknowledged. “On top of that, if we passively checked the CDN, it said the file was there. We had no way to discover missing files unless a user complained.” Once they discovered a missing file, finding the original source file and transcoding it again took time and resources. That’s if they could find it at all—one thing their legacy CDN didn’t provide was backups. If the file couldn’t be located in their primary storage, it was gone.
“It was very bad in terms of engineering, provisioning, and user experience,” Tible reported. For a video streaming service of Kanopy’s scale, a poor user experience was not acceptable. Nor was operating without a solid backup plan. They resolved to look for better cloud partners to increase reliability and redundancy.
The Beginning of a Beautiful Friendship:
Kanopy + Backblaze
First, Kanopy had to find a way to move their data out of their legacy provider and make it S3 compatible in order to structure their tech stack the way they wanted. Their legacy CDN used its own storage type, so Kanopy’s development team wrote a script themselves to move the data to AWS, where they planned to set up their video processing infrastructure.
While AWS works well for compute resources like transcoding and encryption, Kanopy’s AWS bills would skyrocket if they used Amazon S3 for origin storage. They stream out 15,000+ unique titles each month, or roughly half of their total collection, which would incur massive egress fees through Amazon S3. It was not an option for storing production data long term, but making their data S3 compatible allowed them to consider other S3-compatible providers.
Prioritizing reliability above all else, Kanopy tested a few solutions for origin storage, including Backblaze B2 Cloud Storage and IBM. IBM also offered S3 compatible APIs, so the data would have been easy to move, but using IBM for storage meant implementing a CDN they didn’t have experience with. “We were already using Cloudflare, so it was configured for production. We didn’t want to use something wholly new,” Tible said.
After running a proof of concept, Backblaze proved more reliable and gave them the ability to use their preferred CDN, Cloudflare, to continue delivering content around the globe. Kanopy completed the migration of production data from Amazon S3 to Backblaze B2 using Cloud to Cloud Migration, moving 150 TB in less than three days and discovering a few final missing files in the process.
The Partner Network That Dreams Are Made Of
Kanopy’s vendor-agnostic, multi-cloud tech stack provides them the flexibility to structure the most cost-effective workflows. They use IBM Aspera to receive videos from contract suppliers like Paramount or HBO onto their video processing infrastructure on AWS. There, they transcode and encrypt the files for digital rights management purposes, generating upwards of 10,000 chunks per film in some cases. They send the transcoded chunks from AWS to their Backblaze B2 origin storage using Flexify.IO, a data transfer partner, and save an archive copy on Amazon Deep Glacier, giving them much-needed redundancy in addition to their origin store copy on Backblaze B2.
Since Backblaze is already configured with Cloudflare, data stored on Backblaze B2 automatically feeds into Cloudflare's CDN. This allows content to live in Backblaze B2, yet be delivered with Cloudflare’s low latency and high speed. “Video needs to have a quick startup time,” Tible said. “With Backblaze over Cloudflare, we didn’t have any issues.”
The Backblaze partner ecosystem allowed Kanopy’s development team to structure their tech stack using integrated, best-of-breed providers. “Backblaze’s partners were important,” Tible confirmed. Barney added, “The Backblaze partners, especially the integration with Cloudflare, were fortunate wins.”
"Video needs to have a quick startup time. With Backblaze over Cloudflare, we didn’t have any issues."
Pierre-Antoine Tible, Lead Video Software Engineer, Kanopy
Here’s Looking at the Next 25,000 Movies, Kid
“Backblaze was a natural fit for us because it was more reliable and less expensive, enabling us to get ahead of where the business is going.”
Dave Barney, Chief Technology Officer, Kanopy
Though reliability was a main driver in Kanopy’s efforts to overhaul their tech stack, looking forward, Barney sees their new system enabling Kanopy to scale even further. Benefitting from the Bandwidth Alliance, Kanopy pays $0 in egress fees to transfer data from Backblaze to Cloudflare. The Bandwidth Alliance is a group of cloud and networking companies that discount or waive data transfer fees for shared customers, of which Backblaze and Cloudflare are both members. Backblaze also offers simple pricing that allows Kanopy to achieve redundancy below their expected budget. “We’re rapidly accelerating the amount of content we onboard. Had reliability not become an issue, cost containment very quickly would have. Backblaze and the Bandwidth Alliance helped us attain both,” Barney attested. “Backblaze was a natural fit for us because it was more reliable and less expensive, enabling us to get ahead of where the business is going.”
Adding even more titles deepens their value proposition and enables them to support independent film. As independent cinemas continue to close across the world, their service fills that void and gives independent filmmakers a channel for reaching consumers. How many titles will Kanopy add in the future? 10,000? 25,000? Every movie ever made? The great thing about their service is that you can find out. You just need to renew your library card.
Backblaze B2 Cloud Storage is purpose-built for ease, instant access to files and data, and infinite scalability. It seamlessly supports workflows via hundreds of third-party software integrations, or through direct APIs and CLIs. At only $5/TB of object storage per month (a fraction of the cost of the largest solutions), Backblaze B2 is priced so users don’t have to choose between what matters and what doesn’t when it comes to backup, archive, data organization, workflow streamlining, and more.