“What is Cloud Storage?” is a series of posts for business leaders and entrepreneurs interested in using the cloud to scale their business without wasting millions of capital on infrastructure. Despite being relatively simple, information about “the Cloud” is overrun with frustratingly unclear jargon. These guides aim to cut through the hype and give you the information you need to convince stakeholders that scaling your business in the cloud is an essential next step. We hope you find them useful, and will let us know what additional insight you might need.” –The Editors
What is Cloud Storage?
The term cloud storage is used in popular media as though everyone knows exactly what it means. But ask anyone to list and define the different types of cloud storage, and you’re likely to get some blank looks. This, despite the fact that understanding the different varieties of cloud is essential to deciding the storage solution that is right for your business. With that in mind, we’re going to take a look at the three primary types of cloud storage. Below, you’ll find a quick and easy-to-use field guide to the three basic types of cloud storage being used today: object, file, and block storage.
Your business has certain needs. Maybe you need to share content with a number of contributors, producers, or editors based around the world. Or possibly you have a complex and huge database of sales metrics you need to process or manipulate that is stressing your on-site capabilities. Or you might simply have data you need to archive. Regardless, while people are quick to recommend the “cloud” for any business scenario involving data, you need to know which cloud is right for your scenario. Read on to learn more.
The Three Types of Cloud Storage
In cloud storage, the definition of an ‘object’ is pretty simple. Object storage is literally some assemblage of data with one unique identifier and an infinite amount of metadata.
Maybe that doesn’t sound so simple after all. Let’s break it down into its components to try to make it clearer.
This data that makes up an object could be anything—an advertising jingle’s audio file, the photo album from your company party, a 300-page software manual, or simply a related grouping of bits and bytes.
When that data is added to object storage, it typically receives an identifier that is referred to as a Universally Unique Identifier (UUID) or a Globally Unique Identifier (GUID). These identifiers are 128-bit integers. In layman’s terms, this means that the identifier—the “name” of the object—is a complex number, of sorts. The identifier is so complex, in fact, that it allows for each identifier to be considered unique.
The third and final component of an object is its metadata—literally “the data about the data”—which can be any information that is used to classify or characterize the data in a particular object. This metadata could be the jingle’s name, a collection of the geographical coordinates where a set of digital pictures were taken, or the name of the author who wrote the user manual.
The Advantages of Object Storage
The primary advantages of object storage—and the reason it’s used by the majority of cloud storage providers—is that it enables the storage of massive amounts of unstructured data while still maintaining easy data accessibility. The greater amount of storage is achieved thanks to its flat structure—by using GUIDs instead of the hierarchies characteristic of file storage or block storage, object storage allows for infinite scalability. In other words, by doing away with structure, there’s more room for data.
The higher level of accessibility is largely thanks to the metadata, which is infinitely customizable. Think of the metadata as a set of labels for your data. Because this metadata can be refined and rewritten and expanded infinitely, the objects in object storage can easily be reorganized and scaled, based on different metadata criteria.
This last point is what makes object storage so popular for backup and archiving functions. Metadata’s unrestricted nature allows storage administrators to easily implement their own policies for data preservation, retention, and deletion, making it easier to reinforce data and create better “disaster recovery” strategies like Reed-Solomon Erasure Coding and Vault Architecture.
The Primary Uses of Object Storage
The main use cases for object storage include:
- Storage of unstructured data like multimedia files
- Storage of large data sets
- Storage of large quantities of media assets like video footage as an archive in place of local tape drives
The prime use cases for object storage generally include storing large amounts of data that businesses need to access only periodically. For instance, if your business does a lot of production work in any medium, you probably need a lot of space to store your finished projects after their useful life is complete, but you probably also need access to the files in case you or a client need them again in the future.
This sort of accessible archive is perfect for object storage because the data doesn’t need to be highly structured. For example, KLRU, the Austin Public Television station responsible for broadcasting the famous musical showcase “Austin City Limits,” recently opted to migrate their 40+ year archive of footage into cloud storage. Object storage provided a cheap, but reliable, archive for all of their work. And their ability to organize the content with metadata meant they could easily distribute it to their network of licensees (or anyone else interested in using the content).
The scalability and flexibility of object storage has made it the go-to choice for many businesses who are transitioning to cloud solutions. That said, the relative complexity of the naming schema for the objects—that 128-bit identifier isn’t exactly user-friendly for most of us—and the metadata management approach can prove to be too complex or ill suited for certain use-cases. For media production companies and agencies this will often lead to the use of third party software (including Media Asset Managers (MAM) and Digital Asset Managers (DAM) that layer organizational schema over the top of the object store. When this doesn’t work, many turn to file storage, which we’ll discuss next.
For administrators in need of a friendlier user interface but smaller storage requirements—think millions of files, instead of billions—file storage might be the answer.
So what is file storage? In file storage, the data is stored in files. These files are, in turn, organized in folders, and these folders are then arranged into directories and subdirectories in a hierarchical fashion. To access a file, users or machines only need the path from directory to subdirectory to folder to file.
Because all the data stored in such a system is already organized in a hierarchical directory tree, like the files on your hard drive, it’s therefore easy to name, delete, or otherwise manipulate files without any additional interface. If you have used practically any operating system (whether Windows or Apple iOS, or whatever else), then you’re likely already familiar with these types of file and folder trees and are more than capable of working within them.
The Advantages of File Storage
The approachability of file storage is often seen as its primary advantage. But, using file storage in the cloud adds one key element: sharing. In cloud file storage, like on an individual computer, an administrator can easily set access as well as editing permissions across files and trees such that security and version control are far easier to manage. This allows for easy access sharing and thereby easy collaboration.
The disadvantage of file storage systems, however, is that if you plan for your data to grow, there is a certain point at which the hierarchy and permissions will become complex enough to slow the system significantly.
The Use Cases for File Storage
Common use cases for file storage are:
- Storage of files for an office or directory in a content repository
- Storage of files in a small development or data center environment that is a cost effective option for local archiving
- Storage of data that requires data protection and easy deployment
Generally speaking, discrete amounts of structured data work well in file storage systems. If this describes your organization’s data profile, and you need robust sharing, cloud file storage could be right for you. Specific examples would include businesses that require web-based applications. In this instance, a file storage system can allow multiple users who need to manipulate files at the same time the access they need, while also clearly delineating who can make changes. Another example is data analytics operations, which often require multiple servers to modify multiple files at the same time. These requirements make file storage systems a good solution for that use case as well.
Now that you have a better idea of the differences between object and file storage, let’s take a look at block storage and its special use cases.
A lot of cloud-based enterprise workloads are currently using block storage. In this type of system, data is broken up into pieces called blocks, and then stored across a system that can be physically distributed to maximize efficiency. Each block receives a unique identifier, which allows the storage system to put the blocks back together when the data they contain are needed.
The Advantages of Block Storage
A block storage system in the cloud is used in scenarios where it’s important to be able to quickly retrieve and manipulate data, with an operating system accessing these data points directly across block volumes.
Block storage also decouples data from user environments, allowing that data to be spread across multiple environments. This creates multiple paths to the data and allows the user to retrieve it quickly. When a user or application requests data from a block storage system, the underlying storage system reassembles the data blocks and presents the data to the user or application.
The primary disadvantages of block storage are its lack of metadata, which limits organizational flexibility, and its higher price and complexity—as compared to the other solutions we’ve discussed.
The Use Cases for Block Storage
Primary use cases for block storage are:
- Storage of databases
- Storage for RAID volumes
- Storage of data for critical systems that impact business operations
- Storage of data as file systems for operating systems for virtualization software vendors
The relatively fast, reliable performance of block storage systems make them the preferred technology for databases. For the same reason block storage works well for databases, it also provides good support for enterprise applications: for transaction-based business applications, block storage ensures users are serviced quickly and reliably. Virtual machine file systems (VMFS) like VMware also tend to use block storage because of the way data is distributed across multiple volumes.
Making a Choice Between Different Types of Cloud Storage
So which cloud storage system is right for you? Block or file storage could be useful if you’re dealing with a lot of data that members of a team have to change frequently. You might find that block storage works best for you if you need to store an organized collection of data that you can access quickly. File storage has the advantage that the data is easy to manipulate directly without a custom-built interface. But if you need highly scalable storage units for relatively unstructured data, that is where object storage shines. Whatever path you decide, now you have a sense of the use cases, advantages, and disadvantages of different storage types to weigh your next step into the cloud storage ecosystem.