What Object Storage Is – and What It is Not

Posted on January 17, 2014 by Eric Slack

Object storage has been around for a number of years but is now getting more attention. This is due to a couple of factors: the amount of data, especially file data, is continuing to grow and the length of time it must be retained is increasing. The need for storage systems that can keep up with this demand for long-term capacity is greater than ever and storage vendors are responding with object-based storage systems. But what exactly is object storage and why is it a better fit for scalable file storage than traditional storage architectures?

What Object Storage Is

Object storage is an architecture in which data is stored in discrete ‘buckets’ (called objects) instead of in large volumes of address space (like block storage) or in a hierarchy of directories and folders (like NAS). Each object is assigned a unique identifier called an “object ID number” (OID), which is compiled into a flat index and searched to access that object’s files. This structure provides a simple, efficient way to organize and access files, compared to navigating a folder hierarchy. Instead, users or applications basically look up the OID and fetch the data required using an offset into the object.

This architecture also requires a minimum of metadata, the information that’s used to organize data sets, compared with a traditional file system. This metadata efficiency means object storage generates less overhead storing and handling files. But object storage is also more flexible than traditional storage as systems can be configured with custom metadata fields to support advanced functionality.

The object-based storage software architecture is also ideal for a modular storage topology. Since each object is a discrete ‘container’, large data sets are easily divided into subsets and stored in a scale-out fashion, on multiple storage modules or “nodes”. This logical cluster of nodes can then be physically separated to provide greater data protection.

Object storage systems are accessed using a REST-based interface, using lower-level PUT and GET commands. This allows applications to directly access data without using traditional file system protocols and for object storage to be easily connected to via the internet.

What Object Storage is Not

Object storage is not a storage system, per se, but an architecture as described above, one that can be integrated into storage systems in many different configurations. Some object-based solutions are software-only that run on user-supplied hardware. Others are appliances, typically 1U or 2U nodes often supplied as turnkey systems that leverage commodity hardware.

Object storage is not a file system either, or a NAS; however, object-based storage systems are often used in the same large file-storage environments as scale-out NAS solutions. These storage systems typically contain a file system layer that essentially performs a protocol translation, mapping a file name to the object that contains that file.

Object storage is not erasure coding, a data resiliency process that uses redundant blocks of data and a parity-like calculation to recover from an infrastructure failure. As with the file system discussion above, many object storage systems include features like erasure coding and data distribution (or dispersion), both process that are well suited to the object-based architecture.

Conclusion

Object-based storage systems have some unique characteristics which can provide compelling advantages for storage vendors designing systems to handle the deluge of unstructured data that many companies are facing. In the future posts we’ll discuss what these characteristics are, how they’re being leveraged by storage vendors and where object storage systems are being used.

Click Here To Sign Up For Our Newsletter

About Eric Slack

Eric is an Analyst with Storage Switzerland and has over 25 years experience in high-technology industries. He’s held technical, management and marketing positions in the computer storage, instrumentation, digital imaging and test equipment fields. He has spent the past 15 years in the data storage field, with storage hardware manufacturers and as a national storage integrator, designing and implementing open systems storage solutions for companies in the Western United States. Eric earned degrees in electrical/computer engineering from the University of Colorado and marketing from California State University, Humboldt. He and his wife live in Colorado and have twins in college.

Tagged with: Architecture, File system, Metadata, NAS, Object, Object ID Number, Object Storage
Posted in Blog

4 comments on “What Object Storage Is – and What It is Not”

What Object Storage Is – and What It is Not | TwinStrata says:

January 22, 2014 at 9:49 am

[…] Click here to read the whole article storageswiss.com […]
Where Object Storage is Used | Storage Swiss - Storage Switzerland says:

January 30, 2014 at 9:36 am

[…] previous columns we’ve discussed what exactly object storage is (and is not) and what advantages this technology brings to the storage infrastructure. It’s more scalable than […]
2014 Next Generation Object Storage Summit Kicks Off | Storage Swiss - Storage Switzerland says:

March 4, 2014 at 1:09 pm

[…] Los Angeles – Storage Switzerland and a host of other industry influencers are spending the next day or so at the Next Generation Object Storage Summit (NGOSS). As is usually the case at these events, we opened with the ongoing argument about how the industry should explain what object storage is and why an organization might want to use it. My colleague, Eric Slack, has gone to great lengths to frame this up in his recent Object Storage series. […]
Do SMB Data Centers Need Object Storage? | Storage Swiss - Storage Switzerland says:

March 5, 2014 at 1:23 pm

[…] I strongly disagree with. You should care and you should have a reasonable understanding as to what object storage is and why it is better. You may never interface with the object storage layer directly but knowing […]

Comments are closed.