The problem with the term “data archiving” is it implies a method to store data that hasn’t been accessed in a while and likely will never be accessed again. The of goal of archiving is to reduce costs by moving this data to storage that, while slower to respond to access requests, is significantly less expensive. The problem is a subset of this archived data that is actually accessed quite frequently, often called the “Active Archive”. This data needs to be on media that can respond much quicker to a recall demand than a traditional archive.
What Data Goes in an Active Archive?
Active archive data is typically data that is not accessed frequently like production data and it is data that as a data set is too large to cost effectively store on production-class storage systems. This could be video surveillance data, media and entertainment data and data used to fuel HPC projects. It can also be data that has specific retention requirements the organization has to adhere to. In all cases the data has either specific monetary value or its loss could potentially cost the organization money.
As we discussed in our webinar, “The Showdown for Data Preservation: Iron Mountain vs. The Cloud”, this data needs to be preserved to make sure when it is needed in the future it is readable. But this data also needs to be accessible. As we discuss in this StorageShort, acceptable recall times on preserved data are much faster than traditional legacy archives.
Addressing the recall expectation of the aActive aArchive is critical. The faster and more seamless the recall effort is the more aggressive the organization can be in moving data to the archive. This may mean a more expensive archive storage area than traditional archive. If the recall is slow with a high operational expense then the organization will be less likely to move data off of production storage, which means they don’t save anything.
Adopting an archive storage area the is more expensive than traditional archive storage but still less expensive than production storage, which creates a happy medium that still saves the organization a significant amount of money as well as creating an archive it will actually use.
Join us for our on demand webinar, “The Showdown for Data Preservation: Iron Mountain vs. The Cloud”, and learn all about data preservation and where the best place to store this important data set is.