archive (original) (raw)

What is an archive?

An archive is a collection of data moved to a repository for long-term retention, to be kept separate for compliance reasons or moved off primary storage media. It can include a simple list of files or files organized under a directory or catalog structure, depending on how a particular program supports archiving.

Web and file transfer protocol sites that provide downloadable software programs sometimes refer to the list of downloadable files as an archive or archives.

Backup vs. archive

While data backup and archiving are similar, they have distinct differences.

Differences between backup and archive.

Backups are copies of data stored for the purpose of recovery in the case of corruption or other data loss. These copies are typically created using replication or mirroring and are updated as files change. The storage needs to perform well enough to restore data quickly. Backups are often stored as blocks to facilitate the recovery of large amounts of data at one time.

Archived data is not a copy, but rather inactive and rarely altered data that needs to be retained for long periods of time. Performance is less critical in archive storage. Rather than being stored in blocks, archived data is usually stored as a file or object that can be stored with metadata attached so that granular access to data is possible.

Archive storage options

Archive storage typically needs to be able to store large amounts of data for long periods of time at a low cost. The following storage options are commonly used for archived data:

Enterprise data archiving tools

Archiving software enables data to move from production storage to archive storage as needed.

Many archiving software products can automatically offload data to the archived storage location based on user-created policies or as the data becomes less frequently accessed. Some archiving software connects directly to a cloud provider, while other software helps tape or object storage act as an extension of the disk used to store production data.

In many cases, archiving and backup software are integrated. To improve response times when data is accessed, some software also offers the ability to cache segments of archived data on disk, while the majority is stored on object or tape.

Editor's note: This article was revised in 2023 by TechTarget editors to improve the reader experience.

This was last updated in October 2023

Continue Reading About archive

Dig Deeper on Storage architecture and strategy