When applied to large file systems, traditional backup software presents three inherent problems:
An alternative is to use the LibSAM library for the Sun StorageTek Storage Archive Manager (SAM), a paradigm for data management that goes beyond backup. What Is the LibSAM Library?
Designed to use with Sun StorageTek SAM and Sun StorageTek QFS
software, the LibSAM library or API allows you to manage data in a
The model employed is client-server: A client process makes requests to a server process. The server processes the requests and returns the processing status to the client. In the simplest case, as is the case with LibSAM, the server and client run on the same machine. Therefore, all requests are local and translate into system calls to the kernel. Basic Concepts and Implementation
Before this article delves into the details of how LibSAM is used to overcome the limitations of traditional backup mechanisms, you should understand some basic concepts associated with each function. This article will first discuss the four major components of the Sun StorageTek SAM archive management system: Archiving
Archiving, the process of backing up a file by copying it from a file system to archive media, is typically the first component. The archive media can be a removable media cartridge or a disk partition of another file system. Figure 1 shows the basic components of the archiving process.
To archive a file managed by a
Given a pathname for a file,
As a best practice, an application should create and maintain four copies of each archive set. These copies should be stored on their own media to protect against data loss. The copy managed directly by the file system is referred to as the
on-disk copy, which is stored in the disk cache. Note that you can
configure the archiving parameters in the
Releasing
After a period of time during which a file's data has not been accessed or modified, the software lets go of or releases the file's copy on disk. Releasing is the process of making disk cache space available by identifying archived files for which all archive copies have been made and freeing their online disk space. The releaser is the second major component of Sun StorageTek SAM. Figure 2 illustrates the releasing process. This frees disk space to allow creation of other files or retrieval of other file data from the archive media. Note: The releaser can release only files that have at least one active archived copy.
To release a file managed by a
Given a pathname for a file, the
Staging
Staging is the process of copying file data from an archive copy back into the disk cache. When a file whose data blocks have been released is accessed, the staging software automatically copies the file data -- or a portion of it -- back to the online disk cache. The stager is the third major component of Sun StorageTek SAM. Figure 3 illustrates this concept.
To stage a file managed by a
Given a pathname for a file, The Table 3 shows the full list of staging options.
Recycling
Over a period of time, archived copies can become redundant by
their age and can be deleted from archive media to create space for
newer versions or copies. Archived file data is stored in a
Recycling conserves archive media by reclaiming expired archive data, thus making room for additional file data to be archived. Recycling is the fourth and last major component of Sun StorageTek SAM. Figure 4 illustrates this concept.
Recycling is a process that typically does not require user interaction. Often, the recycler is invoked through root's Real-Life Example
To put the pieces together, consider the example of a hospital, in which large volumes of data about patients are generated and subsequently sit unused for long periods of time. (Download the example program.) A patient's hospital records might have the following data storage history:
Overcoming Limitations of Traditional Backup Software
In current backup methodologies, files are copied repeatedly at each full backup cycle, regardless of whether they have changed. With the Sun StorageTek Storage Archive Manager (SAM) software, a file never needs to be copied again unless it changes, because it is protected by the four copies made to different media. Should a file change after it is archived, the advanced file system in Sun StorageTek SAM software automatically makes new copies to new media or new locations, protecting the new versions of this file again, without having to wait for a full or incremental backup that traditional backup software requires. After the new copy is made successfully, the file's metadata is updated to reflect its new location and media. This technique of automatically copying files only after they have been created or modified effectively eliminates the need for a specific period to back up data and can significantly decrease administrator overhead. In addition, the software features the capability to copy and archive from disk to disk, allowing organizations to develop more resilient disaster-recovery scenarios by copying to remote sites. Restoring a file system protected by Sun StorageTek SAM software is also extremely fast when compared to traditional backup software because only the metadata needs to be restored before the file system can be mounted and used. This can take minutes rather than the hours or days that a full restoration from tape requires. Once the metadata is restored and the file systems mounted, all references to the data are satisfied from the Sun StorageTek SAM archive. Files are migrated back to online storage as they are accessed, providing transparent access to the actual file data from near-line storage. And Sun StorageTek SAM software's read-behind feature enables users to begin reading the file even before it is fully restored, significantly benefiting users who need to access large files. Last but not least, because Sun StorageTek SAM software copies only new and changed files, only the tape space for these files is necessary. The software can easily keep up with the rapid scaling and explosion of data in today's economy by never having to perform a full backup of all old and new data. Not only does this save total cost of ownership by eliminating the need to add media to preserve old and stale data, it also saves valuable time and money by limiting administrator overhead and reducing the number of tape devices that are required to accomplish a full backup within a given time. Additional Features
Enhanced policy-based administration and security features include quotas and access control lists (ACLs) to control space consumption and data access. In addition, Sun StorageTek SAM includes the ability to manage
very large files by making use of segments, as well as the ability to
extend continuous archiving capabilities to remote sites with SAM
Remote, Table 4 shows the LibSAM API's functions and descriptions at a glance.
All the APIs in LibSAM, except for For more details about each library routine, see the individual corresponding man page for that routine. Library routines contained in LibSAM are found in section 3 of the downloadable man pages. Summary
Sun StorageTek SAM is often combined with Sun StorageTek QFS software. With this combination, also known as Sun SAM-QFS, Sun presents a new approach that helps organizations manage information assets according to their business needs. The software enables dynamic archiving, reduced backup windows, and fast recovery to help enhance productivity and improve resource use. It consolidates innovative archiving and backup methodologies in a high-performance file system with virtually unlimited scalability. The software replaces traditional backups to improve storage resource use for applications in which data needs to be available continuously and quickly restored in the event of a business disruption. Administrators can set automatic archiving policies to determine when, where, and how information is stored, ensuring cost-effective management of large volumes of data. Metadata archiving and read-behind features help enterprises recover from business disruptions in minutes or hours, as opposed to days, and they let users begin reading files even before they are fully restored. When you put it all together, Sun StorageTek SAM-QFS software enables enterprises to get great value from their information, meeting demanding business requirements across a wide array of applications, regulations, user needs, and corporate policies, while delivering lower overall costs. References and Download Information
Download: LibSAM man pages
About the Authors
Svati Chandra is a staff member of the Sun StorageTek SAM-QFS development team. The Sun StorageTek SAM-QFS development team works toward providing a cost-effective, high-performance, end-to-end solution for information lifecycle management (ILM). | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
| ||||||||||||