CA2804372C - De-duplication based backup of file systems - Google Patents

De-duplication based backup of file systems Download PDF

Info

Publication number
CA2804372C
CA2804372C CA2804372A CA2804372A CA2804372C CA 2804372 C CA2804372 C CA 2804372C CA 2804372 A CA2804372 A CA 2804372A CA 2804372 A CA2804372 A CA 2804372A CA 2804372 C CA2804372 C CA 2804372C
Authority
CA
Canada
Prior art keywords
file system
data blocks
request
files
point
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CA2804372A
Other languages
French (fr)
Other versions
CA2804372A1 (en
Inventor
Jedidiah Yueh
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Delphix Corp
Original Assignee
Delphix Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=45467721&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=CA2804372(C) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Delphix Corp filed Critical Delphix Corp
Publication of CA2804372A1 publication Critical patent/CA2804372A1/en
Application granted granted Critical
Publication of CA2804372C publication Critical patent/CA2804372C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/11File system administration, e.g. details of archiving or snapshots
    • G06F16/128Details of file system snapshots on the file-level, e.g. snapshot creation, administration, deletion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1415Saving, restoring, recovering or retrying at system level
    • G06F11/1435Saving, restoring, recovering or retrying at system level using file system or storage system metadata
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1448Management of the data involved in backup or backup restore
    • G06F11/1453Management of the data involved in backup or backup restore using de-duplication of the data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/84Using snapshots, i.e. a logical point-in-time copy of the data

Abstract

File system backups are performed by copying information describing changes in the file system since a previous point in time. To restore data, a virtual restored file system (VRFS) structure is created corresponding to a snapshot of data copied from the file system that is stored in the backup file system. A client can read the data stored in the backup file system pointed at by the VRFS structure as well as write to the backup file system. Multiple clients can share blocks of data stored on the backup file system via the same VRFS structure or via multiple VRFS structures. If a client writes to a data block pointed at by a VRFS, a copy of the data block is made to which the client makes changes so that the data in the original data blocks is saved for other clients.

Description

DE-DUPLICATION BASED BACKUP OF FILE SYSTEMS
Inventors:
Jedidiah Yueh BACKGROUND
[0002] This disclosure relates generally to backup and restore of file systems, and in particular to storage efficient backup.
[0003] File systems provide permanent storage for data that is critical to an organization.
Therefore, file systems form an important part of an organization's information technology. File systems are used for persistent storage of data by applications used in an organization, for example, email servers, document management systems, source code management systems and the like. Organizations make copies of the data stored in the file systems to avoid loss of data in case the original data in the file system is destroyed or corrupted. The process of making copies of the data is called a backup. Conventional systems that perform backups of data store the data in storage devices with large storage capacity, for example, magnetic tapes.
These devices typically have slow access time. Conventional systems restore data from the backup storage in case of loss of data stored in the original file system. The process of restoring data of a large file system can take significant amount of time resulting in loss of business during the time the data is unavailable.
SUMMARY
[0004] Embodiments perform backups of a file system by copying changed blocks of data from the file system to a storage system. In case of loss of data or if the source file system is unavailable, a virtual restored file system (VRFS) structure is created using blocks of data stored in the storage system. The VRFS structure points at data blocks copied at various points in time.
Since only changed blocks of data are stored in the storage system, the amount of data copied during backup is small. Also restoration of data is efficient because creation of the VRFS structure takes significantly less time than conventional restoring of file systems from a backup storage device. Furthermore, a client can directly read or write in the storage system while the source file system is down, instead of waiting for data to be restored on another file system.
[0005] In an embodiment, the backup system receives data blocks for a plurality of point-in-time copies of a source file system. The backup system stores the data blocks in a storage system. A data block can be associated with multiple point-in-time copies, for example, if a data block did not change since a previous point-in-time copy was obtained, the data block may be associated with the previous point-in-time copy as well as with a subsequent point-in-time copy.
The backup system receives a request to restore information from the source file system for a target system. The backup system generates a virtual restored file system by linking a set of files to stored data blocks of the storage system and mounting the set of files on the target system.
[0006] In an embodiment, the backup file system performs backups of multiple source file systems. In an embodiment, a client can update the data stored on the backup file system pointed at by the VRFS structure. Multiple clients can share blocks of data stored on the backup file system via the same VRFS structure or via multiple VRFS structures. If a client writes to a data block pointed at by a VRFS, the changed data block is copied. This allows the original data blocks to be used by other clients.
10006a1 In another embodiment, there is provided a method for performing backup of file systems, the method comprising: receiving a set of data blocks for a plurality of point-in-time copies of a source file system, each point-in-time copy of the source file-system obtained by extracting one or more data blocks from the source file-system that changed since a previous point-in-time copy was extracted, the source file system comprising at least a source file; storing the set of data blocks on a storage system, the stored set of data blocks comprising one or more versions of a data block, each version corresponding to a point-in-time copy;
and receiving a request to restore information obtained from the source file system for a target system. The method further comprises, responsive to receiving the request to restore, creating a virtual restored file system comprising a set of files including a restored file corresponding to the source file, the creating comprising: linking the restored file to a plurality of the data blocks stored on the storage system, the plurality of data blocks comprising at least a first data block associated with a first point in time copy and a second data block associated with a second point in time copy; and mounting the set of files to the target system to allow the target system to access the set of files, the mounted set of files comprising the virtual restored file system.
10006b] In another embodiment, there is provided a method for performing backups of file systems, the method comprising receiving requests to perform backups of a plurality of source file systems, each request identifying a source file system, the source file system comprising at least a source file. The method further comprises, for each request received, performing backup of an associated source file system, comprising: receiving a set of data blocks for a plurality of point-in-time copies of the associated source file system each point-in-time copy of the associated source file-system obtained by extracting one or more data blocks from the associated source file-system that changed since a previous point-in-time copy was extracted; and storing on a storage system, the set of data blocks, the stored set of data blocks comprising one or more versions of a data block, each version corresponding to a point-in-time copy. The method further comprises receiving a request to restore information obtained from a particular source file system, the request to restore identifying a target system. The method further comprises, responsive to receiving the request to restore the particular source file system, creating a virtual restored file system comprising a set of files including a restored file corresponding to the source file, the creating comprising: linking the restored file to a plurality of stored data blocks stored on the storage system, the plurality of data blocks comprising at least a first data block associated with a first point in time copy and a second data block associated with a second point in time copy; and mounting the set of files to the target system to allow the target system to access the set of files, the mounted set of files comprising the virtual restored file system.
10006c1 In another embodiment, there is provided a computer-readable storage medium storing computer-executable instructions that when executed by at least one processor cause the at least one processor to backup a file system by: receiving a set of data blocks for a plurality of point-in-time copies of a source file system, each point-in-time copy of the source file-system obtained by extracting one or more data blocks from the source file-system that changed since a previous point-in-time copy was extracted, the source file system comprising at least a source file; and storing on a storage system, the set of data blocks. The at least one processor is further 2a caused to: receive a request to restore information from the source file system for a target system;
and responsive to receiving the request to restore, create a virtual restored file system comprising a set of files including a restored file corresponding to the source file. The creating comprises:
linking the restored file to a plurality of the data blocks stored on the storage system, the plurality of data blocks comprising at least a first data block associated with a first point in time copy and a second data block associated with a second point in time copy; and mounting the set of files to the target system to allow the target system to access the set of files, the mounted set of files comprising the virtual restored file system.
[0006d] In another embodiment, there is provided a method for performing backup of file systems. The method involves receiving, by a storage system, a set of data blocks for a plurality of point-in-time copies of a source file system, each point-in-time copy of the source file-system obtained by extracting one or more data blocks from the source file-system, the source file system comprising at least a source file. The method further involves storing, by the storage system, the received set of data blocks, wherein one or more stored data blocks associated with a point-in-time copy are shared with other point-in-time copies, wherein the storage system transforms data of a received data block to a format distinct from a format of the received data block and stores the transformed data. The method further involves receiving a first request to restore information associated with a first point-in-time copy for a first target system, and responsive to receiving the request to restore information obtained from the source file system, provisioning a first virtual restored file system comprising a set of files including a restored file corresponding to the source file, the restored file comprising a plurality of stored data blocks, the plurality of stored data blocks comprising stored data blocks associated with two or more distinct point-in-time copies. The provisioning involves mounting the set of files to the first target system to provide read and write access to the set of files, the mounted set of files comprising the first virtual restored file system. The method further involves receiving a second request to restore information associated with a second point-in-time copy for a second target system, and responsive to receiving the second request, provisioning a second virtual restored file system, wherein the second virtual restored file system shares one or more data blocks with the first virtual restored file system.
2b [0006e] In another embodiment, there is provided a computer-readable storage medium storing computer-executable instructions that when executed at least one processor cause at least one processor to perform a backup of file systems. The instructions cause the at least one processor to implement a block copy manager configured to receive a set of data blocks for a plurality of point-in-time copies of a source file system, each point-in-time copy of the source file-system obtained by extracting one or more data blocks from the source file-system, the source file system comprising at least a source file. The instructions further cause the at least one processor to implement a storage allocation manager module configured to store the received set of data blocks on a storage system, wherein one or more stored data blocks associated with a point-in-time copy are shared with other point-in-time copies, wherein the instructions comprise instructions to: transform data of a received data block to a format distinct from a format of the received data block, and store the transformed data. The instructions further cause the at least one processor to implement a virtual restored file system manager configured to:
receive a first request to restore infon-nation associated with a first point-in-time copy for a first target system;
and responsive to receiving the request to restore information obtained from the source file system, provision a first virtual restored file system comprising a set of files including a restored file corresponding to the source file, the restored file comprising a plurality of stored data blocks, the plurality of stored data blocks comprising stored data blocks associated with two or more distinct point-in-time copies. The provisioning includes mounting the set of files to the first target system to provide read and write access to the set of files, the mounted set of files comprising the first virtual restored file system. The instructions further cause the at least one processor to receive a second request to restore information associated with a second point-in-time copy for a second target system, and responsive to receiving the second request, provision a second virtual restored file system, wherein the second virtual restored file system shares one or more data blocks with the first virtual restored file system.
[0006f] In another embodiment, there is provided a computer-implemented system for performing backup of file systems. The system includes one or more computer processors in communication with the computer readable medium described above, and configured to execute the instructions stored on the computer readable medium on the one or more computer processors, to back up a file system.
2c BRIEF DESCRIPTION OF THE DRAWINGS
[0007] FIG. 1(a) (prior art) shows a conventional approach towards performing backups of file systems using a backup storage.
[0008] FIG. 1(b) (prior art) shows a conventional approach towards making data stored in a backup storage available for use in case of loss of data of the original file system.
[0009] FIG. 2(a) is a diagram illustrating how backups are performed using a de-duplication based backup system, in accordance with an embodiment.
[0010] FIG. 2(b) is a diagram illustrating how data backed up using a de-duplication based backup system is made available for use by a client in case of loss of data, in accordance with an embodiment.
[0011] FIG. 3 is a diagram illustrating how blocks of data copied at different points in time are organized in a backup file system to allow provisioning of a VRFS, in accordance with an embodiment.
[0012] FIG. 4 illustrates how data blocks stored in a backup file system can be shared by file structures created for different VRFS structures, in accordance with an embodiment.
[0013] FIG. 5 illustrates a system architecture for a de-duplication based backup system, in accordance with an embodiment.
2d 100141 FIG. 6 is a flowchart of a process for receiving data from a production server for performing backups using a de-duplication based backup system, in accordance with an embodiment.
[0015] FIG. 7 is a flowchart of a process for provisioning a VRFS structure corresponding to a given point-in-time, in accordance with an embodiment.
[0016] FIG. 8 illustrates an embodiment of a computing machine that can read instructions from a machine-readable medium and execute the instructions in a processor or controller.
[0017] The figures depict various embodiments for purposes of illustration only. One skilled in the art will readily recognize from the following discussion that alternative embodiments of the structures and methods illustrated herein may be employed.
DETAILED DESCRIPTION
[0018] FIG. 1(a) shows a conventional approach towards performing backups of file systems storing data, for example, data used by production servers 115. Multiple file systems 110 can be backed up using a single backup system 100. Clients 140 access the data stored in file systems 110 by interacting 145 with the production server 115. The client 140 that accesses the file system 115 can be executing on a computer system separate from the production server 115 or it can be a program executing within the production server 115. The data from the file systems 110 is periodically copied 160 to the backup system 100. The backup system 100 stores the backup data it receives on a backup storage device 130, for example, a magnetic tape storage device.
[0019] FIG. 1 and the other figures use like reference numerals to identify like elements. A
letter after a reference numeral, such as "110A," indicates that the text refers specifically to the element having that particular reference numeral. A reference numeral in the text without a following letter, such as "110," refers to any or all of the elements in the figures bearing that reference numeral (e.g. "110" in the text refers to reference numerals "110A"
and/or "110B" in the figures).
[0020] FIG. 1(b) illustrates the conventional approach towards making data available from a backup system 100 using a restore operation. If there is a system failure 125 of the production server 115(a) or the file system 110(a), the client 140 is unable to use the data of the file system 110(a). The latest backup of the data from file system 110(a) is typically retrieved 165 by the backup system 100 from the backup storage 130 and restored 170 onto a file system 110(c). The client 140 can use 175 the data available on the file system 110(e), which represents a recent copy of the data from the file system 110(a).
Typically, the performance of backup storage devices 130 is slower compared to storage devices used for file systems 110 of production system. As a result, the overall time taken to restore the data from the backup storage device 130 to the file system 110(c) can be significant, resulting in loss of potential business during the restore operation.
[0021] FIG. 2 illustrates a de-duplication based backup system 200 that eliminates the slow restore operation 170 for making data available in case of file system failures. As shown in FIG. 2(a), the de-duplication based backup system 200 receives from the file systems 110, blocks of data that have changed since a previous point in time.
A block of data can store any information in the file system including data, executable program code, or instructions. In one embodiment, the changed blocks of data are received periodically. In these embodiments, each set of data blocks received by the de-duplication based backup system 200 corresponds to data blocks that changed since the last time the blocks were extracted from the file system 110. The operation of receiving the blocks of data from the production server is also referred to as the changed-block copy operation. In an embodiment, the changed blocks of data can be sent to the de-duplication based backup system 200 by an agent running on the production server 110. That agent can either identify changed files and portions of files (data blocks), by either operating at the file system level or at a volume level.
The data blocks may or may not be fixed in size.
[0022] Alternatively, the de-duplication based backup system 200 sends a request to execute application programming interface (API) calls using libraries on the production server 115 to retrieve the changed blocks. In this configuration there is no need to have a backup agent on the production server 115 since the process is initiated and controlled by the de-duplication based backup system 200. A de-duplication based backup system 200 that does not require a backup agent on the production server is called an agent-less backup system. The de-duplication based backup system 200 stores the changed blocks of data in a backup file system 210. Typically, the amount of data corresponding to the changed blocks of the file system 110 is significantly less than the entire data of the file system 110. As a result the amount of data stored in the backup file system 210 can be significantly less than the amount corresponding to a full backup of the file system as shown in FIG.
1. That amount can be further reduced via compression or other data reduction techniques, such as using hashes as pointers to already stored identical blocks. Accordingly, the backup file system 210 can utilize a storage device that is significantly faster than a typical backup storage device 130. In an embodiment, the backup file system can be stored on a storage device that is at least as fast as the storage device used by the file system 110. For example, the backup file system 210 can be stored on a hard drive or a solid state drive (S SD). Storing the backup data on a fast storage device enables the possibility of allowing applications running on clients140 to directly access the data stored in the de-duplication based backup system 200 in case the original file system 110 isn't available without significantly degrading the performance of those applications.
[0023] FIG. 2(b) illustrates how a de-duplication based backup system 200 makes data available for a client 140 to use. If there is a failure 125 of the file system 110(a) or the production server 115(a) the client 140 can directly access the data stored in the de-duplication based backup system 200. The de-duplication based backup system 200 presents a set of data blocks that correspond to the latest data of the file system 110(a). In an embodiment, the client can request the de-duplication based backup system 200 to provide access to data corresponding to a particular point in time that is different from the latest point in time. The data blocks corresponding to a point in time may be presented to a server and operating system as either a network mounted file system (e.g. over NFS), or as a disk volume (e.g. over FC or iSCSI).
100241 The set of data blocks representing the file system 110 at a particular point in time may comprise data blocks copied by the de-duplication based backup system 200 from the file system 110 at different points in time depending on when the data block was changed.
The set of data blocks corresponding to the data of the file system 110(a) can be made available to the client 140 via a file-sharing system 220 that allows the client 140 to access the backup file system 210. The operation of making the set of data blocks available is called provisioning of a virtual restored file system (VRFS) corresponding to the data of the failed file system 110(a). For example, the file-sharing system 220 may provide access to the backup file system 210 via network file system (NFS) mount operation. A
portion of the backup file system 210 may be NFS-mounted to the client 140, thereby allowing client 140 to access files stored on the mounted portion of the backup file system 210. The client 140 can read as well as write to the mounted portion of the backup file system 210.
All or part of the file system at a specific point in time may be mounted.
[0025] The process illustrated by FIG. 2(b) eliminates the expensive data restore operation 170 illustrated in FIG. 1(b) that requires copy 170 of a large amount of data from a backup storage device. The backup file system 210 is used to store the backup copies and also can be accessed directly by the client 140 for read/write purposes. A
virtual restored file system (VRFS) structure is created that allows the client 140 to access a snapshot of the data stored on the backup file system 210 that is copied from the file system 110.
The time required to provision a VRFS corresponding to the failed file system 110(a) is significantly less than the time required for a restore operation 170. In an embodiment, all of the steps to mount the file system or volume can be fully automated, by having a toolkit on the target server (the toolkit comprising software modules), which has the privileges to mount the file system or volume. In an embodiment, when the production server 115(a) or file system 110(a) that previously failed starts working again, the changes made to the backup file system 210 by the client 140 are transferred to the production server 115(a) and applied to the file system 110(a). In an embodiment, applying the changes comprises copying the data blocks that were changed in the de-duplication based backup system 200 over to the file system 110a and replacing the corresponding data blocks of the file system 110a with the received data blocks. Subsequently, the client 140 can revert back to using the file system 110(a) of the production server 115(a).
[0026] FIG. 3 is a diagram illustrating how blocks of data copied at different points in time are organized in a backup file system to allow provisioning of a VRFS.
The file system 110 shown in FIG. 4 comprises data blocks Fl, F2, F3, and F4. Assume time TO
is the first point-in-time at which the backup file system 210 is used to backup data from file system 110. At time TO all blocks from the file system 110 are copied 315(a) to the backup file system and stored as data blocks F11, F12, F13, and F14 respectively. Assume that between time TO and Ti, blocks F2 and F4 are changed. At time Ti, only data blocks F2 and F4 are copied 315(a) from file system 110 to the backup file system 210 and stored as blocks F22 and F24 respectively. Similarly, assuming data blocks F3 and F4 change between time Ti and T2, at time T2, these blocks are copied to the backup file system 210 and stored as blocks F33 and F34 respectively.
[0027] If the file system 110 fails after time T2, a snapshot of data from the file system 110 corresponding to the point in time T2 can be made available to a client.
The data structure used for provisioning a VRFS corresponding to point T2 is shown as a set 320 of pointers to the data blocks. Each pointer Pi points at the latest copy of data block i modified before time T2. For example, pointer P1 points at data block Fll copied at time TO, pointer P2 points as block F22 copied at time Ti, and pointer P3, P4 point at blocks F33, F34 copied at time T2. If the client 140 writes to a data block pointed at by the VRFS
320, a copy of the corresponding data block is made. The client 140 subsequently reads/writes to the copy of the data block. This allows the original copy of the block stored in the backup file system 210 to be saved for use by other clients, for example, to provision other VRFS
structures.

[0028] FIG. 4 illustrates how multiple snapshots of the file system 110 can be supported at the same time by the backup file system 210 if necessary. The VRFS 320(a) corresponding to the file system snapshot for time T2 points to the copies of data blocks obtained on or before time T2. For example, pointer V1 points to block Fll copied at time TO, pointer V2 points to block F22 copied at time Ti, and pointers V3, V4 that point at blocks F33, F34 respectively. On the other hand, VRFS 320(a) corresponding to the file system snapshot for time Ti points to the latest copy of a data block copied on or before time Ti. Accordingly, pointer Ul points to block Fll copied at time TO, pointer U2 points to block F22 copied at time Ti, pointer U3 points to block F13 copied at time TO
and pointer U4 points at block F24 copied at time Ti. Data blocks of the backup file system 210 can be shared by multiple VRFS structures corresponding to different points in time.
For example, block F22 is shared by the VRFS corresponding to time Ti as well as VRFS
corresponding to time T2. Furthermore, a VRFS can be shared by multiple clients 140. In one embodiment, these different points in time can all be mounted by a single target server or by multiple target servers. In one embodiment, multiple points in time can be mounted on a single server under multiple mount points or under a single mount point, with directories representing multiple points in time.
[0029] The changes made to the backup file system 210 using a VRFS 320 can be merged back to the source file system 110. However if there are multiple copies of a data block that have been updated by clients 140, the updates to the data block may conflict with each other. Merging these copies of the data block to the original file system 110 may require further analysis of the copies of the data block to determine if the same portion of the data block is updated differently in the two copies. Conflicts may have to be resolved to determine which update from a set of conflicting updates is applied to the data while merging changes from the backup file system 210 to the original file system 110.
[0030] In some embodiments, a filter driver can monitor changes to data blocks and store the history of the changes using a suitable format. In some embodiments, variable sized data segments are copied from the file system 110 to the backup file system 210. A
variable sized data segments can be smaller than a complete data block if the amount of data that changed since the last copy operation is small. As a result, a copy of variable sized segments can take less time and can be performed more frequently. For example, an operation that copies all the changed blocks since a previous changed-block copy operation can be performed once a day but the backup of the history of changes can be performed every few minutes. A restore of a file system for a given time Tm can be performed by taking data blocks corresponding to the most recent block copy taken at time Tn before the time Tm. The changes corresponding to history of changes copied after Tn and before Tm can be applied to the data copied by the changed-block copy operation to obtain a snapshot of data close to a time point before time Tm. In some embodiments, combinations of variable sized data segments and fixed sized blocks are retrieved from the file system 110 and stored on the backup file system in either compressed form or without compression. In some embodiments, changed blocks representing several different points in time are stored local to the original file system, for instance if the original file system is not connected to the network, and then sent in a batch job while maintaining the information that allows recreation of the data corresponding to multiple points in time.
System Architecture 100311 FIG. 5 shows a high level block diagram illustrating a system environment suitable for performing backups of file systems using a de-duplication based backup system 200 and making the data stored in the backup system available to clients. The system environment comprises one or more production server 115, a de-duplication based backup system 200, an administration system 540, and one or more clients 140. Systems shown in FIG. 3 can communicate with each other if necessary via a network.
[0032] A production server 115 is a computer system that performs operations on data stored in a file system 110. For example, the production server 115 may comprise an email server, or a database server, an enterprise resource planning (ERP) application, a mobile device, or any other application that stores data. In one embodiment, a production server 115 refers to any computer system that stores data on a file system 110. The production server 115 comprises a file system 110, a backup agent 345, and a file system library 385. In alternative configurations, different and/or additional modules can be included in a production server 115.
[0033] In some embodiments, the file system 110 may be a local storage of the production server 115 or a network attached storage. The file system library 385 provides APIs useful for extracting information from the file system 110. The backup system 200 calls the APIs of the file system library 385 to copy data from file system 110 to the backup system 200. In one embodiment, the backup system 200 calls the file system library 385 APIs to determine the number of blocks of the file system that changed since a given point in time to determine whether to obtain the changed blocks. In another embodiment, the backup system 200 periodically copies at a fixed interval, blocks of file system 110 that changed since the last time a similar operation was performed. In one embodiment, the file system library 385 mounts the file system 110 of the production server 115 on the de-duplication based backup system 200 using a file sharing system similar to the file-sharing system 220.
Mounting file system 110 on the backup system 200 allows transfer of data stored on the file system 110 to the backup system 200 using file system APIs.
[0034] In some embodiments the de-duplication based backup system 200 comprises one or more de-duplication based backup systems 200 that share a mapping of all the blocks to prevent duplicates from being stored across all the de-duplication based backup systems 200.
For example, use of multiple de-duplication based backup systems 200 provides fault-tolerance or high-availability of the systems in case of failures. In some embodiments, the de-duplication based backup systems 200 add structured redundancy via mirroring, RAID, or other mechanism to protect the data stored in the de-duplication based backup systems 200.
[0035] In some embodiments, the production server 115 includes a backup agent 545 that coordinates transfer of data blocks from the file system 110 to the de-duplication based backup system 200. For example, the backup agent 545 can periodically monitor data blocks of the file system 110 for changes to the data blocks to identify data blocks to be copied to the backup system 200. In one embodiment, the backup agent 545 writes changed blocks of file system 110 to the backup system 200 at a fixed time interval. In another embodiment, the backup agent 545 writes changed blocks of file system 110 to the backup system 200 if the number of changed blocks exceeds a threshold number or the size of the changed data exceeds a threshold value. In another embodiment all change blocks are sent as they are stored and then stored in the de-duplication backup system as a log of change blocks that can be applied to create different points in time.
100361 Some embodiments may not include all the modules shown in FIG. 5, for example, the backup agent 345. The copy of data from the file system 110 that does not include the backup agent 545 is initiated by the block copy manager 510. These embodiments are called agent-less de-duplication based backup systems. These embodiments are beneficial since they require less maintenance for production server 115 by eliminating the installation and maintenance of the backup agent 545. In some embodiments the agent-less system requires less load on the production server 115 by eliminating the backup agent 545 that can consume system resources. On the other hand, a system that uses a backup agent 545 may require fewer communications between the de-duplication based backup system 200 and the production system 115 since the block copy manager 510 is not required to request backup data from the production server 115 each time. In a system where read,/write operations on the file system 110 are less frequent, the backup agent 545 can significantly reduce the number of messages exchanged between the production server 115 and the de-duplication based backup system 200.
[0037] In one embodiment, the file system library 585 includes a filter driver that monitors all update operations of the file system 110. The filter driver monitors all the read/write operations performed on the file system 110. The filter driver includes the logic to track the updates made to the file systems and determine when to copy the updates to the backup system 200. The filter driver can copy the changed blocks of the file system 110 to the backup system 200 either on a periodic basis or when the amount of changed data or number of changed blocks exceeds a threshold.
[0038] The data stored in the backup file system 210 can be exposed to a client 140, for example, by using a file-sharing system 220. For example, the data stored in the backup file system 210 can be mounted on a computer system running the client 140. The client 140 can be an application or a program running on a computer system. The de-duplication based backup system 200 retrieves information available in the file system 110 of the production server 115 and stores it in the backup file system 210. The information retrieved from the file system 110 comprises data that changed since a previous retrieval for a subset of the file system 110 or for the entire disk storing the file system 110. In some embodiments, the file system 110 can be mounted as an iSCSI volume that represents a network disk (instead of a network file system). If the information corresponding to the entire disk is retrieved from the file system 110, the retrieved information can be used to boot or start up a new computer system.
In one embodiment, the information retrieved from the file system 110 can be a portion of the file system 110 that is associated with an application, for example, an email server. In some embodiments, APIs provided by an application can be used to monitor and process changes to the file system made by the application, for example, history of changes made by the application can be obtained and stored. Systems and methods for retrieving data associated with database systems to a database storage systems are described in the US. Application No.
12/603,541 filed on October 21, 2009, now US 8,150,808.
[0039] In some embodiments, the information backed up from the file system 110 can include a set of one or more files. The de-duplication based backup system 200 retrieves changed blocks of data from the set of files and copies them onto the backup file system 210.

In some embodiments, the production server 115 can have applications that have file handles open for files stored in the file system 110 when the block copy manager 510 or the backup agent 545 needs to copy data from the file system 110. An open file handle for a file 10a indicates that the application can be in the process of changing the data in the file.
Depending on the state of the application, the data of the file may not be in a consistent state.
In these embodiments a previous state of the data blocks of the file system 110 that represents a consistent state of the file system is used to perform the backup.
[0040] The de-duplication based backup system 200 includes an auto-discovery manager 520, a VRFS manager 575, a block copy manager 510, a file sharing manager 570, a storage allocation manager 565, and a backup file system 210. In alternative configurations, different and/or additional modules can be included in the de-duplication based backup system 200.
The block copy manager 510 interacts with the production server 115 by sending a request to the file system library 585 to retrieve information representing a point-in-time copy of the file system 110 or a portion of the file system 110. The block copy manager 510 stores the retrieved information in the backup file system 210. The VRFS manager 575 builds the data structures to create a virtual restored file system 420 based on blocks of data stored in the backup file system 210. The storage allocation manager 565 allocates storage for the information received to store the information in the backup file system 210.
The storage allocation manager 565 keeps track of the various versions of each block of data that may be obtained from the production server 115. The storage allocation manager 565 also copies blocks of data if necessary, for example, when a client writes to a data block. If a block of data is copied for read-only purposes, the storage allocation manager 565 allocates only sufficient storage to keep a pointer of reference to the exiting block of data. However, if an attempt to write to the copied block of data is made, the storage allocation manager 565 allocates sufficient storage to make a copy of the block of data to avoid updating the original block of data. In some embodiments, the data blocks retrieved by the de-duplication based backup system 200 may be compressed before they are stored in the backup file system 210.
[0041] The file sharing manager 570 allows files stored in the de-duplication based backup system 200 to be shared across computers that may be connected with the de-duplication based backup system 200 over the network. The file sharing manager 570 uses the file-sharing system 220 for sharing files with clients 140. An example of a system for sharing files is a network file system (NFS). A system for sharing files may utilize fiber channel Storage area networks (FC-SAN) or network attached storage (NAS) or combinations and variations thereof. The system for sharing files may be based on small computer system interface (SCSI) protocol, internet small computer system interface (iSCSI) protocol, fiber channel protocols or other similar and related protocols. In some embodiments, the de-duplication based backup system 200 may utilize a logical volume manager. In an embodiment, files are organized in a format emulating a given file system disk layout, such as the file system of WINDOWSTM operating system called NTFS or the UN EXTNI
File system (UFS).
[00421 The auto-discovery manager 520 allows the de-duplication based backup system 200 to find file systems that need backups for a giver computer system or a set of computer systems. For example, the de-duplication based backup system 200 may be provided with information identifying a computer system, for example, the network address or a machine.
The de-duplication based backup system 200 can automatically discover various file systems used by the computer system that may need backup. In some embodiments, the de-duplication based backup system 200 is provided information identifying applications that need backup on the computer system. The de-duplication based backup system 200 can identify all the relevant the files that are used by the identified applications that need backup. For example, the de-duplication based backup system 200 may be requested to perform backups of an email server on a given machine. The auto-discovery manager 520 can identify the files used by the email server to store email related data as well as email server configuration information that needs to be backed up. In an embodiment, the auto-discovery manager 520 can take information identifying a set of computer systems and discover the file systems associated with all the computer systems that need backup. In some embodiments, the auto-discovery manager 520 is provided access to a network to perform discovery of various machines on the network including desktop computers, laptop computers, servers etc. and the file systems on the discovered computers for which backup needs to be performed. In an embodiment, the information discovered by the auto-discovery manager 520 is reviewed by a system administrator to determine the set of file systems that need to be backed up to the backup file system 210.
10043] The administration system 540 comprises a user interface 595 that allows a system administrator to perform administrative operations for example, specifying production systems that need to be backed up or client systems 140 that need access to the backup file system 210 in case of data loss. In some embodiments, the user interface 595 allows a system administrator to configure parameters for the auto-discovery manager 520 to perform discovery of file systems 110 for backup. The user interface 595 also allows a system administrator to specify the frequency at which data is copied from the production server 115 for performing backup.
[0044] FIG. 6 shows a flowchart of the process illustrating how a block copy manager 510 receives changed blocks from the production server 115 and saves them. The block 12a copy manager 510 sends 610 a request for blocks changed since a previous changed-block copy operation to the production server 115. In some embodiments, the request may include information identifying the files for which change information is retrieved or information identifying a portion of the file system 110 for which changed blocks need to be retrieved. In some embodiments, the request comprises a remote call to execute instructions (or code) in the file system library 585. The file system library 585 executes instructions to identify the changed data blocks and sends the requested data to the de-duplication based backup system 200. The block copy manager 510 receives 620 the information comprising the changed blocks. The block copy manager 510 requests the storage allocation manager 565 to save 630 the blocks of data in a storage efficient manner. The block copy manager 510 waits for the next iteration of the changed-block copy operation and repeats the above steps after the waiting period. In one embodiment special processing can be applied either before, during, or after the backup process or before, during, or after the mounting process to make a point-in-time copy available. Special processes can include but are not be limited to encryption, identification and elimination of confidential data, elimination of unnecessary or temporary data (such as empty blocks), or other processes.
[0045] In some embodiments, the backup agent 545 of the production server determines whether to initiate the changed-block copy operation. For example, the backup agent 545 can monitor the changes made to a portion of the file system 110, for example, a set of files. If the amount of changes as measured by the number of blocks changed or an amount of information indicating history of changes exceeds a threshold value, the backup agent may determine that a changed-block copy operation needs to be initiated.
In another embodiment, even though the amount of changes may not exceed a threshold value, the backup agent 545 may initiate a changed-block copy operation if the time since a previous changed-block copy operation exceeds a threshold value. The backup agent can send the information comprising changed blocks of data or history of changes to the block copy manager 510. The block copy manager 510 sends a request to the storage allocation manager 565 to save the information. The storage allocation manager 565 saves the information in the backup file system 210.
[0046] FIG. 7 illustrates how a virtual restored file system structure can be created for a client 140 based on the backup information saved in the backup file system 210. The VRFS
manager 575 receives 700 a request to create a VRFS for a given time point Tn, for example, in case there is a failure of a file system 110 that was backed up. The VRFS
manager 575 identifies 710 the recent most block copy for a time point Tj that is before time Tn. In some embodiments, if information describing history of changes that occurred between the time periods Tj and Tn is available, the information is identified 720. Based on the block copy for time point Tj and the history of changes, a read write file structure is created 730 by making storage efficient copy of data blocks. For example, the file structure may include pointers to blocks of data stored in backup file system 210. In some embodiments, data changes based on history of changes are applied to the blocks of data to obtain a copy of the data block corresponding to a more recent point in time compared to Tj. A handle to the file structure created is sent 740 to the client 140. The client can perform read or write operations using the VRFS file structure.
Alternative Embodiments [0047] In some embodiments, the de-duplication based backup system 200 can be used as a production system in case the original production server 115 fails. The de-duplication based backup system 200 provides a consolidated storage system for several applications or file systems from several computer systems. All applications that can be executed on the original file systems 110 can be executed using the backup file system 210. If backup file system 210 is used as a consolidated file system, use of a high-performance storage device (for example, solid-state drive (SSD)) to store the backup file system 210 improves the performance of all the clients 140 using the data stored in the backup file system 210. This allows the enterprise to improve performance of several clients 140 by investing in a single consolidated system instead of a large number of individual systems. For example, replacing the hard drives on several machines by SSD can be lot more expensive than replacing the drive of a single consolidated system. In some embodiments, the backup file system 210 can be used as a high-availability system that can be used by clients, for example, when the production server 115 is not available, even if there is no loss of data stored in file system 110.
[0048] In some embodiments added storage caching layers can improve the performance of the backup file system, especially if multiple blocks are shared across different systems accessing different points in time or different file systems.
[0049] The entire backup file system 210 can be copied to a conventional backup storage for performing a backup of the backup file system 210 (for example, as shown in FIG. 1). As a result, a conventional backup storage operation can also be performed for a set of file systems 110. The backup of the backup file system 210 using conventional backup techniques can be much simpler process than performing individual backups of all the file systems 110 of the various production servers 115 of an enterprise. This simplicity is
14 obtained due to the fact that a single system is being backed up and also because the amount of data being backed up is smaller since it represents only the changed blocks of data from the original file systems which can be further reduced in size by using data compression. In some embodiments the backup file system can be replicated in whole or in part to another storage device.
[0050] In some embodiments, instead of creating a VRFS structure for a client to use, the data represented by the VRFS can be exported to a different file system and the client allowed to access the exported data stored in the other file system. Exporting the VRFS data can be beneficial, for example, if the applications using the data require large amount resources that are not available in the de-duplication based backup system 200.
Computing Machine Architecture [0051] FIG. 8 is a block diagram illustrating components of an example machine able to read instructions from a machine-readable medium and execute them in a processor (or controller). Specifically, FIG. 8 shows a diagrammatic representation of a machine in the example form of a computer system 800 within which instructions 824 (e.g., software) for causing the machine to perform any one or more of the methodologies discussed herein may be executed. In alternative embodiments, the machine operates as a standalone device or may be connected (e.g., networked) to other machines. In a networked deployment, the machine may operate in the capacity of a server machine or a client machine in a server-client network environment, or as a peer machine in a peer-to-peer (or distributed) network environment.
[0052] The machine may be a server computer, a client computer, a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a cellular telephone, a smartphone, a web appliance, a network router, switch or bridge, or any machine capable of executing instructions 824 (sequential or otherwise) that specify actions to be taken by that machine. Further, while only a single machine is illustrated, the term "machine" shall also be taken to include any collection of machines that individually or jointly execute instructions 824 to perform any one or more of the methodologies discussed herein.
[0053] The example computer system 800 includes a processor 802 (e.g., a central processing unit (CPU), a graphics processing unit (GPU), a digital signal processor (DSP), one or more application specific integrated circuits (A STCs), one or more radio-frequency integrated circuits (RFICs), or any combination of these), a main memory 804, and a static memory 806, which are configured to communicate with each other via a bus 808.
The computer system 800 may further include graphics display unit 810 (e.g., a plasma display panel (PDP), a liquid crystal display (LCD), a projector, or a cathode ray tube (CRT)). The computer system 800 may also include alphanumeric input device 812 (e.g., a keyboard), a cursor control device 814 (e.g., a mouse, a trackball, a joystick, a motion sensor, or other pointing instrument), a storage unit 816, a signal generation device 818 (e.g., a speaker), and a network interface device 820, which also are configured to communicate via the bus 808.
[0054] The storage unit 816 includes a machine-readable medium 822 on which is stored instructions 824 (e.g., software) embodying any one or more of the methodologies or functions described herein. The instructions 824 (e.g., software) may also reside, completely or at least partially, within the main memory 804 or within the processor 802 (e.g., within a processor's cache memory) during execution thereof by the computer system 800, the main memory 804 and the processor 802 also constituting machine-readable media. The instructions 824 (e.g., software) may be transmitted or received over a network 826 via the network interface device 820.
[0055] While machine-readable medium 822 is shown in an example embodiment to be a single medium, the term "machine-readable medium" should be taken to include a single medium or multiple media (e.g., a centralized or distributed database, or associated caches and servers) able to store instructions (e.g., instructions 824). The term "machine-readable medium" shall also be taken to include any medium that is capable of storing instructions (e.g., instructions 824) for execution by the machine and that cause the machine to perform any one or more of the methodologies disclosed herein. The term "machine-readable medium" includes, but not be limited to, data repositories in the form of solid-state memories, optical media, and magnetic media.
Additional Configuration Considerations [0056] The foregoing description of certain embodiments has been presented for the purpose of illustration; it is not intended to be exhaustive or limiting to the precise forms disclosed.
Persons skilled in the relevant art can appreciate that many modifications and variations are possible in light of the above disclosure.
[0057] Some portions of this description describe embodiments in terms of algorithms and symbolic representations of operations on information. These algorithmic descriptions and representations are commonly used by those skilled in the data processing arts to convey the substance of their work effectively to others skilled in the art. These operations, while described functionally, computationally, or logically, are understood to be implemented by computer programs or equivalent electrical circuits, microcode, or the like.
Furthermore, it has also proven convenient at times, to refer to these arrangements of operations as modules, without loss of generality. The described operations and their associated modules may be embodied in software, firmware, hardware, or any combinations thereof.
[0058] Any of the steps, operations, or processes described herein may be performed or implemented with one or more hardware or software modules, alone or in combination with other devices. In one embodiment, a software module is implemented with a computer program product comprising a computer-readable medium containing computer program code, which can be executed by a computer processor for performing any or all of the steps, operations, or processes described.
[0059] Embodiments may also relate to an apparatus for performing the operations herein.
This apparatus may be specially constructed for the required purposes, and/or it may comprise a general-purpose computing device selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a tangible computer readable storage medium or any type of media suitable for storing electronic instructions, and coupled to a computer system bus. Furthermore, any computing systems referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability.
[0060] Embodiments may also relate to a computer data signal embodied in a carrier wave, where the computer data signal includes any embodiment of a computer program product or other data combination described herein. The computer data signal is a product that is presented in a tangible medium or carrier wave and modulated or otherwise encoded in the carrier wave, which is tangible, and transmitted according to any suitable transmission method.
[0061] Finally, the language used in the specification has been principally selected for readability and instructional purposes, and it may not have been selected to delineate or circumscribe the inventive subject matter. It is therefore intended that the scope of the invention be limited not by this detailed description, but rather by any claims that issue on an application based hereon.

Claims (34)

EMBODIMENTS IN WHICH AN EXCLUSIVE PROPERTY OR PRIVILEGE IS
CLAIMED ARE DEFINED AS FOLLOWS:
1. A method for performing backup of file systems, the method comprising:
receiving a set of data blocks for a plurality of point-in-time copies of a source file system, each point-in-time copy of the source file-system obtained by extracting one or more data blocks from the source file-system that changed since a previous point-in-time copy was extracted, the source file system comprising at least a source file;
storing the set of data blocks on a storage system, the stored set of data blocks comprising one or more versions of a data block, each version corresponding to a point-in-time copy;
receiving a request to restore information obtained from the source file system for a target system; and responsive to receiving the request to restore, creating a virtual restored file system comprising a set of files including a restored file corresponding to the source file, the creating comprising:
linking the restored file to a plurality of the data blocks stored on the storage system, the plurality of data blocks comprising at least a first data block associated with a first point in time copy and a second data block associated with a second point in time copy, and mounting the set of files to the target system to allow the target system to access the set of files, the mounted set of files comprising the virtual restored file system.
2. The method of claim 1, further comprising:
sending a request to receive the one or more data blocks for a point-in-time copy of the source file system.
3. The method of claim 1, wherein the access to the set of files allows read access and write access to information stored in the plurality of stored data blocks.
4. The method of claim 1, wherein the plurality of point-in-time copies of the source file system are received based on a predetermined schedule associated with the source file system.
5. The method of claim 1, further comprising:
receiving a request to read data from the virtual restored file system;
identifying a stored data block from the plurality of stored data blocks; and sending information from the stored data block.
6. The method of claim 1, further comprising:
receiving a request to write data to the restored file of the virtual restored file system;
identifying a stored data block from the plurality of data blocks linked to the restored file;
creating a copy of the stored data block and linking the restored file to the copy of the stored data block; and updating the copy of the stored data block based on the request to write.
7. The method of claim 1, further comprising:
receiving a request to merge updates to the virtual restored file system back with the source file system; and sending stored data blocks of the plurality of stored data blocks that have been updated to the source file system.
8. The method of claim 1, wherein the request to restore information is a first request to restore information, the target system is a first target system, the virtual restored file system is a first virtual restored file system, the set of files is a first set of files, the restored file is a first restored file, and the plurality of stored data blocks is a first plurality of stored data blocks, the method further comprising:
receiving a second request to restore information obtained from the source file system, the second request identifying a second target system; and responsive to receiving the second request to restore, creating a second virtual restored file system comprising a set of files linked to data blocks of the storage system including a second restored file corresponding to the source tile, the creating comprising:
linking the second restored file to a second plurality of stored data blocks stored on the storage system, the second plurality of data blocks distinct from the first plurality of data blocks but sharing at least one data block with the first plurality of data blocks, and mounting the second set of files to the second target system to allow the second target system to access the second set of files, the mounted second set of files comprising the second virtual restored file system.
9. The method of claim 1, wherein the storage system is a first storage system, the method comprising:
receiving a request to export the virtual restored file system to a second storage system; and sending the plurality of stored data blocks to the second storage system.
10. The method of claim 1, wherein the request to restore is associated with a restore time point, wherein the plurality of stored data blocks comprise data blocks received as part of point-in-time copies of the source file system extracted before the restore time.
11. A method for performing backups of file systems, the method comprising:
receiving requests to perform backups of a plurality of source file systems, each request identifying a source file system, the source file system comprising at least a source file;
for each request received, performing backup of an associated source file system, comprising:
receiving a set of data blocks for a plurality of point-in-time copies of the associated source file system each point-in-time copy of the associated source file-system obtained by extracting one or more data blocks from the associated source file-system that changed since a previous point-in-time copy was extracted, and storing on a storage system, the set of data blocks, the stored set of data blocks comprising one or more versions of a data block, each version corresponding to a point-in-time copy; and receiving a request to restore information obtained from a particular source file system, the request to restore identifying a target system; and responsive to receiving the request to restore the particular source file system, creating a virtual restored file system comprising a set of files including a restored file corresponding to the source file, the creating comprising:
linking the restored file to a plurality of stored data blocks stored on the storage system, the plurality of data blocks comprising at least a first data block associated with a first point in time copy and a second data block associated with a second point in time copy, and mounting the set of files to the target system to allow the target system to access the set of files, the mounted set of files comprising the virtual restored file system.
12. The method of claim 11, further comprising:
sending a request to receive the one or more data blocks for a point-in-time copy of the particular source file system.
13. The method of claim 11, further comprising:
receiving a request to write data to the virtual restored file system;
identifying a stored data block from the plurality of stored data blocks; and updating the stored data block based on the request to write.
14. The method of claim 11, wherein the request to restore information is a first request to restore information, the target system is a first target system, the set of files is a first set of files, the virtual restored file system is a first virtual restored file system, and the plurality of stored data blocks is a first plurality of stored data blocks, the method further comprising:
receiving a second request to restore information obtained from the particular source file system, the request identifying a second target system; and responsive to receiving the second request to restore, creating a second virtual restored file system comprising a set of files linked to data blocks of the storage system including a second restored file corresponding to the source file, the creating comprising:
linking the second restored file to a second plurality of stored data blocks stored on the storage system, the second plurality of data blocks distinct from the first plurality of data blocks but sharing at least one data block with the first plurality of data blocks, and mounting the second set of files to the second target system to allow the second target system to access the second set of files, the mounted second set of files comprising the second virtual restored file system.
15. The method of claim 14, wherein one or more of the stored data blocks associated with the first set of files are also associated with the second set of files.
16. The method of claim 14, further comprising:
receiving a request to write data to the restored file of the first virtual restored file system;
identifying a stored data block from the first plurality of stored data blocks linked to the restored file;

responsive to determining that the stored data block is associated with the second virtual restored file system, creating a copy of the stored data block and linking the restored file to the copy of the stored data block; and updating the copy of the stored data block based on the request to write.
17. The method of claim 11, wherein the request to restore is associated with a restore time point, wherein the plurality of stored data blocks comprise data blocks received as part of point-in-time copies extracted before the restore time.
18. A computer-readable storage medium storing computer-executable instructions that when executed by at least one processor cause the at least one processor to backup a file system by:
receiving a set of data blocks for a plurality of point-in-time copies of a source file system, each point-in-time copy of the source file-system obtained by extracting one or more data blocks from the source file-system that changed since a previous point-in-time copy was extracted, the source file system comprising at least a source file;
storing on a storage system, the set of data blocks;
receiving a request to restore information from the source file system for a target system; and responsive to receiving the request to restore, creating a virtual restored file system comprising a set of files including a restored file corresponding to the source file, the creating comprising:
linking the restored file to a plurality of the data blocks stored on the storage system, the plurality of data blocks comprising at least a first data block associated with a first point in time copy and a second data block associated with a second point in time copy, and mounting the set of files to the target system to allow the target system to access the set of files, the mounted set of files comprising the virtual restored file system.
19. The computer readable medium of claim 18, wherein the instructions further cause the at least one processor to:
receive a request to write data to the restored file of the virtual restored file system;
identify a stored data block from the plurality of data blocks linked to the restored file;
create a copy of the stored data block and linking the restored file to the copy of the stored data block; and update the copy of the stored data block based on the request to write.
20. The computer readable medium of claim 18, wherein the request to restore information is a first request to restore information, the target system is a first target system, the virtual restored file system is a first virtual restored file system, the set of files is a first set of files, the restored file is a first restored file, and the plurality of stored data blocks is a first plurality of stored data blocks, and wherein the instructions further cause the at least one processor to:
receive a second request to restore information obtained from the source file system, the second request identifying a second target system; and responsive to receiving the second request to restore, create a second virtual restored file system comprising a second set of files linked to data blocks of the storage system including a second restored file corresponding to the source file, the creating comprising:
linking the second restored file to a second plurality of stored data blocks stored on the storage system, the second plurality of data blocks distinct from the first plurality of data blocks but sharing at least one data block with the first plurality of data blocks, and mounting the second set of files to the second target system to allow the second target system to access the second set of files, the mounted second set of files comprising the second virtual restored file system.
21. The computer readable medium of claim 18, wherein the instructions further cause the at least one processor to associate the request to restore with a restore time point, wherein the plurality of stored data blocks comprise data blocks received as part of point-in-time copies extracted before the restore time.
22. A method for performing backup of file systems, the method comprising:
receiving, by a storage system, a set of data blocks for a plurality of point-in-time copies of a source file system, each point-in-time copy of the source file-system obtained by extracting one or more data blocks from the source file-system, the source file system comprising at least a source file;
storing, by the storage system, the received set of data blocks, wherein one or more stored data blocks associated with a point-in-time copy arc shared with other point-in-time copies, wherein the storage system transforms data of a received data block to a format distinct from a format of the received data block and stores the transformed data;
receiving a first request to restore information associated with a first point-in-time copy for a first target system;

responsive to receiving the request to restore information obtained from the source file system, provisioning a first virtual restored file system comprising a set of files including a restored file corresponding to the source file, the restored file comprising a plurality of stored data blocks, the plurality of stored data blocks comprising stored data blocks associated with two or more distinct point-in-time copies, the provisioning comprising:
mounting the set of files to the first target system to provide read and write access to the set of files, the mounted set of files comprising the first virtual restored file system;
receiving a second request to restore information associated with a second point-in-time copy for a second target system; and responsive to receiving the second request, provisioning a second virtual restored file system, wherein the second virtual restored file system shares one or more data blocks with the first virtual restored tile system.
23. The method of claim 22, wherein the plurality of point-in-time copies of the source file system are received based on a predetermined schedule associated with the source file system.
24. The method of claim 22, wherein the received data blocks comprise changed blocks from the source file system since a previous point-in-time copy was obtained.
25. The method of claim 22, further comprising:
receiving a request to write data to the first virtual restored file system;
identifying a stored data block associated with the first virtual restored file system;
creating a copy of the stored data block and associating the stored data block with the first virtual restored file system; and updating the copy of the stored data block based on the request to write.
26. The method of claim 22, wherein the storage system is a first storage system, and wherein the method further comprises:
receiving a request to export the first virtual restored file system to a second storage system; and sending the plurality of stored data blocks to the second storage system.
27. The method of claim 22, wherein transforming the data of a received data block comprises compressing data stored in the received data block.
28. A computer-readable storage medium storing computer-executable code comprising instructions that when executed by at least one processor cause the at least one processor to perform a backup of file systems, by causing the at least one processor to:
implement a block copy manager configured to:
receive a set of data blocks for a plurality of point-in-time copies of a source file system, each point-in-time copy of the source file-system obtained by extracting one or more data blocks from the source file-system, the source file system comprising at least a source file;
implement a storage allocation manager module configured to:
store the received set of data blocks on a storage system, wherein one or more stored data blocks associated with a point-in-time copy are shared with other point-in-time copies, wherein instructions to store comprise instructions to: transform data of a received data block to a format distinct from a format of the received data block, and store the transformed data;

implement a virtual restored file system manager configured to:
receive a first request to restore information associated with a first point-in-time copy for a first target system;
responsive to receiving the request to restore information obtained from the source file system, provision a first virtual restored file system comprising a set of files including a restored file corresponding to the source file, the restored file comprising a plurality of stored data blocks, the plurality of stored data blocks comprising stored data blocks associated with two or more distinct point-in-time copies, wherein provisioning comprises:
mounting the set of files to the first target system to provide read and write access to the set of files, the mounted set of files comprising the first virtual restored file system;
receive a second request to restore information associated with a second point-in-time copy for a second target system; and responsive to receiving the second request, provision a second virtual restored file system, wherein the second virtual restored file system shares one or more data blocks with the first virtual restored file system.
29. The computer-readable storage medium of claim 28, wherein the instructions cause the at least one processor to cause the plurality of point-in-time copies of the source file system to be received based on a predetermined schedule associated with the source file system.
30. The computer-readable storage medium of claim 28, wherein the instructions cause the at least one processor to cause the received data blocks to comprise changed blocks from the source file system since a previous point-in-time copy was obtained.
31. The computer-readable storage medium of claim 28, wherein the instructions that cause the at least one processor to implement the virtual restored file system manager include instructions that cause the at least one processor to:
receive a request to write data to the first virtual restored file system;
identify a stored data block associated with the first virtual restored file system;
create a copy of the stored data block and associating the stored data block with the first virtual restored file system; and update the copy of the stored data block based on the request to write.
32. The computer-readable storage medium of claim 28, wherein the storage system is a first storage system and wherein the instructions that cause the at least one processor to implement the virtual restored file system manager include instructions that cause the at least one processor to:
receive a request to export the first virtual restored file system to a second storage system; and send the plurality of stored data blocks to the second storage system.
33. The computer-readable storage medium of claim 28, wherein transforming the data of a received data block comprises compressing data stored in the received data block.
34. A computer-implemented system for performing backup of file systems, the system comprising:
one or more computer processors in communication with the computer-readable storage medium of any one of claims 28 - 33; and configured to execute the instructions stored on said computer-readable storage medium on the one or more computer processors, to back up a file system.
CA2804372A 2010-07-15 2011-07-15 De-duplication based backup of file systems Active CA2804372C (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US36465210P 2010-07-15 2010-07-15
US61/364,652 2010-07-15
US13/183,131 US8548944B2 (en) 2010-07-15 2011-07-14 De-duplication based backup of file systems
US13/183,131 2011-07-14
PCT/US2011/044209 WO2012009650A1 (en) 2010-07-15 2011-07-15 De-duplication based backup of file systems

Publications (2)

Publication Number Publication Date
CA2804372A1 CA2804372A1 (en) 2012-01-19
CA2804372C true CA2804372C (en) 2017-09-26

Family

ID=45467721

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2804372A Active CA2804372C (en) 2010-07-15 2011-07-15 De-duplication based backup of file systems

Country Status (6)

Country Link
US (2) US8548944B2 (en)
EP (1) EP2593858B1 (en)
CN (1) CN103098016B (en)
AU (1) AU2011278970B2 (en)
CA (1) CA2804372C (en)
WO (1) WO2012009650A1 (en)

Families Citing this family (102)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8484162B2 (en) 2008-06-24 2013-07-09 Commvault Systems, Inc. De-duplication systems and methods for application-specific data
US8930306B1 (en) 2009-07-08 2015-01-06 Commvault Systems, Inc. Synchronized data deduplication
US8578109B2 (en) 2010-09-30 2013-11-05 Commvault Systems, Inc. Systems and methods for retaining and using data block signatures in data protection operations
US8577851B2 (en) 2010-09-30 2013-11-05 Commvault Systems, Inc. Content aligned block-based deduplication
US8843489B2 (en) 2010-11-16 2014-09-23 Actifio, Inc. System and method for managing deduplicated copies of data using temporal relationships among copies
US8417674B2 (en) 2010-11-16 2013-04-09 Actifio, Inc. System and method for creating deduplicated copies of data by sending difference data between near-neighbor temporal states
US8402004B2 (en) 2010-11-16 2013-03-19 Actifio, Inc. System and method for creating deduplicated copies of data by tracking temporal relationships among copies and by ingesting difference data
US9858155B2 (en) 2010-11-16 2018-01-02 Actifio, Inc. System and method for managing data with service level agreements that may specify non-uniform copying of data
US8904126B2 (en) 2010-11-16 2014-12-02 Actifio, Inc. System and method for performing a plurality of prescribed data management functions in a manner that reduces redundant access operations to primary storage
US9020900B2 (en) 2010-12-14 2015-04-28 Commvault Systems, Inc. Distributed deduplicated storage system
US8954446B2 (en) 2010-12-14 2015-02-10 Comm Vault Systems, Inc. Client-side repository in a networked deduplicated storage system
US8745003B1 (en) 2011-05-13 2014-06-03 Emc Corporation Synchronization of storage using comparisons of fingerprints of blocks
US8782003B1 (en) * 2011-05-13 2014-07-15 Emc Corporation Synchronization of storage using log files and snapshots
US9311327B1 (en) 2011-06-30 2016-04-12 Emc Corporation Updating key value databases for virtual backups
US8849777B1 (en) * 2011-06-30 2014-09-30 Emc Corporation File deletion detection in key value databases for virtual backups
US9158632B1 (en) 2011-06-30 2015-10-13 Emc Corporation Efficient file browsing using key value databases for virtual backups
US8843443B1 (en) 2011-06-30 2014-09-23 Emc Corporation Efficient backup of virtual data
US8949829B1 (en) 2011-06-30 2015-02-03 Emc Corporation Virtual machine disaster recovery
US8671075B1 (en) * 2011-06-30 2014-03-11 Emc Corporation Change tracking indices in virtual machines
US8849769B1 (en) 2011-06-30 2014-09-30 Emc Corporation Virtual machine file level recovery
US9229951B1 (en) 2011-06-30 2016-01-05 Emc Corporation Key value databases for virtual backups
WO2013019869A2 (en) 2011-08-01 2013-02-07 Actifio, Inc. Data fingerpringting for copy accuracy assurance
US9477739B2 (en) * 2011-09-23 2016-10-25 Hybrid Logic Ltd System for live-migration and automated recovery of applications in a distributed system
US10311027B2 (en) 2011-09-23 2019-06-04 Open Invention Network, Llc System for live-migration and automated recovery of applications in a distributed system
GB2495079A (en) 2011-09-23 2013-04-03 Hybrid Logic Ltd Live migration of applications and file systems in a distributed system
US10331801B2 (en) 2011-09-23 2019-06-25 Open Invention Network, Llc System for live-migration and automated recovery of applications in a distributed system
US9547705B2 (en) * 2011-09-23 2017-01-17 Hybrid Logic Ltd System for live-migration and automated recovery of applications in a distributed system
US8949197B2 (en) 2011-10-31 2015-02-03 Oracle International Corporation Virtual full backups
US8874522B2 (en) * 2011-10-31 2014-10-28 Symantec Corporation Managing backups of data objects in containers
US9075810B2 (en) * 2012-01-24 2015-07-07 Symantec Corporation Volume and partial volume merge to synchronize to non-homogeneous drive layouts
US20130227352A1 (en) 2012-02-24 2013-08-29 Commvault Systems, Inc. Log monitoring
US20130339310A1 (en) 2012-06-13 2013-12-19 Commvault Systems, Inc. Restore using a client side signature repository in a networked storage system
CA2877284A1 (en) 2012-06-18 2013-12-27 Actifio, Inc. Enhanced data management virtualization system
US9529808B1 (en) 2012-07-16 2016-12-27 Tintri Inc. Efficient and flexible organization and management of file metadata
US9152325B2 (en) 2012-07-26 2015-10-06 International Business Machines Corporation Logical and physical block addressing for efficiently storing data
US9098513B1 (en) * 2012-08-27 2015-08-04 Trend Micro Incorporated Methods and systems for differencing orderly dependent files
US9665591B2 (en) 2013-01-11 2017-05-30 Commvault Systems, Inc. High availability distributed deduplicated storage system
US9286163B2 (en) 2013-01-14 2016-03-15 International Business Machines Corporation Data recovery scheme based on data backup status
WO2014125582A1 (en) * 2013-02-13 2014-08-21 株式会社日立製作所 Storage device and data management method
US9817835B2 (en) 2013-03-12 2017-11-14 Tintri Inc. Efficient data synchronization for storage containers
US9606729B2 (en) * 2013-03-15 2017-03-28 Skyera, Llc Apparatus and method for insertion and deletion in multi-dimensional to linear address space translation
AU2014265979A1 (en) 2013-05-14 2015-12-10 Actifio, Inc. Efficient data replication and garbage collection predictions
US9110847B2 (en) * 2013-06-24 2015-08-18 Sap Se N to M host system copy
JP6248435B2 (en) * 2013-07-04 2017-12-20 富士通株式会社 Storage device and storage device control method
KR101693683B1 (en) * 2013-07-09 2017-01-06 델픽스 코퍼레이션 Virtual database rewind
US20150081644A1 (en) * 2013-07-16 2015-03-19 Openpeak Inc. Method and system for backing up and restoring a virtual file system
US9053216B1 (en) 2013-08-09 2015-06-09 Datto, Inc. CPU register assisted virtual machine screenshot capture timing apparatuses, methods and systems
US10628378B2 (en) 2013-09-03 2020-04-21 Tintri By Ddn, Inc. Replication of snapshots and clones
US9158658B2 (en) * 2013-10-15 2015-10-13 International Business Machines Corporation Detecting merge conflicts and compilation errors in a collaborative integrated development environment
US20150142748A1 (en) 2013-11-18 2015-05-21 Actifio, Inc. Computerized methods and apparatus for data cloning
US10146784B2 (en) * 2014-01-02 2018-12-04 International Business Machines Corporation Defragmenting files having file blocks in multiple point-in-time copies
US10146783B2 (en) 2014-01-02 2018-12-04 International Business Machines Corporation Using file element accesses to select file elements in a file system to defragment
CN103761328B (en) * 2014-01-29 2017-02-15 上海网达软件股份有限公司 In-process virtual file system based on single database and single file
US9720778B2 (en) 2014-02-14 2017-08-01 Actifio, Inc. Local area network free data movement
US10380072B2 (en) 2014-03-17 2019-08-13 Commvault Systems, Inc. Managing deletions from a deduplication database
US9633056B2 (en) 2014-03-17 2017-04-25 Commvault Systems, Inc. Maintaining a deduplication database
US9792187B2 (en) 2014-05-06 2017-10-17 Actifio, Inc. Facilitating test failover using a thin provisioned virtual machine created from a snapshot
US9594636B2 (en) 2014-05-30 2017-03-14 Datto, Inc. Management of data replication and storage apparatuses, methods and systems
US9852026B2 (en) 2014-08-06 2017-12-26 Commvault Systems, Inc. Efficient application recovery in an information management system based on a pseudo-storage-device driver
US11249858B2 (en) 2014-08-06 2022-02-15 Commvault Systems, Inc. Point-in-time backups of a production application made accessible over fibre channel and/or ISCSI as data sources to a remote application by representing the backups as pseudo-disks operating apart from the production application and its host
CN105447033B (en) * 2014-08-28 2019-06-11 国际商业机器公司 The method and apparatus of initial copy are generated in duplication initialization
WO2016044403A1 (en) 2014-09-16 2016-03-24 Mutalik, Madhav Copy data techniques
US10379963B2 (en) 2014-09-16 2019-08-13 Actifio, Inc. Methods and apparatus for managing a large-scale environment of copy data management appliances
US9575673B2 (en) 2014-10-29 2017-02-21 Commvault Systems, Inc. Accessing a file system using tiered deduplication
US9639701B1 (en) * 2015-03-31 2017-05-02 EMC IP Holding Company LLC Scheduling data protection operations based on data activity
US9934265B2 (en) 2015-04-09 2018-04-03 Commvault Systems, Inc. Management of log data
US10339106B2 (en) 2015-04-09 2019-07-02 Commvault Systems, Inc. Highly reusable deduplication database after disaster recovery
US10001934B2 (en) * 2015-05-08 2018-06-19 Ricoh Company, Ltd. Information processing apparatus, information processing system, and information processing method
US20160350391A1 (en) 2015-05-26 2016-12-01 Commvault Systems, Inc. Replication using deduplicated secondary copy data
US9766825B2 (en) 2015-07-22 2017-09-19 Commvault Systems, Inc. Browse and restore for block-level backups
KR102294669B1 (en) 2015-08-12 2021-08-27 삼성전자 주식회사 Electronic device for compressing data and method thereof
US10496598B2 (en) * 2015-09-29 2019-12-03 Blackberry Limited Data access control based on storage validation
US9935973B2 (en) 2015-12-16 2018-04-03 Carbonite, Inc. Systems and methods for automatic detection of malicious activity via common files
US10083299B2 (en) * 2015-12-16 2018-09-25 Carbonite, Inc. Systems and methods for automatic snapshotting of backups based on malicious modification detection
US10061663B2 (en) 2015-12-30 2018-08-28 Commvault Systems, Inc. Rebuilding deduplication data in a distributed deduplication data storage system
US10430434B2 (en) 2016-01-20 2019-10-01 Delphix Corporation Managing transformed snapshots in a storage system
US10296368B2 (en) 2016-03-09 2019-05-21 Commvault Systems, Inc. Hypervisor-independent block-level live browse for access to backed up virtual machine (VM) data and hypervisor-free file-level recovery (block-level pseudo-mount)
US10846024B2 (en) 2016-05-16 2020-11-24 Commvault Systems, Inc. Global de-duplication of virtual disks in a storage platform
US10795577B2 (en) 2016-05-16 2020-10-06 Commvault Systems, Inc. De-duplication of client-side data cache for virtual disks
US11436088B2 (en) * 2016-08-10 2022-09-06 Netapp, Inc. Methods for managing snapshots in a distributed de-duplication system and devices thereof
US11347691B2 (en) * 2016-08-10 2022-05-31 Netapp, Inc. Methods for managing storage in a distributed de-duplication system and devices thereof
US10585760B2 (en) 2016-10-19 2020-03-10 International Business Machines Corporation File name level based file search and restoration from block level backups of virtual machines
CN112068992A (en) * 2016-10-28 2020-12-11 华为技术有限公司 Remote data copying method, storage device and storage system
US10740193B2 (en) 2017-02-27 2020-08-11 Commvault Systems, Inc. Hypervisor-independent reference copies of virtual machine payload data based on block-level pseudo-mount
US11550665B2 (en) 2017-06-02 2023-01-10 Apple Inc. Techniques for preserving clone relationships between files
US11449389B2 (en) 2017-06-02 2022-09-20 Apple Inc. Techniques for performing incremental data backups
US10664352B2 (en) 2017-06-14 2020-05-26 Commvault Systems, Inc. Live browsing of backed up data residing on cloned disks
US10333631B2 (en) * 2017-08-24 2019-06-25 Rohde & Schwarz Gmbh & Co. Kg Test arrangement and test method
TWI691860B (en) * 2018-10-23 2020-04-21 財團法人工業技術研究院 Method and computer system for preventing malicious software from attacking files of the computer system and corresponding non-transitory computer readable storage medium
CN109542681B (en) * 2018-11-15 2021-11-26 郑州云海信息技术有限公司 Time navigation system based on file backup, and visual data recovery method and device
US11010258B2 (en) 2018-11-27 2021-05-18 Commvault Systems, Inc. Generating backup copies through interoperability between components of a data storage management system and appliances for data storage and deduplication
US11698727B2 (en) 2018-12-14 2023-07-11 Commvault Systems, Inc. Performing secondary copy operations based on deduplication performance
US20200327017A1 (en) 2019-04-10 2020-10-15 Commvault Systems, Inc. Restore using deduplicated secondary copy data
US11455281B2 (en) * 2019-04-19 2022-09-27 EMC IP Holding Company LLC Generating and morphing a collection of files in a folder/sub-folder structure that collectively has desired dedupability, compression, clustering and commonality
US11100064B2 (en) 2019-04-30 2021-08-24 Commvault Systems, Inc. Automated log-based remediation of an information management system
US11463264B2 (en) 2019-05-08 2022-10-04 Commvault Systems, Inc. Use of data block signatures for monitoring in an information management system
US20210173811A1 (en) 2019-12-04 2021-06-10 Commvault Systems, Inc. Optimizing the restoration of deduplicated data stored in multi-node replicated file systems
CN111090824B (en) * 2019-12-23 2023-09-19 百度国际科技(深圳)有限公司 Content processing method and device
US11645161B2 (en) * 2020-03-26 2023-05-09 Hewlett Packard Enterprise Development Lp Catalog of files associated with snapshots
US11687424B2 (en) 2020-05-28 2023-06-27 Commvault Systems, Inc. Automated media agent state management
US11574050B2 (en) 2021-03-12 2023-02-07 Commvault Systems, Inc. Media agent hardening against ransomware attacks
US20220398232A1 (en) * 2021-06-14 2022-12-15 Microsoft Technology Licensing, Llc Versioned metadata using virtual databases

Family Cites Families (89)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4853843A (en) 1987-12-18 1989-08-01 Tektronix, Inc. System for merging virtual partitions of a distributed database
US6970434B1 (en) 1995-06-07 2005-11-29 Broadcom Corporation Hierarchical communication system providing intelligent data, program and processing migration
US5680618A (en) 1993-05-26 1997-10-21 Borland International, Inc. Driver query and substitution for format independent native data access
JP3751018B2 (en) 1993-06-03 2006-03-01 ネットワーク・アプライアンス・インコーポレイテッド LightAnywhere file system layout
US5680608A (en) 1995-02-06 1997-10-21 International Business Machines Corporation Method and system for avoiding blocking in a data processing system having a sort-merge network
US5634053A (en) 1995-08-29 1997-05-27 Hughes Aircraft Company Federated information management (FIM) system and method for providing data site filtering and translation for heterogeneous databases
US5842222A (en) 1996-10-04 1998-11-24 Taiwan Semiconductor Manufacturing Company, Ltd. Production information system enhanced for availability
US6304882B1 (en) 1998-05-05 2001-10-16 Informix Software, Inc. Data replication system and method
JP2000047919A (en) 1998-07-30 2000-02-18 Hitachi Ltd Virtual database replication system
US7197491B1 (en) 1999-09-21 2007-03-27 International Business Machines Corporation Architecture and implementation of a dynamic RMI server configuration hierarchy to support federated search and update across heterogeneous datastores
US6557012B1 (en) 2000-04-22 2003-04-29 Oracle Corp System and method of refreshing and posting data between versions of a database table
US6523036B1 (en) 2000-08-01 2003-02-18 Dantz Development Corporation Internet database system
US7072916B1 (en) 2000-08-18 2006-07-04 Network Appliance, Inc. Instant snapshot
US7512673B2 (en) 2001-01-11 2009-03-31 Attune Systems, Inc. Rule based aggregation of files and transactions in a switched file system
US20020143764A1 (en) 2001-04-03 2002-10-03 Martin Andrew R. Data management system and method for intercepting and changing database instructions between a database back end and an application front end
DE50101548D1 (en) 2001-05-17 2004-04-01 Presmar Peter Virtual database of heterogeneous data structures
US7373364B1 (en) 2002-03-05 2008-05-13 Network Appliance, Inc. System and method for creating a point-in-time restoration of a database file
US7225204B2 (en) 2002-03-19 2007-05-29 Network Appliance, Inc. System and method for asynchronous mirroring of snapshots at a destination using a purgatory directory and inode mapping
US7340489B2 (en) 2002-04-10 2008-03-04 Emc Corporation Virtual storage devices
JP2003316522A (en) 2002-04-26 2003-11-07 Hitachi Ltd Computer system and method for controlling the same system
US7107385B2 (en) 2002-08-09 2006-09-12 Network Appliance, Inc. Storage virtualization by layering virtual disk objects on a file system
JP4124331B2 (en) 2002-09-17 2008-07-23 株式会社日立製作所 Virtual volume creation and management method for DBMS
US7243093B2 (en) 2002-11-27 2007-07-10 International Business Machines Corporation Federated query management
US6883083B1 (en) 2002-12-20 2005-04-19 Veritas Operating Corporation System and method for maintaining and accessing information regarding virtual storage devices
WO2004064441A1 (en) 2003-01-14 2004-07-29 Telefonaktiebolaget Lm Ericsson (Publ) Resource allocation management
US7457982B2 (en) 2003-04-11 2008-11-25 Network Appliance, Inc. Writable virtual disk of read-only snapshot file objects
US7269607B2 (en) 2003-09-29 2007-09-11 International Business Machines Coproartion Method and information technology infrastructure for establishing a log point for automatic recovery of federated databases to a prior point in time
US7346923B2 (en) 2003-11-21 2008-03-18 International Business Machines Corporation Federated identity management within a distributed portal server
US7334095B1 (en) 2004-04-30 2008-02-19 Network Appliance, Inc. Writable clone of read-only volume
US7334094B2 (en) 2004-04-30 2008-02-19 Network Appliance, Inc. Online clone volume splitting technique
US7409511B2 (en) 2004-04-30 2008-08-05 Network Appliance, Inc. Cloning technique for efficiently creating a copy of a volume in a storage system
US7953749B2 (en) 2004-05-11 2011-05-31 Oracel International Corporation Providing the timing of the last committed change to a row in a database table
US7653665B1 (en) 2004-09-13 2010-01-26 Microsoft Corporation Systems and methods for avoiding database anomalies when maintaining constraints and indexes in presence of snapshot isolation
US20060106893A1 (en) * 2004-11-02 2006-05-18 Rodger Daniels Incremental backup operations in storage networks
GB0428108D0 (en) 2004-12-23 2005-01-26 Ibm Storage system with multiple copy targeting
US7757056B1 (en) 2005-03-16 2010-07-13 Netapp, Inc. System and method for efficiently calculating storage required to split a clone volume
US7631021B2 (en) 2005-03-25 2009-12-08 Netapp, Inc. Apparatus and method for data replication at an intermediate node
US7539836B1 (en) 2005-04-18 2009-05-26 Netapp, Inc. Method and system for configuring a data storage object
US9152823B2 (en) 2005-04-22 2015-10-06 Storagecraft Technology Corporation Systems, methods, and computer readable media for computer data protection
US7822758B1 (en) 2005-04-22 2010-10-26 Network Appliance, Inc. Method and apparatus for restoring a data set
CA2613419C (en) 2005-06-24 2014-12-02 Syncsort Incorporated System and method for virtualizing backup images
US20070260628A1 (en) 2006-05-02 2007-11-08 Tele Atlas North America, Inc. System and method for providing a virtual database environment and generating digital map information
US20080037553A1 (en) 2005-12-22 2008-02-14 Bellsouth Intellectual Property Corporation Systems and methods for allocating bandwidth to ports in a computer network
JP4822889B2 (en) 2006-03-20 2011-11-24 富士通株式会社 Database integrated reference program, database integrated reference method, and database integrated reference device
US7747831B2 (en) 2006-03-20 2010-06-29 Emc Corporation High efficiency portable archive and data protection using a virtualization layer
US7590660B1 (en) 2006-03-21 2009-09-15 Network Appliance, Inc. Method and system for efficient database cloning
CA2648428C (en) 2006-04-07 2017-11-21 Data Storage Group Data compression and storage techniques
US7653794B2 (en) 2006-05-08 2010-01-26 Microsoft Corporation Converting physical machines to virtual machines
US7809769B2 (en) 2006-05-18 2010-10-05 Google Inc. Database partitioning by virtual partitions
US7849114B2 (en) 2006-06-19 2010-12-07 International Business Machines Corporation Method, system, and program product for generating a virtual database
US7921077B2 (en) 2006-06-29 2011-04-05 Netapp, Inc. System and method for managing data deduplication of storage systems utilizing persistent consistency point images
US7587563B1 (en) 2006-07-11 2009-09-08 Network Appliance, Inc. Method and system to make a read-only file system appear to be writeable
US7856424B2 (en) 2006-08-04 2010-12-21 Apple Inc. User interface for backup management
US8311988B2 (en) 2006-08-04 2012-11-13 Apple Inc. Consistent back up of electronic information
JP5068062B2 (en) 2006-10-30 2012-11-07 インターナショナル・ビジネス・マシーンズ・コーポレーション System, method, and program for integrating databases
US7827366B1 (en) 2006-10-31 2010-11-02 Network Appliance, Inc. Method and system for providing continuous and long-term data protection for a dataset in a storage system
US8255915B1 (en) 2006-10-31 2012-08-28 Hewlett-Packard Development Company, L.P. Workload management for computer system with container hierarchy and workload-group policies
US20080104146A1 (en) 2006-10-31 2008-05-01 Rebit, Inc. System for automatically shadowing encrypted data and file directory structures for a plurality of network-connected computers using a network-attached memory with single instance storage
US20080154989A1 (en) 2006-12-21 2008-06-26 Boxicom, Inc. (Dba 3X Systems) Data backup system and method associated therewith
US8935206B2 (en) 2007-01-31 2015-01-13 Hewlett-Packard Development Company, L.P. Snapshots in distributed storage systems
WO2008121873A1 (en) 2007-03-29 2008-10-09 Vmware, Inc. Synchronization and customization of a clone computer
US7760643B2 (en) 2007-04-09 2010-07-20 Telcordia Technologies, Inc. Automatic policy change management scheme for DiffServ-enabled MPLS networks
US7953946B2 (en) 2007-04-16 2011-05-31 Microsoft Corporation Controlled anticipation in creating a shadow copy
US8775663B1 (en) 2007-04-25 2014-07-08 Netapp, Inc. Data replication network traffic compression
US8010900B2 (en) 2007-06-08 2011-08-30 Apple Inc. User interface for electronic backup
US7669023B2 (en) 2007-07-10 2010-02-23 Hitachi, Ltd. Power efficient storage with data de-duplication
WO2009032711A1 (en) 2007-08-29 2009-03-12 Nirvanix, Inc. Policy-based file management for a storage delivery network
US20090132616A1 (en) 2007-10-02 2009-05-21 Richard Winter Archival backup integration
US7877357B1 (en) 2007-10-12 2011-01-25 Netapp, Inc. Providing a simulated dynamic image of a file system
US7996636B1 (en) 2007-11-06 2011-08-09 Netapp, Inc. Uniquely identifying block context signatures in a storage volume hierarchy
US8082234B2 (en) 2007-11-19 2011-12-20 Teradata Us, Inc. Closed-loop system management method and process capable of managing workloads in a multi-system database environment
CN101441582A (en) * 2007-11-20 2009-05-27 北京和力记易科技有限公司 Backup and recovery method of computer data object and system and program product for implementing the same
US7779051B2 (en) 2008-01-02 2010-08-17 International Business Machines Corporation System and method for optimizing federated and ETL'd databases with considerations of specialized data structures within an environment having multidimensional constraints
US20090177697A1 (en) 2008-01-08 2009-07-09 International Business Machines Corporation Correlation and parallelism aware materialized view recommendation for heterogeneous, distributed database systems
WO2009098909A1 (en) 2008-02-04 2009-08-13 Nec Corporation Virtual appliance assignment system
CN101286127B (en) * 2008-05-08 2010-06-02 华中科技大学 Multi-fork diary memory continuous data protecting and restoration method
US8532973B1 (en) 2008-06-27 2013-09-10 Netapp, Inc. Operating a storage server on a virtual machine
US8037032B2 (en) 2008-08-25 2011-10-11 Vmware, Inc. Managing backups using virtual machines
US8311985B2 (en) 2008-09-16 2012-11-13 Quest Software, Inc. Remote backup and restore system and method
US9542222B2 (en) 2008-11-14 2017-01-10 Oracle International Corporation Resource broker system for dynamically deploying and managing software services in a virtual environment based on resource usage and service level agreement
US20100131959A1 (en) 2008-11-26 2010-05-27 Spiers Adam Z Proactive application workload management
US8452930B2 (en) * 2009-03-27 2013-05-28 Hitachi, Ltd. Methods and apparatus for backup and restore of thin provisioning volume
US8280858B2 (en) 2009-06-29 2012-10-02 Oracle America, Inc. Storage pool scrubbing with concurrent snapshots
US10120767B2 (en) 2009-07-15 2018-11-06 Idera, Inc. System, method, and computer program product for creating a virtual database
US8341119B1 (en) 2009-09-14 2012-12-25 Netapp, Inc. Flexible copies having different sub-types
US8161077B2 (en) 2009-10-21 2012-04-17 Delphix Corp. Datacenter workflow automation scenarios using virtual databases
US8150808B2 (en) 2009-10-21 2012-04-03 Delphix Corp. Virtual database system
US9106591B2 (en) 2009-12-24 2015-08-11 Delphix Corporation Adaptive resource management using survival minimum resources for low priority consumers
US8244685B2 (en) * 2010-02-24 2012-08-14 Autonomy, Inc. Data restoration utilizing forward and backward deltas

Also Published As

Publication number Publication date
CN103098016A (en) 2013-05-08
US8548944B2 (en) 2013-10-01
AU2011278970A1 (en) 2013-02-21
US9514140B2 (en) 2016-12-06
WO2012009650A1 (en) 2012-01-19
EP2593858A4 (en) 2014-10-08
EP2593858B1 (en) 2016-04-13
CA2804372A1 (en) 2012-01-19
EP2593858A1 (en) 2013-05-22
AU2011278970B2 (en) 2015-02-12
US20120016839A1 (en) 2012-01-19
CN103098016B (en) 2015-03-25
US20140025637A1 (en) 2014-01-23

Similar Documents

Publication Publication Date Title
CA2804372C (en) De-duplication based backup of file systems
US11263173B2 (en) Transaction log index generation in an enterprise backup system
US11500669B2 (en) Live recovery of virtual machines in a public cloud computing environment
US10540236B2 (en) System and method for multi-hop data backup
US10642633B1 (en) Intelligent backups with dynamic proxy in virtualized environment
US10114706B1 (en) Backup and recovery of raw disks [RDM] in virtual environment using snapshot technology
EP3367233B1 (en) Datacenter workflow automation scenarios using virtual databases
US9495435B2 (en) System and method for intelligent database backup
KR101617339B1 (en) Virtual database system
US7523277B1 (en) Transient point-in-time images for continuous data protection
US20130325810A1 (en) Creation and expiration of backup objects in block-level incremental-forever backup systems
US10324807B1 (en) Fast native file system creation for backup files on deduplication systems
US20220358087A1 (en) Technique for creating an in-memory compact state of snapshot metadata
US11853267B2 (en) Uninterrupted file level backup operation using a virtual object file queue
US20220382647A1 (en) Leveraging metadata of a deduplication storage system to perform an efficient restore of backup data

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20160316