WO2000008563A1 - Redundant, asymmetrically parallel disk cache for a data storage system - Google Patents
Redundant, asymmetrically parallel disk cache for a data storage system Download PDFInfo
- Publication number
- WO2000008563A1 WO2000008563A1 PCT/US1999/015830 US9915830W WO0008563A1 WO 2000008563 A1 WO2000008563 A1 WO 2000008563A1 US 9915830 W US9915830 W US 9915830W WO 0008563 A1 WO0008563 A1 WO 0008563A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cache
- disk
- data
- write
- backup
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11C—STATIC STORES
- G11C29/00—Checking stores for correct operation ; Subsequent repair; Testing stores during standby or offline operation
- G11C29/70—Masking faults in memories by using spares or by reconfiguring
- G11C29/74—Masking faults in memories by using spares or by reconfiguring using duplex memories, i.e. using dual copies
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/1666—Error detection or correction of the data by redundancy in hardware where the redundant component is memory or memory area
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0866—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches for peripheral storage systems, e.g. disk cache
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/22—Employing cache memory using specific memory technology
- G06F2212/222—Non-volatile memory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/31—Providing disk cache in a specific location of a storage system
- G06F2212/312—In storage controller
Definitions
- the invention relates to the field of redundant data storage systems, and in particular to a data storage system architecture having a redundant, asymmetrical disk cache.
- RAID redundant array of independent/inexpensive disks
- the RAID system architecture provides a large amount of data storage in a reasonably reliable manner.
- RAID system architectures are set forth in the paper entitled “A Case for Redundant Arrays of Inexpensive Disks (RAID) ", Patterson et al., Proc. ACM SIGMOD, June 1988. These architectures include RAID-1, RAID-2, RAID-3, RAID-4 and RAID-5.
- U.S. Patent 5,526,482 entitled “Storage Device Array Architecture With Copyback Cache” briefly discusses each of these architectures, and in the interest of brevity a similar discussion will not be repeated herein.
- each of the RAID system architectures includes a plurality of disks that are controlled by a RAID disk controller.
- a central processing unit CPU
- the controller directs how the information shall be stored on the plurality of disks to ensure that a failure of any one of the disks will not cause the system to loose information.
- NVRAM cache e.g. , battery backed RAM
- dual-copy caches i.e., a primary cache and a backup cache
- a failure in one cache does not cause the loss of data. That is, the other cache is intact which ensures the integrity of the data stored on disk.
- the controller writes the data independently into the two caches.
- An object of the present invention is to provide a data storage system which employs an inexpensive, redundant write cache.
- a data storage system includes redundant write caches, a disk controller and an array of disks.
- One of the redundant write caches is a primary write cache and another is a backup write cache having a hybrid memory structure comprising a cache-disk space mapped to at least one of the disks.
- the cache-disk space may be located on a single disk within the disk array, or distributed over a number of the plurality of disks in the array.
- the array of disks can may be configured as a RAID architecture.
- the data storage system of the present invention preferably employs a conventional, fast- write-fast-read primary write cache and a non- volatile, hybrid memory backup write cache.
- the backup write cache includes NVRAM, and small and random writes are buffered in a least recently used (LRU) cache located in the NVRAM. If the LRU cache is full, LRU data blocks in the LRU cache are transferred into a segment buffer which is also located in the backup cache NVRAM to make room for new write requests. When the segment buffer is full, its entire contents are written into the cache-disk space.
- LRU least recently used
- the backup write cache may achieve the same write speed as the primary cache.
- the redundant write caches are asymmetric since the primary write cache and the backup write cache have different structures.
- the system is relatively inexpensive since the amount NVRAM in the backup cache is relatively small, ranging from hundreds of KB to several MB, and the cost of the disk space for the cache-disk space is significantly less than a large amount of NVRAM.
- the caching arrangement of the present invention has a significant reliability advantage over conventional single NVRAM write caches, and a cost advantage over dual-copy NVRAM caches.
- the prevent invention provides a cost-effective architecture for very large write caches capable of masking the effects of small writes for high-end data storage systems that would otherwise have to use dual-copy, identical NVRAM caches.
- Fig. 1 is a functional block diagram illustration of a data storage system architecture according to the present invention
- Fig. 2 is a functional block diagram illustration of a backup write cache
- Fig. 3 is a flow chart illustration of the processing steps associated with the destaging process
- Fig. 4 is a functional block diagram illustration of an alternative embodiment data storage system
- Fig. 5 is a pictorial illustration of the cache-disk memory space
- Fig. 6 illustrates a triple redundant data storage system
- Fig. 7 illustrates yet another triple data storage system.
- Fig. 1 illustrates a functional block diagram of a data storage system 10.
- the system 10 communicates with a central processing unit (CPU) 12 via a bus 14 (e.g. , a small computer systems interface (SCSI) bus).
- the system 10 includes a controller 16 which communicates with a read cache 18, a primary write cache 20 and a backup write cache 22.
- the primary write cache 20 and the backup write cache 22 contain redundant data. As known, if either of these two caches fails, the data can be reconstructed from the other.
- Each of the caches 18, 20, 22 reads data from or writes data to a plurality of disks 26-30.
- the plurality of disks 26-30 are configured as a redundant array of independent/inexpensive disks (RAID).
- RAID redundant array of independent/inexpensive disks
- the primary write cache 20 generally employs NVRAM for storage (e.g., battery backed RAM) to ensure data is retained in the event of a power failure (or other system failures).
- NVRAM battery backed RAM
- the primary write cache 20 may employ DRAM rather than NVRAM.
- the backup write cache 22 has a hierarchical, hybrid-memory structure.
- Fig. 2 illustrates a functional block diagram of the backup write cache 22.
- the cache has a hybrid memory structure that includes NVRAM 40, volatile RAM 41 (preferably DRAM) and a cache-disk space 42.
- NVRAM 40 volatile RAM 41 (preferably DRAM)
- volatile RAM 41 preferably DRAM
- cache-disk space 42 The size of the NVRAM 40 ranges from several hundred kB to several MB in size, while the cache-disk space 42 has the capacity to store about several hundred MB.
- the sizes discussed herein are used only for their relative sizes, and one of ordinary skill will recognize that the present invention is not limited to the sizes discussed herein.
- the backup cache NVRAM 40 is partitioned to provide a last recently used (LRU) cache 43, a plurality of segment buffers 44, 46 and a hash table 48.
- the recently accessed data resides in the LRU cache 43.
- the less recently accessed data is kept in the cache-disk space 42, and is preferably organized into a plurality of disk segments 56-60 similar to a log-structured file system such as Sprite LFS and the BSD LFS.
- Each disk segment 56-60 contains a number of slots each of which can hold one data block. In one embodiment, each block is 32 kb.
- Data blocks stored in the segments 56-60 are addressed by their segment IDs and slot IDs.
- Data blocks stored in the LRU cache 43 are addressed by their logical block addresses (LB As).
- the hash table 48 contains location information for each of the valid data blocks in the backup cache 22. Specifically, the location information identifies whether a block is in the LRU cache 43 or in the cache-disk space 42, as well as the address of the information in the LRU cache 43 or the cache-disk space 42.
- the backup cache 22 also includes a disk segment table 61 which is preferably located in the volatile RAM 41.
- the details of the disk segment table 61 shall be discussed herein in detail.
- the cache-disk space 42 is preferably physically located on a portion of the disk 30.
- assigning the cache-disk space 32 to disk advantageously implements the cache-disk space 42 without additional system hardware cost, since the plurality of disks 26-30 may include one or more spare disks that are available in the event an active disk fails.
- the spare disks are not used during normal operation and therefore, it is difficult to detect if one of the spares has failed.
- mapping the cache-disk space 42 to a spare disk e.g., disk 30
- it is easier to detect when the spare disk is working properly thus reducing the possibility of a latent failure of a spare disk.
- the disk controller 16 invalidates any data copy in the read cache 18, and sends the data to be written to both the primary write cache 20 and the LRU cache 43 of the backup write cache 22. If there is space available in the write caches 20, 22, the data is copied to the available space immediately. To indicate that the data block is located in the backup LRU cache 43, the controller 16 creates a hash entry which is stored in the hash table 48. Once the data is written into both the primary write cache 20 and the backup write cache 22, the controller 16 sends an acknowledgment to the CPU 12 signaling that the write request is complete.
- the controller 16 If there is no space left in the primary write cache 20, the controller 16 first tries to discard a "clean" block of data from the cache 20 to make room for the new request.
- a "clean" block of data is defined as a cache resident block of data that has already been successfully copied to disk. However, if the controller 16 cannot find a "clean" block to discard, it executes a foreground destaging process (to be described in detail hereinafter) which writes a least-recently-used data block to the array of disks 24. When the least- recently-used block is safely written into the array of disks 24, the space that was used for the least-recently-used data block in the primary write cache 20 is freed for the incoming request. The controller 16 also invalidates the associated data in the backup write cache 22.
- the controller 16 selects an empty one of the segment buffers 44, 46 and designates it as the "current" segment buffer.
- An LRU cache resident data block is then copied to the "current" segment buffer and the corresponding entries in the hash table 48 and the disk segment table 50 are modified to indicate that the data block is now in the "current" segment buffer.
- the disk segment table 50 contains information regarding which slots and disk segments are free, and can be used to cache data. Space within the LRU cache 43 is then available to accept incoming requests. Subsequent write requests may continue to evict least-recently-used blocks from the LRU cache 43 and transfer the blocks to the "current" segment buffer until it is full.
- the controller 16 designates another segment buffer as the "current" segment buffer and continues to accept data from the LRU cache 43. The controller then writes the contents of the full segment buffer into the cache-disk space 42, preferably in one large write during background.
- the full segment buffer is written to the cache-disk space 42 in one large write instead of many small writes, the full segment buffer is quickly again made available when the disk write finishes. Therefore, the NVRAM 40 and the cache-disk space 42 of the backup write cache 22 collectively appear to the controller 16 as a large uniform NVRAM write cache, whose write performance is analogous to the primary write cache 20.
- the size of the NVRAM resident segment buffers 44, 46 directly affects the write efficiency. That is, the larger the segment buffers 44, 46, the smaller the write overheads caused by disk seeking and rotational latencies.
- segment buffer size As the size of the segment buffers 44, 46 increases, the smaller the LRU cache 43 becomes since the segment buffers 44, 46 and the LRU cache 43 are both physically resident in NVRAM. Therefore, there is a trade-off between segment buffer size and LRU cache size.
- the controller 16 receives a read request from the CPU 12, the read cache 18 and the primary write cache 20 are searched. If there is a cache hit, the data is immediately read. However, if there is a cache miss, the LRU block (not shown) in the read cache 18 is discarded and its buffer space is freed. The requested data is then read from the array of disks 24 and stored in the freed LRU block of the read cache 18.
- the backup cache 22 is generally not involved in read operations.
- the destaging process involves moving "dirty" data resident in the primary write cache 20 to disk.
- Dirty data is defined as data that is stored in the primary write cache, but not yet stored in disk.
- Destaging is performed in background when the controller 16 detects an idle period, or when the number of "dirty" blocks in the primary write cache 20 exceeds a predetermined value (i.e. , a high water mark).
- Fig. 3 is a flow chart illustration of processing steps 70 associated with the destaging process.
- This process 70 is executed by the controller regularly in background, and on an as-needed-basis in foreground.
- the controller 16 Upon entry to the process 70, the controller 16 (Fig. 1) executes step 72 to identify "dirty" blocks of data resident in the primary write cache 20 (Fig. 1).
- Step 74 is then executed to read the old data and the parity value associated with the old data.
- the old data and the old parity value are read from disk.
- a new parity value based upon the identified "dirty" data and the old data is then computed in step 76, and the new data and parity value are written to the plurality of disks 24 in step 78.
- step 80 is performed to designate the blocks of data that were written to disk in step 78 as "clean".
- the controller 16 executes step 82 to invalidate the copy of the data stored in the backup write cache 22 (Fig. 1) that is identical to the data written to disk in step 78.
- the controller 16 If the data to be invalidated in the backup write cache 22 is resident in the NVRAM LRU buffer 43, the controller releases the LRU buffer 43 as necessary. However, if the data to be invalidated is resident in one of the segment buffers 44, 46, or one of the disk segments 56-60, the controller 16 invalidates the data by marking the corresponding segment slot stored in the disk segment table 61 (Fig. 2) as "invalid" . The controller 16 also deletes the corresponding hash entry resident in the hash table 48.
- the destaging process 70 preferably runs continuously until the controller processing returns to foreground. Alternatively, the destaging process may only run continuously during background until the number of dirty blocks in the primary cache falls below the predetermined low water-mark. During background the controller may also execute known tasks such as built-in-test (BIT).
- the destaging process 70 is called when the controller 16 (Fig. 1) receives a new write request and the entire primary write cache 20 is occupied by
- the destaging process 100 (Fig. 3) is initiated to free space for the incoming request.
- the backup cache is not read or written to during either the background or foreground destaging process. Therefore the slower read speed the cache-disk will not affect the destaging performance.
- Fig. 4 illustrates a functional block diagram of an alternative embodiment data storage system 90.
- This alternative embodiment system 90 is substantially the same as the system 10 illustrated in Fig. 1, with the principal exception that the cache-disk space 42 (Fig. 2) is mapped into a plurality of disks 92-95, rather than one disk (e.g., disk 30 - Fig. 1). Specifically, the cache-disk space 42 is logically partitioned into sub-spaces that are distributed over the plurality of disks 92-95.
- the cache-disk space 42 includes a plurality of sub-spaces 42a- 42d that are mapped into the plurality of the disks.
- the first sub-space 42a which includes a first plurality of disk segments 56-58, is mapped to disk #1 92 (Fig. 4).
- the second sub-space 42b includes a second plurality of disk segments 98-100 and is mapped to disk #2 93 (Fig. 4).
- cache disk sub-space 42d includes a plurality of segments 102-104 and is mapped to disk #N 95 (Fig. 4).
- the cache-disk space 42 is subdivided and mapped into the plurality of disks 93-95 (Fig. 4).
- the controller 16 may automatically reconfigure the system 10 illustrated Fig. 1 to the system 90 in Fig. 4 if the spare disk 30 (Fig. 1) becomes an active disk due to a disk failure.
- the controller reconfigures the backup write cache 42 to operate using a logical cache-disk space that is partitioned into subspaces and mapped into the active disks since the dedicated spare disk may no longer be available.
- the logical cache-disk space the data written into the cache partitions on the disks do not involve in parity operations.
- the segment buffer size can not be too large because segment writes must compete with normal reads in data disks. Large segment buffer sizes may result in lower read performance.
- Computer based simulations have shown that using four 128 KB segment buffers achieves good system performance.
- Fig. 6 illustrates another alternative embodiment data storage system 110.
- This data storage system is triple redundant system.
- the system includes a primary cache 112 and two backup caches 114,116.
- the primary cache 112 and the two backup caches 114,116 provide triple redundancy protection.
- the back-up caches 114,116 include non-volatile memory, they provide dual redundancy in the event of a power failure.
- dedicated cache disks 118, 120 cooperate with the backup write caches 114,116 respectively.
- Fig. 7 illustrates yet another alternative embodiment data storage system 130.
- This system 130 is substantially the same as the system 110 illustrated in Fig. 7, with the principal exception that the cache-disk space is mapped into a plurality of disks 132-135.
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU49907/99A AU4990799A (en) | 1998-08-04 | 1999-07-14 | Redundant, asymmetrically parallel disk cache for a data storage system |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/129,042 US6243795B1 (en) | 1998-08-04 | 1998-08-04 | Redundant, asymmetrically parallel disk cache for a data storage system |
US09/129,042 | 1998-08-04 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2000008563A1 true WO2000008563A1 (en) | 2000-02-17 |
Family
ID=22438211
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1999/015830 WO2000008563A1 (en) | 1998-08-04 | 1999-07-14 | Redundant, asymmetrically parallel disk cache for a data storage system |
Country Status (3)
Country | Link |
---|---|
US (1) | US6243795B1 (en) |
AU (1) | AU4990799A (en) |
WO (1) | WO2000008563A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002029575A2 (en) * | 2000-09-29 | 2002-04-11 | Emc Corporation | System and method for hierarchical data storage in a log structure |
WO2017216887A1 (en) * | 2016-06-15 | 2017-12-21 | 株式会社日立製作所 | Information processing system |
Families Citing this family (104)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3568110B2 (en) * | 1999-10-15 | 2004-09-22 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Cache memory control method, computer system, hard disk drive, and hard disk controller |
US6851024B1 (en) * | 2000-08-17 | 2005-02-01 | Hewlett-Packard Development Company, L.P. | Exclusive caching in computer systems |
US6629198B2 (en) * | 2000-12-08 | 2003-09-30 | Sun Microsystems, Inc. | Data storage system and method employing a write-ahead hash log |
US6507893B2 (en) * | 2001-01-26 | 2003-01-14 | Dell Products, L.P. | System and method for time window access frequency based caching for memory controllers |
ATE347140T1 (en) * | 2001-03-15 | 2006-12-15 | Rhode Island Education | REMOTE ONLINE INFORMATION BACKUP SYSTEM |
US20040158687A1 (en) * | 2002-05-01 | 2004-08-12 | The Board Of Governors For Higher Education, State Of Rhode Island And Providence Plantations | Distributed raid and location independence caching system |
WO2002088961A1 (en) * | 2001-05-01 | 2002-11-07 | The Board Of Governors For Higher Education, State Of Rhode Island And Providence Plantations | Distributed raid and location independence caching system |
JP4209108B2 (en) * | 2001-12-20 | 2009-01-14 | 株式会社日立製作所 | Storage device control method, storage device used in this method, disk array device, and disk controller |
US7159085B2 (en) * | 2002-04-19 | 2007-01-02 | Emc Corporation | Method of allowing point-in-time view of data on a disk |
JP2003345520A (en) * | 2002-05-24 | 2003-12-05 | Hitachi Ltd | Disk array system and cache control method in disk array system |
US7028154B2 (en) | 2002-06-18 | 2006-04-11 | Hewlett-Packard Development Company, L.P. | Procedure to reduce copy time for data backup from short-term to long-term memory |
US7080277B2 (en) * | 2002-08-29 | 2006-07-18 | International Business Machines Corporation | Method and apparatus for recovery of a logical volume in a multi copy storage system |
AU2002951471A0 (en) * | 2002-09-18 | 2002-10-03 | Platypus Technology Australia Pty Ltd | Redundant array of solid-state storage device modules |
US7035974B2 (en) * | 2002-11-06 | 2006-04-25 | Synology Inc. | RAID-5 disk having cache memory implemented using non-volatile RAM |
US7051156B2 (en) * | 2002-11-06 | 2006-05-23 | Synology Inc. | Raid-5 disk having cache memory |
US6922754B2 (en) * | 2002-12-09 | 2005-07-26 | Infabric Technologies, Inc. | Data-aware data flow manager |
US20040250039A1 (en) * | 2003-06-05 | 2004-12-09 | Hanks D. Mitchel | System and method for using swappable storage for high data content multi-source data storage |
US7171516B2 (en) * | 2003-09-29 | 2007-01-30 | International Business Machines Corporation | Increasing through-put of a storage controller by autonomically adjusting host delay |
US7155586B1 (en) | 2003-12-30 | 2006-12-26 | Emc Corporation | Method of allowing point-in-time view of data on a disk using a map on cache disk |
US7536593B2 (en) * | 2004-03-05 | 2009-05-19 | International Business Machines Corporation | Apparatus, system, and method for emergency backup |
US7370163B2 (en) * | 2004-05-03 | 2008-05-06 | Gemini Storage | Adaptive cache engine for storage area network including systems and methods related thereto |
US20060036904A1 (en) * | 2004-08-13 | 2006-02-16 | Gemini Storage | Data replication method over a limited bandwidth network by mirroring parities |
US7457980B2 (en) * | 2004-08-13 | 2008-11-25 | Ken Qing Yang | Data replication method over a limited bandwidth network by mirroring parities |
US7636814B1 (en) | 2005-04-28 | 2009-12-22 | Symantec Operating Corporation | System and method for asynchronous reads of old data blocks updated through a write-back cache |
DE602005023273D1 (en) * | 2005-04-29 | 2010-10-14 | St Microelectronics Srl | An improved cache system |
JP2007059014A (en) * | 2005-08-26 | 2007-03-08 | Sony Corp | Information recorder, information recording method and computer program |
US7411757B2 (en) * | 2006-07-27 | 2008-08-12 | Hitachi Global Storage Technologies Netherlands B.V. | Disk drive with nonvolatile memory having multiple modes of operation |
KR100800484B1 (en) * | 2006-11-03 | 2008-02-04 | 삼성전자주식회사 | Data store system including the buffer for non-volatile memory and the buffer for disk, and data access method of the data store system |
US7730347B1 (en) | 2007-01-03 | 2010-06-01 | Board Of Governors For Higher Education, State Of Rhode Island And Providence Plantations | Data recovery system and method including a disk array architecture that provides recovery of data to any point of time |
US8239706B1 (en) | 2007-01-03 | 2012-08-07 | Board Of Governors For Higher Education, State Of Rhode Island And Providence Plantations | Data retrieval system and method that provides retrieval of data to any point in time |
KR100881187B1 (en) | 2007-01-16 | 2009-02-05 | 삼성전자주식회사 | Hybrid hard disk drive, computer system including hybrid HDD, and flash memory DMA circuit of hybrid HDD |
JP2009075759A (en) * | 2007-09-19 | 2009-04-09 | Hitachi Ltd | Storage device, and method for managing data in storage device |
US8589729B1 (en) * | 2007-09-28 | 2013-11-19 | Emc Corporation | Data preservation system and method |
JP5398551B2 (en) * | 2008-02-08 | 2014-01-29 | 富士通株式会社 | Backup method, controller, and disk array system |
US8250310B2 (en) * | 2008-07-31 | 2012-08-21 | International Business Machines Corporation | Assigning data to NVRAM of shared access hybrid hard drives |
US9176883B2 (en) | 2009-04-30 | 2015-11-03 | HGST Netherlands B.V. | Storage of data reference blocks and deltas in different storage devices |
US9413527B2 (en) | 2009-04-30 | 2016-08-09 | HGST Netherlands B.V. | Optimizing signature computation and sampling for fast adaptive similarity detection based on algorithm-specific performance |
US9582222B2 (en) | 2009-04-30 | 2017-02-28 | Western Digital Technologies, Inc. | Pre-cache similarity-based delta compression for use in a data storage system |
CN101562543B (en) * | 2009-05-25 | 2013-07-31 | 阿里巴巴集团控股有限公司 | Cache data processing method and processing system and device thereof |
US20110218967A1 (en) * | 2010-03-08 | 2011-09-08 | Microsoft Corporation | Partial Block Based Backups |
US8738880B2 (en) | 2010-08-17 | 2014-05-27 | International Business Machines Corporation | Throttling storage initialization for data destage |
JP5520747B2 (en) * | 2010-08-25 | 2014-06-11 | 株式会社日立製作所 | Information device equipped with cache and computer-readable storage medium |
US8850114B2 (en) | 2010-09-07 | 2014-09-30 | Daniel L Rosenband | Storage array controller for flash-based storage devices |
US9280477B2 (en) | 2010-12-13 | 2016-03-08 | Seagate Technology Llc | Data storage management in a memory device |
US20120278527A1 (en) * | 2011-04-26 | 2012-11-01 | Byungcheol Cho | System architecture based on hybrid raid storage |
US20120278550A1 (en) * | 2011-04-26 | 2012-11-01 | Byungcheol Cho | System architecture based on raid controller collaboration |
CN103019966B (en) * | 2011-09-22 | 2015-08-19 | 乔鼎资讯股份有限公司 | A kind of data storage system and management method thereof comprising backup body internal memory |
WO2014098839A1 (en) * | 2012-12-19 | 2014-06-26 | Hewlett-Packard Development Company | Nvram path selection |
US9298398B2 (en) | 2013-04-16 | 2016-03-29 | International Business Machines Corporation | Fine-grained control of data placement |
US9329938B2 (en) | 2013-04-16 | 2016-05-03 | International Business Machines Corporation | Essential metadata replication |
US9619404B2 (en) * | 2013-04-16 | 2017-04-11 | International Business Machines Corporation | Backup cache with immediate availability |
US9423981B2 (en) | 2013-04-16 | 2016-08-23 | International Business Machines Corporation | Logical region allocation with immediate availability |
US9104332B2 (en) | 2013-04-16 | 2015-08-11 | International Business Machines Corporation | Managing metadata and data for a logical volume in a distributed and declustered system |
US9104597B2 (en) | 2013-04-16 | 2015-08-11 | International Business Machines Corporation | Destaging cache data using a distributed freezer |
US9298617B2 (en) * | 2013-04-16 | 2016-03-29 | International Business Machines Corporation | Parallel destaging with replicated cache pinning |
US9507671B2 (en) * | 2013-09-16 | 2016-11-29 | Globalfoundries Inc. | Write cache protection in a purpose built backup appliance |
US9323630B2 (en) | 2013-09-16 | 2016-04-26 | HGST Netherlands B.V. | Enhanced data recovery from data storage devices |
JP6036736B2 (en) * | 2014-03-31 | 2016-11-30 | 日本電気株式会社 | Cache device, storage device, cache control method, and storage control program |
CN103970486A (en) * | 2014-05-22 | 2014-08-06 | 浪潮(北京)电子信息产业有限公司 | Method and system for data storage |
US9594628B2 (en) * | 2014-07-30 | 2017-03-14 | Segate Technology Llc | Operational vibration compensation through media cache management |
US9904688B2 (en) | 2014-09-30 | 2018-02-27 | International Business Machines Corporation | Buffering and replicating data written to a distributed storage system |
US9916325B2 (en) | 2014-09-30 | 2018-03-13 | International Business Machines Corporation | Quick initialization of data regions in a distributed storage system |
US10001932B2 (en) * | 2015-06-09 | 2018-06-19 | Western Digital Technologies, Inc. | Enhanced redundant caching for shingled magnetic recording devices in data storage drive assemblies |
US10230809B2 (en) * | 2016-02-29 | 2019-03-12 | Intel Corporation | Managing replica caching in a distributed storage system |
US9501364B1 (en) * | 2016-03-18 | 2016-11-22 | Storagecraft Technology Corporation | Hybrid image backup of a source storage |
US10120808B2 (en) * | 2016-04-22 | 2018-11-06 | Arm Limited | Apparatus having cache memory disposed in a memory transaction path between interconnect circuitry and a non-volatile memory, and corresponding method |
CN108733313B (en) * | 2017-04-17 | 2021-07-23 | 伊姆西Ip控股有限责任公司 | Method, apparatus and computer readable medium for establishing multi-level flash cache using a spare disk |
US10503409B2 (en) * | 2017-09-27 | 2019-12-10 | Alibaba Group Holding Limited | Low-latency lightweight distributed storage system |
US10496548B2 (en) | 2018-02-07 | 2019-12-03 | Alibaba Group Holding Limited | Method and system for user-space storage I/O stack with user-space flash translation layer |
US10831404B2 (en) | 2018-02-08 | 2020-11-10 | Alibaba Group Holding Limited | Method and system for facilitating high-capacity shared memory using DIMM from retired servers |
US11379155B2 (en) | 2018-05-24 | 2022-07-05 | Alibaba Group Holding Limited | System and method for flash storage management using multiple open page stripes |
WO2020000136A1 (en) | 2018-06-25 | 2020-01-02 | Alibaba Group Holding Limited | System and method for managing resources of a storage device and quantifying the cost of i/o requests |
US11061735B2 (en) | 2019-01-02 | 2021-07-13 | Alibaba Group Holding Limited | System and method for offloading computation to storage nodes in distributed system |
US10922234B2 (en) | 2019-04-11 | 2021-02-16 | Alibaba Group Holding Limited | Method and system for online recovery of logical-to-physical mapping table affected by noise sources in a solid state drive |
US11169873B2 (en) | 2019-05-21 | 2021-11-09 | Alibaba Group Holding Limited | Method and system for extending lifespan and enhancing throughput in a high-density solid state drive |
US10860223B1 (en) | 2019-07-18 | 2020-12-08 | Alibaba Group Holding Limited | Method and system for enhancing a distributed storage system by decoupling computation and network tasks |
US11126561B2 (en) | 2019-10-01 | 2021-09-21 | Alibaba Group Holding Limited | Method and system for organizing NAND blocks and placing data to facilitate high-throughput for random writes in a solid state drive |
US11042307B1 (en) | 2020-01-13 | 2021-06-22 | Alibaba Group Holding Limited | System and method for facilitating improved utilization of NAND flash based on page-wise operation |
US11449455B2 (en) | 2020-01-15 | 2022-09-20 | Alibaba Group Holding Limited | Method and system for facilitating a high-capacity object storage system with configuration agility and mixed deployment flexibility |
US10923156B1 (en) | 2020-02-19 | 2021-02-16 | Alibaba Group Holding Limited | Method and system for facilitating low-cost high-throughput storage for accessing large-size I/O blocks in a hard disk drive |
US10872622B1 (en) | 2020-02-19 | 2020-12-22 | Alibaba Group Holding Limited | Method and system for deploying mixed storage products on a uniform storage infrastructure |
US11150986B2 (en) | 2020-02-26 | 2021-10-19 | Alibaba Group Holding Limited | Efficient compaction on log-structured distributed file system using erasure coding for resource consumption reduction |
US11144250B2 (en) | 2020-03-13 | 2021-10-12 | Alibaba Group Holding Limited | Method and system for facilitating a persistent memory-centric system |
US11200114B2 (en) | 2020-03-17 | 2021-12-14 | Alibaba Group Holding Limited | System and method for facilitating elastic error correction code in memory |
US11385833B2 (en) | 2020-04-20 | 2022-07-12 | Alibaba Group Holding Limited | Method and system for facilitating a light-weight garbage collection with a reduced utilization of resources |
US11281575B2 (en) | 2020-05-11 | 2022-03-22 | Alibaba Group Holding Limited | Method and system for facilitating data placement and control of physical addresses with multi-queue I/O blocks |
US11461262B2 (en) | 2020-05-13 | 2022-10-04 | Alibaba Group Holding Limited | Method and system for facilitating a converged computation and storage node in a distributed storage system |
US11494115B2 (en) | 2020-05-13 | 2022-11-08 | Alibaba Group Holding Limited | System method for facilitating memory media as file storage device based on real-time hashing by performing integrity check with a cyclical redundancy check (CRC) |
US11218165B2 (en) | 2020-05-15 | 2022-01-04 | Alibaba Group Holding Limited | Memory-mapped two-dimensional error correction code for multi-bit error tolerance in DRAM |
US11507499B2 (en) | 2020-05-19 | 2022-11-22 | Alibaba Group Holding Limited | System and method for facilitating mitigation of read/write amplification in data compression |
US11556277B2 (en) | 2020-05-19 | 2023-01-17 | Alibaba Group Holding Limited | System and method for facilitating improved performance in ordering key-value storage with input/output stack simplification |
US11263132B2 (en) | 2020-06-11 | 2022-03-01 | Alibaba Group Holding Limited | Method and system for facilitating log-structure data organization |
US11422931B2 (en) | 2020-06-17 | 2022-08-23 | Alibaba Group Holding Limited | Method and system for facilitating a physically isolated storage unit for multi-tenancy virtualization |
US11354200B2 (en) | 2020-06-17 | 2022-06-07 | Alibaba Group Holding Limited | Method and system for facilitating data recovery and version rollback in a storage device |
US11354233B2 (en) | 2020-07-27 | 2022-06-07 | Alibaba Group Holding Limited | Method and system for facilitating fast crash recovery in a storage device |
US11372774B2 (en) | 2020-08-24 | 2022-06-28 | Alibaba Group Holding Limited | Method and system for a solid state drive with on-chip memory integration |
US11487465B2 (en) | 2020-12-11 | 2022-11-01 | Alibaba Group Holding Limited | Method and system for a local storage engine collaborating with a solid state drive controller |
US11734115B2 (en) | 2020-12-28 | 2023-08-22 | Alibaba Group Holding Limited | Method and system for facilitating write latency reduction in a queue depth of one scenario |
US11416365B2 (en) | 2020-12-30 | 2022-08-16 | Alibaba Group Holding Limited | Method and system for open NAND block detection and correction in an open-channel SSD |
US11847324B2 (en) | 2020-12-31 | 2023-12-19 | Pure Storage, Inc. | Optimizing resiliency groups for data regions of a storage system |
US11614880B2 (en) | 2020-12-31 | 2023-03-28 | Pure Storage, Inc. | Storage system with selectable write paths |
US11726699B2 (en) | 2021-03-30 | 2023-08-15 | Alibaba Singapore Holding Private Limited | Method and system for facilitating multi-stream sequential read performance improvement with reduced read amplification |
US11461173B1 (en) | 2021-04-21 | 2022-10-04 | Alibaba Singapore Holding Private Limited | Method and system for facilitating efficient data compression based on error correction code and reorganization of data placement |
US11476874B1 (en) | 2021-05-14 | 2022-10-18 | Alibaba Singapore Holding Private Limited | Method and system for facilitating a storage server with hybrid memory for journaling and data storage |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5341493A (en) * | 1990-09-21 | 1994-08-23 | Emc Corporation | Disk storage system with write preservation during power failure |
US5526482A (en) * | 1991-01-04 | 1996-06-11 | Emc Corporation | Storage device array architecture with copyback cache |
US5535381A (en) * | 1993-07-22 | 1996-07-09 | Data General Corporation | Apparatus and method for copying and restoring disk files |
US5577226A (en) * | 1994-05-06 | 1996-11-19 | Eec Systems, Inc. | Method and system for coherently caching I/O devices across a network |
US5588129A (en) * | 1994-02-09 | 1996-12-24 | Ballard; Clinton L. | Cache for optical storage device and method for implementing same |
US5787466A (en) * | 1996-05-01 | 1998-07-28 | Sun Microsystems, Inc. | Multi-tier cache and method for implementing such a system |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5909700A (en) * | 1996-12-23 | 1999-06-01 | Emc Corporation | Back-up data storage facility incorporating filtering to select data items to be backed up |
US5594885A (en) * | 1991-03-05 | 1997-01-14 | Zitel Corporation | Method for operating a cache memory system using a recycled register for identifying a reuse status of a corresponding cache entry |
JP2810593B2 (en) | 1992-05-13 | 1998-10-15 | 三菱電機株式会社 | Storage device |
US5689678A (en) | 1993-03-11 | 1997-11-18 | Emc Corporation | Distributed storage array system having a plurality of modular control units |
US5522031A (en) | 1993-06-29 | 1996-05-28 | Digital Equipment Corporation | Method and apparatus for the on-line restoration of a disk in a RAID-4 or RAID-5 array with concurrent access by applications |
US5666512A (en) | 1995-02-10 | 1997-09-09 | Hewlett-Packard Company | Disk array having hot spare resources and methods for using hot spare resources to store user data |
US6021408A (en) * | 1996-09-12 | 2000-02-01 | Veritas Software Corp. | Methods for operating a log device |
US5896492A (en) * | 1996-10-28 | 1999-04-20 | Sun Microsystems, Inc. | Maintaining data coherency between a primary memory controller and a backup memory controller |
-
1998
- 1998-08-04 US US09/129,042 patent/US6243795B1/en not_active Expired - Lifetime
-
1999
- 1999-07-14 AU AU49907/99A patent/AU4990799A/en not_active Abandoned
- 1999-07-14 WO PCT/US1999/015830 patent/WO2000008563A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5341493A (en) * | 1990-09-21 | 1994-08-23 | Emc Corporation | Disk storage system with write preservation during power failure |
US5526482A (en) * | 1991-01-04 | 1996-06-11 | Emc Corporation | Storage device array architecture with copyback cache |
US5535381A (en) * | 1993-07-22 | 1996-07-09 | Data General Corporation | Apparatus and method for copying and restoring disk files |
US5588129A (en) * | 1994-02-09 | 1996-12-24 | Ballard; Clinton L. | Cache for optical storage device and method for implementing same |
US5577226A (en) * | 1994-05-06 | 1996-11-19 | Eec Systems, Inc. | Method and system for coherently caching I/O devices across a network |
US5787466A (en) * | 1996-05-01 | 1998-07-28 | Sun Microsystems, Inc. | Multi-tier cache and method for implementing such a system |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002029575A2 (en) * | 2000-09-29 | 2002-04-11 | Emc Corporation | System and method for hierarchical data storage in a log structure |
WO2002029575A3 (en) * | 2000-09-29 | 2002-06-13 | Emc Corp | System and method for hierarchical data storage in a log structure |
GB2383450A (en) * | 2000-09-29 | 2003-06-25 | Emc Corp | System and method for hierarchical data storage |
GB2383450B (en) * | 2000-09-29 | 2004-10-13 | Emc Corp | System and method for hierarchical data storage |
US6865650B1 (en) | 2000-09-29 | 2005-03-08 | Emc Corporation | System and method for hierarchical data storage |
WO2017216887A1 (en) * | 2016-06-15 | 2017-12-21 | 株式会社日立製作所 | Information processing system |
JPWO2017216887A1 (en) * | 2016-06-15 | 2019-01-17 | 株式会社日立製作所 | Information processing system |
US10853268B2 (en) | 2016-06-15 | 2020-12-01 | Hitachi, Ltd. | Parity generating information processing system |
Also Published As
Publication number | Publication date |
---|---|
AU4990799A (en) | 2000-02-28 |
US6243795B1 (en) | 2001-06-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6243795B1 (en) | Redundant, asymmetrically parallel disk cache for a data storage system | |
US20100100664A1 (en) | Storage system | |
US6341331B1 (en) | Method and system for managing a raid storage system with cache | |
US5586291A (en) | Disk controller with volatile and non-volatile cache memories | |
US7130956B2 (en) | Storage system including hierarchical cache metadata | |
US8255627B2 (en) | Secondary cache for write accumulation and coalescing | |
US5617530A (en) | Storage device array architecture with copyback cache | |
US7840848B2 (en) | Self-healing cache operations | |
US5309451A (en) | Data and parity prefetching for redundant arrays of disk drives | |
US7035974B2 (en) | RAID-5 disk having cache memory implemented using non-volatile RAM | |
US7130957B2 (en) | Storage system structure for storing relational cache metadata | |
US6105103A (en) | Method for mapping in dynamically addressed storage subsystems | |
US7930588B2 (en) | Deferred volume metadata invalidation | |
US20090210620A1 (en) | Method to handle demand based dynamic cache allocation between SSD and RAID cache | |
US6901477B2 (en) | Provision of a victim cache within a storage cache hierarchy | |
US20070260846A1 (en) | Methods of prefetching data in data storage systems | |
JP2017151982A (en) | System and method for caching in data storage subsystem | |
JP2015535116A (en) | Memory segment remapping to address fragmentation | |
US8566525B2 (en) | Techniques for managing data in a storage controller | |
US9626260B2 (en) | Read/write cache device and method persistent in the event of a power failure | |
US6772289B1 (en) | Methods and apparatus for managing cached CRC values in a storage controller | |
JP5176854B2 (en) | Disk array subsystem, cache control method for disk array subsystem, and program | |
JP2023152247A (en) | Storage system and storage control method | |
He et al. | VC-RAID: A large virtual NVRAM cache for software do-it-yourself RAID |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase |