WO1995003580A1 - Method for rapid recovery from a network file server failure - Google Patents

Method for rapid recovery from a network file server failure Download PDF

Info

Publication number
WO1995003580A1
WO1995003580A1 PCT/US1994/007046 US9407046W WO9503580A1 WO 1995003580 A1 WO1995003580 A1 WO 1995003580A1 US 9407046 W US9407046 W US 9407046W WO 9503580 A1 WO9503580 A1 WO 9503580A1
Authority
WO
WIPO (PCT)
Prior art keywords
computer
mass storage
file server
computer system
storage device
Prior art date
Application number
PCT/US1994/007046
Other languages
French (fr)
Inventor
Richard Ohran
Terry Dickson
Original Assignee
Vinca Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vinca Corporation filed Critical Vinca Corporation
Priority to AU72115/94A priority Critical patent/AU7211594A/en
Publication of WO1995003580A1 publication Critical patent/WO1995003580A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2094Redundant storage or storage space
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1415Saving, restoring, recovering or retrying at system level
    • G06F11/1438Restarting or rejuvenating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/40Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass for recovering from a failure of a protocol instance or entity, e.g. service redundancy protocols, protocol state redundancy or protocol service redirection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2089Redundant storage control functionality
    • G06F11/2092Techniques of failing over between control units
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2097Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements maintaining the standby controller/processing unit updated
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C29/00Checking stores for correct operation ; Subsequent repair; Testing stores during standby or offline operation
    • G11C29/70Masking faults in memories by using spares or by reconfiguring
    • G11C29/74Masking faults in memories by using spares or by reconfiguring using duplex memories, i.e. using dual copies

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method for providing rapid recovery from a network file server failure through the use of a backup computer system. Unlike other redundant file server configurations, this method does not require the backup computer system to be running the file server operating system. Instead, it runs a special mass storage access program (201) that communicates with a mass storage emulator program on the network file server (202), making the disks (or other mass storage devices) on the backup computer system appear like they were disks on the file server computer. By mirroring data by writing to both the mass storage of the file server and through the mass storage emulator and mass storage access program to the disks on the backup computer, a copy of the data on the file server computer is made (203). In the event of failure of the file server computer (204), the backup computer can be restarted as a file server (205), using the copy on its disks. The same method can be utilized to restore normal system operation when the failure has been corrected.

Description

METHOD FOR RAPID RECOVERY FROM A NETWORK FILE SERVER FAILURE
Microfiche Appendix This specification includes a MICROFICHE APPENDIX which is intended to be a part of the disclosure herein. The MICROFICHE APPENDIX contains 1 page of microfiche and a total of 69 frames. The MICROFICHE APPENDIX includes computer source code used in the implementation of one preferred embodiment of the invention.
Background of the Invention Field of the Invention. This invention relates to network file server computer systems, and in particular to the methods used to recover from a computer failure in a system with a plurality of computer systems, each with its own mass storage devices. Description of Related Art. It is often desirable to provide continuous operation of computer systems, particularly file servers which support a number of user workstations or personal computers. To achieve this continuous operation, it is necessary for the computer system to be tolerant of software and hardware problems or faults. This is generally done by having redundant computers and mass storage devices, such that a backup computer or disk drive is immediately available to take over in the event of a fault. A number of techniques for implementing a fault- tolerant computer system are described in Major et al., United States Patent 5,157,663 (which is hereby incorporated by reference in its entirety) and its cited references. In particular, the invention of Major provides a redundant network file server capable of recovering from the failure of either the computer or the mass storage device of one of the file servers. The file server operating system is run on each computer system in the network file server, with each computer system cooperating to produce the redundant network file server. This technique has been used by Novell to implement its SFT-III fault-tolerant file server product. There are a number of reasons why the use of a redundant network file server such as that described in Major may be undesirable. As can be seen from the description in Major, the software needed to provide such a redundant network file server is considerably more complex than the software of the present invention. This can result in a lower reliability due the increased presence of programming errors ("bugs") in the complex software. Also, the processing time required to handle a client request may be increased by the complexity of the redundant network file server software, when compared to a single-processor network file server. Finally, license restrictions or other limitations may make it infeasible or uneconomical to run a redundant network file server instead of a normal network file server.
Summary of the Invention It is an object of this invention to provide rapid recovery from a network file server failure without the complex software of a redundant network file server. This is achieved by having a second, backup computer system with its own mass storage device (generally a magnetic disk). This backup computer is connected by an appropriate means for communications to the file server computer, allowing the transmission of information (such as commands and data) between the two computers. A mass storage emulator, running like a device driver on the file server computer, sends information to a mass storage access program on the backup computer. The mass storage access program performs the requested operation (read, write, etc.) on the mass storage system connected to the backup computer, and returns the result to the mass storage emulator on the file server computer. This makes the mass storage device on the backup computer look like another mass storage device on the file server computer. The data mirroring option of the file server operating system can be activated (or, if the operating system does not support data mirroring, a special device driver that provides data mirroring can be used), so that a copy of all data written to the mass storage device directly connected to the file server will also be written to the mass storage device on the backup computer, through the mass storage emulator and mass storage access programs. When a failure is detected in the file server computer system, the backup computer can be restarted as a file server computer. The mass storage device of the backup computer will contain a copy of the information on the mass storage device of the failed file server, so the new file server can start with approximately the same data as when the previous file server failed. It is a further object of this invention to improve the reliability of a redundant network file server computer system by reducing the complexity of the software when compared to the software of a redundant network file server. As will be clear from the discussion of the preferred embodiment of the invention, the programs for the mass storage emulator on the file server computer and the mass storage access on the backup computer are considerably less complex than a full redundant file server operating system. It is a further object of this invention to improve the performance of a network file server by improving the performance of the mass storage used by the network file server. Because of the simplicity of the mass storage emulator and the mass storage access program, as well as the fact that the backup computer is only running the mass storage access program and not a file server operating system that must also handle network requests and other activity, the performance of the emulated mass storage device may exceed the performance of a mass storage device directly attached to the file server computer. This is particularly true if the mass storage access program is expanded to provide caching of information and the full memory of the backup computer (less that occupied by a simple operating system and the mass storage access program) is used as a cache. These and other features of the invention will be more readily understood upon consideration of the attached drawings and of the following detailed description of those drawings and the presently preferred embodiments of the invention.
Brief Description of the Drawings Figure 1 illustrates a computer configuration on which the method of the invention runs. Figure 2 is a flow diagram showing the steps of the method of the invention.
Detailed Description of the Invention Referring to Figure 1, which illustrates a representative computer configuration on which the method of the invention runs, it can be seen that there are two computer systems 110 and 120. The first computer system 110 is running a file server operating system (such as Novell NetWare®). Computer system 110 includes computer 112 connected to network 101 through interface 111 (and its associated software), and mass storage device 114 connected through controller 113 (and its associated software). These represent the standard components of a network file server. In the case of NetWare, computer 112 is generally a PC- compatible computer based on an Intel 386 or 486 processor, network 101 can be an ethernet (so that interface 111 is an ethernet interface), and mass storage device 114 is an SCSI or IDE magnetic disk connected through an appropriate controller 113. Computer 122 would also be a PC-compatible computer, so that it could also run the same NetWare file server operating system as computer 112. Network 101 could also be implemented as a token ring, Arcnet, or any other network technology, such network technology being known to those skilled in the art. The mass storage devices of the invention should not be viewed as limited to magnetic disk drives, but can also be implemented using optical discs, magnetic tape drives, or any other medium capable of handling the read and write requests of the particular computer system. Added to the standard network file server to support the method of this invention are a backup computer system 120 and a means 102 for communicating between computer system 110 and computer system 120. Computer system 120 has components similar to computer system 110. Computer system 120 can be connected to network 101 through interface 121, although it is not necessary for computer system 120 to actually be connected to network 101 during normal operation. Computer 122 is connected to interface 121 and to mass storage device 124 through controller 123. While it is not necessary for computer system 120 to have identical components to computer system 110, in many cases that will be the case. In other cases, computer system 120 may be an older, slower system previously used as a file server but replaced with computer system 110. All that is required of computer system 120 is that it be capable of running the file server operating system in case of the failure of computer system 110, and that its mass storage device 124 be of sufficient capacity to hold the data mirrored from mass storage device 114. Communications means 102 provides a link between computer systems 110 and 120. Computer 112 is connected to communications means 102 through attachment 115, and computer 122 is connected to communications means 102 through attachment 125. Communications means 102 can be implemented using a variety of techniques, well-known to those skilled in the art. In the preferred embodiment, a high- speed serial point-to-point link is used. An alternative would be to use the serial communications ports of computers 112 and 122, programmed to run at a high data rate, or the parallel interfaces of computers 112 and 122. Another alternative is for communications means 102 to be a virtual circuit or channel carried on network 101. In this latter case, communications means 102 would really be network 101, attachment 115 would really be interface 111, and attachment 125 would really be interface 121. It is important that communication means 102 provide data transfer at rates comparable to the data rate of mass storage device 124 so that it does not limit the performance of the system. The method of this invention is not dependant on the particular implementation of communications means 102. Figure 2 is a flow diagram showing the steps of the method of the invention. In step 201, a special program — the mass storage access program — is run on computer system 120. The mass storage access program receives commands from computer system 110 over communications means 102. Based on those commands, the mass storage access program accesses mass storage device 124 to perform the operation specified in the command received from computer system 110. The results of the accessing of mass storage device 124 is returned to computer system 110 over communications means 102. The mass storage access program can be enhanced to provide a cache of data on mass storage device 124. The implementation of such a cache function is well-known in the art, consisting of keeping a copy of the most recently accessed information of mass storage device 124 in the memory of computer 122. When a read command is received, it is not necessary to access mass storage device 124 if a copy of the data is in the cache. Since, in the preferred embodiment, computer 122 has a large memory (it must be large enough to run the file server operating system) and the mass storage access program is quite small, there is a large amount of memory available for the cache. This means that many entries will be in the cache, and the chance of finding a block being read in the cache is higher than would be normal for a similar cache in a file server operating system. In step 202, coincidentally with the running of the mass storage access program on computer system 120, another program — the mass storage emulator - - is installed on computer system 110. The mass storage emulator takes mass storage requests from the file server operating system running on computer system 110 and sends them as commands over communications means 102 to computer system 120, where they are processed by the mass storage access program, as discussed above. When results from a command are received from the mass storage access program over communications means 102 by the mass storage emulator, they are returned to the file server operating system, much as the result of a normal mass storage request would be returned. In this way, the mass storage access program and the mass storage emulator cooperate to make it appear to the file server operating system that mass storage device 124 is directly connected to computer 112 on computer system 110. In the preferred embodiment of this invention, the mass storage access program is a conventional program running under the disk operating system of personal computer 122. The disk storage emulator is a NetWare Loadable Module (NLM), much like the device driver for a disk drive. Copies of the source code for the mass storage access program and the mass storage emulator are given in the microfiche appendix. In step 203, mirroring of data is initiated. When data is being mirrored on two or more mass storage devices, whenever data is to be written it is written to all mass storage devices taking part in the mirroring, at the same location on each mass storage device. (The location may be relative to the start of the mass storage device, or to the start of a partition or contiguous portion of the mass storage device, as appropriate to the way the mass storage device has been formatted and is being used.) Data can be read from any mass storage device taking part in the mirroring, since each mass storage device contains identical data. Mirroring may be an integral function of the file server operating system, so that no special program is necessary for implementing disk mirroring as part of the method of this invention. Step 203 only requires the activation or starting of mirroring on the part of the file server operating system. This is the case in the preferred embodiment of the invention, operating with NetWare and using the mirroring facilities of that file server operating system. If the file server operating system does not provide mirroring, a separate mirroring module will have to be implemented. Such a mirroring module, whose implementation should be obvious to one skilled in the art, will take each write request and pass it to the driver for each mass storage device taking part in the mirroring. For mass storage device 124 on computer system 120, the driver will be the mass storage emulator, discussed above. When successful completion of the write request has been received from all mass storage devices taking part in the mirroring, the mirroring module will indicate successful completion to. the file server operating system. For read requests, the mirroring module can direct the read request to any of the mass storage devices, since all contain identical data. Generally, the read request will be directed to the mass storage device which is first available to handle the request. As part of the initiating of mirroring, it is necessary to assure that each mass storage device taking part in mirroring has the same contents at the start of mirroring. This can be done by designating one of the mass storage devices as the master, and making a copy of the master mass storage device's data to all other mass storage devices taking part in the mirroring. An alternative approach is to have a timestamp indicating when the last change was made to the data on a mass storage device. If the timestamp on a mass storage device is the same as the timestamp on the master mass storage device, it will not be necessary to make a new copy of the data. At step 204, the method of this invention waits until a failure of file server computer system 110 is detected. Such a failure could come from the failure of either hardware (such as computer 112 or mass storage device 114) or software (such as the file server operating system) . Although means for automatically detecting such a failure may be used, in the preferred embodiment such failure is detected by a system operator or workstation user by noticing that file server requests are no longer being handled by computer system 110. It is not difficult for a user to determine there is a problem with file server computer system 110; in most cases, a user workstation will stop working and "hang" while it waits for a file server request that will never be honored. In step 205, when a failure of computer system 110 has been detected, if computer system 120 is not currently connected to network 101 through interface 121, a connection to the network 101 is made. This can be done either by activating interface 121 or physically connecting interface 121 to network 101, as appropriate. In step 206, when computer system 120 has been connected to network 101, a file server operating system is loaded into computer 122 and executed, making computer system 120 a file server computer system. New file server computer system 120 now responds to requests received from network 101 as failed file server computer system 110 did before its failure. The file server operating system executing on computer 122 accesses mass storage device 124 to respond to the requests. Note that because mass storage device 124 received data through the mass storage emulator and mass storage access program while file server computer system 110 was operating, mass storage device 124 contains a copy of the data stored on mass storage device 114 prior to the failure of computer system 120. (Because of timing, the last few write operations may not have occurred on all mass storage devices taking part in mirroring, but the file server operating system is capable of handling these small differences.) Because a copy of the mass storage data of failed file server computer system 110 is immediately available to new file server computer system 120, the time necessary to recover from a file server failure is minimal. When the fault that caused the failure of computer system 120 has been corrected, fault- tolerant operation can be restored. Depending on the relative capabilities of computer systems 110 and 120, one of two techniques can be employed. Both involve the same method steps as were discussed above. If the two computer systems have components of similar speed and capacity, there is no reason not to continue using computer system 120 as the file server computer. In this case, computer system 110 can now be treated as the backup computer system. The mass storage access program is run on computer system 110, the mass storage emulator is installed on computer system 120, and mirroring is initiated on the file server operating system running on computer system 120. As part of the initiating of mirroring, any data written to mass storage device 124 during the time computer system 110 was not available is now copied to mass storage device 114 though the mass storage emulator, communications mean 102, and the mass storage access program. Alternatively, if computer system 120 is less capable than computer system 110, it will be desirable to make computer system 110 the file server computer system when the failure has been corrected. To accomplish this, two approaches are possible. In the first approach, computer system 110 is brought up as the backup computer system, running the mass storage access program, as discussed above. When mass storage device 114 contains a copy of the data on mass storage device 124, computer system 110 can be restarted as the file server (running the "file server operating system) and computer system 120 can be restarted as the backup computer in accordance with the method discussed above. Alternatively, when the failure of computer system 110 has been corrected, computer system 120 can be restarted as backup computer system, running the mass storage access program, and computer- system 110 can be restarted as the file server computer, running the file server operating system and the mass storage emulator. When mirroring is initiated, it will be determined by the timestamps stored on each of mass storage devices 114 and 124 that the data on mass storage device 114 is out of date. The file server operating system will read the data on mass storage device 124 (though the mass storage emulator, communications means 102, and the mass storage access program) . It will also copy the data from mass storage device 124 to. mass storage device 114 until they contain identical data. It is to be understood that the above described embodiments are merely illustrative of numerous and varied other embodiments which may constitute applications of the principles of the invention. Such other embodiments may be readily devised by those skilled in the art without departing from the spirit or scope of this invention and it is our intent they be deemed within the scope of our invention.

Claims

Claims We claim: 1. A method for rapid recovery from a network file server failure, operating on a computer configuration that includes: a first computer system, comprising: (a) a first computer executing a file server operating system adapted to respond to requests received from a network, (b) a first interface connecting said first computer to said network, and (c) a first mass storage device connected to said first computer; a second computer system, comprising: (a) a second computer capable of executing said file server operating system, (b) a second interface capable of connecting said second computer to said network, and (c) a second mass storage device connected to said second computer; and means for communicating between said first computer system and said second computer system, the recovery method of the invention comprising: (A) running a mass storage access program on said second computer, said mass storage access program receiving commands from said first computer over said communicating means, accessing said second mass storage device as specified by said commands, and returning said commands' results to said first computer over said communicating means; (B) installing a mass storage emulator on said first computer for use by said file server operating system, said mass storage emulator taking mass storage requests from said file server operating system, sending commands to said second computer system over said communicating means, and returning results of said commands received over said communicating means from said second computer to said file server operating system; (C) initiating mirroring of data by writing data both to said first mass storage device and through said mass storage emulator and said mass storage access program to said second mass storage device; (D) when a failure of said first computer system is detected, stopping said mass storage access program on said second computer; and then (E) if said second computer is not currently connected to said network, connecting said second computer to said network through said second interface; and then (F) executing said file server operating system on said second computer, said file server operating system responding to requests received from said network and accessing data stored on said second mass storage device. 2. A method as in claim 1, wherein said first computer system and said second computer system are each PC-compatible computers. 3. A method as in claim 2 wherein said file server operating system is Novell NetWare. 4. A method as in claim 1, wherein said network is an ethernet. 5. A method as in claim 1, wherein said network is a token ring. 6. A method as in claim 1, wherein said first mass storage device is a magnetic disk. 7. A method as in claim 1, wherein said second mass storage device is a magnetic disk. 8. A method as in claim 1, wherein said means for communicating between said first computer system and said second computer system is a high-speed serial point-to-point link. 9. A method as in claim 1, wherein said means for communicating between said first computer system and said second computer system uses the serial communications ports of said first computer and said second computer. 10. A method as in claim 1, wherein said means for communicating between said first computer system and said second computer system uses the parallel interfaces of said first computer and said second computer. 11. A method as in claim 1, wherein said means for communicating between said first computer system and said second computer system is said network. 12. A method as in claim 1, wherein said mass storage access program provides a cache of data on said second mass storage device. 13. A method as in claim "1, wherein said mass storage access program and said mass storage emulator makes said second mass storage device of said second computer system look like another mass storage device on said first computer system. 14. A method as in claim 1, wherein said mirroring of data is performed by said file server operating system. 15. A method as in claim 1, wherein said mirroring of data is performed by a mirroring module. 16. A method as in claim 1, wherein said method is also used for restoration of fault-tolerant operation. 17. A method as in claim 16, wherein said second computer system becomes a file server computer system and said first computer system becomes a backup computer system. 18. A method as in claim 16, wherein said first computer system again becomes a file server computer and said second computer system becomes a backup computer system. 19. A method for rapid recovery from a network file server failure, the method operating on a computer system that includes: a first computer system, comprising: (a) a first computer capable of executing a file server operating system adapted to respond to requests received from a network, (b) a first interface adapted to connect said first computer to said network, and (c) a first mass storage device capable of being connected to said first computer; a second computer system, comprising: (a) a second computer capable of executing said file server operating system, (b) a second interface capable of connecting said second computer to said network, and (c) a second mass storage device capable of being connected to said second computer; and means for communicating between said first computer system and said second computer system, the recovery method of the invention comprising: (A) running a mass storage access program on said second computer, said mass storage access program being capable of receiving commands from said first computer over said communicating means, accessing said second mass storage device as specified by said commands, and returning said commands' results to said first computer over said communicating means; (B) installing a mass storage emulator on said first computer for use by said file server operating system, said mass storage emulator taking mass storage requests from said file server operating system, sending commands to said second computer system over said communicating means, and returning results of said commands received over said communicating means from said second computer to said file server operating system; (C) initiating mirroring of data by writing data both to said first mass storage device and through said mass storage emulator and said mass storage access program to said second mass storage device; (D) when a failure of said first computer system is detected, stopping said mass storage access program on said second computer; (E) connecting said second computer to said network through said second interface as necessary; (F) executing said file server operating system on said second computer, said file server operating system being capable of responding to requests received from said network and accessing data stored on said second mass storage device.
PCT/US1994/007046 1993-07-20 1994-06-21 Method for rapid recovery from a network file server failure WO1995003580A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU72115/94A AU7211594A (en) 1993-07-20 1994-06-21 Method for rapid recovery from a network file server failure

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US9475593A 1993-07-20 1993-07-20
US08/094,755 1993-07-20

Publications (1)

Publication Number Publication Date
WO1995003580A1 true WO1995003580A1 (en) 1995-02-02

Family

ID=22246992

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1994/007046 WO1995003580A1 (en) 1993-07-20 1994-06-21 Method for rapid recovery from a network file server failure

Country Status (2)

Country Link
AU (1) AU7211594A (en)
WO (1) WO1995003580A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6088721A (en) * 1998-10-20 2000-07-11 Lucent Technologies, Inc. Efficient unified replication and caching protocol
WO2001011464A2 (en) * 1999-05-28 2001-02-15 Sentillion, Inc. Context management server appliance
US7346648B1 (en) 1999-05-28 2008-03-18 Sentillion, Inc. Context management server appliance
GB2444287A (en) * 2006-12-02 2008-06-04 David Peter Neupert Backup Server System
US7409577B2 (en) 2001-05-25 2008-08-05 Neverfail Group Limited Fault-tolerant networks
US8103906B1 (en) 2010-10-01 2012-01-24 Massoud Alibakhsh System and method for providing total real-time redundancy for a plurality of client-server systems
CN103907094A (en) * 2011-10-31 2014-07-02 国际商业机器公司 Serialization of access to data in multi-mainframe computing environments
EP2776928A4 (en) * 2011-11-11 2015-09-23 Level 3 Communications Llc Systems and methods for automatic replacement and repair of communications network devices
US10326645B2 (en) 2011-11-11 2019-06-18 Level 3 Communications, Llc System and methods for configuration management
US10997042B2 (en) 2011-11-11 2021-05-04 Level 3 Communications, Llc Systems and methods for configuration management

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4823256A (en) * 1984-06-22 1989-04-18 American Telephone And Telegraph Company, At&T Bell Laboratories Reconfigurable dual processor system
US5079740A (en) * 1987-01-12 1992-01-07 Ncr Corporation System and method of providing an automatic back-up primary terminal for a cluster of secondary terminals
US5157663A (en) * 1990-09-24 1992-10-20 Novell, Inc. Fault tolerant computer system
US5235700A (en) * 1990-02-08 1993-08-10 International Business Machines Corporation Checkpointing mechanism for fault-tolerant systems
US5295258A (en) * 1989-12-22 1994-03-15 Tandem Computers Incorporated Fault-tolerant computer system with online recovery and reintegration of redundant components
US5307481A (en) * 1990-02-28 1994-04-26 Hitachi, Ltd. Highly reliable online system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4823256A (en) * 1984-06-22 1989-04-18 American Telephone And Telegraph Company, At&T Bell Laboratories Reconfigurable dual processor system
US5079740A (en) * 1987-01-12 1992-01-07 Ncr Corporation System and method of providing an automatic back-up primary terminal for a cluster of secondary terminals
US5295258A (en) * 1989-12-22 1994-03-15 Tandem Computers Incorporated Fault-tolerant computer system with online recovery and reintegration of redundant components
US5235700A (en) * 1990-02-08 1993-08-10 International Business Machines Corporation Checkpointing mechanism for fault-tolerant systems
US5307481A (en) * 1990-02-28 1994-04-26 Hitachi, Ltd. Highly reliable online system
US5157663A (en) * 1990-09-24 1992-10-20 Novell, Inc. Fault tolerant computer system

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6088721A (en) * 1998-10-20 2000-07-11 Lucent Technologies, Inc. Efficient unified replication and caching protocol
WO2001011464A2 (en) * 1999-05-28 2001-02-15 Sentillion, Inc. Context management server appliance
WO2001011464A3 (en) * 1999-05-28 2002-05-10 Sentillion Inc Context management server appliance
US7346648B1 (en) 1999-05-28 2008-03-18 Sentillion, Inc. Context management server appliance
US7409577B2 (en) 2001-05-25 2008-08-05 Neverfail Group Limited Fault-tolerant networks
US7788524B2 (en) 2001-05-25 2010-08-31 Neverfail Group Limited Fault-tolerant networks
GB2444287A (en) * 2006-12-02 2008-06-04 David Peter Neupert Backup Server System
GB2444287B (en) * 2006-12-02 2011-04-27 David Peter Neupert Server backup system
US8103906B1 (en) 2010-10-01 2012-01-24 Massoud Alibakhsh System and method for providing total real-time redundancy for a plurality of client-server systems
US8689038B2 (en) 2010-10-01 2014-04-01 Massoud Alibakhsh System and method for providing total real-time redundancy for a plurality of client-server systems
CN103907094A (en) * 2011-10-31 2014-07-02 国际商业机器公司 Serialization of access to data in multi-mainframe computing environments
EP2776928A4 (en) * 2011-11-11 2015-09-23 Level 3 Communications Llc Systems and methods for automatic replacement and repair of communications network devices
US9817709B2 (en) 2011-11-11 2017-11-14 Level 3 Communications, Llc Systems and methods for automatic replacement and repair of communications network devices
US10326645B2 (en) 2011-11-11 2019-06-18 Level 3 Communications, Llc System and methods for configuration management
US10592330B2 (en) 2011-11-11 2020-03-17 Level 3 Communications, Llc Systems and methods for automatic replacement and repair of communications network devices
US10997042B2 (en) 2011-11-11 2021-05-04 Level 3 Communications, Llc Systems and methods for configuration management

Also Published As

Publication number Publication date
AU7211594A (en) 1995-02-20

Similar Documents

Publication Publication Date Title
US5978565A (en) Method for rapid recovery from a network file server failure including method for operating co-standby servers
JP3149325B2 (en) Method and associated system for forming a consistency group to provide disaster recovery functionality
US5263154A (en) Method and system for incremental time zero backup copying of data
US6237008B1 (en) System and method for enabling pair-pair remote copy storage volumes to mirror data in another storage volume
USRE37364E1 (en) Method and system for sidefile status polling in a time zero backup copy process
US5497483A (en) Method and system for track transfer control during concurrent copy operations in a data processing storage subsystem
US7266637B1 (en) Storage management system
US5379398A (en) Method and system for concurrent access during backup copying of data
US5448718A (en) Method and system for time zero backup session security
US7020669B2 (en) Apparatus, method and system for writing data to network accessible file system while minimizing risk of cache data loss/ data corruption
JP3151008B2 (en) Disk sector analysis method
EP1426863B1 (en) Method and apparatus for maintaining consistency of data stored in a group of mirroring devices
US7389396B1 (en) Bounding I/O service time
US6782416B2 (en) Distributed and geographically dispersed quorum resource disks
EP0802485B1 (en) Host-based RAID-5 and NV-RAM integration and method therefor
JPH08305500A (en) Storage controller,data storage system containing it and double-pair suppression method
US20050071372A1 (en) Autonomic infrastructure enablement for point in time copy consistency
JPH07239799A (en) Method for provision of remote data shadowing and remote data duplex system
EP1031084A1 (en) Computer system transparent data migration
EP2078254A2 (en) Apparatus, system, and method for an improved synchronous data mirror swap
AU7786894A (en) File Backup System
JPS638835A (en) Trouble recovery device
US6732232B2 (en) Adaptive resource allocation in multi-drive arrays
US6754682B1 (en) Method and apparatus for enabling consistent ancillary disk array storage device operations with respect to a main application
US7254683B2 (en) Speculative data mirroring apparatus method and system

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AT AU BB BG BR BY CA CH CN CZ DE DK ES FI GB HU JP KP KR KZ LK LU LV MG MN MW NL NO NZ PL PT RO RU SD SE SK UA UZ VN

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: CA