CN102880520A - Method for system to automatically recover from failure - Google Patents

Method for system to automatically recover from failure Download PDF

Info

Publication number
CN102880520A
CN102880520A CN2012103177076A CN201210317707A CN102880520A CN 102880520 A CN102880520 A CN 102880520A CN 2012103177076 A CN2012103177076 A CN 2012103177076A CN 201210317707 A CN201210317707 A CN 201210317707A CN 102880520 A CN102880520 A CN 102880520A
Authority
CN
China
Prior art keywords
time
malfunction
recovered under
automatically recovered
method automatically
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012103177076A
Other languages
Chinese (zh)
Other versions
CN102880520B (en
Inventor
汪公玉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Gongjin Electronics Co Ltd
Original Assignee
Shenzhen Gongjin Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Gongjin Electronics Co Ltd filed Critical Shenzhen Gongjin Electronics Co Ltd
Priority to CN201210317707.6A priority Critical patent/CN102880520B/en
Publication of CN102880520A publication Critical patent/CN102880520A/en
Application granted granted Critical
Publication of CN102880520B publication Critical patent/CN102880520B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a method for a system to automatically recover from a failure, which has the following steps: according to the actual situation of the system hardware, the related processing function of a watchdog is defined at the initialized position of bootloader, and watchdog counter progressing is added to avoid the system serial port hanging; and the watchdog is automatically fed when the counter value reaches 10 minutes, and finally the system is automatically rebooted. The method adopts the watchdog to detect whether the system software fails, and automatically reboots the computer in case that the failure is detected, so as to recover the system software.

Description

The method that a kind of system is recovered under the malfunction automatically
Technical field
The invention belongs to the software fault process field, particularly the automatic method of recovering of the software of the industrial control systems such as a kind of focus AP, server, telephone system and production line.
Background technology
At present, occur causing the software error of systemic breakdown not fearful in the desktop computer that has the people to operate, because the user only needs restarting systems to get final product, it only can cause losing of low volume data.Yet for the software that operates in industrial control system, system must not have to recover fault under the condition of human intervention.This characteristic is very crucial in both cases: a kind of is the high-efficiency system, such as server, telephone system and production line etc.; Another kind is highly-reliable system, will damage because in a single day mistake appears in this system, such as automobile, Medical Devices, Industry Control, automatically-controlled door etc.These industrial control systems generally all use two cover hardware, the identical software of operation on every cover hardware, and one primary, and another is for subsequent use, when host apparatus software broke down, host apparatus withdrawed from, and stand-by equipment pushes up up immediately, at this moment, main equipment only need to reset and just can recover.In addition, focus AP can not cause ap to hang when carrying out any operation in 10 minutes after start enters bootloader, and can not carry out any operation this moment to ap.Need to manually restart ap and enter again the relevant operative technique problem of u-boot execution, use inconvenient.
Summary of the invention
Can cause ap to hang for solving when present focus AP did not carry out any operation in 10 minutes after start enters bootloader, can not carry out any operation this moment to ap.Need to manually restart ap and enter again u-boot and carry out relevant operative technique problem, use inconvenient deficiency, the invention provides the method that a kind of system is recovered under the malfunction automatically.
The present invention for the technical scheme of finishing its technical purpose and adopting is: the method that a kind of system is recovered under the malfunction automatically, the method is according to the actual conditions of system hardware, by the initialized local relevant treatment function that defines house dog at bootloader, then process according to causing system's serial port failure event handling to add the house dog counter; By the time automatic dog feeding when counter values to 10 minute, the final system autoboot.
Further, the method that above-mentioned system is recovered under malfunction automatically: described system is focus AP or server, telephone system and production line for the worker.
Further, the method that above-mentioned system is recovered under malfunction automatically: may further comprise the steps:
Step M1, the initialized entrance function of use bootloader;
Step M2, check under the u-boot pattern whether serial ports has the character input, if the character input is arranged then receive character, do respective handling according to the definition of character, process otherwise enter step M3;
Step M3, detection do not have the time of character input, process if reach setting-up time then enter step M4, detect judgement otherwise carry out next time;
Step M4, execution reset order make system's autoboot.
Further, the method that above-mentioned system is recovered under malfunction automatically: among the described step M3, detection does not have the time of character input to judge whether to arrive setting-up time by the count counting.
Further, the method that above-mentioned system is recovered under malfunction automatically: setting-up time is 10 minutes.
Whether the present invention breaks down by the mode detection system software that employing arranges house dog, and when having detected the fault generation, the autoboot system makes system software recover normal.
Below by with specific embodiments and the drawings the present invention is further detailed.
Description of drawings
Accompanying drawing 1 is the embodiment of the invention 1 process flow diagram.
Embodiment
As shown in Figure 1, present embodiment is a kind of focus AP when not carrying out any operation in 10 minutes after start enters bootloader and cause ap to hang, in the time of can not carrying out any operation to ap, and the method that focus AP software fault is recovered automatically.
According to the actual conditions of hardware, by the initialized local relevant treatment function that defines house dog at bootloader,
In the present embodiment, have following code in the place that enters bootloader, house dog and related function are carried out initialization, specific code is as follows:
watchdog_open();
mem_malloc_init();
serial_init();
eth_initialize(gd->bd);
watchdog_close();
Wherein the first row is opened house dog and is processed function, then is the initialization function that carries out the hardware such as internal memory, serial ports, Ethernet interface, is at last to close house dog to process function
Then process according to causing the event handling of ap serial port failure to add the house dog counter.By the time automatic dog feeding, then ap autoboot when counter values to 10 minute.Here, house dog is to process by counter, when counter calculates 10 minutes values of setting, after calling corresponding command process such as reset and ordering.The counting of original 10 minutes values clear 0. that calculate and then circulation front is processed.
Committed step such as Fig. 1, realize being described as follows:
The initialized entrance function of step M1:bootloader comprises the initialization that house dog opens and closes, and causes in addition the processing of the network interface card initialization failure function that program hangs.
Step M2: check under the boot pattern whether serial ports has the character input, receives character, does respective handling according to the definition of character.Otherwise entering step M3 processes.
Step M3: if serial ports never receives the arrival of character, just do the count counting and judge whether to arrive 10 minutes.Enter step M4 processing if reach 10 minutes, otherwise carry out next time cycle criterion.
Step M4: if the count counting was above 10 minutes.Just carry out the reset order and make the ap autoboot.Attention: use the reset order, because when starting kernel, need also to judge that with which kernel and other judgement be with which kernel and other judgement.

Claims (5)

1. the method automatically recovered under the malfunction of a system, it is characterized in that: the method is according to the actual conditions of system hardware, by the initialized local relevant treatment function that defines house dog at bootloader, then process according to causing system's serial port failure event handling to add the house dog counter; By the time automatic dog feeding when counter values to 10 minute, the final system autoboot.
2. the method automatically recovered under the malfunction of system according to claim 1, it is characterized in that: described system is focus AP or server, telephone system and production line for the worker.
3. the method automatically recovered under the malfunction of system according to claim 2 is characterized in that: may further comprise the steps:
Step M1, the initialized entrance function of use bootloader;
Step M2, check under the boot pattern whether serial ports has the character input, if the character input is arranged then receive character, do respective handling according to the definition of character, process otherwise enter step M3;
Step M3, detection do not have the time of character input, process if reach setting-up time then enter step M4, detect judgement otherwise carry out next time;
Step M4, execution reset order make system's autoboot.
4. the method automatically recovered under the malfunction of system according to claim 3 is characterized in that: among the described step M3, detect the time that does not have the character input to judge whether to arrive setting-up time by the count counting.
5. according to claim 3 or the method automatically recovered under the malfunction of 4 described systems, it is characterized in that: setting-up time is 10 minutes.
CN201210317707.6A 2012-08-31 2012-08-31 A kind of method that system is automatically recovered from malfunction Active CN102880520B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210317707.6A CN102880520B (en) 2012-08-31 2012-08-31 A kind of method that system is automatically recovered from malfunction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210317707.6A CN102880520B (en) 2012-08-31 2012-08-31 A kind of method that system is automatically recovered from malfunction

Publications (2)

Publication Number Publication Date
CN102880520A true CN102880520A (en) 2013-01-16
CN102880520B CN102880520B (en) 2015-07-29

Family

ID=47481857

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210317707.6A Active CN102880520B (en) 2012-08-31 2012-08-31 A kind of method that system is automatically recovered from malfunction

Country Status (1)

Country Link
CN (1) CN102880520B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106792848A (en) * 2016-12-13 2017-05-31 惠州Tcl移动通信有限公司 A kind of WIFI hot spot reset processing method and system based on mobile terminal

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030204792A1 (en) * 2002-04-25 2003-10-30 Cahill Jeremy Paul Watchdog timer using a high precision event timer
CN101271414A (en) * 2008-05-06 2008-09-24 江苏东大金智建筑智能化系统工程有限公司 Full-range dog-feeding method of embedded system
CN101739305A (en) * 2010-02-09 2010-06-16 太仓市同维电子有限公司 Operating system kernel level real-time dongle monitoring device and monitoring method thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030204792A1 (en) * 2002-04-25 2003-10-30 Cahill Jeremy Paul Watchdog timer using a high precision event timer
CN101271414A (en) * 2008-05-06 2008-09-24 江苏东大金智建筑智能化系统工程有限公司 Full-range dog-feeding method of embedded system
CN101739305A (en) * 2010-02-09 2010-06-16 太仓市同维电子有限公司 Operating system kernel level real-time dongle monitoring device and monitoring method thereof

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106792848A (en) * 2016-12-13 2017-05-31 惠州Tcl移动通信有限公司 A kind of WIFI hot spot reset processing method and system based on mobile terminal

Also Published As

Publication number Publication date
CN102880520B (en) 2015-07-29

Similar Documents

Publication Publication Date Title
CN105843694B (en) The controlled restoration methods and device of error message between independently operable processor
US8468389B2 (en) Firmware recovery system and method of baseboard management controller of computing device
CN108121559B (en) Configuration file pushing method, server and storage medium
WO2016145888A1 (en) Display screen processing method and device
US8438423B1 (en) Invalid setup recovery
CN107111595B (en) Method, device and system for detecting early boot errors
CN104461786A (en) Recovery method and system for Android system
CN104063477A (en) Processing method and processing device for startup abnormalities of embedded system
CN103345434B (en) The data back up method of a kind of display device and device
CN101430669A (en) Exception recovery method and device for apparatus with network interface card and memory device
CN109582506A (en) The restoration methods and device of touch screen failure, host equipment and touch-control system
CN102880527A (en) Data recovery method of baseboard management controller
CN114116280A (en) Interactive BMC self-recovery method, system, terminal and storage medium
CN105468390A (en) BOOT online upgrade apparatus and method
CN106874126A (en) Host process method for detecting abnormality in a kind of software development
CN102880520B (en) A kind of method that system is automatically recovered from malfunction
CN116820827B (en) Control method and system of substrate management controller of node server
CN102012848A (en) Method for resetting parameters of baseboard management controller
CN105912414A (en) Method and system for server management
CN105760276B (en) Reboot orders are improved to restart reliability and increase the method for the reset log
CN110347532A (en) Multi-partitioned systems backup method and device
CN104268026A (en) Monitoring and management method and device for embedded system
CN101369238A (en) Exception monitoring and reset processing method for USB equipment
CN107179911B (en) Method and equipment for restarting management engine
CN101571817A (en) Mainboard as well as recovering method and startup method of basic input output system thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant