CN103095533A - Timed monitoring method in cloud calculating system platform - Google Patents

Timed monitoring method in cloud calculating system platform Download PDF

Info

Publication number
CN103095533A
CN103095533A CN2013100566791A CN201310056679A CN103095533A CN 103095533 A CN103095533 A CN 103095533A CN 2013100566791 A CN2013100566791 A CN 2013100566791A CN 201310056679 A CN201310056679 A CN 201310056679A CN 103095533 A CN103095533 A CN 103095533A
Authority
CN
China
Prior art keywords
monitoring
cloud computing
module
system platform
computing system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2013100566791A
Other languages
Chinese (zh)
Inventor
王帅
高滨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN2013100566791A priority Critical patent/CN103095533A/en
Publication of CN103095533A publication Critical patent/CN103095533A/en
Pending legal-status Critical Current

Links

Abstract

The invention provides a timed monitoring method in a cloud calculating system platform. The method is that a prediction mechanism is provided for a timed monitoring system in a cloud calculating system so that the cloud calculating system platform can provide service stably and continuously, the system achieves monitoring for a timed task in the cloud calculating system platform based on historical performance data stored in the cloud calculating system platform and combining with a monitoring scheduling module. A data collecting module in the cloud calculating system platform is responsible for collecting performance parameters and alarm logs of all business modules. A failure prediction module of the system achieves statistic analysis for information collected by each monitored node group, generates system performance prediction analysis and sends the analysis to a system administrator. The system administrator can optimize the system according to the analysis so as to avoid the happening of a failure. Accordingly, a good early warning and preventive measure is provided for the continuous operation of the cloud calculating system platform.

Description

Timing method for supervising in a kind of cloud computing system platform
Technical field
The present invention relates to the timing monitoring strategies in a kind of cloud computing system platform, be specifically related to the timing method for supervising in a kind of cloud computing system platform.
Background technology
Cloud computing is a kind ofly to take Intel Virtualization Technology as process, and network is carrier, and the services such as base frame, platform and software are provided is form, by cheap computer resource for the user provide fast, reliable and serve efficiently.The cloud computing platform system is the system that the various resources on cloud computing platform are managed, and the cloud computing system platform forms huge infrastructure, data storage, software the resource pool of mutual sharing and collaboration, and takes out on this basis the stratification service.Cloud service makes people needn't be concerned about the specific implementation details of bottom, just calculating and storage is put into high in the clouds and processes, just as convenient with power and water.
Regularly monitoring is the important component part of cloud computing system platform, it be in the cloud computing system platform as the prerequisite of the operations such as the analysis of the scheduling of system management, resource, network and fault detect, and the task scheduling in monitoring technique itself relates to maintenance and very many-sided work such as recovery and thread pool maintenance of the time rule generation of multi-thread concurrent, operation and parsing, various scenes.
In sum, the monitoring in the cloud computing system platform plays an important role for the service quality that improves the cloud computing system platform.
Summary of the invention
The purpose of this invention is to provide the timing method for supervising in a kind of cloud computing system platform.
Monitoring in present most cloud computing system is the data analysis to collecting first, and stores in database.Yet these are all the processing after event occurs, and have the property of delaying, and easily have influence on the normal use of cloud computing service.
The objective of the invention is to realize in the following manner, provide a kind of forecasting mechanism for the timing supervisory control system in cloud computing system, make the cloud computing system platform more stable and continue service is provided, this system is that to take in the cloud computing system platform History Performance Data of storing be basis, in conjunction with the monitoring and scheduling module, the monitoring of realization to timed task in the cloud computing system platform, system comprises: 1) data acquisition module 2) the failure predication monitoring module, wherein:
1) data acquisition module is that each scheduled service module in the cloud computing system platform is carried out the collection of data;
2) failure predication monitoring module, the design of employing hierarchical structure, monitoring is divided into related several groups of tool, each monitoring child node belongs to the sub-branch of upper strata father node, and each child node is not only cheated the monitoring things of processing in this group, simultaneously to the father node feedback monitoring situation on upper strata with to lower floor's leaf node transmission monitor command, the failure monitoring prediction module to the Information Monitoring of each sub-monitor node after statistical analysis, and some business module of the contingent fault of prognoses system and concrete time point, to predict the outcome to feed back in the root monitor node system manager will be carried out to early warning, thereby guarantee that providing of cloud computing system platform is lasting, high-quality service.
Data acquisition module in the cloud computing system platform is responsible for collecting performance parameter and the alarm log collection of each business module.
Failure predication module by system, the information that the node group of each monitoring is collected is carried out statistical analysis, the generation system performance prediction is analyzed and is reported and submitted the system manager, the generation of fault is avoided in the optimization that the system manager can based on analysis carries out system, thereby provides good pre-alarm and prevention measure for the continuous service of cloud computing system platform.
The invention has the beneficial effects as follows:
(1) isolation of abnormal failure.When one of them monitor node abnormal, within can being confined to certain scope to fault, can not affect the operation of other business modules, improved the availability of system;
(2) Fault Forecast Analysis.By the analysis of the History Performance Data to system platform, the contingent problem of system is carried out to early warning.
The accompanying drawing explanation
Fig. 1 is the architectural schematic of cloud computing system platform tradition monitoring;
Fig. 2 is the timing monitoring system structural representation in the cloud computing system platform.
Embodiment
With reference to Figure of description, method of the present invention is described in detail below.
In cloud computing system for timing monitoring system structure provides a kind of forecasting mechanism, make the cloud computing system platform can more stablize and continue service is provided.
This method is to take in the cloud computing system platform History Performance Data stored as basis, in conjunction with the monitoring and scheduling module, realizes the monitoring to timed task in the cloud computing system platform.This strategy comprises: 1, data acquisition module 2, failure predication monitoring module;
(1) data acquisition module, data acquisition module is mainly that each scheduled service module in the cloud computing system platform is carried out the collection of data;
(2) failure predication monitoring module, the failure predication monitoring module, the architecture of employing hierarchical, monitoring is divided into related several groups of tool, and each monitoring child node belongs to the sub-branch of upper strata father node.And each child node not only can be processed the monitoring things in this group, simultaneously to the father node feedback monitoring situation on upper strata with to lower floor's leaf node transmission monitor command.The failure monitoring prediction module to the Information Monitoring of each sub-monitor node after statistical analysis, and some business module of the contingent fault of prognoses system and concrete time point, will predict the outcome to feed back in the root monitor node system manager will be carried out to early warning.Thereby guarantee lasting, the high-quality service of providing of cloud computing system platform.
Embodiment:
Below content of the present invention is described to this process with an object lesson:
The cloud computing system platform operates on a basic platform, and has a virtual data center in this basic platform, and ten virtual machines are arranged under virtual data center.The data acquisition module of cloud computing system platform is responsible for information gathering the stored data base history of forming record of the relevant timed task of virtual data center, monitoring module is the analysis through prediction module according to image data, imminent scheduled service module and time origination point in system are predicted, and send to the system manager, the system manager can make response timely according to prediction and adjust, and does not affect the timing monitor service of whole cloud computing system platform.
Except the described technical characterictic of specification, be the known technology of those skilled in the art.

Claims (3)

1. the timing method for supervising in a cloud computing system platform, it is characterized in that providing a kind of forecasting mechanism for the timing supervisory control system in cloud computing system, make the cloud computing system platform more stable and continue service is provided, this system is that to take in the cloud computing system platform History Performance Data of storing be basis, in conjunction with the monitoring and scheduling module, the monitoring of realization to timed task in the cloud computing system platform, system comprises: 1) data acquisition module 2) the failure predication monitoring module, wherein:
1) data acquisition module is that each scheduled service module in the cloud computing system platform is carried out the collection of data;
2) failure predication monitoring module, the design of employing hierarchical structure, monitoring is divided into related several groups of tool, each monitoring child node belongs to the sub-branch of upper strata father node, and each child node is not only cheated the monitoring things of processing in this group, simultaneously to the father node feedback monitoring situation on upper strata with to lower floor's leaf node transmission monitor command, the failure monitoring prediction module to the Information Monitoring of each sub-monitor node after statistical analysis, and some business module of the contingent fault of prognoses system and concrete time point, to predict the outcome to feed back in the root monitor node system manager will be carried out to early warning, thereby guarantee that providing of cloud computing system platform is lasting, high-quality service.
2. method according to claim 1, is characterized in that, the data acquisition module in the cloud computing system platform is responsible for collecting performance parameter and the alarm log collection of each business module.
3. method according to claim 1, it is characterized in that, failure predication module by system, the information that the node group of each monitoring is collected is carried out statistical analysis, the generation system performance prediction is analyzed and is reported and submitted the system manager, the generation of fault is avoided in the optimization that the system manager can based on analysis carries out system, thereby provides good pre-alarm and prevention measure for the continuous service of cloud computing system platform.
CN2013100566791A 2013-02-22 2013-02-22 Timed monitoring method in cloud calculating system platform Pending CN103095533A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2013100566791A CN103095533A (en) 2013-02-22 2013-02-22 Timed monitoring method in cloud calculating system platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013100566791A CN103095533A (en) 2013-02-22 2013-02-22 Timed monitoring method in cloud calculating system platform

Publications (1)

Publication Number Publication Date
CN103095533A true CN103095533A (en) 2013-05-08

Family

ID=48207691

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2013100566791A Pending CN103095533A (en) 2013-02-22 2013-02-22 Timed monitoring method in cloud calculating system platform

Country Status (1)

Country Link
CN (1) CN103095533A (en)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103346906A (en) * 2013-06-19 2013-10-09 华南师范大学 Intelligent operation and maintenance method and system based on cloud computing
CN103401699A (en) * 2013-07-18 2013-11-20 深圳先进技术研究院 Cloud data center security monitoring early warning system and method
CN103475696A (en) * 2013-08-23 2013-12-25 汉柏科技有限公司 System and method for monitoring state of cloud computing cluster server
CN103744977A (en) * 2014-01-13 2014-04-23 浪潮(北京)电子信息产业有限公司 Monitoring method and monitoring system for cloud computing system platform
CN103825779A (en) * 2014-02-21 2014-05-28 南京邮电大学 Method for monitoring state of cloud data center
CN103957116A (en) * 2014-03-31 2014-07-30 昆明理工大学 Decision-making method and system of cloud failure data
CN104184819A (en) * 2014-08-29 2014-12-03 城云科技(杭州)有限公司 Multi-hierarchy load balancing cloud resource monitoring method
CN104301159A (en) * 2014-11-13 2015-01-21 中国建设银行股份有限公司 Monitoring method and system of server cluster
CN104378262A (en) * 2013-12-13 2015-02-25 国家计算机网络与信息安全管理中心 Intelligent monitoring analyzing method and system under cloud computing
CN104486445A (en) * 2014-12-30 2015-04-01 北京天云融创软件技术有限公司 Distributed extendable resource monitoring system and method based on cloud platform
CN104796294A (en) * 2015-05-07 2015-07-22 上海逸云信息科技发展有限公司 Cloud acceleration network monitoring system and method
CN104866380A (en) * 2015-06-18 2015-08-26 北京搜狐新媒体信息技术有限公司 Method and device for processing state transition of cluster management system
CN105099815A (en) * 2015-06-26 2015-11-25 北京奇虎科技有限公司 Cloud disk monitoring method and cloud disk monitoring device
CN105119765A (en) * 2015-09-30 2015-12-02 浪潮(北京)电子信息产业有限公司 Intelligent processing fault system architecture
CN105184886A (en) * 2015-09-01 2015-12-23 浪潮集团有限公司 Cloud data center intelligence inspection system and cloud data center intelligence inspection method
WO2015196885A1 (en) * 2014-06-27 2015-12-30 阿里巴巴集团控股有限公司 Method and apparatus for acquiring and storing performance data of cloud computing system
US9385934B2 (en) 2014-04-08 2016-07-05 International Business Machines Corporation Dynamic network monitoring
CN106357478A (en) * 2016-09-30 2017-01-25 郑州云海信息技术有限公司 Server cluster monitoring method and system
CN107251485A (en) * 2014-12-30 2017-10-13 康博泰公司 The service quality of the raising of cellular radio access networks
CN108196985A (en) * 2017-12-29 2018-06-22 中国电子科技集团公司信息科学研究院 A kind of storage system failure prediction method and device based on intelligent predicting
US10043194B2 (en) 2014-04-04 2018-08-07 International Business Machines Corporation Network demand forecasting
CN109726077A (en) * 2018-12-21 2019-05-07 中冶建筑研究总院有限公司 A kind of Enterprise Project lightweight safety management control data platform
US10361924B2 (en) 2014-04-04 2019-07-23 International Business Machines Corporation Forecasting computer resources demand
US10439891B2 (en) 2014-04-08 2019-10-08 International Business Machines Corporation Hyperparameter and network topology selection in network demand forecasting
US10713574B2 (en) 2014-04-10 2020-07-14 International Business Machines Corporation Cognitive distributed network
CN117527523A (en) * 2023-11-23 2024-02-06 广东堡塔安全技术有限公司 Cloud computing-based server security monitoring system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110145836A1 (en) * 2009-12-12 2011-06-16 Microsoft Corporation Cloud Computing Monitoring and Management System
CN102420869A (en) * 2011-12-02 2012-04-18 浪潮集团有限公司 Security monitoring method for cloud data center
CN102868736A (en) * 2012-08-30 2013-01-09 浪潮(北京)电子信息产业有限公司 Design and implementation method of cloud computing monitoring framework, and cloud computing processing equipment
CN102882909A (en) * 2011-07-15 2013-01-16 易云捷讯科技(北京)有限公司 Cloud computing service monitoring system and method thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110145836A1 (en) * 2009-12-12 2011-06-16 Microsoft Corporation Cloud Computing Monitoring and Management System
CN102882909A (en) * 2011-07-15 2013-01-16 易云捷讯科技(北京)有限公司 Cloud computing service monitoring system and method thereof
CN102420869A (en) * 2011-12-02 2012-04-18 浪潮集团有限公司 Security monitoring method for cloud data center
CN102868736A (en) * 2012-08-30 2013-01-09 浪潮(北京)电子信息产业有限公司 Design and implementation method of cloud computing monitoring framework, and cloud computing processing equipment

Cited By (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103346906B (en) * 2013-06-19 2016-07-13 华南师范大学 A kind of intelligent O&M method and system based on cloud computing
CN103346906A (en) * 2013-06-19 2013-10-09 华南师范大学 Intelligent operation and maintenance method and system based on cloud computing
CN103401699A (en) * 2013-07-18 2013-11-20 深圳先进技术研究院 Cloud data center security monitoring early warning system and method
CN103475696A (en) * 2013-08-23 2013-12-25 汉柏科技有限公司 System and method for monitoring state of cloud computing cluster server
CN104378262A (en) * 2013-12-13 2015-02-25 国家计算机网络与信息安全管理中心 Intelligent monitoring analyzing method and system under cloud computing
CN103744977A (en) * 2014-01-13 2014-04-23 浪潮(北京)电子信息产业有限公司 Monitoring method and monitoring system for cloud computing system platform
CN103825779A (en) * 2014-02-21 2014-05-28 南京邮电大学 Method for monitoring state of cloud data center
CN103825779B (en) * 2014-02-21 2016-10-05 南京邮电大学 A kind of cloud data center method for monitoring state
CN103957116B (en) * 2014-03-31 2017-12-01 昆明理工大学 A kind of decision-making technique and system of cloud fault data
CN103957116A (en) * 2014-03-31 2014-07-30 昆明理工大学 Decision-making method and system of cloud failure data
US10043194B2 (en) 2014-04-04 2018-08-07 International Business Machines Corporation Network demand forecasting
US10361924B2 (en) 2014-04-04 2019-07-23 International Business Machines Corporation Forecasting computer resources demand
US10650396B2 (en) 2014-04-04 2020-05-12 International Business Machines Corporation Network demand forecasting
US11082301B2 (en) 2014-04-04 2021-08-03 International Business Machines Corporation Forecasting computer resources demand
US10250481B2 (en) 2014-04-08 2019-04-02 International Business Machines Corporation Dynamic network monitoring
US10771371B2 (en) 2014-04-08 2020-09-08 International Business Machines Corporation Dynamic network monitoring
US10257071B2 (en) 2014-04-08 2019-04-09 International Business Machines Corporation Dynamic network monitoring
US9385934B2 (en) 2014-04-08 2016-07-05 International Business Machines Corporation Dynamic network monitoring
US11848826B2 (en) 2014-04-08 2023-12-19 Kyndryl, Inc. Hyperparameter and network topology selection in network demand forecasting
US10439891B2 (en) 2014-04-08 2019-10-08 International Business Machines Corporation Hyperparameter and network topology selection in network demand forecasting
US10693759B2 (en) 2014-04-08 2020-06-23 International Business Machines Corporation Dynamic network monitoring
US9705779B2 (en) 2014-04-08 2017-07-11 International Business Machines Corporation Dynamic network monitoring
US9722907B2 (en) 2014-04-08 2017-08-01 International Business Machines Corporation Dynamic network monitoring
US10713574B2 (en) 2014-04-10 2020-07-14 International Business Machines Corporation Cognitive distributed network
CN105242873A (en) * 2014-06-27 2016-01-13 阿里巴巴集团控股有限公司 Method and apparatus for acquiring and storing performance data of cloud computing system
CN105242873B (en) * 2014-06-27 2018-06-01 阿里巴巴集团控股有限公司 The acquisition of the performance data of cloud computing system and storage method and device
WO2015196885A1 (en) * 2014-06-27 2015-12-30 阿里巴巴集团控股有限公司 Method and apparatus for acquiring and storing performance data of cloud computing system
CN104184819B (en) * 2014-08-29 2017-12-05 城云科技(中国)有限公司 Multi-layer load balancing cloud resource monitoring method
CN104184819A (en) * 2014-08-29 2014-12-03 城云科技(杭州)有限公司 Multi-hierarchy load balancing cloud resource monitoring method
CN104301159B (en) * 2014-11-13 2019-01-25 中国建设银行股份有限公司 A kind of monitoring method and system of server cluster
CN104301159A (en) * 2014-11-13 2015-01-21 中国建设银行股份有限公司 Monitoring method and system of server cluster
CN107251485A (en) * 2014-12-30 2017-10-13 康博泰公司 The service quality of the raising of cellular radio access networks
CN104486445B (en) * 2014-12-30 2017-03-22 北京天云融创软件技术有限公司 Distributed extendable resource monitoring system based on cloud platform
CN104486445A (en) * 2014-12-30 2015-04-01 北京天云融创软件技术有限公司 Distributed extendable resource monitoring system and method based on cloud platform
CN104796294A (en) * 2015-05-07 2015-07-22 上海逸云信息科技发展有限公司 Cloud acceleration network monitoring system and method
CN104866380A (en) * 2015-06-18 2015-08-26 北京搜狐新媒体信息技术有限公司 Method and device for processing state transition of cluster management system
CN104866380B (en) * 2015-06-18 2018-07-06 北京搜狐新媒体信息技术有限公司 A kind for the treatment of method and apparatus of the state conversion of cluster management system
CN105099815B (en) * 2015-06-26 2019-02-26 北京奇虎科技有限公司 Cloud disk monitoring method and device
CN105099815A (en) * 2015-06-26 2015-11-25 北京奇虎科技有限公司 Cloud disk monitoring method and cloud disk monitoring device
CN105184886A (en) * 2015-09-01 2015-12-23 浪潮集团有限公司 Cloud data center intelligence inspection system and cloud data center intelligence inspection method
CN105119765B (en) * 2015-09-30 2018-06-29 浪潮(北京)电子信息产业有限公司 A kind of Intelligent treatment fault system framework
CN105119765A (en) * 2015-09-30 2015-12-02 浪潮(北京)电子信息产业有限公司 Intelligent processing fault system architecture
CN106357478B (en) * 2016-09-30 2019-08-02 郑州云海信息技术有限公司 A kind of server cluster monitoring method and system
CN106357478A (en) * 2016-09-30 2017-01-25 郑州云海信息技术有限公司 Server cluster monitoring method and system
CN108196985A (en) * 2017-12-29 2018-06-22 中国电子科技集团公司信息科学研究院 A kind of storage system failure prediction method and device based on intelligent predicting
CN109726077A (en) * 2018-12-21 2019-05-07 中冶建筑研究总院有限公司 A kind of Enterprise Project lightweight safety management control data platform
CN117527523A (en) * 2023-11-23 2024-02-06 广东堡塔安全技术有限公司 Cloud computing-based server security monitoring system

Similar Documents

Publication Publication Date Title
CN103095533A (en) Timed monitoring method in cloud calculating system platform
CN110782370B (en) Comprehensive operation and maintenance management platform for power dispatching data network
CN107330056B (en) Wind power plant SCADA system based on big data cloud computing platform and operation method thereof
CN106651633B (en) Power utilization information acquisition system based on big data technology and acquisition method thereof
US10116165B2 (en) Powergrid operation and supervision system
CN105323111A (en) Operation and maintenance automation system and method
CN110033186A (en) A kind of huge hydroelectric power plant's intelligence prison disc system
CN103684916A (en) Method and system for intelligent monitoring and analyzing under cloud computing
CN104182902A (en) Monitoring method based on centralized operation and maintenance system of dispatching data network
CN102081622A (en) Method and device for evaluating system health degree
CN106878466B (en) A kind of Hydropower Unit data management and equipment control unified platform
Bellavista et al. Machine learning for predictive diagnostics at the edge: An IIoT practical example
US10539934B2 (en) Outage and switch management for a power grid system
CN102903010A (en) Support vector machine-based abnormal judgment method for safety production cloud service platform orientating industrial and mining enterprises
CN106251069A (en) Dynamic on-demand routing inspection method based on big data analysis and association
CN102930372A (en) Data analysis method for association rule of cloud service platform system orienting to safe production of industrial and mining enterprises
CN111091240A (en) Public institution electric power energy efficiency monitoring system and service method
Ahmad et al. Optimization‐based workload distribution in geographically distributed data centers: A survey
CN105022823B (en) A kind of cloud service performance early warning event generation method based on data mining
CN105825314A (en) Monitoring information analysis method and system based on centralized operation and maintenance mode
CN102903009A (en) Malfunction diagnosis method based on generalized rule reasoning and used for safety production cloud service platform facing industrial and mining enterprises
CN106649034B (en) Visual intelligent operation and maintenance method and platform
CN110713090A (en) System and method for realizing real-time monitoring of abnormal state of multi-target elevator
CN117076882A (en) Dynamic prediction management method for cloud service resources
CN108989456B (en) A kind of network implementation approach based on big data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130508

WD01 Invention patent application deemed withdrawn after publication