US8170845B2 - Method and apparatus for automatic performance modeling with load dependent service times and overheads - Google Patents

Method and apparatus for automatic performance modeling with load dependent service times and overheads Download PDF

Info

Publication number
US8170845B2
US8170845B2 US12/236,824 US23682408A US8170845B2 US 8170845 B2 US8170845 B2 US 8170845B2 US 23682408 A US23682408 A US 23682408A US 8170845 B2 US8170845 B2 US 8170845B2
Authority
US
United States
Prior art keywords
function
server
service time
modeled
term
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related, expires
Application number
US12/236,824
Other versions
US20100076733A1 (en
Inventor
Dinesh Kumar
Li Zhang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ServiceNow Inc
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Priority to US12/236,824 priority Critical patent/US8170845B2/en
Assigned to INTERNATIONAL BUSINESS MACHINES CORPORATION reassignment INTERNATIONAL BUSINESS MACHINES CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: KUMAR, DINESH, ZHANG, LI
Publication of US20100076733A1 publication Critical patent/US20100076733A1/en
Application granted granted Critical
Publication of US8170845B2 publication Critical patent/US8170845B2/en
Assigned to MIDWAY TECHNOLOGY COMPANY LLC reassignment MIDWAY TECHNOLOGY COMPANY LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: INTERNATIONAL BUSINESS MACHINES CORPORATION
Assigned to SERVICENOW, INC. reassignment SERVICENOW, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MIDWAY TECHNOLOGY COMPANY LLC
Expired - Fee Related legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/147Network analysis or design for predicting network behaviour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • G06F11/3419Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment by assessing time
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3447Performance evaluation by modeling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/87Monitoring of transactions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0852Delays
    • H04L43/0864Round trip delays

Definitions

  • the present disclosure relates generally to performance modeling of information technology (IT) systems, and more particularly, to performance modeling of transaction-based distributed software applications.
  • Performance modeling of computer networks can be an important tool in the capacity planning of computer networks. Performance models of complex software and hardware network architectures can aid in accurately predicting their performance for varying data traffic patterns and workloads.
  • Transaction-based software applications generate data traffic that includes request/response transaction pairs.
  • Transactions may include requests for service by a computer system. These requests can include collections of sub-requests.
  • a purchase request can include visiting a particular webpage for a product, clicking the buy button, entering account information, clicking the submit button, etc.
  • a transaction-based software application that is deployed over an arbitrary computer network architecture may be considered a transaction-based distributed software application.
  • Each request and response associated with a transaction of the transaction-based distributed software application can be exchanged between different server entities of the computer network.
  • a performance model of the transaction-based distributed software application deployed over the computer network can be used to predict the performance of the application and take measures to improve it if possible.
  • the model may reveal that a server of the network has an unacceptably high response time in servicing requests of the application, suggesting that an upgrade of that server may improve the application's performance.
  • Performance models based on traditional queuing theory need certain model parameters to compute performance metrics.
  • the model parameters include service times of different transaction classes and central processing unit (CPU) overheads of computers within a given network.
  • the performance metrics may include average transactional response time, average number of transactions (or jobs) waiting to be processed in a buffer queue, etc.
  • Other conventional techniques that make use of simulations and manual calibrations can also be used to compute these and other similar performance metrics from the model parameters. However, none of these techniques can be used practically if the service time parameters are not known in advance.
  • Another known conventional technique uses inferencing to generate a load-independent service time and CPU overhead based performance model of an arbitrary computer network architecture. Inferencing allows one to compute the service time parameters from readily available measurement data on end-to-end-response times, overall CPU utilizations, and workload arrival rates. However, this inferencing technique is not reliable unless the arriving transactional workload is stationary.
  • Real world network traffic and the transactional workload arriving at transaction-based software applications is non-stationary in nature when a window of time is considered. Further, state of the art transaction-based software applications are quite complex in design. Accordingly, end-to-end transactional requests and responses may incur variable service times that depend on the total arriving non-stationary workload.
  • An exemplary embodiment of the present invention includes a method for modeling performance of an information technology system having one or more servers for serving a number of types of transactions.
  • the method includes modeling a service time of each transaction type at each server and a processor overhead at each server as one of a polynomial, exponential, or logarithmic function of an average arrival rate of each transaction type at the corresponding server to generate respective service time and processor overhead functions and inferring optimal values of coefficients in these service time and processor overhead functions to generate a performance model of the information technology system.
  • An exemplary embodiment of the present invention includes a method for automated performance modeling of an information technology system.
  • the method includes monitoring transactions exchanged between a plurality of servers of an information technology system to measure an average arrival rate of each transaction type at each server, modeling a service time of each transaction type at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate a service time function, modeling a processor overhead at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate a processor overhead function, inferring optimal values of coefficients in the service time and processor overhead functions by minimizing a quadratic objective function, determining the service time and processor overhead functions using the inferred optimal values of the coefficients, and generating a performance model of the information technology system from the determined service time and processor overhead functions.
  • An exemplary embodiment of the present invention includes a system for modeling performance of an information technology system having one or more servers for serving a number of types of transactions.
  • the system includes a monitoring module, a modeling module, and an enhanced inferencing module.
  • the monitoring module monitors the transactions exchanged between the servers to measure an average arrival rate of each transaction type at each server.
  • the modeling module models a service time of each transaction type at each server and a processor overhead at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate service time and processor overhead functions.
  • the enhanced inferencing module infers optimal values of coefficients in the service time and processor overhead functions to generate a performance model of the information technology system.
  • FIG. 1 illustrates an exemplary e-business site architecture which may be modeled in accordance with an exemplary embodiment of the present invention
  • FIG. 2 illustrates an exemplary queuing model corresponding to the e-business site depicted in FIG. 1 ;
  • FIG. 3 illustrates a high-level flow chart of a method of modeling performance of an information technology system according to an exemplary embodiment of the present invention
  • FIG. 4 illustrates a high-level block diagram of a system for modeling performance of an information technology system according to an exemplary embodiment of the present invention.
  • FIGS. 1-4 Methods and systems for modeling performance of an information technology system are presented herein with reference to FIGS. 1-4 . It is to be understood that the methods described herein may be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof. In particular, at least a portion of the present invention may be implemented as an application comprising program instructions that are tangibly embodied on one or more program storage devices (e.g., hard disk, magnetic floppy disk, RAM, ROM, CD ROM, etc.) and executable by any device or machine comprising suitable architecture, such as a general purpose digital computer having a processor, memory, and input/output interfaces.
  • program storage devices e.g., hard disk, magnetic floppy disk, RAM, ROM, CD ROM, etc.
  • suitable architecture such as a general purpose digital computer having a processor, memory, and input/output interfaces.
  • Exemplary embodiments of the present invention may be implemented in a distributed data processing environment in which end-to-end response time and CPU utilization measurements are compiled and used for the automatic model building.
  • a distributed data processing environment may be used for modeling an online production environment for a commercial Web site.
  • FIG. 1 illustrates an exemplary e-business site architecture which may be modeled in accordance with an exemplary embodiment of the present invention.
  • a production IT system 100 includes e-business servers, which may be organized to have a multi-tiered architecture and clients 110 , 111 , and 112 .
  • This system 100 is distributed via the Internet 131 ; however, any network or other distributed system may be employed.
  • a firewall 109 is employed for security; however each system may include additional or alternate network configurations.
  • a site 101 may include two separate clusters of servers, including a portal cluster 103 for authentication-related requests and a cluster of business transaction servers 105 for processing business related requests.
  • a portal cluster 103 for authentication-related requests
  • a cluster of business transaction servers 105 for processing business related requests.
  • Incoming requests to the site can be routed to a collection of front-end servers 121 - 122 by a load balancing router 107 .
  • the e-business workload is composed of transactions and requests to the e-business servers.
  • This interaction includes authentication transactions such as login, and business transactions such as browsing a catalog, searching for products, adding items to a shopping cart, proceeding to check out, etc.
  • Each of these transactions may use the site's resources differently.
  • Transactions such as browsing may only involve the front-end application servers 121 - 122 to fetch static pages, which is relatively inexpensive, while other transactions such as searching or checking out may involve composition of a dynamic page or multiple queries to the database that need a large amount of processing time and involve both the front-end application servers 121 - 122 and the back-end database servers 123 - 124 .
  • user navigational patterns vary from person to person. Some users may spend all their time browsing and searching, while some frequent buyers may buy directly without any searching.
  • the e-business workload mix may not be well known. Even though log files recorded by the system can provide some information about load mix, they can be incomplete.
  • a group of geographically distributed probing stations may be used to monitor the IT system's performance. Such probing stations may periodically send out probing requests for different business transactions and measure the end-to-end response times. The end-to-end response times therefore include network delays, which can be different for each geographical location.
  • Modeling becomes more challenging as the workload (e.g., number of jobs or transactions handled per unit of time) for an IT system used in production environment is not stationary.
  • the workload has peak/off-peak volume, daily/weekly patterns, which can be volatile. It is thus a challenging task to assess an IT system's capability of delivering end-to-end performance assurance across the entire IT environment, given the variety of system architectures, numerous applications with different functions, and the vast diversity in user behavior.
  • a queuing model is presented.
  • FIG. 2 with continued reference to FIG. 1 , a high-level multi-class queuing network model 200 is described.
  • This form of model captures resources and delay effects and provides traceability between the performance measures and the system architecture.
  • Each resource component that incurs delays will be modeled by a generic service station with queuing effect.
  • Such a generic service station could have any number of servers. For example, if the delay incurred at the firewall 109 is constant and non-negligible, one could then model the firewall 109 as an infinite server station with constant service time.
  • the transactions can be characterized and profiled into different classes, so that requests within each class would follow similar paths through the various server stations and incur similar service demands at each station along the path.
  • profiling can be based on prior engineering knowledge or after a workload analysis. For example, a login transaction maybe different from a buy transaction and these different transaction sessions would visit a different set of server stations and make different resource demands.
  • network 231 corresponds to Internet 131 in FIG. 1 ;
  • network dispatcher 207 corresponds to router 107 ;
  • clients 210 correspond to clients 110 - 112 and
  • servers 221 - 224 correspond to servers 121 - 124 .
  • End-to-end delays of a particular job may be described as a total response time (e.g., an end-to-end response time) that a user experiences from the time the user issues a transaction to the time the user receives the complete response. Therefore, the client end-to-end response times include the network delay at the network 231 and the delays at the Web servers 221 - 224 .
  • an end-to-end response time R can be represented by equation (1) as follows:
  • a transaction class refers to a grouping of transactions with similar characteristics.
  • the transaction classes may include transactions for searching, buying, logging on, etc.
  • the buying transaction class may include transactions that are responsible for purchasing a particular product.
  • Different transaction classes typically have different service requirements.
  • the above equations may be repeatedly applied to a system that includes multiple computers (e.g., machines) and multiple classes of transactions.
  • the parameter C is used to denote the index set of all transaction classes that may have different service requirements or different service level agreement (SLA) targets.
  • the parameter L is used to denote the index set of all the physical machines that are part of the given network.
  • the parameter ⁇ tilde over ( ⁇ ) ⁇ c is used to denote the rate (e.g., workload) at which transactions of a transaction class c ⁇ C arrive from an external source into the network (e.g., through any machine).
  • the parameter ⁇ i c is used to denote the rate at which transactions of a transaction class c ⁇ C arrive at a machine i ⁇ L from within the network.
  • the parameter ⁇ i c represents the effective or average arrival rate of transactions of a class c arriving at machine i, whereas the parameter ⁇ tilde over ( ⁇ ) ⁇ c represents the total arrival rate of transactions of a class c being generated by a source outside the network.
  • ⁇ tilde over ( ⁇ ) ⁇ c is located at some jth position (j ⁇ L) when class c transactions enter the network through machine j
  • I represents the identity matrix
  • P c represents the traffic routing probability matrix.
  • P c [P ij c ] is the probability that traffic corresponding to transactions of class c leaves a machine i and goes to machine j. It is assumed that P c is well defined for a given network architecture (i.e., the values P ij c are given).
  • the parameter s i c denotes the service time of a class c transaction at machine i. Equations (1) and (2) above can be repeatedly applied to obtain the service time s i c using the following inferencing technique.
  • the response time R c of traffic class c and utilization u i of machine i may be represented respectively by equations (4) and (5) as follows:
  • d net c denotes the total network delay incurred by a class c transaction
  • o i cpu denotes the total CPU overhead at machine i
  • ⁇ i c ⁇ i c / ⁇ tilde over ( ⁇ ) ⁇ c
  • P i is the total number of processors (e.g., CPUs) in machine i.
  • the arriving workload e.g., arrival rate of transactions
  • the computers of the network may incur additional processing overheads when mean transactional workload is high as compared to when the mean transactional workload is low.
  • the extra processing overheads may increase with increasing workload, for example, due to higher context-switching and a higher number of interrupts being generated by a network interface card (NIC) at the CPU of each server.
  • NIC network interface card
  • Service times for a particular transaction class at a given machine and total CPU overhead at a given machine may be more at higher mean workloads as compared to a lower mean. Further, an empirical analysis of an actual system has shown that service times and CPU/computer overheads may incur either polynomial, exponential, or logarithmic increases with increasing mean arriving workload. It has been further observed that the sum arriving workload over all transaction classes works as an approximation to incorporate dependence of service times and CPU overheads on the workload of individual transaction classes.
  • FIG. 3 illustrates a high-level flow chart of method for modeling performance of an information technology system that includes one or more servers.
  • the method includes the selection of one of a polynomial function 312 , an exponential function 313 , or a logarithmic function 314 for modeling service time (S 311 ).
  • the method further includes the selection of one of a polynomial function 322 , an exponential function 323 , or a logarithmic function 324 for modeling CPU overhead (S 321 ).
  • the selected functions need not be of the same type.
  • the polynomial function 312 may have been selected for modeling the service time
  • the exponential function 323 may have been selected for modeling the CPU overhead.
  • Service times for transactions of class c at each server may be expressed as a service time function s i c ( ⁇ i c
  • the CPU overheads may also be expressed as a CPU overhead function o i cpu (g ⁇ i c
  • the functions are considered to be ⁇ i c
  • c ⁇ C ⁇ ⁇ c ⁇ i c and g ⁇ i c
  • c ⁇ C ⁇ ⁇ c ⁇ i c .
  • the service time function at machine i for a given transaction class c maybe expressed as one of equations 9, 10, or 11 as follows:
  • the service time function would be expressed by equation (9). Further, if the exponential function 313 or logarithmic function 314 were previously selected, the service time function would be respectively expressed as equations (10) or (11).
  • the CPU overhead function at machine i may be expressed as one of equations (12), (13), or (14) as follows:
  • the CPU overhead function would be expressed by equation (12). Further, if the exponential function 323 or logarithmic function 324 were previously selected, the CPU overhead function would be respectively expressed as equations (13) or (14).
  • the corresponding optimization problem would then comprise a quadratic objective function in the set of variables ⁇ a p,i c , b q,i , d net c , e c , e i
  • a first term of the quadratic objective function of equation (6) incorporates the measured end-to-end response times ⁇ tilde over (R) ⁇ c ( 341 ) and the second term incorporates the measured server utilizations ⁇ i ( 342 ).
  • the minimizing solution to the optimization problem would yield optimal values of coefficients in the service time and CPU overhead functions (i.e., a p,i c , b q,i ) (S 340 ).
  • the determined coefficient values from this enhanced inferencing technique result in purely sum arriving workload dependent service time and CPU overhead formulations.
  • the coefficient values can be plugged into the service time and CPU overhead functions to generate a performance model of the system.
  • FIG. 4 An exemplary system for generating a performance model for an information technology system having one or more servers is illustrated in FIG. 4 , according to an exemplary embodiment of the present invention.
  • the system 410 includes a monitoring module 411 , a modeling module 412 , and an enhanced inferencing module 413 .
  • the system 410 may include an analysis module 415 and/or a graphical user interface 414 .
  • the monitoring module 411 monitors transactions exchanged between servers 401 , 402 , 403 , and 40 N of an information technology system (e.g., the system illustrated in FIG. 1 ) connected through a network 420 .
  • the monitoring is used to collect and/or measure average arrival rates of each transaction type at each server.
  • the monitoring module 411 may be further used to monitor the transactions to measure end-to-end response times amongst the servers for each transaction type and utilizations of each server.
  • the modeling module 412 models a service time of each transaction type at each server and a CPU overhead at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate service time and CPU overhead functions.
  • the enhanced inferencing module 413 infers optimal values of coefficients in the service time and CPU overhead functions to generate a performance model 416 of the information technology system.
  • the enhanced inferencing module 413 may infer the coefficient values using the above described enhanced inferencing technique.
  • the coefficient values may be inferred by minimizing a quadratic objective function.
  • the quadratic objective function includes a first term based on a relative error e c between the measured end-to-end response times ⁇ tilde over (R) ⁇ c and modeled end-to-end response times R c and a second term based on a relative error e i between the measured utilizations ⁇ i and modeled utilizations u i .
  • the modeled end-to-end response times R c may be based on the service time function, the measured utilizations ⁇ i , and network delays d net c associated with the exchange of each transaction through each server.
  • the modeled utilizations u i may be based on the service time function, the CPU overhead function and the total number of processors P i at each server.
  • the analysis module 414 analyzes the generated performance model 416 to determine whether improvements should be made to the information technology system.
  • the system 410 may further include a graphical user interface to display the suggested improvements to a user.

Abstract

A method for modeling performance of an information technology system having one or more servers for serving a number of types of transactions includes modeling a service time of each transaction type at each server and a processor overhead at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate service time and processor overhead functions and inferring optimal values of coefficients in the service time and processor overhead functions to generate a performance model of the information technology system.

Description

BACKGROUND
1. Technical Field
The present disclosure relates generally to performance modeling of information technology (IT) systems, and more particularly, to performance modeling of transaction-based distributed software applications.
2. Discussion of Related Art
Performance modeling of computer networks can be an important tool in the capacity planning of computer networks. Performance models of complex software and hardware network architectures can aid in accurately predicting their performance for varying data traffic patterns and workloads.
Transaction-based software applications generate data traffic that includes request/response transaction pairs. Transactions may include requests for service by a computer system. These requests can include collections of sub-requests. For example, a purchase request can include visiting a particular webpage for a product, clicking the buy button, entering account information, clicking the submit button, etc. A transaction-based software application that is deployed over an arbitrary computer network architecture may be considered a transaction-based distributed software application. Each request and response associated with a transaction of the transaction-based distributed software application can be exchanged between different server entities of the computer network.
A performance model of the transaction-based distributed software application deployed over the computer network can be used to predict the performance of the application and take measures to improve it if possible. For example, the model may reveal that a server of the network has an unacceptably high response time in servicing requests of the application, suggesting that an upgrade of that server may improve the application's performance.
Performance models based on traditional queuing theory need certain model parameters to compute performance metrics. The model parameters include service times of different transaction classes and central processing unit (CPU) overheads of computers within a given network. The performance metrics may include average transactional response time, average number of transactions (or jobs) waiting to be processed in a buffer queue, etc. Other conventional techniques that make use of simulations and manual calibrations can also be used to compute these and other similar performance metrics from the model parameters. However, none of these techniques can be used practically if the service time parameters are not known in advance.
Another known conventional technique uses inferencing to generate a load-independent service time and CPU overhead based performance model of an arbitrary computer network architecture. Inferencing allows one to compute the service time parameters from readily available measurement data on end-to-end-response times, overall CPU utilizations, and workload arrival rates. However, this inferencing technique is not reliable unless the arriving transactional workload is stationary.
Real world network traffic and the transactional workload arriving at transaction-based software applications is non-stationary in nature when a window of time is considered. Further, state of the art transaction-based software applications are quite complex in design. Accordingly, end-to-end transactional requests and responses may incur variable service times that depend on the total arriving non-stationary workload.
Thus, there is a need for methods and systems that can accurately model system performance when the arriving transactional workload is non-stationary.
SUMMARY
An exemplary embodiment of the present invention includes a method for modeling performance of an information technology system having one or more servers for serving a number of types of transactions. The method includes modeling a service time of each transaction type at each server and a processor overhead at each server as one of a polynomial, exponential, or logarithmic function of an average arrival rate of each transaction type at the corresponding server to generate respective service time and processor overhead functions and inferring optimal values of coefficients in these service time and processor overhead functions to generate a performance model of the information technology system.
An exemplary embodiment of the present invention includes a method for automated performance modeling of an information technology system. The method includes monitoring transactions exchanged between a plurality of servers of an information technology system to measure an average arrival rate of each transaction type at each server, modeling a service time of each transaction type at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate a service time function, modeling a processor overhead at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate a processor overhead function, inferring optimal values of coefficients in the service time and processor overhead functions by minimizing a quadratic objective function, determining the service time and processor overhead functions using the inferred optimal values of the coefficients, and generating a performance model of the information technology system from the determined service time and processor overhead functions.
An exemplary embodiment of the present invention includes a system for modeling performance of an information technology system having one or more servers for serving a number of types of transactions. The system includes a monitoring module, a modeling module, and an enhanced inferencing module. The monitoring module monitors the transactions exchanged between the servers to measure an average arrival rate of each transaction type at each server. The modeling module models a service time of each transaction type at each server and a processor overhead at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate service time and processor overhead functions. The enhanced inferencing module infers optimal values of coefficients in the service time and processor overhead functions to generate a performance model of the information technology system.
BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS
Exemplary embodiments of the invention can be understood in more detail from the following descriptions taken in conjunction with the accompanying drawings in which:
FIG. 1 illustrates an exemplary e-business site architecture which may be modeled in accordance with an exemplary embodiment of the present invention;
FIG. 2 illustrates an exemplary queuing model corresponding to the e-business site depicted in FIG. 1;
FIG. 3 illustrates a high-level flow chart of a method of modeling performance of an information technology system according to an exemplary embodiment of the present invention; and
FIG. 4 illustrates a high-level block diagram of a system for modeling performance of an information technology system according to an exemplary embodiment of the present invention.
DETAILED DESCRIPTION
Methods and systems for modeling performance of an information technology system are presented herein with reference to FIGS. 1-4. It is to be understood that the methods described herein may be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof. In particular, at least a portion of the present invention may be implemented as an application comprising program instructions that are tangibly embodied on one or more program storage devices (e.g., hard disk, magnetic floppy disk, RAM, ROM, CD ROM, etc.) and executable by any device or machine comprising suitable architecture, such as a general purpose digital computer having a processor, memory, and input/output interfaces. It is to be further understood that, because some of the constituent system components and process steps depicted in the accompanying figures may be implemented in software, the connections between system modules (or the logic flow of method steps) may differ depending upon the manner in which the present invention is programmed. Given the teachings herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations of the present invention.
Exemplary embodiments of the present invention may be implemented in a distributed data processing environment in which end-to-end response time and CPU utilization measurements are compiled and used for the automatic model building. For example, a distributed data processing environment may be used for modeling an online production environment for a commercial Web site.
FIG. 1 illustrates an exemplary e-business site architecture which may be modeled in accordance with an exemplary embodiment of the present invention. Referring to FIG. 1, a production IT system 100 includes e-business servers, which may be organized to have a multi-tiered architecture and clients 110, 111, and 112. This system 100 is distributed via the Internet 131; however, any network or other distributed system may be employed. In this example, a firewall 109 is employed for security; however each system may include additional or alternate network configurations.
For example, a site 101 may include two separate clusters of servers, including a portal cluster 103 for authentication-related requests and a cluster of business transaction servers 105 for processing business related requests. Within each cluster, there are two or more tiers, front-end servers 121-122, and back-end database servers 123-124. Incoming requests to the site can be routed to a collection of front-end servers 121-122 by a load balancing router 107.
The e-business workload is composed of transactions and requests to the e-business servers. Consider for example an enterprise online shopping scenario. This interaction includes authentication transactions such as login, and business transactions such as browsing a catalog, searching for products, adding items to a shopping cart, proceeding to check out, etc. Each of these transactions may use the site's resources differently.
Transactions such as browsing may only involve the front-end application servers 121-122 to fetch static pages, which is relatively inexpensive, while other transactions such as searching or checking out may involve composition of a dynamic page or multiple queries to the database that need a large amount of processing time and involve both the front-end application servers 121-122 and the back-end database servers 123-124. In addition, user navigational patterns vary from person to person. Some users may spend all their time browsing and searching, while some frequent buyers may buy directly without any searching.
For a production system 100, the e-business workload mix may not be well known. Even though log files recorded by the system can provide some information about load mix, they can be incomplete. A group of geographically distributed probing stations may be used to monitor the IT system's performance. Such probing stations may periodically send out probing requests for different business transactions and measure the end-to-end response times. The end-to-end response times therefore include network delays, which can be different for each geographical location.
Modeling becomes more challenging as the workload (e.g., number of jobs or transactions handled per unit of time) for an IT system used in production environment is not stationary. The workload has peak/off-peak volume, daily/weekly patterns, which can be volatile. It is thus a challenging task to assess an IT system's capability of delivering end-to-end performance assurance across the entire IT environment, given the variety of system architectures, numerous applications with different functions, and the vast diversity in user behavior.
To ensure the feasibility of the modeling framework, yet still capture the characteristics of a complex e-business infrastructure, a queuing model is presented. Referring to FIG. 2 with continued reference to FIG. 1, a high-level multi-class queuing network model 200 is described. This form of model captures resources and delay effects and provides traceability between the performance measures and the system architecture. Each resource component that incurs delays will be modeled by a generic service station with queuing effect. Such a generic service station could have any number of servers. For example, if the delay incurred at the firewall 109 is constant and non-negligible, one could then model the firewall 109 as an infinite server station with constant service time.
The transactions can be characterized and profiled into different classes, so that requests within each class would follow similar paths through the various server stations and incur similar service demands at each station along the path. Such profiling can be based on prior engineering knowledge or after a workload analysis. For example, a login transaction maybe different from a buy transaction and these different transaction sessions would visit a different set of server stations and make different resource demands.
The system described with reference to FIG. 1 can be modeled as an open queuing network 200 of FIG. 2. In FIG. 2, network 231 corresponds to Internet 131 in FIG. 1; network dispatcher 207 corresponds to router 107; clients 210 correspond to clients 110-112 and servers 221-224 correspond to servers 121-124.
Since the delays incurred at the network 231 are non-negligible, such delay effects can be modeled as delay servers. End-to-end delays of a particular job (or transaction) may be described as a total response time (e.g., an end-to-end response time) that a user experiences from the time the user issues a transaction to the time the user receives the complete response. Therefore, the client end-to-end response times include the network delay at the network 231 and the delays at the Web servers 221-224.
From queuing theory, for a single transaction class (or type) hitting a single server, an end-to-end response time R can be represented by equation (1) as follows:
R = s 1 - u , ( 1 )
where, u is represented by equation (2) as follows:
u=λs  (2).
In the above equations, s represents the service time of the given transaction class on the server, R represents the response time of the given transaction class on the server, λ represents the arrival rate of all such transactions on the server, and u represents the utilization of the server. A transaction class refers to a grouping of transactions with similar characteristics. For example the transaction classes may include transactions for searching, buying, logging on, etc. For example, the buying transaction class may include transactions that are responsible for purchasing a particular product. Different transaction classes typically have different service requirements. The above equations may be repeatedly applied to a system that includes multiple computers (e.g., machines) and multiple classes of transactions.
It is assumed that there are K transaction classes and M physical machines in a given network. The parameter C is used to denote the index set of all transaction classes that may have different service requirements or different service level agreement (SLA) targets. The parameter L is used to denote the index set of all the physical machines that are part of the given network. The parameter {tilde over (γ)}c is used to denote the rate (e.g., workload) at which transactions of a transaction class cεC arrive from an external source into the network (e.g., through any machine). The parameter λi c is used to denote the rate at which transactions of a transaction class cεC arrive at a machine iεL from within the network. The parameter λi c represents the effective or average arrival rate of transactions of a class c arriving at machine i, whereas the parameter {tilde over (γ)}c represents the total arrival rate of transactions of a class c being generated by a source outside the network. The effective arrival rate may be computed from the total arrival rate by solving the open Jackson network traffic balance equations in equation (3) as follows:
cεC,λ cc(I−P c),  (3)
where λc=[λ1 c, λ2 c, . . . , λM c], the vector γc=[0, 0, . . . , {tilde over (γ)}c, . . . 0], {tilde over (γ)}c is located at some jth position (jεL) when class c transactions enter the network through machine j, I represents the identity matrix, and Pc represents the traffic routing probability matrix. For example, Pc=[Pij c] is the probability that traffic corresponding to transactions of class c leaves a machine i and goes to machine j. It is assumed that Pc is well defined for a given network architecture (i.e., the values Pij c are given).
The parameter si c denotes the service time of a class c transaction at machine i. Equations (1) and (2) above can be repeatedly applied to obtain the service time si c using the following inferencing technique. The response time Rc of traffic class c and utilization ui of machine i, may be represented respectively by equations (4) and (5) as follows:
c C , i L α i c s i c 1 - u ~ i + d net c = R c , ( 4 ) i L , c C λ i c P i s i c + o i cpu = u i , ( 5 )
where, dnet c denotes the total network delay incurred by a class c transaction, oi cpu denotes the total CPU overhead at machine i, αi ci c/{tilde over (γ)}c and Pi is the total number of processors (e.g., CPUs) in machine i. One may then seek to minimize the sum of the squares of relative errors ec and ei between the analytical entities Rc and ui given by equations (4) and (5) and their corresponding mean measurement values {tilde over (R)}c and ũi obtained from the real network. The corresponding optimization problem would then comprise a quadratic objective function in the set of variables {si c, dnet c, oi cpu, ec, ei, cεC, iεL}, where the quadratic objective function is represented by equation (6) as follows:
min c C ( e c R ~ c ) 2 + i L ( e i u ~ i ) 2 ( 6 )
with the following set of linear constraints that may be represented by equations (7) and (8) as follows:
cεC,R c +e c ={tilde over (R)} c  (7)
iεL,u i +e i i,  (8)
where, Rc and ui may be given respectively by equations (4) and (5). The solution to the optimization problem represented by the quadratic objective function of equation (6) and the linear constraints of equations (7) and (8) yields the service times si c and CPU overheads oi cpu.
However, the above optimization problem assumes that the arriving workload (e.g., arrival rate of transactions) will be stationary. When the arriving workload is non-stationary, the computers of the network may incur additional processing overheads when mean transactional workload is high as compared to when the mean transactional workload is low. The extra processing overheads may increase with increasing workload, for example, due to higher context-switching and a higher number of interrupts being generated by a network interface card (NIC) at the CPU of each server.
Service times for a particular transaction class at a given machine and total CPU overhead at a given machine may be more at higher mean workloads as compared to a lower mean. Further, an empirical analysis of an actual system has shown that service times and CPU/computer overheads may incur either polynomial, exponential, or logarithmic increases with increasing mean arriving workload. It has been further observed that the sum arriving workload over all transaction classes works as an approximation to incorporate dependence of service times and CPU overheads on the workload of individual transaction classes.
Accordingly, exemplary methods and systems for modeling system performance presented herein consider the service times and CPU overheads as increasing polynomial, exponential, or logarithmic functions of the total arriving workload. FIG. 3 illustrates a high-level flow chart of method for modeling performance of an information technology system that includes one or more servers. The method includes the selection of one of a polynomial function 312, an exponential function 313, or a logarithmic function 314 for modeling service time (S311). The method further includes the selection of one of a polynomial function 322, an exponential function 323, or a logarithmic function 324 for modeling CPU overhead (S321). The selected functions need not be of the same type. For example, the polynomial function 312 may have been selected for modeling the service time, while the exponential function 323 may have been selected for modeling the CPU overhead.
Service times for transactions of class c at each server may be expressed as a service time function si c(ƒ{λi c|cεC}) for any given function ƒ by incorporating measured arrival rates λi c of each transaction class c at the corresponding server i into the selected function (e.g., 312, 313, or 314) (S315). The CPU overheads may also be expressed as a CPU overhead function oi cpu (g {λi c|cεC}) for any given function g by incorporating the measured arrival rates λi c 330 of each transaction class c into the selected function (e.g., 322, 323, or 324) (S325). For example, the functions are considered to be ƒ{λi c|cεC}=Σcλi c and g{λi c|cεC}=Σcλi c.
The service time function at machine i for a given transaction class c maybe expressed as one of equations 9, 10, or 11 as follows:
s i c ( c λ i c ) = a 0 , i c + a 1 , i c c λ i c + a 2 , i c ( c λ i c ) 2 + + a n , i c ( c λ i c ) n , ( 9 ) s i c ( c λ i c ) = a 0 , i c + a 1 , i c c λ i c + a 2 , i c ( c λ i c ) 2 + + a n , i c ( c λ i c ) n , ( 10 ) s i c ( c λ i c ) = a 0 , i c + a 1 , i c log c λ i c + a 2 , i c ( log c λ i c ) 2 + + a n , i c ( log c λ i c ) n , ( 11 )
where, n represents the degree of the functions. For example, if the polynomial function 312 was previously selected, the service time function would be expressed by equation (9). Further, if the exponential function 313 or logarithmic function 314 were previously selected, the service time function would be respectively expressed as equations (10) or (11).
The CPU overhead function at machine i may be expressed as one of equations (12), (13), or (14) as follows:
o i cpu ( c λ i c ) = b 0 , i + b 1 , i c λ i c + b 2 , i ( c λ i c ) 2 + + b m , i ( c λ i c ) m , ( 12 ) o i cpu ( c λ i c ) = b 0 , i + b 1 , i c λ i c + b 2 , i ( c λ i c ) 2 + + b m , i ( c λ i c ) m , ( 13 ) o i cpu ( c λ i c ) = b 0 , i + b 1 , i log c λ i c + b 2 , i ( log c λ i c ) 2 + + b m , i ( log c λ i c ) m , ( 14 )
where, m represents the degree of the function. For example, if the polynomial function 322 was previously selected, the CPU overhead function would be expressed by equation (12). Further, if the exponential function 323 or logarithmic function 324 were previously selected, the CPU overhead function would be respectively expressed as equations (13) or (14).
The previous equations (4) and (5) for response time Rc of a traffic class c and utilization ui of machine i can then be re-written incorporating the service time function si ccλi c) and the CPU overhead function oi cpucλi c) as shown respectively in equations (15) and (16) as follows:
c C , i L α i c s i c ( c λ i c ) 1 - u ~ i + d net c = R c , ( 15 ) i L , c C λ i c P i s i c ( c λ i c ) + o i cpu ( c λ i c ) = u i . ( 16 )
The corresponding optimization problem would then comprise a quadratic objective function in the set of variables {ap,i c, bq,i, dnet c, ec, ei|pε{0, 1, . . . , n},qε{0, 1, . . . , m},cεC, iεL}, where the quadratic objective function is represented by equation 6, linear constraints may be represented by equations (7) and (8), and Rc and ui can instead given by equations (15) and (16). For example, a first term of the quadratic objective function of equation (6) incorporates the measured end-to-end response times {tilde over (R)}c (341) and the second term incorporates the measured server utilizations ũi (342). The minimizing solution to the optimization problem would yield optimal values of coefficients in the service time and CPU overhead functions (i.e., ap,i c, bq,i) (S340).
The determined coefficient values from this enhanced inferencing technique result in purely sum arriving workload dependent service time and CPU overhead formulations. The coefficient values can be plugged into the service time and CPU overhead functions to generate a performance model of the system.
An exemplary system for generating a performance model for an information technology system having one or more servers is illustrated in FIG. 4, according to an exemplary embodiment of the present invention. The system 410 includes a monitoring module 411, a modeling module 412, and an enhanced inferencing module 413. The system 410 may include an analysis module 415 and/or a graphical user interface 414.
The monitoring module 411 monitors transactions exchanged between servers 401, 402, 403, and 40N of an information technology system (e.g., the system illustrated in FIG. 1) connected through a network 420. The monitoring is used to collect and/or measure average arrival rates of each transaction type at each server. The monitoring module 411 may be further used to monitor the transactions to measure end-to-end response times amongst the servers for each transaction type and utilizations of each server.
The modeling module 412 models a service time of each transaction type at each server and a CPU overhead at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate service time and CPU overhead functions.
The enhanced inferencing module 413 infers optimal values of coefficients in the service time and CPU overhead functions to generate a performance model 416 of the information technology system. The enhanced inferencing module 413 may infer the coefficient values using the above described enhanced inferencing technique. The coefficient values may be inferred by minimizing a quadratic objective function. As shown in equations (6)-(8), the quadratic objective function includes a first term based on a relative error ec between the measured end-to-end response times {tilde over (R)}c and modeled end-to-end response times Rc and a second term based on a relative error ei between the measured utilizations ũi and modeled utilizations ui. As shown in equation (15), the modeled end-to-end response times Rc may be based on the service time function, the measured utilizations ũi, and network delays dnet c associated with the exchange of each transaction through each server. As shown in equation (16), the modeled utilizations ui may be based on the service time function, the CPU overhead function and the total number of processors Pi at each server.
The analysis module 414 analyzes the generated performance model 416 to determine whether improvements should be made to the information technology system. The system 410 may further include a graphical user interface to display the suggested improvements to a user.
It is to be understood that the particular exemplary embodiments disclosed above are illustrative only, as the invention may be modified and practiced in different but equivalent manners apparent to those skilled in the art having the benefit of the teachings herein. Furthermore, no limitations are intended to the herein described exemplary embodiments, other than as described in the claims below. It is therefore evident that the particular exemplary embodiments disclosed herein may be altered or modified and all such variations are considered within the scope and spirit of the disclosure.

Claims (23)

1. A method for modeling performance of an information technology system having a server for serving a class of transactions, the method comprising:
modeling a service time for servicing transactions of the class on the server as one of a polynomial, exponential, or logarithmic function that comprises a first term including an average arrival rate of transactions of the class at the server raised to a first power and a second term including the average arrival rate raised to a second power higher than the first to generate a service time function;
modeling a processor overhead time of a processor of the server as one of a polynomial, exponential, or logarithmic function that comprises a third term including the average arrival rate and a fourth term including the average arrival rate raised to a fourth power higher than the third to generate a processor overhead function;
optimizing coefficients of the terms of the service time function and coefficients of the terms of the processor overhead function; and
generating a performance model of the information technology system using the resulting service time and processor overhead functions.
2. The method of claim 1, wherein the optimizing comprises minimizing a quadratic objective function,
wherein the quadratic objective function comprises:
a first term based on measured end-to-end response times of each transaction of the class; and
a second term based on a measured utilization of the server.
3. The method of claim 2, wherein the first term includes an error term that represents a relative error between the measured end-to-end response times and modeled end-to-end response times that are based on the service time function and the second term includes an error term that represents a relative error between the measured utilization and a modeled utilization based on the processor overhead function.
4. A method for automated performance modeling of an information technology system, the method comprising:
monitoring transactions exchanged between a plurality of servers of an information technology system to measure an average arrival rate of each transaction type at each server;
modeling a service time for servicing transactions of each transaction type at each server as a function that comprises a first term including an average arrival rate of the transactions of the corresponding transaction type at the corresponding server raised to a first power and a second term including the average arrival rate raised to a second higher power to generate a service time function;
modeling a processor overhead time of a processor at each server as one of a polynomial, an exponential, or logarithmic function of the average arrival rate of the transactions of the corresponding transaction type at the corresponding server to generate a processor overhead function;
optimizing a coefficient in a term in the service time function and a coefficient in a term in the processor overhead function by minimizing a quadratic objective function; and
generating a performance model of the information technology system from the resulting service time and processor overhead functions.
5. The method of claim 4, wherein the monitoring further comprises monitoring the transactions to measure end-to-end response times between the server and another server for each transaction class and utilizations of each server.
6. The method of claim 5, wherein the quadratic objective function comprises a first term based on a relative error between the measured end-to-end response times and modeled end-to-end response times and a second term based on a relative error between the measured utilizations and modeled utilizations.
7. The method of claim 6, wherein the modeled end-to-end response times are based on the service time function.
8. The method of claim 7, wherein the modeled end-to-end response times are further based on the measured utilizations.
9. The method of claim 8, wherein the modeled end-to-end response times are further based on network delays associated with the exchange of each transaction type through the servers.
10. The method of claim 6, wherein the modeled utilizations are based on the service time function.
11. The method of claim 10, wherein the modeled utilizations are further based on the processor overhead function.
12. The method of claim 11 wherein the modeled utilizations are further based on a total number of processors in the servers.
13. The method of claim 4, wherein the processor overhead time is based on the processor engaging in context-switching.
14. The method of claim 13, wherein the processor overhead time is further based on the processor servicing interrupts.
15. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for modeling performance of an information technology system including a plurality of servers, the method steps comprising:
monitoring transactions exchanged between a plurality of servers of an information technology system to measure an average arrival rate of each transaction type at each server;
modeling a service time of each transaction type for servicing transactions at each server as one of a polynomial, exponential, or logarithmic function that comprises a first term including a summation of arrival rates of the transactions of the corresponding transaction type at the corresponding server that is raised to a first power and a second term including the summation raised to a second power higher than the first to generate a service time function;
modeling a processor overhead time of a processor at each server as one of a polynomial, exponential, or logarithmic function of the average arrival rate of each transaction type at the corresponding server to generate a processor overhead function;
optimizing a coefficient in a term in the service time function and a coefficient in a term in the processor overhead function by minimizing a quadratic objective function; and
generating a performance model of the information technology system from the resulting service time and processor overhead functions.
16. The program storage device of claim 15, wherein the monitoring further comprises monitoring the transactions to measure end-to-end response times amongst the servers for each transaction type and utilizations of each server.
17. The program storage device of claim 16, wherein the quadratic objective function comprises a first term based on a relative error between the measured end-to-end response times and modeled end-to-end response times and a second term based on a relative error between the measured utilizations and modeled utilizations.
18. The program storage device of claim 17, wherein the modeled end-to-end response times are based on the service time function.
19. The program storage device of claim 18, wherein the modeled end-to-end response times are further based on the measured utilizations.
20. The program storage device of claim 19, wherein the modeled end-to-end response times are further based on network delays associated with the exchange of each transaction type through the servers.
21. The program storage device of claim 17, wherein the modeled utilizations are based on the service time function.
22. The program storage device of claim 21, wherein the modeled utilizations are further based on the processor overhead function.
23. The program storage device of claim 22, wherein the modeled utilizations are further based on a total number of processors in the servers.
US12/236,824 2008-09-24 2008-09-24 Method and apparatus for automatic performance modeling with load dependent service times and overheads Expired - Fee Related US8170845B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/236,824 US8170845B2 (en) 2008-09-24 2008-09-24 Method and apparatus for automatic performance modeling with load dependent service times and overheads

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US12/236,824 US8170845B2 (en) 2008-09-24 2008-09-24 Method and apparatus for automatic performance modeling with load dependent service times and overheads

Publications (2)

Publication Number Publication Date
US20100076733A1 US20100076733A1 (en) 2010-03-25
US8170845B2 true US8170845B2 (en) 2012-05-01

Family

ID=42038535

Family Applications (1)

Application Number Title Priority Date Filing Date
US12/236,824 Expired - Fee Related US8170845B2 (en) 2008-09-24 2008-09-24 Method and apparatus for automatic performance modeling with load dependent service times and overheads

Country Status (1)

Country Link
US (1) US8170845B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10146656B2 (en) 2016-02-16 2018-12-04 Tata Consultancy Services Limited Service demand based performance prediction using a single workload

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8689037B2 (en) * 2009-12-16 2014-04-01 Qualcomm Incorporated System and method for asynchronously and independently controlling core clocks in a multicore central processing unit
US9104411B2 (en) 2009-12-16 2015-08-11 Qualcomm Incorporated System and method for controlling central processing unit power with guaranteed transient deadlines
US8775830B2 (en) 2009-12-16 2014-07-08 Qualcomm Incorporated System and method for dynamically controlling a plurality of cores in a multicore central processing unit based on temperature
US9128705B2 (en) * 2009-12-16 2015-09-08 Qualcomm Incorporated System and method for controlling central processing unit power with reduced frequency oscillations
US9563250B2 (en) * 2009-12-16 2017-02-07 Qualcomm Incorporated System and method for controlling central processing unit power based on inferred workload parallelism
US8909962B2 (en) * 2009-12-16 2014-12-09 Qualcomm Incorporated System and method for controlling central processing unit power with guaranteed transient deadlines
US9176572B2 (en) 2009-12-16 2015-11-03 Qualcomm Incorporated System and method for controlling central processing unit power with guaranteed transient deadlines
US20110145559A1 (en) * 2009-12-16 2011-06-16 Thomson Steven S System and method for controlling central processing unit power with guaranteed steady state deadlines
US8650426B2 (en) 2009-12-16 2014-02-11 Qualcomm Incorporated System and method for controlling central processing unit power in a virtualized system
US20120066375A1 (en) * 2010-03-11 2012-03-15 InMon Corp. Combining Data Associated with Hardware Resources and Application Transactions
US10355945B2 (en) * 2016-09-21 2019-07-16 International Business Machines Corporation Service level management of a workload defined environment
US10474563B1 (en) * 2016-12-28 2019-11-12 Wells Fargo Bank, N.A. System testing from production transactions
US11005731B2 (en) * 2017-04-05 2021-05-11 Cisco Technology, Inc. Estimating model parameters for automatic deployment of scalable micro services

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5465354A (en) 1992-03-19 1995-11-07 Hitachi, Ltd. Method and apparatus for job execution prediction and control and method for job execution situation display
US5946662A (en) * 1996-03-29 1999-08-31 International Business Machines Corporation Method for providing inventory optimization
US6393480B1 (en) 1999-06-21 2002-05-21 Compuware Corporation Application response time prediction
US20020082807A1 (en) 2000-12-23 2002-06-27 Turicchi Thomas Edwin Method for service level estimation in an operating computer system
US6567767B1 (en) 2000-09-19 2003-05-20 Unisys Corporation Terminal server simulated client performance measurement tool
US20030126254A1 (en) * 2001-11-26 2003-07-03 Cruickshank Robert F. Network performance monitoring
US20040010674A1 (en) * 2002-07-11 2004-01-15 International Business Machines Corporation Lazy deregistration protocol for a split socket stack
US6691067B1 (en) * 1999-04-07 2004-02-10 Bmc Software, Inc. Enterprise management system and method which includes statistical recreation of system resource usage for more accurate monitoring, prediction, and performance workload characterization
US20040059544A1 (en) * 2001-08-06 2004-03-25 Itzhak Smocha Software system and methods for analyzing the performance of a server
US20050086335A1 (en) * 2003-10-20 2005-04-21 International Business Machines Corporation Method and apparatus for automatic modeling building using inference for IT systems
US20050114583A1 (en) * 2001-12-27 2005-05-26 John Beale Low-overhead processor interfacing
US20050240935A1 (en) 2004-04-07 2005-10-27 Krishnan Ramanathan Method of distributing load amongst two or more computer system resources
US6973415B1 (en) * 2003-11-12 2005-12-06 Sprint Communications Company L.P. System and method for monitoring and modeling system performance
US20070168494A1 (en) 2005-12-22 2007-07-19 Zhen Liu Method and system for on-line performance modeling using inference for real production it systems
US7369967B1 (en) * 2004-12-27 2008-05-06 Sprint Communications Company L.P. System and method for monitoring and modeling system performance

Patent Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5465354A (en) 1992-03-19 1995-11-07 Hitachi, Ltd. Method and apparatus for job execution prediction and control and method for job execution situation display
US5946662A (en) * 1996-03-29 1999-08-31 International Business Machines Corporation Method for providing inventory optimization
US6691067B1 (en) * 1999-04-07 2004-02-10 Bmc Software, Inc. Enterprise management system and method which includes statistical recreation of system resource usage for more accurate monitoring, prediction, and performance workload characterization
US6393480B1 (en) 1999-06-21 2002-05-21 Compuware Corporation Application response time prediction
US6567767B1 (en) 2000-09-19 2003-05-20 Unisys Corporation Terminal server simulated client performance measurement tool
US20020082807A1 (en) 2000-12-23 2002-06-27 Turicchi Thomas Edwin Method for service level estimation in an operating computer system
US20040059544A1 (en) * 2001-08-06 2004-03-25 Itzhak Smocha Software system and methods for analyzing the performance of a server
US20030126254A1 (en) * 2001-11-26 2003-07-03 Cruickshank Robert F. Network performance monitoring
US20050114583A1 (en) * 2001-12-27 2005-05-26 John Beale Low-overhead processor interfacing
US20040010674A1 (en) * 2002-07-11 2004-01-15 International Business Machines Corporation Lazy deregistration protocol for a split socket stack
US20050086335A1 (en) * 2003-10-20 2005-04-21 International Business Machines Corporation Method and apparatus for automatic modeling building using inference for IT systems
US7296256B2 (en) 2003-10-20 2007-11-13 International Business Machines Corporation Method and apparatus for automatic modeling building using inference for IT systems
US6973415B1 (en) * 2003-11-12 2005-12-06 Sprint Communications Company L.P. System and method for monitoring and modeling system performance
US7107187B1 (en) * 2003-11-12 2006-09-12 Sprint Communications Company L.P. Method for modeling system performance
US20050240935A1 (en) 2004-04-07 2005-10-27 Krishnan Ramanathan Method of distributing load amongst two or more computer system resources
US7369967B1 (en) * 2004-12-27 2008-05-06 Sprint Communications Company L.P. System and method for monitoring and modeling system performance
US20070168494A1 (en) 2005-12-22 2007-07-19 Zhen Liu Method and system for on-line performance modeling using inference for real production it systems
US7739099B2 (en) * 2005-12-22 2010-06-15 International Business Machines Corporation Method and system for on-line performance modeling using inference for real production IT systems

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Managing Versions of Web Documents in a Transaction-time Web Server". Dyreson, Curtis, et al. May 17-22, 2004. *
"Queueing Model Based Network Server Performance Control", Sha, Lui, et al. IEEE 2002. *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10146656B2 (en) 2016-02-16 2018-12-04 Tata Consultancy Services Limited Service demand based performance prediction using a single workload

Also Published As

Publication number Publication date
US20100076733A1 (en) 2010-03-25

Similar Documents

Publication Publication Date Title
US8170845B2 (en) Method and apparatus for automatic performance modeling with load dependent service times and overheads
Stewart et al. Exploiting nonstationarity for performance prediction
US7739099B2 (en) Method and system for on-line performance modeling using inference for real production IT systems
US7720955B1 (en) Determining performance of an application based on transactions
CN100391159C (en) Method and apparatus for automatic modeling building using inference for IT systems
US7805496B2 (en) Automatic generation of hybrid performance models
US8452862B2 (en) Apparatus and method for capacity planning for data center server consolidation and workload reassignment
US20080221941A1 (en) System and method for capacity planning for computing systems
US8224624B2 (en) Using application performance signatures for characterizing application updates
Zhang et al. A regression-based analytic model for capacity planning of multi-tier applications
US20100094992A1 (en) Capacity Planning Of Multi-tiered Applicatons From Application Logs
Almeida et al. Capacity planning an essential tool for managing Web services
US8332507B2 (en) Method for determining service demands in a network load balanced scenario
Nambiar et al. Transaction processing performance council (TPC): state of the council 2010
Kalbasi et al. Dec: Service demand estimation with confidence
Liew et al. Cloudguide: Helping users estimate cloud deployment cost and performance for legacy web applications
Zhang et al. Workload service requirements analysis: A queueing network optimization approach
Gupta et al. Long range dependence in cloud servers: a statistical analysis based on google workload trace
Rolia et al. Resource demand modeling for multi-tier services
Ramakrishnan et al. Setting realistic think times in performance testing: A practitioner's approach
Gokhale et al. Performance and availability analysis of an e-commerce site
Uhlir et al. Latency-based benchmarking of cloud service providers
Almeida Capacity planning for web services techniques and methodology
Haak et al. Autonomic benchmarking for cloud infrastructures: an economic optimization model
Chhetri et al. Smart CloudMonitor-providing visibility into performance of black-box clouds

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION,NEW YO

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KUMAR, DINESH;ZHANG, LI;REEL/FRAME:021579/0823

Effective date: 20080820

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW Y

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KUMAR, DINESH;ZHANG, LI;REEL/FRAME:021579/0823

Effective date: 20080820

STCF Information on status: patent grant

Free format text: PATENTED CASE

REMI Maintenance fee reminder mailed
AS Assignment

Owner name: MIDWAY TECHNOLOGY COMPANY LLC, MICHIGAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERNATIONAL BUSINESS MACHINES CORPORATION;REEL/FRAME:037704/0257

Effective date: 20151231

AS Assignment

Owner name: SERVICENOW, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MIDWAY TECHNOLOGY COMPANY LLC;REEL/FRAME:038324/0816

Effective date: 20160324

FPAY Fee payment

Year of fee payment: 4

SULP Surcharge for late payment
FEPP Fee payment procedure

Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FEPP Fee payment procedure

Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

LAPS Lapse for failure to pay maintenance fees

Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20200501