WO2001074042A2 - Lexical analysis of telephone conversations with call center agents - Google Patents

Lexical analysis of telephone conversations with call center agents Download PDF

Info

Publication number
WO2001074042A2
WO2001074042A2 PCT/US2001/009131 US0109131W WO0174042A2 WO 2001074042 A2 WO2001074042 A2 WO 2001074042A2 US 0109131 W US0109131 W US 0109131W WO 0174042 A2 WO0174042 A2 WO 0174042A2
Authority
WO
WIPO (PCT)
Prior art keywords
call
calls
features
call center
lexical content
Prior art date
Application number
PCT/US2001/009131
Other languages
French (fr)
Other versions
WO2001074042A3 (en
Inventor
Jonathan Hood Young
Sean D. True
David Meyer Ray
Jakob Wahlberg
Bradley Ray Howes
Megan M. Mca'nulty
John A. Morse
Mark Jackson
Original Assignee
Dragon Systems, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dragon Systems, Inc. filed Critical Dragon Systems, Inc.
Priority to EP01918909A priority Critical patent/EP1273159A2/en
Priority to AU2001245927A priority patent/AU2001245927A1/en
Publication of WO2001074042A2 publication Critical patent/WO2001074042A2/en
Publication of WO2001074042A3 publication Critical patent/WO2001074042A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • H04M3/5175Call or contact centers supervision arrangements

Definitions

  • This invention relates to speech recognition.
  • call centers to handle phone calls with customers.
  • call centers employ multiple agents to handle technical support calls, customer orders, and so forth.
  • Call centers often provide scripts and other techniques to ensure that calls are handled consistently and in the manner desired by the organization.
  • Some organizations record telephone conversations between agents and customers to monitor customer service quality, for legal purposes, and for other reasons.
  • organizations also record calls within an organization such as one call center agent asking a question of another agent.
  • Buried within the collection of recorded calls from a call center are customer comments, suggestions, and other information of interest in making decisions regarding marketing, technical support, engineering, call center management, and other issues.
  • many centers instruct agents to ask specific questions of customers and to log their responses into a database. Summary
  • the invention features a method of analyzing a collection of calls at one or more call center stations.
  • the method includes receiving lexical content of a telephone call handled by a call center agent, the lexical content being identified by a speech recognition system and identifying one or more features of the telephone call based on the received lexical content.
  • the method also includes storing the one or more identified features along with one or more identified features of another telephone call, collectively analyzing the stored features of the telephone calls, and reporting results of the analyzing.
  • Embodiments may include one or more of the following features.
  • the method may include receiving acoustic data signals corresponding to the telephone call, and performing speech recognition on the received acoustic data to determine the lexical content of the call.
  • the method may include receiving descriptive information for a call such as the call duration, call time, caller identification, and agent identification. Identifying features may be performed based on the descriptive information.
  • One or the features may include a term frequency feature, a readability feature, a script-adherence feature, and/or feature classifying utterances (e.g., classifying an utterance as at least one of the following: a question, an answer, and a hesitation).
  • a term frequency feature e.g., classifying an utterance as at least one of the following: a question, an answer, and a hesitation.
  • the method may further include receiving identification of a speaker of identified lexical content.
  • the identification may be determined.
  • the features may include a feature measuring agent speaking time, a feature measuring caller speaking time.
  • the analysis may include representing at least some of the calls in a vector space model.
  • the analysis may further include determining clusters of calls in the vector space model, for example, using k-means clustering.
  • the analysis may further include tracking clusters of calls over time (e.g., identifying new clusters and/or identifying changes in a cluster).
  • the analysis may further include using the vector space model to identify calls similar to a call having specified properties, for example, to identify calls similar to a specified call.
  • the analyzing may include receiving an ad-hoc query (e.g., a Boolean query) and ranking calls based on the query. Such a ranking may include determining the term frequency of terms in call and/or determining the term frequency of terms in a corpus of calls and using an inverse document frequency statistic.
  • the collectively analyzing may include using a natural language processing technique.
  • the method may include storing audio signal data for at least some of the calls for subsequent playback.
  • the collectively analyzing may include identifying call topics handled by call center agents and/or determining the performance of call center agents.
  • the invention features software disposed on a computer readable medium, for use at a call center having one or more agents handling calls at one or more call center stations.
  • the software includes instructions for causing a processor to receive lexical content of a telephone call handled by a call center agent, the lexical content being identified by a speech recognition system, identify one or more features of the telephone call based on the received lexical content, store the identified features along with the identified features of other telephone calls, collectively analyze the features of telephone calls, and report the analysis.
  • FIG. 1 is a diagram of a call center that uses speech recognition to identify terms spoken during calls between agents and customers.
  • FIG. 2 is a flowchart of a process for identifying call features and using the identified features to generate reports and to respond to queries.
  • FIG. 3 is a flowchart of a process for identifying call features.
  • FIG. 4 is a diagram of a vector space having call features as dimensions.
  • FIG. 5 is a diagram of clusters in vector space.
  • FIG. 6 is a flowchart of a process for using a vector space representation of calls to produce reports and respond to queries.
  • FIG. 1 shows an example of a call center 100 that enables a team of phone agents to handle calls to and from customers.
  • the center 100 uses speech recognition systems 108a- 108n to automatically "transcribe" agent/customer conversations.
  • Call analysis software 122 analyzes the transcriptions generated by the speech recognition systems to identify different features of each call. For example, the software 122 can identify the topics discussed between an agent and a customer and can gauge how well the agent handled the call. The software 122 can also perform statistical analysis of these features to produce reports identifying trends and anomalies.
  • the system 100 enables call managers to gather important information from each dialog with a customer. For example, by constructing queries and reviewing statistical reports of the calls, a call manager can identify product or documentation weaknesses and agents needing additional training.
  • FIG. 1 shows call center stations 106a-106n (e.g., personal computers in a PBX (Private Branch Exchange)) receiving voice signals from both customer phones 102a-102n and agent headsets 104a-104n. Instead of acting as simple conduits
  • FIG. 1 shows call center stations 106a-106n (e.g., personal computers in a PBX (Private Branch Exchange)) receiving voice signals from both customer phones 102a-102n and agent headsets 104a-104n.
  • the stations 106a-106n record the acoustic signals of each call, for example, as PC ".wav" sound files.
  • Speech recognition systems 108a-108n such as NaturallySpeakingTM 4.0 from Dragon SystemsTM of Newton, Massachusetts, process the sound files to identify each call's lexical content (e.g., words, phrases, and other vocalizations such as "um” and "er”).
  • the speech recognition systems 108a-108n use trained speaker models (i.e., models tailored to the speech of a particular speaker) to improve recognition performance. For example, when a system 108a-108n can identify an agent (e.g., from the station used) or a customer (e.g., using caller ID or a product license number), the system 108a-108n may load a speech model previously trained for the identified speaker.
  • the stations 106a-106n send the acoustic signals 116 and the lexical content 118 of each call 114 to a server 110.
  • the server 110 stores this information in a database 112 for analysis and future retrieval.
  • the server 110 also may receive descriptive information 120 for each call, such as agent comments entered at the station, the time of day of the call, the identification of the agent handling the call, and the identification of the customer (e.g., the customer's name from caller ID or the customer's product license number).
  • the server 110 can request the descriptive information, for example, through an API (application programming interface) provided by the stations 106a-106n or by a centralized call switching system.
  • API application programming interface
  • a call manager's computer 124 provides a graphical user interface that enables the manager to construct and submit queries, view the response of the software
  • Another call center may have an architecture substantially different from that of the call center 100 shown in FIG. 1.
  • the server 110 could perform some or all of the speech recognition.
  • call analysis software 122 need not reside on the call server 110, but may instead reside on the client.
  • FIG. 2 shows a process 200 for analyzing a collection of calls such as calls collected at the call center shown in FIG. 1.
  • These techniques are not limited to the handling of call center conversations, but instead can be used to analyze recorded telephone conversations regardless of their origin.
  • the techniques can analyze financial conference calls, interviews (conducted, for example, by a remote medical advisor, a market researcher, or a journalist), 911 calls, and lawyer-client conversations.
  • the process 200 receives the acoustic signals of a call and the results of speech recognition (e.g., the lexical content).
  • Speech recognition can produce a list of identified terms (e.g., words and/or phrases), when the term was spoken (e.g., start and end time offsets into the sound file), and the speech recognition system's confidence 206 in the system's identification of the term.
  • the system may also list the speaker of each term.
  • a number of hardware and software techniques can be used to identify a speaker. For example, some call center stations provide one output for an agent's voice and another for a customer's voice. In such cases, identifying the speaker is a simple matter of identifying the output carrying the speech. In other configurations, such as those that only provide a single output with the combined voices of agent and customer, hardware and/or software can separate agent and customer voices. For example, a feed-forward loop can subtract the signal from the agent's headset microphone from the signal of the agent's and customer's voice combined, leaving only the signal of the customer's voice. In other embodiments, the speaker 208 of a term can be determined using software speaker identification techniques.
  • the process 200 can identify different call features (step 202). For example, the process 200 can score each call for the presence of any of a list of profane word spoken by the agent and/or customer. A number of other features are described below.
  • the process 200 adds the call features to the corpus (entire collection) of calls previously processed (step 204). Thereafter, the process 200 can receive user queries specifying Boolean or SQL (Structure Query Language) combinations of features (step 206) and can respond to these queries with matches or near matches (step 208). For example, a call manager may look for heated conversations caused by a customer's being on hold too long with an SQL query of "select * from CallFeatures where ((CustomerProfanity > 3) and (HoldDuration > 1:00))." To speed query responses, the process may construct an inverted index (not shown) listing features and the different calls having those features.
  • SQL Structure Query Language
  • the software may use more sophisticated techniques to rank query results.
  • the software may maintain statistics on the entire collection of calls. For example, the software may maintain the document frequency (df) of terms (e.g., the number of calls including a particular term).
  • df document frequency
  • a less evenly distributed word e.g., a term appearing in fewer calls
  • a less evenly distributed word may be more telling of call content. That is, the word "try” may appear in many calls, but the term "transducer” may appear in a handful of calls.
  • calls having query terms with lower Rvalues may provide a more telling indication of the call's subject matter and may be ranked higher than other calls listed in response to a query.
  • the software can also track the proximity of terms.
  • the process 200 may analyze call features using more sophisticated statistical approaches (step 210). This enables the software to generate reports (step 212) characterizing the distributions of calls and permits even more abstract queries (e.g., "find calls like this one").
  • FIG. 3 shows a process 300 for identifying different features of a call.
  • portions of a call may be analyzed to determine whether the portion corresponds to a question, answer, or hesitation (step 302).
  • the number of questions, answers, and/or hesitations spoken by an agent and/or customer can form a score or scores for analysis.
  • scores can help call center managers identify agents who may not be fully up to speed on a particular matter. For example, agents needing additional training may exhibit hesitation or ask more questions than other agents.
  • Speech may be categorized using analysis of acoustic signals and/or the corresponding lexical content. For example, analysis of the intonation
  • each utterance can indicate the type of utterance. That is, in
  • the process 300 can also determine a score for a call feature that measures the correspondence of the agent's speech with the provided script (step 304). That is, the process 300 can determine for each agent utterance, whether it follows the logical pattern of a previously specified script. For example, the system might determine how closely an agent followed a script, whether the agent repeated questions, backed up, or whether portions of the script were skipped in this call. Sophisticated systems might include scripts that fork and rejoin. The score may be adjusted to be more or less tolerant of deviations from the script.
  • the process 300 may determine a "readability" score for the agent's speech (step 306) to ensure agents do not overwhelm such callers with technical jargon.
  • readability formulas readability scores based on the measures such as the number of syllables per word, the number of words per sentence, and/or the number of letters per work.
  • the "Kincaid" score can be computed as: ⁇ [11.8*(syllables per word)]+[0.39*(words per sentence)] ⁇ .
  • Other scores include the Automated Readability Index, the Coleman-Liau score, the Flesch Index, and the Fog Index.
  • the process 300 may also determine other features such as the total speaking time by the agent and the customer (step 308). Similarly, the process 300 may determine the speaking rate (e.g., syllables per second) (step 310). These features may be used, for example, to identify agents spending too much time on some calls or hurrying through others. The process also may derive features from combinations of other features. For example, a "Bad Call" score may be determined by (Profanity Score / Duration of Call).
  • the process 300 may also identify features based on the number of occurrences of terms in a call (step 312). For example, the process 300 may count the number of times a product name is spoken during a call.
  • Categorization sorts calls into different predefined bins based on the features of the calls. For example, call categories can include "Regarding product X”, “Simple Broker Purchase or Sale”, “Request for literature”,
  • clustering does not make assumptions about call categories, but instead lets calls clump into groups by natural divisions in their feature values. Both clustering and categorization can use a "vector space model" to group calls.
  • FIG. 4 shows a very simple vector space 400 having three-dimensions 402,
  • Each dimension 402, 404, 406 represents a feature of a call.
  • the x-axis 402 measures the number of times a customer says "software"; the y-axis
  • each call whether ten-minutes or ten-seconds long, can be plotted as a single point (or vector) in the space 400 by merely counting up the number of times the selected words were spoken.
  • point 408 corresponds to a call where a customer said "the new microphone is not as good as the old microphone.” Since the word "microphone” was spoken twice and the words "install” and "software” were not spoken at all, the call has coordinates of (0,2,0).
  • FIG. 4 shows a three-dimensional vector space.
  • the vector space is not limited to three-dimensions, but can instead have w-dimensions where n is the number of different features of a call.
  • a call manager can control the number of dimensions, for example, by configuring the statistical analysis system to focus on certain features, words, or sets of words (e.g., profanity, product names, and/or words associated with common problems).
  • the n may be the number of different words in the English language.
  • stemming reduces the number of dimensions by truncating words to common roots. That is, “laughing”, “laughs”, and “laughter” may all truncate to "laugh", reducing the number of dimensions by three.
  • a "stop list” of common words such as articles and prepositions can also significantly reduce the number of dimensions representing call content.
  • synonym-sets can reduce dimensions by providing a single dimension for terms with similar meanings. For example, “headphones”, “headset”, or “mic” are all synonyms with “microphone.” Thus, a system can eliminate dimensions by counting appearance of "headphones", “headset”, “mic” as appearances of "microphone."
  • a term e.g., a word or a phrase
  • tf the term's frequency
  • the term frequency roughly gauges how salient a word is within a call. The higher the term frequency, the more likely it is that the term is a good description of the document content.
  • Term frequency is usually dampened by a function (e.g., ⁇ ) since
  • the term frequency statistic can reflect the confidences of the speech recognition system for each term to reflect uncertainty in identification during recognition. For example, instead of adding up the number of times a term appears in lexical content, a process can sum the speech recognition systems confidences in each term.
  • weighting can be improved using document frequency statistics. For example, / ⁇ (inverse document frequency) expressions, combine tf values of a call with ⁇ (document frequency) values. For example, the feature value for a word may be computed using:
  • Weight (1 + log (t/word)) log (NumDocs/ WO rd).
  • Such an expression embodies the notion that a sliding scale exists between term frequency within a document and the term's comparative rareness in a corpus.
  • Plotting calls in vector space enables quick mathematical comparison of the calls. For example, the angle formed by two "call" vectors is also a good estimate of topical similarity. That is, the smaller the angle the more similar the calls. Alternatively, the geometric distance between vector space points may provide an indication of topical similarity.
  • a call manager can request all calls resembling a specified call.
  • analysis software can plot the specified call and rank similar calls based on their distance from the specified call.
  • seed category points in the vector space, software can categorize calls based on their proximity to a particular seed. For example, different seeds may correspond to different products.
  • each group 500, 502 has a "centroid", C, 504, 506.
  • Each centroid 504, 506 is the "center of gravity” of its respective cluster. The centroid 504, 506 may not correspond to a particular call.
  • each group 500, 502 also has a medoid, a "prototypical" group member that is closest to the centroid.
  • a wide variety of clustering algorithms can partition the points into groups 500, 502.
  • the K-means clustering algorithm begins with an initial set of cluster points. Each point is assigned to the nearest cluster center. The algorithm then re-computes cluster centers by re-determining cluster centroids. Each point is then reassigned to the nearest cluster center and cluster centers are recomputed. Iterations can continue as long as each iteration improves some measure of cluster quality (e.g., average distance of cluster points to their cluster centroids).
  • clustering algorithms include "bottom-up” algorithms that form partitions by starting with individual points and grouping the most similar ones (e.g., those closest together) and “top-down” algorithms that form partitions by starting with all the points and dividing them into groups.
  • Many clustering algorithms may produce different numbers of clusters for different sets of points, depending on their distribution in the vector space. Tracking the number of clusters over time can provide valuable information to a call manager. For example, dissipation of a "microphone" problem cluster may indicate that a revision to a manual addressed the problem. Similarly, a "software installation” cluster may emerge when upgrades are distributed. The software can monitor the number of points in a cluster over time. When a new cluster appears, the software may automatically notify a manager, for example, by sending e-mail including an "audio bookmark" to the cluster's medoid call.
  • any call feature (e.g., one of those shown in FIG. 3) may be used as a hyperdimension axis.
  • a vector space may include a time-of-day feature. This may show that certain problems prompt calls during the workday while others prompt calls at night.
  • FIG. 6 shows processes 600, 610 that implement some of the capabilities described above.
  • process 600 may plot each call in vector space based on the respective call features (step 602).
  • the process 600 may, in turn, form clusters or categorize the calls based on their vector space coordinates (step 604). From the clusters and/or categorizations, the process 600 can generate a report (step 606) identifying call grouping properties, size, and development over time.
  • another process 610 can use the vector space representation of a collection of calls to provide a "query-by-example" capability.
  • the process may receive a description of a point in vector space (step 612), for example, by user specification of a particular call, and may then identify calls similar to the specified call (step 614).
  • Process 600 may provide a user interface that enables a call center manager to configure call analysis and to prepare and submit queries.
  • the user interface can enable a manager to identify different call categories and characteristics of these categories (e.g., a Boolean expression that is "True" when a call falls in a particular category or a vector space location corresponding to the category).
  • the user interface and analysis software may enable a manager to limit searches to calls belonging to a cluster or category or having a particular feature (e.g., only calls about product X handled by a particular agent).
  • the user interface may also present a ranked list of calls or categories corresponding to a query, generate statistical reports, permit navigation to individual calls, enable users to listen to individual calls, search for keywords within the calls, and customize the set of statistical reports
  • the described techniques may be applied to calls of any origin.
  • the techniques are not limited to any particular hardware or software configuration; they may find applicability in any computing or processing environment.
  • the techniques may be implemented in hardware or software, or a combination of the two.
  • the techniques are implemented in computer programs executing on programmable computers that each include a processor, a storage medium readable by the processor (including volatile and non-volatile memory and/or storage elements), at least one input device, and one or more output devices.
  • Program code is applied to data entered using the input device to perform the functions described and to generate output information.
  • the output information is applied to one or more output devices.
  • Each program is preferably implemented in a high level procedural or object oriented programming language to communicate with a computer system.
  • the programs can be implemented in assembly or machine language, if desired.
  • the language may be a compiled or interpreted language.
  • Each such computer program is preferable stored on a storage medium or device (e.g., CD-ROM, hard disk or magnetic diskette) that is readable by a general or special purpose programmable computer for configuring and operating the computer when the storage medium or device is read by the computer to perform the procedures described in this document.
  • a storage medium or device e.g., CD-ROM, hard disk or magnetic diskette
  • the system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner.

Abstract

A method of analyzing a collection of calls at one or more call center stations. The method includes receiving lexical content of a telephone call handled by a call center agent and identifying one or more features of the telephone call based on the received lexical content. The method also includes collectively analyzing the stored features along with the stored features of other telephone calls and reporting results of the analyzing.

Description

CALL ANALYSIS
Reference to Related Application This application relates to and is a continuation-in-part of co-pending U.S. Application No. 09/052,900, titled "INTERACTIVE SEARCHING," which is incorporated by reference.
Background
This invention relates to speech recognition.
Many businesses and organizations provide call centers to handle phone calls with customers. Typically, call centers employ multiple agents to handle technical support calls, customer orders, and so forth. Call centers often provide scripts and other techniques to ensure that calls are handled consistently and in the manner desired by the organization. Some organizations record telephone conversations between agents and customers to monitor customer service quality, for legal purposes, and for other reasons. Sometimes, organizations also record calls within an organization such as one call center agent asking a question of another agent.
Buried within the collection of recorded calls from a call center are customer comments, suggestions, and other information of interest in making decisions regarding marketing, technical support, engineering, call center management, and other issues. In an attempt to harvest information from this direct contact with customers, many centers instruct agents to ask specific questions of customers and to log their responses into a database. Summary
In general, in one aspect, the invention features a method of analyzing a collection of calls at one or more call center stations. The method includes receiving lexical content of a telephone call handled by a call center agent, the lexical content being identified by a speech recognition system and identifying one or more features of the telephone call based on the received lexical content. The method also includes storing the one or more identified features along with one or more identified features of another telephone call, collectively analyzing the stored features of the telephone calls, and reporting results of the analyzing.
Embodiments may include one or more of the following features. The method may include receiving acoustic data signals corresponding to the telephone call, and performing speech recognition on the received acoustic data to determine the lexical content of the call. The method may include receiving descriptive information for a call such as the call duration, call time, caller identification, and agent identification. Identifying features may be performed based on the descriptive information.
One or the features may include a term frequency feature, a readability feature, a script-adherence feature, and/or feature classifying utterances (e.g., classifying an utterance as at least one of the following: a question, an answer, and a hesitation).
The method may further include receiving identification of a speaker of identified lexical content. The identification may be determined. The features may include a feature measuring agent speaking time, a feature measuring caller speaking time.
The analysis may include representing at least some of the calls in a vector space model. The analysis may further include determining clusters of calls in the vector space model, for example, using k-means clustering. The analysis may further include tracking clusters of calls over time (e.g., identifying new clusters and/or identifying changes in a cluster). The analysis may further include using the vector space model to identify calls similar to a call having specified properties, for example, to identify calls similar to a specified call. The analyzing may include receiving an ad-hoc query (e.g., a Boolean query) and ranking calls based on the query. Such a ranking may include determining the term frequency of terms in call and/or determining the term frequency of terms in a corpus of calls and using an inverse document frequency statistic.
The collectively analyzing may include using a natural language processing technique. The method may include storing audio signal data for at least some of the calls for subsequent playback. The collectively analyzing may include identifying call topics handled by call center agents and/or determining the performance of call center agents.
In general, in another aspect, the invention features software disposed on a computer readable medium, for use at a call center having one or more agents handling calls at one or more call center stations. The software includes instructions for causing a processor to receive lexical content of a telephone call handled by a call center agent, the lexical content being identified by a speech recognition system, identify one or more features of the telephone call based on the received lexical content, store the identified features along with the identified features of other telephone calls, collectively analyze the features of telephone calls, and report the analysis.
Other features and advantages of the invention will be apparent from the following description, including the drawings, and the claims.
Brief Description of the Drawings
FIG. 1 is a diagram of a call center that uses speech recognition to identify terms spoken during calls between agents and customers. FIG. 2 is a flowchart of a process for identifying call features and using the identified features to generate reports and to respond to queries.
FIG. 3 is a flowchart of a process for identifying call features.
FIG. 4 is a diagram of a vector space having call features as dimensions.
FIG. 5 is a diagram of clusters in vector space.
FIG. 6 is a flowchart of a process for using a vector space representation of calls to produce reports and respond to queries.
Detailed Description
FIG. 1 shows an example of a call center 100 that enables a team of phone agents to handle calls to and from customers. The center 100 uses speech recognition systems 108a- 108n to automatically "transcribe" agent/customer conversations. Call analysis software 122 analyzes the transcriptions generated by the speech recognition systems to identify different features of each call. For example, the software 122 can identify the topics discussed between an agent and a customer and can gauge how well the agent handled the call. The software 122 can also perform statistical analysis of these features to produce reports identifying trends and anomalies. The system 100 enables call managers to gather important information from each dialog with a customer. For example, by constructing queries and reviewing statistical reports of the calls, a call manager can identify product or documentation weaknesses and agents needing additional training.
Sample Architecture
In greater detail, FIG. 1 shows call center stations 106a-106n (e.g., personal computers in a PBX (Private Branch Exchange)) receiving voice signals from both customer phones 102a-102n and agent headsets 104a-104n. Instead of acting as simple conduits In greater detail, FIG. 1 shows call center stations 106a-106n (e.g., personal computers in a PBX (Private Branch Exchange)) receiving voice signals from both customer phones 102a-102n and agent headsets 104a-104n. Instead of acting as simple conduits between agents and customers, the stations 106a-106n record the acoustic signals of each call, for example, as PC ".wav" sound files. Speech recognition systems 108a-108n, such as NaturallySpeaking™ 4.0 from Dragon Systems™ of Newton, Massachusetts, process the sound files to identify each call's lexical content (e.g., words, phrases, and other vocalizations such as "um" and "er"). When possible, the speech recognition systems 108a-108n use trained speaker models (i.e., models tailored to the speech of a particular speaker) to improve recognition performance. For example, when a system 108a-108n can identify an agent (e.g., from the station used) or a customer (e.g., using caller ID or a product license number), the system 108a-108n may load a speech model previously trained for the identified speaker.
The stations 106a-106n send the acoustic signals 116 and the lexical content 118 of each call 114 to a server 110. The server 110 stores this information in a database 112 for analysis and future retrieval. The server 110 also may receive descriptive information 120 for each call, such as agent comments entered at the station, the time of day of the call, the identification of the agent handling the call, and the identification of the customer (e.g., the customer's name from caller ID or the customer's product license number). The server 110 can request the descriptive information, for example, through an API (application programming interface) provided by the stations 106a-106n or by a centralized call switching system.
As shown, a call manager's computer 124 provides a graphical user interface that enables the manager to construct and submit queries, view the response of the software
122 to such queries, and view other reports generated by the software 122. Another call center may have an architecture substantially different from that of the call center 100 shown in FIG. 1. For example, instead of distributing speech recognition systems 106a-106n over the call center stations 106a-106n, the server 110 could perform some or all of the speech recognition. Additionally, call analysis software 122 need not reside on the call server 110, but may instead reside on the client.
Call Processing
FIG. 2 shows a process 200 for analyzing a collection of calls such as calls collected at the call center shown in FIG. 1. These techniques are not limited to the handling of call center conversations, but instead can be used to analyze recorded telephone conversations regardless of their origin. For example, the techniques can analyze financial conference calls, interviews (conducted, for example, by a remote medical advisor, a market researcher, or a journalist), 911 calls, and lawyer-client conversations.
As shown, the process 200 receives the acoustic signals of a call and the results of speech recognition (e.g., the lexical content). Speech recognition can produce a list of identified terms (e.g., words and/or phrases), when the term was spoken (e.g., start and end time offsets into the sound file), and the speech recognition system's confidence 206 in the system's identification of the term. The system may also list the speaker of each term.
A number of hardware and software techniques can be used to identify a speaker. For example, some call center stations provide one output for an agent's voice and another for a customer's voice. In such cases, identifying the speaker is a simple matter of identifying the output carrying the speech. In other configurations, such as those that only provide a single output with the combined voices of agent and customer, hardware and/or software can separate agent and customer voices. For example, a feed-forward loop can subtract the signal from the agent's headset microphone from the signal of the agent's and customer's voice combined, leaving only the signal of the customer's voice. In other embodiments, the speaker 208 of a term can be determined using software speaker identification techniques. From the acoustic signals and lexical content, the process 200 can identify different call features (step 202). For example, the process 200 can score each call for the presence of any of a list of profane word spoken by the agent and/or customer. A number of other features are described below.
After determining features, the process 200 adds the call features to the corpus (entire collection) of calls previously processed (step 204). Thereafter, the process 200 can receive user queries specifying Boolean or SQL (Structure Query Language) combinations of features (step 206) and can respond to these queries with matches or near matches (step 208). For example, a call manager may look for heated conversations caused by a customer's being on hold too long with an SQL query of "select * from CallFeatures where ((CustomerProfanity > 3) and (HoldDuration > 1:00))." To speed query responses, the process may construct an inverted index (not shown) listing features and the different calls having those features.
Many times ad-hoc queries return either too few or too many calls. Thus, software may use more sophisticated techniques to rank query results. To this end, the software may maintain statistics on the entire collection of calls. For example, the software may maintain the document frequency (df) of terms (e.g., the number of calls including a particular term). A less evenly distributed word (e.g., a term appearing in fewer calls) may be more telling of call content. That is, the word "try" may appear in many calls, but the term "transducer" may appear in a handful of calls. Thus, calls having query terms with lower Rvalues may provide a more telling indication of the call's subject matter and may be ranked higher than other calls listed in response to a query. The software can also track the proximity of terms. That is, some collections of terms have flexible but significant relationships. For example, "knock" and "door" often appear close to one another, but not necessarily one right after the other. The software can track the mean (μ) number of terms separating "door" and "knock" along with a standard deviation (σ). Calls having these terms separated by the mean number of words plus or minus a standard deviation are likely to correspond to a query for those terms and may be ranked more highly in a list of calls provided in response to a query. Thus, a query for "knock door" may return a list of calls where calls having the phrase "knock on the door" may be ranked more highly than "a knock indicates that the hotel maid is at your door".
In addition to Boolean, SQL, and other ad-hoc queries, the process 200 may analyze call features using more sophisticated statistical approaches (step 210). This enables the software to generate reports (step 212) characterizing the distributions of calls and permits even more abstract queries (e.g., "find calls like this one").
FIG. 3 shows a process 300 for identifying different features of a call. As shown, portions of a call may be analyzed to determine whether the portion corresponds to a question, answer, or hesitation (step 302). The number of questions, answers, and/or hesitations spoken by an agent and/or customer can form a score or scores for analysis. Such scores can help call center managers identify agents who may not be fully up to speed on a particular matter. For example, agents needing additional training may exhibit hesitation or ask more questions than other agents. Speech may be categorized using analysis of acoustic signals and/or the corresponding lexical content. For example, analysis of the intonation
(e.g., fundamental frequency) of each utterance can indicate the type of utterance. That is, in
English, questions tend to end with a rising intonation, statements tend to end with falling intonations, and hesitations tend toward a monotone. Analysis of the lexical content of the call may also be used to classify call portions. For example, most questions begin with a limited number of characteristic terms. That is, many questions begin with "are", "why", or "how," while phrases such as "hold on" or vocalizations such as "um" and "er" characterize hesitations.
The process 300 can also determine a score for a call feature that measures the correspondence of the agent's speech with the provided script (step 304). That is, the process 300 can determine for each agent utterance, whether it follows the logical pattern of a previously specified script. For example, the system might determine how closely an agent followed a script, whether the agent repeated questions, backed up, or whether portions of the script were skipped in this call. Sophisticated systems might include scripts that fork and rejoin. The score may be adjusted to be more or less tolerant of deviations from the script.
Since call centers such as technical support lines often receive calls from befuddled consumers, the process 300 may determine a "readability" score for the agent's speech (step 306) to ensure agents do not overwhelm such callers with technical jargon. Typically, readability formulas readability scores based on the measures such as the number of syllables per word, the number of words per sentence, and/or the number of letters per work. For example, the "Kincaid" score can be computed as: {[11.8*(syllables per word)]+[0.39*(words per sentence)]}. Other scores include the Automated Readability Index, the Coleman-Liau score, the Flesch Index, and the Fog Index.
The process 300 may also determine other features such as the total speaking time by the agent and the customer (step 308). Similarly, the process 300 may determine the speaking rate (e.g., syllables per second) (step 310). These features may be used, for example, to identify agents spending too much time on some calls or hurrying through others. The process also may derive features from combinations of other features. For example, a "Bad Call" score may be determined by (Profanity Score / Duration of Call).
The process 300 may also identify features based on the number of occurrences of terms in a call (step 312). For example, the process 300 may count the number of times a product name is spoken during a call.
Call Clustering
Any of the features described above may be the basis of an ad-hoc query or other statistical analysis such as categorization and/or clustering. Categorization sorts calls into different predefined bins based on the features of the calls. For example, call categories can include "Regarding product X", "Simple Broker Purchase or Sale", "Request for literature",
"Machine misconfigured", and "Customer Unhappy." By contrast, clustering does not make assumptions about call categories, but instead lets calls clump into groups by natural divisions in their feature values. Both clustering and categorization can use a "vector space model" to group calls.
FIG. 4 shows a very simple vector space 400 having three-dimensions 402,
404, 406. Each dimension 402, 404, 406 represents a feature of a call. For example, as shown, the x-axis 402 measures the number of times a customer says "software"; the y-axis
406 measures the number of times the customer says "microphone"; and the z-axis 404 measures the number of times a customer says "install." Using these features as coordinate system 400 dimensions, 402, 404, 406, each call, whether ten-minutes or ten-seconds long, can be plotted as a single point (or vector) in the space 400 by merely counting up the number of times the selected words were spoken. For example, point 408 corresponds to a call where a customer said "the new microphone is not as good as the old microphone." Since the word "microphone" was spoken twice and the words "install" and "software" were not spoken at all, the call has coordinates of (0,2,0).
FIG. 4 shows a three-dimensional vector space. Although difficult to imagine, the vector space is not limited to three-dimensions, but can instead have w-dimensions where n is the number of different features of a call. A call manager can control the number of dimensions, for example, by configuring the statistical analysis system to focus on certain features, words, or sets of words (e.g., profanity, product names, and/or words associated with common problems).
In other implementations, the n may be the number of different words in the English language. A variety of techniques can reduce the large number of dimensions without greatly affecting the call's content. For example, stemming reduces the number of dimensions by truncating words to common roots. That is, "laughing", "laughs", and "laughter" may all truncate to "laugh", reducing the number of dimensions by three. A "stop list" of common words such as articles and prepositions can also significantly reduce the number of dimensions representing call content. Additionally, synonym-sets can reduce dimensions by providing a single dimension for terms with similar meanings. For example, "headphones", "headset", or "mic" are all synonyms with "microphone." Thus, a system can eliminate dimensions by counting appearance of "headphones", "headset", "mic" as appearances of "microphone".
The description, thus far, used the number of times a term (e.g., a word or a phrase) was spoken in a call as the value of that term's feature. This measure is known as a term's frequency (tf). The term frequency roughly gauges how salient a word is within a call. The higher the term frequency, the more likely it is that the term is a good description of the document content. Term frequency is usually dampened by a function (e.g., ψ ) since
occurrence indicates a higher importance, but not as important as a strict count may imply. Additionally, the term frequency statistic can reflect the confidences of the speech recognition system for each term to reflect uncertainty in identification during recognition. For example, instead of adding up the number of times a term appears in lexical content, a process can sum the speech recognition systems confidences in each term.
Quantification of term features ("weighting") can be improved using document frequency statistics. For example, /^(inverse document frequency) expressions, combine tf values of a call with ^(document frequency) values. For example, the feature value for a word may be computed using:
Weight = (1 + log (t/word)) log (NumDocs/ WOrd).
Such an expression embodies the notion that a sliding scale exists between term frequency within a document and the term's comparative rareness in a corpus.
Plotting calls in vector space enables quick mathematical comparison of the calls. For example, the angle formed by two "call" vectors is also a good estimate of topical similarity. That is, the smaller the angle the more similar the calls. Alternatively, the geometric distance between vector space points may provide an indication of topical similarity.
These simple quantifications of similarity can ease call retrieval and provide insight into call content. For example, instead of constructing a query, a call manager can request all calls resembling a specified call. In response, analysis software can plot the specified call and rank similar calls based on their distance from the specified call. Alternatively, by providing "seed category" points in the vector space, software can categorize calls based on their proximity to a particular seed. For example, different seeds may correspond to different products.
As shown in FIG. 5, over time, call "points" populate the vector space. By visual examination, these points seem to form groups 500, 502 of related calls. That is, group 500 seems to correspond to calls discussing microphone problems, while group 505 seems to correspond to calls discussing software installation problems. As shown, each group 500, 502 has a "centroid", C, 504, 506. Each centroid 504, 506 is the "center of gravity" of its respective cluster. The centroid 504, 506 may not correspond to a particular call. However, each group 500, 502 also has a medoid, a "prototypical" group member that is closest to the centroid.
A wide variety of clustering algorithms can partition the points into groups 500, 502. For example, the K-means clustering algorithm begins with an initial set of cluster points. Each point is assigned to the nearest cluster center. The algorithm then re-computes cluster centers by re-determining cluster centroids. Each point is then reassigned to the nearest cluster center and cluster centers are recomputed. Iterations can continue as long as each iteration improves some measure of cluster quality (e.g., average distance of cluster points to their cluster centroids).
More generally, clustering algorithms include "bottom-up" algorithms that form partitions by starting with individual points and grouping the most similar ones (e.g., those closest together) and "top-down" algorithms that form partitions by starting with all the points and dividing them into groups. Many clustering algorithms may produce different numbers of clusters for different sets of points, depending on their distribution in the vector space. Tracking the number of clusters over time can provide valuable information to a call manager. For example, dissipation of a "microphone" problem cluster may indicate that a revision to a manual addressed the problem. Similarly, a "software installation" cluster may emerge when upgrades are distributed. The software can monitor the number of points in a cluster over time. When a new cluster appears, the software may automatically notify a manager, for example, by sending e-mail including an "audio bookmark" to the cluster's medoid call.
Though the running example in FIGs. 4 and 5 used terms such as vector space dimensions, any call feature (e.g., one of those shown in FIG. 3) may be used as a hyperdimension axis. For example, in addition to term frequencies, a vector space may include a time-of-day feature. This may show that certain problems prompt calls during the workday while others prompt calls at night.
FIG. 6 shows processes 600, 610 that implement some of the capabilities described above. For example, process 600 may plot each call in vector space based on the respective call features (step 602). The process 600 may, in turn, form clusters or categorize the calls based on their vector space coordinates (step 604). From the clusters and/or categorizations, the process 600 can generate a report (step 606) identifying call grouping properties, size, and development over time. As shown, another process 610 can use the vector space representation of a collection of calls to provide a "query-by-example" capability. For example, the process may receive a description of a point in vector space (step 612), for example, by user specification of a particular call, and may then identify calls similar to the specified call (step 614).
Process 600 may provide a user interface that enables a call center manager to configure call analysis and to prepare and submit queries. For example, the user interface can enable a manager to identify different call categories and characteristics of these categories (e.g., a Boolean expression that is "True" when a call falls in a particular category or a vector space location corresponding to the category). The user interface and analysis software may enable a manager to limit searches to calls belonging to a cluster or category or having a particular feature (e.g., only calls about product X handled by a particular agent). The user interface may also present a ranked list of calls or categories corresponding to a query, generate statistical reports, permit navigation to individual calls, enable users to listen to individual calls, search for keywords within the calls, and customize the set of statistical reports
Embodiments
Though this application described conversations between agents and customers at a call center, the described techniques may be applied to calls of any origin. The techniques are not limited to any particular hardware or software configuration; they may find applicability in any computing or processing environment. The techniques may be implemented in hardware or software, or a combination of the two. Preferably, the techniques are implemented in computer programs executing on programmable computers that each include a processor, a storage medium readable by the processor (including volatile and non-volatile memory and/or storage elements), at least one input device, and one or more output devices. Program code is applied to data entered using the input device to perform the functions described and to generate output information. The output information is applied to one or more output devices.
Each program is preferably implemented in a high level procedural or object oriented programming language to communicate with a computer system. However, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language.
Each such computer program is preferable stored on a storage medium or device (e.g., CD-ROM, hard disk or magnetic diskette) that is readable by a general or special purpose programmable computer for configuring and operating the computer when the storage medium or device is read by the computer to perform the procedures described in this document. The system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner.
Other embodiments are within the scope of the following claims.

Claims

What is claimed is: 1. A method of analyzing a collection of calls at one or more call center stations, the method comprising: receiving lexical content of a telephone call handled by a call center agent, the lexical content being identified by a speech recognition system; identifying one or more features of the telephone call based on the received lexical content; storing the one or more identified features along with one or more identified features of another telephone call; collectively analyzing the stored features of the telephone calls; and reporting results of the analyzing.
2. The method of claim 1, further comprising receiving acoustic data signals corresponding to the telephone call, and performing speech recognition on the received acoustic data to determine the lexical content of the call.
3. The method of claim 1, further comprising receiving descriptive information for a call.
4. The method of claim 3, wherein the descriptive information comprises at least one of the following: call duration, call time, caller identification, and agent identification.
5. The method of claim 3, wherein identifying features comprises identifying features based on the descriptive information.
6. The method of claim 1, wherein lexical content comprises words.
7. The method of claim 1, wherein one of the one or more features comprises at least one term frequency feature.
8. The method of claim 1, wherein one of the one or more features comprises a readability feature.
9. The method of claim 1, wherein one of the one or more features comprises a feature classifying utterances.
10. The method of claim 9, wherein classifying utterances comprises classifying an utterance as at least one of the following: a question, an answer, and a hesitation.
11. The method of claim 1, wherein one of the one or more features comprises a feature representing the agent's adherence to a script.
12. The method of claim 1, further comprising receiving identification of a speaker of identified lexical content.
13. The method of claim 12, further comprising identifying a speaker of identified lexical content.
14. The method of claim 12, wherein one of the one or more features comprises a feature measuring agent speaking time.
15. The method of claim 12, wherein one of the one or more features comprises a feature measuring caller speaking time.
16. The method of claim 1, wherein analysis comprises representing at least some of the calls in a vector space model.
17. The method of claim 16, further comprising determining clusters of calls in the vector space model.
18. The method of claim 16, wherein determining clusters comprises k-means clustering.
19. The method of claim 16, further comprising tracking clusters of calls over time.
20. The method of claim 19, wherein tracking comprises identifying new clusters.
21. The method of claim 19, wherein tracking comprises identifying changes in a cluster.
22. The method of claim 16, further comprising using the vector space model to identify calls similar to a call having specified properties.
23. The method of claim 16, further comprising using the vector space model to identify calls similar to a specified call.
24. The method of claim 1, wherein collectively analyzing comprises receiving an ad-hoc query and ranking calls based on the query.
25. The method of claim 24, wherein the query comprises a boolean query.
26. The method of claim 24, wherein ranking comprises determining the term frequency of terms in call.
27. The method of claim 26, wherein ranking comprises determining the term frequency of terms in a corpus of calls and using an inverse document frequency statistic.
28. The method of claim 1, wherein collectively analyzing comprises analyzing using a natural language processing technique.
29. The method of claim 1 , further comprising storing audio signal data for at least some of the calls.
30. The method of claim 29, wherein reporting comprises providing the audio signal data for playback.
31. The method of claim 1 , wherein collectively analyzing comprises identifying call topics handled by call center agents.
32. The method of claim 1, wherein collectively analyzing comprises determining the performance of call center agents.
33. Software disposed on a computer readable medium, for use at a call center having one or more agents handling calls at one or more call center stations, the software including instructions for causing a processor to: receive lexical content of a telephone call handled by a call center agent, the lexical content being identified by a speech recognition system; identify one or more features of the telephone call based on the received lexical content; store the identified features along with the identified features of other telephone calls; collectively analyze the features of telephone calls; and report the analysis.
PCT/US2001/009131 2000-03-24 2001-03-23 Lexical analysis of telephone conversations with call center agents WO2001074042A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP01918909A EP1273159A2 (en) 2000-03-24 2001-03-23 Lexical analysis of telephone conversations with call center agents
AU2001245927A AU2001245927A1 (en) 2000-03-24 2001-03-23 Lexical analysis of telephone conversations with call center agents

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US53515500A 2000-03-24 2000-03-24
US09/535,155 2000-03-24

Publications (2)

Publication Number Publication Date
WO2001074042A2 true WO2001074042A2 (en) 2001-10-04
WO2001074042A3 WO2001074042A3 (en) 2002-09-19

Family

ID=24133067

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/009131 WO2001074042A2 (en) 2000-03-24 2001-03-23 Lexical analysis of telephone conversations with call center agents

Country Status (3)

Country Link
EP (1) EP1273159A2 (en)
AU (1) AU2001245927A1 (en)
WO (1) WO2001074042A2 (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003009173A2 (en) * 2001-07-18 2003-01-30 Sap Aktiengesellschaft Information retrieval using enhanced document vectors
WO2004042700A1 (en) * 2002-10-31 2004-05-21 Worldcom, Inc. Interactive voice response system utility
WO2004072780A2 (en) 2003-02-05 2004-08-26 Verint Systems, Inc. Method for automatic and semi-automatic classification and clustering of non-deterministic texts
EP1556854A2 (en) * 2002-09-27 2005-07-27 Callminer, Inc. Software for statistical analysis of speech
WO2006124945A1 (en) * 2005-05-18 2006-11-23 Eloyalty Corporation A method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
WO2006124942A1 (en) * 2005-05-18 2006-11-23 Eloyalty Corporation A method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US7864945B2 (en) 2004-11-30 2011-01-04 Aspect Software, Inc. Method for detecting availability of non-traditional agent
US7869586B2 (en) 2007-03-30 2011-01-11 Eloyalty Corporation Method and system for aggregating and analyzing data relating to a plurality of interactions between a customer and a contact center and generating business process analytics
US8023639B2 (en) 2007-03-30 2011-09-20 Mattersight Corporation Method and system determining the complexity of a telephonic communication received by a contact center
US8718262B2 (en) 2007-03-30 2014-05-06 Mattersight Corporation Method and system for automatically routing a telephonic communication base on analytic attributes associated with prior telephonic communication
US8731934B2 (en) 2007-02-15 2014-05-20 Dsi-Iti, Llc System and method for multi-modal audio mining of telephone conversations
US9083801B2 (en) 2013-03-14 2015-07-14 Mattersight Corporation Methods and system for analyzing multichannel electronic communication data
US9225841B2 (en) 2005-05-18 2015-12-29 Mattersight Corporation Method and system for selecting and navigating to call examples for playback or analysis
US9413891B2 (en) 2014-01-08 2016-08-09 Callminer, Inc. Real-time conversational analytics facility
US9923936B2 (en) 2016-04-07 2018-03-20 Global Tel*Link Corporation System and method for third party monitoring of voice and video calls
US10027797B1 (en) 2017-05-10 2018-07-17 Global Tel*Link Corporation Alarm control for inmate call monitoring
US10225396B2 (en) 2017-05-18 2019-03-05 Global Tel*Link Corporation Third party monitoring of a activity within a monitoring platform
US10419611B2 (en) 2007-09-28 2019-09-17 Mattersight Corporation System and methods for determining trends in electronic communications
US10572961B2 (en) 2016-03-15 2020-02-25 Global Tel*Link Corporation Detection and prevention of inmate to inmate message relay
US10740380B2 (en) 2018-05-24 2020-08-11 International Business Machines Corporation Incremental discovery of salient topics during customer interaction
US10860786B2 (en) 2017-06-01 2020-12-08 Global Tel*Link Corporation System and method for analyzing and investigating communication data from a controlled environment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0981237A2 (en) * 1998-08-17 2000-02-23 Siemens Information and Communication Networks, Inc. Apparatus and method for automated voice analysis in ACD silent call monitoring
EP1096767A2 (en) * 1999-10-26 2001-05-02 Rockwell Electronic Commerce Corporation System and method for automatically detecting problematic calls

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0981237A2 (en) * 1998-08-17 2000-02-23 Siemens Information and Communication Networks, Inc. Apparatus and method for automated voice analysis in ACD silent call monitoring
EP1096767A2 (en) * 1999-10-26 2001-05-02 Rockwell Electronic Commerce Corporation System and method for automatically detecting problematic calls

Cited By (63)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003009173A3 (en) * 2001-07-18 2003-12-18 Sap Ag Information retrieval using enhanced document vectors
WO2003009173A2 (en) * 2001-07-18 2003-01-30 Sap Aktiengesellschaft Information retrieval using enhanced document vectors
US8583434B2 (en) 2002-09-27 2013-11-12 Callminer, Inc. Methods for statistical analysis of speech
EP1556854A2 (en) * 2002-09-27 2005-07-27 Callminer, Inc. Software for statistical analysis of speech
EP1556854A4 (en) * 2002-09-27 2007-08-15 Callminer Inc Software for statistical analysis of speech
US7346509B2 (en) 2002-09-27 2008-03-18 Callminer, Inc. Software for statistical analysis of speech
WO2004042700A1 (en) * 2002-10-31 2004-05-21 Worldcom, Inc. Interactive voice response system utility
US8666747B2 (en) 2002-10-31 2014-03-04 Verizon Business Global Llc Providing information regarding interactive voice response sessions
WO2004072780A2 (en) 2003-02-05 2004-08-26 Verint Systems, Inc. Method for automatic and semi-automatic classification and clustering of non-deterministic texts
WO2004072780A3 (en) * 2003-02-05 2004-11-11 Verint Systems Inc Method for automatic and semi-automatic classification and clustering of non-deterministic texts
US7864945B2 (en) 2004-11-30 2011-01-04 Aspect Software, Inc. Method for detecting availability of non-traditional agent
US8094803B2 (en) 2005-05-18 2012-01-10 Mattersight Corporation Method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US10104233B2 (en) 2005-05-18 2018-10-16 Mattersight Corporation Coaching portal and methods based on behavioral assessment data
US9571650B2 (en) 2005-05-18 2017-02-14 Mattersight Corporation Method and system for generating a responsive communication based on behavioral assessment data
US9692894B2 (en) 2005-05-18 2017-06-27 Mattersight Corporation Customer satisfaction system and method based on behavioral assessment data
US9432511B2 (en) 2005-05-18 2016-08-30 Mattersight Corporation Method and system of searching for communications for playback or analysis
WO2006124942A1 (en) * 2005-05-18 2006-11-23 Eloyalty Corporation A method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US10129402B1 (en) 2005-05-18 2018-11-13 Mattersight Corporation Customer satisfaction analysis of caller interaction event data system and methods
WO2006124945A1 (en) * 2005-05-18 2006-11-23 Eloyalty Corporation A method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US10021248B2 (en) 2005-05-18 2018-07-10 Mattersight Corporation Method and system for analyzing caller interaction event data
US7995717B2 (en) 2005-05-18 2011-08-09 Mattersight Corporation Method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US9225841B2 (en) 2005-05-18 2015-12-29 Mattersight Corporation Method and system for selecting and navigating to call examples for playback or analysis
US9357071B2 (en) 2005-05-18 2016-05-31 Mattersight Corporation Method and system for analyzing a communication by applying a behavioral model thereto
US10120919B2 (en) 2007-02-15 2018-11-06 Global Tel*Link Corporation System and method for multi-modal audio mining of telephone conversations
US8731934B2 (en) 2007-02-15 2014-05-20 Dsi-Iti, Llc System and method for multi-modal audio mining of telephone conversations
US11789966B2 (en) 2007-02-15 2023-10-17 Global Tel*Link Corporation System and method for multi-modal audio mining of telephone conversations
US9552417B2 (en) 2007-02-15 2017-01-24 Global Tel*Link Corp. System and method for multi-modal audio mining of telephone conversations
US10853384B2 (en) 2007-02-15 2020-12-01 Global Tel*Link Corporation System and method for multi-modal audio mining of telephone conversations
US9699307B2 (en) 2007-03-30 2017-07-04 Mattersight Corporation Method and system for automatically routing a telephonic communication
US7869586B2 (en) 2007-03-30 2011-01-11 Eloyalty Corporation Method and system for aggregating and analyzing data relating to a plurality of interactions between a customer and a contact center and generating business process analytics
US8023639B2 (en) 2007-03-30 2011-09-20 Mattersight Corporation Method and system determining the complexity of a telephonic communication received by a contact center
US10129394B2 (en) 2007-03-30 2018-11-13 Mattersight Corporation Telephonic communication routing system based on customer satisfaction
US8718262B2 (en) 2007-03-30 2014-05-06 Mattersight Corporation Method and system for automatically routing a telephonic communication base on analytic attributes associated with prior telephonic communication
US10419611B2 (en) 2007-09-28 2019-09-17 Mattersight Corporation System and methods for determining trends in electronic communications
US10601994B2 (en) 2007-09-28 2020-03-24 Mattersight Corporation Methods and systems for determining and displaying business relevance of telephonic communications between customers and a contact center
US9942400B2 (en) 2013-03-14 2018-04-10 Mattersight Corporation System and methods for analyzing multichannel communications including voice data
US9407768B2 (en) 2013-03-14 2016-08-02 Mattersight Corporation Methods and system for analyzing multichannel electronic communication data
US9191510B2 (en) 2013-03-14 2015-11-17 Mattersight Corporation Methods and system for analyzing multichannel electronic communication data
US9083801B2 (en) 2013-03-14 2015-07-14 Mattersight Corporation Methods and system for analyzing multichannel electronic communication data
US10194029B2 (en) 2013-03-14 2019-01-29 Mattersight Corporation System and methods for analyzing online forum language
US9667788B2 (en) 2013-03-14 2017-05-30 Mattersight Corporation Responsive communication system for analyzed multichannel electronic communication
US10582056B2 (en) 2014-01-08 2020-03-03 Callminer, Inc. Communication channel customer journey
US10992807B2 (en) 2014-01-08 2021-04-27 Callminer, Inc. System and method for searching content using acoustic characteristics
US9413891B2 (en) 2014-01-08 2016-08-09 Callminer, Inc. Real-time conversational analytics facility
US10601992B2 (en) 2014-01-08 2020-03-24 Callminer, Inc. Contact center agent coaching tool
US11277516B2 (en) 2014-01-08 2022-03-15 Callminer, Inc. System and method for AB testing based on communication content
US10313520B2 (en) 2014-01-08 2019-06-04 Callminer, Inc. Real-time compliance monitoring facility
US10645224B2 (en) 2014-01-08 2020-05-05 Callminer, Inc. System and method of categorizing communications
US10572961B2 (en) 2016-03-15 2020-02-25 Global Tel*Link Corporation Detection and prevention of inmate to inmate message relay
US11640644B2 (en) 2016-03-15 2023-05-02 Global Tel* Link Corporation Detection and prevention of inmate to inmate message relay
US11238553B2 (en) 2016-03-15 2022-02-01 Global Tel*Link Corporation Detection and prevention of inmate to inmate message relay
US11271976B2 (en) 2016-04-07 2022-03-08 Global Tel*Link Corporation System and method for third party monitoring of voice and video calls
US9923936B2 (en) 2016-04-07 2018-03-20 Global Tel*Link Corporation System and method for third party monitoring of voice and video calls
US10715565B2 (en) 2016-04-07 2020-07-14 Global Tel*Link Corporation System and method for third party monitoring of voice and video calls
US10277640B2 (en) 2016-04-07 2019-04-30 Global Tel*Link Corporation System and method for third party monitoring of voice and video calls
US10027797B1 (en) 2017-05-10 2018-07-17 Global Tel*Link Corporation Alarm control for inmate call monitoring
US11044361B2 (en) 2017-05-18 2021-06-22 Global Tel*Link Corporation Third party monitoring of activity within a monitoring platform
US10601982B2 (en) 2017-05-18 2020-03-24 Global Tel*Link Corporation Third party monitoring of activity within a monitoring platform
US11563845B2 (en) 2017-05-18 2023-01-24 Global Tel*Link Corporation Third party monitoring of activity within a monitoring platform
US10225396B2 (en) 2017-05-18 2019-03-05 Global Tel*Link Corporation Third party monitoring of a activity within a monitoring platform
US10860786B2 (en) 2017-06-01 2020-12-08 Global Tel*Link Corporation System and method for analyzing and investigating communication data from a controlled environment
US11526658B2 (en) 2017-06-01 2022-12-13 Global Tel*Link Corporation System and method for analyzing and investigating communication data from a controlled environment
US10740380B2 (en) 2018-05-24 2020-08-11 International Business Machines Corporation Incremental discovery of salient topics during customer interaction

Also Published As

Publication number Publication date
WO2001074042A3 (en) 2002-09-19
AU2001245927A1 (en) 2001-10-08
EP1273159A2 (en) 2003-01-08

Similar Documents

Publication Publication Date Title
US20030154072A1 (en) Call analysis
WO2001074042A2 (en) Lexical analysis of telephone conversations with call center agents
US10032454B2 (en) Speaker and call characteristic sensitive open voice search
US10958779B1 (en) Machine learning dataset generation using a natural language processing technique
US11380327B2 (en) Speech communication system and method with human-machine coordination
US9213978B2 (en) System and method for speech trend analytics with objective function and feature constraints
US7346509B2 (en) Software for statistical analysis of speech
Waibel et al. Advances in automatic meeting record creation and access
CA2508946C (en) Method and apparatus for natural language call routing using confidence scores
US7711566B1 (en) Systems and methods for monitoring speech data labelers
US7453992B2 (en) System and method for management of call data using a vector based model and relational data structure
CN109151218A (en) Call voice quality detecting method, device, computer equipment and storage medium
CN101547261B (en) Association apparatus and association method
US20210350384A1 (en) Assistance for customer service agents
US11461863B2 (en) Idea assessment and landscape mapping
US20190318004A1 (en) Intelligent Call Center Agent Assistant
US11805204B2 (en) Artificial intelligence based refinement of automatic control setting in an operator interface using localized transcripts
Gavalda et al. “The Truth is Out There”: Using Advanced Speech Analytics to Learn Why Customers Call Help-line Desks and How Effectively They Are Being Served by the Call Center Agent
Nourtel et al. Evaluating the impact of speaker anonymization on emotion recognition
CA2375589A1 (en) Method and apparatus for determining user satisfaction with automated speech recognition (asr) system and quality control of the asr system
CN116821953A (en) Asset analysis method and device based on metadata
JP2020071690A (en) Pattern recognition model and pattern learning device, generation method for pattern recognition model, faq extraction method using the same and pattern recognition device, and program
Riedhammer Long story short–Global unsupervised models for keyphrase
Durukal Searching a website via spoken questions
Nambiar et al. Helping satisfy multiple objectives during a service desk conversation

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

ENP Entry into the national phase in:

Ref country code: JP

Ref document number: 2001 571636

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 2001918909

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2001918909

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2001918909

Country of ref document: EP