EP1216554A1 - System and method for delivering customized voice audio data on a packet-switched network - Google Patents
System and method for delivering customized voice audio data on a packet-switched networkInfo
- Publication number
- EP1216554A1 EP1216554A1 EP00966851A EP00966851A EP1216554A1 EP 1216554 A1 EP1216554 A1 EP 1216554A1 EP 00966851 A EP00966851 A EP 00966851A EP 00966851 A EP00966851 A EP 00966851A EP 1216554 A1 EP1216554 A1 EP 1216554A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- network
- flow
- packets
- station
- hearing profile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 230000008569 process Effects 0.000 claims abstract description 13
- 238000004891 communication Methods 0.000 claims abstract description 8
- 238000012545 processing Methods 0.000 claims description 31
- 238000001914 filtration Methods 0.000 claims description 3
- 230000004044 response Effects 0.000 claims description 3
- 230000006870 function Effects 0.000 claims description 2
- 230000008901 benefit Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000012074 hearing test Methods 0.000 description 3
- 238000009826 distribution Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000006735 deficit Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/1066—Session management
- H04L65/1083—In-session procedures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/75—Media network packet handling
- H04L65/765—Media network packet handling intermediate
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/2866—Architectures; Arrangements
- H04L67/30—Profiles
- H04L67/306—User profiles
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L69/00—Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
- H04L69/30—Definitions, standards or architectural aspects of layered protocol stacks
- H04L69/32—Architecture of open systems interconnection [OSI] 7-layer type protocol stacks, e.g. the interfaces between the data link level and the physical level
- H04L69/322—Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions
- H04L69/329—Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions in the application layer [OSI layer 7]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/42391—Systems providing special services or facilities to subscribers where the subscribers are hearing-impaired persons, e.g. telephone devices for the deaf
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R5/00—Stereophonic arrangements
- H04R5/04—Circuit arrangements, e.g. for selective connection of amplifier inputs/outputs to loudspeakers, for loudspeaker detection, or for adaptation of settings to personal preferences or hearing impairments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2205/00—Details of stereophonic arrangements covered by H04R5/00 but not provided for in any of its subgroups
- H04R2205/041—Adaptation of stereophonic signal reproduction for the hearing impaired
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/55—Communication between hearing aids and external devices via a network for data exchange
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
- H04R25/70—Adaptation of deaf aid to hearing loss, e.g. initial electronic fitting
Definitions
- the present invention relates to the distribution and customization of voice audio data products, like telephone signals, via a packet-switched network such as networks based on the Internet Protocol (IP).
- IP Internet Protocol
- the hearing profiles of individuals vary in a number of ways.
- the ability to hear sounds varies with frequency among individuals across the normal audio frequency range.
- the dynamic range varies among individuals so that levels of an audio stimulus that are perceived as soft sounds and levels of an audio stimulus that are perceived as loud sounds differ from person to person.
- Standard hearing tests are designed to produce an audiogram that characterizes such factors as frequency, sensitivity and dynamic range in the hearing profiles of individuals.
- psycho-acoustic factors concerning the manner in which a person perceives combinations of normal sounds affect the ability to hear in ways that can vary from person to person.
- Telephone signals are distributed in the public switched telephone networks (PSTNs) in a form intended for persons having hearing profiles within a normal range.
- PSTNs public switched telephone networks
- Proposals for applying custom processing of telephone signals in the PSTN according to a customer's hearing impairment have been presented. See, for example, U.S. Patent No. 5,388,185 entitled SYSTEM FOR ADAPTIVE PROCESSING OF TELEPHONE VOICE SIGNALS, invented by Terry et al.
- Such proposals have been limited to circuit-switched telephone networks with limitations inherent in such systems.
- Techniques are being developed for the transmission of real time voice over IP based networks, which are sometimes referred to as Voice over Internet Protocol (VoIP) technologies.
- VoIP Voice over Internet Protocol
- One leading VoIP technology is based on ITU-T Recommendation H.323, "Packet-Based Multimedia Communications Systems," from the International Telecommunication Union which describes a set of protocols useful for supporting real time audio and video communication over IP networks.
- the present invention provides customized audio for VoIP sessions, or other packet- switched networks for carrying voice, and systems and methods for producing the customized audio data for such sessions, processed according to a hearing profile of the participants. According to the present invention, the quality of VoIP telephone calls and other voice sessions on packet-switched networks is improved.
- the present invention provides a processor coupled to a packet-switched communication network which processes a flow of packets carrying voice data according to the hearing profiles of the intended recipients.
- the flow of packets in various embodiments is compliant with a standard Internet Protocol.
- the flow of packets is compliant with ITU-T Recommendation H.323.
- the hearing profile data is stored in one embodiment with a registry of users. Resources are included in this embodiment to associate data from the registry characterizing a hearing profile of the user with a flow of packets in the packet-switched network for use and customizing the voice data.
- the invention includes a method for conimunication of voice signals in a packet-switched network that includes detecting a flow of packets carrying the voice signals on the network to a recipient, utilizing data within the packets.
- the method includes obtaining a hearing profile of the recipient from memory on at least one of an intermediate station or an end station in the network.
- the flow of packets is processed according to the hearing profile in a processor on one of an intermediate station and an end station in the network.
- a typical session on a packet-switched network carrying voice includes a first flow from a first participant in the call to the second adjustment in the call, and a second flow from the second participant to the first. The processing of the first and second flows can be positioned in different locales in the network.
- a flow to a particular participant may be processed in a processor within the local area network of the participant.
- Data sent from the other participant in the session is delivered to the processor within the local area network of the participant via the Internet for processing prior to delivery.
- the sending station need not have the hearing profile data available for the intended recipients.
- the processing of hearing profile data is executing a single server within the network which stores the hearing profiles of the recipients of the data.
- the method includes establishing a registry of users in a processor coupled to the network.
- the registry includes parameters indicating whether processing according to a hearing profile of a user is enabled for that user, and other parameters necessary for establishing the session and providing a hearing profile for use within the session.
- a processor detects a flow of packets carrying voice signals on the network to a recipient, the packets being identifiable by parameters within the packets as members of the flow.
- the processor accesses the registry in response to the flow and if the recipient is a registered user, a hearing profile is obtained using parameters available from the registry from memory in the network.
- the flow of packets is processed according to a hearing profile.
- the invention includes maintaining the registry in a gate keeper processor on the network, wherein the gate keeper processor performs functions compliant with ITU-T Recommendation H.323.
- the present invention provides for the customization of voice sessions in packet-switched networks.
- a customization of voice sessions requires the management of user registries and of processing resources necessary for matching individual voice sessions with data relevant to the participants in the sessions.
- the present invention by taking advantage of the power and flexibility of packet-switched networks such as the Internet, makes the delivery of voice sessions customized according to individual hearing profiles practical and cost-effective.
- FIG. 1 illustrates a network serving voice over the Internet protocol with customized processing according to the hearing profile of users according to the present invention.
- Fig. 2 is a simplified flow chart illustrating the processing at a locale in the network of a voice over the Internet Protocol session with user hearing profiles.
- Fig. 3 is a simplified block diagram of an H.323 standard network enhanced with hearing profile processing according to the present invention.
- Fig. 4 illustrates one example entry in a registry of hearing profiles for the system of Fig.
- Fig. 5 is a simplified flow chart of a process for establishing an H.323 call with customized processing according to the present invention.
- Fig. 6 is a diagram of one example of a hearing profile for use in producing customized audio for VoIP sessions.
- Fig. 7 is a diagram of another example of a hearing profile for use in producing customized audio for VoIP sessions.
- Fig. 8 is a diagram of yet another example of a hearing profile for use in producing customized audio for VoIP sessions.
- Fig. 1 provides an illustration of one example network supporting VoIP telephone calls with audio data processed according to the hearing profiles of the participants in the call.
- the example network includes local area network 10 and local area network 11 which are coupled to one another through an Internet protocol network 12.
- a gateway server 13 is coupled to the Internet protocol network 12 and to the public switched telephone network PSTN 16.
- Local area network 10 includes VoIP server/router 14 which couples the local area network 10 to the Internet protocol network 12.
- Local area network 11 includes a VoIP server/router 17 which couples the local area network 11 to the Internet protocol network 12.
- End stations illustrated in the example of Fig. 1 include the telephones 20, 21, 22 coupled to the local area network 10, and the voice enabled personal computer 23 coupled to the local area network 10. Also, the stations coupled to the local area network 11 include telephones 25 and 26, and voice or multimedia enabled personal computers 27 and 28.
- a central office switch 30 connects end station telephones 31 and 32 into the network.
- the intermediate station servers 14 and 17 are coupled to hearing profile databases 15 and 18 to support processing of VoIP sessions according to the hearing profiles of the participants in the call.
- the VoIP server/router 17 associates the hearing profile of the recipient using end station 28 with the incoming packet flow of the session, and processes the audio data using the hearing profile.
- the VoIP server/router 14 coupled to the local area network 10 associates the hearing profile of the recipient using the telephone 22 with the incoming packet flow of the session, and processes the audio data using the hearing profile as it is delivered to the telephone 22.
- the VoIP gateway 13 associates the hearing profile of the recipient at the telephone 32 with incoming packets in the VoIP session, and processes the data using the hearing profile.
- the processed data is then re-formatted for the circuit switched public switched telephone network for delivery to the recipient at the telephone 32.
- the hearing profile databases 15, 18, 19 illustrated in the example of Fig. 1 maintain registries of users of the VoIP service.
- the registries contain the necessary information to support the processing of the packet flows in the VoIP sessions utilizing the hearing profiles of users having entries in the databases.
- the establishment and maintenance of the hearing profile databases in the packet- switched network can be handled using a variety of network architectures, taking full advantage of the packet-switched network flexibility.
- a central server may be used for acquiring, updating, and delivering hearing profiles for use in the VoIP sessions supported by the database.
- distributed servers each maintaining hearing profile data for a limited number of users could be utilized which process only incoming packet flows.
- the hearing profiles may be stored locally on the end stations, and no intermediate station need be involved in the processing of the packet flows according to the hearing profiles.
- a combination of locales in the network may be used to support a particular session, according to the capability of the stations involved in the session.
- a basic process executed at a locale in the packet-switched network according to the present invention for packet flows in a VoIP call is illustrated in Fig. 2.
- the locale in the network either an end station or an intermediate station, detects a VoIP session by filtering the packet flow to detect headers or other control information tagging the flow as a VoIP session (step 50).
- a profile of a recipient at the destination end station is retrieved from memory (step 51).
- the packet flow is processed according to the hearing profile of the recipient (step 52).
- the processed packets are forwarded to the destination end station (step 53).
- FIG. 3 provides a simplified illustration of an H.323 network enhanced according to the present invention.
- an H.323 gatekeeper process 100 is coupled to the Internet 101.
- H.323 compliant end stations 102 and 103 are also coupled to the Internet 101.
- the H.323 compliant gateway 104 is coupled to end stations, such as end station 105 which support other protocol suites for the VoIP session.
- the gate keeper process 100 includes an H.323 registry 106 which is enhanced according to the present invention with a hearing profile database.
- entries in the H.323 registry 106 have a format such as shown in Fig. 4.
- the format includes a user identifier 120, and H.323 call parameters 121 which are developed according to the standard.
- a pointer to a hearing profile, or a hearing profile itself is stored in the field 122.
- the hearing profile processing capability for the user is stored as a parameter 123.
- a few examples of the hearing profile pointer 122 include a URL, a memory address in local memory on the gate keeper, or a memory address in a particular storage device having another format.
- the hearing profile processing capability parameter 123 indicates where the processing is available for applying the hearing profile.
- parameter 124 indicates that digital signal processing resources are available locally on an end station of the user (e.g., end station 102), parameter 125 indicates that digital signal processing resources are available on a gateway 104 through which the user accesses the network, and parameter 126 indicates that digital signal processing resources at the gate keeper 100 are to be used for packet flows destined to the identified user.
- Fig. 5 provides a simplified flow chart of the processing at the gate keeper 100 of Fig. 4 enhanced according to present invention.
- the gate keeper detects an H.323 call set up session (step 150).
- the hearing profile processing capability for the identified end stations is determined using the information in the registry (step 151).
- the hearing profile or hearing profiles involved in the session are forwarded to the processors that are associated with the registered users.
- the hearing profile is simply retrieved from the registry for use at the gate keeper processor itself (step 152).
- the H.323 call set up procedures establish call paths for each direction, taking into account the locale in the network for processing of the audio signals with the hearing profiles of the users (step 153).
- the call set up is completed (step 154).
- Figs. 6, 7 and 8 illustrate alternative formats for the hearing profiles to be stored in the registry 106, or databases 15, 17, 19 in the examples shown in the Figures.
- the hearing profile shown in Fig. 6 includes a customer ID 250, coefficients of transform equations 251, and an identifier of the playback device type and listening environment 252 in which the audio data product is to be played.
- Fig. 6 includes a customer ID 250, coefficients of transform equations 251, and an identifier of the playback device type and listening environment 252 in which the audio data product is to be played.
- the hearing profile includes a customer identifier 260, the audiogram for customer 261, listening condition data 262, and psycho-acoustic parameters 263 which relate to the listening characteristics of the customer.
- the hearing profile shown in Fig. 8 includes a customer identifier 270, and software 271 along with code data structures 272 that in combination provide executable transform code for producing the customized audio data product from the selected audio data product.
- the three examples of hearing profile formats shown in Figs. 6-8 are representative of a large number of hearing profile formats that could be utilized depending on the type of transform processes being executed, the type of audio products being delivered, and other factors related to the architecture of the system.
- the system in various embodiments provides input tools, such as a graphical user interface usable, for example, at a kiosk or accessible via a network, for accepting input data concerning a hearing profile of a customer, either supported by a specialized hearing test device or by other resources for use in the test.
- the input tools are used for creating a machine readable hearing profile, which is stored in a registry of hearing profiles for use in the production of customized audio data products.
- the hearing profile is stored on a portable data storage medium for use by the customer, as described above.
- the system provides tools for accepting input data characterizing the feedback from the customer.
- Processing resources are provided for modifying the machine readable hearing profile in response to the feedback data.
- the modifying of the machine readable hearing profile in one aspect of the invention comprises applying an optimization modeling process to the hearing profile, by which the profiles are improved over time using feedback arising from a variety of customized audio data products and from a variety of listening conditions.
- the related applications incorporated by reference above, include detailed descriptions of technologies for gathering feedback and optimizing hearing profile data. While the present invention is disclosed by reference to the preferred embodiments and examples detailed above, it is to be understood that these examples are intended in an illustrative rather than in a limiting sense. It is contemplated that modifications and combinations will readily occur to those skilled in the art, which modifications and combinations will be within the spirit of the invention and the scope of the appended claims.
Abstract
Customized audio for VoIP sessions, or other packet-switched networks for carrying voice, is processed according to a hearing profile of the participants. A processor (13, 14, 17) coupled to a packet-switched communication network (12) processes a flow of packets carrying voice data according to the hearing profiles of the intended recipients. The flow of packets in various embodiments is compliant with a standard Internet Protocol, including, for example, ITU-T Recommendation H.323. The hearing profile data is stored in one embodiment with a registry of users (15, 18, 19). Resources are included in this embodiment to associate data from the registry characterizing a hearing profile of the user with a flow of packets in the packet-switched network for use in customizing the voice data.
Description
SYSTEM AND METHOD FOR DELIVERING CUSTOMIZED VOICE AUDIO DATA ON A PACKET-SWITCHED NETWORK
BACKGROUND OF THE INVENTION Field of the Invention
The present invention relates to the distribution and customization of voice audio data products, like telephone signals, via a packet-switched network such as networks based on the Internet Protocol (IP).
Description of Related Art
The hearing profiles of individuals vary in a number of ways. The ability to hear sounds varies with frequency among individuals across the normal audio frequency range. Also, the dynamic range varies among individuals so that levels of an audio stimulus that are perceived as soft sounds and levels of an audio stimulus that are perceived as loud sounds differ from person to person. Standard hearing tests are designed to produce an audiogram that characterizes such factors as frequency, sensitivity and dynamic range in the hearing profiles of individuals. There are also other factors that affect a hearing profile. For example, psycho-acoustic factors concerning the manner in which a person perceives combinations of normal sounds affect the ability to hear in ways that can vary from person to person. Also, environmental factors such as the usual listening environment of a person (library, conference room, concert hall) and the equipment on which the sound is produced (loud speakers, ear phones, telephone hand set) are important. In persons wearing hearing aids or using other assistive hearing devices, the type of aid or device affects the hearing profile. The physiology of an impairment suffered by the individual may also be an important factor in the hearing profile.
The hearing profiles of individuals have been applied in the hearing aid field for customizing and fitting hearing aids for individuals. See, for example, U.S. Patent No. 4,731,850 entitled PROGRAMMABLE DIGITAL HEARING AID SYSTEM, invented by Levitt et al.; and U.S. Patent No. 5,848,171 entitled HEARING AID DEVICE INCORPORATING SIGNAL PROCESSING TECHNIQUES, invented by Stockham, Jr. et al. Thus, techniques for processing sound to offset variations in hearing are well known. However, these techniques are unavailable
to persons not using hearing aids. Furthermore, many persons who could benefit from such processing are not in positions to use hearing aids for a variety of reasons.
Telephone signals are distributed in the public switched telephone networks (PSTNs) in a form intended for persons having hearing profiles within a normal range. Proposals for applying custom processing of telephone signals in the PSTN according to a customer's hearing impairment have been presented. See, for example, U.S. Patent No. 5,388,185 entitled SYSTEM FOR ADAPTIVE PROCESSING OF TELEPHONE VOICE SIGNALS, invented by Terry et al. Such proposals have been limited to circuit-switched telephone networks with limitations inherent in such systems. Techniques are being developed for the transmission of real time voice over IP based networks, which are sometimes referred to as Voice over Internet Protocol (VoIP) technologies. One leading VoIP technology is based on ITU-T Recommendation H.323, "Packet-Based Multimedia Communications Systems," from the International Telecommunication Union which describes a set of protocols useful for supporting real time audio and video communication over IP networks.
Accordingly, it is desirable to provide systems and methods to apply techniques for the processing of real time audio data according to hearing profiles for the benefit of individuals who are not wearing hearing assistance devices. Also it is desirable to provide tools to simplify the gathering of information needed to develop hearing profiles of individuals, and to apply the gathered information in the distribution and customization of real time voice signals.
SUMMARY OF THE INVENTION The present invention provides customized audio for VoIP sessions, or other packet- switched networks for carrying voice, and systems and methods for producing the customized audio data for such sessions, processed according to a hearing profile of the participants. According to the present invention, the quality of VoIP telephone calls and other voice sessions on packet-switched networks is improved.
Thus, the present invention provides a processor coupled to a packet-switched communication network which processes a flow of packets carrying voice data according to the hearing profiles of the intended recipients. The flow of packets in various embodiments is
compliant with a standard Internet Protocol. In one embodiment, the flow of packets is compliant with ITU-T Recommendation H.323. The hearing profile data is stored in one embodiment with a registry of users. Resources are included in this embodiment to associate data from the registry characterizing a hearing profile of the user with a flow of packets in the packet-switched network for use and customizing the voice data.
The invention includes a method for conimunication of voice signals in a packet-switched network that includes detecting a flow of packets carrying the voice signals on the network to a recipient, utilizing data within the packets. In addition, the method includes obtaining a hearing profile of the recipient from memory on at least one of an intermediate station or an end station in the network. Finally, the flow of packets is processed according to the hearing profile in a processor on one of an intermediate station and an end station in the network. A typical session on a packet-switched network carrying voice includes a first flow from a first participant in the call to the second adjustment in the call, and a second flow from the second participant to the first. The processing of the first and second flows can be positioned in different locales in the network. For example, a flow to a particular participant may be processed in a processor within the local area network of the participant. Data sent from the other participant in the session is delivered to the processor within the local area network of the participant via the Internet for processing prior to delivery. In this manner, the sending station need not have the hearing profile data available for the intended recipients. In alternative systems, the processing of hearing profile data is executing a single server within the network which stores the hearing profiles of the recipients of the data.
According to another embodiment of the invention, the method includes establishing a registry of users in a processor coupled to the network. The registry includes parameters indicating whether processing according to a hearing profile of a user is enabled for that user, and other parameters necessary for establishing the session and providing a hearing profile for use within the session. Accordingly, a processor detects a flow of packets carrying voice signals on the network to a recipient, the packets being identifiable by parameters within the packets as members of the flow. Next, the processor accesses the registry in response to the flow and if the recipient is a registered user, a hearing profile is obtained using parameters available from the registry from memory in the network. Finally, the flow of packets is processed according to a hearing profile.
In another embodiment, the invention includes maintaining the registry in a gate keeper processor on the network, wherein the gate keeper processor performs functions compliant with ITU-T Recommendation H.323.
Accordingly, the present invention provides for the customization of voice sessions in packet-switched networks. A customization of voice sessions requires the management of user registries and of processing resources necessary for matching individual voice sessions with data relevant to the participants in the sessions. The present invention, by taking advantage of the power and flexibility of packet-switched networks such as the Internet, makes the delivery of voice sessions customized according to individual hearing profiles practical and cost-effective. Other aspects and advantages of the present invention can be seen on review of the drawings, the detailed description, and the claims which follow.
BRIEF DESCRIPTION OF THE FIGURES Fig. 1 illustrates a network serving voice over the Internet protocol with customized processing according to the hearing profile of users according to the present invention.
Fig. 2 is a simplified flow chart illustrating the processing at a locale in the network of a voice over the Internet Protocol session with user hearing profiles.
Fig. 3 is a simplified block diagram of an H.323 standard network enhanced with hearing profile processing according to the present invention. Fig. 4 illustrates one example entry in a registry of hearing profiles for the system of Fig.
3.
Fig. 5 is a simplified flow chart of a process for establishing an H.323 call with customized processing according to the present invention.
Fig. 6 is a diagram of one example of a hearing profile for use in producing customized audio for VoIP sessions.
Fig. 7 is a diagram of another example of a hearing profile for use in producing customized audio for VoIP sessions.
Fig. 8 is a diagram of yet another example of a hearing profile for use in producing customized audio for VoIP sessions.
DETAILED DESCRIPTION
A detailed description of the various embodiments of the present invention is provided with reference to Figs. 1 through 8.
Fig. 1 provides an illustration of one example network supporting VoIP telephone calls with audio data processed according to the hearing profiles of the participants in the call. The example network includes local area network 10 and local area network 11 which are coupled to one another through an Internet protocol network 12. Also, a gateway server 13 is coupled to the Internet protocol network 12 and to the public switched telephone network PSTN 16. Local area network 10 includes VoIP server/router 14 which couples the local area network 10 to the Internet protocol network 12. Local area network 11 includes a VoIP server/router 17 which couples the local area network 11 to the Internet protocol network 12.
End stations illustrated in the example of Fig. 1 include the telephones 20, 21, 22 coupled to the local area network 10, and the voice enabled personal computer 23 coupled to the local area network 10. Also, the stations coupled to the local area network 11 include telephones 25 and 26, and voice or multimedia enabled personal computers 27 and 28. In the public switched telephone network 16, a central office switch 30 connects end station telephones 31 and 32 into the network.
The intermediate station servers 14 and 17 are coupled to hearing profile databases 15 and 18 to support processing of VoIP sessions according to the hearing profiles of the participants in the call. Thus for example, when a call is established between end station 28 on local area network 1 1, and end station 22 on local area network 10, the VoIP server/router 17 associates the hearing profile of the recipient using end station 28 with the incoming packet flow of the session, and processes the audio data using the hearing profile. Likewise, the VoIP server/router 14 coupled to the local area network 10 associates the hearing profile of the recipient using the telephone 22 with the incoming packet flow of the session, and processes the audio data using the hearing profile as it is delivered to the telephone 22. In a similar manner, when a VoIP session is established between an end station, such as end station 25 on local area network 11, and a telephone coupled to the public switched telephone network 16, such as telephone 32, the VoIP gateway 13 associates the hearing profile of the recipient at the telephone 32 with incoming packets in the VoIP session, and processes the data using the hearing profile. The processed data is then re-formatted for the circuit switched public switched telephone network for delivery to the recipient at the telephone 32.
The hearing profile databases 15, 18, 19 illustrated in the example of Fig. 1 maintain registries of users of the VoIP service. The registries contain the necessary information to support the processing of the packet flows in the VoIP sessions utilizing the hearing profiles of users having entries in the databases. The establishment and maintenance of the hearing profile databases in the packet- switched network can be handled using a variety of network architectures, taking full advantage of the packet-switched network flexibility. For example, a central server may be used for acquiring, updating, and delivering hearing profiles for use in the VoIP sessions supported by the database. Alternatively, distributed servers, each maintaining hearing profile data for a limited number of users could be utilized which process only incoming packet flows. In yet another alternative, the hearing profiles may be stored locally on the end stations, and no intermediate station need be involved in the processing of the packet flows according to the hearing profiles. A combination of locales in the network may be used to support a particular session, according to the capability of the stations involved in the session. A basic process executed at a locale in the packet-switched network according to the present invention for packet flows in a VoIP call is illustrated in Fig. 2. The locale in the network, either an end station or an intermediate station, detects a VoIP session by filtering the packet flow to detect headers or other control information tagging the flow as a VoIP session (step 50). Upon detection of a VoIP session, a profile of a recipient at the destination end station is retrieved from memory (step 51). Next, the packet flow is processed according to the hearing profile of the recipient (step 52). Finally, the processed packets are forwarded to the destination end station (step 53).
As mentioned above, one popular suite of protocols developed for packet based multimedia communication systems such as Internet protocol based network systems used for VoIP sessions is known as ITU-T Recommendation H.323. Recommendation H.323 is incorporated by reference as if fully set forth herein. Fig. 3 provides a simplified illustration of an H.323 network enhanced according to the present invention. In the network of Fig. 3, an H.323 gatekeeper process 100 is coupled to the Internet 101. H.323 compliant end stations 102 and 103 are also coupled to the Internet 101. The H.323 compliant gateway 104 is coupled to end stations, such as end station 105 which support other protocol suites for the VoIP session.
The gate keeper process 100 includes an H.323 registry 106 which is enhanced according to the present invention with a hearing profile database. Thus, for example, entries in the H.323 registry 106 have a format such as shown in Fig. 4. The format includes a user identifier 120, and H.323 call parameters 121 which are developed according to the standard. A pointer to a hearing profile, or a hearing profile itself is stored in the field 122. The hearing profile processing capability for the user is stored as a parameter 123. A few examples of the hearing profile pointer 122 include a URL, a memory address in local memory on the gate keeper, or a memory address in a particular storage device having another format. The hearing profile processing capability parameter 123 indicates where the processing is available for applying the hearing profile. Thus, parameter 124 indicates that digital signal processing resources are available locally on an end station of the user (e.g., end station 102), parameter 125 indicates that digital signal processing resources are available on a gateway 104 through which the user accesses the network, and parameter 126 indicates that digital signal processing resources at the gate keeper 100 are to be used for packet flows destined to the identified user. Fig. 5 provides a simplified flow chart of the processing at the gate keeper 100 of Fig. 4 enhanced according to present invention. During call establishment, the gate keeper detects an H.323 call set up session (step 150). The hearing profile processing capability for the identified end stations is determined using the information in the registry (step 151). Next, the hearing profile or hearing profiles involved in the session are forwarded to the processors that are associated with the registered users. Alternatively, the hearing profile is simply retrieved from the registry for use at the gate keeper processor itself (step 152). The H.323 call set up procedures establish call paths for each direction, taking into account the locale in the network for processing of the audio signals with the hearing profiles of the users (step 153). Finally, the call set up is completed (step 154). Figs. 6, 7 and 8 illustrate alternative formats for the hearing profiles to be stored in the registry 106, or databases 15, 17, 19 in the examples shown in the Figures. The hearing profile shown in Fig. 6 includes a customer ID 250, coefficients of transform equations 251, and an identifier of the playback device type and listening environment 252 in which the audio data product is to be played. In Fig. 7, the hearing profile includes a customer identifier 260, the audiogram for customer 261, listening condition data 262, and psycho-acoustic parameters 263 which relate to the listening characteristics of the customer. The hearing profile shown in Fig. 8
includes a customer identifier 270, and software 271 along with code data structures 272 that in combination provide executable transform code for producing the customized audio data product from the selected audio data product. The three examples of hearing profile formats shown in Figs. 6-8 are representative of a large number of hearing profile formats that could be utilized depending on the type of transform processes being executed, the type of audio products being delivered, and other factors related to the architecture of the system.
The system in various embodiments provides input tools, such as a graphical user interface usable, for example, at a kiosk or accessible via a network, for accepting input data concerning a hearing profile of a customer, either supported by a specialized hearing test device or by other resources for use in the test. The input tools are used for creating a machine readable hearing profile, which is stored in a registry of hearing profiles for use in the production of customized audio data products. Alternatively, the hearing profile is stored on a portable data storage medium for use by the customer, as described above. The related applications incorporated by reference above, include detailed descriptions of technologies for conducting hearing tests and for gathering hearing profile data.
Also, in various embodiments of the invention, the system provides tools for accepting input data characterizing the feedback from the customer. Processing resources are provided for modifying the machine readable hearing profile in response to the feedback data. The modifying of the machine readable hearing profile in one aspect of the invention comprises applying an optimization modeling process to the hearing profile, by which the profiles are improved over time using feedback arising from a variety of customized audio data products and from a variety of listening conditions. The related applications, incorporated by reference above, include detailed descriptions of technologies for gathering feedback and optimizing hearing profile data. While the present invention is disclosed by reference to the preferred embodiments and examples detailed above, it is to be understood that these examples are intended in an illustrative rather than in a limiting sense. It is contemplated that modifications and combinations will readily occur to those skilled in the art, which modifications and combinations will be within the spirit of the invention and the scope of the appended claims.
Claims
1. For a packet-switched communication network, a station comprising: a processor coupled to the network which processes a flow of packets carrying voice data according to hearing profiles of intended recipients.
2. The station of claim 1, wherein the flow of packets comprises packets compliant with a standard Internet Protocol.
3. The station of claim 1, wherein the flow of packets comprises packets compliant with ITU-T Recommendation H.323.
4. The station of claim 1, including a registry of users, and including resources to associate data from the registry characterizing a hearing profile of a user in the registry with the flow of packets.
5. A method for communication of voice signals in a packet-switched network, comprising: detecting a flow of packets carrying the voice signals on the network to a recipient, the packets identifiable as members of the flow; obtaining a hearing profile of the recipient from memory on at least one of an intermediate station and an end station in the network; and processing the flow of packets according to the hearing profile in a processor on at least one of an intermediate station and an end station in the network.
6. The method of claim 5, including filtering packets received in the processor to detect members of the flow.
7. The method of claim 5, wherein the flow originates at an originating end station in the network, and the memory is on the originating end station.
8. The method of claim 5, wherein the memory is on an intermediate station in the network.
9. The method of claim 5, wherein the flow is transmitted to a destination end station in the network, and the memory is on the destination end station.
10. A method for communication of voice signals in a packet-switched network, comprising: establishing a registry of users in a processor coupled to the network, the registry including parameters indicating whether processing according to a hearing profile of a user is enabled for the user; detecting a flow of packets carrying the voice signals on the network to a recipient, the packets identifiable as members of the flow; accessing the registry in response to the flow, and if the recipient is a registered user, obtaining a hearing profile of the recipient from memory in the network; and processing the flow of packets according to the hearing profile in a processor in the network.
11. The method of claim 10, including filtering packets in the processor in the network to detect members of the flow.
12. The method of claim 10, wherein the flow originates at an originating end station in the network, and the memory is on the originating end station.
13. The method of claim 10, wherein the memory is on an intermediate station in the network.
14. The method of claim 10, wherein the flow is transmitted to a destination end station in the network, and the memory is on the destination end station.
15. The method of claim 10, including maintaining the registry in a gate keeper processor on the network, and wherein the gate keeper processor performs functions compliant with ITU-T Recommendation H.323.
16. The method of claim 10, wherein the parameters include a pointer to a hearing profile for the registered user.
17. The method of claim 10, wherein the processor is on an intermediate station in the network, and including routing the flow through the intermediate station.
18. The method of claim 10, wherein the processor is on an end station associated with the recipient, and including transmitting a copy of the hearing profile from the memory in the network to the end station.
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US407751 | 1999-09-28 | ||
US09/407,751 US7181297B1 (en) | 1999-09-28 | 1999-09-28 | System and method for delivering customized audio data |
US46403699A | 1999-12-15 | 1999-12-15 | |
US464036 | 1999-12-15 | ||
US49525500A | 2000-01-31 | 2000-01-31 | |
US495255 | 2000-01-31 | ||
PCT/US2000/026304 WO2001024462A1 (en) | 1999-09-28 | 2000-09-25 | System and method for delivering customized voice audio data on a packet-switched network |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1216554A1 true EP1216554A1 (en) | 2002-06-26 |
EP1216554A4 EP1216554A4 (en) | 2005-10-12 |
Family
ID=27410724
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP00966851A Withdrawn EP1216554A4 (en) | 1999-09-28 | 2000-09-25 | System and method for delivering customized voice audio data on a packet-switched network |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1216554A4 (en) |
JP (1) | JP2003515261A (en) |
CN (1) | CN1390408A (en) |
AU (1) | AU7713500A (en) |
WO (1) | WO2001024462A1 (en) |
Families Citing this family (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1552723A4 (en) * | 2002-10-15 | 2010-02-17 | Korea Electronics Telecomm | Apparatus and method for adapting audio signal according to user's preference |
US9378187B2 (en) | 2003-12-11 | 2016-06-28 | International Business Machines Corporation | Creating a presentation document |
US7634412B2 (en) | 2003-12-11 | 2009-12-15 | Nuance Communications, Inc. | Creating a voice response grammar from a user grammar |
US7162692B2 (en) | 2003-12-11 | 2007-01-09 | International Business Machines Corporation | Differential dynamic content delivery |
US8499232B2 (en) | 2004-01-13 | 2013-07-30 | International Business Machines Corporation | Differential dynamic content delivery with a participant alterable session copy of a user profile |
US7567908B2 (en) | 2004-01-13 | 2009-07-28 | International Business Machines Corporation | Differential dynamic content delivery with text display in dependence upon simultaneous speech |
US8001454B2 (en) | 2004-01-13 | 2011-08-16 | International Business Machines Corporation | Differential dynamic content delivery with presentation control instructions |
US7890848B2 (en) | 2004-01-13 | 2011-02-15 | International Business Machines Corporation | Differential dynamic content delivery with alternative content presentation |
US7571380B2 (en) | 2004-01-13 | 2009-08-04 | International Business Machines Corporation | Differential dynamic content delivery with a presenter-alterable session copy of a user profile |
US8954844B2 (en) | 2004-01-13 | 2015-02-10 | Nuance Communications, Inc. | Differential dynamic content delivery with text display in dependence upon sound level |
US7287221B2 (en) | 2004-01-13 | 2007-10-23 | International Business Machines Corporation | Differential dynamic content delivery with text display in dependence upon sound level |
US7430707B2 (en) | 2004-01-13 | 2008-09-30 | International Business Machines Corporation | Differential dynamic content delivery with device controlling action |
US7519683B2 (en) | 2004-04-26 | 2009-04-14 | International Business Machines Corporation | Dynamic media content for collaborators with client locations in dynamic client contexts |
US7831906B2 (en) | 2004-04-26 | 2010-11-09 | International Business Machines Corporation | Virtually bound dynamic media content for collaborators |
US7519659B2 (en) | 2004-04-26 | 2009-04-14 | International Business Machines Corporation | Dynamic media content for collaborators |
US7827239B2 (en) | 2004-04-26 | 2010-11-02 | International Business Machines Corporation | Dynamic media content for collaborators with client environment information in dynamic client contexts |
US7487208B2 (en) | 2004-07-08 | 2009-02-03 | International Business Machines Corporation | Differential dynamic content delivery to alternate display device locations |
US7428698B2 (en) | 2004-07-08 | 2008-09-23 | International Business Machines Corporation | Differential dynamic delivery of content historically likely to be viewed |
US7519904B2 (en) | 2004-07-08 | 2009-04-14 | International Business Machines Corporation | Differential dynamic delivery of content to users not in attendance at a presentation |
US7921362B2 (en) | 2004-07-08 | 2011-04-05 | International Business Machines Corporation | Differential dynamic delivery of presentation previews |
US8185814B2 (en) | 2004-07-08 | 2012-05-22 | International Business Machines Corporation | Differential dynamic delivery of content according to user expressions of interest |
US7487209B2 (en) | 2004-07-13 | 2009-02-03 | International Business Machines Corporation | Delivering dynamic media content for collaborators to purposeful devices |
US7426538B2 (en) | 2004-07-13 | 2008-09-16 | International Business Machines Corporation | Dynamic media content for collaborators with VOIP support for client communications |
US9167087B2 (en) | 2004-07-13 | 2015-10-20 | International Business Machines Corporation | Dynamic media content for collaborators including disparate location representations |
US7475340B2 (en) | 2005-03-24 | 2009-01-06 | International Business Machines Corporation | Differential dynamic content delivery with indications of interest from non-participants |
US7493556B2 (en) | 2005-03-31 | 2009-02-17 | International Business Machines Corporation | Differential dynamic content delivery with a session document recreated in dependence upon an interest of an identified user participant |
US7523388B2 (en) | 2005-03-31 | 2009-04-21 | International Business Machines Corporation | Differential dynamic content delivery with a planned agenda |
US9419843B2 (en) * | 2006-07-06 | 2016-08-16 | Broadcom Corporation | Custom ASIC combining VoIP and security IP |
US9138178B2 (en) | 2010-08-05 | 2015-09-22 | Ace Communications Limited | Method and system for self-managed sound enhancement |
CN104937954B (en) * | 2013-01-09 | 2019-06-28 | 听优企业 | Method and system for the enhancing of Self management sound |
KR20180075642A (en) * | 2015-10-29 | 2018-07-04 | 와이덱스 에이/에스 | System and method for managing customizable configurations of hearing aids |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5388185A (en) * | 1991-09-30 | 1995-02-07 | U S West Advanced Technologies, Inc. | System for adaptive processing of telephone voice signals |
EP0719016A2 (en) * | 1994-12-01 | 1996-06-26 | Sharp Kabushiki Kaisha | Communication equipment |
US5559855A (en) * | 1994-11-29 | 1996-09-24 | Lucent Technologies Inc. | System and method for recognizing and routing telephone calls involving hearing or speech impaired persons |
US5832212A (en) * | 1996-04-19 | 1998-11-03 | International Business Machines Corporation | Censoring browser method and apparatus for internet viewing |
WO1999041880A1 (en) * | 1998-02-12 | 1999-08-19 | Qsound Labs, Inc. | Teleconferencing method and apparatus with three-dimensional sound positioning |
-
2000
- 2000-09-25 AU AU77135/00A patent/AU7713500A/en not_active Abandoned
- 2000-09-25 EP EP00966851A patent/EP1216554A4/en not_active Withdrawn
- 2000-09-25 CN CN00813413.8A patent/CN1390408A/en active Pending
- 2000-09-25 WO PCT/US2000/026304 patent/WO2001024462A1/en not_active Application Discontinuation
- 2000-09-25 JP JP2001527519A patent/JP2003515261A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5388185A (en) * | 1991-09-30 | 1995-02-07 | U S West Advanced Technologies, Inc. | System for adaptive processing of telephone voice signals |
US5559855A (en) * | 1994-11-29 | 1996-09-24 | Lucent Technologies Inc. | System and method for recognizing and routing telephone calls involving hearing or speech impaired persons |
EP0719016A2 (en) * | 1994-12-01 | 1996-06-26 | Sharp Kabushiki Kaisha | Communication equipment |
US5832212A (en) * | 1996-04-19 | 1998-11-03 | International Business Machines Corporation | Censoring browser method and apparatus for internet viewing |
WO1999041880A1 (en) * | 1998-02-12 | 1999-08-19 | Qsound Labs, Inc. | Teleconferencing method and apparatus with three-dimensional sound positioning |
Non-Patent Citations (1)
Title |
---|
See also references of WO0124462A1 * |
Also Published As
Publication number | Publication date |
---|---|
WO2001024462A1 (en) | 2001-04-05 |
CN1390408A (en) | 2003-01-08 |
EP1216554A4 (en) | 2005-10-12 |
JP2003515261A (en) | 2003-04-22 |
AU7713500A (en) | 2001-04-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2001024462A1 (en) | System and method for delivering customized voice audio data on a packet-switched network | |
US7075921B2 (en) | Remote media control for voice over internet telephony and related applications | |
US6707811B2 (en) | Internet telephony for ecommerce | |
US6791974B1 (en) | Universal internet based telephony system that provides ubiquitous access for subscribers from any terminal device | |
CN100448218C (en) | Method for receiving call | |
US9020621B1 (en) | Network based media enhancement function based on an identifier | |
JP2007006047A (en) | Voice ip telephone method and device | |
JP2006512016A (en) | Method and apparatus for realizing call processing in packet telephone network | |
KR20000062885A (en) | Computer network telephony | |
KR20010041478A (en) | Establishing a voice call from a client computer via a bridgeport | |
WO2001045346A2 (en) | A client-server network for managing internet protocol voice packets | |
US20150269953A1 (en) | Audio signal manipulation for speech enhancement before sound reproduction | |
US20140278402A1 (en) | Automatic Channel Selective Transcription Engine | |
US7620158B2 (en) | Video relay system and method | |
WO2014143827A1 (en) | Dynamic personalization of a communication session in heterogeneous environments | |
US20040156493A1 (en) | Method and apparatus for providing a central telephony service for a calling party at the called party telephone | |
US20030076817A1 (en) | Apparatus and method for interfacing internet telephone | |
US6707827B1 (en) | Method and apparatus for optimizing audio response in telephony-over-LAN systems | |
JP4502835B2 (en) | Mobile communication system and mobile phone notification method used therefor | |
TWI240516B (en) | System for automatically selecting voice data transmission and reception system for IP network, method thereof, and IP terminal | |
EP0964567A2 (en) | A programmable telecommunications interface for communication over a data network | |
AU1899201A (en) | Method and system for transmitting communication signals through internet and a public switched telephone network | |
Cisco | E | |
US6603848B1 (en) | Techniques for providing caller name announcement | |
KR20000072754A (en) | Method and system for transmitting communication signals through the internet and a public switched telephone network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20020328 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20050829 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: 7H 04L 29/06 A |
|
18W | Application withdrawn |
Effective date: 20050921 |