EP1216554A4 - System and method for delivering customized voice audio data on a packet-switched network - Google Patents

System and method for delivering customized voice audio data on a packet-switched network

Info

Publication number
EP1216554A4
EP1216554A4 EP00966851A EP00966851A EP1216554A4 EP 1216554 A4 EP1216554 A4 EP 1216554A4 EP 00966851 A EP00966851 A EP 00966851A EP 00966851 A EP00966851 A EP 00966851A EP 1216554 A4 EP1216554 A4 EP 1216554A4
Authority
EP
European Patent Office
Prior art keywords
network
flow
packets
station
hearing profile
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP00966851A
Other languages
German (de)
French (fr)
Other versions
EP1216554A1 (en
Inventor
Vincent Pluvinage
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sound ID Inc
Original Assignee
Sound ID Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/407,751 external-priority patent/US7181297B1/en
Application filed by Sound ID Inc filed Critical Sound ID Inc
Publication of EP1216554A1 publication Critical patent/EP1216554A1/en
Publication of EP1216554A4 publication Critical patent/EP1216554A4/en
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1083In-session procedures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/765Media network packet handling intermediate
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/2866Architectures; Arrangements
    • H04L67/30Profiles
    • H04L67/306User profiles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/30Definitions, standards or architectural aspects of layered protocol stacks
    • H04L69/32Architecture of open systems interconnection [OSI] 7-layer type protocol stacks, e.g. the interfaces between the data link level and the physical level
    • H04L69/322Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions
    • H04L69/329Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions in the application layer [OSI layer 7]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42391Systems providing special services or facilities to subscribers where the subscribers are hearing-impaired persons, e.g. telephone devices for the deaf
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/04Circuit arrangements, e.g. for selective connection of amplifier inputs/outputs to loudspeakers, for loudspeaker detection, or for adaptation of settings to personal preferences or hearing impairments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2205/00Details of stereophonic arrangements covered by H04R5/00 but not provided for in any of its subgroups
    • H04R2205/041Adaptation of stereophonic signal reproduction for the hearing impaired
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/55Communication between hearing aids and external devices via a network for data exchange
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/70Adaptation of deaf aid to hearing loss, e.g. initial electronic fitting

Definitions

  • the present invention relates to the distribution and customization of voice audio data products, like telephone signals, via a packet-switched network such as networks based on the Internet Protocol (IP).
  • IP Internet Protocol
  • the hearing profiles of individuals vary in a number of ways.
  • the ability to hear sounds varies with frequency among individuals across the normal audio frequency range.
  • the dynamic range varies among individuals so that levels of an audio stimulus that are perceived as soft sounds and levels of an audio stimulus that are perceived as loud sounds differ from person to person.
  • Standard hearing tests are designed to produce an audiogram that characterizes such factors as frequency, sensitivity and dynamic range in the hearing profiles of individuals.
  • psycho-acoustic factors concerning the manner in which a person perceives combinations of normal sounds affect the ability to hear in ways that can vary from person to person.
  • Telephone signals are distributed in the public switched telephone networks (PSTNs) in a form intended for persons having hearing profiles within a normal range.
  • PSTNs public switched telephone networks
  • Proposals for applying custom processing of telephone signals in the PSTN according to a customer's hearing impairment have been presented. See, for example, U.S. Patent No. 5,388,185 entitled SYSTEM FOR ADAPTIVE PROCESSING OF TELEPHONE VOICE SIGNALS, invented by Terry et al.
  • Such proposals have been limited to circuit-switched telephone networks with limitations inherent in such systems.
  • Techniques are being developed for the transmission of real time voice over IP based networks, which are sometimes referred to as Voice over Internet Protocol (VoIP) technologies.
  • VoIP Voice over Internet Protocol
  • One leading VoIP technology is based on ITU-T Recommendation H.323, "Packet-Based Multimedia Communications Systems," from the International Telecommunication Union which describes a set of protocols useful for supporting real time audio and video communication over IP networks.
  • the present invention provides customized audio for VoIP sessions, or other packet- switched networks for carrying voice, and systems and methods for producing the customized audio data for such sessions, processed according to a hearing profile of the participants. According to the present invention, the quality of VoIP telephone calls and other voice sessions on packet-switched networks is improved.
  • the present invention provides a processor coupled to a packet-switched communication network which processes a flow of packets carrying voice data according to the hearing profiles of the intended recipients.
  • the flow of packets in various embodiments is compliant with a standard Internet Protocol.
  • the flow of packets is compliant with ITU-T Recommendation H.323.
  • the hearing profile data is stored in one embodiment with a registry of users. Resources are included in this embodiment to associate data from the registry characterizing a hearing profile of the user with a flow of packets in the packet-switched network for use and customizing the voice data.
  • the invention includes a method for conimunication of voice signals in a packet-switched network that includes detecting a flow of packets carrying the voice signals on the network to a recipient, utilizing data within the packets.
  • the method includes obtaining a hearing profile of the recipient from memory on at least one of an intermediate station or an end station in the network.
  • the flow of packets is processed according to the hearing profile in a processor on one of an intermediate station and an end station in the network.
  • a typical session on a packet-switched network carrying voice includes a first flow from a first participant in the call to the second adjustment in the call, and a second flow from the second participant to the first. The processing of the first and second flows can be positioned in different locales in the network.
  • a flow to a particular participant may be processed in a processor within the local area network of the participant.
  • Data sent from the other participant in the session is delivered to the processor within the local area network of the participant via the Internet for processing prior to delivery.
  • the sending station need not have the hearing profile data available for the intended recipients.
  • the processing of hearing profile data is executing a single server within the network which stores the hearing profiles of the recipients of the data.
  • the method includes establishing a registry of users in a processor coupled to the network.
  • the registry includes parameters indicating whether processing according to a hearing profile of a user is enabled for that user, and other parameters necessary for establishing the session and providing a hearing profile for use within the session.
  • a processor detects a flow of packets carrying voice signals on the network to a recipient, the packets being identifiable by parameters within the packets as members of the flow.
  • the processor accesses the registry in response to the flow and if the recipient is a registered user, a hearing profile is obtained using parameters available from the registry from memory in the network.
  • the flow of packets is processed according to a hearing profile.
  • the invention includes maintaining the registry in a gate keeper processor on the network, wherein the gate keeper processor performs functions compliant with ITU-T Recommendation H.323.
  • the present invention provides for the customization of voice sessions in packet-switched networks.
  • a customization of voice sessions requires the management of user registries and of processing resources necessary for matching individual voice sessions with data relevant to the participants in the sessions.
  • the present invention by taking advantage of the power and flexibility of packet-switched networks such as the Internet, makes the delivery of voice sessions customized according to individual hearing profiles practical and cost-effective.
  • FIG. 1 illustrates a network serving voice over the Internet protocol with customized processing according to the hearing profile of users according to the present invention.
  • Fig. 2 is a simplified flow chart illustrating the processing at a locale in the network of a voice over the Internet Protocol session with user hearing profiles.
  • Fig. 3 is a simplified block diagram of an H.323 standard network enhanced with hearing profile processing according to the present invention.
  • Fig. 4 illustrates one example entry in a registry of hearing profiles for the system of Fig.
  • Fig. 5 is a simplified flow chart of a process for establishing an H.323 call with customized processing according to the present invention.
  • Fig. 6 is a diagram of one example of a hearing profile for use in producing customized audio for VoIP sessions.
  • Fig. 7 is a diagram of another example of a hearing profile for use in producing customized audio for VoIP sessions.
  • Fig. 8 is a diagram of yet another example of a hearing profile for use in producing customized audio for VoIP sessions.
  • Fig. 1 provides an illustration of one example network supporting VoIP telephone calls with audio data processed according to the hearing profiles of the participants in the call.
  • the example network includes local area network 10 and local area network 11 which are coupled to one another through an Internet protocol network 12.
  • a gateway server 13 is coupled to the Internet protocol network 12 and to the public switched telephone network PSTN 16.
  • Local area network 10 includes VoIP server/router 14 which couples the local area network 10 to the Internet protocol network 12.
  • Local area network 11 includes a VoIP server/router 17 which couples the local area network 11 to the Internet protocol network 12.
  • End stations illustrated in the example of Fig. 1 include the telephones 20, 21, 22 coupled to the local area network 10, and the voice enabled personal computer 23 coupled to the local area network 10. Also, the stations coupled to the local area network 11 include telephones 25 and 26, and voice or multimedia enabled personal computers 27 and 28.
  • a central office switch 30 connects end station telephones 31 and 32 into the network.
  • the intermediate station servers 14 and 17 are coupled to hearing profile databases 15 and 18 to support processing of VoIP sessions according to the hearing profiles of the participants in the call.
  • the VoIP server/router 17 associates the hearing profile of the recipient using end station 28 with the incoming packet flow of the session, and processes the audio data using the hearing profile.
  • the VoIP server/router 14 coupled to the local area network 10 associates the hearing profile of the recipient using the telephone 22 with the incoming packet flow of the session, and processes the audio data using the hearing profile as it is delivered to the telephone 22.
  • the VoIP gateway 13 associates the hearing profile of the recipient at the telephone 32 with incoming packets in the VoIP session, and processes the data using the hearing profile.
  • the processed data is then re-formatted for the circuit switched public switched telephone network for delivery to the recipient at the telephone 32.
  • the hearing profile databases 15, 18, 19 illustrated in the example of Fig. 1 maintain registries of users of the VoIP service.
  • the registries contain the necessary information to support the processing of the packet flows in the VoIP sessions utilizing the hearing profiles of users having entries in the databases.
  • the establishment and maintenance of the hearing profile databases in the packet- switched network can be handled using a variety of network architectures, taking full advantage of the packet-switched network flexibility.
  • a central server may be used for acquiring, updating, and delivering hearing profiles for use in the VoIP sessions supported by the database.
  • distributed servers each maintaining hearing profile data for a limited number of users could be utilized which process only incoming packet flows.
  • the hearing profiles may be stored locally on the end stations, and no intermediate station need be involved in the processing of the packet flows according to the hearing profiles.
  • a combination of locales in the network may be used to support a particular session, according to the capability of the stations involved in the session.
  • a basic process executed at a locale in the packet-switched network according to the present invention for packet flows in a VoIP call is illustrated in Fig. 2.
  • the locale in the network either an end station or an intermediate station, detects a VoIP session by filtering the packet flow to detect headers or other control information tagging the flow as a VoIP session (step 50).
  • a profile of a recipient at the destination end station is retrieved from memory (step 51).
  • the packet flow is processed according to the hearing profile of the recipient (step 52).
  • the processed packets are forwarded to the destination end station (step 53).
  • FIG. 3 provides a simplified illustration of an H.323 network enhanced according to the present invention.
  • an H.323 gatekeeper process 100 is coupled to the Internet 101.
  • H.323 compliant end stations 102 and 103 are also coupled to the Internet 101.
  • the H.323 compliant gateway 104 is coupled to end stations, such as end station 105 which support other protocol suites for the VoIP session.
  • the gate keeper process 100 includes an H.323 registry 106 which is enhanced according to the present invention with a hearing profile database.
  • entries in the H.323 registry 106 have a format such as shown in Fig. 4.
  • the format includes a user identifier 120, and H.323 call parameters 121 which are developed according to the standard.
  • a pointer to a hearing profile, or a hearing profile itself is stored in the field 122.
  • the hearing profile processing capability for the user is stored as a parameter 123.
  • a few examples of the hearing profile pointer 122 include a URL, a memory address in local memory on the gate keeper, or a memory address in a particular storage device having another format.
  • the hearing profile processing capability parameter 123 indicates where the processing is available for applying the hearing profile.
  • parameter 124 indicates that digital signal processing resources are available locally on an end station of the user (e.g., end station 102), parameter 125 indicates that digital signal processing resources are available on a gateway 104 through which the user accesses the network, and parameter 126 indicates that digital signal processing resources at the gate keeper 100 are to be used for packet flows destined to the identified user.
  • Fig. 5 provides a simplified flow chart of the processing at the gate keeper 100 of Fig. 4 enhanced according to present invention.
  • the gate keeper detects an H.323 call set up session (step 150).
  • the hearing profile processing capability for the identified end stations is determined using the information in the registry (step 151).
  • the hearing profile or hearing profiles involved in the session are forwarded to the processors that are associated with the registered users.
  • the hearing profile is simply retrieved from the registry for use at the gate keeper processor itself (step 152).
  • the H.323 call set up procedures establish call paths for each direction, taking into account the locale in the network for processing of the audio signals with the hearing profiles of the users (step 153).
  • the call set up is completed (step 154).
  • Figs. 6, 7 and 8 illustrate alternative formats for the hearing profiles to be stored in the registry 106, or databases 15, 17, 19 in the examples shown in the Figures.
  • the hearing profile shown in Fig. 6 includes a customer ID 250, coefficients of transform equations 251, and an identifier of the playback device type and listening environment 252 in which the audio data product is to be played.
  • Fig. 6 includes a customer ID 250, coefficients of transform equations 251, and an identifier of the playback device type and listening environment 252 in which the audio data product is to be played.
  • the hearing profile includes a customer identifier 260, the audiogram for customer 261, listening condition data 262, and psycho-acoustic parameters 263 which relate to the listening characteristics of the customer.
  • the hearing profile shown in Fig. 8 includes a customer identifier 270, and software 271 along with code data structures 272 that in combination provide executable transform code for producing the customized audio data product from the selected audio data product.
  • the three examples of hearing profile formats shown in Figs. 6-8 are representative of a large number of hearing profile formats that could be utilized depending on the type of transform processes being executed, the type of audio products being delivered, and other factors related to the architecture of the system.
  • the system in various embodiments provides input tools, such as a graphical user interface usable, for example, at a kiosk or accessible via a network, for accepting input data concerning a hearing profile of a customer, either supported by a specialized hearing test device or by other resources for use in the test.
  • the input tools are used for creating a machine readable hearing profile, which is stored in a registry of hearing profiles for use in the production of customized audio data products.
  • the hearing profile is stored on a portable data storage medium for use by the customer, as described above.
  • the system provides tools for accepting input data characterizing the feedback from the customer.
  • Processing resources are provided for modifying the machine readable hearing profile in response to the feedback data.
  • the modifying of the machine readable hearing profile in one aspect of the invention comprises applying an optimization modeling process to the hearing profile, by which the profiles are improved over time using feedback arising from a variety of customized audio data products and from a variety of listening conditions.
  • the related applications incorporated by reference above, include detailed descriptions of technologies for gathering feedback and optimizing hearing profile data. While the present invention is disclosed by reference to the preferred embodiments and examples detailed above, it is to be understood that these examples are intended in an illustrative rather than in a limiting sense. It is contemplated that modifications and combinations will readily occur to those skilled in the art, which modifications and combinations will be within the spirit of the invention and the scope of the appended claims.

Abstract

Customized audio for VoIP sessions, or other packet-switched networks for carrying voice, is processed according to a hearing profile of the participants. A processor (13, 14, 17) coupled to a packet-switched communication network (12) processes a flow of packets carrying voice data according to the hearing profiles of the intended recipients. The flow of packets in various embodiments is compliant with a standard Internet Protocol, including, for example, ITU-T Recommendation H.323. The hearing profile data is stored in one embodiment with a registry of users (15, 18, 19). Resources are included in this embodiment to associate data from the registry characterizing a hearing profile of the user with a flow of packets in the packet-switched network for use in customizing the voice data.

Description

SYSTEM AND METHOD FOR DELIVERING CUSTOMIZED VOICE AUDIO DATA ON A PACKET-SWITCHED NETWORK
BACKGROUND OF THE INVENTION Field of the Invention
The present invention relates to the distribution and customization of voice audio data products, like telephone signals, via a packet-switched network such as networks based on the Internet Protocol (IP).
Description of Related Art
The hearing profiles of individuals vary in a number of ways. The ability to hear sounds varies with frequency among individuals across the normal audio frequency range. Also, the dynamic range varies among individuals so that levels of an audio stimulus that are perceived as soft sounds and levels of an audio stimulus that are perceived as loud sounds differ from person to person. Standard hearing tests are designed to produce an audiogram that characterizes such factors as frequency, sensitivity and dynamic range in the hearing profiles of individuals. There are also other factors that affect a hearing profile. For example, psycho-acoustic factors concerning the manner in which a person perceives combinations of normal sounds affect the ability to hear in ways that can vary from person to person. Also, environmental factors such as the usual listening environment of a person (library, conference room, concert hall) and the equipment on which the sound is produced (loud speakers, ear phones, telephone hand set) are important. In persons wearing hearing aids or using other assistive hearing devices, the type of aid or device affects the hearing profile. The physiology of an impairment suffered by the individual may also be an important factor in the hearing profile.
The hearing profiles of individuals have been applied in the hearing aid field for customizing and fitting hearing aids for individuals. See, for example, U.S. Patent No. 4,731,850 entitled PROGRAMMABLE DIGITAL HEARING AID SYSTEM, invented by Levitt et al.; and U.S. Patent No. 5,848,171 entitled HEARING AID DEVICE INCORPORATING SIGNAL PROCESSING TECHNIQUES, invented by Stockham, Jr. et al. Thus, techniques for processing sound to offset variations in hearing are well known. However, these techniques are unavailable to persons not using hearing aids. Furthermore, many persons who could benefit from such processing are not in positions to use hearing aids for a variety of reasons.
Telephone signals are distributed in the public switched telephone networks (PSTNs) in a form intended for persons having hearing profiles within a normal range. Proposals for applying custom processing of telephone signals in the PSTN according to a customer's hearing impairment have been presented. See, for example, U.S. Patent No. 5,388,185 entitled SYSTEM FOR ADAPTIVE PROCESSING OF TELEPHONE VOICE SIGNALS, invented by Terry et al. Such proposals have been limited to circuit-switched telephone networks with limitations inherent in such systems. Techniques are being developed for the transmission of real time voice over IP based networks, which are sometimes referred to as Voice over Internet Protocol (VoIP) technologies. One leading VoIP technology is based on ITU-T Recommendation H.323, "Packet-Based Multimedia Communications Systems," from the International Telecommunication Union which describes a set of protocols useful for supporting real time audio and video communication over IP networks.
Accordingly, it is desirable to provide systems and methods to apply techniques for the processing of real time audio data according to hearing profiles for the benefit of individuals who are not wearing hearing assistance devices. Also it is desirable to provide tools to simplify the gathering of information needed to develop hearing profiles of individuals, and to apply the gathered information in the distribution and customization of real time voice signals.
SUMMARY OF THE INVENTION The present invention provides customized audio for VoIP sessions, or other packet- switched networks for carrying voice, and systems and methods for producing the customized audio data for such sessions, processed according to a hearing profile of the participants. According to the present invention, the quality of VoIP telephone calls and other voice sessions on packet-switched networks is improved.
Thus, the present invention provides a processor coupled to a packet-switched communication network which processes a flow of packets carrying voice data according to the hearing profiles of the intended recipients. The flow of packets in various embodiments is compliant with a standard Internet Protocol. In one embodiment, the flow of packets is compliant with ITU-T Recommendation H.323. The hearing profile data is stored in one embodiment with a registry of users. Resources are included in this embodiment to associate data from the registry characterizing a hearing profile of the user with a flow of packets in the packet-switched network for use and customizing the voice data.
The invention includes a method for conimunication of voice signals in a packet-switched network that includes detecting a flow of packets carrying the voice signals on the network to a recipient, utilizing data within the packets. In addition, the method includes obtaining a hearing profile of the recipient from memory on at least one of an intermediate station or an end station in the network. Finally, the flow of packets is processed according to the hearing profile in a processor on one of an intermediate station and an end station in the network. A typical session on a packet-switched network carrying voice includes a first flow from a first participant in the call to the second adjustment in the call, and a second flow from the second participant to the first. The processing of the first and second flows can be positioned in different locales in the network. For example, a flow to a particular participant may be processed in a processor within the local area network of the participant. Data sent from the other participant in the session is delivered to the processor within the local area network of the participant via the Internet for processing prior to delivery. In this manner, the sending station need not have the hearing profile data available for the intended recipients. In alternative systems, the processing of hearing profile data is executing a single server within the network which stores the hearing profiles of the recipients of the data.
According to another embodiment of the invention, the method includes establishing a registry of users in a processor coupled to the network. The registry includes parameters indicating whether processing according to a hearing profile of a user is enabled for that user, and other parameters necessary for establishing the session and providing a hearing profile for use within the session. Accordingly, a processor detects a flow of packets carrying voice signals on the network to a recipient, the packets being identifiable by parameters within the packets as members of the flow. Next, the processor accesses the registry in response to the flow and if the recipient is a registered user, a hearing profile is obtained using parameters available from the registry from memory in the network. Finally, the flow of packets is processed according to a hearing profile. In another embodiment, the invention includes maintaining the registry in a gate keeper processor on the network, wherein the gate keeper processor performs functions compliant with ITU-T Recommendation H.323.
Accordingly, the present invention provides for the customization of voice sessions in packet-switched networks. A customization of voice sessions requires the management of user registries and of processing resources necessary for matching individual voice sessions with data relevant to the participants in the sessions. The present invention, by taking advantage of the power and flexibility of packet-switched networks such as the Internet, makes the delivery of voice sessions customized according to individual hearing profiles practical and cost-effective. Other aspects and advantages of the present invention can be seen on review of the drawings, the detailed description, and the claims which follow.
BRIEF DESCRIPTION OF THE FIGURES Fig. 1 illustrates a network serving voice over the Internet protocol with customized processing according to the hearing profile of users according to the present invention.
Fig. 2 is a simplified flow chart illustrating the processing at a locale in the network of a voice over the Internet Protocol session with user hearing profiles.
Fig. 3 is a simplified block diagram of an H.323 standard network enhanced with hearing profile processing according to the present invention. Fig. 4 illustrates one example entry in a registry of hearing profiles for the system of Fig.
3.
Fig. 5 is a simplified flow chart of a process for establishing an H.323 call with customized processing according to the present invention.
Fig. 6 is a diagram of one example of a hearing profile for use in producing customized audio for VoIP sessions.
Fig. 7 is a diagram of another example of a hearing profile for use in producing customized audio for VoIP sessions.
Fig. 8 is a diagram of yet another example of a hearing profile for use in producing customized audio for VoIP sessions.
DETAILED DESCRIPTION A detailed description of the various embodiments of the present invention is provided with reference to Figs. 1 through 8.
Fig. 1 provides an illustration of one example network supporting VoIP telephone calls with audio data processed according to the hearing profiles of the participants in the call. The example network includes local area network 10 and local area network 11 which are coupled to one another through an Internet protocol network 12. Also, a gateway server 13 is coupled to the Internet protocol network 12 and to the public switched telephone network PSTN 16. Local area network 10 includes VoIP server/router 14 which couples the local area network 10 to the Internet protocol network 12. Local area network 11 includes a VoIP server/router 17 which couples the local area network 11 to the Internet protocol network 12.
End stations illustrated in the example of Fig. 1 include the telephones 20, 21, 22 coupled to the local area network 10, and the voice enabled personal computer 23 coupled to the local area network 10. Also, the stations coupled to the local area network 11 include telephones 25 and 26, and voice or multimedia enabled personal computers 27 and 28. In the public switched telephone network 16, a central office switch 30 connects end station telephones 31 and 32 into the network.
The intermediate station servers 14 and 17 are coupled to hearing profile databases 15 and 18 to support processing of VoIP sessions according to the hearing profiles of the participants in the call. Thus for example, when a call is established between end station 28 on local area network 1 1, and end station 22 on local area network 10, the VoIP server/router 17 associates the hearing profile of the recipient using end station 28 with the incoming packet flow of the session, and processes the audio data using the hearing profile. Likewise, the VoIP server/router 14 coupled to the local area network 10 associates the hearing profile of the recipient using the telephone 22 with the incoming packet flow of the session, and processes the audio data using the hearing profile as it is delivered to the telephone 22. In a similar manner, when a VoIP session is established between an end station, such as end station 25 on local area network 11, and a telephone coupled to the public switched telephone network 16, such as telephone 32, the VoIP gateway 13 associates the hearing profile of the recipient at the telephone 32 with incoming packets in the VoIP session, and processes the data using the hearing profile. The processed data is then re-formatted for the circuit switched public switched telephone network for delivery to the recipient at the telephone 32. The hearing profile databases 15, 18, 19 illustrated in the example of Fig. 1 maintain registries of users of the VoIP service. The registries contain the necessary information to support the processing of the packet flows in the VoIP sessions utilizing the hearing profiles of users having entries in the databases. The establishment and maintenance of the hearing profile databases in the packet- switched network can be handled using a variety of network architectures, taking full advantage of the packet-switched network flexibility. For example, a central server may be used for acquiring, updating, and delivering hearing profiles for use in the VoIP sessions supported by the database. Alternatively, distributed servers, each maintaining hearing profile data for a limited number of users could be utilized which process only incoming packet flows. In yet another alternative, the hearing profiles may be stored locally on the end stations, and no intermediate station need be involved in the processing of the packet flows according to the hearing profiles. A combination of locales in the network may be used to support a particular session, according to the capability of the stations involved in the session. A basic process executed at a locale in the packet-switched network according to the present invention for packet flows in a VoIP call is illustrated in Fig. 2. The locale in the network, either an end station or an intermediate station, detects a VoIP session by filtering the packet flow to detect headers or other control information tagging the flow as a VoIP session (step 50). Upon detection of a VoIP session, a profile of a recipient at the destination end station is retrieved from memory (step 51). Next, the packet flow is processed according to the hearing profile of the recipient (step 52). Finally, the processed packets are forwarded to the destination end station (step 53).
As mentioned above, one popular suite of protocols developed for packet based multimedia communication systems such as Internet protocol based network systems used for VoIP sessions is known as ITU-T Recommendation H.323. Recommendation H.323 is incorporated by reference as if fully set forth herein. Fig. 3 provides a simplified illustration of an H.323 network enhanced according to the present invention. In the network of Fig. 3, an H.323 gatekeeper process 100 is coupled to the Internet 101. H.323 compliant end stations 102 and 103 are also coupled to the Internet 101. The H.323 compliant gateway 104 is coupled to end stations, such as end station 105 which support other protocol suites for the VoIP session. The gate keeper process 100 includes an H.323 registry 106 which is enhanced according to the present invention with a hearing profile database. Thus, for example, entries in the H.323 registry 106 have a format such as shown in Fig. 4. The format includes a user identifier 120, and H.323 call parameters 121 which are developed according to the standard. A pointer to a hearing profile, or a hearing profile itself is stored in the field 122. The hearing profile processing capability for the user is stored as a parameter 123. A few examples of the hearing profile pointer 122 include a URL, a memory address in local memory on the gate keeper, or a memory address in a particular storage device having another format. The hearing profile processing capability parameter 123 indicates where the processing is available for applying the hearing profile. Thus, parameter 124 indicates that digital signal processing resources are available locally on an end station of the user (e.g., end station 102), parameter 125 indicates that digital signal processing resources are available on a gateway 104 through which the user accesses the network, and parameter 126 indicates that digital signal processing resources at the gate keeper 100 are to be used for packet flows destined to the identified user. Fig. 5 provides a simplified flow chart of the processing at the gate keeper 100 of Fig. 4 enhanced according to present invention. During call establishment, the gate keeper detects an H.323 call set up session (step 150). The hearing profile processing capability for the identified end stations is determined using the information in the registry (step 151). Next, the hearing profile or hearing profiles involved in the session are forwarded to the processors that are associated with the registered users. Alternatively, the hearing profile is simply retrieved from the registry for use at the gate keeper processor itself (step 152). The H.323 call set up procedures establish call paths for each direction, taking into account the locale in the network for processing of the audio signals with the hearing profiles of the users (step 153). Finally, the call set up is completed (step 154). Figs. 6, 7 and 8 illustrate alternative formats for the hearing profiles to be stored in the registry 106, or databases 15, 17, 19 in the examples shown in the Figures. The hearing profile shown in Fig. 6 includes a customer ID 250, coefficients of transform equations 251, and an identifier of the playback device type and listening environment 252 in which the audio data product is to be played. In Fig. 7, the hearing profile includes a customer identifier 260, the audiogram for customer 261, listening condition data 262, and psycho-acoustic parameters 263 which relate to the listening characteristics of the customer. The hearing profile shown in Fig. 8 includes a customer identifier 270, and software 271 along with code data structures 272 that in combination provide executable transform code for producing the customized audio data product from the selected audio data product. The three examples of hearing profile formats shown in Figs. 6-8 are representative of a large number of hearing profile formats that could be utilized depending on the type of transform processes being executed, the type of audio products being delivered, and other factors related to the architecture of the system.
The system in various embodiments provides input tools, such as a graphical user interface usable, for example, at a kiosk or accessible via a network, for accepting input data concerning a hearing profile of a customer, either supported by a specialized hearing test device or by other resources for use in the test. The input tools are used for creating a machine readable hearing profile, which is stored in a registry of hearing profiles for use in the production of customized audio data products. Alternatively, the hearing profile is stored on a portable data storage medium for use by the customer, as described above. The related applications incorporated by reference above, include detailed descriptions of technologies for conducting hearing tests and for gathering hearing profile data.
Also, in various embodiments of the invention, the system provides tools for accepting input data characterizing the feedback from the customer. Processing resources are provided for modifying the machine readable hearing profile in response to the feedback data. The modifying of the machine readable hearing profile in one aspect of the invention comprises applying an optimization modeling process to the hearing profile, by which the profiles are improved over time using feedback arising from a variety of customized audio data products and from a variety of listening conditions. The related applications, incorporated by reference above, include detailed descriptions of technologies for gathering feedback and optimizing hearing profile data. While the present invention is disclosed by reference to the preferred embodiments and examples detailed above, it is to be understood that these examples are intended in an illustrative rather than in a limiting sense. It is contemplated that modifications and combinations will readily occur to those skilled in the art, which modifications and combinations will be within the spirit of the invention and the scope of the appended claims.

Claims

What is claimed is:
1. For a packet-switched communication network, a station comprising: a processor coupled to the network which processes a flow of packets carrying voice data according to hearing profiles of intended recipients.
2. The station of claim 1, wherein the flow of packets comprises packets compliant with a standard Internet Protocol.
3. The station of claim 1, wherein the flow of packets comprises packets compliant with ITU-T Recommendation H.323.
4. The station of claim 1, including a registry of users, and including resources to associate data from the registry characterizing a hearing profile of a user in the registry with the flow of packets.
5. A method for communication of voice signals in a packet-switched network, comprising: detecting a flow of packets carrying the voice signals on the network to a recipient, the packets identifiable as members of the flow; obtaining a hearing profile of the recipient from memory on at least one of an intermediate station and an end station in the network; and processing the flow of packets according to the hearing profile in a processor on at least one of an intermediate station and an end station in the network.
6. The method of claim 5, including filtering packets received in the processor to detect members of the flow.
7. The method of claim 5, wherein the flow originates at an originating end station in the network, and the memory is on the originating end station.
8. The method of claim 5, wherein the memory is on an intermediate station in the network.
9. The method of claim 5, wherein the flow is transmitted to a destination end station in the network, and the memory is on the destination end station.
10. A method for communication of voice signals in a packet-switched network, comprising: establishing a registry of users in a processor coupled to the network, the registry including parameters indicating whether processing according to a hearing profile of a user is enabled for the user; detecting a flow of packets carrying the voice signals on the network to a recipient, the packets identifiable as members of the flow; accessing the registry in response to the flow, and if the recipient is a registered user, obtaining a hearing profile of the recipient from memory in the network; and processing the flow of packets according to the hearing profile in a processor in the network.
11. The method of claim 10, including filtering packets in the processor in the network to detect members of the flow.
12. The method of claim 10, wherein the flow originates at an originating end station in the network, and the memory is on the originating end station.
13. The method of claim 10, wherein the memory is on an intermediate station in the network.
14. The method of claim 10, wherein the flow is transmitted to a destination end station in the network, and the memory is on the destination end station.
15. The method of claim 10, including maintaining the registry in a gate keeper processor on the network, and wherein the gate keeper processor performs functions compliant with ITU-T Recommendation H.323.
16. The method of claim 10, wherein the parameters include a pointer to a hearing profile for the registered user.
17. The method of claim 10, wherein the processor is on an intermediate station in the network, and including routing the flow through the intermediate station.
18. The method of claim 10, wherein the processor is on an end station associated with the recipient, and including transmitting a copy of the hearing profile from the memory in the network to the end station.
EP00966851A 1999-09-28 2000-09-25 System and method for delivering customized voice audio data on a packet-switched network Withdrawn EP1216554A4 (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US407751 1999-09-28
US09/407,751 US7181297B1 (en) 1999-09-28 1999-09-28 System and method for delivering customized audio data
US46403699A 1999-12-15 1999-12-15
US464036 1999-12-15
US49525500A 2000-01-31 2000-01-31
US495255 2000-01-31
PCT/US2000/026304 WO2001024462A1 (en) 1999-09-28 2000-09-25 System and method for delivering customized voice audio data on a packet-switched network

Publications (2)

Publication Number Publication Date
EP1216554A1 EP1216554A1 (en) 2002-06-26
EP1216554A4 true EP1216554A4 (en) 2005-10-12

Family

ID=27410724

Family Applications (1)

Application Number Title Priority Date Filing Date
EP00966851A Withdrawn EP1216554A4 (en) 1999-09-28 2000-09-25 System and method for delivering customized voice audio data on a packet-switched network

Country Status (5)

Country Link
EP (1) EP1216554A4 (en)
JP (1) JP2003515261A (en)
CN (1) CN1390408A (en)
AU (1) AU7713500A (en)
WO (1) WO2001024462A1 (en)

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1552723A4 (en) * 2002-10-15 2010-02-17 Korea Electronics Telecomm Apparatus and method for adapting audio signal according to user's preference
US9378187B2 (en) 2003-12-11 2016-06-28 International Business Machines Corporation Creating a presentation document
US7634412B2 (en) 2003-12-11 2009-12-15 Nuance Communications, Inc. Creating a voice response grammar from a user grammar
US7162692B2 (en) 2003-12-11 2007-01-09 International Business Machines Corporation Differential dynamic content delivery
US8499232B2 (en) 2004-01-13 2013-07-30 International Business Machines Corporation Differential dynamic content delivery with a participant alterable session copy of a user profile
US7567908B2 (en) 2004-01-13 2009-07-28 International Business Machines Corporation Differential dynamic content delivery with text display in dependence upon simultaneous speech
US8001454B2 (en) 2004-01-13 2011-08-16 International Business Machines Corporation Differential dynamic content delivery with presentation control instructions
US7890848B2 (en) 2004-01-13 2011-02-15 International Business Machines Corporation Differential dynamic content delivery with alternative content presentation
US7571380B2 (en) 2004-01-13 2009-08-04 International Business Machines Corporation Differential dynamic content delivery with a presenter-alterable session copy of a user profile
US8954844B2 (en) 2004-01-13 2015-02-10 Nuance Communications, Inc. Differential dynamic content delivery with text display in dependence upon sound level
US7287221B2 (en) 2004-01-13 2007-10-23 International Business Machines Corporation Differential dynamic content delivery with text display in dependence upon sound level
US7430707B2 (en) 2004-01-13 2008-09-30 International Business Machines Corporation Differential dynamic content delivery with device controlling action
US7519683B2 (en) 2004-04-26 2009-04-14 International Business Machines Corporation Dynamic media content for collaborators with client locations in dynamic client contexts
US7831906B2 (en) 2004-04-26 2010-11-09 International Business Machines Corporation Virtually bound dynamic media content for collaborators
US7519659B2 (en) 2004-04-26 2009-04-14 International Business Machines Corporation Dynamic media content for collaborators
US7827239B2 (en) 2004-04-26 2010-11-02 International Business Machines Corporation Dynamic media content for collaborators with client environment information in dynamic client contexts
US7487208B2 (en) 2004-07-08 2009-02-03 International Business Machines Corporation Differential dynamic content delivery to alternate display device locations
US7428698B2 (en) 2004-07-08 2008-09-23 International Business Machines Corporation Differential dynamic delivery of content historically likely to be viewed
US7519904B2 (en) 2004-07-08 2009-04-14 International Business Machines Corporation Differential dynamic delivery of content to users not in attendance at a presentation
US7921362B2 (en) 2004-07-08 2011-04-05 International Business Machines Corporation Differential dynamic delivery of presentation previews
US8185814B2 (en) 2004-07-08 2012-05-22 International Business Machines Corporation Differential dynamic delivery of content according to user expressions of interest
US7487209B2 (en) 2004-07-13 2009-02-03 International Business Machines Corporation Delivering dynamic media content for collaborators to purposeful devices
US7426538B2 (en) 2004-07-13 2008-09-16 International Business Machines Corporation Dynamic media content for collaborators with VOIP support for client communications
US9167087B2 (en) 2004-07-13 2015-10-20 International Business Machines Corporation Dynamic media content for collaborators including disparate location representations
US7475340B2 (en) 2005-03-24 2009-01-06 International Business Machines Corporation Differential dynamic content delivery with indications of interest from non-participants
US7493556B2 (en) 2005-03-31 2009-02-17 International Business Machines Corporation Differential dynamic content delivery with a session document recreated in dependence upon an interest of an identified user participant
US7523388B2 (en) 2005-03-31 2009-04-21 International Business Machines Corporation Differential dynamic content delivery with a planned agenda
US9419843B2 (en) * 2006-07-06 2016-08-16 Broadcom Corporation Custom ASIC combining VoIP and security IP
US9138178B2 (en) 2010-08-05 2015-09-22 Ace Communications Limited Method and system for self-managed sound enhancement
CN104937954B (en) * 2013-01-09 2019-06-28 听优企业 Method and system for the enhancing of Self management sound
KR20180075642A (en) * 2015-10-29 2018-07-04 와이덱스 에이/에스 System and method for managing customizable configurations of hearing aids

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5388185A (en) * 1991-09-30 1995-02-07 U S West Advanced Technologies, Inc. System for adaptive processing of telephone voice signals
EP0719016A2 (en) * 1994-12-01 1996-06-26 Sharp Kabushiki Kaisha Communication equipment
US5559855A (en) * 1994-11-29 1996-09-24 Lucent Technologies Inc. System and method for recognizing and routing telephone calls involving hearing or speech impaired persons
US5832212A (en) * 1996-04-19 1998-11-03 International Business Machines Corporation Censoring browser method and apparatus for internet viewing
WO1999041880A1 (en) * 1998-02-12 1999-08-19 Qsound Labs, Inc. Teleconferencing method and apparatus with three-dimensional sound positioning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5388185A (en) * 1991-09-30 1995-02-07 U S West Advanced Technologies, Inc. System for adaptive processing of telephone voice signals
US5559855A (en) * 1994-11-29 1996-09-24 Lucent Technologies Inc. System and method for recognizing and routing telephone calls involving hearing or speech impaired persons
EP0719016A2 (en) * 1994-12-01 1996-06-26 Sharp Kabushiki Kaisha Communication equipment
US5832212A (en) * 1996-04-19 1998-11-03 International Business Machines Corporation Censoring browser method and apparatus for internet viewing
WO1999041880A1 (en) * 1998-02-12 1999-08-19 Qsound Labs, Inc. Teleconferencing method and apparatus with three-dimensional sound positioning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of WO0124462A1 *

Also Published As

Publication number Publication date
WO2001024462A1 (en) 2001-04-05
CN1390408A (en) 2003-01-08
JP2003515261A (en) 2003-04-22
EP1216554A1 (en) 2002-06-26
AU7713500A (en) 2001-04-30

Similar Documents

Publication Publication Date Title
EP1216554A1 (en) System and method for delivering customized voice audio data on a packet-switched network
US7075921B2 (en) Remote media control for voice over internet telephony and related applications
US6707811B2 (en) Internet telephony for ecommerce
US6791974B1 (en) Universal internet based telephony system that provides ubiquitous access for subscribers from any terminal device
CN100448218C (en) Method for receiving call
US9020621B1 (en) Network based media enhancement function based on an identifier
JP2007006047A (en) Voice ip telephone method and device
JP2006512016A (en) Method and apparatus for realizing call processing in packet telephone network
KR20010041478A (en) Establishing a voice call from a client computer via a bridgeport
KR20000062885A (en) Computer network telephony
WO2001045346A2 (en) A client-server network for managing internet protocol voice packets
US20150269953A1 (en) Audio signal manipulation for speech enhancement before sound reproduction
US20140278402A1 (en) Automatic Channel Selective Transcription Engine
US7620158B2 (en) Video relay system and method
WO2014143827A1 (en) Dynamic personalization of a communication session in heterogeneous environments
US20040156493A1 (en) Method and apparatus for providing a central telephony service for a calling party at the called party telephone
US20030076817A1 (en) Apparatus and method for interfacing internet telephone
US6707827B1 (en) Method and apparatus for optimizing audio response in telephony-over-LAN systems
JP4502835B2 (en) Mobile communication system and mobile phone notification method used therefor
TWI240516B (en) System for automatically selecting voice data transmission and reception system for IP network, method thereof, and IP terminal
EP0964567A2 (en) A programmable telecommunications interface for communication over a data network
AU1899201A (en) Method and system for transmitting communication signals through internet and a public switched telephone network
Cisco E
US6603848B1 (en) Techniques for providing caller name announcement
KR20040022738A (en) SMS system of internet visual phone

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20020328

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

A4 Supplementary search report drawn up and despatched

Effective date: 20050829

RIC1 Information provided on ipc code assigned before grant

Ipc: 7H 04L 29/06 A

18W Application withdrawn

Effective date: 20050921