US20110137656A1 - Sound classification system for hearing aids - Google Patents

Sound classification system for hearing aids Download PDF

Info

Publication number
US20110137656A1
US20110137656A1 US12/879,218 US87921810A US2011137656A1 US 20110137656 A1 US20110137656 A1 US 20110137656A1 US 87921810 A US87921810 A US 87921810A US 2011137656 A1 US2011137656 A1 US 2011137656A1
Authority
US
United States
Prior art keywords
sound
classification
classifying
features
hearing aid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/879,218
Inventor
JuanJuan Xiang
Martin Mckinney
Kelly Fitz
Tao Zhang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Starkey Laboratories Inc
Original Assignee
Starkey Laboratories Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Starkey Laboratories Inc filed Critical Starkey Laboratories Inc
Priority to US12/879,218 priority Critical patent/US20110137656A1/en
Assigned to STARKEY LABORATORIES, INC. reassignment STARKEY LABORATORIES, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: XIANG, JUANJUAN, MCKINNEY, MARTIN, ZHANG, TAO, FITZ, KELLY
Publication of US20110137656A1 publication Critical patent/US20110137656A1/en
Assigned to CITIBANK, N.A., AS ADMINISTRATIVE AGENT reassignment CITIBANK, N.A., AS ADMINISTRATIVE AGENT NOTICE OF GRANT OF SECURITY INTEREST IN PATENTS Assignors: STARKEY LABORATORIES, INC.
Priority to US16/741,114 priority patent/US11250878B2/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/50Customised settings for obtaining desired overall acoustical characteristics
    • H04R25/505Customised settings for obtaining desired overall acoustical characteristics using digital signal processing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/50Customised settings for obtaining desired overall acoustical characteristics
    • H04R25/505Customised settings for obtaining desired overall acoustical characteristics using digital signal processing
    • H04R25/507Customised settings for obtaining desired overall acoustical characteristics using digital signal processing implemented by neural network or fuzzy logic

Definitions

  • This document relates generally to hearing aids and more particularly to a sound classification system for hearing aids.
  • Hearing aid users are typically exposed to a variety of listening situations, such as speech, music or noisy environment.
  • the behavior of the instrument for instance the activation of a directional microphone, or the compression/expansion parameters, should adapt to the currently engaged environment. This indicates the need for sound classification algorithms functioning as a front end to the rest of the signal processing scheme housed in the instruments [1].
  • a hearing aid includes a sound classification module to classify environmental sound sensed by a microphone.
  • the sound classification module executes an advanced sound classification algorithm.
  • the hearing aid then processes the sound according to the classification.
  • FIG. 1 is a block diagram illustrating an embodiment of a hearing aid including a processor with a sound classification module.
  • FIG. 2 is a block diagram illustrating an embodiment of a method for operating a hearing aid including an advanced sound classification function.
  • FIG. 3 is a graph showing performance of sound classification including error rate plotted as a function of the number of employed features.
  • FIG. 4 is a graph showing performance of sound classification including computational cost plotted as a function of the number of employed features.
  • FIG. 5 is a graph showing performance of sound classification including error rate plotted as a function of the length of test sequence.
  • FIG. 6 is a graph showing performance of sound classification including error rate plotted as a function of the structures of classification system.
  • Sound classification has been studied under different contexts, such as speech/music discrimination [2, 3], environment sounds classification [4], and content-based audio classification [5, 6]. Compared with these applications, sounds classification in hearing aids is more challenging due to the limited power consumption, the real time operation and the great varieties of sounds encountered in the real life. So far, a couple of simple features and classifier schemes, such as a threshold-based classifier, have been implemented in hearing aids to identify speech, noise and speech in noise [7]. When more kinds of sounds need to be classified, advanced classifiers and features are needed to achieve a satisfactory performance. The present sound classification system provides a hearing aid with such advanced classifiers and features.
  • results of a systemic evaluation of the present sound classification system including the impact of sophisticated features and classifiers on the classification rate, computational cost and classification delay are presented.
  • the classification system which is intended to detect speech, music and several kinds of noises is constructed.
  • the performance of two feature sets including both low level features and Mel-scale Frequency cepstral coefficients (MFCC) are compared by applying Gaussian classifiers, Hidden Markov Model (HMM), and Gaussian Mixture Model (GMM) individually.
  • HMM Hidden Markov Model
  • GMM Gaussian Mixture Model
  • FIG. 1 is a block diagram illustrating an embodiment of a hearing aid 100 .
  • Hearing aid 100 includes a microphone 102 , a processor 104 , and a receiver 106 .
  • Microphone 102 senses an environmental sound.
  • Processor 104 includes a sound classification module that classifies the sensed sound by executing an advanced classification algorithm. Examples of the advanced classification algorithm include HMM and GMM.
  • Processor 104 processes the sound using an outcome of the classification of the sound for specified hearing assistance functions.
  • Receiver 106 transmits the processed sound for perception by the hearing aid user.
  • the sound classification module uses a two-stage environment classification scheme.
  • the signal is first classified as music, speech or non-speech.
  • the non-speech sounds are further characterized as machine noise, wind noise or other sounds.
  • the classification performance and the associated computational cost are evaluated along three dimensions: the choice of classifiers, the choice of feature sets and number of features within each feature set. Each component is further discussed below.
  • the sound classification module includes one of two feature groups, specifically a low level feature set, and MFCCs.
  • the former consists of both temporal and spectral features, such as zero crossing rate, short time energy, spectral centroid, spectral bandwidth, spectral roll-off, spectral flux, high/low energy ratio, etc.
  • the logarithms of these features are included in the set as well.
  • the first 12 coefficients are included in the MFCC set [9].
  • the evaluation of the sound classification module is performed on a database composed of sounds from five classes: music, speech, wind noise, machine noise and others.
  • the music content is taken from a database collected by Scheirer and Slaney [3], which contains 80 15-seconds long audio music samples covering different genres, such as classical music, rock and pop songs, folk music, etc.
  • the remaining samples are recordings made by the authors.
  • the acoustic signals from a variety of auditory scenes are picked up by a microphone located in a BTE hearing aid first and then are stored in a Sony TCD-D8 DAT recorder with a 16-bit resolution and a 48 kHz sampling frequency.
  • the recordings are manually divided according to recording environment and then segmented using a short-term energy detector, followed by manual verification and adjustment of the segment boundaries. The resulting segments are used for training and testing the classification system.
  • the class “speech” includes both clean and noisy speech.
  • the clean speech comprise of speech spoken by different peoples at different reverberation situations, such as a living room or a cafeteria.
  • the noisy speech is generated by mixing randomly selected files from the clean speech class with noise at three levels of SNR: ⁇ 6 dB, 0 dB and 6 dB.
  • the class “machine noise” contains the noise generated by various machineries, such as automobile, vacuum and blender.
  • the class “others” is the most varying category comprising any sounds that are not suitably described by the other three classes, for instance the sounds from water running, foot stepping, etc. The duration of the samples of each class is listed in Table 1.
  • Computational cost is a critical constraint concerning the application of various classification algorithms in hearing aids, mainly due to the limitation on power consumption and real time operation.
  • the cost of a classification algorithm consists of two parts, feature evaluation and classification. The former is related with the length of analysis window and the resolution of the Fourier Frequency Transform (FFT), while the latter is mainly determined by the number of sound classes, the dimension of the employed feature vector. For a GMM and HMM classifier, the number of components and states affect the computational cost as well. At each classification stage, the computational cost is measured in terms of number of operations and evaluated along three dimensions: choice of models, choice of feature sets, and number of selected features, just as in the performance evaluation.
  • FFT Fourier Frequency Transform
  • FIG. 2 is a block diagram illustrating an embodiment of a method 200 for operating a hearing aid including an advanced sound classification function. In one embodiment, method 200 is applied to operate hearing aid 100 .
  • an environmental sound is sensed.
  • the sensed sound is classified by executing an advanced classification algorithm.
  • the sound is processed using an outcome of the classification of the sound for specified hearing assistance functions.
  • FIG. 3 is a graph showing error rates each plotted as a function of the number of employed features.
  • the performance is evaluated over two feature sets (low level and MFCC) and three models (Gaussian, GMM and HMM).
  • the error rate obtained at each iteration of the forward sequential feature selection process is plotted.
  • the various classifiers and feature sets are indicated by line styles and marker styles, respectively.
  • FIG. 4 is a graph showing performance of sound classification including computational costs each plotted as a function of the number of employed features.
  • the performance is evaluated over advanced models (GMM and HMM) and low level.
  • the recommended number of features is indicated in FIG. 4 by a gray patch. It is noteworthy that when combining the computational cost from the two stages, some features are identified as optimal features on both stages but need only be calculated once. Thus the overall cost is less than the direct summation over the two stages.
  • FIG. 3 shows that the increased number of features results both decreased error rates and increased associated computational costs, thus indicating a conflict between performance and computational complexity. It seems that choosing five to seven features is a reasonable comprise between the two factors. Using this number of features, the error rate is about 10% and the computational cost is still manageable. On the other hand, using more than seven features only slightly improves the performance but incurs great computational cost.
  • the impact of the test sequence length on performance when using low level features and advanced models is also examined.
  • the error rates are each plotted as a function of the test sequence length in FIG. 5 .
  • the performance is evaluated over advanced models and low level feature set. As expected, increasing the length of test sequence improves the classification performance.
  • An approximate 20% decrease of error rate is obtained by increasing the test sequence from 128 ms to 256 ms. This benefit is diminished with further increase in the length of the test sequence. The overall pattern of the rate of decrease seems to be consistent across classifiers.
  • FIG. 6 shows error rates each plotted as a function of the structures of classification system. The performance is evaluated over a hierarchy structure and a flat structure. A low level feature set is used with both structures. For each case the low level features are employed in the feature selection process and the lowest error rate are presented. It seems that the flat classification scheme has a slight advantage over the two-stage one, at the expense of the flexibility and computational complexity.

Abstract

A hearing aid includes a sound classification module to classify environmental sound sensed by a microphone. The sound classification module executes an advanced sound classification algorithm. The hearing aid then processes the sound according to the classification.

Description

    CLAIM OF PRIORITY
  • The present application claims the benefit under 35 U.S.C. 119(e) of U.S. Provisional Patent Application Ser. No. 61/241,735, filed Sep. 11, 2009, which is incorporated herein by reference in its entirety
  • TECHNICAL FIELD
  • This document relates generally to hearing aids and more particularly to a sound classification system for hearing aids.
  • BACKGROUND
  • Hearing aid users are typically exposed to a variety of listening situations, such as speech, music or noisy environment. To yield the best listening experience for them, the behavior of the instrument, for instance the activation of a directional microphone, or the compression/expansion parameters, should adapt to the currently engaged environment. This indicates the need for sound classification algorithms functioning as a front end to the rest of the signal processing scheme housed in the instruments [1].
  • SUMMARY
  • A hearing aid includes a sound classification module to classify environmental sound sensed by a microphone. The sound classification module executes an advanced sound classification algorithm. The hearing aid then processes the sound according to the classification.
  • This Summary is an overview of some of the teachings of the present application and not intended to be an exclusive or exhaustive treatment of the present subject matter. Further details about the present subject matter are found in the detailed description and appended claims. The scope of the present invention is defined by the appended claims and their legal equivalents.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram illustrating an embodiment of a hearing aid including a processor with a sound classification module.
  • FIG. 2 is a block diagram illustrating an embodiment of a method for operating a hearing aid including an advanced sound classification function.
  • FIG. 3 is a graph showing performance of sound classification including error rate plotted as a function of the number of employed features.
  • FIG. 4 is a graph showing performance of sound classification including computational cost plotted as a function of the number of employed features.
  • FIG. 5 is a graph showing performance of sound classification including error rate plotted as a function of the length of test sequence.
  • FIG. 6 is a graph showing performance of sound classification including error rate plotted as a function of the structures of classification system.
  • DETAILED DESCRIPTION
  • The following detailed description of the present subject matter refers to subject matter in the accompanying drawings which show, by way of illustration, specific aspects and embodiments in which the present subject matter may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the present subject matter. References to “an”, “one”, or “various” embodiments in this disclosure are not necessarily to the same embodiment, and such references contemplate more than one embodiment. The following detailed description is demonstrative and not to be taken in a limiting sense. The scope of the present subject matter is defined by the appended claims, along with the full scope of legal equivalents to which such claims are entitled.
  • Automatic program switching has been shown to be of great beneficial for hearing aids users. This feature is mediated by a sound classification system, which is traditionally implemented using simple features and heuristic classification schemes, resulting in an unsatisfactory performance in a complicated auditory scenario.
  • Sound classification has been studied under different contexts, such as speech/music discrimination [2, 3], environment sounds classification [4], and content-based audio classification [5, 6]. Compared with these applications, sounds classification in hearing aids is more challenging due to the limited power consumption, the real time operation and the great varieties of sounds encountered in the real life. So far, a couple of simple features and classifier schemes, such as a threshold-based classifier, have been implemented in hearing aids to identify speech, noise and speech in noise [7]. When more kinds of sounds need to be classified, advanced classifiers and features are needed to achieve a satisfactory performance. The present sound classification system provides a hearing aid with such advanced classifiers and features. In this document, results of a systemic evaluation of the present sound classification system, including the impact of sophisticated features and classifiers on the classification rate, computational cost and classification delay are presented. The classification system which is intended to detect speech, music and several kinds of noises is constructed. The performance of two feature sets, including both low level features and Mel-scale Frequency cepstral coefficients (MFCC), are compared by applying Gaussian classifiers, Hidden Markov Model (HMM), and Gaussian Mixture Model (GMM) individually.
  • FIG. 1 is a block diagram illustrating an embodiment of a hearing aid 100. Hearing aid 100 includes a microphone 102, a processor 104, and a receiver 106. Microphone 102 senses an environmental sound. Processor 104 includes a sound classification module that classifies the sensed sound by executing an advanced classification algorithm. Examples of the advanced classification algorithm include HMM and GMM. Processor 104 processes the sound using an outcome of the classification of the sound for specified hearing assistance functions. Receiver 106 transmits the processed sound for perception by the hearing aid user.
  • The sound classification module uses a two-stage environment classification scheme. The signal is first classified as music, speech or non-speech. Then the non-speech sounds are further characterized as machine noise, wind noise or other sounds. At each stage, the classification performance and the associated computational cost are evaluated along three dimensions: the choice of classifiers, the choice of feature sets and number of features within each feature set. Each component is further discussed below.
  • Choosing appropriate features to be implemented in the sound classification module is a domain-specific question. Based on previous work [1-3, 8], the sound classification module includes one of two feature groups, specifically a low level feature set, and MFCCs. The former consists of both temporal and spectral features, such as zero crossing rate, short time energy, spectral centroid, spectral bandwidth, spectral roll-off, spectral flux, high/low energy ratio, etc. The logarithms of these features are included in the set as well. The first 12 coefficients are included in the MFCC set [9]. There are some other features proposed in literature, such as cepstral modulation ratio [10] and several psychoacoustic features [8, 11]. These features are not investigated here either due to their high computational cost or because the calculation of these features is not well defined.
  • Within each set, some features may be redundant or noisy or simply have weak discriminative capability. To identify optimal features, a forward sequential feature selection algorithm is employed [12]. It is noteworthy that the derived feature set is specific to the choice of classifiers, which are discussed in the following section.
  • Many pattern-recognition techniques have been proposed and used in various fields. However, for hearing aids application, it is often beneficial to keep computational cost low. For this purpose, this study focuses on three classification algorithms: a quadratic Gaussian classifier, a GMM with 5 components, and an ergodic HMM with 5 states and 5 components [13]. The feature selection algorithm is performed for each classifier. The training of GMM and HMM is carried out using the expectation-maximization (EM) algorithm and in the test phase the decision is made based on the Viterbi decoder [14]. To examine the robustness of performance for a given combination of classifiers and features, a 4-fold cross-validation testing procedure is employed to determine the average classification rate [13].
  • The evaluation of the sound classification module is performed on a database composed of sounds from five classes: music, speech, wind noise, machine noise and others. The music content is taken from a database collected by Scheirer and Slaney [3], which contains 80 15-seconds long audio music samples covering different genres, such as classical music, rock and pop songs, folk music, etc. The remaining samples are recordings made by the authors. The acoustic signals from a variety of auditory scenes are picked up by a microphone located in a BTE hearing aid first and then are stored in a Sony TCD-D8 DAT recorder with a 16-bit resolution and a 48 kHz sampling frequency. The recordings are manually divided according to recording environment and then segmented using a short-term energy detector, followed by manual verification and adjustment of the segment boundaries. The resulting segments are used for training and testing the classification system.
  • The class “speech” includes both clean and noisy speech. The clean speech comprise of speech spoken by different peoples at different reverberation situations, such as a living room or a cafeteria. The noisy speech is generated by mixing randomly selected files from the clean speech class with noise at three levels of SNR: −6 dB, 0 dB and 6 dB. The class “machine noise” contains the noise generated by various machineries, such as automobile, vacuum and blender. The class “others” is the most varying category comprising any sounds that are not suitably described by the other three classes, for instance the sounds from water running, foot stepping, etc. The duration of the samples of each class is listed in Table 1.
  • TABLE 1
    list of the recorded sound types and their length of durations.
    Sound Machine Wind
    Type Music Speech Noise Noise Others
    Duration
    14 40 73 12 22
  • Computational cost is a critical constraint concerning the application of various classification algorithms in hearing aids, mainly due to the limitation on power consumption and real time operation. The cost of a classification algorithm consists of two parts, feature evaluation and classification. The former is related with the length of analysis window and the resolution of the Fourier Frequency Transform (FFT), while the latter is mainly determined by the number of sound classes, the dimension of the employed feature vector. For a GMM and HMM classifier, the number of components and states affect the computational cost as well. At each classification stage, the computational cost is measured in terms of number of operations and evaluated along three dimensions: choice of models, choice of feature sets, and number of selected features, just as in the performance evaluation.
  • FIG. 2 is a block diagram illustrating an embodiment of a method 200 for operating a hearing aid including an advanced sound classification function. In one embodiment, method 200 is applied to operate hearing aid 100.
  • At 202, an environmental sound is sensed. At 204, the sensed sound is classified by executing an advanced classification algorithm. At 206, the sound is processed using an outcome of the classification of the sound for specified hearing assistance functions.
  • As discussed above, experiments were conducted to systematically assess the impact of more sophisticated classifiers and features on the performances of the sound classification system. The results show that using advanced models, such as HMM or GMM, significantly improve the classification performance. This change does not inevitably lead to a great increase of computational complexity, provided that a suitable number (5 to 7) of low level features are carefully chosen. These findings, thus, indicate that the use of these advanced models is feasible in a hearing aid application.
  • The evaluation performed at each stage is combined to obtain the overall classification rate, which is illustrated in FIG. 3, which is a graph showing error rates each plotted as a function of the number of employed features. The performance is evaluated over two feature sets (low level and MFCC) and three models (Gaussian, GMM and HMM). The error rate obtained at each iteration of the forward sequential feature selection process is plotted. The various classifiers and feature sets are indicated by line styles and marker styles, respectively.
  • Several results are apparent upon examination of FIG. 3. The first is that advanced models perform better on average than a simple one. When ten features are employed, the lowest error rate of the Gaussian classifier is 26%, while using GMM and HMM, the rates drop to around 18% and 12%, respectively. The performance improvement associated with the employment of GMM might be explained by the better fitting between the distribution of feature vectors and the model. And the further improvement of HMM might be related with its exploration on the dynamics of the feature vectors.
  • In terms of the feature set, we observe that there is no significant difference in classification performance between the low level feature set and MFCC, provided that the number of employed features is more than five in both cases. This demonstrates that a carefully chosen low level feature subset has discriminative capability equivalent to that of the MFCC. Considering that the computational cost of a low level features-based classification is typically one magnitude lower than a MFCC-based one, the low level features are in favor when the computational source is tight.
  • The discussion above shows the advantage for using advanced models with low level features. Therefore, the following discussion focuses on the advanced models with low level features. The impact of the number of features employed is examined. The overall computational cost is determined from the two stages, as shown in FIG. 4. FIG. 4 is a graph showing performance of sound classification including computational costs each plotted as a function of the number of employed features. The performance is evaluated over advanced models (GMM and HMM) and low level. The recommended number of features is indicated in FIG. 4 by a gray patch. It is noteworthy that when combining the computational cost from the two stages, some features are identified as optimal features on both stages but need only be calculated once. Thus the overall cost is less than the direct summation over the two stages.
  • A comparison between FIG. 3 and FIG. 4 shows that the increased number of features results both decreased error rates and increased associated computational costs, thus indicating a conflict between performance and computational complexity. It seems that choosing five to seven features is a reasonable comprise between the two factors. Using this number of features, the error rate is about 10% and the computational cost is still manageable. On the other hand, using more than seven features only slightly improves the performance but incurs great computational cost.
  • The impact of the test sequence length on performance when using low level features and advanced models is also examined. The error rates are each plotted as a function of the test sequence length in FIG. 5. The performance is evaluated over advanced models and low level feature set. As expected, increasing the length of test sequence improves the classification performance. An approximate 20% decrease of error rate is obtained by increasing the test sequence from 128 ms to 256 ms. This benefit is diminished with further increase in the length of the test sequence. The overall pattern of the rate of decrease seems to be consistent across classifiers.
  • The recognition rates of advance models using a two-stage classification scheme with the one based on a flat structure where the input sound is directly assigned to one of the five types are compared. FIG. 6 shows error rates each plotted as a function of the structures of classification system. The performance is evaluated over a hierarchy structure and a flat structure. A low level feature set is used with both structures. For each case the low level features are employed in the feature selection process and the lowest error rate are presented. It seems that the flat classification scheme has a slight advantage over the two-stage one, at the expense of the flexibility and computational complexity.
  • A number of experiments are conducted to assess the impact of classifiers, feature sets and number of features on the performances of a classification system, where five sound classes, “speech”, “music”, “machine noise”, “wind noise” and “others”, are distinguished. The results show that compared with a Gaussian classifier, advanced models, such as GMM or HMM, significantly improves the classification performance. The use of the advanced classifiers is not necessarily associated with a great increase of computational complexity, as one may expect. As for the choice of feature set, the performance of low-level-based classification is comparable with MFCC-based classification. Considering that the computational cost of low level features is generally lower than MFCC, the low level feature set should be recommended when the computational resource is limited. In addition, the number of features is suggested as 5 to 7 to balance the performance and computational cost. The classification performance can be further improved by using longer test sequence or a flat classification scheme.
  • This application is intended to cover adaptations or variations of the present subject matter. It is to be understood that the above description is intended to be illustrative, and not restrictive. The scope of the present subject matter should be determined with reference to the appended claims, along with the full scope of legal equivalents to which such claims are entitled,
  • The following documents, which are referred to in the discussion above, are each incorporated by reference in its entirety:
  • REFERENCES
  • [1] Kates, J. M., Classification of background noises for hearing aid applications. The Journal of the Acoustical Society of America, 1995. 97: p. 461.
  • [2] Lavner, Y. and D. Ruinskiy, A Decision-Tree based Algorithm for Speech/Music Classification and Segmentation. EURASIP Journal on Audio, Speech, and Music Processing, 2009. 2009.
  • [3] Scheirer, E. and M. Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. 1997, pp. 1331-1334.
  • [4] Chu, S., S. Narayanan, and C. C. J. Kuo. Environmental sound recognition using MP-based features. in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. 2008, pp. 1-4.
  • [5] Huang, R. and J. H. L. Hansen, Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora. IEEE Transactions on Audio, Speech, and Language Processing, 2006. 14(3): pp. 907-919.
  • [6] Zhang, T. and C. C. J. Kuo, Audio content analysis for online audiovisual data segmentation and classification. IEEE Transactions on Speech and Audio Processing, 2001. 9(4): p. 441.
  • [7] Edwards, B. W., Z. Hou, C. J. Struck, and P. Dharan, Signal-processing algorithms for a new software-based, digital hearing device. Hearing Journal, 1998. 51: pp. 44-54.
  • [8] McKinney, M. F. and J. Breebaart. Features for audio and music classification. in Proceedings of International Conference on Music Information Retrieval 2003, pp. 151-158.
  • [9] Quatieri, T. F., Discrete-time speech signal processing. 2002: Prentice Hall PTR.
  • [10] Martin, R. and A. Nagathil. Cepstral modulation ratio regression (CMRARE) parameters for audio signal analysis and classification. in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. 2009, pp. 321-324.
  • [11] Buchler, M., S. Allegro, S. Launer, and N. Dillier, Sound classification in hearing aids inspired by auditory scene analysis. EURASIP Journal on Applied Signal Processing, 2005. 18: pp. 2991-3002.
  • [12] Blum, A. L. and P. Langley, Selection of relevant features and examples in machine learning. Artificial intelligence, 1997. 97(1-2): pp. 245-271.
  • [13] Duda, R. O., P. E. Hart, and D. G. Stork, Pattern classification. 2001: Wiley New York.
  • [14] Rabiner, L. R., A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 1989. 77(2): pp. 257-286.

Claims (20)

1. A hearing aid, comprising:
a microphone configured to sense an environmental sound; and
a processor including a sound classification module coupled to the microphone and configured to classify the sound by executing an advanced classification algorithm, and processor configured to process the sound using an outcome of the classification of the sound for specified hearing assistance functions.
2. The hearing aid of claim 1, wherein the sound classification nodule is configured to classify the sound by executing an advanced classification algorithm including a Hidden Markov Model (HMM),
3. The hearing aid of claim 1, wherein the sound classification nodule is configured to classify the sound by executing an advanced classification algorithm including a Gaussian Mixture Model (GMM).
4. The hearing aid of claim 1, wherein the sound classification nodule is configured to classify the sound as one of music, speech, and non-speech.
5. The hearing aid of claim 4, wherein the sound classification nodule is configured to further classify the sound as one of machine noise, wind noise, and other sounds in response to the sound being classified as the non-speech.
6. The hearing aid of claim 1, wherein the sound classification module comprises a feature set having an amount of features for sound classification determined based on performance and computational cost of the sound classification.
7. The hearing aid of claim 6, wherein the feature set comprises 5 to 7 features.
8. The hearing aid of claim 7, wherein the feature set comprises low level features.
9. The hearing aid of claim 7, wherein the feature set comprises Mel-scale Frequency cepstral coefficients (MFCC).
10. A method for operating a hearing aid, comprising:
sensing an environmental sound;
classifying the sound by executing an advanced classification algorithm; and
processing the sound using an outcome of the classification of the sound for specified hearing assistance functions.
11. The method of claim 10, wherein classifying the sound comprises classifying the sound by using a Hidden Markov Model (HMM),
12. The method of claim 10, wherein classifying the sound comprises classifying the sound by using a Gaussian Mixture Model (GMM).
13. The method of claim 10, wherein classifying the sound comprises classifying the sound using a two-stage environment classification scheme.
14. The method of claim 13, wherein using the two-stage environment classification scheme comprises classifying the sound as one of music, speech or non-speech.
15. The method of claim 14, wherein using the two-stage environment classification scheme further comprises classifying the sound as one of machine noise, wind noise, and other sounds in response to the sound being classified as the non-speech.
16. The method claim 10, comprising selecting the advanced classification algorithm based on based on performance and computational cost for the classifying the sound.
17. The method of claim 16, further comprising selecting a feature set for the classifying the sound based on the performance and computational cost for the classifying the sound.
18. The method of claim 17, further comprising selecting an amount of features in the feature set based on the performance and computational cost for the classifying the sound.
19. The method of claim 18, wherein selecting the amount of features in the feature set comprises selecting 5 to 7 features.
20. The method of claim 19, wherein selecting the amount of features in the feature set comprises selecting low level features.
US12/879,218 2009-09-11 2010-09-10 Sound classification system for hearing aids Abandoned US20110137656A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US12/879,218 US20110137656A1 (en) 2009-09-11 2010-09-10 Sound classification system for hearing aids
US16/741,114 US11250878B2 (en) 2009-09-11 2020-01-13 Sound classification system for hearing aids

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US24173509P 2009-09-11 2009-09-11
US12/879,218 US20110137656A1 (en) 2009-09-11 2010-09-10 Sound classification system for hearing aids

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US16/741,114 Continuation US11250878B2 (en) 2009-09-11 2020-01-13 Sound classification system for hearing aids

Publications (1)

Publication Number Publication Date
US20110137656A1 true US20110137656A1 (en) 2011-06-09

Family

ID=43827374

Family Applications (2)

Application Number Title Priority Date Filing Date
US12/879,218 Abandoned US20110137656A1 (en) 2009-09-11 2010-09-10 Sound classification system for hearing aids
US16/741,114 Active 2030-10-27 US11250878B2 (en) 2009-09-11 2020-01-13 Sound classification system for hearing aids

Family Applications After (1)

Application Number Title Priority Date Filing Date
US16/741,114 Active 2030-10-27 US11250878B2 (en) 2009-09-11 2020-01-13 Sound classification system for hearing aids

Country Status (3)

Country Link
US (2) US20110137656A1 (en)
EP (1) EP2328363B1 (en)
DK (1) DK2328363T3 (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130070928A1 (en) * 2011-09-21 2013-03-21 Daniel P. W. Ellis Methods, systems, and media for mobile audio event recognition
US8958586B2 (en) 2012-12-21 2015-02-17 Starkey Laboratories, Inc. Sound environment classification by coordinated sensing using hearing assistance devices
US9124981B2 (en) 2012-11-14 2015-09-01 Qualcomm Incorporated Systems and methods for classification of audio environments
US20150269954A1 (en) * 2014-03-21 2015-09-24 Joseph F. Ryan Adaptive microphone sampling rate techniques
US9384272B2 (en) 2011-10-05 2016-07-05 The Trustees Of Columbia University In The City Of New York Methods, systems, and media for identifying similar songs using jumpcodes
US20160240207A1 (en) * 2012-03-21 2016-08-18 Samsung Electronics Co., Ltd. Method and apparatus for encoding and decoding high frequency for bandwidth extension
US9549264B2 (en) 2013-02-15 2017-01-17 Samsung Electronics Co., Ltd. Portable terminal for controlling hearing aid and method therefor
EP3361753A1 (en) 2017-02-09 2018-08-15 Starkey Laboratories, Inc. Hearing device incorporating dynamic microphone attenuation during streaming
WO2019079909A1 (en) * 2017-10-27 2019-05-02 Ecole De Technologie Superieure In-ear nonverbal audio events classification system and method
US10522169B2 (en) 2016-09-23 2019-12-31 Trustees Of The California State University Classification of teaching based upon sound amplitude
US10631101B2 (en) * 2016-06-09 2020-04-21 Cochlear Limited Advanced scene classification for prosthesis
WO2021138647A1 (en) 2020-01-03 2021-07-08 Starkey Laboratories, Inc. Ear-worn electronic device employing acoustic environment adaptation
US11100918B2 (en) * 2018-08-27 2021-08-24 American Family Mutual Insurance Company, S.I. Event sensing system
WO2021242570A1 (en) 2020-05-29 2021-12-02 Starkey Laboratories, Inc. Hearing device with multiple neural networks for sound enhancement
WO2022051032A1 (en) 2020-09-01 2022-03-10 Starkey Laboratories, Inc. Mobile device that provides sound enhancement for hearing device
WO2022146627A1 (en) 2020-12-28 2022-07-07 Starkey Laboratories, Inc. Ear-wearable electronic hearing device incorporating microphone array with enhanced wind noise suppression
US20220360909A1 (en) * 2021-05-10 2022-11-10 Kieran REED Prosthesis automated assistant
CN115334349A (en) * 2022-07-15 2022-11-11 北京达佳互联信息技术有限公司 Audio processing method and device, electronic equipment and storage medium

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982804B (en) * 2011-09-02 2017-05-03 杜比实验室特许公司 Method and system of voice frequency classification
US9763016B2 (en) * 2014-07-31 2017-09-12 Starkey Laboratories, Inc. Automatic directional switching algorithm for hearing aids
CA2990891A1 (en) 2015-06-30 2017-01-05 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forchung E.V. Method and device for associating noises and for analyzing
CN107066365B (en) * 2017-02-20 2021-01-01 创新先进技术有限公司 System abnormity monitoring method and device
US11337011B2 (en) 2017-10-17 2022-05-17 Cochlear Limited Hierarchical environmental classification in a hearing prosthesis
US11722826B2 (en) 2017-10-17 2023-08-08 Cochlear Limited Hierarchical environmental classification in a hearing prosthesis
CN111027453B (en) * 2019-12-06 2022-05-17 西北工业大学 Automatic non-cooperative underwater target identification method based on Gaussian mixture model
EP3996390A1 (en) * 2021-05-20 2022-05-11 Sonova AG Method for selecting a hearing program of a hearing device based on own voice detection

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5727072A (en) * 1995-02-24 1998-03-10 Nynex Science & Technology Use of noise segmentation for noise cancellation
US6067517A (en) * 1996-02-02 2000-05-23 International Business Machines Corporation Transcription of speech data with segments from acoustically dissimilar environments
US6182036B1 (en) * 1999-02-23 2001-01-30 Motorola, Inc. Method of extracting features in a voice recognition system
US6240386B1 (en) * 1998-08-24 2001-05-29 Conexant Systems, Inc. Speech codec employing noise classification for noise compensation
US6570991B1 (en) * 1996-12-18 2003-05-27 Interval Research Corporation Multi-feature speech/music discrimination system
US6708146B1 (en) * 1997-01-03 2004-03-16 Telecommunications Research Laboratories Voiceband signal classifier
US6862359B2 (en) * 2001-12-18 2005-03-01 Gn Resound A/S Hearing prosthesis with automatic classification of the listening environment
US20060136205A1 (en) * 2004-12-21 2006-06-22 Song Jianming J Method of refining statistical pattern recognition models and statistical pattern recognizers
US7082394B2 (en) * 2002-06-25 2006-07-25 Microsoft Corporation Noise-robust feature extraction using multi-layer principal component analysis
US20060182295A1 (en) * 2005-02-11 2006-08-17 Phonak Ag Dynamic hearing assistance system and method therefore
US20080004878A1 (en) * 2006-06-30 2008-01-03 Robert Bosch Corporation Method and apparatus for generating features through logical and functional operations
WO2008028484A1 (en) * 2006-09-05 2008-03-13 Gn Resound A/S A hearing aid with histogram based sound environment classification
US20080144869A1 (en) * 2005-09-01 2008-06-19 Widex A/S Method and apparatus for controlling band split compressors in a hearing aid
US20080279395A1 (en) * 2005-11-11 2008-11-13 Phonak Ag Feedback Compensation in a Sound Processing Device
US20090028367A1 (en) * 2006-04-01 2009-01-29 Widex A/S Hearing aid and method for controlling signal processing in a hearing aid
US20090037175A1 (en) * 2007-08-03 2009-02-05 Microsoft Corporation Confidence measure generation for speech related searching

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK1273205T3 (en) * 2000-04-04 2006-10-09 Gn Resound As A hearing prosthesis with automatic classification of the listening environment
US8494193B2 (en) * 2006-03-14 2013-07-23 Starkey Laboratories, Inc. Environment detection and adaptation in hearing assistance devices
JP5038403B2 (en) * 2007-03-16 2012-10-03 パナソニック株式会社 Speech analysis apparatus, speech analysis method, speech analysis program, and system integrated circuit
US7693806B2 (en) 2007-06-21 2010-04-06 Microsoft Corporation Classification using a cascade approach

Patent Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5727072A (en) * 1995-02-24 1998-03-10 Nynex Science & Technology Use of noise segmentation for noise cancellation
US6067517A (en) * 1996-02-02 2000-05-23 International Business Machines Corporation Transcription of speech data with segments from acoustically dissimilar environments
US6570991B1 (en) * 1996-12-18 2003-05-27 Interval Research Corporation Multi-feature speech/music discrimination system
US6708146B1 (en) * 1997-01-03 2004-03-16 Telecommunications Research Laboratories Voiceband signal classifier
US6240386B1 (en) * 1998-08-24 2001-05-29 Conexant Systems, Inc. Speech codec employing noise classification for noise compensation
US6182036B1 (en) * 1999-02-23 2001-01-30 Motorola, Inc. Method of extracting features in a voice recognition system
US6862359B2 (en) * 2001-12-18 2005-03-01 Gn Resound A/S Hearing prosthesis with automatic classification of the listening environment
US7082394B2 (en) * 2002-06-25 2006-07-25 Microsoft Corporation Noise-robust feature extraction using multi-layer principal component analysis
US20060136205A1 (en) * 2004-12-21 2006-06-22 Song Jianming J Method of refining statistical pattern recognition models and statistical pattern recognizers
US20060182295A1 (en) * 2005-02-11 2006-08-17 Phonak Ag Dynamic hearing assistance system and method therefore
US20080144869A1 (en) * 2005-09-01 2008-06-19 Widex A/S Method and apparatus for controlling band split compressors in a hearing aid
US20080279395A1 (en) * 2005-11-11 2008-11-13 Phonak Ag Feedback Compensation in a Sound Processing Device
US20090028367A1 (en) * 2006-04-01 2009-01-29 Widex A/S Hearing aid and method for controlling signal processing in a hearing aid
US20080004878A1 (en) * 2006-06-30 2008-01-03 Robert Bosch Corporation Method and apparatus for generating features through logical and functional operations
WO2008028484A1 (en) * 2006-09-05 2008-03-13 Gn Resound A/S A hearing aid with histogram based sound environment classification
US20100027820A1 (en) * 2006-09-05 2010-02-04 Gn Resound A/S Hearing aid with histogram based sound environment classification
US20090037175A1 (en) * 2007-08-03 2009-02-05 Microsoft Corporation Confidence measure generation for speech related searching

Non-Patent Citations (11)

* Cited by examiner, † Cited by third party
Title
Alexandre et al. "Automatic sound classification for improving speech intelligibility in hearing aids using a layered structure." Intelligent Data Engineering and Automated Learning-IDEAL 2006, Springer Berlin Heidelberg, September 2006, pp. 306-313. *
Alexandre et al. "Feature selection for sound classification in hearing aids through restricted search driven by genetic algorithms." Audio, Speech, and Language Processing, IEEE Transactions on 15.8 (2007), November 2007, pp. 2249-2256. *
Alexandre et al. "Feature selection for sound classification in hearing aids through restricted search driven by genetic algorithms." Audio, Speech, and Language Processing, IEEE Transactions on 15.8, November 2007, pp. 2249-2256. *
Alexandre et al. "Sound classification in hearing aids by the harmony search algorithm." Music-Inspired Harmony Search Algorithm. Springer Berlin Heidelberg, 2009, pp. 173-188. *
Büchler, Michael, et al. "Sound classification in hearing aids inspired by auditory scene analysis." EURASIP Journal on Advances in Signal Processing 2005.18, December 2005, pp. 1-12. *
B�chler, Michael, et al. "Sound classification in hearing aids inspired by auditory scene analysis." EURASIP Journal on Applied Signal Processing 2005, 2005, pp. 2991-3002. *
Burred et al. "Hierarchical automatic audio signal classification." Journal of the Audio Engineering Society 52.7/8, July 2004, pp. 724-739. *
Cuadra et al. "Reducing the computational cost for sound classification in hearing aids by selecting features via genetic algorithms with restricted search." Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on. IEEE, July 2008, pp. 1320-1327. *
Dong, Rong, et al. "Low-power implementation of an HMM-based sound environment classification algorithm for hearing aid application." Signal Processing Conference, 2007 15th European. IEEE, September 2007, pp. 1635-1638. *
Freeman, et al. "Audio environment classification for hearing aids using artificial neural networks with windowed input." Proceedings of IEEE Symposium on Computational Intelligence in Image and Signal Processing. April 2007, pp. 183-188. *
Kates, James M. "Classification of background noises for hearing-aid applications." The Journal of the Acoustical Society of America vol. 97, 1995, pp. 461-470. *

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130070928A1 (en) * 2011-09-21 2013-03-21 Daniel P. W. Ellis Methods, systems, and media for mobile audio event recognition
US9384272B2 (en) 2011-10-05 2016-07-05 The Trustees Of Columbia University In The City Of New York Methods, systems, and media for identifying similar songs using jumpcodes
US9761238B2 (en) * 2012-03-21 2017-09-12 Samsung Electronics Co., Ltd. Method and apparatus for encoding and decoding high frequency for bandwidth extension
US20160240207A1 (en) * 2012-03-21 2016-08-18 Samsung Electronics Co., Ltd. Method and apparatus for encoding and decoding high frequency for bandwidth extension
US10339948B2 (en) 2012-03-21 2019-07-02 Samsung Electronics Co., Ltd. Method and apparatus for encoding and decoding high frequency for bandwidth extension
US9124981B2 (en) 2012-11-14 2015-09-01 Qualcomm Incorporated Systems and methods for classification of audio environments
US9584930B2 (en) 2012-12-21 2017-02-28 Starkey Laboratories, Inc. Sound environment classification by coordinated sensing using hearing assistance devices
US8958586B2 (en) 2012-12-21 2015-02-17 Starkey Laboratories, Inc. Sound environment classification by coordinated sensing using hearing assistance devices
US9549264B2 (en) 2013-02-15 2017-01-17 Samsung Electronics Co., Ltd. Portable terminal for controlling hearing aid and method therefor
US20150269954A1 (en) * 2014-03-21 2015-09-24 Joseph F. Ryan Adaptive microphone sampling rate techniques
US9406313B2 (en) * 2014-03-21 2016-08-02 Intel Corporation Adaptive microphone sampling rate techniques
US11825268B2 (en) 2016-06-09 2023-11-21 Cochlear Limited Advanced scene classification for prosthesis
US10631101B2 (en) * 2016-06-09 2020-04-21 Cochlear Limited Advanced scene classification for prosthesis
US10522169B2 (en) 2016-09-23 2019-12-31 Trustees Of The California State University Classification of teaching based upon sound amplitude
US11457319B2 (en) 2017-02-09 2022-09-27 Starkey Laboratories, Inc. Hearing device incorporating dynamic microphone attenuation during streaming
US10284969B2 (en) 2017-02-09 2019-05-07 Starkey Laboratories, Inc. Hearing device incorporating dynamic microphone attenuation during streaming
US11109165B2 (en) 2017-02-09 2021-08-31 Starkey Laboratories, Inc. Hearing device incorporating dynamic microphone attenuation during streaming
EP3361753A1 (en) 2017-02-09 2018-08-15 Starkey Laboratories, Inc. Hearing device incorporating dynamic microphone attenuation during streaming
WO2019079909A1 (en) * 2017-10-27 2019-05-02 Ecole De Technologie Superieure In-ear nonverbal audio events classification system and method
US11875782B2 (en) 2018-08-27 2024-01-16 American Family Mutual Insurance Company, S.I. Event sensing system
US11100918B2 (en) * 2018-08-27 2021-08-24 American Family Mutual Insurance Company, S.I. Event sensing system
WO2021138648A1 (en) 2020-01-03 2021-07-08 Starkey Laboratories, Inc. Ear-worn electronic device employing acoustic environment adaptation
WO2021138647A1 (en) 2020-01-03 2021-07-08 Starkey Laboratories, Inc. Ear-worn electronic device employing acoustic environment adaptation
WO2021242570A1 (en) 2020-05-29 2021-12-02 Starkey Laboratories, Inc. Hearing device with multiple neural networks for sound enhancement
WO2022051032A1 (en) 2020-09-01 2022-03-10 Starkey Laboratories, Inc. Mobile device that provides sound enhancement for hearing device
WO2022146627A1 (en) 2020-12-28 2022-07-07 Starkey Laboratories, Inc. Ear-wearable electronic hearing device incorporating microphone array with enhanced wind noise suppression
US20220360909A1 (en) * 2021-05-10 2022-11-10 Kieran REED Prosthesis automated assistant
CN115334349A (en) * 2022-07-15 2022-11-11 北京达佳互联信息技术有限公司 Audio processing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
US20200152227A1 (en) 2020-05-14
EP2328363B1 (en) 2016-05-18
EP2328363A1 (en) 2011-06-01
US11250878B2 (en) 2022-02-15
DK2328363T3 (en) 2016-08-22

Similar Documents

Publication Publication Date Title
US11250878B2 (en) Sound classification system for hearing aids
Xiang et al. Evaluation of sound classification algorithms for hearing aid applications
Xu et al. An experimental study on speech enhancement based on deep neural networks
EP1210711B1 (en) Sound source classification
Büchler et al. Sound classification in hearing aids inspired by auditory scene analysis
US20150228277A1 (en) Voiced Sound Pattern Detection
Zelinka et al. Impact of vocal effort variability on automatic speech recognition
Schröder et al. Spectro-temporal Gabor filterbank features for acoustic event detection
US20050192795A1 (en) Identification of the presence of speech in digital audio data
US20110261983A1 (en) Systems and methods for own voice recognition with adaptations for noise robustness
EP2083417B1 (en) Sound processing device and program
Allegro et al. Automatic sound classification inspired by auditory scene analysis
JP5050698B2 (en) Voice processing apparatus and program
CN109997186B (en) Apparatus and method for classifying acoustic environments
Dey et al. Cross-corpora language recognition: A preliminary investigation with Indian languages
Clemins et al. Application of speech recognition to African elephant (Loxodonta Africana) vocalizations
Sose et al. Sound Source Separation Using Neural Network
Tchorz et al. Classification of environmental sounds for future hearing aid applications
Uhle et al. Speech enhancement of movie sound
McKinney et al. Evaluation of sound classification algorithms for hearing aid applications
JP5109050B2 (en) Voice processing apparatus and program
Islam GFCC-based robust gender detection
Kunekar et al. Audio feature extraction: Foreground and Background audio separation using KNN algorithm
Venkatesan et al. Analysis of monaural and binaural statistical properties for the estimation of distance of a target speaker
Wrigley et al. Feature selection for the classification of crosstalk in multi-channel audio

Legal Events

Date Code Title Description
AS Assignment

Owner name: STARKEY LABORATORIES, INC., MINNESOTA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:XIANG, JUANJUAN;MCKINNEY, MARTIN;FITZ, KELLY;AND OTHERS;SIGNING DATES FROM 20100929 TO 20110128;REEL/FRAME:025840/0105

STCV Information on status: appeal procedure

Free format text: ON APPEAL -- AWAITING DECISION BY THE BOARD OF APPEALS

AS Assignment

Owner name: CITIBANK, N.A., AS ADMINISTRATIVE AGENT, TEXAS

Free format text: NOTICE OF GRANT OF SECURITY INTEREST IN PATENTS;ASSIGNOR:STARKEY LABORATORIES, INC.;REEL/FRAME:046944/0689

Effective date: 20180824

STCV Information on status: appeal procedure

Free format text: BOARD OF APPEALS DECISION RENDERED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION