DE60124842D1 - Rauschrobuste Mustererkennung - Google Patents

Rauschrobuste Mustererkennung

Info

Publication number
DE60124842D1
DE60124842D1 DE60124842T DE60124842T DE60124842D1 DE 60124842 D1 DE60124842 D1 DE 60124842D1 DE 60124842 T DE60124842 T DE 60124842T DE 60124842 T DE60124842 T DE 60124842T DE 60124842 D1 DE60124842 D1 DE 60124842D1
Authority
DE
Germany
Prior art keywords
noise
pattern recognition
training
signal
recognition model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60124842T
Other languages
English (en)
Other versions
DE60124842T2 (de
Inventor
Li Deng
Xuedong Huang
Michael D Plumpe
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of DE60124842D1 publication Critical patent/DE60124842D1/de
Publication of DE60124842T2 publication Critical patent/DE60124842T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
DE60124842T 2000-10-16 2001-10-10 Rauschrobuste Mustererkennung Expired - Lifetime DE60124842T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/688,950 US6876966B1 (en) 2000-10-16 2000-10-16 Pattern recognition training method and apparatus using inserted noise followed by noise reduction
US688950 2000-10-16

Publications (2)

Publication Number Publication Date
DE60124842D1 true DE60124842D1 (de) 2007-01-11
DE60124842T2 DE60124842T2 (de) 2007-04-12

Family

ID=24766456

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60124842T Expired - Lifetime DE60124842T2 (de) 2000-10-16 2001-10-10 Rauschrobuste Mustererkennung

Country Status (5)

Country Link
US (1) US6876966B1 (de)
EP (1) EP1199708B1 (de)
JP (1) JP4195211B2 (de)
AT (1) ATE347161T1 (de)
DE (1) DE60124842T2 (de)

Families Citing this family (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7542961B2 (en) * 2001-05-02 2009-06-02 Victor Gogolak Method and system for analyzing drug adverse effects
US6778994B2 (en) 2001-05-02 2004-08-17 Victor Gogolak Pharmacovigilance database
US7925612B2 (en) * 2001-05-02 2011-04-12 Victor Gogolak Method for graphically depicting drug adverse effect risks
US7461006B2 (en) * 2001-08-29 2008-12-02 Victor Gogolak Method and system for the analysis and association of patient-specific and population-based genomic data with drug safety adverse event data
US7165028B2 (en) * 2001-12-12 2007-01-16 Texas Instruments Incorporated Method of speech recognition resistant to convolutive distortion and additive distortion
US7209881B2 (en) * 2001-12-20 2007-04-24 Matsushita Electric Industrial Co., Ltd. Preparing acoustic models by sufficient statistics and noise-superimposed speech data
US7130776B2 (en) * 2002-03-25 2006-10-31 Lockheed Martin Corporation Method and computer program product for producing a pattern recognition training set
US7117148B2 (en) 2002-04-05 2006-10-03 Microsoft Corporation Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
US7174292B2 (en) 2002-05-20 2007-02-06 Microsoft Corporation Method of determining uncertainty associated with acoustic distortion-based noise reduction
US7107210B2 (en) * 2002-05-20 2006-09-12 Microsoft Corporation Method of noise reduction based on dynamic aspects of speech
US7103540B2 (en) * 2002-05-20 2006-09-05 Microsoft Corporation Method of pattern recognition using noise reduction uncertainty
JP4352790B2 (ja) * 2002-10-31 2009-10-28 セイコーエプソン株式会社 音響モデル作成方法および音声認識装置ならびに音声認識装置を有する乗り物
US7370057B2 (en) * 2002-12-03 2008-05-06 Lockheed Martin Corporation Framework for evaluating data cleansing applications
WO2004104908A1 (en) * 2003-05-21 2004-12-02 Koninklijke Philips Electronics N.V. Method and device for verifying the identity of an object
US8041026B1 (en) 2006-02-07 2011-10-18 Avaya Inc. Event driven noise cancellation
US20070239444A1 (en) * 2006-03-29 2007-10-11 Motorola, Inc. Voice signal perturbation for speech recognition
JP4245617B2 (ja) * 2006-04-06 2009-03-25 株式会社東芝 特徴量補正装置、特徴量補正方法および特徴量補正プログラム
JP4316583B2 (ja) 2006-04-07 2009-08-19 株式会社東芝 特徴量補正装置、特徴量補正方法および特徴量補正プログラム
US7840287B2 (en) * 2006-04-13 2010-11-23 Fisher-Rosemount Systems, Inc. Robust process model identification in model based control techniques
US8407160B2 (en) * 2006-11-15 2013-03-26 The Trustees Of Columbia University In The City Of New York Systems, methods, and media for generating sanitized data, sanitizing anomaly detection models, and/or generating sanitized anomaly detection models
US8195453B2 (en) * 2007-09-13 2012-06-05 Qnx Software Systems Limited Distributed intelligibility testing system
WO2009039897A1 (en) 2007-09-26 2009-04-02 Fraunhofer - Gesellschaft Zur Förderung Der Angewandten Forschung E.V. Apparatus and method for extracting an ambient signal in an apparatus and method for obtaining weighting coefficients for extracting an ambient signal and computer program
US8615397B2 (en) * 2008-04-04 2013-12-24 Intuit Inc. Identifying audio content using distorted target patterns
NO328622B1 (no) 2008-06-30 2010-04-06 Tandberg Telecom As Anordning og fremgangsmate for reduksjon av tastaturstoy i konferanseutstyr
JP5150542B2 (ja) * 2009-03-26 2013-02-20 株式会社東芝 パターン認識装置、パターン認識方法、及び、プログラム
US11416214B2 (en) 2009-12-23 2022-08-16 Google Llc Multi-modal input on an electronic device
EP4318463A3 (de) 2009-12-23 2024-02-28 Google LLC Multimodale eingabe in eine elektronische vorrichtung
US8660842B2 (en) * 2010-03-09 2014-02-25 Honda Motor Co., Ltd. Enhancing speech recognition using visual information
US8265928B2 (en) 2010-04-14 2012-09-11 Google Inc. Geotagged environmental audio for enhanced speech recognition accuracy
US8468012B2 (en) 2010-05-26 2013-06-18 Google Inc. Acoustic model adaptation using geographic information
US8484023B2 (en) * 2010-09-24 2013-07-09 Nuance Communications, Inc. Sparse representation features for speech recognition
US8352245B1 (en) 2010-12-30 2013-01-08 Google Inc. Adjusting language models
US8296142B2 (en) 2011-01-21 2012-10-23 Google Inc. Speech recognition using dock context
HUP1200018A2 (en) 2012-01-11 2013-07-29 77 Elektronika Mueszeripari Kft Method of training a neural network, as well as a neural network
US8484017B1 (en) 2012-09-10 2013-07-09 Google Inc. Identifying media content
US20140074466A1 (en) 2012-09-10 2014-03-13 Google Inc. Answering questions using environmental context
US9734819B2 (en) 2013-02-21 2017-08-15 Google Technology Holdings LLC Recognizing accented speech
US20140270249A1 (en) 2013-03-12 2014-09-18 Motorola Mobility Llc Method and Apparatus for Estimating Variability of Background Noise for Noise Suppression
US20140278393A1 (en) 2013-03-12 2014-09-18 Motorola Mobility Llc Apparatus and Method for Power Efficient Signal Conditioning for a Voice Recognition System
US9275638B2 (en) 2013-03-12 2016-03-01 Google Technology Holdings LLC Method and apparatus for training a voice recognition model database
US9237225B2 (en) 2013-03-12 2016-01-12 Google Technology Holdings LLC Apparatus with dynamic audio signal pre-conditioning and methods therefor
WO2014182453A2 (en) * 2013-05-06 2014-11-13 Motorola Mobility Llc Method and apparatus for training a voice recognition model database
CN103310789B (zh) * 2013-05-08 2016-04-06 北京大学深圳研究生院 一种基于改进的并行模型组合的声音事件识别方法
US9842592B2 (en) 2014-02-12 2017-12-12 Google Inc. Language models using non-linguistic context
US9412365B2 (en) 2014-03-24 2016-08-09 Google Inc. Enhanced maximum entropy models
US9858922B2 (en) 2014-06-23 2018-01-02 Google Inc. Caching speech recognition scores
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
US9299347B1 (en) * 2014-10-22 2016-03-29 Google Inc. Speech recognition using associative mapping
KR102167719B1 (ko) 2014-12-08 2020-10-19 삼성전자주식회사 언어 모델 학습 방법 및 장치, 음성 인식 방법 및 장치
US9535905B2 (en) * 2014-12-12 2017-01-03 International Business Machines Corporation Statistical process control and analytics for translation supply chain operational management
KR101988222B1 (ko) * 2015-02-12 2019-06-13 한국전자통신연구원 대어휘 연속 음성 인식 장치 및 방법
US10134394B2 (en) 2015-03-20 2018-11-20 Google Llc Speech recognition using log-linear model
US9786270B2 (en) 2015-07-09 2017-10-10 Google Inc. Generating acoustic models
KR102494139B1 (ko) * 2015-11-06 2023-01-31 삼성전자주식회사 뉴럴 네트워크 학습 장치 및 방법과, 음성 인식 장치 및 방법
US20170148466A1 (en) * 2015-11-25 2017-05-25 Tim Jackson Method and system for reducing background sounds in a noisy environment
CN105448303B (zh) * 2015-11-27 2020-02-04 百度在线网络技术(北京)有限公司 语音信号的处理方法和装置
US10229672B1 (en) 2015-12-31 2019-03-12 Google Llc Training acoustic models using connectionist temporal classification
US9978367B2 (en) 2016-03-16 2018-05-22 Google Llc Determining dialog states for language models
US20180018973A1 (en) 2016-07-15 2018-01-18 Google Inc. Speaker verification
US10832664B2 (en) 2016-08-19 2020-11-10 Google Llc Automated speech recognition using language models that selectively use domain-specific model components
US10311860B2 (en) 2017-02-14 2019-06-04 Google Llc Language model biasing system
US10706840B2 (en) 2017-08-18 2020-07-07 Google Llc Encoder-decoder models for sequence to sequence mapping
JP7019096B2 (ja) 2018-08-30 2022-02-14 ドルビー・インターナショナル・アーベー 低ビットレート符号化オーディオの増強を制御する方法及び機器
CN111210810A (zh) * 2019-12-17 2020-05-29 秒针信息技术有限公司 模型训练方法和装置
EP3862782A1 (de) * 2020-02-04 2021-08-11 Infineon Technologies AG Vorrichtung und verfahren zur korrektur eines eingangssignals
CN111429930B (zh) * 2020-03-16 2023-02-28 云知声智能科技股份有限公司 一种基于自适应采样率的降噪模型处理方法及系统
CN112614484B (zh) * 2020-11-23 2022-05-20 北京百度网讯科技有限公司 特征信息挖掘方法、装置及电子设备
CN114190953A (zh) * 2021-12-09 2022-03-18 四川新源生物电子科技有限公司 针对脑电采集设备的脑电信号降噪模型的训练方法和系统

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE4309985A1 (de) * 1993-03-29 1994-10-06 Sel Alcatel Ag Geräuschreduktion zur Spracherkennung
DE4322372A1 (de) * 1993-07-06 1995-01-12 Sel Alcatel Ag Verfahren und Vorrichtung zur Spracherkennung
US6067517A (en) * 1996-02-02 2000-05-23 International Business Machines Corporation Transcription of speech data with segments from acoustically dissimilar environments
US6026359A (en) * 1996-09-20 2000-02-15 Nippon Telegraph And Telephone Corporation Scheme for model adaptation in pattern recognition based on Taylor expansion
US5950157A (en) * 1997-02-28 1999-09-07 Sri International Method for establishing handset-dependent normalizing models for speaker recognition
US6529872B1 (en) * 2000-04-18 2003-03-04 Matsushita Electric Industrial Co., Ltd. Method for noise adaptation in automatic speech recognition using transformed matrices

Also Published As

Publication number Publication date
US6876966B1 (en) 2005-04-05
JP4195211B2 (ja) 2008-12-10
EP1199708A2 (de) 2002-04-24
ATE347161T1 (de) 2006-12-15
EP1199708B1 (de) 2006-11-29
DE60124842T2 (de) 2007-04-12
JP2002140089A (ja) 2002-05-17
EP1199708A3 (de) 2003-10-15

Similar Documents

Publication Publication Date Title
DE60124842D1 (de) Rauschrobuste Mustererkennung
ATE213086T1 (de) Verfahren und vorrichtung zur sprachkodierung
DE60222739D1 (de) Gerät und Verfahren zur Erzeugung von digitalen Signalen, die jeweils einen analogen Signalwert kodieren
DE60139877D1 (de) Teileerkennungsdatenerzeugungsverfahren und vorrichtung, anbringvorrichtung für elektronische teile und aufzeichnungsmedium
DE60137162D1 (de) Vorrichtung, Verfahren und Aufzeichnungsdatenträger zum Vergleichen von Bildern
DE69807807T2 (de) Verfahren und vorrichtung zur übertragung von inhaltsinformation und darauf bezogener zusatzinformation
DE60143927D1 (de) Verfahren und vorrichtung zur erzeugung von kompakten metadateien für transcodierungshinweise
DE69625341T2 (de) Verfahren und Vorrichtung zur Datenkodierung und Übertragung mittels verrauschten Medien
DE60106506D1 (de) Videospielvorrichtung, Verfahren zur Ausführung von Aktionen einer Spielfigur in einem Videospiel, und entsprechendes computerlesbares Aufzeichnungsmedium
DE60128270D1 (de) Verfahren und System zur Erzeugung von Sprechererkennungsdaten, und Verfahren und System zur Sprechererkennung
ATE487212T1 (de) Verstekte bedingte zufallfeldermodelle für phonetische klassifizierung und spracherkennung
DE60138696D1 (de) Verfahren und system zum speichern eines codierungsmusters
DE50103752D1 (de) Verfahren und sendeschaltung zur erzeugung eines sendesignals
DE69800320T2 (de) Verfahren und Vorrichtung zur Sprechererkennung durch Prüfung von mündlicher Information mittels Zwangsdekodierung
ATE412941T1 (de) Speicherschnittstellenprotokoll zur unterscheidung von statusinformationen von lesedaten
ATE319160T1 (de) Verfahren zur rauschrobusten klassifikation in der sprachkodierung
DE60235211D1 (de) Verfahren zum Vorlöschen von Rauschen eines Bildes.
DE60227308D1 (de) System, Verfahren und Vorrichtung zur Bestimmung der Grenze eines Informationselements
ATE450033T1 (de) Verfahren zur geräuschunterdrückung
ATE381915T1 (de) Audioinformationsübertragungsvorrichtung und zugehöriges verfahren
ATE286334T1 (de) Vorrichtung zur klassifikation von komplexen signalen mit linearer digitaler modulation
DE60325736D1 (de) Verfahren und Vorrichtung zur Rauschverminderung in einem Schallsignal
DE10194477D2 (de) Verfahren zur Erzeugung von Soft-Bit-Informationen aus Gray-Codierten Signalen
DE60114511D1 (de) Verfahren und vorrichtung zur beseitigung von störsignalen
DE69918793D1 (de) Verfahren und Vorrichtung zur Tonaufnahme und Wiedergabe mit natürlichen Gefühl von Schallfeld

Legal Events

Date Code Title Description
8364 No opposition during term of opposition