DE69722980D1 - Aufzeichnung von Sprachdaten mit Segmenten von akustisch verschiedenen Umgebungen - Google Patents

Aufzeichnung von Sprachdaten mit Segmenten von akustisch verschiedenen Umgebungen

Info

Publication number
DE69722980D1
DE69722980D1 DE69722980T DE69722980T DE69722980D1 DE 69722980 D1 DE69722980 D1 DE 69722980D1 DE 69722980 T DE69722980 T DE 69722980T DE 69722980 T DE69722980 T DE 69722980T DE 69722980 D1 DE69722980 D1 DE 69722980D1
Authority
DE
Germany
Prior art keywords
segments
recording
voice data
different environments
acoustically different
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69722980T
Other languages
English (en)
Other versions
DE69722980T2 (de
Inventor
Lalit Rai Bahl
Ponani Gopalakrishnan
Ramesh Ambat Gopinath
Stephane Herman Maes
Mukund Panmanabhan
Lazaros Polymenakos
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Application granted granted Critical
Publication of DE69722980D1 publication Critical patent/DE69722980D1/de
Publication of DE69722980T2 publication Critical patent/DE69722980T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
DE69722980T 1996-02-02 1997-01-17 Aufzeichnung von Sprachdaten mit Segmenten von akustisch verschiedenen Umgebungen Expired - Lifetime DE69722980T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/595,722 US6067517A (en) 1996-02-02 1996-02-02 Transcription of speech data with segments from acoustically dissimilar environments
US595722 1996-02-02

Publications (2)

Publication Number Publication Date
DE69722980D1 true DE69722980D1 (de) 2003-07-31
DE69722980T2 DE69722980T2 (de) 2004-05-19

Family

ID=24384411

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69722980T Expired - Lifetime DE69722980T2 (de) 1996-02-02 1997-01-17 Aufzeichnung von Sprachdaten mit Segmenten von akustisch verschiedenen Umgebungen

Country Status (3)

Country Link
US (1) US6067517A (de)
EP (1) EP0788090B1 (de)
DE (1) DE69722980T2 (de)

Families Citing this family (74)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6683697B1 (en) * 1991-03-20 2004-01-27 Millenium L.P. Information processing methodology
US5258855A (en) * 1991-03-20 1993-11-02 System X, L. P. Information processing methodology
US5897616A (en) 1997-06-11 1999-04-27 International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
US6377921B1 (en) * 1998-06-26 2002-04-23 International Business Machines Corporation Identifying mismatches between assumed and actual pronunciations of words
US6260014B1 (en) * 1998-09-14 2001-07-10 International Business Machines Corporation Specific task composite acoustic models
US6324510B1 (en) * 1998-11-06 2001-11-27 Lernout & Hauspie Speech Products N.V. Method and apparatus of hierarchically organizing an acoustic model for speech recognition and adaptation of the model to unseen domains
US7006967B1 (en) * 1999-02-05 2006-02-28 Custom Speech Usa, Inc. System and method for automating transcription services
JP2000259198A (ja) * 1999-03-04 2000-09-22 Sony Corp パターン認識装置および方法、並びに提供媒体
US6577999B1 (en) * 1999-03-08 2003-06-10 International Business Machines Corporation Method and apparatus for intelligently managing multiple pronunciations for a speech recognition vocabulary
US6332122B1 (en) 1999-06-23 2001-12-18 International Business Machines Corporation Transcription system for multiple speakers, using and establishing identification
EP1116219B1 (de) * 1999-07-01 2005-03-16 Koninklijke Philips Electronics N.V. Robuste sprachverarbeitung von verrauschten sprachmodellen
US7689416B1 (en) * 1999-09-29 2010-03-30 Poirier Darrell A System for transferring personalize matter from one computer to another
US7016835B2 (en) * 1999-10-29 2006-03-21 International Business Machines Corporation Speech and signal digitization by using recognition metrics to select from multiple techniques
US6834308B1 (en) * 2000-02-17 2004-12-21 Audible Magic Corporation Method and apparatus for identifying media content presented on a media playing device
US20020055844A1 (en) * 2000-02-25 2002-05-09 L'esperance Lauren Speech user interface for portable personal devices
CA2417926C (en) 2000-07-31 2013-02-12 Eliza Corporation Method of and system for improving accuracy in a speech recognition system
US6990446B1 (en) * 2000-10-10 2006-01-24 Microsoft Corporation Method and apparatus using spectral addition for speaker recognition
US7003455B1 (en) * 2000-10-16 2006-02-21 Microsoft Corporation Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech
US6876966B1 (en) * 2000-10-16 2005-04-05 Microsoft Corporation Pattern recognition training method and apparatus using inserted noise followed by noise reduction
JP3467469B2 (ja) * 2000-10-31 2003-11-17 Necエレクトロニクス株式会社 音声復号装置および音声復号プログラムを記録した記録媒体
US6605768B2 (en) * 2000-12-06 2003-08-12 Matsushita Electric Industrial Co., Ltd. Music-signal compressing/decompressing apparatus
US6826241B2 (en) 2001-02-21 2004-11-30 Motorola, Inc. Apparatus and method for filtering maximum-length-code signals in a spread spectrum communication system
US6985858B2 (en) * 2001-03-20 2006-01-10 Microsoft Corporation Method and apparatus for removing noise from feature vectors
WO2002082271A1 (en) 2001-04-05 2002-10-17 Audible Magic Corporation Copyright detection and protection system and method
US7953219B2 (en) * 2001-07-19 2011-05-31 Nice Systems, Ltd. Method apparatus and system for capturing and analyzing interaction based content
US8972481B2 (en) 2001-07-20 2015-03-03 Audible Magic, Inc. Playlist generation method and apparatus
US6959276B2 (en) * 2001-09-27 2005-10-25 Microsoft Corporation Including the category of environmental noise when processing speech signals
US7117148B2 (en) 2002-04-05 2006-10-03 Microsoft Corporation Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
US20050228661A1 (en) * 2002-05-06 2005-10-13 Josep Prous Blancafort Voice recognition method
US7319959B1 (en) * 2002-05-14 2008-01-15 Audience, Inc. Multi-source phoneme classification for noise-robust automatic speech recognition
US20030220784A1 (en) * 2002-05-24 2003-11-27 International Business Machines Corporation System and method for automated voice message transcription and delivery
US20040006628A1 (en) * 2002-07-03 2004-01-08 Scott Shepard Systems and methods for providing real-time alerting
US20040021765A1 (en) * 2002-07-03 2004-02-05 Francis Kubala Speech recognition system for managing telemeetings
US20040006748A1 (en) * 2002-07-03 2004-01-08 Amit Srivastava Systems and methods for providing online event tracking
US7424427B2 (en) * 2002-10-17 2008-09-09 Verizon Corporate Services Group Inc. Systems and methods for classifying audio into broad phoneme classes
US8335683B2 (en) * 2003-01-23 2012-12-18 Microsoft Corporation System for using statistical classifiers for spoken language understanding
WO2004090870A1 (ja) * 2003-04-04 2004-10-21 Kabushiki Kaisha Toshiba 広帯域音声を符号化または復号化するための方法及び装置
US7596494B2 (en) * 2003-11-26 2009-09-29 Microsoft Corporation Method and apparatus for high resolution speech reconstruction
US7725314B2 (en) * 2004-02-16 2010-05-25 Microsoft Corporation Method and apparatus for constructing a speech filter using estimates of clean speech and noise
DE102004017486A1 (de) * 2004-04-08 2005-10-27 Siemens Ag Verfahren zur Geräuschreduktion bei einem Sprach-Eingangssignal
US8204884B2 (en) * 2004-07-14 2012-06-19 Nice Systems Ltd. Method, apparatus and system for capturing and analyzing interaction based content
US20060288402A1 (en) * 2005-06-20 2006-12-21 Nokia Corporation Security component for dynamic properties framework
GB2430073A (en) * 2005-09-08 2007-03-14 Univ East Anglia Analysis and transcription of music
CN1949364B (zh) * 2005-10-12 2010-05-05 财团法人工业技术研究院 语音识别的前级检测系统与方法
US20070239444A1 (en) * 2006-03-29 2007-10-11 Motorola, Inc. Voice signal perturbation for speech recognition
KR100883652B1 (ko) * 2006-08-03 2009-02-18 삼성전자주식회사 음성 구간 검출 방법 및 장치, 및 이를 이용한 음성 인식시스템
US7885813B2 (en) * 2006-09-29 2011-02-08 Verint Systems Inc. Systems and methods for analyzing communication sessions
US8006314B2 (en) 2007-07-27 2011-08-23 Audible Magic Corporation System for identifying content of digital data
US8219404B2 (en) * 2007-08-09 2012-07-10 Nice Systems, Ltd. Method and apparatus for recognizing a speaker in lawful interception systems
US8249870B2 (en) * 2008-11-12 2012-08-21 Massachusetts Institute Of Technology Semi-automatic speech transcription
EP2364495B1 (de) * 2008-12-10 2016-10-12 Agnitio S.L. Verfahren zum verifizieren der identität eines sprechers und diesbezügliches computerlesbares medium und computer
US20110137656A1 (en) * 2009-09-11 2011-06-09 Starkey Laboratories, Inc. Sound classification system for hearing aids
US8554562B2 (en) * 2009-11-15 2013-10-08 Nuance Communications, Inc. Method and system for speaker diarization
US8718290B2 (en) 2010-01-26 2014-05-06 Audience, Inc. Adaptive noise reduction using level cues
US9378754B1 (en) 2010-04-28 2016-06-28 Knowles Electronics, Llc Adaptive spatial classifier for multi-microphone systems
US9564148B2 (en) * 2010-05-18 2017-02-07 Sprint Communications Company L.P. Isolation and modification of audio streams of a mixed signal in a wireless communication device
US8812310B2 (en) * 2010-08-22 2014-08-19 King Saud University Environment recognition of audio input
US8442825B1 (en) 2011-08-16 2013-05-14 The United States Of America As Represented By The Director, National Security Agency Biomimetic voice identifier
KR101482148B1 (ko) * 2011-12-23 2015-01-14 주식회사 케이티 개인화된 발음열을 이용한 그룹 매핑 데이터 생성 서버, 음성 인식 서버 및 방법
US9026065B2 (en) * 2012-03-21 2015-05-05 Raytheon Company Methods and apparatus for resource sharing for voice and data interlacing
US9081778B2 (en) 2012-09-25 2015-07-14 Audible Magic Corporation Using digital fingerprints to associate data with a work
US9508345B1 (en) 2013-09-24 2016-11-29 Knowles Electronics, Llc Continuous voice sensing
US9953634B1 (en) 2013-12-17 2018-04-24 Knowles Electronics, Llc Passive training for automatic speech recognition
US9437188B1 (en) 2014-03-28 2016-09-06 Knowles Electronics, Llc Buffered reprocessing for multi-microphone automatic speech recognition assist
US9858922B2 (en) 2014-06-23 2018-01-02 Google Inc. Caching speech recognition scores
US9299347B1 (en) 2014-10-22 2016-03-29 Google Inc. Speech recognition using associative mapping
US9786270B2 (en) 2015-07-09 2017-10-10 Google Inc. Generating acoustic models
US10229672B1 (en) 2015-12-31 2019-03-12 Google Llc Training acoustic models using connectionist temporal classification
US20180018973A1 (en) 2016-07-15 2018-01-18 Google Inc. Speaker verification
US10706840B2 (en) 2017-08-18 2020-07-07 Google Llc Encoder-decoder models for sequence to sequence mapping
US10586529B2 (en) * 2017-09-14 2020-03-10 International Business Machines Corporation Processing of speech signal
CN109545229B (zh) * 2019-01-11 2023-04-21 华南理工大学 一种基于语音样本特征空间轨迹的说话人识别方法
KR20200140571A (ko) * 2019-06-07 2020-12-16 삼성전자주식회사 데이터 인식 방법 및 장치
CN111883159A (zh) * 2020-08-05 2020-11-03 龙马智芯(珠海横琴)科技有限公司 语音的处理方法及装置

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4430726A (en) * 1981-06-18 1984-02-07 Bell Telephone Laboratories, Incorporated Dictation/transcription method and arrangement
EP0559349B1 (de) * 1992-03-02 1999-01-07 AT&T Corp. Lernverfahren und Gerät zur Spracherkennung
US5333275A (en) * 1992-06-23 1994-07-26 Wheatley Barbara J System and method for time aligning speech
DE69423838T2 (de) * 1993-09-23 2000-08-03 Xerox Corp Semantische Gleichereignisfilterung für Spracherkennung und Signalübersetzungsanwendungen
JP2986345B2 (ja) * 1993-10-18 1999-12-06 インターナショナル・ビジネス・マシーンズ・コーポレイション 音声記録指標化装置及び方法
US5625748A (en) * 1994-04-18 1997-04-29 Bbn Corporation Topic discriminator using posterior probability or confidence scores

Also Published As

Publication number Publication date
EP0788090A3 (de) 1998-08-19
US6067517A (en) 2000-05-23
EP0788090B1 (de) 2003-06-25
DE69722980T2 (de) 2004-05-19
EP0788090A2 (de) 1997-08-06

Similar Documents

Publication Publication Date Title
DE69722980D1 (de) Aufzeichnung von Sprachdaten mit Segmenten von akustisch verschiedenen Umgebungen
DE69638102D1 (de) Datenaufzeichnungsmedien
DE69434923D1 (de) Aufzeichnungsmedium
DE69717358D1 (de) Informationsaufzeichnungsmedium
DE69739630D1 (de) Informationsaufzeichnungsmedium
DE69429558D1 (de) Tondatenverarbeitung
DE69834761D1 (de) Magnetisches Aufzeichnungsmedium
DE69535043D1 (de) Wiedergabe von Aufnahmemedien
DE69516484T2 (de) Informationsaufzeichnungs- und/oder -wiedergabegerät
DE69429602D1 (de) Informationsaufzeichnungsmedium
DE69605707D1 (de) Aufzeichnungsmedium
DE69420749D1 (de) Wiedergabe von Aufzeichnungsmedien
DE69526178D1 (de) Datenaufzeichnungsmedium
DE69515943D1 (de) Informationsaufzeichnungsmedium
DE69608489T2 (de) Magnetisches Aufzeichnungsmedium
DE69836238D1 (de) Magnetisches Aufzeichnungsmedium
DE19782243T1 (de) Magnetaufzeichnungsmedium
DE69608835T3 (de) Magnetischer Aufzeichnungsträger
DE69817697D1 (de) Magnetisches Aufzeichnungsmedium
DE69806302D1 (de) Magnetisches Aufzeichnungsmedium
DE69617279T2 (de) Datenaufzeichnung und -wiedergabe
DE69813100T2 (de) Magnetisches Aufzeichnungsmedium
DE69821699D1 (de) Magnetisches Aufzeichnungsmedium
DE69818211D1 (de) Magnetischer Aufzeichnungsträger
DE69802855T2 (de) Magnetisches Aufzeichnungsmedium

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8320 Willingness to grant licences declared (paragraph 23)
8328 Change in the person/name/address of the agent

Representative=s name: DUSCHER, R., DIPL.-PHYS. DR.RER.NAT., PAT.-ANW., 7