WO2001009877A3 - System and method for improving the accuracy of a speech recognition program - Google Patents

System and method for improving the accuracy of a speech recognition program Download PDF

Info

Publication number
WO2001009877A3
WO2001009877A3 PCT/US2000/020467 US0020467W WO0109877A3 WO 2001009877 A3 WO2001009877 A3 WO 2001009877A3 US 0020467 W US0020467 W US 0020467W WO 0109877 A3 WO0109877 A3 WO 0109877A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech recognition
recognition program
accuracy
improving
speech
Prior art date
Application number
PCT/US2000/020467
Other languages
French (fr)
Other versions
WO2001009877A9 (en
WO2001009877A2 (en
Inventor
Jonathan Kahn
Thomas P Flynn
Charles Qin
Nicholas J Linden
James A Sells
Original Assignee
Custom Speech Usa Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/362,255 external-priority patent/US6490558B1/en
Priority claimed from US09/625,657 external-priority patent/US6704709B1/en
Application filed by Custom Speech Usa Inc filed Critical Custom Speech Usa Inc
Priority to EP00950784A priority Critical patent/EP1509902A4/en
Priority to NZ516956A priority patent/NZ516956A/en
Priority to CA002380433A priority patent/CA2380433A1/en
Priority to AU63835/00A priority patent/AU776890B2/en
Publication of WO2001009877A2 publication Critical patent/WO2001009877A2/en
Publication of WO2001009877A9 publication Critical patent/WO2001009877A9/en
Publication of WO2001009877A3 publication Critical patent/WO2001009877A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering

Abstract

A system and method for quickly improving the accuracy of a speech recognition program. The system is based on a speech recognition program that automatically converts a pre-recorded audio file into a written text. The system parses the written text into segments, each of which is corrected by the system and saved in a retrievable manner in association with the computer. The standard speech files are saved towards improving accuracy in speech-to-text conversation by the speech recognition program. The system further includes facilities to repetitively establish an independent instance of the written text from the pre-recorded audio file using the speech recognition program. This independent instance can then be broken into segments and each segment in said independent instance replaced with a corrected segment associated with the segment. In this manner, repetitive instruction of a speech recognition program can be facilitated. A system and method for directing pre-recorded audio files to a speech recognition program that does not accept such files is also disclosed. Such system and method are necessary to sue the system and method for quickly improving the accuracy of a speech recognition program with some pre-existing speech recognition programs.
PCT/US2000/020467 1999-07-28 2000-07-27 System and method for improving the accuracy of a speech recognition program WO2001009877A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP00950784A EP1509902A4 (en) 1999-07-28 2000-07-27 System and method for improving the accuracy of a speech recognition program
NZ516956A NZ516956A (en) 1999-07-28 2000-07-27 System and method for improving the accuracy of a speech recognition program
CA002380433A CA2380433A1 (en) 1999-07-28 2000-07-27 System and method for improving the accuracy of a speech recognition program
AU63835/00A AU776890B2 (en) 1999-07-28 2000-07-27 System and method for improving the accuracy of a speech recognition program

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US09/362,255 1999-07-28
US09/362,255 US6490558B1 (en) 1999-07-28 1999-07-28 System and method for improving the accuracy of a speech recognition program through repetitive training
US09/430,144 1999-10-29
US09/430,144 US6421643B1 (en) 1999-07-28 1999-10-29 Method and apparatus for directing an audio file to a speech recognition program that does not accept such files
US20887800P 2000-06-01 2000-06-01
US60/208,878 2000-06-01
US09/625,657 US6704709B1 (en) 1999-07-28 2000-07-26 System and method for improving the accuracy of a speech recognition program
US09/625,657 2000-07-26

Publications (3)

Publication Number Publication Date
WO2001009877A2 WO2001009877A2 (en) 2001-02-08
WO2001009877A9 WO2001009877A9 (en) 2002-07-11
WO2001009877A3 true WO2001009877A3 (en) 2004-10-28

Family

ID=27498742

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/020467 WO2001009877A2 (en) 1999-07-28 2000-07-27 System and method for improving the accuracy of a speech recognition program

Country Status (5)

Country Link
EP (1) EP1509902A4 (en)
AU (1) AU776890B2 (en)
CA (1) CA2380433A1 (en)
NZ (1) NZ516956A (en)
WO (1) WO2001009877A2 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2885247B1 (en) * 2005-04-27 2007-08-31 Marc Bendayan SPEECH RECOGNITION EQUIPMENT.
US8521510B2 (en) * 2006-08-31 2013-08-27 At&T Intellectual Property Ii, L.P. Method and system for providing an automated web transcription service
JP2012189930A (en) 2011-03-14 2012-10-04 Seiko Epson Corp Projector
CN112329926A (en) * 2020-11-30 2021-02-05 珠海采筑电子商务有限公司 Quality improvement method and system for intelligent robot

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4914704A (en) * 1984-10-30 1990-04-03 International Business Machines Corporation Text editor for speech input
US4994966A (en) * 1988-03-31 1991-02-19 Emerson & Stern Associates, Inc. System and method for natural language parsing by initiating processing prior to entry of complete sentences
US5712957A (en) * 1995-09-08 1998-01-27 Carnegie Mellon University Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists
US5883986A (en) * 1995-06-02 1999-03-16 Xerox Corporation Method and system for automatic transcription correction

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2986345B2 (en) * 1993-10-18 1999-12-06 インターナショナル・ビジネス・マシーンズ・コーポレイション Voice recording indexing apparatus and method
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
GB9709341D0 (en) * 1997-05-08 1997-06-25 British Broadcasting Corp Method of and apparatus for editing audio or audio-visual recordings
US6353809B2 (en) * 1997-06-06 2002-03-05 Olympus Optical, Ltd. Speech recognition with text generation from portions of voice data preselected by manual-input commands
US6064957A (en) * 1997-08-15 2000-05-16 General Electric Company Improving speech recognition through text-based linguistic post-processing

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4914704A (en) * 1984-10-30 1990-04-03 International Business Machines Corporation Text editor for speech input
US4994966A (en) * 1988-03-31 1991-02-19 Emerson & Stern Associates, Inc. System and method for natural language parsing by initiating processing prior to entry of complete sentences
US5883986A (en) * 1995-06-02 1999-03-16 Xerox Corporation Method and system for automatic transcription correction
US5712957A (en) * 1995-09-08 1998-01-27 Carnegie Mellon University Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1509902A4 *

Also Published As

Publication number Publication date
AU6383500A (en) 2001-02-19
WO2001009877A9 (en) 2002-07-11
AU776890B2 (en) 2004-09-23
EP1509902A4 (en) 2005-08-17
WO2001009877A2 (en) 2001-02-08
CA2380433A1 (en) 2001-02-08
EP1509902A2 (en) 2005-03-02
NZ516956A (en) 2004-11-26

Similar Documents

Publication Publication Date Title
AP2001002243A0 (en) Automated transcription system and method using two speech converting instances and computer-assisted correction.
WO2004003688A8 (en) A method for comparing a transcribed text file with a previously created file
US7881930B2 (en) ASR-aided transcription with segmented feedback training
EP0899719A3 (en) Method for aligning text with audio signals
AU2002211438A1 (en) Language independent voice-based search system
US6704709B1 (en) System and method for improving the accuracy of a speech recognition program
DE60211197D1 (en) METHOD AND DEVICE FOR THE CONVERSION OF SPANISHED TEXTS AND CORRECTION OF THE KNOWN TEXTS
AU2002214658A1 (en) Speech recognition using word-in-phrase command
AU2003299312A1 (en) Text-to-speech method and system, computer program product therefor
WO2003005258A3 (en) Method of providing an account information and method of and device for transcribing of dictations
EP1050872A3 (en) Method and system for selecting recognized words when correcting recognized speech
EP2453436A3 (en) Automatic language model update
DE69635655D1 (en) SRECHERANGEPASSTE LANGUAGE IDENTIFICATION
EP0841655A3 (en) Method and system for buffering recognized words during speech recognition
EP0840288A3 (en) Method and system for editing phrases during continuous speech recognition
WO2004086359A3 (en) System for speech recognition and correction, correction device and method for creating a lexicon of alternatives
WO2002061730A8 (en) Syntax-driven, operator assisted voice recognition system and methods
DE60128816D1 (en) LANGUAGE RECOGNITION PROCEDURE WITH REPLACEMENT COMMAND
WO2006023631A3 (en) Document transcription system training
EP1556855A4 (en) Method and system for text editing in hand-held electronic device
WO2008042119A3 (en) System and method for integrating voice with a medical device
EP0749109A3 (en) Speech recognition for tonal languages
ATE363120T1 (en) AUDIO DIALOGUE SYSTEM AND VOICE-CONTROLLED BROWSING PROCESS
WO2006040727A3 (en) A system and a method of processing audio data to generate reverberation
AU2002220661A1 (en) Method and device for generating an adapted reference for automatic speech recognition

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2380433

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2002/00904

Country of ref document: ZA

Ref document number: 516956

Country of ref document: NZ

Ref document number: 200200904

Country of ref document: ZA

Ref document number: IN/PCT/2002/160/KOL

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2000950784

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 63835/00

Country of ref document: AU

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

AK Designated states

Kind code of ref document: C2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

COP Corrected version of pamphlet

Free format text: PAGES 1-17, DESCRIPTION, REPLACED BY NEW PAGES 1-17; PAGES 18-27, CLAIMS, REPLACED BY NEW PAGES 18-27; PAGES 1/12-12/12, DRAWINGS, REPLACED BY NEW PAGES 1/12-12/12; PAGES 1-4, SEQUENCE LISTING, REPLACED BY NEW PAGES 1-13; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

NENP Non-entry into the national phase

Ref country code: JP

WWP Wipo information: published in national office

Ref document number: 516956

Country of ref document: NZ

WWG Wipo information: grant in national office

Ref document number: 63835/00

Country of ref document: AU

WWP Wipo information: published in national office

Ref document number: 2000950784

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 516956

Country of ref document: NZ

WWW Wipo information: withdrawn in national office

Ref document number: 2000950784

Country of ref document: EP