WO2004023455A3 - Methods, systems, and programming for performing speech recognition - Google Patents
Methods, systems, and programming for performing speech recognition Download PDFInfo
- Publication number
- WO2004023455A3 WO2004023455A3 PCT/US2002/028590 US0228590W WO2004023455A3 WO 2004023455 A3 WO2004023455 A3 WO 2004023455A3 US 0228590 W US0228590 W US 0228590W WO 2004023455 A3 WO2004023455 A3 WO 2004023455A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speech recognition
- recognition
- combination
- speech
- vocabulary
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
Abstract
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2002336458A AU2002336458A1 (en) | 2002-09-06 | 2002-09-06 | Methods, systems, and programming for performing speech recognition |
KR1020057003948A KR100996212B1 (en) | 2002-09-06 | 2002-09-06 | Methods, systems, and programming for performing speech recognition |
US10/227,653 US7225130B2 (en) | 2001-09-05 | 2002-09-06 | Methods, systems, and programming for performing speech recognition |
PCT/US2002/028590 WO2004023455A2 (en) | 2002-09-06 | 2002-09-06 | Methods, systems, and programming for performing speech recognition |
JP2004533998A JP2006515073A (en) | 2002-09-06 | 2002-09-06 | Method, system, and programming for performing speech recognition |
EP02773307A EP1604350A4 (en) | 2002-09-06 | 2002-09-06 | Methods, systems, and programming for performing speech recognition |
CNA028298519A CN1864204A (en) | 2002-09-06 | 2002-09-06 | Methods, systems and programming for performing speech recognition |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/227,653 US7225130B2 (en) | 2001-09-05 | 2002-09-06 | Methods, systems, and programming for performing speech recognition |
PCT/US2002/028590 WO2004023455A2 (en) | 2002-09-06 | 2002-09-06 | Methods, systems, and programming for performing speech recognition |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2004023455A2 WO2004023455A2 (en) | 2004-03-18 |
WO2004023455A3 true WO2004023455A3 (en) | 2006-03-23 |
Family
ID=32684400
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2002/028590 WO2004023455A2 (en) | 2001-09-05 | 2002-09-06 | Methods, systems, and programming for performing speech recognition |
Country Status (2)
Country | Link |
---|---|
US (1) | US7225130B2 (en) |
WO (1) | WO2004023455A2 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7809574B2 (en) | 2001-09-05 | 2010-10-05 | Voice Signal Technologies Inc. | Word recognition using choice lists |
CN105120195A (en) * | 2015-09-18 | 2015-12-02 | 谷鸿林 | Content recording and reproducing system and method |
Families Citing this family (226)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7406084B2 (en) * | 1997-09-19 | 2008-07-29 | Nokia Siemens Networks Gmbh & Co. Kg | Flexible software architecture for a call processing system |
US7712053B2 (en) | 1998-12-04 | 2010-05-04 | Tegic Communications, Inc. | Explicit character filtering of ambiguous text entry |
US7679534B2 (en) | 1998-12-04 | 2010-03-16 | Tegic Communications, Inc. | Contextual prediction of user words and user actions |
US7319957B2 (en) * | 2004-02-11 | 2008-01-15 | Tegic Communications, Inc. | Handwriting and voice input with automatic correction |
US7881936B2 (en) * | 1998-12-04 | 2011-02-01 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US7720682B2 (en) * | 1998-12-04 | 2010-05-18 | Tegic Communications, Inc. | Method and apparatus utilizing voice input to resolve ambiguous manually entered text input |
US8938688B2 (en) | 1998-12-04 | 2015-01-20 | Nuance Communications, Inc. | Contextual prediction of user words and user actions |
US20100122164A1 (en) * | 1999-12-03 | 2010-05-13 | Tegic Communications, Inc. | Contextual prediction of user words and user actions |
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US7716058B2 (en) * | 2001-09-05 | 2010-05-11 | Voice Signal Technologies, Inc. | Speech recognition using automatic recognition turn off |
WO2004023455A2 (en) * | 2002-09-06 | 2004-03-18 | Voice Signal Technologies, Inc. | Methods, systems, and programming for performing speech recognition |
US7505911B2 (en) * | 2001-09-05 | 2009-03-17 | Roth Daniel L | Combined speech recognition and sound recording |
US7467089B2 (en) * | 2001-09-05 | 2008-12-16 | Roth Daniel L | Combined speech and handwriting recognition |
US7313526B2 (en) | 2001-09-05 | 2007-12-25 | Voice Signal Technologies, Inc. | Speech recognition using selectable recognition modes |
US7577569B2 (en) * | 2001-09-05 | 2009-08-18 | Voice Signal Technologies, Inc. | Combined speech recognition and text-to-speech generation |
US7634403B2 (en) * | 2001-09-05 | 2009-12-15 | Voice Signal Technologies, Inc. | Word recognition using word transformation commands |
US7526431B2 (en) * | 2001-09-05 | 2009-04-28 | Voice Signal Technologies, Inc. | Speech recognition using ambiguous or phone key spelling and/or filtering |
US8583440B2 (en) * | 2002-06-20 | 2013-11-12 | Tegic Communications, Inc. | Apparatus and method for providing visual indication of character ambiguity during text entry |
US20040078792A1 (en) * | 2002-10-21 | 2004-04-22 | Microsoft Corporation | System and method for selectively deactivating auto-deploy functionality of a software input panel |
EP1431958B1 (en) * | 2002-12-16 | 2018-07-18 | Sony Mobile Communications Inc. | Apparatus connectable to or incorporating a device for generating speech, and computer program product therefor |
JP2004226741A (en) * | 2003-01-23 | 2004-08-12 | Nissan Motor Co Ltd | Information providing device |
WO2004077404A1 (en) * | 2003-02-21 | 2004-09-10 | Voice Signal Technologies, Inc. | Method of producing alternate utterance hypotheses using auxilia ry information on close competitors |
US7729913B1 (en) * | 2003-03-18 | 2010-06-01 | A9.Com, Inc. | Generation and selection of voice recognition grammars for conducting database searches |
US7263483B2 (en) * | 2003-04-28 | 2007-08-28 | Dictaphone Corporation | USB dictation device |
JP2004341778A (en) * | 2003-05-15 | 2004-12-02 | Sharp Corp | Electronic mail browsing device and electronic mail editing device |
US20040243415A1 (en) * | 2003-06-02 | 2004-12-02 | International Business Machines Corporation | Architecture for a speech input method editor for handheld portable devices |
WO2005027093A1 (en) * | 2003-09-11 | 2005-03-24 | Voice Signal Technologies, Inc. | Generation of an alternative pronunciation |
US8136050B2 (en) * | 2003-11-21 | 2012-03-13 | Nuance Communications, Inc. | Electronic device and user interface and input method therefor |
JP4012143B2 (en) * | 2003-12-16 | 2007-11-21 | キヤノン株式会社 | Information processing apparatus and data input method |
US20050192802A1 (en) * | 2004-02-11 | 2005-09-01 | Alex Robinson | Handwriting and voice input with automatic correction |
TW200538969A (en) * | 2004-02-11 | 2005-12-01 | America Online Inc | Handwriting and voice input with automatic correction |
US8954325B1 (en) * | 2004-03-22 | 2015-02-10 | Rockstar Consortium Us Lp | Speech recognition in automated information services systems |
US20060041926A1 (en) * | 2004-04-30 | 2006-02-23 | Vulcan Inc. | Voice control of multimedia content |
US8095364B2 (en) * | 2004-06-02 | 2012-01-10 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US20070182595A1 (en) * | 2004-06-04 | 2007-08-09 | Firooz Ghasabian | Systems to enhance data entry in mobile and fixed environment |
DE102004035244A1 (en) * | 2004-07-21 | 2006-02-16 | Givemepower Gmbh | Computer aided design system has a facility to enter drawing related information as audio input |
US7124951B2 (en) * | 2004-07-30 | 2006-10-24 | Symbol Technologies, Inc. | Programmable keypad for sized optimized device |
US7418387B2 (en) * | 2004-11-24 | 2008-08-26 | Microsoft Corporation | Generic spelling mnemonics |
US7966310B2 (en) * | 2004-11-24 | 2011-06-21 | At&T Intellectual Property I, L.P. | Method, system, and software for correcting uniform resource locators |
US20060122837A1 (en) * | 2004-12-08 | 2006-06-08 | Electronics And Telecommunications Research Institute | Voice interface system and speech recognition method |
US8200828B2 (en) | 2005-01-14 | 2012-06-12 | Citrix Systems, Inc. | Systems and methods for single stack shadowing |
US8340130B2 (en) | 2005-01-14 | 2012-12-25 | Citrix Systems, Inc. | Methods and systems for generating playback instructions for rendering of a recorded computer session |
US20060159432A1 (en) | 2005-01-14 | 2006-07-20 | Citrix Systems, Inc. | System and methods for automatic time-warped playback in rendering a recorded computer session |
US8935316B2 (en) | 2005-01-14 | 2015-01-13 | Citrix Systems, Inc. | Methods and systems for in-session playback on a local machine of remotely-stored and real time presentation layer protocol data |
US8145777B2 (en) | 2005-01-14 | 2012-03-27 | Citrix Systems, Inc. | Method and system for real-time seeking during playback of remote presentation protocols |
US7831728B2 (en) | 2005-01-14 | 2010-11-09 | Citrix Systems, Inc. | Methods and systems for real-time seeking during real-time playback of a presentation layer protocol data stream |
US8230096B2 (en) | 2005-01-14 | 2012-07-24 | Citrix Systems, Inc. | Methods and systems for generating playback instructions for playback of a recorded computer session |
US8296441B2 (en) | 2005-01-14 | 2012-10-23 | Citrix Systems, Inc. | Methods and systems for joining a real-time session of presentation layer protocol data |
US9224391B2 (en) * | 2005-02-17 | 2015-12-29 | Nuance Communications, Inc. | Method and system for automatically providing linguistic formulations that are outside a recognition domain of an automatic speech recognition system |
US7805300B2 (en) | 2005-03-21 | 2010-09-28 | At&T Intellectual Property Ii, L.P. | Apparatus and method for analysis of language model changes |
US20060225037A1 (en) * | 2005-03-30 | 2006-10-05 | Microsoft Corporation | Enabling UI template customization and reuse through parameterization |
US20060224962A1 (en) * | 2005-03-30 | 2006-10-05 | Microsoft Corporation | Context menu navigational method for accessing contextual and product-wide choices via remote control |
US8214754B2 (en) | 2005-04-15 | 2012-07-03 | Microsoft Corporation | Registration of applications and complimentary features for interactive user interfaces |
CN102272827B (en) * | 2005-06-01 | 2013-07-10 | 泰吉克通讯股份有限公司 | Method and apparatus utilizing voice input to resolve ambiguous manually entered text input |
GB2428853A (en) * | 2005-07-22 | 2007-02-07 | Novauris Technologies Ltd | Speech recognition application specific dictionary |
NZ589382A (en) * | 2005-06-16 | 2012-03-30 | Keyless Systems Ltd | Data Entry System |
US20070016420A1 (en) * | 2005-07-07 | 2007-01-18 | International Business Machines Corporation | Dictionary lookup for mobile devices using spelling recognition |
US20070047708A1 (en) * | 2005-08-31 | 2007-03-01 | Motorola, Inc. | Voice call reply using voice recognition and text to speech |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US8191008B2 (en) | 2005-10-03 | 2012-05-29 | Citrix Systems, Inc. | Simulating multi-monitor functionality in a single monitor environment |
US7590536B2 (en) * | 2005-10-07 | 2009-09-15 | Nuance Communications, Inc. | Voice language model adjustment based on user affinity |
US20070100619A1 (en) * | 2005-11-02 | 2007-05-03 | Nokia Corporation | Key usage and text marking in the context of a combined predictive text and speech recognition system |
US8417185B2 (en) | 2005-12-16 | 2013-04-09 | Vocollect, Inc. | Wireless headset and method for robust voice data communication |
US20070150477A1 (en) * | 2005-12-22 | 2007-06-28 | International Business Machines Corporation | Validating a uniform resource locator ('URL') in a document |
US7773767B2 (en) | 2006-02-06 | 2010-08-10 | Vocollect, Inc. | Headset terminal with rear stability strap |
US7885419B2 (en) * | 2006-02-06 | 2011-02-08 | Vocollect, Inc. | Headset terminal with speech functionality |
JP4734155B2 (en) * | 2006-03-24 | 2011-07-27 | 株式会社東芝 | Speech recognition apparatus, speech recognition method, and speech recognition program |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
KR100814641B1 (en) | 2006-10-23 | 2008-03-18 | 성균관대학교산학협력단 | User driven voice service system and method thereof |
CN101589428B (en) * | 2006-12-28 | 2012-07-04 | 三菱电机株式会社 | Vehicle-mounted voice recognition apparatus |
JP4672686B2 (en) * | 2007-02-16 | 2011-04-20 | 株式会社デンソー | Voice recognition device and navigation device |
US8719723B2 (en) * | 2007-03-05 | 2014-05-06 | Microsoft Corporation | Displaying data sensitive targets |
US20090030697A1 (en) * | 2007-03-07 | 2009-01-29 | Cerra Joseph P | Using contextual information for delivering results generated from a speech recognition facility using an unstructured language model |
US20090030687A1 (en) * | 2007-03-07 | 2009-01-29 | Cerra Joseph P | Adapting an unstructured language model speech recognition system based on usage |
US20080221884A1 (en) * | 2007-03-07 | 2008-09-11 | Cerra Joseph P | Mobile environment speech processing facility |
US8886540B2 (en) | 2007-03-07 | 2014-11-11 | Vlingo Corporation | Using speech recognition results based on an unstructured language model in a mobile communication facility application |
US20090030685A1 (en) * | 2007-03-07 | 2009-01-29 | Cerra Joseph P | Using speech recognition results based on an unstructured language model with a navigation system |
US20110054898A1 (en) * | 2007-03-07 | 2011-03-03 | Phillips Michael S | Multiple web-based content search user interface in mobile search application |
US20080221880A1 (en) * | 2007-03-07 | 2008-09-11 | Cerra Joseph P | Mobile music environment speech processing facility |
US20090030688A1 (en) * | 2007-03-07 | 2009-01-29 | Cerra Joseph P | Tagging speech recognition results based on an unstructured language model for use in a mobile communication facility application |
US20110054899A1 (en) * | 2007-03-07 | 2011-03-03 | Phillips Michael S | Command and control utilizing content information in a mobile voice-to-speech application |
US8949266B2 (en) | 2007-03-07 | 2015-02-03 | Vlingo Corporation | Multiple web-based content category searching in mobile search application |
US8949130B2 (en) * | 2007-03-07 | 2015-02-03 | Vlingo Corporation | Internal and external speech recognition use with a mobile communication facility |
US8838457B2 (en) * | 2007-03-07 | 2014-09-16 | Vlingo Corporation | Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility |
US20110054896A1 (en) * | 2007-03-07 | 2011-03-03 | Phillips Michael S | Sending a communications header with voice recording to send metadata for use in speech recognition and formatting in mobile dictation application |
US8635243B2 (en) * | 2007-03-07 | 2014-01-21 | Research In Motion Limited | Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application |
US20110054895A1 (en) * | 2007-03-07 | 2011-03-03 | Phillips Michael S | Utilizing user transmitted text to improve language model in mobile dictation application |
US10056077B2 (en) * | 2007-03-07 | 2018-08-21 | Nuance Communications, Inc. | Using speech recognition results based on an unstructured language model with a music system |
US8886545B2 (en) * | 2007-03-07 | 2014-11-11 | Vlingo Corporation | Dealing with switch latency in speech recognition |
US20110054897A1 (en) * | 2007-03-07 | 2011-03-03 | Phillips Michael S | Transmitting signal quality information in mobile dictation application |
US20110060587A1 (en) * | 2007-03-07 | 2011-03-10 | Phillips Michael S | Command and control utilizing ancillary information in a mobile voice-to-speech application |
US8515757B2 (en) | 2007-03-20 | 2013-08-20 | Nuance Communications, Inc. | Indexing digitized speech with words represented in the digitized speech |
US20080312935A1 (en) * | 2007-06-18 | 2008-12-18 | Mau Ii Frederick W | Media device with speech recognition and method for using same |
WO2009027980A1 (en) * | 2007-08-28 | 2009-03-05 | Yissum Research Development Company Of The Hebrew University Of Jerusalem | Method, device and system for speech recognition |
SG152092A1 (en) * | 2007-10-26 | 2009-05-29 | Creative Tech Ltd | Wireless handheld device able to accept text input and methods for inputting text on a wireless handheld device |
JP5024154B2 (en) * | 2008-03-27 | 2012-09-12 | 富士通株式会社 | Association apparatus, association method, and computer program |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US9230222B2 (en) * | 2008-07-23 | 2016-01-05 | The Quantum Group, Inc. | System and method enabling bi-translation for improved prescription accuracy |
USD605629S1 (en) | 2008-09-29 | 2009-12-08 | Vocollect, Inc. | Headset |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US8160287B2 (en) | 2009-05-22 | 2012-04-17 | Vocollect, Inc. | Headset with adjustable headband |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US8775183B2 (en) * | 2009-06-12 | 2014-07-08 | Microsoft Corporation | Application of user-specified transformations to automatic speech recognition results |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US8438659B2 (en) | 2009-11-05 | 2013-05-07 | Vocollect, Inc. | Portable computing device and headset interface |
EP2339576B1 (en) | 2009-12-23 | 2019-08-07 | Google LLC | Multi-modal input on an electronic device |
US11416214B2 (en) | 2009-12-23 | 2022-08-16 | Google Llc | Multi-modal input on an electronic device |
US20110184736A1 (en) * | 2010-01-26 | 2011-07-28 | Benjamin Slotznick | Automated method of recognizing inputted information items and selecting information items |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US20110246194A1 (en) * | 2010-03-30 | 2011-10-06 | Nvoq Incorporated | Indicia to indicate a dictation application is capable of receiving audio |
US8352245B1 (en) | 2010-12-30 | 2013-01-08 | Google Inc. | Adjusting language models |
US8296142B2 (en) | 2011-01-21 | 2012-10-23 | Google Inc. | Speech recognition using dock context |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US9009041B2 (en) * | 2011-07-26 | 2015-04-14 | Nuance Communications, Inc. | Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
US8615159B2 (en) | 2011-09-20 | 2013-12-24 | Citrix Systems, Inc. | Methods and systems for cataloging text in a recorded session |
US8255218B1 (en) * | 2011-09-26 | 2012-08-28 | Google Inc. | Directing dictation into input fields |
US9620122B2 (en) * | 2011-12-08 | 2017-04-11 | Lenovo (Singapore) Pte. Ltd | Hybrid speech recognition |
US10019991B2 (en) * | 2012-05-02 | 2018-07-10 | Electronics And Telecommunications Research Institute | Apparatus and method for speech recognition |
US20130298071A1 (en) * | 2012-05-02 | 2013-11-07 | Jonathan WINE | Finger text-entry overlay |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
CN102722525A (en) * | 2012-05-15 | 2012-10-10 | 北京百度网讯科技有限公司 | Methods and systems for establishing language model of address book names and searching voice |
US9721563B2 (en) * | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9400639B2 (en) | 2012-06-22 | 2016-07-26 | Microsoft Technology Licensing, Llc | Generating programs using context-free compositions and probability of determined transformation rules |
US10354650B2 (en) | 2012-06-26 | 2019-07-16 | Google Llc | Recognizing speech with mixed speech recognition models to generate transcriptions |
KR101399777B1 (en) * | 2012-07-27 | 2014-05-27 | 한국과학기술원 | Voice recognition supporting method and system for improving an voice recognition ratio |
KR102150289B1 (en) * | 2012-08-30 | 2020-09-01 | 삼성전자주식회사 | User interface appratus in a user terminal and method therefor |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
KR101330671B1 (en) | 2012-09-28 | 2013-11-15 | 삼성전자주식회사 | Electronic device, server and control methods thereof |
US8543397B1 (en) | 2012-10-11 | 2013-09-24 | Google Inc. | Mobile device voice activation |
US9026428B2 (en) * | 2012-10-15 | 2015-05-05 | Nuance Communications, Inc. | Text/character input system, such as for use with touch screens on mobile phones |
KR20140054643A (en) * | 2012-10-29 | 2014-05-09 | 삼성전자주식회사 | Voice recognition apparatus and voice recogniton method |
US8947220B2 (en) * | 2012-10-31 | 2015-02-03 | GM Global Technology Operations LLC | Speech recognition functionality in a vehicle through an extrinsic device |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
WO2014200728A1 (en) | 2013-06-09 | 2014-12-18 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
WO2014205078A2 (en) * | 2013-06-21 | 2014-12-24 | Larman Mark | Inline audio recording device |
US20160004502A1 (en) * | 2013-07-16 | 2016-01-07 | Cloudcar, Inc. | System and method for correcting speech input |
US9305554B2 (en) * | 2013-07-17 | 2016-04-05 | Samsung Electronics Co., Ltd. | Multi-level speech recognition |
US9842592B2 (en) | 2014-02-12 | 2017-12-12 | Google Inc. | Language models using non-linguistic context |
JP6427755B2 (en) | 2014-02-24 | 2018-11-28 | パナソニックIpマネジメント株式会社 | Data input device, data input method, program, and in-vehicle device |
US9412365B2 (en) | 2014-03-24 | 2016-08-09 | Google Inc. | Enhanced maximum entropy models |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US9536521B2 (en) * | 2014-06-30 | 2017-01-03 | Xerox Corporation | Voice recognition |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US10199034B2 (en) | 2014-08-18 | 2019-02-05 | At&T Intellectual Property I, L.P. | System and method for unified normalization in text-to-speech and automatic speech recognition |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
CN104267922B (en) * | 2014-09-16 | 2019-05-31 | 联想(北京)有限公司 | A kind of information processing method and electronic equipment |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US10582046B2 (en) * | 2014-12-30 | 2020-03-03 | Harman International Industries, Incorporated | Voice recognition-based dialing |
CN105869631B (en) * | 2015-01-21 | 2019-08-23 | 上海羽扇智信息科技有限公司 | The method and apparatus of voice prediction |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US10134394B2 (en) | 2015-03-20 | 2018-11-20 | Google Llc | Speech recognition using log-linear model |
CN104822093B (en) | 2015-04-13 | 2017-12-19 | 腾讯科技(北京)有限公司 | Barrage dissemination method and device |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US10048842B2 (en) | 2015-06-15 | 2018-08-14 | Google Llc | Selection biasing |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9978367B2 (en) | 2016-03-16 | 2018-05-22 | Google Llc | Determining dialog states for language models |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
US10832664B2 (en) | 2016-08-19 | 2020-11-10 | Google Llc | Automated speech recognition using language models that selectively use domain-specific model components |
US10186263B2 (en) * | 2016-08-30 | 2019-01-22 | Lenovo Enterprise Solutions (Singapore) Pte. Ltd. | Spoken utterance stop event other than pause or cessation in spoken utterances stream |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
CN106371400B (en) * | 2016-10-31 | 2019-02-15 | 成都乐创自动化技术股份有限公司 | Method and device is chosen in a kind of teaching track |
CN106792140A (en) * | 2016-12-12 | 2017-05-31 | 合肥华耀广告传媒有限公司 | A kind of broadcasted TV ads monitoring system based on voice recognition |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US10311860B2 (en) | 2017-02-14 | 2019-06-04 | Google Llc | Language model biasing system |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | Far-field extension for digital assistant services |
US10607606B2 (en) | 2017-06-19 | 2020-03-31 | Lenovo (Singapore) Pte. Ltd. | Systems and methods for execution of digital assistant |
US10503467B2 (en) * | 2017-07-13 | 2019-12-10 | International Business Machines Corporation | User interface sound emanation activity classification |
KR20190138915A (en) | 2018-06-07 | 2019-12-17 | 현대자동차주식회사 | Voice recognition apparatus, vehicle having the same and control method for the vehicle |
CN110874176B (en) * | 2018-08-29 | 2024-03-29 | 斑马智行网络(香港)有限公司 | Interaction method, storage medium, operating system and device |
JP7159756B2 (en) * | 2018-09-27 | 2022-10-25 | 富士通株式会社 | Audio playback interval control method, audio playback interval control program, and information processing device |
CN111027793B (en) * | 2019-03-27 | 2023-06-30 | 广东小天才科技有限公司 | Method and system for determining word mastering degree and electronic equipment |
KR102243325B1 (en) * | 2019-09-11 | 2021-04-22 | 넷마블 주식회사 | Computer programs for providing startup language recognition technology |
CN112040326A (en) * | 2020-08-24 | 2020-12-04 | 深圳创维-Rgb电子有限公司 | Bullet screen control method and system, television and storage medium |
CN113192501B (en) * | 2021-04-12 | 2022-04-22 | 青岛信芯微电子科技股份有限公司 | Instruction word recognition method and device |
CN114173188B (en) * | 2021-10-18 | 2023-06-02 | 深圳追一科技有限公司 | Video generation method, electronic device, storage medium and digital person server |
US11880645B2 (en) | 2022-06-15 | 2024-01-23 | T-Mobile Usa, Inc. | Generating encoded text based on spoken utterances using machine learning systems and methods |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5208897A (en) * | 1990-08-21 | 1993-05-04 | Emerson & Stern Associates, Inc. | Method and apparatus for speech recognition based on subsyllable spellings |
US5632002A (en) * | 1992-12-28 | 1997-05-20 | Kabushiki Kaisha Toshiba | Speech recognition interface system suitable for window systems and speech mail systems |
US5754972A (en) * | 1992-03-06 | 1998-05-19 | Dragon Systems, Inc. | Speech recognition system for languages with compound words |
US5974413A (en) * | 1997-07-03 | 1999-10-26 | Activeword Systems, Inc. | Semantic user interface |
US5983179A (en) * | 1992-11-13 | 1999-11-09 | Dragon Systems, Inc. | Speech recognition system which turns its voice response on for confirmation when it has been turned off without confirmation |
US6266642B1 (en) * | 1999-01-29 | 2001-07-24 | Sony Corporation | Method and portable apparatus for performing spoken language translation |
Family Cites Families (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58211792A (en) | 1982-06-04 | 1983-12-09 | 日産自動車株式会社 | Voice recognition equipment for vehicle |
US4829576A (en) | 1986-10-21 | 1989-05-09 | Dragon Systems, Inc. | Voice recognition system |
US5206897A (en) * | 1991-08-02 | 1993-04-27 | Noel Goudreau | Home incarceration system |
US5734789A (en) | 1992-06-01 | 1998-03-31 | Hughes Electronics | Voiced, unvoiced or noise modes in a CELP vocoder |
US6092043A (en) | 1992-11-13 | 2000-07-18 | Dragon Systems, Inc. | Apparatuses and method for training and operating speech recognition systems |
US5903864A (en) | 1995-08-30 | 1999-05-11 | Dragon Systems | Speech recognition |
US5819225A (en) * | 1996-05-30 | 1998-10-06 | International Business Machines Corporation | Display indications of speech processing states in speech recognition system |
US6374219B1 (en) * | 1997-09-19 | 2002-04-16 | Microsoft Corporation | System for using silence in speech recognition |
US5987410A (en) | 1997-11-10 | 1999-11-16 | U.S. Philips Corporation | Method and device for recognizing speech in a spelling mode including word qualifiers |
US6285980B1 (en) | 1998-11-02 | 2001-09-04 | Lucent Technologies Inc. | Context sharing of similarities in context dependent word models |
US6324510B1 (en) | 1998-11-06 | 2001-11-27 | Lernout & Hauspie Speech Products N.V. | Method and apparatus of hierarchically organizing an acoustic model for speech recognition and adaptation of the model to unseen domains |
US6342903B1 (en) * | 1999-02-25 | 2002-01-29 | International Business Machines Corp. | User selectable input devices for speech applications |
US6224383B1 (en) | 1999-03-25 | 2001-05-01 | Planetlingo, Inc. | Method and system for computer assisted natural language instruction with distracters |
US6434523B1 (en) | 1999-04-23 | 2002-08-13 | Nuance Communications | Creating and editing grammars for speech recognition graphically |
US6581033B1 (en) | 1999-10-19 | 2003-06-17 | Microsoft Corporation | System and method for correction of speech recognition mode errors |
US6694296B1 (en) * | 2000-07-20 | 2004-02-17 | Microsoft Corporation | Method and apparatus for the recognition of spelled spoken words |
US6925154B2 (en) * | 2001-05-04 | 2005-08-02 | International Business Machines Corproation | Methods and apparatus for conversational name dialing systems |
US7369997B2 (en) | 2001-08-01 | 2008-05-06 | Microsoft Corporation | Controlling speech recognition functionality in a computing device |
US7634403B2 (en) | 2001-09-05 | 2009-12-15 | Voice Signal Technologies, Inc. | Word recognition using word transformation commands |
US7809574B2 (en) | 2001-09-05 | 2010-10-05 | Voice Signal Technologies Inc. | Word recognition using choice lists |
US7505911B2 (en) | 2001-09-05 | 2009-03-17 | Roth Daniel L | Combined speech recognition and sound recording |
US7526431B2 (en) | 2001-09-05 | 2009-04-28 | Voice Signal Technologies, Inc. | Speech recognition using ambiguous or phone key spelling and/or filtering |
US7467089B2 (en) | 2001-09-05 | 2008-12-16 | Roth Daniel L | Combined speech and handwriting recognition |
US7313526B2 (en) | 2001-09-05 | 2007-12-25 | Voice Signal Technologies, Inc. | Speech recognition using selectable recognition modes |
US7716058B2 (en) | 2001-09-05 | 2010-05-11 | Voice Signal Technologies, Inc. | Speech recognition using automatic recognition turn off |
US7444286B2 (en) | 2001-09-05 | 2008-10-28 | Roth Daniel L | Speech recognition using re-utterance recognition |
WO2004023455A2 (en) | 2002-09-06 | 2004-03-18 | Voice Signal Technologies, Inc. | Methods, systems, and programming for performing speech recognition |
US7577569B2 (en) | 2001-09-05 | 2009-08-18 | Voice Signal Technologies, Inc. | Combined speech recognition and text-to-speech generation |
US7246060B2 (en) | 2001-11-06 | 2007-07-17 | Microsoft Corporation | Natural input recognition system and method using a contextual mapping engine and adaptive user bias |
JP3885002B2 (en) | 2002-06-28 | 2007-02-21 | キヤノン株式会社 | Information processing apparatus and method |
WO2004015543A2 (en) | 2002-08-09 | 2004-02-19 | Recare, Inc. | Method and system for context-sensitive recognition of human input |
-
2002
- 2002-09-06 WO PCT/US2002/028590 patent/WO2004023455A2/en active Application Filing
- 2002-09-06 US US10/227,653 patent/US7225130B2/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5208897A (en) * | 1990-08-21 | 1993-05-04 | Emerson & Stern Associates, Inc. | Method and apparatus for speech recognition based on subsyllable spellings |
US5754972A (en) * | 1992-03-06 | 1998-05-19 | Dragon Systems, Inc. | Speech recognition system for languages with compound words |
US5983179A (en) * | 1992-11-13 | 1999-11-09 | Dragon Systems, Inc. | Speech recognition system which turns its voice response on for confirmation when it has been turned off without confirmation |
US5632002A (en) * | 1992-12-28 | 1997-05-20 | Kabushiki Kaisha Toshiba | Speech recognition interface system suitable for window systems and speech mail systems |
US5974413A (en) * | 1997-07-03 | 1999-10-26 | Activeword Systems, Inc. | Semantic user interface |
US6266642B1 (en) * | 1999-01-29 | 2001-07-24 | Sony Corporation | Method and portable apparatus for performing spoken language translation |
Non-Patent Citations (1)
Title |
---|
See also references of EP1604350A4 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7809574B2 (en) | 2001-09-05 | 2010-10-05 | Voice Signal Technologies Inc. | Word recognition using choice lists |
CN105120195A (en) * | 2015-09-18 | 2015-12-02 | 谷鸿林 | Content recording and reproducing system and method |
CN105120195B (en) * | 2015-09-18 | 2019-04-26 | 谷鸿林 | Content recordal, playback system and method |
Also Published As
Publication number | Publication date |
---|---|
US20040267528A9 (en) | 2004-12-30 |
US20040049388A1 (en) | 2004-03-11 |
US7225130B2 (en) | 2007-05-29 |
WO2004023455A2 (en) | 2004-03-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2004023455A3 (en) | Methods, systems, and programming for performing speech recognition | |
US9092435B2 (en) | System and method for extraction of meta data from a digital media storage device for media selection in a vehicle | |
ES2295025T3 (en) | USER INTERFACE TALKED FOR DEVICES ENABLED THROUGH THE VOICE. | |
US7490039B1 (en) | Text to speech system and method having interactive spelling capabilities | |
Hawley et al. | A speech-controlled environmental control system for people with severe dysarthria | |
Greenberg | On the origins of speech intelligibility in the real world | |
US7624018B2 (en) | Speech recognition using categories and speech prefixing | |
EP1096472A3 (en) | Audio playback of a multi-source written document | |
JP2003022089A (en) | Voice spelling of audio-dedicated interface | |
JP2006146182A (en) | Centralized method and system for clarifying voice command | |
WO2002097590A2 (en) | Language independent and voice operated information management system | |
WO2004036939A1 (en) | Portable digital mobile communication apparatus, method for controlling speech and system | |
WO2004034377A3 (en) | Apparatus, methods and programming for speech synthesis via bit manipulations of compressed data base | |
KR20030044899A (en) | Method and apparatus for a voice controlled foreign language translation device | |
KR101819459B1 (en) | Voice recognition system and apparatus supporting voice recognition error correction | |
US7349844B2 (en) | Minimizing resource consumption for speech recognition processing with dual access buffering | |
US20030216915A1 (en) | Voice command and voice recognition for hand-held devices | |
WO2019239656A1 (en) | Information processing device and information processing method | |
CN110767233A (en) | Voice conversion system and method | |
KR20050015585A (en) | Apparatus And Method for Enhanced Voice Recognition | |
JPS597998A (en) | Continuous voice recognition equipment | |
WO2000046787A3 (en) | System and method for automating transcription services | |
WO2000022609A1 (en) | Speech recognition and control system and telephone | |
CN109218843B (en) | Personalized intelligent voice prompt method based on television equipment | |
US20070118381A1 (en) | Voice control methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2004533998 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020057003948 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2002773307 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 20028298519 Country of ref document: CN |
|
WWP | Wipo information: published in national office |
Ref document number: 2002773307 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1020057003948 Country of ref document: KR |