CN102144209B - 电子设备中的多层次话音反馈 - Google Patents
电子设备中的多层次话音反馈 Download PDFInfo
- Publication number
- CN102144209B CN102144209B CN200980134444.1A CN200980134444A CN102144209B CN 102144209 B CN102144209 B CN 102144209B CN 200980134444 A CN200980134444 A CN 200980134444A CN 102144209 B CN102144209 B CN 102144209B
- Authority
- CN
- China
- Prior art keywords
- queue
- electronic equipment
- elements
- speech
- aloud
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/0482—Interaction with lists of selectable items, e.g. menus
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72469—User interfaces specially adapted for cordless or mobile telephones for operating the device by selecting functions from two or more displayed items, e.g. menus or icons
Abstract
本发明旨在向电子设备的用户提供话音反馈。由于每个电子设备显示画面可包括数个朗读元素(即,为其提供话音反馈的元素),元素可被排序。为此,电子设备可使层次与每个朗读元素的显示相关联。电子设备随后可根据相关联的层次,提供所显示的朗读元素的话音反馈。为了降低设计话音反馈系统中的复杂性,话音反馈特征可被结合到用于向用户显示内容的模型-视图-控制器(MVC)设计中。例如,MVC设计的模型和视图可包括与朗读属性相关联的附加变量。电子设备可利用任何适当的方法,接收每个朗读元素的音频文件,例如包括通过向主设备提供朗读元素的列表和指令主设备的文本到语音引擎产生和提供音频文件。
Description
技术领域
本公开旨在在电子设备中提供多层次话音反馈。
背景技术
许多电子设备提供用户可以使用的大量功能或操作。可用功能或操作的数目常常超过利用电子设备的输入机构可以得到的输入的数目。为了使用户可以使用并未特别依赖特定输入(例如,不与按键序列或按钮按压相关联的输入,诸如可从Apple Inc.获得的iPod上的MENU按钮)的电子设备操作,电子设备可提供具有可选选项的菜单,其中所述选项与电子设备操作相关联。例如,响应于从输入机构(例如,MENU按钮)接收到与菜单相关联的输入,电子设备可在显示器上显示具有可选选项的菜单。
由于菜单一般被显示在电子设备显示器上,因此可能要求用户看着显示器来选择特定选项。有时这并不可取。例如,如果用户希望节约电力(例如,便携式电子设备中的电力),那么要求电子设备显示菜单并移动用户操纵的突出显示区域以提供选择会耗尽电力。作为另一个例子,如果用户在阴暗的环境中,并且显示器不包括背光,那么用户不能区分菜单的显示选项。作为又一个例子,如果用户是盲人,或者视觉受损,那么用户不能观看显示的菜单。
为了克服这个问题,一些系统可以响应于检测到用户的输入或者电池状态的变化,提供音频反馈,如在共同转让的美国专利公布No.2008/0129520、标题为“ELECTRONIC DEVICE WITH ENHANCEDAUDIO FEEDBACK”(代理案号No.P4250US1)中所述,该专利公布在此整体引为参考。在一些情况下,电子设备可提供话音反馈,该话音反馈描述用户可选择的选项或者用户可指令电子设备执行的操作。如果同时显示数个菜单,或者如果显示器包括不同的模块或显示区(例如,数个视图),那么电子设备难以确定为其提供话音反馈的对象或菜单选项,或者对象或菜单选项的顺序。
发明内容
本发明旨在提供用于向用户提供多层次(multi-tiered)话音反馈的系统和方法。特别地,本发明旨在按照预定顺序(例如,基于与每个显示对象相关联的层次),提供数个显示对象(例如,菜单项)的话音反馈。
在一些实施例中,可以提供用于向电子设备的用户提供话音反馈的方法、电子设备和计算机可读介质。电子设备可显示数个元素,并识别所述元素中为其提供话音反馈的至少两个元素。电子设备可确定与每个所识别元素的显示相关联的层次,其中所述层次定义每个所显示元素的相对重要性。电子设备随后可按照所确定层次的顺序,提供所识别元素的话音反馈,例如,以使得首先对最重要的元素提供话音反馈,随后对第二重要的元素提供话音反馈,直到对每个元素提供了话音反馈为止。
在一些实施例中,可以提供为显示内容提供音频反馈的方法、电子设备和计算机可读介质。电子设备可指令显示器显示数个元素,其中朗读属性与所述元素中的至少两个元素相关联。电子设备可确定与所述至少两个元素中的每一个相关联的层次,并产生包括所述至少两个元素的队列。所确定的层次可设定元素在所产生的队列中的顺序。电子设备可指令音频输出端按照队列的顺序,依次朗读每个队列元素,其中音频输出端包括与所述至少两个元素中的每一个相关联的话音反馈。
在一些实施例中,可以提供用于朗读由电子设备显示的元素的文本的方法、电子设备和计算机可读介质。电子设备可显示与朗读属性相关联的数个元素。朗读属性可识别每个元素的要朗读的文本。电子设备可把数个元素显示在数个视图中,其中每个视图与朗读顺序相关联。电子设备可产生包括所述数个元素的队列,其中队列中的元素的顺序是根据每个视图的朗读顺序设定的(例如,以使得具有较高朗读顺序的元素在队列的头部)。电子设备可等待第一超时时间过去,和识别与队列的每个元素相关联的音频文件。在第一超时时间之内,电子设备可修改音频重放,以使语音更易于听到,和在检测到事务的时候,避免电子设备朗读。音频文件可包括每个元素的要朗读的口头朗读属性文本。电子设备可按照队列的顺序,顺序重放所识别的音频文件,并暂停第二超时时间。第二超时时间允许电子设备使音频重放返回到朗读前的配置(例如,音乐重放)。在一些实施例中,电子设备可从主设备接收音频文件,所述主设备根据每个元素的要朗读的朗读属性文本,利用文本到语音引擎产生所述音频文件。
附图说明
根据结合附图进行的下述详细说明,本发明的上述及其它特征、本发明的本质和各种优点将更明显,其中:
图1是按照本发明的一个实施例的电子设备的示意图;
图2是按照本发明的一个实施例的具有对其可应用话音反馈的内容的说明性显示屏幕的示意图;
图3是按照本发明的一个实施例的与图2的显示画面相关联的用于重放的朗读项目的说明性队列的示意图;
图4是按照本发明的一个实施例的在接收到用户对图2的显示画面的选项的选择之后电子设备显示画面的示意图;
图5是按照本发明的一个实施例的与图4的显示画面相关联的用于重放的朗读项目的说明性队列的示意图;
图6是按照本发明的一个实施例的具有不同标记选项的图4的电子设备显示画面的示意图;
图7是按照本发明的一个实施例的与图6的显示画面相关联的用于重放的朗读项目的说明性队列的示意图;
图8是按照本发明的一个实施例的响应于用户选择图6的突出显示菜单选项而提供的电子设备显示画面的示意图;
图9是按照本发明的一个实施例的与图8的显示画面相关联的用于重放的朗读项目的说明性队列的示意图;
图10是按照本发明的一个实施例的说明性“现在播放”显示画面的示意图;
图11是按照本发明的一个实施例的用于现在播放显示画面的朗读项目的说明性队列的示意图;
图12是按照本发明的一个实施例的用于朗读出朗读字符串(speakable string)的说明性状态图;
图13是按照本发明的一个实施例的包括电子设备和主设备的说明性通信系统的示意图;
图14是用于向电子设备提供静态字符串的说明性处理的流程图;和
图15是用于向电子设备提供动态字符串的说明性处理的流程图。
具体实施方式
提供了一种基于与所显示的选项相关联的各层次来提供选择性话音反馈的电子设备。
电子设备可以包括处理器和显示器。电子设备可向用户显示任何适当的信息。例如,显示器可包括标题栏、具有可选选项的菜单、显示与一个或多个选项相关联的信息的信息区、识别可供选择的媒体或文件的信息或者任何其它适当的信息。当用户使用显示器时,电子设备可提供不同显示元素的话音反馈。
每个显示元素可以与不同的属性相关联。在一些实施例中,将为其提供话音反馈的显示元素可以与朗读属性相关联。朗读属性可包括相关联元素的要朗读的文本。另外,作为为显示每个元素而实现的视图的一部分,该元素可以与朗读顺序或层次相关联。当电子设备显示元素(例如,作为视图的一部分)时,电子设备可根据朗读属性和朗读顺序,确定为其提供话音反馈的文本(例如,要朗读的文本)和与每个元素相关联的顺序或层次。电子设备可选择具有最高层次的元素,并为所选择的要素提供话音反馈(例如,朗读)。电子设备随后可相继选择具有次高层次的每个元素,并按照层次顺序,为后续的各元素提供话音反馈(例如,利用其中依据与每个元素相关联的层次来设定元素的顺序的队列)。当电子设备提供话音反馈时,不包括朗读属性或朗读顺序的元素(例如,不为其提供话音反馈的元素)可被电子设备忽略或跳过。
电子设备可利用任何适当的方法,确定在特定时间要朗读出哪个元素。在一些实施例中,电子设备可响应于检测到事务(transaction)(例如,关于什么元素能够被朗读的决定),提供话音反馈。例如,响应于确定显示画面已转变,或者响应于接收到使显示画面改变的用户动作(例如,用户选择某一选项,或者移动突出显示区),电子设备可检测到事务。响应于检测到事务,电子设备可识别更新的显示画面的朗读元素,和与朗读元素(例如,事务内要顺序朗读出的元素)相关联的层次。电子设备随后可根据所识别的更新显示画面的元素,创建将为其提供话音反馈的元素的新队列,并根据新创建的队列提供话音反馈。在一些实施例中,可通过替换现有队列的不朗读的相同或较低层次的各个项目,构成新的队列。朗读的特定元素和朗读各个元素的顺序可随每个事务而变化。
可利用任何适当的方法,产生响应于接收到为所显示的特定元素提供话音反馈的指令而被重放的音频文件。在一些实施例中,为了利用文本-语音(TTS)引擎提供高质量音频,可从与电子设备连接的主设备接收音频文件。如果电子设备的资源有限(例如,由电子设备的便携性所导致的固有存储、处理和电力限制),那么这种方法特别可取。电子设备可向主设备提供列举与将由电子设备朗读出的每个元素相关联的字符串的文件。主设备随后可利用文本到语音引擎,把字符串转换成语音,并把语音的音频文件提供给电子设备。电子设备随后可参考字符串到音频文件的映射以响应于确定将提供所显示元素的话音反馈而提供供重放的恰当音频文件。
图1是按照本发明的一个实施例的电子设备的示意图。电子设备100可包括处理器102、存储装置104、存储器106、输入机构108、音频输出端110、显示器112和通信电路114。在一些实施例中,可以组合或省略一个或多个电子装置组件100(例如,可以组合存储装置104和内存106)。在一些实施例中,电子设备100可包括未被组合或包括在图1中所示的那些组件中的其它组件(例如,电源或总线),或者图1中所示组件的数个实例。为了简单起见,图1中只示出了每种组件中的一个。
处理器102可包括用于控制电子设备100的操作和性能的任何处理电路。例如,处理器102可被用于运行操作系统应用程序、固件应用程序、媒体重放应用程序、媒体编辑应用程序、或者任何其它应用程序。在一些实施例中,处理器可以驱动显示器和处理从用户接口接收的输入。
例如,存储装置104可包括一个或多个存储介质,所述存储介质包括硬盘驱动器、固态驱动器、闪速存储器、诸如ROM之类的永久性存储器、任何其它适当类型的存储组件或者它们的任意组合。例如,存储装置104可以保存媒体数据(例如,音乐和视频文件)、应用程序数据(例如,用于在设备100上实现各种功能)、固件、用户偏好信息数据(例如,媒体重放偏好)、认证信息(例如,与授权用户相关联的数据的库)、生活方式信息数据(例如,食物偏好)、健身信息数据(例如,由健身监测设备获得的信息)、事务信息数据(例如,诸如信用卡信息之类的信息)、无线连接信息数据(例如,使电子设备100能够建立无线连接的信息)、预约信息数据(例如,记录用户订阅的播客或电视节目或其它媒体的信息)、联系信息数据(例如,电话号码和电子邮件地址)、日历信息数据和任何其它适当的数据,或者它们的任意组合。
内存106可包括高速缓冲存储器,诸如RAM之类的半永久性存储器,和/或用于临时保存数据的一种或多种不同类型的存储器。在一些实施例中,内存106还可用于保存用于操作电子设备应用程序的数据,或者可被保存在存储装置104中的任何其它类型的数据。在一些实施例中,内存106和存储装置104可被结合成单个存储介质。
输入机构108可向电子设备的输入/输出电路提供输入。输入机构108可包括任何适当的输入机构,诸如,例如按钮、小键盘、标度盘(dial)、点触轮或者触摸屏。在一些实施例中,电子设备100可包括电容性感测机构,或者多触摸电容性感测机构。在共同拥有的申请日为2004年7月10日、题为“Gestures for Touch Sensitive Input Device”的美国专利申请No.10/902,964和申请日为2005年1月18日、题为“Mode-BasedGraphical User Interfaces for Touch Sensitive Input Device”的美国专利申请No.11/028,590中描述了一些感测机构,这两件专利申请均在此整体引为参考。
音频输出端110可包括内置到电子设备100中的一个或多个扬声器(例如,单声道扬声器或立体声扬声器),或者与音频输出机构耦接的音频连接器(例如,音频插孔或者适当的蓝牙连接)。例如,音频输出端110可利用有线或无线连接,向头戴式送受话器、头戴式受话器或者耳塞提供音频数据。
显示器112可包括用于提供用户可视的显示画面的显示电路(例如,屏幕或投影系统)。例如,显示器112可包括结合在电子设备100中的屏幕(例如,LCD屏幕)。作为另一个例子,显示器112可包括在远离电子设备100的表面上提供内容的显示的可移动显示器或者投影系统(例如,视频投影仪)。在一些实施例中,显示器112可包括编码器/解码器(编解码器),以把数字媒体数据转换成模拟信号。例如,显示器112(或者电子设备100内的其它适当的电路)可包括视频编解码器、音频编解码器或者任何其它适当类型的编解码器。
显示器112还可包括显示驱动器电路、用于驱动显示驱动器的电路或者这两者。在处理器102的指导下,显示器112可显示内容(例如,媒体重放信息,在电子设备上实现的应用程序的应用程序屏幕,关于进行中的通信操作的信息,关于到来的通信请求的信息,或者设备操作屏幕)。
输入机构108、音频输出端110和显示器112中的一个或多个可以与输入/输出电路耦接。输入/输出电路可把模拟信号和其它信号转换成数字数据(以及如果需要的话,对其进行编码/解码)。在一些实施例中,输入/输出电路还可把数字数据转换成任何其它类型的信号,反之亦然。例如,输入/输出电路可接收和转换物理接触输入(例如,来自多触摸屏幕),物理移动(例如,来自鼠标或传感器),模拟音频信号(例如,来自麦克风),或者任何其它输入。数字数据可被提供给处理器102、存储装置104、内存106、或者电子设备100的任何其它组件,或者可从处理器102、存储装置104、内存106、或者电子设备100的任何其它组件接收数字数据。在一些实施例中,在电子设备100中可以包括输入/输出电路的数个实例。
利用任何适当的通信协议,通信电路114可以与其它设备或者与一个或多个服务器通信。电子设备100可包括通信电路的一个或多个实例,以便利用不同的通信网络,同时进行数种通信操作。例如,通信电路可支持Wi-Fi(例如,802.11协议)、以太网、蓝牙TM(它是Bluetooth Sig,Inc.拥有的商标)、射频系统、蜂窝网络(例如,GSM、AMPS、GPRS、CDMA、EV-DQ、EDGE、3GSM、DECT、IS-136/TDMA、iDen、LTE或者任何其它适当的蜂窝网络或协议)、红外、TCP/IP(例如,在每个TCP/IP层中使用的任意协议)、HTTP、BitTorrent、FTP、RTP、RTSP、SSH、IP话音(VOIP)、任何其它通信协议或者它们的任意组合。在一些实施例中,通信电路114可包括在电子设备100和主设备之间提供有线通信链路的一个或多个通信端口。例如,便携式电子设备可包括接纳使便携式电子设备耦接到主计算机的电缆的一个或多个连接器(例如,30针连接器或USB连接器)。利用主计算机上的软件(例如,可从Apple Inc.获得的iTunes),便携式设备可与主计算机通信。
在一些实施例中,电子设备100可包括总线,所述总线提供用于向控制处理器102、存储装置104、内存106、输入/输出电路108、传感器110、和包括在电子设备中的任何其它组件传送数据,从上述组件传送数据,或者在上述组件之间传送数据的数据传送路径。
电子设备可提供任何适当的显示内容——包括,例如,菜单选项或者可供向用户重放的内容——的话音反馈(例如,诸如艺术家姓名、媒体标题或者专辑之类的与媒体相关联的元数据的话音反馈)。图2是按照本发明的一个实施例的具有对其可应用话音反馈的内容的说明性显示屏幕的示意图。显示画面200包括在其上显示内容的数个区域。例如,显示画面200可包括标题栏210、菜单220和附加信息230。标题栏210可包括指示电子设备所使用的模式或应用程序的标题212。例如,标题212可包括iPod(例如,当未选择任何应用程序时最上面的标题)、音乐、视频、照片、播客、其它(extras)和设置。可以得到其它标题,例如,当附属设备与电子设备耦接时(例如,无线电附件或健身附件)。标题栏210还可包括任何其它适当的信息,例如包括电池指示符214。
菜单220可包括数个可选选项222,例如包括用于选择模式或应用程序的选项,或者用于与所选择的特定模式或应用程序相关联的选项。通过把突出显示区224导航到某一选项上,用户可从菜单220中选择选项。当突出显示区在特定选项之上时,用户可提供选择指令(例如,通过按下按钮,或者提供任何其他适当的输入),以选择所述特定选项。附加信息230可包括任何适当的信息,例如包括与由标题212识别的模式或应用程序、一个或多个显示的选项222、由突出显示区224识别的特定选项相关联的信息,或者任何其它适当的信息。
电子设备可利用任何适当的方法,产生显示画面200或者任何其它显示画面。在一些实施例中,可以使用模型-视图-控制器(MVC)架构或设计。模型可包括与供控制器显示的视图联系的任何适当信息(例如,控制器可查询模型,以构成视图,或者修改视图的在运行时与模型的联系)。例如,模型可包括一个或多个字符串或图像。每个视图可被配置成显示(例如,支持)一种或多种类型的元素。视图可把所支持的类型传给get_Property调用,响应于此,模型可把与所支持类型相关联的数据提供给供设备显示的视图。可以组合数个视图,以形成每个显示画面。例如,显示画面200可包括显示画面的每个区域的至少一个视图。
为了使提供所显示内容的话音反馈更容易,电子设备可把话音反馈变量和设置结合到与内容的实际显示相关联的MVC架构中。在一些实施例中,模型可包括另外的朗读属性字段。朗读属性字段可包括为提供话音反馈所需的或者可用于提供话音反馈的任何适当信息。在一些实施例中,朗读属性字段可包括将提供话音反馈的指示(例如,切换设置)。电子设备可利用任何适当的方法,确定要朗读的文本。在一些实施例中,视图或调度系统可查询与视图相关联的类型的属性ID。在一些实施例中,可以替代地或另外地提供根据属性ID(例如,利用散列表)生成的固定大小的ID,以识别为其提供话音反馈的文本。在一些实施例中,朗读属性可以替代地或另外地包括将由电子设备朗读的一串文本,或者到具有要显示在模型中的文本的字段的指针。
电子设备可把层次或重要性包含在MVC架构的任何适当组件中,例如包括作为与每个视图相关联的朗读顺序变量。朗读顺序可提供显示在对应视图中的朗读元素——例如,可被显示的其它视图中的相关其它文本——的重要性的指示。例如,所述指示可包括语音的层次。电子设备可定义任何适当的朗读顺序或层次,例如包括上下文(例如,与菜单标题相关联)、焦点(例如,列表控制,诸如突出显示区位置)、选择(例如,与列表上的某一项目相关联的选项)、属性(例如,媒体的详细说明或歌词)、细节和空闲。根据模型或者显示在视图中的元素,每个视图可以与一个或多个层次或朗读顺序相关联。例如,如果在视图内同时显示菜单选项和相关联的设置(例如,背光选项224和设置226),那么视图可以与数个层次相关联。可替换地,可在不同的视图中提供菜单选项和设置。
如果一个或数个视图被显示成显示画面的一部分,那么电子设备可从模型取回要显示的元素,和显示所述元素的方式。另外,电子设备可从每个模型取回朗读属性,和从每个所显示的视图取回朗读顺序。电子设备可为显示画面的任何适当的朗读元素提供话音反馈。例如,电子设备可提供一个或多个视图的话音反馈。作为另一个例子,电子设备可为特定视图中的一个或多个元素提供话音反馈。在一些实施例中,电子设备可以为特定视图中的处于每个层次的仅仅一个元素提供话音反馈(例如,提供菜单220中的仅仅一个元素的话音反馈,在菜单220中每个选项与特定层次相关联)。
为了按照恰当的顺序提供所显示的朗读元素的话音反馈,电子设备的语音调度器可定义为其提供话音反馈的项目(例如,朗读项目)的队列,其中朗读顺序或层次设定各元素在队列中的顺序。电子设备可朗读所显示元素的任意适当组合。例如,电子设备可以仅仅朗读一个菜单项(例如,用突出显示区识别的菜单项)。作为另一个例子,电子设备可朗读数个菜单项(例如,跟在突出显示的菜单项之后的所有菜单项)。作为又一个例子,电子设备可以朗读所有菜单项。为了确保电子设备首先朗读用突出显示区识别的菜单项,电子设备可以把较高的层次或顺序与对应的菜单项联系起来。本讨论将互换地使用术语“朗读”朗读元素或字符串和“播放”与朗读元素或字符串相关联的“音频文件”来描述提供朗读元素的话音反馈。
在一些实施例中,语音调度器可以只把每个视图的每个层次的一个朗读元素包括在队列中。例如,这可为电子设备提供一种仅仅朗读被突出显示的菜单项的简易机制(例如,通过只把焦点层次分配给“音乐”菜单选项,仅仅朗读“音乐”,而不朗读菜单220中的其它项目)。如果在某一事务内,在给定层次的视图之内数个所显示的项目发生变化,那么语音调度器可以只把最近变化的项目放入队列中。为了提供与在单个事务中的相同朗读顺序相关联的数个项目的话音反馈,电子设备可把所述数个项目显示在与相同的朗读顺序相关联的不同视图中。语音调度器可使用任何适当的方法来提供具有相同层次(例如,下面更详细说明的现在播放显示画面中的空闲层次)的视图的不同元素的话音反馈。例如,语音调度器可遵循一个或多个资源文件中的元素的顺序、以视图的图形位置为基础的顺序、按字母顺序、或者利用任何适当的顺序。
图3是按照本发明的一个实施例的与图2的显示画面相关联的用于重放的朗读项目的说明性队列的示意图。可利用任何适当的方法描述队列300。在图3的例子中,队列300可包括要相继朗读的朗读字符串的列表310。作为视图的一部分,每个朗读字符串可以与在对应列340中识别的朗读层次相关联。利用来自显示画面200(图2)的元素,朗读字符串可包括具有上下文层次342的iPod字符串312和具有焦点层次343的音乐字符串313(例如,用突出显示区识别的菜单项是唯一被朗读的菜单项)。在其中朗读所有菜单项(例如,并不仅仅朗读用突出显示区识别的菜单项)的实现方式中,朗读字符串可包括例如全部具有选择层次(例如,在音乐字符串313的焦点层次之下的层次)的视频字符串、照片字符串、播客字符串、其它字符串、设置字符串、随机播放歌曲字符串和背光字符串。另外,由于背光选项可以和相关联的设置一起被显示,因此队列300还可包括在朗读了背光字符串之后朗读的、与属性层次相关联的On字符串。在其中只朗读突出显示选项的实现方式中,响应于检测到突出显示区已被放在菜单中的背光选项之上,电子设备可把焦点层次分配给背光字符串,并把选择层次分配给On字符串。电子设备可识别与每个朗读字符串相关联的音频文件(例如,利用散列或数据库),并按照由队列300设定的顺序,相继重放每个所识别的音频文件。
当电子设备显示画面上的内容发生变化时,电子设备可修改提供的话音反馈,以反映变化后的显示画面。图4是按照本发明的一个实施例的在接收到用户对图2的显示画面的选项的选择之后电子设备显示画面的示意图。类似于显示画面200(图2),显示画面400包括显示内容的数个区域。例如,显示画面400可包括标题栏410、菜单420和附加信息430。标题栏410可包括指示电子设备所使用的模式或应用程序的标题412。在图4的例子中,标题412可包括音乐,其指示所选择的菜单220(图2)的选项。
菜单420可包括数个可选选项422,例如,包括与所选择的特定模式或应用程序相关联的选项。通过把突出显示区424导航到某一选项之上,用户可以从菜单420中选择该选项。当突出显示区在特定选项之上时,用户可提供选择指令(例如,通过按下按钮,或者提供任何其它适当的输入),以选择所述特定选项。在图4的例子中,选项422可包括Cover Flow(封面流)、播放列表、艺术家、专辑、歌曲、流派、作曲家、有声读物和搜索。附加信息430可包括任何适当的信息,例如包括与由标题412识别的模式或应用程序、一个或多个所显示的选项422、由突出显示区424识别的特定选项相关联的信息,或者任何其它适当的信息。
响应于确定所显示的内容已发生变化(例如,响应检测到事务),语音调度器可更新或修改为显示画面提供话音反馈的朗读项目的队列。例如,语音调度器可确定与改变后的显示画面的每个视图相关联的朗读属性,从而产生所述队列。图5是按照本发明的一个实施例的与图4的显示画面相关联的用于重放的朗读项目的说明性队列的示意图。可以使用任何适当的方法来描述队列500。在图5的例子中,队列500包括要相继朗读的朗读字符串的列表510。作为视图的一部分,每个朗读字符串可与在对应栏540中识别的朗读层次相关联。利用来自显示画面400(图4)的元素,朗读字符串可包括具有上下文层次542的音乐字符串512,和具有焦点层次543的Cover Flow字符串513(例如,用突出显示区识别的菜单选项)。在朗读所有菜单选项的实现方式中,队列500可包括例如全部具有选择层次(例如,在Cover Flow字符串513的焦点层次543之下的层次)的播放列表字符串、艺术家字符串、专辑字符串、歌曲字符串、流派字符串、作曲家、有声读物字符串和搜索字符串。电子设备可识别与每个朗读字符串相关联的音频文件(例如,利用散列或数据库),并按照由队列500设定的顺序,相继重放每个所识别的音频文件。
在一些实施例中,当所显示的内容保持不变,但是用户控制的标记(例如,突出显示区)变化时,电子设备提供的话音反馈会变化。这使用户可以识别响应于在用户移动标记的时候,用户对由标记识别的选项的选择,将执行的动作。图6是按照本发明的一个实施例的具有不同标记选项的图4的电子设备显示画面的示意图。类似于显示画面400(图4),显示画面600包括显示内容的数个区域。例如,显示画面600包括标题栏610、菜单620和附加信息630。标题栏610包括指示电子设备所使用的模式或应用程序的标题612,它可以是和显示画面400相同的模式(例如,音乐)。
菜单620可包括和显示画面400相同的可选选项622。如图6中所示,用户已把突出显示区624导航到艺术家选项(例如,而不是如显示画面400中的Cover Flow选项)之上。所显示的附加信息630可包括任何适当的信息,例如包括与由标题612识别的模式或应用程序、一个或多个所显示的选项622、由突出显示区624识别的特定选项相关联的信息,或者任何其它适当的信息。在图4和6的例子中,所显示的辅助信息可以不同,从而反映突出显示区624的位置。
响应于确定突出显示区的位置已发生变化(例如,响应于检测到事务),语音调度器可更新为显示画面提供话音反馈的朗读项目的队列。例如,语音调度器可确定与改变后的显示画面的每个视图相关联的修改、变更或更新后的朗读属性,以产生所述队列。图7是按照本发明的一个实施例的与图6的显示画面相关联的用于重放的朗读项目的说明性队列的示意图。可利用任何适当的方法描述队列700。在图7的例子中,队列700包括要相继朗读的朗读字符串的列表710。作为视图的一部分,每个朗读字符串与在对应列740中识别的朗读层次相关联。利用来自显示画面600(图6)的元素,朗读字符串可包括具有上下文层次742的音乐字符串712,和具有焦点层次743的艺术家字符串713(例如,用突出显示区识别的菜单选项)。特别地,队列700中的朗读字符串的列表可不同于队列500(图5)中的朗读字符串的列表,以反映突出显示区被下移到艺术家选项。例如,可从队列700中除去将会在队列700之前的队列500中朗读的朗读字符串。电子设备可识别与每个朗读字符串相关联的音频文件(例如,利用散列或数据库),并按照由队列700设置的顺序,相继重放每个所识别的音频文件。在其中提供非突出显示菜单选项的话音反馈的实现方式中,队列700可包括例如全部具有选择层次(例如,在艺术家字符串713的焦点层次743之下的层次)的专辑字符串、歌曲字符串、流派字符串、作曲家、有声读物字符串、搜索字符串、Cover Flow字符串和播放列表字符串。可按照任何适当的方式对其它菜单选项排序,例如包括作为从由突出显示区识别的菜单项开始的重复列表。
响应于检测到事务,电子设备可重放朗读选项音频文件的任意部分。在一些实施例中,如果在用户提供访问显示画面400的指令时电子设备开始重放与显示画面200相关联的音频文件,或者在用户把突出显示区移动到在显示画面600中反映的位置的时候,电子设备开始重放与显示画面400的朗读字符串相关联的音频文件,那么电子设备可根据与音频文件相关联的层次和/或朗读项目的语音调度器队列的修改,有选择地停止重放该音频文件或者继续重放该音频文件。在一些实施例中,语音调度器首先确定更新后的队列,并比较初始队列和更新后的队列。特别地,语音调度器可从队列的开始,确定初始队列和更新队列的保持相同的各个部分,和更新队列中朗读元素的顺序开始发生变化的位置。例如,当语音调度器从队列300移动到队列500时,语音调度器可确定这两个队列并不共有任何共有的朗读字符串,于是从初始位置就不同。作为另一个例子,当语音调度器从队列500移动到队列700时,语音调度器可确定这两个队列共有与上下文层次相关联的朗读字符串,不过从与焦点层次相关联的朗读字符串开始变得不同。
语音调度器还可确定目前对其提供音频的朗读字符串分别在初始队列和更新队列(如果有的话)上的位置。例如,当语音调度器从队列500移动到队列700时,语音调度器可确定为其重放音频文件的朗读字符串是朗读字符串“音乐”(例如,由队列500和700共有的朗读字符串),还是不同的朗读字符串(例如,不为队列500和700共有的朗读字符串)。如果语音调度器确定目前朗读的朗读字符串属于由初始队列和更新队列共有的朗读字符串,那么语音调度器可继续朗读或重放与朗读字符串相关联的音频,随后继续按照由更新队列设定的顺序,重放与更新队列的朗读字符串相关联的音频。例如,如果当用户使显示画面从显示画面400改变成显示画面600时,电子设备正在重放与朗读字符串“音乐”(它具有上下文层次)相关联的音频,那么当电子设备重放完与朗读字符串“音乐”相关联的音频时,电子设备可提供与朗读字符串“艺术家”(与显示画面600相关联的队列中的下一个项目)相关联的音频(例如,而不是与朗读字符串“Cover Flow”相关联的音频,“Cover Flow”是队列中的与显示画面400相关联的下一个朗读字符串)。
如果语音调度器改为确定目前朗读的朗读字符串不属于由初始队列和更新队列共有的朗读字符串的范围,那么电子设备可停止重放与目前朗读的朗读字符串相关联的音频。例如,一旦语音调度器确定目前朗读的语音不在共有的朗读字符串的范围内,电子设备就可停止重放音频。电子设备随后可重新开始重放与更新队列的任意适当的朗读字符串相关联的音频,例如,包括从更新队列中从其开始改变朗读元素的顺序的朗读字符串开始的更新队列的各朗读字符串。例如,如果当用户使电子设备从显示画面400移动到显示画面600时,电子设备目前正在朗读朗读字符串“Cover Flow”,那么电子设备可停止重放与朗读字符串“CoverFlow”相关联的音频(例如,只重放“Cover Flow”的音频),并开始重放与朗读字符串“艺术家”(例如,不同于队列500的队列700的第一个朗读字符串)相关联的音频。在其中朗读所有菜单项的实现方式中,如果当用户使电子设备从显示画面400移动到显示画面600时,电子设备目前正在朗读朗读字符串“流派”,那么电子设备可停止重放与朗读字符串“流派”相关联的音频,并开始重放与朗读字符串“艺术家”相关联的音频。随后当在与显示画面600相关联的队列(例如,队列700)中到达朗读字符串“流派”时,可再次朗读朗读字符串“流派”。因此,如果用户以适当的速度,沿着显示在显示画面400中的选项移动突出显示区,那么电子设备可以仅仅重放显示画面400的每个选项的部分(例如,第一个音节)。
在一些实施例中,电子设备可以提供未由电子设备固件或操作系统静态提供的菜单项的话音反馈。例如,电子设备可提供根据用户(例如,从主设备)提供给电子设备的内容而产生的动态字符串的话音反馈。在一些实施例中,电子设备可提供由用户传送给电子设备的媒体的话音反馈(例如,根据与所传送的媒体相关联的元数据)。图8是按照本发明的一个实施例的响应于用户选择图6的突出显示菜单选项而提供的电子设备显示画面的示意图。类似于显示画面600(图6),显示画面800可包括显示内容的数个区域。例如,显示画面800可包括标题栏810、菜单820和附加信息830。标题栏810可包括指示电子设备使用中的模式或应用程序的标题812(例如,“艺术家”)。
菜单820可包括与“艺术家”模式相关联的任何适当列表,例如,包括电子设备可以得到的媒体(例如,电子设备保存的媒体)的艺术家姓名的列表822。电子设备可利用任何适当的方法收集艺术家姓名,例如包括从与媒体相关联的元数据收集艺术家姓名。所显示的附加信息830可包括任何适当的信息,例如包括与在菜单820中识别的一位或多位艺术家相关联的信息(例如,与可从用突出显示区824识别的艺术家获得的媒体有关的信息),或者用标题612识别的模式或应用程序。
响应于检测到事务(例如,对图6的显示画面600中的艺术家选项的用户选择),语音调度器可更新朗读项目的队列,以反映所显示的动态艺术家姓名。例如,语音调度器可确定与改变后的显示画面的每个视图相关联的修改、变更或更新后的朗读属性,从而产生队列。图9是按照本发明的一个实施例的与图8的显示画面相关联的用于重放的朗读项目的说明性队列的示意图。可以利用任何适当的方法描述队列900。在图9的例子中,队列900包括要相继朗读的朗读字符串的列表910。作为视图的一部分,每个朗读字符串可以与在对应列940中识别的朗读层次相关联。利用来自显示画面800(图8)的元素,朗读字符串可包括具有上下文层次942的艺术家字符串912和具有焦点层次943的共有字符串913(例如,用突出显示区识别的艺术家)。在其中提供非突出显示菜单选项的话音反馈的实现方式中,队列900可包括例如全部具有选择层次(例如,在共有字符串813的焦点层次843之下的层次)的Corrs字符串、Craig David字符串、Creed字符串、D12字符串、Da Brat字符串和Daniel Beddingfield字符串。可按照任何适当的方式对其它艺术家排序,例如包括作为从用突出显示区识别的艺术家开始的重复列表。
在一些实施例中,电子设备可根据媒体重放的状态,有选择地提供话音反馈。例如,当电子设备在重放媒体时,电子设备可不提供特定元素或者在特定模式下的话音反馈。图10是按照本发明的一个实施例的说明性“现在播放”显示画面的示意图。显示画面1000包括标题栏1010、菜单1020和附加信息1030。标题栏1010包括指示电子设备在使用的模式或应用程序的标题1012。例如,标题1012可包括iPod(例如,当没有选择任何应用程序时最上面的标题)、音乐、视频、照片、播客、其它(Extras)、设置和现在播放。标题栏1010还可包括任何其它适当的信息,例如包括电池指示符1014。
菜单1020可包括数个可选选项1022,例如包括用于选择模式或应用程序的选项,或者与所选择的特定模式或应用程序相关联的选项。通过把突出显示区1024导航到某一选项之上,用户可从菜单1020中选择选项。在突出显示区被放在特定选项之上的时候,用户可提供选择指令(例如,通过按下按钮或者提供任何其它适当的输入),以选择所述特定选项。例如,为了查看与目前重放的媒体(例如,目前播放或暂停的媒体)相关联的信息,用户可选择现在播放选项。响应于接收到对现在播放选项的用户选择,电子设备可显示与现在播放的媒体有关的附加信息1030。例如,附加信息1030可包括覆盖在专辑封套上的艺术家1032、标题1034和专辑1036。在一些实施例中,艺术家1032、标题1034和专辑1036中的每一个都可以与相同或不同的视图相关联(例如,通过把相同的层次用于所有附加信息元素来允许附加信息的话音反馈的不同视图)。
响应于接收到对显示画面1000(图10)的现在播放选项的选择,语音调度器可更新朗读项目的队列,从而朗读与现在播放的媒体有关的一个或多个字符串。例如,语音调度器可确定与改变后的显示画面的每个视图相关联的修改、变更或更新后的朗读属性,从而产生队列。图11是按照本发明的一个实施例的用于现在播放显示画面的朗读项目的说明性队列的示意图。可利用任何适当的方法描述队列1100。在图11的例子中,队列1100包括要相继朗读的朗读字符串的列表1110。作为视图的一部分,每个朗读字符串可以与在对应列1140中识别的朗读层次相关联。利用来自显示画面1000(图10)的元素,朗读字符串可包括具有上下文层次1142的iPod字符串1112、具有焦点层次1143的现在播放字符串1113(例如,用突出显示区识别的菜单选项)、具有空闲层次1144的Mika字符串1114、具有空闲层次1145的Grace Kelly字符串1115和具有空闲层次1146的Life in Cartoon Motion字符串1116。
为了确保不会在不适合的时间提供艺术家、标题和专辑的话音反馈,当重放媒体时(例如,未被暂停时),电子设备可以不提供与空闲层次相关联的朗读元素的话音反馈。例如,电子设备可首先确定是否正在重放媒体。响应于确定没有正在重放任何媒体,电子设备可提供队列1100中的所有元素——包括与空闲层次相关联的元素——的话音反馈。如果电子设备代之以确定目前正在重放媒体,那么电子设备可提供队列1100中各个视图的与除空闲层次之外的层次相关联的元素的话音反馈。响应于检测到正在重放媒体,语音调度器可从队列1100中除去与空闲层次相关联的元素,或者代之以跳过队列1100中与空闲层次相关联的元素。电子设备可向任何适当的显示信息分配空闲层次,例如包括向显示在附加信息窗口或区域中的信息(例如,保存在设备上的歌曲或照片的数目)分配空闲层次。
电子设备可利用任何适当的方法,确定在何时要朗读什么字符串。图12是按照本发明的一个实施例的用于朗读出朗读字符串的说明性状态图。状态图1200可包括数种状态和达到这数种状态中的每种状态的数条路径。电子设备可开始于空闲状态1202。例如,当不显示任何内容时,电子设备可保持空闲状态。作为另一个例子,当显示内容,但是所显示的内容与话音反馈无关(例如,显示专辑封面)时,电子设备可保持空闲状态。作为又一个例子,当显示朗读内容,但是朗读内容都已被朗读时,电子设备可保持空闲状态。
当处于空闲状态1202时,电子设备可监测显示画面的事务。电子设备做出的关于什么元素要朗读的任何决定都会导致事务。可利用数种不同的方法开始(和由电子设备检测)事务。例如,响应于接收到用户指令(例如,对导致显示画面改变的可选选项的用户选择),可检测到事务。作为另一个例子,响应于显示画面的转变(例如,例如由于超时或者由于用户移动突出显示区而导致的显示画面变化),可检测到事务。响应于检测到事务,电子设备可进入更新步骤1204。在更新步骤1204,电子设备可更新与提供话音反馈相关联的变量或字段。例如,例如根据可从用于产生事务后显示画面的各视图的一个或多个模型获得的字段,语音调度器可产生供电子设备朗读的各个项目的队列。在更新步骤1204之后,电子设备可进入PreSpeakTimeout状态1206。
在PreSpeakTimeout状态1206,电子设备可暂停第一超时时间。在该超时时间内,电子设备可进行任何适当的操作,例如包括产生要朗读的朗读字符串的队列,识别与朗读字符串相关联的音频文件,和进行准备供重放的音频文件的初始操作,隐藏(duck)或衰减在先音频输出(例如,由于音乐重放而导致的输出),或者进行任何其它适当的操作。例如,电子设备可减小在先的音频反馈(例如,隐藏),以致朗读的字符串更清楚。作为另一个例子,在话音反馈期间,电子设备可暂停媒体的重放(以致用户不会错过任何媒体)。作为又一个例子,电子设备可使用PreSpeakTimeout状态,以确保没有检测到更新的事务(例如,突出显示区的后续移动),以避免部分地朗读文本。电子设备可保持PreSpeakTimeout状态1206任何适当的持续时间,例如包括在0ms-500ms范围中的持续时间(例如,100ms)。一旦与PreSpeakTimeout状态1206相关联的第一超时时间已过去,电子设备就可进入重新开始步骤1208,从而进入朗读状态1210。
在朗读状态1210,电子设备朗读置于在更新步骤1204期间产生的队列中的朗读项目。例如,电子设备可识别与在所产生的队列中的朗读项目相关联的音频文件,并重放所识别的音频文件。当电子设备朗读完由语音调度器产生的话音反馈队列中的第一个项目时,电子设备可确定已提供了恰当的话音反馈,并进入完成步骤1212。在完成步骤1212,语音调度器可从队列中除去朗读过的朗读元素,或者把指针移动到队列中的下一个朗读元素。在一些实施例中,电子设备可代之以刚好在朗读朗读元素之前(例如,当在朗读状态1210下的时候),从队列中除去该朗读元素,以致当电子设备在完成步骤1212之后返回朗读状态1210时,电子设备所识别的第一个朗读元素是要朗读的下一个元素。电子设备可在朗读状态1210和完成步骤1212之间连续移动,直到在更新步骤(例如,更新步骤1204)中产生的队列中的所有朗读项目都已被朗读为止(即,所述队列为空,或者指针已到达队列的结尾),或者直到显示画面被改变并且执行新的更新步骤为止。
响应于当在朗读状态1210下的时候检测到事务(例如,如上所述),电子设备可进入更新步骤1214。在更新步骤1214,电子设备可更新与提供话音反馈相关联的变量或字段,以与由所述事务引起的显示画面相符。例如,语音调度器可在更新后的话音反馈队列中,根据事务之后的显示画面,更新朗读元素和为其提供话音反馈的朗读元素的顺序。在一些实施例中,电子设备还可从队列的第一个朗读元素开始,确定更新队列的与初始的话音反馈队列(例如,在步骤1214之前)匹配的部分,并识别正在为其提供话音反馈的当前朗读元素。如果电子设备确定当前的朗读元素在初始队列和更新队列共有的那部分朗读元素之内,那么电子设备可返回到朗读状态1210,并继续朗读更新队列的下一个朗读元素(例如,利用完成步骤1212和朗读状态1210)。如果电子设备代之以确定当前的朗读元素不在初始队列和更新队列共有的那部分朗读元素之内,那么电子设备可停止朗读当前的朗读元素(例如,停止重放与当前的朗读元素相关联的音频文件),并返回朗读状态1210。当返回到朗读状态1210时,电子设备可提供更新队列的朗读元素的话音反馈,例如,从队列的在所确定的那部分共有朗读元素之后的第一个朗读元素开始。
一旦电子设备已提供了由语音调度器产生的队列中的每个元素的话音反馈(例如,一旦队列为空),电子设备就可进入no_ready_queue步骤1216。在no_ready_queue步骤1216,电子设备可从语音调度器接收朗读项目的队列为空的指示(例如,no_ready_queue变量)。电子设备可从no_ready_queue步骤1216进入PostSpeakTimeout状态1218。在状态1218,电子设备暂停第二超时时间。在该超时时间内,电子设备可进行任何适当的操作,例如包括准备供重放的其它音频,初始化用户选择的操作(例如,响应于检测到对于所显示和朗读的菜单选项之一的选择指令),或者任何其它适当的操作。电子设备可代之以或者另外地从隐藏或衰减模式恢复音频输出(例如,从在PreSpeakTimeout状态1206期间启动的隐藏或衰减模式恢复到重放音频或其它媒体的正常模式)。可替换地,电子设备可重新开始被暂停的媒体的重放。电子设备可保持PostSpeakTimeout状态1218任何适当的持续时间,例如包括在0ms-500ms范围中的持续时间(例如100ms)。一旦与PostSpeakTimeout状态1218相关联的第一超时时间已过去,电子设备就进入重新开始步骤1220,从而返回空闲状态1202。
在一些实施例中,当在PostSpeakTimeout状态1218下时,电子设备可检测到事务(例如,上面所述的事务),并且进入更新步骤1222。更新步骤1222可包括更新步骤1214的一些或全部特征。在更新步骤1222,电子设备可更新与提供话音反馈相关联的变量或字段,以与由所述事务引起的显示画面相符。例如,语音调度器可在更新的话音反馈队列中,根据事务之后的显示画面,更新朗读元素和为其提供话音反馈的朗读元素的顺序。另外,在一些实施例中,电子设备可从队列的第一个朗读元素开始,确定更新队列的与初始话音反馈队列(例如,在步骤1222之前)匹配的部分,并识别正在为其提供话音反馈的当前朗读元素(例如,如上结合更新步骤1214所述)。电子设备随后返回朗读状态1210,并提供更新队列的朗读元素的话音反馈,例如从队列的在所确定的那部分共有朗读元素之后的第一个朗读元素开始。
在一些实施例中,电子设备可能在朗读过程中检测到错误。例如,在play_error步骤1224,电子设备可接收到与朗读状态1210相关联的错误的指示。电子设备可在步骤1224接收到错误的任何适当的指示,例如包括play_error变量。电子设备随后可进入ErrorSpeaking状态1226。在ErrorSpeaking状态1226,电子设备可执行任何适当的操作。例如,电子设备可进行调试操作,或者用于识别错误的来源的其它操作。作为另一个例子,电子设备可收集与错误相关联的信息,以提供给软件的开发人员,供调试或修改之用。如果电子设备完成与ErrorSpeaking状态1226相关联的一个或多个操作,电子设备可进入完成步骤1228,从而返回到朗读状态1210,以继续提供由语音调度器产生的队列中的朗读元素的话音反馈。
可替换地,如果电子设备未能完成与ErrorSpeaking状态1226相关联的所有操作,电子设备可进入重新开始步骤1230,从而返回到朗读状态1210。电子设备可能因任何适当的原因——例如包括未能接收到有效的“完成”消息,接收到取消ErrorSpeaking操作或者返回到朗读状态1210的用户指令,错误超时(例如,100ms),或者任何其它适当的原因,或者根据任何其它适当的条件——不能执行与朗读状态1210相关联的操作。
电子设备可利用任何适当的方法,获得与每个朗读元素相关联的音频文件。在一些实施例中,音频文件可由电子设备本地保存,例如作为设备的固件或软件的一部分。不过,这种方法的固有局限性在于固件通常被全球提供给在语言和口音不同的不同地点销售或使用的所有电子设备。为了确保用恰当的语言或者恰当的口音提供话音反馈,需要使每个设备使用的固件个人化。这会由于需要保存和提供数个版本的固件而导致成本相当高,并且会由于固件或软件提供者可需要管理不同固件或软件对于不同设备的分发而明显更复杂。另外,音频文件的大小(例如,与文本文件相对比)可较大,并且不容许以固件或软件更新的形式提供。
在一些实施例中,电子设备可利用在设备上运行的文本-语音(TTS)引擎,本地产生音频文件。利用这种方法,电子设备可用与设备相关联的语言,把与不同菜单选项相关联的文本字符串提供给设备的TTS引擎,从而产生话音反馈用音频文件。由于可用TTS引擎能够操作的文本字符串中的变化来反映朗读元素存在于其中的显示画面的改变,这种方法便于更容易的固件或软件更新。不过,可从电子设备获得的TTS引擎会限制这种方法。特别地,如果电子设备的资源有限,例如存储器、处理能力或电源电力有限(例如,与便携式电子设备相关联的限制),那么TTS引擎产生的语音的质量会降低。例如,不能获得与方言或口音相关联的语调,或者不支持与特定语言(例如,过度不同于默认语言的语言)相关联的语音。
在一些实施例中,电子设备可代之以或者另外地从电子设备连接到的主设备接收与朗读元素相关联的音频文件。图13是按照本发明的一个实施例的包括电子设备和主设备的说明性通信系统的示意图。通信系统1300包括电子设备1302和通信网络1310,电子设备1302可利用通信网络1310与在通信网络1310内的其它设备进行有线或无线通信。例如,电子设备1302可通过通信网络1310,与主设备1320进行通信操作。尽管通信系统1300可包括数个电子设备1302和主设备1320,不过为了避免使图13过于复杂,图13中只分别表示了一个电子设备和一个主设备。
可以使用可用于建立通信网络的任何适当的电路、设备、系统或者它们的组合(例如,包括通信塔和电信服务器的无线通信基础结构)建立通信网络1310。通信网络1310能够利用任何适当的短程或长程通信协议,提供无线通信。在一些实施例中,例如,通信网络1310可以支持Wi-Fi(例如,802.11协议)、蓝牙(注册商标)、射频系统(例如,1300MHz,2.4GHz和5.6GHz通信系统)、红外、由无线和蜂窝电话机及个人电子邮件设备使用的协议,或者支持电子设备1302和主设备1320之间的无线通信的任何其它协议。例如通过利用电子设备1302和/或主设备1320上的任何适当端口(例如,30针,USB,火线,串行或以太网),通信网络1310可代之以或者另外地能够提供电子设备1302和主设备1320之间的有线通信。
电子设备1302可包括接收媒体或数据的任何适当设备。例如,电子设备1302可包括电子设备100(图1)的一个或多个特征。电子设备1302可利用任何适当的方法,通过通信链路1340与主设备1320耦接。例如,电子设备1302可利用任何适当的无线通信协议通过通信链路1340连接到主设备1320。作为另一个例子,通信链路1340可以是既与电子设备1302耦接,又与媒体提供者1320耦接的有线链路(例如,以太网电缆)。作为又一个例子,通信链路1340可包括有线链路和无线链路的组合(例如,用于与主设备1320无线通信的附属设备可以耦接到电子设备1302)。在一些实施例中,任何适当的连接器、适配器(dongle)或扩展坞可作为通信链路1340的一部分用来耦接电子设备1302和主设备1320。
主设备1320可包括向电子设备1302提供音频文件的任何适当类型的设备。例如,主设备1320可包括计算机(例如,桌上型或膝上型计算机)、服务器(例如,可通过因特网或利用专用通信链路使用的服务器)、信息亭、或者任何其它适当的设备。主设备1320可利用任何适当的方法,提供用于电子设备的朗读元素的音频文件。例如,主设备1320可包括TTS引擎,该TTS引擎可以访问比电子设备1302上本地可用的资源更多的资源。利用更全面的主设备TTS引擎,主设备1320可产生与电子设备的朗读元素的文本字符串相关联的音频文件。主设备TTS引擎使电子设备可以用不同的语言或者用个性化的口音或话音模式(例如,利用名人的话音或者特定地区的口音),来提供话音反馈。TTS引擎可包括通用语音字典,和不同声音的发音规则,以产生所提供文本的音频,并把所产生的音频转换成供电子设备重放的适当格式(例如,AIFF文件)。在一些实施例中,TTS引擎可包括用于进行特定于音乐的处理(例如,用“featuring”代替字符串“feat.”或者“ft.”)的预处理器。在一些实施例中,主设备1320可限制传给电子设备的媒体的数量,以考虑到保存与提供话音反馈相关联的音频文件所需的存储空间(例如,根据保存在电子设备上的媒体文件的预期数目,计算预期为话音反馈音频文件所需要的空间)。
主设备可利用任何适当的方法,识别为其提供音频文件的文本字符串。在一些实施例中,主设备可识别与从主设备传给电子设备的数据相关联的文本字符串,并把所识别的文本字符串提供给TTS引擎,以产生对应的音频文件。例如,这种方法可用于与从主设备传给电子设备的媒体文件(例如,音乐或视频)的元数据(例如,标题、艺术家、专辑、流派或任何其它元数据)相关联的文本字符串。在一些实施例中,电子设备可识别为其向主设备提供音频反馈的特定元数据(例如,电子设备识别标题、艺术家和专辑元数据)。主设备可以利用任何适当的方法来命名音频文件,和把音频文件保存在电子设备中。例如,音频文件名称和存储位置(例如,目录号)可以是对朗读的文本字符串应用散列的结果。
不过,对于不是从主设备传给电子设备的朗读元素(例如,电子设备固件的菜单选项的文本),主设备不知道TTS引擎将为其提供音频文件的文本字符串。在一些实施例中,电子设备可提供文本文件(例如,XML文件),所述文本文件包括与其话音反馈将被提供给主设备的每个静态朗读元素相关联的字符串。电子设备可在任何适当时候,产生具有朗读元素字符串的文本文件。在一些实施例中,每次电子设备启动时,可根据在编译期间从固件或软件源代码提取的数据,产生所述文件。例如,当电子设备编译与显示画面的模型和视图相关联的源代码时,电子设备可识别具有朗读属性的元素(例如,朗读元素),并提取与朗读元素相关联的、要朗读的文本字符串和优先权。在一些实施例中,电子设备可响应于检测到话音反馈语言的变化,产生文本文件,发出反馈话音,或者建立变化。
可用在电子设备启动时产生的数据文件(例如,XML文件),把所提取的文本提供给主设备。这种方法使得能够用固件或软件更新,更容易地改变朗读元素,这是因为编译后的固件或软件代码可包括主设备产生话音反馈用音频文件所需的所提取的朗读元素信息。响应于接收到文本文件,主设备可利用TTS引擎,产生每个朗读元素的音频文件。在一些实施例中,文本文件可包括语言变化的指示,以指令主设备为改变后的文本,或者利用改变后的话音或语言,产生新的音频文件。在共同转让的美国专利公布No.2006/0095848、题为“AUDIO USER INTERFACEFOR COMPUTING DEVICES”(代理卷号No.P3504US1)中更详细地说明了根据接收到的文本文件来产生音频文件的系统和方法,该专利在此整体引为参考。
下面的流程图说明向电子设备提供用于话音反馈的音频文件的说明性处理。图14是用于向电子设备提供静态字符串的说明性处理的流程图。处理1400开始于步骤1402。在步骤1404,电子设备产生列举静态字符串的数据文件。例如,电子设备可从固件提取由电子设备显示的、可为其提供话音反馈的文本的字符串。在步骤1406,电子设备把该文件提供给主设备。例如,电子设备可利用有线或无线通信路径,把文件提供给主设备。
在步骤1408,主设备可把所提供的数据文件的静态字符串转换成音频文件。例如,主设备可利用TTS引擎产生每个静态字符串的音频(例如,产生音频,压缩音频,和把音频转换成可由电子设备重放的文件格式)。在步骤1410,主设备可把所产生的音频传给电子设备。例如,主设备可通过通信路径,把所产生的音频文件传给电子设备。随后在步骤1412,结束处理1400。主设备可把音频文件保存在电子设备上的任意适当位置,例如包括保存在由要朗读的文本串的散列而产生的位置或目录号处。
图15是用于向电子设备提供动态字符串的说明性处理的流程图。处理1500开始于步骤1502。在步骤1504,主设备可识别要传给电子设备的媒体。例如,主设备可取回要传送的媒体(例如,播放列表内的媒体)的列表,以便传送给电子设备。在步骤1506,主设备可识别与所识别的媒体相关联的元数据字符串。例如,主设备可取回要传给电子设备的每个所识别媒体项的、由主设备识别的特定元数据字符串(例如,艺术家、标题和专辑字符串)。
在步骤1508,主设备可把所识别的元数据字符串(例如,动态字符串)转换成音频文件。例如,主设备可利用TTS引擎产生每个动态字符串的音频(例如,产生音频,压缩音频,和把音频转换成可由电子设备重放的文件格式)。在步骤1510,主设备可把所产生的音频传给电子设备。例如,主设备可通过通信路径,把所产生的音频文件传给电子设备。处理1500随后在步骤1512结束。主设备可把音频文件保存在电子设备上的任意适当位置,例如包括保存在由要朗读的文本字符串的散列所产生的位置或目录号处。
本发明的上述实施例是出于举例说明的目的给出的,而不是对本发明的限制,并且本发明仅由下面的权利要求限定。
Claims (15)
1.一种向电子设备的用户提供话音反馈的方法,包括:
显示多个元素;
识别所述多个元素中的为其提供话音反馈的至少两个元素,其中话音反馈层次与所述多个元素中的所述至少两个元素之中的每个元素相关联;
确定与所述多个元素中的所识别的至少两个元素中的每个元素的显示相关联的话音反馈层次;
响应于所述识别和确定,产生包括所述多个元素中的所述所识别的至少两个元素的初始队列;
基于所确定的层次,对所述初始队列中的所识别的元素进行排序;和
按照所确定的层次的顺序,为所述多个元素中的所述所识别的至少两个元素提供话音反馈。
2.按照权利要求1所述的方法,还包括:
取回与所述多个元素中的所述所识别的至少两个元素中的每个元素相关联的音频文件;和
重放取回的音频文件。
3.按照权利要求1所述的方法,还包括:
改变所显示的多个元素中的至少一个元素;和
响应于所述改变,更新所述初始队列的至少一部分。
4.按照权利要求3所述的方法,还包括:
响应于所述改变,重新识别所述多个元素中的为其提供话音反馈的至少两个元素;
重新确定与所述多个元素中的重新识别出的至少两个元素中的每一个元素的显示相关联的层次;和
产生修订队列,所述修订队列包括所述多个元素中的所述重新识别出的至少两个元素。
5.按照权利要求4所述的方法,还包括:
检测在所述改变期间为其提供话音反馈的所识别的元素;
比较所述初始队列和所述修订队列,以识别所述初始队列和所述修订队列的共有部分;
确定检测到的元素不在所述修订队列的与所述初始队列共有的部分中;以及
停止为检测到的元素提供话音反馈。
6.一种朗读电子设备显示的元素的文本的系统,所述系统包括:
用于定义朗读属性与其相关联的多个元素的装置;
用于把所述多个元素显示在多个视图中的装置,其中每个视图与朗读顺序相关联;
用于产生包括所述多个元素的队列的装置,其中所述队列中的多个元素的顺序是根据所述朗读顺序设定的;
用于暂停第一超时时间的装置;
用于识别与所述队列中的所述多个元素中的每个元素相关联的音频文件的装置,其中所述音频文件包括每个元素的要朗读文本;
用于按照所述队列的顺序,依次重放所识别的音频文件的装置;和
用于暂停第二超时时间的装置。
7.按照权利要求6所述的系统,其中所述用于识别与所述队列中的所述多个元素中的每个元素相关联的音频文件的装置还包括:
用于根据所述要朗读文本的散列,取回与所述多个元素中的每个元素相关联的音频文件的装置。
8.按照权利要求6所述的系统,其中主设备利用文本到语音引擎,产生所述音频文件。
9.按照权利要求8所述的系统,还包括:
用于把所述多个元素中的每个元素的要朗读文本提供给所述主设备的装置;和
用于接收通过利用所述文本到语音引擎而产生的音频文件的装置,所述文本到语音引擎被应用于所提供的所述多个元素中的每个元素的所述要朗读文本。
10.按照权利要求6所述的系统,还包括:
用于改变所显示的多个元素中的至少一个元素的装置;和
用于产生修订队列的装置,所述修订队列包括根据与所显示的视图相关联的朗读顺序而排序的改变后的所显示的多个元素。
11.一种向电子设备的用户提供话音反馈的系统,包括:
用于显示多个元素的装置;
用于识别所述多个元素中的为其提供话音反馈的至少两个元素的装置,其中话音反馈层次与所述多个元素中的所述至少两个元素中的每个元素相关联;
用于确定与所述多个元素中的所识别的至少两个元素中的每个元素的显示相关联的话音反馈层次的装置;
用于响应于所述识别和确定,产生包括所述多个元素中的所述所识别的至少两个元素的初始队列的装置;
用于基于所确定的层次,对所述初始队列中的所识别的元素进行排序的装置;和
用于按照所确定的层次的顺序,为所述多个元素中的所述所识别的至少两个元素提供话音反馈的装置。
12.按照权利要求11所述的系统,还包括:
用于取回与所述多个元素中的所述所识别的至少两个元素中的每个元素相关联的音频文件的装置;和
用于重放取回的音频文件的装置。
13.按照权利要求11所述的系统,还包括:
用于改变所显示的多个元素中的至少一个元素的装置;和
用于响应于所述改变,更新所述初始队列的至少一部分的装置。
14.按照权利要求13所述的系统,还包括:
用于响应于所述改变,重新识别所述多个元素中的为其提供话音反馈的至少两个元素的装置;
用于重新确定与所述多个元素中的重新识别出的至少两个元素中的每一个元素的显示相关联的层次的装置;和
用于产生修订队列的装置,所述修订队列包括所述多个元素中的所述重新识别出的至少两个元素。
15.按照权利要求14所述的系统,还包括:
用于检测在所述改变期间为其提供话音反馈的所识别的元素的装置;
用于比较所述初始队列和所述修订队列,以识别所述初始队列和所述修订队列的共有部分的装置;
用于确定检测到的元素不在所述修订队列的与所述初始队列共有的部分中的装置;以及
用于停止为检测到的元素提供话音反馈的装置。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310562133.3A CN103558964B (zh) | 2008-09-05 | 2009-09-01 | 电子设备中的多层次话音反馈 |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/205,780 | 2008-09-05 | ||
US12/205,780 US8768702B2 (en) | 2008-09-05 | 2008-09-05 | Multi-tiered voice feedback in an electronic device |
PCT/US2009/055577 WO2010027953A1 (en) | 2008-09-05 | 2009-09-01 | Multi-tiered voice feedback in an electronic device |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310562133.3A Division CN103558964B (zh) | 2008-09-05 | 2009-09-01 | 电子设备中的多层次话音反馈 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102144209A CN102144209A (zh) | 2011-08-03 |
CN102144209B true CN102144209B (zh) | 2014-05-21 |
Family
ID=41506526
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200980134444.1A Active CN102144209B (zh) | 2008-09-05 | 2009-09-01 | 电子设备中的多层次话音反馈 |
CN201310562133.3A Active CN103558964B (zh) | 2008-09-05 | 2009-09-01 | 电子设备中的多层次话音反馈 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310562133.3A Active CN103558964B (zh) | 2008-09-05 | 2009-09-01 | 电子设备中的多层次话音反馈 |
Country Status (4)
Country | Link |
---|---|
US (2) | US8768702B2 (zh) |
EP (2) | EP3026541B1 (zh) |
CN (2) | CN102144209B (zh) |
WO (1) | WO2010027953A1 (zh) |
Families Citing this family (181)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
US8463053B1 (en) | 2008-08-08 | 2013-06-11 | The Research Foundation Of State University Of New York | Enhanced max margin learning on multimodal data mining in a multimedia database |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US10255566B2 (en) | 2011-06-03 | 2019-04-09 | Apple Inc. | Generating and processing task items that represent tasks to perform |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US9324337B2 (en) * | 2009-11-17 | 2016-04-26 | Dolby Laboratories Licensing Corporation | Method and system for dialog enhancement |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US9634855B2 (en) | 2010-05-13 | 2017-04-25 | Alexander Poltorak | Electronic personal interactive device that determines topics of interest using a conversational agent |
US8595012B2 (en) * | 2010-06-29 | 2013-11-26 | Lenovo (Singapore) Pte. Ltd. | Systems and methods for input device audio feedback |
KR101715381B1 (ko) * | 2010-11-25 | 2017-03-10 | 삼성전자 주식회사 | 전자장치 및 그 제어방법 |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US9563410B2 (en) * | 2011-05-25 | 2017-02-07 | Amx Llc | Data-driven menuing system for providing a flexible user interface on an electronic device |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
DE102011079034A1 (de) | 2011-07-12 | 2013-01-17 | Siemens Aktiengesellschaft | Ansteuerung eines technischen Systems |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
DE102012101629A1 (de) * | 2012-02-28 | 2013-08-29 | Deutsche Telekom Ag | Verfahren und Vorrichtung zur einfachen Steuerung von Kommunikationsdiensten im Fahrzeug durch Einsatz von berührungssensitiven Bildschirmen und Touch-Gesten |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9735973B2 (en) * | 2012-03-15 | 2017-08-15 | Vidoyen Inc. | Expert answer platform methods, apparatuses and media |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9251795B2 (en) | 2012-09-04 | 2016-02-02 | Apple Inc. | Adaptive audio codec selection during a communication session |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
EP2954514B1 (en) | 2013-02-07 | 2021-03-31 | Apple Inc. | Voice trigger for a digital assistant |
US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
US10748529B1 (en) | 2013-03-15 | 2020-08-18 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US9507561B2 (en) * | 2013-03-15 | 2016-11-29 | Verizon Patent And Licensing Inc. | Method and apparatus for facilitating use of touchscreen devices |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
WO2014200728A1 (en) | 2013-06-09 | 2014-12-18 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US10430024B2 (en) * | 2013-11-13 | 2019-10-01 | Microsoft Technology Licensing, Llc | Media item selection using user-specific grammar |
US10296160B2 (en) | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
JP2017509969A (ja) * | 2014-02-21 | 2017-04-06 | タレリス・グローバル・エルエルピーTaleris Global LLP | 航空機の空調システムの性能を判定するための方法 |
US9431002B2 (en) | 2014-03-04 | 2016-08-30 | Tribune Digital Ventures, Llc | Real time popularity based audible content aquisition |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
US9966065B2 (en) | 2014-05-30 | 2018-05-08 | Apple Inc. | Multi-command single utterance input method |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US10152299B2 (en) | 2015-03-06 | 2018-12-11 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9904409B2 (en) * | 2015-04-15 | 2018-02-27 | Samsung Electronics Co., Ltd. | Touch input processing method that adjusts touch sensitivity based on the state of a touch object and electronic device for supporting the same |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10200824B2 (en) | 2015-05-27 | 2019-02-05 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10740384B2 (en) | 2015-09-08 | 2020-08-11 | Apple Inc. | Intelligent automated assistant for media search and playback |
US10331312B2 (en) | 2015-09-08 | 2019-06-25 | Apple Inc. | Intelligent automated assistant in a media environment |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10261964B2 (en) | 2016-01-04 | 2019-04-16 | Gracenote, Inc. | Generating and distributing playlists with music and stories having related moods |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
US10419508B1 (en) | 2016-12-21 | 2019-09-17 | Gracenote Digital Ventures, Llc | Saving media for in-automobile playout |
US10019225B1 (en) | 2016-12-21 | 2018-07-10 | Gracenote Digital Ventures, Llc | Audio streaming based on in-automobile detection |
US10565980B1 (en) | 2016-12-21 | 2020-02-18 | Gracenote Digital Ventures, Llc | Audio streaming of text-based articles from newsfeeds |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
US20180234261A1 (en) * | 2017-02-14 | 2018-08-16 | Samsung Electronics Co., Ltd. | Personalized service method and device |
DK201770383A1 (en) | 2017-05-09 | 2018-12-14 | Apple Inc. | USER INTERFACE FOR CORRECTING RECOGNITION ERRORS |
US10417266B2 (en) | 2017-05-09 | 2019-09-17 | Apple Inc. | Context-aware ranking of intelligent response suggestions |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
DK180048B1 (en) | 2017-05-11 | 2020-02-04 | Apple Inc. | MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770428A1 (en) | 2017-05-12 | 2019-02-18 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US10303715B2 (en) | 2017-05-16 | 2019-05-28 | Apple Inc. | Intelligent automated assistant for media exploration |
US10403278B2 (en) | 2017-05-16 | 2019-09-03 | Apple Inc. | Methods and systems for phonetic matching in digital assistant services |
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
US10657328B2 (en) | 2017-06-02 | 2020-05-19 | Apple Inc. | Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling |
US10445429B2 (en) | 2017-09-21 | 2019-10-15 | Apple Inc. | Natural language understanding using vocabularies with compressed serialized tries |
US10755051B2 (en) | 2017-09-29 | 2020-08-25 | Apple Inc. | Rule-based natural language processing |
US10636424B2 (en) | 2017-11-30 | 2020-04-28 | Apple Inc. | Multi-turn canned dialog |
TWI651714B (zh) * | 2017-12-22 | 2019-02-21 | 隆宸星股份有限公司 | 語音選項選擇系統與方法以及使用其之智慧型機器人 |
US10733982B2 (en) | 2018-01-08 | 2020-08-04 | Apple Inc. | Multi-directional dialog |
US10733375B2 (en) | 2018-01-31 | 2020-08-04 | Apple Inc. | Knowledge-based framework for improving natural language understanding |
US10789959B2 (en) | 2018-03-02 | 2020-09-29 | Apple Inc. | Training speaker recognition models for digital assistants |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US10909331B2 (en) | 2018-03-30 | 2021-02-02 | Apple Inc. | Implicit identification of translation payload with neural machine translation |
CN110312989A (zh) * | 2018-04-28 | 2019-10-08 | 深圳市大疆创新科技有限公司 | 拍摄设备的状态的反馈方法、拍摄设备及便携式电子设备 |
US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
US10984780B2 (en) | 2018-05-21 | 2021-04-20 | Apple Inc. | Global semantic word embeddings using bi-directional recurrent neural networks |
US11386266B2 (en) | 2018-06-01 | 2022-07-12 | Apple Inc. | Text correction |
DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
DK201870355A1 (en) | 2018-06-01 | 2019-12-16 | Apple Inc. | VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS |
US10496705B1 (en) | 2018-06-03 | 2019-12-03 | Apple Inc. | Accelerated task performance |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
CN113794800B (zh) * | 2018-11-23 | 2022-08-26 | 华为技术有限公司 | 一种语音控制方法及电子设备 |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
DK201970511A1 (en) | 2019-05-31 | 2021-02-15 | Apple Inc | Voice identification in digital assistant systems |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | USER ACTIVITY SHORTCUT SUGGESTIONS |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
US11468890B2 (en) | 2019-06-01 | 2022-10-11 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
CN110610720B (zh) * | 2019-09-19 | 2022-02-25 | 北京搜狗科技发展有限公司 | 一种数据处理方法、装置和用于数据处理的装置 |
US11488406B2 (en) | 2019-09-25 | 2022-11-01 | Apple Inc. | Text detection using global geometry estimators |
US11061543B1 (en) | 2020-05-11 | 2021-07-13 | Apple Inc. | Providing relevant data items based on context |
US11043220B1 (en) | 2020-05-11 | 2021-06-22 | Apple Inc. | Digital assistant hardware abstraction |
US11755276B2 (en) | 2020-05-12 | 2023-09-12 | Apple Inc. | Reducing description length based on confidence |
US11490204B2 (en) | 2020-07-20 | 2022-11-01 | Apple Inc. | Multi-device audio adjustment coordination |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1937674A (zh) * | 2005-08-27 | 2007-03-28 | 国际商业机器公司 | 用于语音门户服务器的方法和装置 |
Family Cites Families (615)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US2859005A (en) | 1952-11-21 | 1958-11-04 | Bendix Aviat Corp | Monitoring system for aircraft auto pilots |
US3828132A (en) | 1970-10-30 | 1974-08-06 | Bell Telephone Labor Inc | Speech synthesis by concatenation of formant encoded words |
US3704345A (en) | 1971-03-19 | 1972-11-28 | Bell Telephone Labor Inc | Conversion of printed text into synthetic speech |
US3979557A (en) | 1974-07-03 | 1976-09-07 | International Telephone And Telegraph Corporation | Speech processor system for pitch period extraction using prediction filters |
BG24190A1 (en) | 1976-09-08 | 1978-01-10 | Antonov | Method of synthesis of speech and device for effecting same |
JPS597120B2 (ja) | 1978-11-24 | 1984-02-16 | 日本電気株式会社 | 音声分析装置 |
US4310721A (en) * | 1980-01-23 | 1982-01-12 | The United States Of America As Represented By The Secretary Of The Army | Half duplex integral vocoder modem system |
US4348553A (en) | 1980-07-02 | 1982-09-07 | International Business Machines Corporation | Parallel pattern verifier with dynamic time warping |
DE3382796T2 (de) | 1982-06-11 | 1996-03-28 | Mitsubishi Electric Corp | Vorrichtung zur Zwischenbildkodierung. |
US4688195A (en) | 1983-01-28 | 1987-08-18 | Texas Instruments Incorporated | Natural-language interface generating system |
JPS603056A (ja) | 1983-06-21 | 1985-01-09 | Toshiba Corp | 情報整理装置 |
DE3335358A1 (de) | 1983-09-29 | 1985-04-11 | Siemens AG, 1000 Berlin und 8000 München | Verfahren zur bestimmung von sprachspektren fuer die automatische spracherkennung und sprachcodierung |
US5164900A (en) | 1983-11-14 | 1992-11-17 | Colman Bernath | Method and device for phonetically encoding Chinese textual data for data processing entry |
US4726065A (en) * | 1984-01-26 | 1988-02-16 | Horst Froessl | Image manipulation by speech signals |
US4955047A (en) | 1984-03-26 | 1990-09-04 | Dytel Corporation | Automated attendant with direct inward system access |
US4811243A (en) | 1984-04-06 | 1989-03-07 | Racine Marsh V | Computer aided coordinate digitizing system |
US4692941A (en) | 1984-04-10 | 1987-09-08 | First Byte | Real-time text-to-speech conversion system |
US4783807A (en) | 1984-08-27 | 1988-11-08 | John Marley | System and method for sound recognition with feature selection synchronized to voice pitch |
US4718094A (en) * | 1984-11-19 | 1988-01-05 | International Business Machines Corp. | Speech recognition system |
US5165007A (en) | 1985-02-01 | 1992-11-17 | International Business Machines Corporation | Feneme-based Markov models for words |
US4944013A (en) | 1985-04-03 | 1990-07-24 | British Telecommunications Public Limited Company | Multi-pulse speech coder |
US4833712A (en) | 1985-05-29 | 1989-05-23 | International Business Machines Corporation | Automatic generation of simple Markov model stunted baseforms for words in a vocabulary |
US4819271A (en) | 1985-05-29 | 1989-04-04 | International Business Machines Corporation | Constructing Markov model word baseforms from multiple utterances by concatenating model sequences for word segments |
EP0218859A3 (en) | 1985-10-11 | 1989-09-06 | International Business Machines Corporation | Signal processor communication interface |
US4776016A (en) | 1985-11-21 | 1988-10-04 | Position Orientation Systems, Inc. | Voice control system |
JPH0833744B2 (ja) | 1986-01-09 | 1996-03-29 | 株式会社東芝 | 音声合成装置 |
US4724542A (en) * | 1986-01-22 | 1988-02-09 | International Business Machines Corporation | Automatic reference adaptation during dynamic signature verification |
US5759101A (en) | 1986-03-10 | 1998-06-02 | Response Reward Systems L.C. | Central and remote evaluation of responses of participatory broadcast audience with automatic crediting and couponing |
US5128752A (en) | 1986-03-10 | 1992-07-07 | Kohorn H Von | System and method for generating and redeeming tokens |
US5032989A (en) | 1986-03-19 | 1991-07-16 | Realpro, Ltd. | Real estate search and location system and method |
EP0241170B1 (en) | 1986-03-28 | 1992-05-27 | AT&T Corp. | Adaptive speech feature signal generation arrangement |
US4903305A (en) * | 1986-05-12 | 1990-02-20 | Dragon Systems, Inc. | Method for representing word models for use in speech recognition |
WO1988002516A1 (en) | 1986-10-03 | 1988-04-07 | British Telecommunications Public Limited Company | Language translation system |
WO1988002975A1 (en) | 1986-10-16 | 1988-04-21 | Mitsubishi Denki Kabushiki Kaisha | Amplitude-adapted vector quantizer |
US4829576A (en) | 1986-10-21 | 1989-05-09 | Dragon Systems, Inc. | Voice recognition system |
US4852168A (en) | 1986-11-18 | 1989-07-25 | Sprague Richard P | Compression of stored waveforms for artificial speech |
US4727354A (en) * | 1987-01-07 | 1988-02-23 | Unisys Corporation | System for selecting best fit vector code in vector quantization encoding |
US4827520A (en) | 1987-01-16 | 1989-05-02 | Prince Corporation | Voice actuated control system for use in a vehicle |
US4965763A (en) | 1987-03-03 | 1990-10-23 | International Business Machines Corporation | Computer method for automatic extraction of commonly specified information from business correspondence |
US5644727A (en) * | 1987-04-15 | 1997-07-01 | Proprietary Financial Products, Inc. | System for the operation and management of one or more financial accounts through the use of a digital communication and computation system for exchange, investment and borrowing |
CA1295064C (en) | 1987-05-29 | 1992-01-28 | Kuniyoshi Marui | Voice recognition system used in telephone apparatus |
DE3723078A1 (de) | 1987-07-11 | 1989-01-19 | Philips Patentverwaltung | Verfahren zur erkennung von zusammenhaengend gesprochenen woertern |
US4974191A (en) | 1987-07-31 | 1990-11-27 | Syntellect Software Inc. | Adaptive natural language computer interface system |
CA1288516C (en) | 1987-07-31 | 1991-09-03 | Leendert M. Bijnagte | Apparatus and method for communicating textual and image information between a host computer and a remote display terminal |
US5022081A (en) | 1987-10-01 | 1991-06-04 | Sharp Kabushiki Kaisha | Information recognition system |
US4852173A (en) | 1987-10-29 | 1989-07-25 | International Business Machines Corporation | Design and construction of a binary-tree system for language modelling |
US5072452A (en) | 1987-10-30 | 1991-12-10 | International Business Machines Corporation | Automatic determination of labels and Markov word models in a speech recognition system |
DE3876379T2 (de) | 1987-10-30 | 1993-06-09 | Ibm | Automatische bestimmung von kennzeichen und markov-wortmodellen in einem spracherkennungssystem. |
US4914586A (en) | 1987-11-06 | 1990-04-03 | Xerox Corporation | Garbage collector for hypermedia systems |
US4992972A (en) * | 1987-11-18 | 1991-02-12 | International Business Machines Corporation | Flexible context searchable on-line information system with help files and modules for on-line computer system documentation |
US5220657A (en) | 1987-12-02 | 1993-06-15 | Xerox Corporation | Updating local copy of shared data in a collaborative system |
US4984177A (en) | 1988-02-05 | 1991-01-08 | Advanced Products And Technologies, Inc. | Voice language translator |
US5194950A (en) | 1988-02-29 | 1993-03-16 | Mitsubishi Denki Kabushiki Kaisha | Vector quantizer |
US4914590A (en) | 1988-05-18 | 1990-04-03 | Emhart Industries, Inc. | Natural language understanding system |
FR2636163B1 (fr) | 1988-09-02 | 1991-07-05 | Hamon Christian | Procede et dispositif de synthese de la parole par addition-recouvrement de formes d'onde |
US4839853A (en) | 1988-09-15 | 1989-06-13 | Bell Communications Research, Inc. | Computer information retrieval using latent semantic structure |
JPH0293597A (ja) | 1988-09-30 | 1990-04-04 | Nippon I B M Kk | 音声認識装置 |
US4905163A (en) * | 1988-10-03 | 1990-02-27 | Minnesota Mining & Manufacturing Company | Intelligent optical navigator dynamic information presentation and navigation system |
US5282265A (en) * | 1988-10-04 | 1994-01-25 | Canon Kabushiki Kaisha | Knowledge information processing system |
DE3837590A1 (de) | 1988-11-05 | 1990-05-10 | Ant Nachrichtentech | Verfahren zum reduzieren der datenrate von digitalen bilddaten |
DE68913669T2 (de) | 1988-11-23 | 1994-07-21 | Digital Equipment Corp | Namenaussprache durch einen Synthetisator. |
US5027406A (en) | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
US5127055A (en) | 1988-12-30 | 1992-06-30 | Kurzweil Applied Intelligence, Inc. | Speech recognition apparatus & method having dynamic reference pattern adaptation |
US5293448A (en) | 1989-10-02 | 1994-03-08 | Nippon Telegraph And Telephone Corporation | Speech analysis-synthesis method and apparatus therefor |
US5047614A (en) | 1989-01-23 | 1991-09-10 | Bianco James S | Method and apparatus for computer-aided shopping |
SE466029B (sv) * | 1989-03-06 | 1991-12-02 | Ibm Svenska Ab | Anordning och foerfarande foer analys av naturligt spraak i ett datorbaserat informationsbehandlingssystem |
JPH0782544B2 (ja) | 1989-03-24 | 1995-09-06 | インターナショナル・ビジネス・マシーンズ・コーポレーション | マルチテンプレートを用いるdpマツチング方法及び装置 |
US4977598A (en) | 1989-04-13 | 1990-12-11 | Texas Instruments Incorporated | Efficient pruning algorithm for hidden markov model speech recognition |
US5197005A (en) | 1989-05-01 | 1993-03-23 | Intelligent Business Systems | Database retrieval system having a natural language interface |
US5010574A (en) | 1989-06-13 | 1991-04-23 | At&T Bell Laboratories | Vector quantizer search arrangement |
JP2940005B2 (ja) | 1989-07-20 | 1999-08-25 | 日本電気株式会社 | 音声符号化装置 |
US5091945A (en) * | 1989-09-28 | 1992-02-25 | At&T Bell Laboratories | Source dependent channel coding with error protection |
CA2027705C (en) | 1989-10-17 | 1994-02-15 | Masami Akamine | Speech coding system utilizing a recursive computation technique for improvement in processing speed |
US5020112A (en) | 1989-10-31 | 1991-05-28 | At&T Bell Laboratories | Image recognition method using two-dimensional stochastic grammars |
US5220639A (en) | 1989-12-01 | 1993-06-15 | National Science Council | Mandarin speech input method for Chinese computers and a mandarin speech recognition machine |
US5021971A (en) | 1989-12-07 | 1991-06-04 | Unisys Corporation | Reflective binary encoder for vector quantization |
US5179652A (en) * | 1989-12-13 | 1993-01-12 | Anthony I. Rozmanith | Method and apparatus for storing, transmitting and retrieving graphical and tabular data |
CH681573A5 (en) | 1990-02-13 | 1993-04-15 | Astral | Automatic teller arrangement involving bank computers - is operated by user data card carrying personal data, account information and transaction records |
DE69133296T2 (de) | 1990-02-22 | 2004-01-29 | Nec Corp | Sprachcodierer |
US5301109A (en) | 1990-06-11 | 1994-04-05 | Bell Communications Research, Inc. | Computerized cross-language document retrieval using latent semantic indexing |
JP3266246B2 (ja) | 1990-06-15 | 2002-03-18 | インターナシヨナル・ビジネス・マシーンズ・コーポレーシヨン | 自然言語解析装置及び方法並びに自然言語解析用知識ベース構築方法 |
US5202952A (en) | 1990-06-22 | 1993-04-13 | Dragon Systems, Inc. | Large-vocabulary continuous speech prefiltering and processing system |
GB9017600D0 (en) | 1990-08-10 | 1990-09-26 | British Aerospace | An assembly and method for binary tree-searched vector quanisation data compression processing |
US5404295A (en) | 1990-08-16 | 1995-04-04 | Katz; Boris | Method and apparatus for utilizing annotations to facilitate computer retrieval of database material |
US5309359A (en) | 1990-08-16 | 1994-05-03 | Boris Katz | Method and apparatus for generating and utlizing annotations to facilitate computer text retrieval |
US5297170A (en) | 1990-08-21 | 1994-03-22 | Codex Corporation | Lattice and trellis-coded quantization |
US5400434A (en) | 1990-09-04 | 1995-03-21 | Matsushita Electric Industrial Co., Ltd. | Voice source for synthetic speech system |
US5216747A (en) | 1990-09-20 | 1993-06-01 | Digital Voice Systems, Inc. | Voiced/unvoiced estimation of an acoustic signal |
US5128672A (en) | 1990-10-30 | 1992-07-07 | Apple Computer, Inc. | Dynamic predictive keyboard |
US5325298A (en) | 1990-11-07 | 1994-06-28 | Hnc, Inc. | Methods for generating or revising context vectors for a plurality of word stems |
US5317507A (en) | 1990-11-07 | 1994-05-31 | Gallant Stephen I | Method for document retrieval and for word sense disambiguation using neural networks |
US5247579A (en) * | 1990-12-05 | 1993-09-21 | Digital Voice Systems, Inc. | Methods for speech transmission |
US5345536A (en) | 1990-12-21 | 1994-09-06 | Matsushita Electric Industrial Co., Ltd. | Method of speech recognition |
US5127053A (en) | 1990-12-24 | 1992-06-30 | General Electric Company | Low-complexity method for improving the performance of autocorrelation-based pitch detectors |
US5133011A (en) | 1990-12-26 | 1992-07-21 | International Business Machines Corporation | Method and apparatus for linear vocal control of cursor position |
US5268990A (en) | 1991-01-31 | 1993-12-07 | Sri International | Method for recognizing speech using linguistically-motivated hidden Markov models |
GB9105367D0 (en) | 1991-03-13 | 1991-04-24 | Univ Strathclyde | Computerised information-retrieval database systems |
US5303406A (en) | 1991-04-29 | 1994-04-12 | Motorola, Inc. | Noise squelch circuit with adaptive noise shaping |
US5475587A (en) | 1991-06-28 | 1995-12-12 | Digital Equipment Corporation | Method and apparatus for efficient morphological text analysis using a high-level language for compact specification of inflectional paradigms |
US5293452A (en) | 1991-07-01 | 1994-03-08 | Texas Instruments Incorporated | Voice log-in using spoken name input |
US5687077A (en) | 1991-07-31 | 1997-11-11 | Universal Dynamics Limited | Method and apparatus for adaptive control |
US5199077A (en) | 1991-09-19 | 1993-03-30 | Xerox Corporation | Wordspotting for voice editing and indexing |
JP2662120B2 (ja) | 1991-10-01 | 1997-10-08 | インターナショナル・ビジネス・マシーンズ・コーポレイション | 音声認識装置および音声認識用処理ユニット |
US5222146A (en) | 1991-10-23 | 1993-06-22 | International Business Machines Corporation | Speech recognition apparatus having a speech coder outputting acoustic prototype ranks |
KR940002854B1 (ko) | 1991-11-06 | 1994-04-04 | 한국전기통신공사 | 음성 합성시스팀의 음성단편 코딩 및 그의 피치조절 방법과 그의 유성음 합성장치 |
US5386494A (en) * | 1991-12-06 | 1995-01-31 | Apple Computer, Inc. | Method and apparatus for controlling a speech recognition function using a cursor control device |
US5903454A (en) | 1991-12-23 | 1999-05-11 | Hoffberg; Linda Irene | Human-factored interface corporating adaptive pattern recognition based controller apparatus |
US6081750A (en) | 1991-12-23 | 2000-06-27 | Hoffberg; Steven Mark | Ergonomic man-machine interface incorporating adaptive pattern recognition based control system |
US5502790A (en) | 1991-12-24 | 1996-03-26 | Oki Electric Industry Co., Ltd. | Speech recognition method and system using triphones, diphones, and phonemes |
US5349645A (en) | 1991-12-31 | 1994-09-20 | Matsushita Electric Industrial Co., Ltd. | Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches |
US5267345A (en) | 1992-02-10 | 1993-11-30 | International Business Machines Corporation | Speech recognition apparatus which predicts word classes from context and words from word classes |
DE69322894T2 (de) | 1992-03-02 | 1999-07-29 | At & T Corp | Lernverfahren und Gerät zur Spracherkennung |
US6055514A (en) | 1992-03-20 | 2000-04-25 | Wren; Stephen Corey | System for marketing foods and services utilizing computerized centraland remote facilities |
US5317647A (en) | 1992-04-07 | 1994-05-31 | Apple Computer, Inc. | Constrained attribute grammars for syntactic pattern recognition |
US5412804A (en) | 1992-04-30 | 1995-05-02 | Oracle Corporation | Extending the semantics of the outer join operator for un-nesting queries to a data base |
AU672972C (en) * | 1992-05-20 | 2004-06-17 | Industrial Research Limited | Wideband assisted reverberation system |
US5293584A (en) | 1992-05-21 | 1994-03-08 | International Business Machines Corporation | Speech recognition system for natural language translation |
US5390281A (en) | 1992-05-27 | 1995-02-14 | Apple Computer, Inc. | Method and apparatus for deducing user intent and providing computer implemented services |
US5434777A (en) | 1992-05-27 | 1995-07-18 | Apple Computer, Inc. | Method and apparatus for processing natural language |
US5734789A (en) * | 1992-06-01 | 1998-03-31 | Hughes Electronics | Voiced, unvoiced or noise modes in a CELP vocoder |
US5333275A (en) | 1992-06-23 | 1994-07-26 | Wheatley Barbara J | System and method for time aligning speech |
US5325297A (en) | 1992-06-25 | 1994-06-28 | System Of Multiple-Colored Images For Internationally Listed Estates, Inc. | Computer implemented method and system for storing and retrieving textual data and compressed image data |
JPH0619965A (ja) | 1992-07-01 | 1994-01-28 | Canon Inc | 自然言語処理装置 |
US5999908A (en) | 1992-08-06 | 1999-12-07 | Abelow; Daniel H. | Customer-based product design module |
US5412806A (en) | 1992-08-20 | 1995-05-02 | Hewlett-Packard Company | Calibration of logical cost formulae for queries in a heterogeneous DBMS using synthetic database |
GB9220404D0 (en) | 1992-08-20 | 1992-11-11 | Nat Security Agency | Method of identifying,retrieving and sorting documents |
US5333236A (en) | 1992-09-10 | 1994-07-26 | International Business Machines Corporation | Speech recognizer having a speech coder for an acoustic match based on context-dependent speech-transition acoustic models |
US5384893A (en) * | 1992-09-23 | 1995-01-24 | Emerson & Stern Associates, Inc. | Method and apparatus for speech synthesis based on prosodic analysis |
FR2696036B1 (fr) | 1992-09-24 | 1994-10-14 | France Telecom | Procédé de mesure de ressemblance entre échantillons sonores et dispositif de mise en Óoeuvre de ce procédé. |
JPH0772840B2 (ja) | 1992-09-29 | 1995-08-02 | 日本アイ・ビー・エム株式会社 | 音声モデルの構成方法、音声認識方法、音声認識装置及び音声モデルの訓練方法 |
US5758313A (en) * | 1992-10-16 | 1998-05-26 | Mobile Information Systems, Inc. | Method and apparatus for tracking vehicle location |
US5455888A (en) | 1992-12-04 | 1995-10-03 | Northern Telecom Limited | Speech bandwidth extension method and apparatus |
US5412756A (en) | 1992-12-22 | 1995-05-02 | Mitsubishi Denki Kabushiki Kaisha | Artificial intelligence software shell for plant operation simulation |
US5384892A (en) * | 1992-12-31 | 1995-01-24 | Apple Computer, Inc. | Dynamic language model for speech recognition |
US5390279A (en) * | 1992-12-31 | 1995-02-14 | Apple Computer, Inc. | Partitioning speech rules by context for speech recognition |
US5613036A (en) | 1992-12-31 | 1997-03-18 | Apple Computer, Inc. | Dynamic categories for a speech recognition system |
US5734791A (en) | 1992-12-31 | 1998-03-31 | Apple Computer, Inc. | Rapid tree-based method for vector quantization |
US6122616A (en) | 1993-01-21 | 2000-09-19 | Apple Computer, Inc. | Method and apparatus for diphone aliasing |
US5864844A (en) * | 1993-02-18 | 1999-01-26 | Apple Computer, Inc. | System and method for enhancing a user interface with a computer based training tool |
CA2091658A1 (en) | 1993-03-15 | 1994-09-16 | Matthew Lennig | Method and apparatus for automation of directory assistance using speech recognition |
US6055531A (en) | 1993-03-24 | 2000-04-25 | Engate Incorporated | Down-line transcription system having context sensitive searching capability |
US5536902A (en) | 1993-04-14 | 1996-07-16 | Yamaha Corporation | Method of and apparatus for analyzing and synthesizing a sound by extracting and controlling a sound parameter |
US5444823A (en) | 1993-04-16 | 1995-08-22 | Compaq Computer Corporation | Intelligent search engine for associated on-line documentation having questionless case-based knowledge base |
US5574823A (en) | 1993-06-23 | 1996-11-12 | Her Majesty The Queen In Right Of Canada As Represented By The Minister Of Communications | Frequency selective harmonic coding |
US5515475A (en) | 1993-06-24 | 1996-05-07 | Northern Telecom Limited | Speech recognition method using a two-pass search |
JPH0756933A (ja) | 1993-06-24 | 1995-03-03 | Xerox Corp | 文書検索方法 |
JP3685812B2 (ja) | 1993-06-29 | 2005-08-24 | ソニー株式会社 | 音声信号送受信装置 |
US5794207A (en) | 1996-09-04 | 1998-08-11 | Walker Asset Management Limited Partnership | Method and apparatus for a cryptographically assisted commercial network system designed to facilitate buyer-driven conditional purchase offers |
WO1995002221A1 (en) | 1993-07-07 | 1995-01-19 | Inference Corporation | Case-based organizing and querying of a database |
US5495604A (en) * | 1993-08-25 | 1996-02-27 | Asymetrix Corporation | Method and apparatus for the modeling and query of database structures using natural language-like constructs |
US5619694A (en) | 1993-08-26 | 1997-04-08 | Nec Corporation | Case database storage/retrieval system |
US5940811A (en) | 1993-08-27 | 1999-08-17 | Affinity Technology Group, Inc. | Closed loop financial transaction method and apparatus |
US5377258A (en) * | 1993-08-30 | 1994-12-27 | National Medical Research Council | Method and apparatus for an automated and interactive behavioral guidance system |
US5873056A (en) * | 1993-10-12 | 1999-02-16 | The Syracuse University | Natural language processing system for semantic vector representation which accounts for lexical ambiguity |
US5578808A (en) | 1993-12-22 | 1996-11-26 | Datamark Services, Inc. | Data card that can be used for transactions involving separate card issuers |
CA2179523A1 (en) | 1993-12-23 | 1995-06-29 | David A. Boulton | Method and apparatus for implementing user feedback |
US5621859A (en) | 1994-01-19 | 1997-04-15 | Bbn Corporation | Single tree method for grammar directed, very large vocabulary speech recognizer |
US5584024A (en) | 1994-03-24 | 1996-12-10 | Software Ag | Interactive database query system and method for prohibiting the selection of semantically incorrect query parameters |
US5642519A (en) | 1994-04-29 | 1997-06-24 | Sun Microsystems, Inc. | Speech interpreter with a unified grammer compiler |
KR100250509B1 (ko) | 1994-05-25 | 2000-04-01 | 슈즈이 다께오 | 가변 전송속도 데이터 전송장치 |
US5493677A (en) * | 1994-06-08 | 1996-02-20 | Systems Research & Applications Corporation | Generation, archiving, and retrieval of digital images with evoked suggestion-set captions and natural language interface |
US5675819A (en) | 1994-06-16 | 1997-10-07 | Xerox Corporation | Document information retrieval using global word co-occurrence patterns |
JPH0869470A (ja) | 1994-06-21 | 1996-03-12 | Canon Inc | 自然言語処理装置及びその方法 |
US5948040A (en) | 1994-06-24 | 1999-09-07 | Delorme Publishing Co. | Travel reservation information and planning system |
US5682539A (en) | 1994-09-29 | 1997-10-28 | Conrad; Donovan | Anticipated meaning natural language interface |
US5715468A (en) * | 1994-09-30 | 1998-02-03 | Budzinski; Robert Lucius | Memory system for storing and retrieving experience and knowledge with natural language |
GB2293667B (en) | 1994-09-30 | 1998-05-27 | Intermation Limited | Database management system |
US5845255A (en) | 1994-10-28 | 1998-12-01 | Advanced Health Med-E-Systems Corporation | Prescription management system |
US5577241A (en) | 1994-12-07 | 1996-11-19 | Excite, Inc. | Information retrieval system and method with implementation extensible query architecture |
US5748974A (en) | 1994-12-13 | 1998-05-05 | International Business Machines Corporation | Multimodal natural language interface for cross-application tasks |
US5794050A (en) | 1995-01-04 | 1998-08-11 | Intelligent Text Processing, Inc. | Natural language understanding system |
CA2683230C (en) | 1995-02-13 | 2013-08-27 | Intertrust Technologies Corporation | Systems and methods for secure transaction management and electronic rights protection |
US5701400A (en) | 1995-03-08 | 1997-12-23 | Amado; Carlos Armando | Method and apparatus for applying if-then-else rules to data sets in a relational data base and generating from the results of application of said rules a database of diagnostics linked to said data sets to aid executive analysis of financial data |
US5749081A (en) | 1995-04-06 | 1998-05-05 | Firefly Network, Inc. | System and method for recommending items to a user |
US5642464A (en) | 1995-05-03 | 1997-06-24 | Northern Telecom Limited | Methods and apparatus for noise conditioning in digital speech compression systems using linear predictive coding |
US5664055A (en) | 1995-06-07 | 1997-09-02 | Lucent Technologies Inc. | CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity |
US6496182B1 (en) * | 1995-06-07 | 2002-12-17 | Microsoft Corporation | Method and system for providing touch-sensitive screens for the visually impaired |
US5710886A (en) * | 1995-06-16 | 1998-01-20 | Sellectsoft, L.C. | Electric couponing method and apparatus |
JP3284832B2 (ja) | 1995-06-22 | 2002-05-20 | セイコーエプソン株式会社 | 音声認識対話処理方法および音声認識対話装置 |
US6038533A (en) | 1995-07-07 | 2000-03-14 | Lucent Technologies Inc. | System and method for selecting training text |
US6026388A (en) * | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
JP3697748B2 (ja) | 1995-08-21 | 2005-09-21 | セイコーエプソン株式会社 | 端末、音声認識装置 |
US5712957A (en) * | 1995-09-08 | 1998-01-27 | Carnegie Mellon University | Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists |
US6173261B1 (en) * | 1998-09-30 | 2001-01-09 | At&T Corp | Grammar fragment acquisition using syntactic and semantic clustering |
US5790978A (en) | 1995-09-15 | 1998-08-04 | Lucent Technologies, Inc. | System and method for determining pitch contours |
US5737734A (en) | 1995-09-15 | 1998-04-07 | Infonautics Corporation | Query word relevance adjustment in a search of an information retrieval system |
US5884323A (en) | 1995-10-13 | 1999-03-16 | 3Com Corporation | Extendible method and apparatus for synchronizing files on two different computer systems |
US5799276A (en) | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
US5794237A (en) | 1995-11-13 | 1998-08-11 | International Business Machines Corporation | System and method for improving problem source identification in computer systems employing relevance feedback and statistical source ranking |
US5706442A (en) * | 1995-12-20 | 1998-01-06 | Block Financial Corporation | System for on-line financial services using distributed objects |
AU1836297A (en) | 1996-01-17 | 1997-08-11 | Personal Agents, Inc. | Intelligent agents for electronic commerce |
US6119101A (en) | 1996-01-17 | 2000-09-12 | Personal Agents, Inc. | Intelligent agents for electronic commerce |
US6125356A (en) | 1996-01-18 | 2000-09-26 | Rosefaire Development, Ltd. | Portable sales presentation system with selective scripted seller prompts |
US5987404A (en) | 1996-01-29 | 1999-11-16 | International Business Machines Corporation | Statistical natural language understanding using hidden clumpings |
US5729694A (en) | 1996-02-06 | 1998-03-17 | The Regents Of The University Of California | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves |
US6076088A (en) | 1996-02-09 | 2000-06-13 | Paik; Woojin | Information extraction system and method using concept relation concept (CRC) triples |
US5835893A (en) | 1996-02-15 | 1998-11-10 | Atr Interpreting Telecommunications Research Labs | Class-based word clustering for speech recognition using a three-level balanced hierarchical similarity |
US5901287A (en) | 1996-04-01 | 1999-05-04 | The Sabre Group Inc. | Information aggregation and synthesization system |
US5867799A (en) * | 1996-04-04 | 1999-02-02 | Lang; Andrew K. | Information system and method for filtering a massive flow of information entities to meet user information classification needs |
US5987140A (en) | 1996-04-26 | 1999-11-16 | Verifone, Inc. | System, method and article of manufacture for secure network electronic payment and credit collection |
US5963924A (en) | 1996-04-26 | 1999-10-05 | Verifone, Inc. | System, method and article of manufacture for the use of payment instrument holders and payment instruments in network electronic commerce |
US5913193A (en) | 1996-04-30 | 1999-06-15 | Microsoft Corporation | Method and system of runtime acoustic unit selection for speech synthesis |
US5857184A (en) * | 1996-05-03 | 1999-01-05 | Walden Media, Inc. | Language and method for creating, organizing, and retrieving data from a database |
US5828999A (en) | 1996-05-06 | 1998-10-27 | Apple Computer, Inc. | Method and system for deriving a large-span semantic language model for large-vocabulary recognition systems |
FR2748342B1 (fr) * | 1996-05-06 | 1998-07-17 | France Telecom | Procede et dispositif de filtrage par egalisation d'un signal de parole, mettant en oeuvre un modele statistique de ce signal |
US5826261A (en) | 1996-05-10 | 1998-10-20 | Spencer; Graham | System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query |
US6366883B1 (en) | 1996-05-15 | 2002-04-02 | Atr Interpreting Telecommunications | Concatenation of speech segments by use of a speech synthesizer |
US5727950A (en) | 1996-05-22 | 1998-03-17 | Netsage Corporation | Agent based instruction system and method |
US5966533A (en) * | 1996-06-11 | 1999-10-12 | Excite, Inc. | Method and system for dynamically synthesizing a computer program by differentially resolving atoms based on user context data |
US5915249A (en) | 1996-06-14 | 1999-06-22 | Excite, Inc. | System and method for accelerated query evaluation of very large full-text databases |
US5987132A (en) | 1996-06-17 | 1999-11-16 | Verifone, Inc. | System, method and article of manufacture for conditionally accepting a payment method utilizing an extensible, flexible architecture |
US5825881A (en) | 1996-06-28 | 1998-10-20 | Allsoft Distributing Inc. | Public network merchandising system |
US6070147A (en) | 1996-07-02 | 2000-05-30 | Tecmark Services, Inc. | Customer identification and marketing analysis systems |
WO1998003927A2 (en) | 1996-07-22 | 1998-01-29 | Cyva Research Corp | Personal information security and exchange tool |
EP0829811A1 (en) | 1996-09-11 | 1998-03-18 | Nippon Telegraph And Telephone Corporation | Method and system for information retrieval |
US6181935B1 (en) | 1996-09-27 | 2001-01-30 | Software.Com, Inc. | Mobility extended telephone application programming interface and method of use |
US5794182A (en) | 1996-09-30 | 1998-08-11 | Apple Computer, Inc. | Linear predictive speech encoding systems with efficient combination pitch coefficients computation |
US5721827A (en) * | 1996-10-02 | 1998-02-24 | James Logan | System for electrically distributing personalized information |
US5913203A (en) | 1996-10-03 | 1999-06-15 | Jaesent Inc. | System and method for pseudo cash transactions |
US5930769A (en) | 1996-10-07 | 1999-07-27 | Rose; Andrea | System and method for fashion shopping |
US5873064A (en) * | 1996-11-08 | 1999-02-16 | International Business Machines Corporation | Multi-action voice macro method |
US5836771A (en) | 1996-12-02 | 1998-11-17 | Ho; Chi Fai | Learning method and system based on questioning |
US6665639B2 (en) * | 1996-12-06 | 2003-12-16 | Sensory, Inc. | Speech recognition in consumer electronic products |
US6078914A (en) | 1996-12-09 | 2000-06-20 | Open Text Corporation | Natural language meta-search system and method |
US5839106A (en) | 1996-12-17 | 1998-11-17 | Apple Computer, Inc. | Large-vocabulary speech recognition using an integrated syntactic and semantic statistical language model |
US5966126A (en) | 1996-12-23 | 1999-10-12 | Szabo; Andrew J. | Graphic user interface for database system |
US5932869A (en) * | 1996-12-27 | 1999-08-03 | Graphic Technology, Inc. | Promotional system with magnetic stripe and visual thermo-reversible print surfaced medium |
JP3579204B2 (ja) | 1997-01-17 | 2004-10-20 | 富士通株式会社 | 文書要約装置およびその方法 |
US5941944A (en) | 1997-03-03 | 1999-08-24 | Microsoft Corporation | Method for providing a substitute for a requested inaccessible object by identifying substantially similar objects using weights corresponding to object features |
US6076051A (en) | 1997-03-07 | 2000-06-13 | Microsoft Corporation | Information retrieval utilizing semantic representation of text |
US5930801A (en) | 1997-03-07 | 1999-07-27 | Xerox Corporation | Shared-data environment in which each file has independent security properties |
WO1998041956A1 (en) | 1997-03-20 | 1998-09-24 | Schlumberger Technologies, Inc. | System and method of transactional taxation using secure stored data devices |
US5822743A (en) | 1997-04-08 | 1998-10-13 | 1215627 Ontario Inc. | Knowledge-based information retrieval system |
US5970474A (en) | 1997-04-24 | 1999-10-19 | Sears, Roebuck And Co. | Registry information system for shoppers |
US5895464A (en) | 1997-04-30 | 1999-04-20 | Eastman Kodak Company | Computer program product and a method for using natural language for the description, search and retrieval of multi-media objects |
WO1999001834A1 (en) | 1997-07-02 | 1999-01-14 | Coueignoux, Philippe, J., M. | System and method for the secure discovery, exploitation and publication of information |
US5860063A (en) * | 1997-07-11 | 1999-01-12 | At&T Corp | Automated meaningful phrase clustering |
US5933822A (en) | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US5974146A (en) | 1997-07-30 | 1999-10-26 | Huntington Bancshares Incorporated | Real time bank-centric universal payment system |
US6016476A (en) | 1997-08-11 | 2000-01-18 | International Business Machines Corporation | Portable information and transaction processing system and method utilizing biometric authorization and digital certificate security |
US5895466A (en) | 1997-08-19 | 1999-04-20 | At&T Corp | Automated natural language understanding customer service system |
US6081774A (en) | 1997-08-22 | 2000-06-27 | Novell, Inc. | Natural language information retrieval system and method |
US6404876B1 (en) | 1997-09-25 | 2002-06-11 | Gte Intelligent Network Services Incorporated | System and method for voice activated dialing and routing under open access network control |
US6023684A (en) * | 1997-10-01 | 2000-02-08 | Security First Technologies, Inc. | Three tier financial transaction system with cache memory |
DE69712485T2 (de) | 1997-10-23 | 2002-12-12 | Sony Int Europe Gmbh | Sprachschnittstelle für ein Hausnetzwerk |
US6108627A (en) | 1997-10-31 | 2000-08-22 | Nortel Networks Corporation | Automatic transcription tool |
US5943670A (en) | 1997-11-21 | 1999-08-24 | International Business Machines Corporation | System and method for categorizing objects in combined categories |
US5960422A (en) | 1997-11-26 | 1999-09-28 | International Business Machines Corporation | System and method for optimized source selection in an information retrieval system |
US6026375A (en) * | 1997-12-05 | 2000-02-15 | Nortel Networks Corporation | Method and apparatus for processing orders from customers in a mobile environment |
US6064960A (en) | 1997-12-18 | 2000-05-16 | Apple Computer, Inc. | Method and apparatus for improved duration modeling of phonemes |
US6094649A (en) | 1997-12-22 | 2000-07-25 | Partnet, Inc. | Keyword searches of structured databases |
US6173287B1 (en) | 1998-03-11 | 2001-01-09 | Digital Equipment Corporation | Technique for ranking multimedia annotations of interest |
US6195641B1 (en) * | 1998-03-27 | 2001-02-27 | International Business Machines Corp. | Network universal spoken language vocabulary |
US6026393A (en) * | 1998-03-31 | 2000-02-15 | Casebank Technologies Inc. | Configuration knowledge as an aid to case retrieval |
US6233559B1 (en) | 1998-04-01 | 2001-05-15 | Motorola, Inc. | Speech control of multiple applications using applets |
US6173279B1 (en) * | 1998-04-09 | 2001-01-09 | At&T Corp. | Method of using a natural language interface to retrieve information from one or more data resources |
US6088731A (en) | 1998-04-24 | 2000-07-11 | Associative Computing, Inc. | Intelligent assistant for use with a local computer and with the internet |
WO1999056227A1 (en) | 1998-04-27 | 1999-11-04 | British Telecommunications Public Limited Company | Database access tool |
US6029132A (en) * | 1998-04-30 | 2000-02-22 | Matsushita Electric Industrial Co. | Method for letter-to-sound in text-to-speech synthesis |
US6016471A (en) * | 1998-04-29 | 2000-01-18 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word |
US6285786B1 (en) | 1998-04-30 | 2001-09-04 | Motorola, Inc. | Text recognizer and method using non-cumulative character scoring in a forward search |
US6144938A (en) | 1998-05-01 | 2000-11-07 | Sun Microsystems, Inc. | Voice user interface with personality |
US6778970B2 (en) | 1998-05-28 | 2004-08-17 | Lawrence Au | Topological methods to organize semantic network data flows for conversational applications |
US20070094223A1 (en) | 1998-05-28 | 2007-04-26 | Lawrence Au | Method and system for using contextual meaning in voice to text conversion |
US7711672B2 (en) | 1998-05-28 | 2010-05-04 | Lawrence Au | Semantic network methods to disambiguate natural language meaning |
US6144958A (en) | 1998-07-15 | 2000-11-07 | Amazon.Com, Inc. | System and method for correcting spelling errors in search queries |
US6105865A (en) | 1998-07-17 | 2000-08-22 | Hardesty; Laurence Daniel | Financial transaction system with retirement saving benefit |
US6499013B1 (en) | 1998-09-09 | 2002-12-24 | One Voice Technologies, Inc. | Interactive user interface using speech recognition and natural language processing |
US6434524B1 (en) | 1998-09-09 | 2002-08-13 | One Voice Technologies, Inc. | Object interactive user interface using speech recognition and natural language processing |
US6792082B1 (en) | 1998-09-11 | 2004-09-14 | Comverse Ltd. | Voice mail system with personal assistant provisioning |
US6266637B1 (en) | 1998-09-11 | 2001-07-24 | International Business Machines Corporation | Phrase splicing and variable substitution using a trainable speech synthesizer |
DE29825146U1 (de) | 1998-09-11 | 2005-08-18 | Püllen, Rainer | Teilnehmereinheit und System für einen Multimediadienst |
US6317831B1 (en) | 1998-09-21 | 2001-11-13 | Openwave Systems Inc. | Method and apparatus for establishing a secure connection over a one-way data path |
EP1133734A4 (en) | 1998-10-02 | 2005-12-14 | Ibm | INTERACTIVE NAVIGATOR AND INTERACTIVE SYSTEMS |
US6275824B1 (en) | 1998-10-02 | 2001-08-14 | Ncr Corporation | System and method for managing data privacy in a database management system |
GB9821969D0 (en) | 1998-10-08 | 1998-12-02 | Canon Kk | Apparatus and method for processing natural language |
US6928614B1 (en) | 1998-10-13 | 2005-08-09 | Visteon Global Technologies, Inc. | Mobile office with speech recognition |
US6453292B2 (en) | 1998-10-28 | 2002-09-17 | International Business Machines Corporation | Command boundary identifier for conversational natural language |
US6208971B1 (en) | 1998-10-30 | 2001-03-27 | Apple Computer, Inc. | Method and apparatus for command recognition using data-driven semantic inference |
US6321092B1 (en) | 1998-11-03 | 2001-11-20 | Signal Soft Corporation | Multiple input data management for wireless location-based applications |
US6446076B1 (en) | 1998-11-12 | 2002-09-03 | Accenture Llp. | Voice interactive web-based agent system responsive to a user location for prioritizing and formatting information |
WO2000030069A2 (en) | 1998-11-13 | 2000-05-25 | Lernout & Hauspie Speech Products N.V. | Speech synthesis using concatenation of speech waveforms |
US6606599B2 (en) | 1998-12-23 | 2003-08-12 | Interactive Speech Technologies, Llc | Method for integrating computing processes with an interface controlled by voice actuated grammars |
US6246981B1 (en) | 1998-11-25 | 2001-06-12 | International Business Machines Corporation | Natural language task-oriented dialog manager and method |
US7082397B2 (en) | 1998-12-01 | 2006-07-25 | Nuance Communications, Inc. | System for and method of creating and browsing a voice web |
US6260024B1 (en) | 1998-12-02 | 2001-07-10 | Gary Shkedy | Method and apparatus for facilitating buyer-driven purchase orders on a commercial network system |
US7881936B2 (en) | 1998-12-04 | 2011-02-01 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US7319957B2 (en) | 2004-02-11 | 2008-01-15 | Tegic Communications, Inc. | Handwriting and voice input with automatic correction |
US6317707B1 (en) | 1998-12-07 | 2001-11-13 | At&T Corp. | Automatic clustering of tokens from a corpus for grammar acquisition |
US6308149B1 (en) | 1998-12-16 | 2001-10-23 | Xerox Corporation | Grouping words with equivalent substrings by automatic clustering based on suffix relationships |
US6523172B1 (en) * | 1998-12-17 | 2003-02-18 | Evolutionary Technologies International, Inc. | Parser translator system and method |
US6460029B1 (en) | 1998-12-23 | 2002-10-01 | Microsoft Corporation | System for improving search text |
US6742021B1 (en) | 1999-01-05 | 2004-05-25 | Sri International, Inc. | Navigating network-based electronic information using spoken input with multimodal error feedback |
US7036128B1 (en) | 1999-01-05 | 2006-04-25 | Sri International Offices | Using a community of distributed electronic agents to support a highly mobile, ambient computing environment |
US6523061B1 (en) * | 1999-01-05 | 2003-02-18 | Sri International, Inc. | System, method, and article of manufacture for agent-based navigation in a speech-based data navigation system |
US6851115B1 (en) * | 1999-01-05 | 2005-02-01 | Sri International | Software-based architecture for communication and cooperation among distributed electronic agents |
US6757718B1 (en) | 1999-01-05 | 2004-06-29 | Sri International | Mobile navigation of network-based electronic information using spoken input |
US6513063B1 (en) * | 1999-01-05 | 2003-01-28 | Sri International | Accessing network-based electronic information through scripted online interfaces using spoken input |
US7152070B1 (en) | 1999-01-08 | 2006-12-19 | The Regents Of The University Of California | System and method for integrating and accessing multiple data sources within a data warehouse architecture |
US6505183B1 (en) * | 1999-02-04 | 2003-01-07 | Authoria, Inc. | Human resource knowledge modeling and delivery system |
US6317718B1 (en) | 1999-02-26 | 2001-11-13 | Accenture Properties (2) B.V. | System, method and article of manufacture for location-based filtering for shopping agent in the physical world |
GB9904662D0 (en) | 1999-03-01 | 1999-04-21 | Canon Kk | Natural language search method and apparatus |
US6356905B1 (en) | 1999-03-05 | 2002-03-12 | Accenture Llp | System, method and article of manufacture for mobile communication utilizing an interface support framework |
US6928404B1 (en) | 1999-03-17 | 2005-08-09 | International Business Machines Corporation | System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies |
US6584464B1 (en) | 1999-03-19 | 2003-06-24 | Ask Jeeves, Inc. | Grammar template query system |
WO2000058942A2 (en) | 1999-03-26 | 2000-10-05 | Koninklijke Philips Electronics N.V. | Client-server speech recognition |
US6356854B1 (en) | 1999-04-05 | 2002-03-12 | Delphi Technologies, Inc. | Holographic object position and type sensing system and method |
US6631346B1 (en) | 1999-04-07 | 2003-10-07 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for natural language parsing using multiple passes and tags |
WO2000060435A2 (en) | 1999-04-07 | 2000-10-12 | Rensselaer Polytechnic Institute | System and method for accessing personal information |
US6647260B2 (en) | 1999-04-09 | 2003-11-11 | Openwave Systems Inc. | Method and system facilitating web based provisioning of two-way mobile communications devices |
US6924828B1 (en) | 1999-04-27 | 2005-08-02 | Surfnotes | Method and apparatus for improved information representation |
US6697780B1 (en) * | 1999-04-30 | 2004-02-24 | At&T Corp. | Method and apparatus for rapid acoustic unit selection from a large speech corpus |
WO2000073936A1 (en) | 1999-05-28 | 2000-12-07 | Sehda, Inc. | Phrase-based dialogue modeling with particular application to creating recognition grammars for voice-controlled user interfaces |
US20020032564A1 (en) | 2000-04-19 | 2002-03-14 | Farzad Ehsani | Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface |
US6931384B1 (en) | 1999-06-04 | 2005-08-16 | Microsoft Corporation | System and method providing utility-based decision making about clarification dialog given communicative uncertainty |
US6598039B1 (en) | 1999-06-08 | 2003-07-22 | Albert-Inc. S.A. | Natural language interface for searching database |
US8065155B1 (en) | 1999-06-10 | 2011-11-22 | Gazdzinski Robert F | Adaptive advertising apparatus and methods |
US6615175B1 (en) | 1999-06-10 | 2003-09-02 | Robert F. Gazdzinski | “Smart” elevator system and method |
US7711565B1 (en) | 1999-06-10 | 2010-05-04 | Gazdzinski Robert F | “Smart” elevator system and method |
US7093693B1 (en) | 1999-06-10 | 2006-08-22 | Gazdzinski Robert F | Elevator access control system and method |
US6711585B1 (en) | 1999-06-15 | 2004-03-23 | Kanisa Inc. | System and method for implementing a knowledge management system |
US6401065B1 (en) | 1999-06-17 | 2002-06-04 | International Business Machines Corporation | Intelligent keyboard interface with use of human language processing |
JP3361291B2 (ja) | 1999-07-23 | 2003-01-07 | コナミ株式会社 | 音声合成方法、音声合成装置及び音声合成プログラムを記録したコンピュータ読み取り可能な媒体 |
US6421672B1 (en) | 1999-07-27 | 2002-07-16 | Verizon Services Corp. | Apparatus for and method of disambiguation of directory listing searches utilizing multiple selectable secondary search keys |
EP1079387A3 (en) | 1999-08-26 | 2003-07-09 | Matsushita Electric Industrial Co., Ltd. | Mechanism for storing information about recorded television broadcasts |
US6912499B1 (en) | 1999-08-31 | 2005-06-28 | Nortel Networks Limited | Method and apparatus for training a multilingual speech model set |
US6697824B1 (en) * | 1999-08-31 | 2004-02-24 | Accenture Llp | Relationship management in an E-commerce application framework |
US6601234B1 (en) | 1999-08-31 | 2003-07-29 | Accenture Llp | Attribute dictionary in a business logic services environment |
US7127403B1 (en) | 1999-09-13 | 2006-10-24 | Microstrategy, Inc. | System and method for personalizing an interactive voice broadcast of a voice service based on particulars of a request |
US6601026B2 (en) | 1999-09-17 | 2003-07-29 | Discern Communications, Inc. | Information retrieval by natural language querying |
US6625583B1 (en) | 1999-10-06 | 2003-09-23 | Goldman, Sachs & Co. | Handheld trading system interface |
US6505175B1 (en) * | 1999-10-06 | 2003-01-07 | Goldman, Sachs & Co. | Order centric tracking system |
US7020685B1 (en) | 1999-10-08 | 2006-03-28 | Openwave Systems Inc. | Method and apparatus for providing internet content to SMS-based wireless devices |
AU8030300A (en) | 1999-10-19 | 2001-04-30 | Sony Electronics Inc. | Natural language interface control system |
US6807574B1 (en) * | 1999-10-22 | 2004-10-19 | Tellme Networks, Inc. | Method and apparatus for content personalization over a telephone interface |
JP2001125896A (ja) | 1999-10-26 | 2001-05-11 | Victor Co Of Japan Ltd | 自然言語対話システム |
US7310600B1 (en) | 1999-10-28 | 2007-12-18 | Canon Kabushiki Kaisha | Language recognition using a similarity measure |
US7392185B2 (en) * | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US6633846B1 (en) | 1999-11-12 | 2003-10-14 | Phoenix Solutions, Inc. | Distributed realtime speech recognition system |
US7050977B1 (en) | 1999-11-12 | 2006-05-23 | Phoenix Solutions, Inc. | Speech-enabled server for internet website and method |
US6615172B1 (en) | 1999-11-12 | 2003-09-02 | Phoenix Solutions, Inc. | Intelligent query engine for processing voice based queries |
US9076448B2 (en) | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US7725307B2 (en) | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
US6665640B1 (en) | 1999-11-12 | 2003-12-16 | Phoenix Solutions, Inc. | Interactive speech based learning/training system formulating search queries based on natural language parsing of recognized user queries |
US6532446B1 (en) | 1999-11-24 | 2003-03-11 | Openwave Systems Inc. | Server based speech recognition user interface for wireless devices |
US6526382B1 (en) * | 1999-12-07 | 2003-02-25 | Comverse, Inc. | Language-oriented user interfaces for voice activated services |
US6526395B1 (en) * | 1999-12-31 | 2003-02-25 | Intel Corporation | Application of personality models and interaction with synthetic characters in a computing system |
US6556983B1 (en) | 2000-01-12 | 2003-04-29 | Microsoft Corporation | Methods and apparatus for finding semantic information, such as usage logs, similar to a query using a pattern lattice data space |
US6546388B1 (en) | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
US6701294B1 (en) | 2000-01-19 | 2004-03-02 | Lucent Technologies, Inc. | User interface for translating natural language inquiries into database queries and data presentations |
US6829603B1 (en) | 2000-02-02 | 2004-12-07 | International Business Machines Corp. | System, method and program product for interactive natural dialog |
US6895558B1 (en) | 2000-02-11 | 2005-05-17 | Microsoft Corporation | Multi-access mode electronic personal assistant |
US6640098B1 (en) | 2000-02-14 | 2003-10-28 | Action Engine Corporation | System for obtaining service-related information for local interactive wireless devices |
AU2001243277A1 (en) * | 2000-02-25 | 2001-09-03 | Synquiry Technologies, Ltd. | Conceptual factoring and unification of graphs representing semantic models |
US6449620B1 (en) | 2000-03-02 | 2002-09-10 | Nimble Technology, Inc. | Method and apparatus for generating information pages using semi-structured data stored in a structured manner |
US6895380B2 (en) | 2000-03-02 | 2005-05-17 | Electro Standards Laboratories | Voice actuation with contextual learning for intelligent machine control |
US6757362B1 (en) | 2000-03-06 | 2004-06-29 | Avaya Technology Corp. | Personal virtual assistant |
US6466654B1 (en) | 2000-03-06 | 2002-10-15 | Avaya Technology Corp. | Personal virtual assistant with semantic tagging |
EP1275042A2 (en) | 2000-03-06 | 2003-01-15 | Kanisa Inc. | A system and method for providing an intelligent multi-step dialog with a user |
US6477488B1 (en) | 2000-03-10 | 2002-11-05 | Apple Computer, Inc. | Method for dynamic context scope selection in hybrid n-gram+LSA language modeling |
US6615220B1 (en) | 2000-03-14 | 2003-09-02 | Oracle International Corporation | Method and mechanism for data consolidation |
US6510417B1 (en) * | 2000-03-21 | 2003-01-21 | America Online, Inc. | System and method for voice access to internet-based information |
GB2366009B (en) | 2000-03-22 | 2004-07-21 | Canon Kk | Natural language machine interface |
JP3728172B2 (ja) | 2000-03-31 | 2005-12-21 | キヤノン株式会社 | 音声合成方法および装置 |
US7177798B2 (en) | 2000-04-07 | 2007-02-13 | Rensselaer Polytechnic Institute | Natural language interface using constrained intermediate dictionary of results |
US6810379B1 (en) | 2000-04-24 | 2004-10-26 | Sensory, Inc. | Client/server architecture for text-to-speech synthesis |
US6691111B2 (en) * | 2000-06-30 | 2004-02-10 | Research In Motion Limited | System and method for implementing a natural language user interface |
US6684187B1 (en) * | 2000-06-30 | 2004-01-27 | At&T Corp. | Method and system for preselection of suitable units for concatenative speech |
US6505158B1 (en) * | 2000-07-05 | 2003-01-07 | At&T Corp. | Synthesis-based pre-selection of suitable units for concatenative speech |
JP3949356B2 (ja) | 2000-07-12 | 2007-07-25 | 三菱電機株式会社 | 音声対話システム |
US7139709B2 (en) | 2000-07-20 | 2006-11-21 | Microsoft Corporation | Middleware layer between speech related applications and engines |
JP2002041276A (ja) | 2000-07-24 | 2002-02-08 | Sony Corp | 対話型操作支援システム及び対話型操作支援方法、並びに記憶媒体 |
US20060143007A1 (en) | 2000-07-24 | 2006-06-29 | Koh V E | User interaction with voice information services |
US7092928B1 (en) | 2000-07-31 | 2006-08-15 | Quantum Leap Research, Inc. | Intelligent portal engine |
US6778951B1 (en) | 2000-08-09 | 2004-08-17 | Concerto Software, Inc. | Information retrieval method with natural language interface |
US6766320B1 (en) | 2000-08-24 | 2004-07-20 | Microsoft Corporation | Search engine with natural language-based robust parsing for user query and relevance feedback learning |
DE10042944C2 (de) | 2000-08-31 | 2003-03-13 | Siemens Ag | Graphem-Phonem-Konvertierung |
DE60127274T2 (de) | 2000-09-15 | 2007-12-20 | Lernout & Hauspie Speech Products N.V. | Schnelle wellenformsynchronisation für die verkettung und zeitskalenmodifikation von sprachsignalen |
US7216080B2 (en) | 2000-09-29 | 2007-05-08 | Mindfabric Holdings Llc | Natural-language voice-activated personal assistant |
US6832194B1 (en) | 2000-10-26 | 2004-12-14 | Sensory, Incorporated | Audio recognition peripheral system |
US7027974B1 (en) | 2000-10-27 | 2006-04-11 | Science Applications International Corporation | Ontology-based parser for natural language processing |
US7006969B2 (en) | 2000-11-02 | 2006-02-28 | At&T Corp. | System and method of pattern recognition in very high-dimensional space |
US7139245B2 (en) * | 2000-11-17 | 2006-11-21 | Infineon Technologies North America Corp. | Priority handling of voice over data in a voice-over-internet protocol processor |
WO2002050816A1 (en) | 2000-12-18 | 2002-06-27 | Koninklijke Philips Electronics N.V. | Store speech, select vocabulary to recognize word |
US6937986B2 (en) | 2000-12-28 | 2005-08-30 | Comverse, Inc. | Automatic dynamic speech recognition vocabulary based on external sources of information |
WO2002054239A2 (en) * | 2000-12-29 | 2002-07-11 | General Electric Company | Method and system for identifying repeatedly malfunctioning equipment |
US7257537B2 (en) * | 2001-01-12 | 2007-08-14 | International Business Machines Corporation | Method and apparatus for performing dialog management in a computer conversational interface |
US6964023B2 (en) | 2001-02-05 | 2005-11-08 | International Business Machines Corporation | System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input |
US7290039B1 (en) | 2001-02-27 | 2007-10-30 | Microsoft Corporation | Intent based processing |
US6721728B2 (en) | 2001-03-02 | 2004-04-13 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | System, method and apparatus for discovering phrases in a database |
WO2002073451A2 (en) | 2001-03-13 | 2002-09-19 | Intelligate Ltd. | Dynamic natural language understanding |
US6996531B2 (en) * | 2001-03-30 | 2006-02-07 | Comverse Ltd. | Automated database assistance using a telephone for a speech based or text based multimedia communication mode |
GB0110326D0 (en) | 2001-04-27 | 2001-06-20 | Ibm | Method and apparatus for interoperation between legacy software and screen reader programs |
US6654740B2 (en) | 2001-05-08 | 2003-11-25 | Sunflare Co., Ltd. | Probabilistic information retrieval based on differential latent semantic space |
US7085722B2 (en) | 2001-05-14 | 2006-08-01 | Sony Computer Entertainment America Inc. | System and method for menu-driven voice control of characters in a game environment |
US6944594B2 (en) | 2001-05-30 | 2005-09-13 | Bellsouth Intellectual Property Corporation | Multi-context conversational environment system and method |
US20020194003A1 (en) | 2001-06-05 | 2002-12-19 | Mozer Todd F. | Client-server security system and method |
US20020198714A1 (en) | 2001-06-26 | 2002-12-26 | Guojun Zhou | Statistical spoken dialog system |
US7139722B2 (en) | 2001-06-27 | 2006-11-21 | Bellsouth Intellectual Property Corporation | Location and time sensitive wireless calendaring |
US6604059B2 (en) | 2001-07-10 | 2003-08-05 | Koninklijke Philips Electronics N.V. | Predictive calendar |
US7987151B2 (en) | 2001-08-10 | 2011-07-26 | General Dynamics Advanced Info Systems, Inc. | Apparatus and method for problem solving using intelligent agents |
US20030036909A1 (en) * | 2001-08-17 | 2003-02-20 | Yoshinaga Kato | Methods and devices for operating the multi-function peripherals |
US6813491B1 (en) | 2001-08-31 | 2004-11-02 | Openwave Systems Inc. | Method and apparatus for adapting settings of wireless communication devices in accordance with user proximity |
US7403938B2 (en) | 2001-09-24 | 2008-07-22 | Iac Search & Media, Inc. | Natural language query processing |
US6985865B1 (en) * | 2001-09-26 | 2006-01-10 | Sprint Spectrum L.P. | Method and system for enhanced response to voice commands in a voice command platform |
US20050196732A1 (en) | 2001-09-26 | 2005-09-08 | Scientific Learning Corporation | Method and apparatus for automated training of language learning skills |
US6650735B2 (en) | 2001-09-27 | 2003-11-18 | Microsoft Corporation | Integrated voice access to a variety of personal information services |
US7324947B2 (en) | 2001-10-03 | 2008-01-29 | Promptu Systems Corporation | Global speech user interface |
US7167832B2 (en) | 2001-10-15 | 2007-01-23 | At&T Corp. | Method for dialog management |
GB2381409B (en) | 2001-10-27 | 2004-04-28 | Hewlett Packard Ltd | Asynchronous access to synchronous voice services |
NO316480B1 (no) | 2001-11-15 | 2004-01-26 | Forinnova As | Fremgangsmåte og system for tekstuell granskning og oppdagelse |
US20030101054A1 (en) | 2001-11-27 | 2003-05-29 | Ncc, Llc | Integrated system and method for electronic speech recognition and transcription |
JP2003163745A (ja) * | 2001-11-28 | 2003-06-06 | Matsushita Electric Ind Co Ltd | 電話機、対話応答装置、対話応答端末および対話応答システム |
TW541517B (en) | 2001-12-25 | 2003-07-11 | Univ Nat Cheng Kung | Speech recognition system |
US6895257B2 (en) | 2002-02-18 | 2005-05-17 | Matsushita Electric Industrial Co., Ltd. | Personalized agent for portable devices and cellular phone |
US7197460B1 (en) | 2002-04-23 | 2007-03-27 | At&T Corp. | System for handling frequently asked questions in a natural language dialog service |
US6847966B1 (en) * | 2002-04-24 | 2005-01-25 | Engenium Corporation | Method and system for optimally searching a document database using a representative semantic space |
US7546382B2 (en) | 2002-05-28 | 2009-06-09 | International Business Machines Corporation | Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US6999066B2 (en) * | 2002-06-24 | 2006-02-14 | Xerox Corporation | System for audible feedback for touch screen displays |
US7260529B1 (en) | 2002-06-25 | 2007-08-21 | Lengen Nicholas D | Command insertion system and method for voice recognition applications |
US7299033B2 (en) | 2002-06-28 | 2007-11-20 | Openwave Systems Inc. | Domain-based management of distribution of digital content from multiple suppliers to multiple wireless services subscribers |
US7233790B2 (en) | 2002-06-28 | 2007-06-19 | Openwave Systems, Inc. | Device capability based discovery, packaging and provisioning of content for wireless mobile devices |
US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
US7467087B1 (en) | 2002-10-10 | 2008-12-16 | Gillick Laurence S | Training and using pronunciation guessers in speech recognition |
WO2004049306A1 (en) | 2002-11-22 | 2004-06-10 | Roy Rosser | Autonomous response engine |
WO2004053836A1 (en) | 2002-12-10 | 2004-06-24 | Kirusa, Inc. | Techniques for disambiguating speech input using multimodal interfaces |
US7386449B2 (en) | 2002-12-11 | 2008-06-10 | Voice Enabling Systems Technology Inc. | Knowledge-based flexible natural speech dialogue system |
US8661112B2 (en) * | 2002-12-20 | 2014-02-25 | Nuance Communications, Inc. | Customized interactive voice response menus |
US7956766B2 (en) | 2003-01-06 | 2011-06-07 | Panasonic Corporation | Apparatus operating system |
US7529671B2 (en) | 2003-03-04 | 2009-05-05 | Microsoft Corporation | Block synchronous decoding |
US6980949B2 (en) | 2003-03-14 | 2005-12-27 | Sonum Technologies, Inc. | Natural language processor |
US7496498B2 (en) | 2003-03-24 | 2009-02-24 | Microsoft Corporation | Front-end architecture for a multi-lingual text-to-speech system |
US7421393B1 (en) | 2004-03-01 | 2008-09-02 | At&T Corp. | System for developing a dialog manager using modular spoken-dialog components |
US20050045373A1 (en) * | 2003-05-27 | 2005-03-03 | Joseph Born | Portable media device with audio prompt menu |
US7200559B2 (en) | 2003-05-29 | 2007-04-03 | Microsoft Corporation | Semantic object synchronous understanding implemented with speech application language tags |
US7720683B1 (en) | 2003-06-13 | 2010-05-18 | Sensory, Inc. | Method and apparatus of specifying and performing speech recognition operations |
US7757173B2 (en) * | 2003-07-18 | 2010-07-13 | Apple Inc. | Voice menu system |
JP4551635B2 (ja) | 2003-07-31 | 2010-09-29 | ソニー株式会社 | パイプライン処理システムおよび情報処理装置 |
US7475010B2 (en) * | 2003-09-03 | 2009-01-06 | Lingospot, Inc. | Adaptive and scalable method for resolving natural language ambiguities |
US7418392B1 (en) | 2003-09-25 | 2008-08-26 | Sensory, Inc. | System and method for controlling the operation of a device by voice commands |
US7155706B2 (en) | 2003-10-24 | 2006-12-26 | Microsoft Corporation | Administrative tool environment |
US7412385B2 (en) | 2003-11-12 | 2008-08-12 | Microsoft Corporation | System for identifying paraphrases using machine translation |
US7584092B2 (en) | 2004-11-15 | 2009-09-01 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7447630B2 (en) | 2003-11-26 | 2008-11-04 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
CN1890708B (zh) | 2003-12-05 | 2011-12-07 | 株式会社建伍 | 音频设备控制装置、音频设备控制方法及程序 |
ES2312851T3 (es) | 2003-12-16 | 2009-03-01 | Loquendo Spa | Procedimiento y sistema texto a voz y el programa informatico asociado. |
US7427024B1 (en) | 2003-12-17 | 2008-09-23 | Gazdzinski Mark J | Chattel management apparatus and methods |
US7552055B2 (en) | 2004-01-10 | 2009-06-23 | Microsoft Corporation | Dialog component re-use in recognition systems |
EP1704558B8 (en) | 2004-01-16 | 2011-09-21 | Nuance Communications, Inc. | Corpus-based speech synthesis based on segment recombination |
US20050165607A1 (en) | 2004-01-22 | 2005-07-28 | At&T Corp. | System and method to disambiguate and clarify user intention in a spoken dialog system |
ATE415684T1 (de) | 2004-01-29 | 2008-12-15 | Harman Becker Automotive Sys | Verfahren und system zur sprachdialogschnittstelle |
KR100462292B1 (ko) | 2004-02-26 | 2004-12-17 | 엔에이치엔(주) | 중요도 정보를 반영한 검색 결과 리스트 제공 방법 및 그시스템 |
US7693715B2 (en) | 2004-03-10 | 2010-04-06 | Microsoft Corporation | Generating large units of graphonemes with mutual information criterion for letter to sound conversion |
US7409337B1 (en) | 2004-03-30 | 2008-08-05 | Microsoft Corporation | Natural language processing interface |
US7496512B2 (en) | 2004-04-13 | 2009-02-24 | Microsoft Corporation | Refining of segmental boundaries in speech waveforms using contextual-dependent models |
US8095364B2 (en) * | 2004-06-02 | 2012-01-10 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US7720674B2 (en) | 2004-06-29 | 2010-05-18 | Sap Ag | Systems and methods for processing natural language queries |
TWI252049B (en) * | 2004-07-23 | 2006-03-21 | Inventec Corp | Sound control system and method |
US7936861B2 (en) * | 2004-07-23 | 2011-05-03 | At&T Intellectual Property I, L.P. | Announcement system and method of use |
US7725318B2 (en) | 2004-07-30 | 2010-05-25 | Nice Systems Inc. | System and method for improving the accuracy of audio searching |
US7853574B2 (en) | 2004-08-26 | 2010-12-14 | International Business Machines Corporation | Method of generating a context-inferenced search query and of sorting a result of the query |
KR20060022001A (ko) | 2004-09-06 | 2006-03-09 | 현대모비스 주식회사 | 카오디오 버튼 설치구조 |
US7716056B2 (en) | 2004-09-27 | 2010-05-11 | Robert Bosch Corporation | Method and system for interactive conversational dialogue for cognitively overloaded device users |
US8107401B2 (en) * | 2004-09-30 | 2012-01-31 | Avaya Inc. | Method and apparatus for providing a virtual assistant to a communication participant |
US7735012B2 (en) * | 2004-11-04 | 2010-06-08 | Apple Inc. | Audio user interface for computing devices |
US7546235B2 (en) | 2004-11-15 | 2009-06-09 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7552046B2 (en) | 2004-11-15 | 2009-06-23 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7702500B2 (en) | 2004-11-24 | 2010-04-20 | Blaedow Karen R | Method and apparatus for determining the meaning of natural language |
CN1609859A (zh) | 2004-11-26 | 2005-04-27 | 孙斌 | 搜索结果聚类的方法 |
US7376645B2 (en) | 2004-11-29 | 2008-05-20 | The Intellection Group, Inc. | Multimodal natural language query system and architecture for processing voice and proximity-based queries |
US20060122834A1 (en) | 2004-12-03 | 2006-06-08 | Bennett Ian M | Emotion detection device & method for use in distributed systems |
US8214214B2 (en) | 2004-12-03 | 2012-07-03 | Phoenix Solutions, Inc. | Emotion detection device and method for use in distributed systems |
US7636657B2 (en) | 2004-12-09 | 2009-12-22 | Microsoft Corporation | Method and apparatus for automatic grammar generation from data entries |
US20060143576A1 (en) | 2004-12-23 | 2006-06-29 | Gupta Anurag K | Method and system for resolving cross-modal references in user inputs |
US7873654B2 (en) * | 2005-01-24 | 2011-01-18 | The Intellection Group, Inc. | Multimodal natural language query system for processing and analyzing voice and proximity-based queries |
US7508373B2 (en) | 2005-01-28 | 2009-03-24 | Microsoft Corporation | Form factor and input method for language input |
GB0502259D0 (en) | 2005-02-03 | 2005-03-09 | British Telecomm | Document searching tool and method |
US7676026B1 (en) | 2005-03-08 | 2010-03-09 | Baxtech Asia Pte Ltd | Desktop telephony system |
US7925525B2 (en) | 2005-03-25 | 2011-04-12 | Microsoft Corporation | Smart reminders |
WO2006129967A1 (en) | 2005-05-30 | 2006-12-07 | Daumsoft, Inc. | Conversation system and method using conversational agent |
US8041570B2 (en) | 2005-05-31 | 2011-10-18 | Robert Bosch Corporation | Dialogue management using scripts |
US8024195B2 (en) | 2005-06-27 | 2011-09-20 | Sensory, Inc. | Systems and methods of performing speech recognition using historical information |
US7826945B2 (en) | 2005-07-01 | 2010-11-02 | You Zhang | Automobile speech-recognition interface |
US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US20070067309A1 (en) | 2005-08-05 | 2007-03-22 | Realnetworks, Inc. | System and method for updating profiles |
US7620549B2 (en) * | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
WO2007027989A2 (en) | 2005-08-31 | 2007-03-08 | Voicebox Technologies, Inc. | Dynamic speech sharpening |
US8265939B2 (en) | 2005-08-31 | 2012-09-11 | Nuance Communications, Inc. | Hierarchical methods and apparatus for extracting user intent from spoken utterances |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
JP4908094B2 (ja) | 2005-09-30 | 2012-04-04 | 株式会社リコー | 情報処理システム、情報処理方法及び情報処理プログラム |
US7930168B2 (en) | 2005-10-04 | 2011-04-19 | Robert Bosch Gmbh | Natural language processing of disfluent sentences |
US8620667B2 (en) | 2005-10-17 | 2013-12-31 | Microsoft Corporation | Flexible speech-activated command and control |
US7707032B2 (en) | 2005-10-20 | 2010-04-27 | National Cheng Kung University | Method and system for matching speech data |
US20070106674A1 (en) | 2005-11-10 | 2007-05-10 | Purusharth Agrawal | Field sales process facilitation systems and methods |
US20070112572A1 (en) * | 2005-11-15 | 2007-05-17 | Fail Keith W | Method and apparatus for assisting vision impaired individuals with selecting items from a list |
US7822749B2 (en) | 2005-11-28 | 2010-10-26 | Commvault Systems, Inc. | Systems and methods for classifying and transferring information in a storage network |
KR20070057496A (ko) | 2005-12-02 | 2007-06-07 | 삼성전자주식회사 | 액정 표시 장치 |
KR100810500B1 (ko) | 2005-12-08 | 2008-03-07 | 한국전자통신연구원 | 대화형 음성 인터페이스 시스템에서의 사용자 편의성증대 방법 |
US20070211071A1 (en) | 2005-12-20 | 2007-09-13 | Benjamin Slotznick | Method and apparatus for interacting with a visually displayed document on a screen reader |
DE102005061365A1 (de) | 2005-12-21 | 2007-06-28 | Siemens Ag | Verfahren zur Ansteuerung zumindest einer ersten und zweiten Hintergrundapplikation über ein universelles Sprachdialogsystem |
US7996228B2 (en) | 2005-12-22 | 2011-08-09 | Microsoft Corporation | Voice initiated network operations |
US7599918B2 (en) | 2005-12-29 | 2009-10-06 | Microsoft Corporation | Dynamic search with implicit user intention mining |
JP2007183864A (ja) | 2006-01-10 | 2007-07-19 | Fujitsu Ltd | ファイル検索方法及びそのシステム |
US20070174188A1 (en) | 2006-01-25 | 2007-07-26 | Fish Robert D | Electronic marketplace that facilitates transactions between consolidated buyers and/or sellers |
IL174107A0 (en) * | 2006-02-01 | 2006-08-01 | Grois Dan | Method and system for advertising by means of a search engine over a data network |
US8209063B2 (en) * | 2006-02-13 | 2012-06-26 | Research In Motion Limited | Navigation tool with audible feedback on a handheld communication device |
KR100764174B1 (ko) | 2006-03-03 | 2007-10-08 | 삼성전자주식회사 | 음성 대화 서비스 장치 및 방법 |
US7752152B2 (en) | 2006-03-17 | 2010-07-06 | Microsoft Corporation | Using predictive user models for language modeling on a personal device with user behavior models based on statistical modeling |
JP4734155B2 (ja) | 2006-03-24 | 2011-07-27 | 株式会社東芝 | 音声認識装置、音声認識方法および音声認識プログラム |
US7707027B2 (en) | 2006-04-13 | 2010-04-27 | Nuance Communications, Inc. | Identification and rejection of meaningless input during natural language classification |
US8423347B2 (en) | 2006-06-06 | 2013-04-16 | Microsoft Corporation | Natural language personal information management |
US7483894B2 (en) * | 2006-06-07 | 2009-01-27 | Platformation Technologies, Inc | Methods and apparatus for entity search |
US20100257160A1 (en) | 2006-06-07 | 2010-10-07 | Yu Cao | Methods & apparatus for searching with awareness of different types of information |
US7523108B2 (en) | 2006-06-07 | 2009-04-21 | Platformation, Inc. | Methods and apparatus for searching with awareness of geography and languages |
KR100776800B1 (ko) | 2006-06-16 | 2007-11-19 | 한국전자통신연구원 | 지능형 가제트를 이용한 맞춤형 서비스 제공 방법 및시스템 |
KR20080001227A (ko) | 2006-06-29 | 2008-01-03 | 엘지.필립스 엘시디 주식회사 | 백라이트 유닛의 램프 고정장치 |
US7548895B2 (en) | 2006-06-30 | 2009-06-16 | Microsoft Corporation | Communication-prompted user assistance |
TWI312103B (en) * | 2006-07-17 | 2009-07-11 | Asia Optical Co Inc | Image pickup systems and methods |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8073681B2 (en) * | 2006-10-16 | 2011-12-06 | Voicebox Technologies, Inc. | System and method for a cooperative conversational voice user interface |
US20080114604A1 (en) * | 2006-11-15 | 2008-05-15 | Motorola, Inc. | Method and system for a user interface using higher order commands |
US20080129520A1 (en) | 2006-12-01 | 2008-06-05 | Apple Computer, Inc. | Electronic device with enhanced audio feedback |
EP2099652B1 (en) * | 2007-01-05 | 2016-11-16 | Visteon Global Technologies, Inc. | System and method for customized audio prompting |
WO2008085742A2 (en) | 2007-01-07 | 2008-07-17 | Apple Inc. | Portable multifunction device, method and graphical user interface for interacting with user input elements in displayed content |
KR100883657B1 (ko) | 2007-01-26 | 2009-02-18 | 삼성전자주식회사 | 음성 인식 기반의 음악 검색 방법 및 장치 |
US7818176B2 (en) | 2007-02-06 | 2010-10-19 | Voicebox Technologies, Inc. | System and method for selecting and presenting advertisements based on natural language processing of voice-based input |
US7822608B2 (en) | 2007-02-27 | 2010-10-26 | Nuance Communications, Inc. | Disambiguating a speech recognition grammar in a multimodal application |
US20080221880A1 (en) | 2007-03-07 | 2008-09-11 | Cerra Joseph P | Mobile music environment speech processing facility |
US7801729B2 (en) | 2007-03-13 | 2010-09-21 | Sensory, Inc. | Using multiple attributes to create a voice search playlist |
US8219406B2 (en) | 2007-03-15 | 2012-07-10 | Microsoft Corporation | Speech-centric multimodal user interface design in mobile technology |
US7809610B2 (en) | 2007-04-09 | 2010-10-05 | Platformation, Inc. | Methods and apparatus for freshness and completeness of information |
US7983915B2 (en) | 2007-04-30 | 2011-07-19 | Sonic Foundry, Inc. | Audio content search engine |
US8055708B2 (en) | 2007-06-01 | 2011-11-08 | Microsoft Corporation | Multimedia spaces |
US8204238B2 (en) | 2007-06-08 | 2012-06-19 | Sensory, Inc | Systems and methods of sonic communication |
KR20080109322A (ko) | 2007-06-12 | 2008-12-17 | 엘지전자 주식회사 | 사용자의 직관적 의도 파악에 따른 서비스 제공 방법 및장치 |
US8190627B2 (en) * | 2007-06-28 | 2012-05-29 | Microsoft Corporation | Machine assisted query formulation |
US8019606B2 (en) * | 2007-06-29 | 2011-09-13 | Microsoft Corporation | Identification and selection of a software application via speech |
JP2009036999A (ja) | 2007-08-01 | 2009-02-19 | Infocom Corp | コンピュータによる対話方法、対話システム、コンピュータプログラムおよびコンピュータに読み取り可能な記憶媒体 |
KR101359715B1 (ko) | 2007-08-24 | 2014-02-10 | 삼성전자주식회사 | 모바일 음성 웹 제공 방법 및 장치 |
US8190359B2 (en) | 2007-08-31 | 2012-05-29 | Proxpro, Inc. | Situation-aware personal information management for a mobile device |
US20090058823A1 (en) | 2007-09-04 | 2009-03-05 | Apple Inc. | Virtual Keyboards in Multi-Language Environment |
US8838760B2 (en) | 2007-09-14 | 2014-09-16 | Ricoh Co., Ltd. | Workflow-enabled provider |
KR100920267B1 (ko) | 2007-09-17 | 2009-10-05 | 한국전자통신연구원 | 음성 대화 분석 시스템 및 그 방법 |
US8706476B2 (en) | 2007-09-18 | 2014-04-22 | Ariadne Genomics, Inc. | Natural language processing method by analyzing primitive sentences, logical clauses, clause types and verbal blocks |
US8165886B1 (en) | 2007-10-04 | 2012-04-24 | Great Northern Research LLC | Speech interface system and method for control and interaction with applications on a computing system |
US8036901B2 (en) | 2007-10-05 | 2011-10-11 | Sensory, Incorporated | Systems and methods of performing speech recognition using sensory inputs of human position |
US20090112677A1 (en) | 2007-10-24 | 2009-04-30 | Rhett Randolph L | Method for automatically developing suggested optimal work schedules from unsorted group and individual task lists |
US7840447B2 (en) | 2007-10-30 | 2010-11-23 | Leonard Kleinrock | Pricing and auctioning of bundled items among multiple sellers and buyers |
US7983997B2 (en) | 2007-11-02 | 2011-07-19 | Florida Institute For Human And Machine Cognition, Inc. | Interactive complex task teaching system that allows for natural language input, recognizes a user's intent, and automatically performs tasks in document object model (DOM) nodes |
CN101179754A (zh) * | 2007-11-08 | 2008-05-14 | 深圳市戴文科技有限公司 | 一种交互式业务的实现方法及移动终端 |
US8112280B2 (en) | 2007-11-19 | 2012-02-07 | Sensory, Inc. | Systems and methods of performing speech recognition with barge-in for use in a bluetooth system |
US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
US8095680B2 (en) | 2007-12-20 | 2012-01-10 | Telefonaktiebolaget Lm Ericsson (Publ) | Real-time network transport protocol interface method and apparatus |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
CN101188644A (zh) * | 2007-12-26 | 2008-05-28 | 中国工商银行股份有限公司 | 银行语音服务方法与系统 |
US8219407B1 (en) | 2007-12-27 | 2012-07-10 | Great Northern Research, LLC | Method for processing the output of a speech recognizer |
US20090187950A1 (en) * | 2008-01-18 | 2009-07-23 | At&T Knowledge Ventures, L.P. | Audible menu system |
KR101334066B1 (ko) | 2008-02-11 | 2013-11-29 | 이점식 | 진화하는 사이버 로봇 시스템 및 그 제공 방법 |
US8099289B2 (en) | 2008-02-13 | 2012-01-17 | Sensory, Inc. | Voice interface and search for electronic devices including bluetooth headsets and remote systems |
CN101246020B (zh) * | 2008-03-14 | 2011-05-25 | 深圳市凯立德科技股份有限公司 | 语音播报装置、使用了此装置的导航系统及其采用的方法 |
US8958848B2 (en) | 2008-04-08 | 2015-02-17 | Lg Electronics Inc. | Mobile terminal and menu control method thereof |
US8666824B2 (en) | 2008-04-23 | 2014-03-04 | Dell Products L.P. | Digital media content location and purchasing system |
US8285344B2 (en) | 2008-05-21 | 2012-10-09 | DP Technlogies, Inc. | Method and apparatus for adjusting audio for a user environment |
US8589161B2 (en) | 2008-05-27 | 2013-11-19 | Voicebox Technologies, Inc. | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US8694355B2 (en) | 2008-05-30 | 2014-04-08 | Sri International | Method and apparatus for automated assistance with task management |
US8423288B2 (en) | 2009-11-30 | 2013-04-16 | Apple Inc. | Dynamic alerts for calendar events |
US8166019B1 (en) | 2008-07-21 | 2012-04-24 | Sprint Communications Company L.P. | Providing suggested actions in response to textual communications |
US8898568B2 (en) * | 2008-09-09 | 2014-11-25 | Apple Inc. | Audio user interface |
KR101005074B1 (ko) | 2008-09-18 | 2010-12-30 | 주식회사 수현테크 | 합성수지제 파이프 연결 고정구 |
US9200913B2 (en) | 2008-10-07 | 2015-12-01 | Telecommunication Systems, Inc. | User interface for predictive traffic |
US8140328B2 (en) | 2008-12-01 | 2012-03-20 | At&T Intellectual Property I, L.P. | User intention based on N-best list of recognition hypotheses for utterances in a dialog |
US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
US8805823B2 (en) | 2009-04-14 | 2014-08-12 | Sri International | Content processing systems and methods |
US8606735B2 (en) | 2009-04-30 | 2013-12-10 | Samsung Electronics Co., Ltd. | Apparatus and method for predicting user's intention based on multimodal information |
KR101581883B1 (ko) | 2009-04-30 | 2016-01-11 | 삼성전자주식회사 | 모션 정보를 이용하는 음성 검출 장치 및 방법 |
US10540976B2 (en) | 2009-06-05 | 2020-01-21 | Apple Inc. | Contextual voice commands |
US10255566B2 (en) | 2011-06-03 | 2019-04-09 | Apple Inc. | Generating and processing task items that represent tasks to perform |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
KR101562792B1 (ko) | 2009-06-10 | 2015-10-23 | 삼성전자주식회사 | 목표 예측 인터페이스 제공 장치 및 그 방법 |
US8527278B2 (en) | 2009-06-29 | 2013-09-03 | Abraham Ben David | Intelligent home automation |
US20110047072A1 (en) | 2009-08-07 | 2011-02-24 | Visa U.S.A. Inc. | Systems and Methods for Propensity Analysis and Validation |
US8768313B2 (en) | 2009-08-17 | 2014-07-01 | Digimarc Corporation | Methods and systems for image or audio recognition processing |
US9213558B2 (en) | 2009-09-02 | 2015-12-15 | Sri International | Method and apparatus for tailoring the output of an intelligent automated assistant to a user |
US8321527B2 (en) | 2009-09-10 | 2012-11-27 | Tribal Brands | System and method for tracking user location and associated activity and responsively providing mobile device updates |
KR20110036385A (ko) | 2009-10-01 | 2011-04-07 | 삼성전자주식회사 | 사용자 의도 분석 장치 및 방법 |
US20110099507A1 (en) * | 2009-10-28 | 2011-04-28 | Google Inc. | Displaying a collection of interactive elements that trigger actions directed to an item |
US9197736B2 (en) | 2009-12-31 | 2015-11-24 | Digimarc Corporation | Intuitive computing methods and systems |
US20120137367A1 (en) | 2009-11-06 | 2012-05-31 | Cataphora, Inc. | Continuous anomaly detection based on behavior modeling and heterogeneous information analysis |
US9171541B2 (en) | 2009-11-10 | 2015-10-27 | Voicebox Technologies Corporation | System and method for hybrid processing in a natural language voice services environment |
US9502025B2 (en) | 2009-11-10 | 2016-11-22 | Voicebox Technologies Corporation | System and method for providing a natural language content dedication service |
US8712759B2 (en) | 2009-11-13 | 2014-04-29 | Clausal Computing Oy | Specializing disambiguation of a natural language expression |
KR101960835B1 (ko) | 2009-11-24 | 2019-03-21 | 삼성전자주식회사 | 대화 로봇을 이용한 일정 관리 시스템 및 그 방법 |
US8396888B2 (en) * | 2009-12-04 | 2013-03-12 | Google Inc. | Location-based searching using a search area that corresponds to a geographical location of a computing device |
KR101622111B1 (ko) | 2009-12-11 | 2016-05-18 | 삼성전자 주식회사 | 대화 시스템 및 그의 대화 방법 |
US20110161309A1 (en) | 2009-12-29 | 2011-06-30 | Lx1 Technology Limited | Method Of Sorting The Result Set Of A Search Engine |
US8494852B2 (en) * | 2010-01-05 | 2013-07-23 | Google Inc. | Word-level correction of speech input |
US8381107B2 (en) * | 2010-01-13 | 2013-02-19 | Apple Inc. | Adaptive audio feedback system and method |
US8334842B2 (en) | 2010-01-15 | 2012-12-18 | Microsoft Corporation | Recognizing user intent in motion capture system |
US8626511B2 (en) | 2010-01-22 | 2014-01-07 | Google Inc. | Multi-dimensional disambiguation of voice commands |
US20110218855A1 (en) | 2010-03-03 | 2011-09-08 | Platformation, Inc. | Offering Promotions Based on Query Analysis |
KR101369810B1 (ko) | 2010-04-09 | 2014-03-05 | 이초강 | 로봇을 위한 경험적 상황인식 방법을 실행하는 프로그램을 기록한 컴퓨터 판독가능한 기록 매체. |
US8265928B2 (en) * | 2010-04-14 | 2012-09-11 | Google Inc. | Geotagged environmental audio for enhanced speech recognition accuracy |
US20110279368A1 (en) | 2010-05-12 | 2011-11-17 | Microsoft Corporation | Inferring user intent to engage a motion capture system |
US8694313B2 (en) * | 2010-05-19 | 2014-04-08 | Google Inc. | Disambiguation of contact information using historical data |
US8522283B2 (en) | 2010-05-20 | 2013-08-27 | Google Inc. | Television remote control data transfer |
US8468012B2 (en) | 2010-05-26 | 2013-06-18 | Google Inc. | Acoustic model adaptation using geographic information |
US20110306426A1 (en) | 2010-06-10 | 2011-12-15 | Microsoft Corporation | Activity Participation Based On User Intent |
US8234111B2 (en) * | 2010-06-14 | 2012-07-31 | Google Inc. | Speech and noise models for speech recognition |
US8411874B2 (en) * | 2010-06-30 | 2013-04-02 | Google Inc. | Removing noise from audio |
US8775156B2 (en) | 2010-08-05 | 2014-07-08 | Google Inc. | Translating languages in response to device motion |
US8359020B2 (en) | 2010-08-06 | 2013-01-22 | Google Inc. | Automatically monitoring for voice input based on context |
US8473289B2 (en) | 2010-08-06 | 2013-06-25 | Google Inc. | Disambiguating input based on context |
JP2014520297A (ja) | 2011-04-25 | 2014-08-21 | ベベオ,インク. | 高度な個人的時刻表アシスタントのためのシステムよび方法 |
-
2008
- 2008-09-05 US US12/205,780 patent/US8768702B2/en active Active
-
2009
- 2009-09-01 CN CN200980134444.1A patent/CN102144209B/zh active Active
- 2009-09-01 CN CN201310562133.3A patent/CN103558964B/zh active Active
- 2009-09-01 WO PCT/US2009/055577 patent/WO2010027953A1/en active Application Filing
- 2009-09-01 EP EP16150079.8A patent/EP3026541B1/en active Active
- 2009-09-01 EP EP09792124.1A patent/EP2329348B1/en active Active
-
2013
- 2013-12-26 US US14/141,254 patent/US9691383B2/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1937674A (zh) * | 2005-08-27 | 2007-03-28 | 国际商业机器公司 | 用于语音门户服务器的方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
EP3026541B1 (en) | 2019-11-13 |
US20100063818A1 (en) | 2010-03-11 |
US8768702B2 (en) | 2014-07-01 |
US9691383B2 (en) | 2017-06-27 |
CN102144209A (zh) | 2011-08-03 |
CN103558964B (zh) | 2018-03-23 |
CN103558964A (zh) | 2014-02-05 |
EP2329348B1 (en) | 2016-01-06 |
WO2010027953A1 (en) | 2010-03-11 |
US20140108017A1 (en) | 2014-04-17 |
EP3026541A1 (en) | 2016-06-01 |
EP2329348A1 (en) | 2011-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102144209B (zh) | 电子设备中的多层次话音反馈 | |
US11900017B2 (en) | Optimizing display engagement in action automation | |
KR102331049B1 (ko) | 통신 개시를 위한 사용자 신호 레버리징 | |
CN107209781B (zh) | 使用自然语言的上下文搜索 | |
JP7065740B2 (ja) | アプリケーション機能情報表示方法、装置、及び端末装置 | |
KR102022318B1 (ko) | 음성 인식을 사용하여 사용자 기능을 수행하는 방법 및 장치 | |
CN106462380B (zh) | 用于对于语音命令提供提示的系统和方法 | |
CN107210033A (zh) | 基于众包来更新用于数字个人助理的语言理解分类器模型 | |
US11749276B2 (en) | Voice assistant-enabled web application or web page | |
CN103399847B (zh) | 用于管理计算环境语言的应用语言库 | |
CN106104528A (zh) | 用于屏幕上项目选择和消歧的基于模型的方法 | |
CN104685470A (zh) | 用于从模板生成用户界面的设备和方法 | |
CN107111492A (zh) | 跨诸设备缩放数字个人助理代理 | |
CN107967055A (zh) | 一种人机交互方法、终端及计算机可读介质 | |
CN104282302A (zh) | 用于识别语音和文本的设备和方法 | |
CN102763159A (zh) | 话音输入的处理 | |
CN101794208A (zh) | 用于无显示器的电子设备的音频用户接口 | |
CN105359121A (zh) | 使用接收数据的应用远程操作 | |
US20140040735A1 (en) | Method for providing voice guidance function and an electronic device thereof | |
JP2014049140A (ja) | 使用者デバイスで入力文字を利用した知能型サービス提供方法及び装置 | |
CN105556937A (zh) | 移动终端及其控制方法 | |
CN108287918A (zh) | 基于应用页面的音乐播放方法、装置、存储介质和电子设备 | |
CN104461348A (zh) | 信息选取方法及装置 | |
TWI703492B (zh) | 控制使用者介面之方法、程式及裝置 | |
US8769169B2 (en) | Assistive buffer usage techniques |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1159805 Country of ref document: HK |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1159805 Country of ref document: HK |