WO1995024681A1 - Speech-controlled input device with logic standard interface - Google Patents
Speech-controlled input device with logic standard interface Download PDFInfo
- Publication number
- WO1995024681A1 WO1995024681A1 PCT/SE1995/000234 SE9500234W WO9524681A1 WO 1995024681 A1 WO1995024681 A1 WO 1995024681A1 SE 9500234 W SE9500234 W SE 9500234W WO 9524681 A1 WO9524681 A1 WO 9524681A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speech
- code
- standard
- computer
- sid
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/03—Arrangements for converting the position or the displacement of a member into a coded form
- G06F3/033—Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor
- G06F3/038—Control and interface arrangements therefor, e.g. drivers or device-embedded control circuitry
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
Definitions
- This invention appertains to a device, hereinafter called SID, that speech-controls the input of data, to an operative unit, for example, a computer.
- SID a device
- the SID has been designed on speech-recognition technology principles and can be speech-dependent or speech-independent.
- the invention is chiefly intended for the simple and easy operation of a computer without the need to use hands.
- the SID is a device for speech-control of indata along with a logic standard interface, i.e., a device that generates standard codes or signals. It completely does away with the use of a keyboard and/or a control unit.
- the SID is a system driver in order to function as a standard interface between applications and external units, for example, a microphone. That is, all outdata to the world outside, such as other applications, is a code according to a standard code system.
- the keyboard has both character keys, e.g., a, b; control and special keys, e.g., ESC, TAB,
- INSERT HOME
- function keys e.g., Fl, F2 and key combinations with ALT-, CTRL- or
- the keyboard's drive routines is the standard interface for communication between keyboard and the outside world (applications)
- a keyboard is sufficient in order to input text and control all computer programs including menu-controlled programs in the computer environment without the need of having to adapt either the keyboard, drive routines or the applications.
- This is precisely what the SID is, a general input device. And it is for this reason that the SID uses standard codes for its outdata.
- the SID uses, for example, the code system according to ANSI standards, then, from a function point of view, the SID operates exactly like a keyboard and its drive routines, but instead of depressing keys in a physical keyboard, we speak into a microphone.
- the SID differs radically from the above-mentioned USP.
- the SID can only be compared to a keyboard or mouse and their drive routines, but not with an application program, for example, the word processor "Microsoft
- USP is a "Text editor for speech input” word processor, that is, a separate application program that employs speech recognition technology to input text into its own application,
- the text that is fed through speech input into the "Text editor” is not any standard code, but a so-called “token and string format” (USP: fig. 3, fig. 4), which the "Text editor” designer has defined just for this particular application.
- this word processor has no other interface for other applications than its "Text editor", so it cannot input text or operate any other application than itself.
- the SID On the other hand, is a drive routine and uses standard codes that follow a standard code system in order to be able to replace a keyboard completely. If, for example, according to the ANSI standards, code "97" is in-coded instead of "a”, and code "00;59” instead of "Fl” as outdata in order that SID can be a standard interface for all standard applications in the computer environment.
- the SID sends as its outdata to an operating unit, not only the standard code for the writeable characters but also the standard code for one or a combination of several of all the keyboard keys, according to those to be found in a standard code system.
- an operating unit not only the standard code for the writeable characters but also the standard code for one or a combination of several of all the keyboard keys, according to those to be found in a standard code system.
- these can be:
- Control and special keys e.g.: ESC, TAB, HOME, INSERT; CTRL + character key, e.g. : CTRL+A, CTRL+B; ALT + character key, e.g.: ALT+A, ALT+B; SHIFT + key, e.g.: SHIFT+TAB; Function keys: Fl, F2; also one or a combination of several of these keys, e.g.: CTRL+F1, ALT+F2, and so on.
- CTRL + character key e.g. : CTRL+A, CTRL+B
- ALT + character key e.g.: ALT+A, ALT+B
- SHIFT + key e.g.: SHIFT+TAB
- Function keys Fl, F2; also one or a combination of several of these keys, e.g.: CTRL+F1, ALT+F2, and so on.
- the SID In contrast to the USP, which inputs text only to its own word processor, the SID is a general input device for controlling of and text input to other applications.
- A.2 SID activates menus in menu-controlled programs in order to roll up the "blinds” and operate the different commands in the sub-menus and conversational displays by sending out standard codes for the ALT+character key combinations.
- ALT+A combination has standard code "00;30", which activates, e.g., the "Archive” menu in the menu row in any standard application whatsoever that has an underlined A, "A” in its menu row. This is exactly the same as depressing the ALT-key and the A-key simultaneously on an ordinary computer keyboard.
- the SID With the help of CTRL+characters or other key, the SID briefly activates the command, that is, it activates the command without needing to roll up the blinds, and it also executes the defined functions for these codes in different programs.
- the SID also sends the code for the control and special keys such as ESC, HOME, Function keys and the code for the other key combinations according to the standardized code system.
- A.5 The SID directs the cursor on the display by using Escape sequences according to ANSI standards.
- the USP can do none of these whatsoever, apart from the ability to write only writeable characters, i.e., some of the ASCII characters, via its own defined code format.
- the SID is, moreover, a device that, among other things: is easy and quick to install and start operating, is easy to learn and not difficult to use, can be very simply translated into other languages, has "on-board” unlimited vocabulary, and so on.
- the SID not only can generate the ASCII-code for digits and letters, but also generate the code for other characters, for example, the control characters that are used for control and communication purposes, graphics, foreign and mathematical symbols. This feature automatically provides the system with an on-board "unlimited” vocabulary. Moreover, due to the fact that a set-up of the expanded ASCII-code (Extended ASCII Code), also contains character codes for foreign characters, the SID has an on-board unlimited "multi-language" vocabulary.
- the invention is a simple, easy-to-use, flexible product from the viewpoint of the user, since it eliminates or reduces the above-mentioned disadvantages and the problems with word-based speech-recognition products.
- This invention also has a number of properties non-existent in other systems, properties that pave the way for new possibilities, areas of use and product ideas.
- the means necessary to achieve such a device are components required to introduce voice signals, e.g., a microphone, an audio card, system driver, i.e., SID's program, a microcomputer for the physically independent input device and a communications medium.
- voice signals e.g., a microphone, an audio card, system driver, i.e., SID's program, a microcomputer for the physically independent input device and a communications medium.
- Figure 1 shows a general layout.
- Figures 2 shows a physically independent, speech-controlled input device and 3 show a logic such.
- Figure 4 shows where the application program and the SID program are situated in a data system.
- SID's program which is a system driver, is a standard interface between the application program and external unit, e.g., a microphone.
- the SID can be designed either with speech-dependent or speech-independent recognition, depending on the application in which the SID is to be used. If user identification and unit limitation use is desired, a speech-dependent recognition SID should be used, but if, e.g., a multi-user system is desired, then a speech-independent recognition unit is to be preferred.
- the SID is made up of hardware and software, chiefly in the following order: a device for introducing voice signals in the system, e.g., a microphone 1, a audio card 2, SID's program 3, a microcomputer 4, an arrangement for a physically independent unit 5, and also communications medium 6, 7, partly between the arrangement for the introduction of voice signals 1, and the unit 5/8, and partly between the unit and the device(s) 9, to be controlled.
- a device for introducing voice signals in the system e.g., a microphone 1, a audio card 2, SID's program 3, a microcomputer 4, an arrangement for a physically independent unit 5, and also communications medium 6, 7, partly between the arrangement for the introduction of voice signals 1, and the unit 5/8, and partly between the unit and the device(s) 9, to be controlled.
- the SID should be regarded as a "black box" 10, into which the voice signal, which represents, for example, a letter, a digit, or other keys and key combinations, is inputted and there compared to signals already stored in the "box", from which comes a standard code according to whichever standard system is used, e.g., ANSI, ASCII, ISO, EBEDIC, teletext, digit tones, etc.
- a standard code e.g., ANSI, ASCII, ISO, EBEDIC, teletext, digit tones, etc.
- the user starts, for example, by "training the computer” for the "vocabulary" intended for the computer program. That is to say, an installation- /set up program requests the user to speak into the microphone as per instructions. The user inputs the name of several or all the keys and their standard combinations, one after the other, by speaking them into the microphone. The uttered characters are digitized by a audio card.
- the SID program stores these vocal expressions of these names in a register, in which each item is related to the standard code for each of these keys or standard key combinations. After the training session has been completed, the SID is ready for use. Now, when the user says, e.g., a character into the computer, this character is digitized and this is then compared with the characters already stored in the SID-program's register. When the character is recognized, the standard code for the corresponding character is fetched and sent further for processing of other programs or units.
- the program functions irrespective who the user is, as long as he/she speaks the same (human) language for which the program is pre-programmed.
- the SID-program does not need to be seen on the display, just as in the same way the keyboard's drive routines are not shown on it.
- the SID can be variously configured for different purposes means that it can be applied to a computer or other apparatuses that have a keyboard and/or use a standard code system, in order to replace completely or be a complement to the keyboard. This, therefore, makes it possible to operate, for example, the computer and all its programs with only the speech.
- the SID can be used for writeable characters, that is, letters, digits and other characters, also for non-writeable characters, that is, control and special characters.
- the SID can be used in order to replace function keys, replace buttons on a control unit, replace commands or special command sequences in a program.
- the SID can be used to provide alias for certain characters for special occasions, e.g., in connection with computer games, and also for calling up files or for its own defined commands.
- the SID can be applied to a printer or an electronic typewriter in order to create a speech-controlled typewriter.
- the SID can be applied to a fax-machine in order to create a speech-controlled fax-machine that vocally faxes messages without the use of paper.
- the SID can be applied to a code-lock or authorized control system with personal code.
- the SID can be applied to a telephone or to a code-lock system for telephone numbering by generating, for example, digit tones instead of ASCII-code.
- the SID can also be used in applications in connection with interactive telecommunication and data communication, speech-controlled toys, computer games, and so on.
- the SID can be used to replace completely or be a complement to a control unit.
- the cursor is steered on a display by indicating the position you wish it to have.
- an XY- co-ordinate or X- and Y- co-ordinates can be used to move the cursor to the desired position.
- the chief function of the SID is to replace a keyboard and/or a control unit. This makes it an ideal handicap aid.
- the SID can be designed as an independent unit 5, with physical interfaces 12, 13. Its connect-up 13 could then be plugged into the standard input socket 9 on the equipment to be controlled or used.
- the communication medium between the SID and other equipment 7, also between the speech input and the equipment for speech-input 6, may vary, and will therefore require different kinds of cables or other medium, such as lighting, modulated radio, etc.
- the SID can also be a logic unit 8. In other words, it can be a part of, for example, a computer system and make use of its hardware to operate and control the system, the programs 14, or other equipment 9 connected to the system.
- Speech-controlled computers can be configured and applied for speech-operating a number of apparatuses and equipment, or in connection with the production of new products. Examples of these are: Speech-controlled computers; speech-controlled typewriters; speech-controlled, paperless fax machines; speech-controlled, paperless mobile fax machines; speech-controlled code-lock system or authorized control with personal code; speech-controlled remote control; speech-controlled telephone, applications within telecommunication and data communication; speech-controlled toys; speech-controlled computer games, etc.
- the SID can have many different areas of use, such as a handicap aid that makes it easy to control and operate a computer, different programs and the equipment that is connected to a computer. With the SID a handicapped person will be able to work or communicate with the world outside. It can be used to improve the working environment where it could prevent, e.g., working ailments that arise from overuse of a computer mouse and keyboard. The SID can be used to improve data security, that is, to reduce the risks of incorrect input in connection with, for example, registration or word processing.
- Productivity can also be increased since it is quicker to speak than to write; greater flexibility is achieved through the elimination of the need for a keyboard and hands since these will be replaced by the voice and, e.g., the microphone for such things as stock handling and mobile units can be more compact, take up less space because the need for a keyboard, etc., has been eliminated.
- SID is not limited just to the units, applications, areas of use described above. It also has varied potential within the framework for the basic idea behind it.
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP95912536A EP0749596A1 (en) | 1994-03-08 | 1995-03-06 | Speech-controlled input device with logic standard interface |
AU19650/95A AU1965095A (en) | 1994-03-08 | 1995-03-06 | Speech-controlled input device with logic standard interface |
JP7523395A JPH09510307A (en) | 1994-03-08 | 1995-03-06 | Voice controlled input device with logical standard interface |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE9400768A SE502013C2 (en) | 1994-03-08 | 1994-03-08 | Speech-controlled input device with standard logical interface |
SE9400768-9 | 1994-03-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1995024681A1 true WO1995024681A1 (en) | 1995-09-14 |
Family
ID=20393192
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/SE1995/000234 WO1995024681A1 (en) | 1994-03-08 | 1995-03-06 | Speech-controlled input device with logic standard interface |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP0749596A1 (en) |
JP (1) | JPH09510307A (en) |
AU (1) | AU1965095A (en) |
SE (1) | SE502013C2 (en) |
WO (1) | WO1995024681A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19938535A1 (en) * | 1999-08-13 | 2001-02-15 | Christoph Bueltemann | Conversion of spoken commands into a form for input commands to a computer system |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4914704A (en) * | 1984-10-30 | 1990-04-03 | International Business Machines Corporation | Text editor for speech input |
US5133011A (en) * | 1990-12-26 | 1992-07-21 | International Business Machines Corporation | Method and apparatus for linear vocal control of cursor position |
-
1994
- 1994-03-08 SE SE9400768A patent/SE502013C2/en not_active IP Right Cessation
-
1995
- 1995-03-06 JP JP7523395A patent/JPH09510307A/en not_active Ceased
- 1995-03-06 WO PCT/SE1995/000234 patent/WO1995024681A1/en not_active Application Discontinuation
- 1995-03-06 AU AU19650/95A patent/AU1965095A/en not_active Abandoned
- 1995-03-06 EP EP95912536A patent/EP0749596A1/en not_active Withdrawn
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4914704A (en) * | 1984-10-30 | 1990-04-03 | International Business Machines Corporation | Text editor for speech input |
US5133011A (en) * | 1990-12-26 | 1992-07-21 | International Business Machines Corporation | Method and apparatus for linear vocal control of cursor position |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19938535A1 (en) * | 1999-08-13 | 2001-02-15 | Christoph Bueltemann | Conversion of spoken commands into a form for input commands to a computer system |
Also Published As
Publication number | Publication date |
---|---|
EP0749596A1 (en) | 1996-12-27 |
JPH09510307A (en) | 1997-10-14 |
SE9400768L (en) | 1995-07-17 |
SE9400768D0 (en) | 1994-03-08 |
SE502013C2 (en) | 1995-07-17 |
AU1965095A (en) | 1995-09-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7260529B1 (en) | Command insertion system and method for voice recognition applications | |
US5812977A (en) | Voice control computer interface enabling implementation of common subroutines | |
Cohen et al. | The role of voice input for human-machine communication. | |
US5893064A (en) | Speech recognition method and apparatus with voice commands and associated keystrokes | |
EP1654631A2 (en) | Screen reader having concurrent communication of non-textual information | |
KR20050103196A (en) | Device and method for voicing phonemes, and keyboard for use in such a device | |
KR100917552B1 (en) | Method and system for improving the fidelity of a dialog system | |
WO2003098599A1 (en) | Voice command and voice recognition for hand-held devices | |
CN101137979A (en) | Phrase constructor for translator | |
US5113481A (en) | Augmentive communications system and method | |
EP0749596A1 (en) | Speech-controlled input device with logic standard interface | |
US20090306978A1 (en) | Method and system for encoding languages | |
CN111009234B (en) | Voice conversion method, device and equipment | |
JP3411198B2 (en) | Interpreting apparatus and method, and medium storing interpreting apparatus control program | |
Matsuda et al. | Finger Braille teaching system for people who communicate with deafblind people | |
JPH03132797A (en) | Voice recognition device | |
JP2004170466A (en) | Voice recognition method and electronic device | |
US20180341337A1 (en) | One-handed typing system for eyes-free operation using a numerical key unit | |
JPS5979337A (en) | Input device | |
JPH0863185A (en) | Speech recognition device | |
Hayes | Expanding the horizons of natural language interfaces | |
Rudžionis et al. | Control of computer and electric devices by voice | |
WO2005122130B1 (en) | User created interactive interface | |
JPH02238494A (en) | Voice synthesizing device | |
Carlson et al. | A multi-language, portable text-to-speech system for the disabled |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA CN JP RU US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
ENP | Entry into the national phase |
Ref country code: US Ref document number: 1996 702719 Date of ref document: 19960906 Kind code of ref document: A Format of ref document f/p: F |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1995912536 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1995912536 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: CA |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 1995912536 Country of ref document: EP |