US7707035B2 - Autonomous integrated headset and sound processing system for tactical applications - Google Patents

Autonomous integrated headset and sound processing system for tactical applications Download PDF

Info

Publication number
US7707035B2
US7707035B2 US11/251,350 US25135005A US7707035B2 US 7707035 B2 US7707035 B2 US 7707035B2 US 25135005 A US25135005 A US 25135005A US 7707035 B2 US7707035 B2 US 7707035B2
Authority
US
United States
Prior art keywords
sound
speech
user
headset
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US11/251,350
Other versions
US20070100637A1 (en
Inventor
Timothy S. McCune
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Integrated Wave Technologies Inc
Original Assignee
Integrated Wave Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Integrated Wave Technologies Inc filed Critical Integrated Wave Technologies Inc
Priority to US11/251,350 priority Critical patent/US7707035B2/en
Assigned to INTEGRATED WAVE TECHNOLOGIES, INC. reassignment INTEGRATED WAVE TECHNOLOGIES, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MCCUNE, TIMOTHY S.
Publication of US20070100637A1 publication Critical patent/US20070100637A1/en
Application granted granted Critical
Publication of US7707035B2 publication Critical patent/US7707035B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Definitions

  • This invention relates generally to sound processing and analysis systems, and more particularly the invention relates to an autonomous integrated headset communications, data processing, and sound analysis system suitable for military tactical use.
  • the present invention is directed to such a system.
  • a sound processing system and headset support a user at performing communication and control functions autonomously and separately from a connection to a network or computer, as well as providing the user with a speech, speech recognition and sound analysis input capability to a network, computer and/or other headset users and two-way audio communication.
  • the system is also capable of functioning as a sound analysis sensor/node connected to a network.
  • the headset first of all has a form factor capable of insulating the user's ears from direct exposure to ambient sound and supporting integral hardware and with cables and connectors as required to interface with target designation systems, radio transmitters, and other communications equipment.
  • the system includes:
  • a boom or other microphone for the purpose of user speech input for voice-to-voice translation, communications/control by speech recognition and communications over radio or other channels;
  • a filtering/amplification system comprising hardware/software sound filter elements and an analog/digital converter to convert sound around the user and sounds spoken by the user into computer input;
  • speakers on the outside of the headset to play sound for other persons, including translated phrases that are part of the one-way and two-way voice-to-voice translation function.
  • the system further includes a light sensor to provide specific frequency and intensity data; and helmet-mounted cameras.
  • the headset includes a miniature data processing system to integrate system components, perform sound/speech recognition, execute user commands/control input; provide feedback to the user based upon system operations or input provided through a communication channel.
  • a sound analysis/speech recognition software system analyzes speech and non-speech sounds and takes programmed action based upon this analysis.
  • Software/hardware recognizes English or other language speech spoken by the users for the purpose of one-way or two-way voice-to-voice translation, and software/hardware to recognize English or other language speech spoken to the user for the purpose of two-way voice-to-voice translation.
  • software/hardware is provided to recognize English or other language speech for the purpose of word spotting to detect and identify elements of conversation around the user; and to recognize vehicle or other sounds for the purpose of identifying the sources of the sounds to the user.
  • FIG. 1 illustrates a headset with integrated functions in accordance with the invention.
  • FIG. 2 illustrates the integrated functions in a preferred embodiment of the invention.
  • FIG. 3 illustrates components of the sound processing system of FIGS. 1 , 2 .
  • FIG. 4 illustrates one-way voice-to-voice translator functions of FIG. 2 .
  • FIG. 5 illustrates a flowchart of ambient noise processing functions of FIG. 2 .
  • FIG. 6 illustrates a flowchart of ambient noise identification functions of FIG. 2 .
  • FIG. 7 illustrates a flowchart of voice control commands/control function of FIG. 2 .
  • FIG. 8 illustrates a target identification and designation system of FIG. 2 .
  • FIG. 9 illustrates a two-way voice-to-voice translator function of FIG. 2 .
  • FIG. 10 illustrates a question/response architecture for two-way translator of FIG. 2 .
  • FIG. 11 illustrates a word spotting function of FIG. 2 .
  • FIG. 12 illustrates a light identification function of FIG. 2 .
  • FIG. 1 illustrates a headset shown generally at 10 positioned on a user's ears to provide insulation from direct exposure to ambient sound.
  • Microphone 12 provides for user speech input, while speakers in the headset provide sound to the user from a filtering/amplification system. Speakers 14 outside of the headset provide sound for others including translated phrases in the one-way and two-way voice-to-voice translation system.
  • a light sensor 16 provides specific frequency and intensity data, and external microphones and sound sensors 18 pick up sounds around the user. Cables 20 provide interface with other devices including target designation systems, radio transmitters, and other communication devices.
  • a central processing unit (CPU— FIG. 3 ) in the headset processes sound, light, and other input and recognizes speech inputs from users and persons with whom the user is interacting.
  • CPU central processing unit
  • FIG. 2 illustrates integrated functions of the sound processing system.
  • Ambient sound at 22 is provided through ambient sound filter and level adjustment system 24 to the user.
  • an ambient non-speech sound identifier 26 notifies a user if a sound is identified in a database, such as the sound of aircraft or other vehicles.
  • a word spotter utterance identifier for non-interview speech at 28 notifies the user if an utterance is identified in a database. The sound is sorted by frequency range and then by fundamental characteristics to determine whether it is speech or non-speech, and then further identified based on frequency and other characteristics.
  • functional unit 30 passes utterances from the subject to a noise filter and interview subject speech recognizer 32 to translate foreign language utterances into English, which are then applied to the user.
  • This function is based on a system in continued developed by IWT. Words and phrases within the target domains are characterized by gathering sample utterances made of those words/phrases in the target languages by native speakers. Those samples are processed into template packages that allow for fast analysis.
  • the user can respond to any of these function units or otherwise provide voice communication through an external network at 34 , a voice command for one-way translator 36 , a two-way translator and voice command of output to the interview subject at 38 , a voice command/control of non-translation functions, including C4SRI inputs and targeting information at 40 . Additionally, voice command of helmet-mounted camera or cameras is provided at 42 . The user can receive audio instructions for language, mission and task instruction at 44 . Integration of these multiple functions provides new user capabilities.
  • FIG. 3 is a functional block diagram showing more details of system components for the ambient sounds and user's voice input of the system of FIG. 2 .
  • a central processing unit 50 receives inputs from directional ambient microphones 22 and user's voice input microphone 52 , and after processing the input sounds, provides outputs to built in speakers for external sound projection 54 , target identification and designation system 56 , interconnection with solider communication system 58 , vest mounted sound projection box 60 , and the user headset to play processed sounds at 62 .
  • the headset integrates exterior microphones in addition to the traditional small speakers in each ear enclosure.
  • FIG. 4 is block diagram of one-way voice-to-voice translator function 36 of FIG. 2 .
  • the translation function is initiated by user voice command with the system operating by providing the user with domains of input and output words and phrases.
  • the user activates the system function by voice command and then issues foreign language output phrases by issuing voice commands through the boom microphone.
  • the user boom microphone 30 provides sound inputs to sound filter and speech recognition unit 32 which recognizes user utterances and then provides outputs to the headset speakers and/or auxiliary speakers at 64 and to sound output from auxiliary speakers 66 .
  • Translation is initiated by user voice command.
  • the system operates by providing the user with domains of input and output words and phrases.
  • the user activates the system function by voice command and then issues foreign language output phrases by issuing voice commands through the boom microphone.
  • Ambient noise conditions for the overall system operation require the use of advanced sound filtering.
  • FIG. 5 is a block diagram of the ambient noise input system function.
  • directional ambient sound microphones 68 provide inputs to hardware and software filters to remove noise and non-significant features at 70 .
  • the filtered output is then transmitted to the user and to central processing for analysis at 72 .
  • the system receives audio inputs through one or more microphones set to receive ambient sound. Non-significant noise and features are removed through hardware and software filters.
  • FIG. 6 is a flow diagram of the ambient noise identification function 26 of FIG. 2 .
  • the directional ambient sound microphones 68 provide inputs to the central processing unit 50 which then provides audio cues to the user at 62 .
  • the system receives ambient sound input from the headset's built in directional microphones, and this sound it filtered, digitized and re-filtered before being sent to the central processing function.
  • the central processing function then implements sound recognition algorithms to analyze and identify ambient sound. As sounds are identified, the central processor command specific audio cues are played through the headset so the user will be informed of the identification.
  • FIG. 7 illustrates the voice control command/control function shown at 40 in FIG. 2 .
  • the user's input microphone 52 directs the voice commands to central processing unit 50 , which then issues command/control signals to the headset system components at 74 , and to external system components at 76 .
  • a command verification audio cue is sent to the user through the headset at 78 .
  • This system function provides the user with voice command capability for varied requirements. The user engages and disengages the various functions in the system by voice command, and the user can issue voice commands that send a signal over the command/control network for expanded functions, such as to provide audio status of dates of mission information.
  • FIG. 8 is a functional block diagram of target identification and designation function, which provides for more effective target acquisition by allowing the user to input targeting information in an eyes-free, hands-free manner. This can include identifying weapons systems near the user through ambient sound analysis.
  • the combined capabilities of target identification, integration with target designation devices, and voice command of targeting provide the user with a more effective basis for providing information on these targets to supporting forces.
  • the user can identify and select a target, either using his own senses alone or assisted by the system's sound identification capability.
  • the user then enters critical data using voice commands.
  • a target designator/illuminator such as a laser designator, is engaged by voice or hand command. Location and identification is then sent to command/control network to support engagement of other forces.
  • FIG. 9 is a functional block diagram of two-way voice-to-voice translator function shown at 38 in FIG. 2 .
  • the two-way translator function is initiated by user voice command, and once this function is activated the user can then use voice commands to begin two-way translation sequences as shown.
  • the user instructs interview subjects to elicit specific responses to meet information requirements. Further, the system filters ambient noise to receive foreign language interview subject responses. The system then identifies within total sounds received, utterances that match word/phrases with in the expected response domain. Identified responses are then translated into English and played for the user. Through this structure, the user is able to control and direct a useful two-way conversation with the interview subject.
  • FIG. 10 illustrates question/response architecture for two-way translation and shows the process for conducting controlled response two-way voice-to-voice translation.
  • FIG. 11 is a functional diagram of the word spotting function.
  • the system receives ambient sound input from the headset's built in directional microphones. This sound is filtered, digitized and re-filtered before being sent to the central processing function of the system.
  • the central processing function then implements word spotting recognition algorithms to analyze and identify ambient sound. As target words are identified, the central processor will command specific audio cues to be played through the headset so that the user will be informed of the identification.
  • FIG. 12 illustrates system components of the word processing function, including light sensors.
  • the system receives ambient light input from the headset's built-in sensors 80 , and the central processing function implements identification algorithms to analyze and identify ambient light.
  • the central processor prepares specific audio cues to be played through the headset so that user will be informed of the identification.
  • the user can select from these audio cues of identified light to be played to him as they occur, or he can select that this information be played to him only if he requests an update by voice command.
  • the system detects a light signature that indicates an explosion occurred. The system correlates this with sound received from the explosion to calculate the distance and inform the user by audio cue of the explosion and all calculated information. If the system is able to calculate direction in addition to distance, this information is provided as well.
  • the central processor will characterize the sound by frequency and fundamental characteristics. Sounds deemed to be possible speech utterances will be converted to template form and compared with relevant utterance templates stored in the system database. If a match is made by the processor, the information will be relayed to the user through the use of audio cues. Again, the central processor receives inputs indicating a relevant sound or light had been detected and analyze these inputs. Based on this analysis, the headset's internal speakers will play informational cues to the user and in some instances ask the user to provide inputs related to further analysis/action.
  • the headset provides speech recognition, noise filtering, and sound analysis and identification.
  • One-way and two-way voice-to-voice translation is facilitated along with voice command of computer controlled functions and voice command of helmet mounted cameras.
  • Computer generated audio cues are provided to the user, and optical and other sensors can be provided for situational awareness.
  • the headset can be connected to a network, computer and/or wireless links with other headsets as well as providing the user with speech, speech recognition, and sound analysis input.

Abstract

A sound processing system including a user headset for use in tactical military operations provides integrated sound and speech analysis including sound filtering and amplification, sound analysis and speech recognition for analyzing speech and non-speech sounds and taking programmed actions based on the analysis, recognizing language of speech for purposes of one-way and two-way voice translation, word spotting to detect and identify elements of conversation, and non-speech recognition and identification. The headset includes housings with form factors for insulating a user's ear from direct exposure to ambient sounds with at least one microphone for receiving sound around the user, and a microphone for receiving user speech. The user headset can further include interconnections for connecting the headset with out systems outside of the headset, including target designation systems, communication networks, and radio transmitters.

Description

BACKGROUND OF THE INVENTION
This invention relates generally to sound processing and analysis systems, and more particularly the invention relates to an autonomous integrated headset communications, data processing, and sound analysis system suitable for military tactical use.
In military operations, participants are required to provide communication and control functions in environments having many external sounds and light stimuli and with other participants who may speak different languages. The level of success in implementing these functions depends in large part on the availability and ease of use of a supporting light, data, and sound processing system.
The present invention is directed to such a system.
SUMMARY OF THE INVENTION
In accordance with the invention, a sound processing system and headset support a user at performing communication and control functions autonomously and separately from a connection to a network or computer, as well as providing the user with a speech, speech recognition and sound analysis input capability to a network, computer and/or other headset users and two-way audio communication.
All system capabilities, with the exception of use of communications channels/networks, work on a stand alone basis and exist in the absence of external communications and/or computing resources. The system is also capable of functioning as a sound analysis sensor/node connected to a network.
In a preferred embodiment, the headset first of all has a form factor capable of insulating the user's ears from direct exposure to ambient sound and supporting integral hardware and with cables and connectors as required to interface with target designation systems, radio transmitters, and other communications equipment.
In a preferred embodiment, the system includes:
an outwardly aimed microphone for the purposes of hearing sound around the user;
a boom or other microphone for the purpose of user speech input for voice-to-voice translation, communications/control by speech recognition and communications over radio or other channels;
a filtering/amplification system comprising hardware/software sound filter elements and an analog/digital converter to convert sound around the user and sounds spoken by the user into computer input;
speakers in the headset to play for the user sound produced by the filtering/amplification system, as well as sound produced by the integrated computer: and
speakers on the outside of the headset to play sound for other persons, including translated phrases that are part of the one-way and two-way voice-to-voice translation function.
In a preferred embodiment, the system further includes a light sensor to provide specific frequency and intensity data; and helmet-mounted cameras.
In another preferred embodiment, the headset includes a miniature data processing system to integrate system components, perform sound/speech recognition, execute user commands/control input; provide feedback to the user based upon system operations or input provided through a communication channel. Further, a sound analysis/speech recognition software system analyzes speech and non-speech sounds and takes programmed action based upon this analysis. Software/hardware recognizes English or other language speech spoken by the users for the purpose of one-way or two-way voice-to-voice translation, and software/hardware to recognize English or other language speech spoken to the user for the purpose of two-way voice-to-voice translation. Additionally, software/hardware is provided to recognize English or other language speech for the purpose of word spotting to detect and identify elements of conversation around the user; and to recognize vehicle or other sounds for the purpose of identifying the sources of the sounds to the user.
The invention and objects and features thereof will be more readily apparent from the following detailed description and appended claims when taken with the drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 illustrates a headset with integrated functions in accordance with the invention.
FIG. 2 illustrates the integrated functions in a preferred embodiment of the invention.
FIG. 3 illustrates components of the sound processing system of FIGS. 1, 2.
FIG. 4 illustrates one-way voice-to-voice translator functions of FIG. 2.
FIG. 5 illustrates a flowchart of ambient noise processing functions of FIG. 2.
FIG. 6 illustrates a flowchart of ambient noise identification functions of FIG. 2.
FIG. 7 illustrates a flowchart of voice control commands/control function of FIG. 2.
FIG. 8 illustrates a target identification and designation system of FIG. 2.
FIG. 9 illustrates a two-way voice-to-voice translator function of FIG. 2.
FIG. 10 illustrates a question/response architecture for two-way translator of FIG. 2.
FIG. 11 illustrates a word spotting function of FIG. 2.
FIG. 12 illustrates a light identification function of FIG. 2.
DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS
FIG. 1 illustrates a headset shown generally at 10 positioned on a user's ears to provide insulation from direct exposure to ambient sound. Microphone 12 provides for user speech input, while speakers in the headset provide sound to the user from a filtering/amplification system. Speakers 14 outside of the headset provide sound for others including translated phrases in the one-way and two-way voice-to-voice translation system. A light sensor 16 provides specific frequency and intensity data, and external microphones and sound sensors 18 pick up sounds around the user. Cables 20 provide interface with other devices including target designation systems, radio transmitters, and other communication devices. A central processing unit (CPU—FIG. 3) in the headset processes sound, light, and other input and recognizes speech inputs from users and persons with whom the user is interacting.
FIG. 2 illustrates integrated functions of the sound processing system. Ambient sound at 22 is provided through ambient sound filter and level adjustment system 24 to the user. Also, an ambient non-speech sound identifier 26 notifies a user if a sound is identified in a database, such as the sound of aircraft or other vehicles. A word spotter utterance identifier for non-interview speech at 28 notifies the user if an utterance is identified in a database. The sound is sorted by frequency range and then by fundamental characteristics to determine whether it is speech or non-speech, and then further identified based on frequency and other characteristics.
When a subject is to be interviewed, functional unit 30 passes utterances from the subject to a noise filter and interview subject speech recognizer 32 to translate foreign language utterances into English, which are then applied to the user. This function is based on a system in continued developed by IWT. Words and phrases within the target domains are characterized by gathering sample utterances made of those words/phrases in the target languages by native speakers. Those samples are processed into template packages that allow for fast analysis.
The user can respond to any of these function units or otherwise provide voice communication through an external network at 34, a voice command for one-way translator 36, a two-way translator and voice command of output to the interview subject at 38, a voice command/control of non-translation functions, including C4SRI inputs and targeting information at 40. Additionally, voice command of helmet-mounted camera or cameras is provided at 42. The user can receive audio instructions for language, mission and task instruction at 44. Integration of these multiple functions provides new user capabilities.
FIG. 3 is a functional block diagram showing more details of system components for the ambient sounds and user's voice input of the system of FIG. 2. A central processing unit 50 receives inputs from directional ambient microphones 22 and user's voice input microphone 52, and after processing the input sounds, provides outputs to built in speakers for external sound projection 54, target identification and designation system 56, interconnection with solider communication system 58, vest mounted sound projection box 60, and the user headset to play processed sounds at 62. Thus, the headset integrates exterior microphones in addition to the traditional small speakers in each ear enclosure.
FIG. 4 is block diagram of one-way voice-to-voice translator function 36 of FIG. 2. The translation function is initiated by user voice command with the system operating by providing the user with domains of input and output words and phrases. The user activates the system function by voice command and then issues foreign language output phrases by issuing voice commands through the boom microphone. Here, the user boom microphone 30 provides sound inputs to sound filter and speech recognition unit 32 which recognizes user utterances and then provides outputs to the headset speakers and/or auxiliary speakers at 64 and to sound output from auxiliary speakers 66. Translation is initiated by user voice command. The system operates by providing the user with domains of input and output words and phrases. The user activates the system function by voice command and then issues foreign language output phrases by issuing voice commands through the boom microphone. Ambient noise conditions for the overall system operation require the use of advanced sound filtering.
FIG. 5 is a block diagram of the ambient noise input system function. Here, directional ambient sound microphones 68 provide inputs to hardware and software filters to remove noise and non-significant features at 70. The filtered output is then transmitted to the user and to central processing for analysis at 72. The system receives audio inputs through one or more microphones set to receive ambient sound. Non-significant noise and features are removed through hardware and software filters.
FIG. 6 is a flow diagram of the ambient noise identification function 26 of FIG. 2. The directional ambient sound microphones 68 provide inputs to the central processing unit 50 which then provides audio cues to the user at 62. In operation, the system receives ambient sound input from the headset's built in directional microphones, and this sound it filtered, digitized and re-filtered before being sent to the central processing function. The central processing function then implements sound recognition algorithms to analyze and identify ambient sound. As sounds are identified, the central processor command specific audio cues are played through the headset so the user will be informed of the identification.
FIG. 7 illustrates the voice control command/control function shown at 40 in FIG. 2. The user's input microphone 52 directs the voice commands to central processing unit 50, which then issues command/control signals to the headset system components at 74, and to external system components at 76. A command verification audio cue is sent to the user through the headset at 78. This system function provides the user with voice command capability for varied requirements. The user engages and disengages the various functions in the system by voice command, and the user can issue voice commands that send a signal over the command/control network for expanded functions, such as to provide audio status of dates of mission information.
FIG. 8 is a functional block diagram of target identification and designation function, which provides for more effective target acquisition by allowing the user to input targeting information in an eyes-free, hands-free manner. This can include identifying weapons systems near the user through ambient sound analysis. The combined capabilities of target identification, integration with target designation devices, and voice command of targeting provide the user with a more effective basis for providing information on these targets to supporting forces. The user can identify and select a target, either using his own senses alone or assisted by the system's sound identification capability. The user then enters critical data using voice commands. A target designator/illuminator, such as a laser designator, is engaged by voice or hand command. Location and identification is then sent to command/control network to support engagement of other forces.
FIG. 9 is a functional block diagram of two-way voice-to-voice translator function shown at 38 in FIG. 2. The two-way translator function is initiated by user voice command, and once this function is activated the user can then use voice commands to begin two-way translation sequences as shown. The user instructs interview subjects to elicit specific responses to meet information requirements. Further, the system filters ambient noise to receive foreign language interview subject responses. The system then identifies within total sounds received, utterances that match word/phrases with in the expected response domain. Identified responses are then translated into English and played for the user. Through this structure, the user is able to control and direct a useful two-way conversation with the interview subject.
FIG. 10 illustrates question/response architecture for two-way translation and shows the process for conducting controlled response two-way voice-to-voice translation.
FIG. 11 is a functional diagram of the word spotting function. Here, the system receives ambient sound input from the headset's built in directional microphones. This sound is filtered, digitized and re-filtered before being sent to the central processing function of the system. The central processing function then implements word spotting recognition algorithms to analyze and identify ambient sound. As target words are identified, the central processor will command specific audio cues to be played through the headset so that the user will be informed of the identification.
FIG. 12 illustrates system components of the word processing function, including light sensors. The system receives ambient light input from the headset's built-in sensors 80, and the central processing function implements identification algorithms to analyze and identify ambient light. As target words are identified, the central processor prepares specific audio cues to be played through the headset so that user will be informed of the identification. The user can select from these audio cues of identified light to be played to him as they occur, or he can select that this information be played to him only if he requests an update by voice command. For example, in operation the system detects a light signature that indicates an explosion occurred. The system correlates this with sound received from the explosion to calculate the distance and inform the user by audio cue of the explosion and all calculated information. If the system is able to calculate direction in addition to distance, this information is provided as well.
In the instance of an audio input being received, the central processor will characterize the sound by frequency and fundamental characteristics. Sounds deemed to be possible speech utterances will be converted to template form and compared with relevant utterance templates stored in the system database. If a match is made by the processor, the information will be relayed to the user through the use of audio cues. Again, the central processor receives inputs indicating a relevant sound or light had been detected and analyze these inputs. Based on this analysis, the headset's internal speakers will play informational cues to the user and in some instances ask the user to provide inputs related to further analysis/action.
There has been described an autonomous integrated headset communications, data processing, and sounds analysis system for tactical applications. The headset provides speech recognition, noise filtering, and sound analysis and identification. One-way and two-way voice-to-voice translation is facilitated along with voice command of computer controlled functions and voice command of helmet mounted cameras. Computer generated audio cues are provided to the user, and optical and other sensors can be provided for situational awareness. In addition to autonomous operation, the headset can be connected to a network, computer and/or wireless links with other headsets as well as providing the user with speech, speech recognition, and sound analysis input.
While the invention has been described with reference to a preferred embodiment, the description is illustrative of the invention and is not to be construed as limiting the invention. Various modifications and applications may occur to those skilled in the art without departing from the true spirit and scope of the invention as defined by the appended claims.

Claims (5)

1. A sound processing system for tactical military operations comprising:
a) a headset including housings with form factors for insulating user ears from direct exposure to ambient sound,
b) at least one microphone for receiving sounds around a user,
c) a microphone for receiving user speech,
d) a processing unit for processing sound from said microphones, the processing unit providing sound filtering and amplification, sound analysis and speech recognition for analyzing speech and non-speech sounds and taking programmed actions based on the analysis, recognizing language of speech for purposes of one-way and autonomous two-way voice-to-voice translation, word spotting to detect and identify elements of communication, and non-speech recognition and identification,
e) speakers in the housings to play for a user sound produced by the processing units,
f) at least one speaker on the outside of the headset to play sound for other persons including translated phrases in a first language of speech as part of one-way and two-way voice-to-voice communication, wherein the first language of speech is the same language of speech received by a microphone that receives sounds around a user and recognized by the processing unit,
g) a light sensor for providing to the processing unit frequency and intensity of any impinging light,
h) a variably positionable headset mounted camera, the processing unit providing instructions to the camera in response to voice commands from the user, and
i) interconnections for connecting the user headset with other systems outside of the headset, wherein the other systems include target designation systems, communication networks, and radio transmitters.
2. The sound processing system of claim 1 wherein the processing unit further provides non-speech sound recognition including vehicle noise recognition.
3. A sound processing system for tactical military operations comprising:
a) a headset including housings with form factors for insulating user ears from direct exposure to ambient sound,
b) at least one microphone for receiving sounds around a user,
c) a microphone for receiving user speech,
d) a processing unit for processing sound from said microphones, the processing unit providing sound filtering and amplification, sound analysis and speech recognition for analyzing speech and non-speech sounds and taking programmed actions based on the analysis, recognizing language of speech for purposes of one-way and autonomous two-way voice-to-voice translation, word spotting to detect and identify elements of communication, and non-speech recognition and identification,
e) speakers in the housings to play for a user sound produced by the processing unit,
f) at least one speaker on the outside of the headset to play sound for other persons including translated phrases in a first language of speech as part of one-way and two-way voice-to-voice communication, wherein the first language of speech is the same language of speech received by a microphone that receives sounds around a user and recognized by the processing unit,
g) a light sensor for providing to the processing unit frequency and intensity of any impinging light, and
h) a system to correlate information received by the light sensor with information received by the microphones.
4. The sound processing system of claim 3 and further including:
i) a system to calculate the distance between the user and a source of the sound and light signals received by the system by correlating the sound and light information.
5. The sound processing system of claim 3 and further including:
i) a system to calculate the direction of a source of the sound and light signals received by the system with respect to the user by correlating the sound and light information.
US11/251,350 2005-10-13 2005-10-13 Autonomous integrated headset and sound processing system for tactical applications Active 2028-08-23 US7707035B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/251,350 US7707035B2 (en) 2005-10-13 2005-10-13 Autonomous integrated headset and sound processing system for tactical applications

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US11/251,350 US7707035B2 (en) 2005-10-13 2005-10-13 Autonomous integrated headset and sound processing system for tactical applications

Publications (2)

Publication Number Publication Date
US20070100637A1 US20070100637A1 (en) 2007-05-03
US7707035B2 true US7707035B2 (en) 2010-04-27

Family

ID=37997644

Family Applications (1)

Application Number Title Priority Date Filing Date
US11/251,350 Active 2028-08-23 US7707035B2 (en) 2005-10-13 2005-10-13 Autonomous integrated headset and sound processing system for tactical applications

Country Status (1)

Country Link
US (1) US7707035B2 (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090144061A1 (en) * 2007-11-29 2009-06-04 Plantronics, Inc. Systems and methods for generating verbal feedback messages in head-worn electronic devices
US20090262205A1 (en) * 2008-04-21 2009-10-22 Dana Stephen Smith Voice activated headset imaging system
US20090281809A1 (en) * 2008-05-09 2009-11-12 Plantronics, Inc. Headset Wearer Identity Authentication With Voice Print Or Speech Recognition
US20100235161A1 (en) * 2009-03-11 2010-09-16 Samsung Electronics Co., Ltd. Simultaneous interpretation system
US20110161076A1 (en) * 2009-12-31 2011-06-30 Davis Bruce L Intuitive Computing Methods and Systems
US20110238405A1 (en) * 2007-09-28 2011-09-29 Joel Pedre A translation method and a device, and a headset forming part of said device
US20120004913A1 (en) * 2010-07-01 2012-01-05 Samsung Electronics Co., Ltd. Method and apparatus for controlling operation of portable terminal using microphone
US8099286B1 (en) * 2008-05-12 2012-01-17 Rockwell Collins, Inc. System and method for providing situational awareness enhancement for low bit rate vocoders
US20150039288A1 (en) * 2010-09-21 2015-02-05 Joel Pedre Integrated oral translator with incorporated speaker recognition
US20190251176A1 (en) * 2018-02-09 2019-08-15 Google Llc Concurrent Reception Of Multiple User Speech Input For Translation
US11049094B2 (en) 2014-02-11 2021-06-29 Digimarc Corporation Methods and arrangements for device to device communication
US11086593B2 (en) * 2016-08-26 2021-08-10 Bragi GmbH Voice assistant for wireless earpieces
US11153472B2 (en) 2005-10-17 2021-10-19 Cutting Edge Vision, LLC Automatic upload of pictures from a camera

Families Citing this family (71)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EA011361B1 (en) * 2004-09-07 2009-02-27 Сенсир Пти Лтд. Apparatus and method for sound enhancement
EP2033489B1 (en) 2006-06-14 2015-10-28 Personics Holdings, LLC. Earguard monitoring system
US11683643B2 (en) 2007-05-04 2023-06-20 Staton Techiya Llc Method and device for in ear canal echo suppression
US11856375B2 (en) 2007-05-04 2023-12-26 Staton Techiya Llc Method and device for in-ear echo suppression
US8239185B2 (en) * 2008-01-17 2012-08-07 Geacom, Inc. Method and system for situational language translation
US8600761B2 (en) * 2008-09-09 2013-12-03 The Boeing Company Hands-free and non-visually occluding object information interaction system
US8600067B2 (en) 2008-09-19 2013-12-03 Personics Holdings Inc. Acoustic sealing analysis system
US9952664B2 (en) 2014-01-21 2018-04-24 Osterhout Group, Inc. Eye imaging in head worn computing
US20150205111A1 (en) 2014-01-21 2015-07-23 Osterhout Group, Inc. Optical configurations for head worn computing
US9715112B2 (en) 2014-01-21 2017-07-25 Osterhout Group, Inc. Suppression of stray light in head worn computing
US9229233B2 (en) 2014-02-11 2016-01-05 Osterhout Group, Inc. Micro Doppler presentations in head worn computing
US9400390B2 (en) 2014-01-24 2016-07-26 Osterhout Group, Inc. Peripheral lighting for head worn computing
US9298007B2 (en) 2014-01-21 2016-03-29 Osterhout Group, Inc. Eye imaging in head worn computing
US9965681B2 (en) 2008-12-16 2018-05-08 Osterhout Group, Inc. Eye imaging in head worn computing
US20100250231A1 (en) * 2009-03-07 2010-09-30 Voice Muffler Corporation Mouthpiece with sound reducer to enhance language translation
US8473277B2 (en) * 2010-08-05 2013-06-25 David Lynton Jephcott Translation station
US9779088B2 (en) 2010-08-05 2017-10-03 David Lynton Jephcott Translation station
US20130238311A1 (en) * 2013-04-21 2013-09-12 Sierra JY Lou Method and Implementation of Providing a Communication User Terminal with Adapting Language Translation
US9626963B2 (en) * 2013-04-30 2017-04-18 Paypal, Inc. System and method of improving speech recognition using context
US10043534B2 (en) 2013-12-23 2018-08-07 Staton Techiya, Llc Method and device for spectral expansion for an audio signal
US10649220B2 (en) 2014-06-09 2020-05-12 Mentor Acquisition One, Llc Content presentation in head worn computing
US11227294B2 (en) 2014-04-03 2022-01-18 Mentor Acquisition One, Llc Sight information collection in head worn computing
US9299194B2 (en) 2014-02-14 2016-03-29 Osterhout Group, Inc. Secure sharing in head worn computing
US9594246B2 (en) 2014-01-21 2017-03-14 Osterhout Group, Inc. See-through computer display systems
US9746686B2 (en) 2014-05-19 2017-08-29 Osterhout Group, Inc. Content position calibration in head worn computing
US9448409B2 (en) 2014-11-26 2016-09-20 Osterhout Group, Inc. See-through computer display systems
US20160019715A1 (en) 2014-07-15 2016-01-21 Osterhout Group, Inc. Content presentation in head worn computing
US9810906B2 (en) 2014-06-17 2017-11-07 Osterhout Group, Inc. External user interface for head worn computing
US9529195B2 (en) 2014-01-21 2016-12-27 Osterhout Group, Inc. See-through computer display systems
US11103122B2 (en) 2014-07-15 2021-08-31 Mentor Acquisition One, Llc Content presentation in head worn computing
US9841599B2 (en) 2014-06-05 2017-12-12 Osterhout Group, Inc. Optical configurations for head-worn see-through displays
US10191279B2 (en) 2014-03-17 2019-01-29 Osterhout Group, Inc. Eye imaging in head worn computing
US9671613B2 (en) 2014-09-26 2017-06-06 Osterhout Group, Inc. See-through computer display systems
US9829707B2 (en) 2014-08-12 2017-11-28 Osterhout Group, Inc. Measuring content brightness in head worn computing
US9939934B2 (en) 2014-01-17 2018-04-10 Osterhout Group, Inc. External user interface for head worn computing
US20150277118A1 (en) 2014-03-28 2015-10-01 Osterhout Group, Inc. Sensor dependent content position in head worn computing
US9575321B2 (en) 2014-06-09 2017-02-21 Osterhout Group, Inc. Content presentation in head worn computing
US10254856B2 (en) 2014-01-17 2019-04-09 Osterhout Group, Inc. External user interface for head worn computing
US10684687B2 (en) 2014-12-03 2020-06-16 Mentor Acquisition One, Llc See-through computer display systems
US11892644B2 (en) 2014-01-21 2024-02-06 Mentor Acquisition One, Llc See-through computer display systems
US9836122B2 (en) 2014-01-21 2017-12-05 Osterhout Group, Inc. Eye glint imaging in see-through computer display systems
US11669163B2 (en) 2014-01-21 2023-06-06 Mentor Acquisition One, Llc Eye glint imaging in see-through computer display systems
US9523856B2 (en) 2014-01-21 2016-12-20 Osterhout Group, Inc. See-through computer display systems
US11737666B2 (en) 2014-01-21 2023-08-29 Mentor Acquisition One, Llc Eye imaging in head worn computing
US20150205135A1 (en) 2014-01-21 2015-07-23 Osterhout Group, Inc. See-through computer display systems
US9811159B2 (en) 2014-01-21 2017-11-07 Osterhout Group, Inc. Eye imaging in head worn computing
US9766463B2 (en) 2014-01-21 2017-09-19 Osterhout Group, Inc. See-through computer display systems
US9753288B2 (en) 2014-01-21 2017-09-05 Osterhout Group, Inc. See-through computer display systems
US9494800B2 (en) 2014-01-21 2016-11-15 Osterhout Group, Inc. See-through computer display systems
US11487110B2 (en) 2014-01-21 2022-11-01 Mentor Acquisition One, Llc Eye imaging in head worn computing
US9538915B2 (en) 2014-01-21 2017-01-10 Osterhout Group, Inc. Eye imaging in head worn computing
US9651784B2 (en) 2014-01-21 2017-05-16 Osterhout Group, Inc. See-through computer display systems
US9846308B2 (en) 2014-01-24 2017-12-19 Osterhout Group, Inc. Haptic systems for head-worn computers
US20150241963A1 (en) 2014-02-11 2015-08-27 Osterhout Group, Inc. Eye imaging in head worn computing
US9401540B2 (en) 2014-02-11 2016-07-26 Osterhout Group, Inc. Spatial location presentation in head worn computing
US20160187651A1 (en) 2014-03-28 2016-06-30 Osterhout Group, Inc. Safety for a vehicle operator with an hmd
US9672210B2 (en) * 2014-04-25 2017-06-06 Osterhout Group, Inc. Language translation with head-worn computing
US9651787B2 (en) 2014-04-25 2017-05-16 Osterhout Group, Inc. Speaker assembly for headworn computer
US10853589B2 (en) 2014-04-25 2020-12-01 Mentor Acquisition One, Llc Language translation with head-worn computing
US9423842B2 (en) 2014-09-18 2016-08-23 Osterhout Group, Inc. Thermal management for head-worn computer
US10663740B2 (en) 2014-06-09 2020-05-26 Mentor Acquisition One, Llc Content presentation in head worn computing
US10163453B2 (en) 2014-10-24 2018-12-25 Staton Techiya, Llc Robust voice activity detector system for use with an earphone
US9684172B2 (en) 2014-12-03 2017-06-20 Osterhout Group, Inc. Head worn computer display systems
USD743963S1 (en) 2014-12-22 2015-11-24 Osterhout Group, Inc. Air mouse
USD751552S1 (en) 2014-12-31 2016-03-15 Osterhout Group, Inc. Computer glasses
USD753114S1 (en) 2015-01-05 2016-04-05 Osterhout Group, Inc. Air mouse
US20160239985A1 (en) 2015-02-17 2016-08-18 Osterhout Group, Inc. See-through computer display systems
US10616693B2 (en) 2016-01-22 2020-04-07 Staton Techiya Llc System and method for efficiency among devices
US10690936B2 (en) 2016-08-29 2020-06-23 Mentor Acquisition One, Llc Adjustable nose bridge assembly for headworn computer
USD864959S1 (en) 2017-01-04 2019-10-29 Mentor Acquisition One, Llc Computer glasses
US10951994B2 (en) 2018-04-04 2021-03-16 Staton Techiya, Llc Method to acquire preferred dynamic range function for speech enhancement

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5818381A (en) * 1994-06-24 1998-10-06 Roscoe C. Williams Limited Electronic viewing aid
US5995936A (en) * 1997-02-04 1999-11-30 Brais; Louis Report generation system and method for capturing prose, audio, and video by voice command and automatically linking sound and image to formatted text locations
US6457024B1 (en) * 1991-07-18 2002-09-24 Lee Felsentein Wearable hypermedium system
US20030115059A1 (en) * 2001-12-17 2003-06-19 Neville Jayaratne Real time translator and method of performing real time translation of a plurality of spoken languages
US20050071166A1 (en) * 2003-09-29 2005-03-31 International Business Machines Corporation Apparatus for the collection of data for performing automatic speech recognition
US20060153394A1 (en) * 2005-01-10 2006-07-13 Nigel Beasley Headset audio bypass apparatus and method
US20060167687A1 (en) * 2005-01-21 2006-07-27 Lawrence Kates Management and assistance system for the deaf
US20080004872A1 (en) * 2004-09-07 2008-01-03 Sensear Pty Ltd, An Australian Company Apparatus and Method for Sound Enhancement

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6457024B1 (en) * 1991-07-18 2002-09-24 Lee Felsentein Wearable hypermedium system
US5818381A (en) * 1994-06-24 1998-10-06 Roscoe C. Williams Limited Electronic viewing aid
US5995936A (en) * 1997-02-04 1999-11-30 Brais; Louis Report generation system and method for capturing prose, audio, and video by voice command and automatically linking sound and image to formatted text locations
US20030115059A1 (en) * 2001-12-17 2003-06-19 Neville Jayaratne Real time translator and method of performing real time translation of a plurality of spoken languages
US20050071166A1 (en) * 2003-09-29 2005-03-31 International Business Machines Corporation Apparatus for the collection of data for performing automatic speech recognition
US20080004872A1 (en) * 2004-09-07 2008-01-03 Sensear Pty Ltd, An Australian Company Apparatus and Method for Sound Enhancement
US20060153394A1 (en) * 2005-01-10 2006-07-13 Nigel Beasley Headset audio bypass apparatus and method
US20060167687A1 (en) * 2005-01-21 2006-07-27 Lawrence Kates Management and assistance system for the deaf

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11818458B2 (en) 2005-10-17 2023-11-14 Cutting Edge Vision, LLC Camera touchpad
US11153472B2 (en) 2005-10-17 2021-10-19 Cutting Edge Vision, LLC Automatic upload of pictures from a camera
US20110238405A1 (en) * 2007-09-28 2011-09-29 Joel Pedre A translation method and a device, and a headset forming part of said device
US8311798B2 (en) * 2007-09-28 2012-11-13 Joel Pedre Translation method and a device, and a headset forming part of said device
US20090144061A1 (en) * 2007-11-29 2009-06-04 Plantronics, Inc. Systems and methods for generating verbal feedback messages in head-worn electronic devices
US20090262205A1 (en) * 2008-04-21 2009-10-22 Dana Stephen Smith Voice activated headset imaging system
US8315876B2 (en) * 2008-05-09 2012-11-20 Plantronics, Inc. Headset wearer identity authentication with voice print or speech recognition
US20090281809A1 (en) * 2008-05-09 2009-11-12 Plantronics, Inc. Headset Wearer Identity Authentication With Voice Print Or Speech Recognition
US8099286B1 (en) * 2008-05-12 2012-01-17 Rockwell Collins, Inc. System and method for providing situational awareness enhancement for low bit rate vocoders
US8527258B2 (en) * 2009-03-11 2013-09-03 Samsung Electronics Co., Ltd. Simultaneous interpretation system
US20100235161A1 (en) * 2009-03-11 2010-09-16 Samsung Electronics Co., Ltd. Simultaneous interpretation system
US10785365B2 (en) 2009-10-28 2020-09-22 Digimarc Corporation Intuitive computing methods and systems
US11715473B2 (en) 2009-10-28 2023-08-01 Digimarc Corporation Intuitive computing methods and systems
US20110161076A1 (en) * 2009-12-31 2011-06-30 Davis Bruce L Intuitive Computing Methods and Systems
US9197736B2 (en) * 2009-12-31 2015-11-24 Digimarc Corporation Intuitive computing methods and systems
US20120004913A1 (en) * 2010-07-01 2012-01-05 Samsung Electronics Co., Ltd. Method and apparatus for controlling operation of portable terminal using microphone
US20150039288A1 (en) * 2010-09-21 2015-02-05 Joel Pedre Integrated oral translator with incorporated speaker recognition
US11049094B2 (en) 2014-02-11 2021-06-29 Digimarc Corporation Methods and arrangements for device to device communication
US11573763B2 (en) 2016-08-26 2023-02-07 Bragi GmbH Voice assistant for wireless earpieces
US11086593B2 (en) * 2016-08-26 2021-08-10 Bragi GmbH Voice assistant for wireless earpieces
US11861266B2 (en) 2016-08-26 2024-01-02 Bragi GmbH Voice assistant for wireless earpieces
US11138390B2 (en) * 2018-02-09 2021-10-05 Google Llc Concurrent reception of multiple user speech input for translation
US20190251176A1 (en) * 2018-02-09 2019-08-15 Google Llc Concurrent Reception Of Multiple User Speech Input For Translation

Also Published As

Publication number Publication date
US20070100637A1 (en) 2007-05-03

Similar Documents

Publication Publication Date Title
US7707035B2 (en) Autonomous integrated headset and sound processing system for tactical applications
CN110503969B (en) Audio data processing method and device and storage medium
EP2538404B1 (en) Voice data transferring device, terminal device, voice data transferring method, and voice recognition system
EP1503368B1 (en) Head mounted multi-sensory audio input system
JP2018049143A (en) Voice acquisition system and voice acquisition method
EP3002753A1 (en) Speech enhancement method and apparatus for same
WO2017191249A1 (en) Speech enhancement and audio event detection for an environment with non-stationary noise
US20050033571A1 (en) Head mounted multi-sensory audio input system
JP2007500367A (en) Voice recognition method and communication device
WO2007138503A1 (en) Method of driving a speech recognition system
WO2020230926A1 (en) Voice synthesis apparatus for evaluating quality of synthesized voice by using artificial intelligence, and operating method therefor
WO2020210050A1 (en) Automated control of noise reduction or noise masking
WO2020226213A1 (en) Artificial intelligence device for providing voice recognition function and method for operating artificial intelligence device
KR20210148057A (en) Method for recognizing voice and apparatus used therefor
JP2007248534A (en) Speech recognition device, frequency spectrum acquiring device and speech recognition method
JP2009178783A (en) Communication robot and its control method
CN114731389A (en) Remote operation auxiliary system
US11776557B2 (en) Automatic interpretation server and method thereof
KR102557092B1 (en) Automatic interpretation and translation and dialogue assistance system using transparent display
KR20200028767A (en) Language refinement apparatus based on emotion analysis of speaker
KR101906549B1 (en) A wearable device for hearing impairment person
KR102170684B1 (en) System for interpreting and translating using smart clothes
CN112420046A (en) Multi-person conference method, system and device suitable for hearing-impaired people to participate
JP7055327B2 (en) Conversation collection device, conversation collection system and conversation collection method
KR20200002710A (en) Processing method based on voice recognition for aircraft

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTEGRATED WAVE TECHNOLOGIES, INC.,CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MCCUNE, TIMOTHY S.;REEL/FRAME:017102/0896

Effective date: 20051012

Owner name: INTEGRATED WAVE TECHNOLOGIES, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MCCUNE, TIMOTHY S.;REEL/FRAME:017102/0896

Effective date: 20051012

STCF Information on status: patent grant

Free format text: PATENTED CASE

FPAY Fee payment

Year of fee payment: 4

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2552)

Year of fee payment: 8

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2553); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY

Year of fee payment: 12