WO2011127457A1 - System and method of smart audio logging for mobile devices - Google Patents
System and method of smart audio logging for mobile devices Download PDFInfo
- Publication number
- WO2011127457A1 WO2011127457A1 PCT/US2011/031859 US2011031859W WO2011127457A1 WO 2011127457 A1 WO2011127457 A1 WO 2011127457A1 US 2011031859 W US2011031859 W US 2011031859W WO 2011127457 A1 WO2011127457 A1 WO 2011127457A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- processing
- information
- digital audio
- audio signal
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
- H04M1/7243—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
- H04M1/72433—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
- G11B2020/10537—Audio or video recording
- G11B2020/10546—Audio or video recording specifically adapted for audio data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/60—Substation equipment, e.g. for use by subscribers including speech amplifiers
- H04M1/6008—Substation equipment, e.g. for use by subscribers including speech amplifiers in the transmitter circuit
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/64—Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
- H04M1/65—Recording arrangements for recording a message from the calling party
- H04M1/656—Recording arrangements for recording a message from the calling party for recording conversations
Definitions
- Audio logging enables recording of all or some portions of an audio signal of interest which are typically picked up by one or more microphones in one or more mobile devices. Audio logging is sometimes referred to as audio recording or audio memo interchangeably.
- This document also describes a method of processing a digital audio signal for a mobile device.
- This method includes receiving acoustic signal by at least one microphone; transforming the received acoustic signal into an electrical signal; sampling the electrical signal based on a sampling frequency and a data width for each sampled data to obtain the digital audio signal; storing the digital audio signal into a buffer; extracting at least one auditory context information from the digital audio signal; in response to automatically detecting a start event indicator, performing an audio logging for the digital audio signal; and in response to automatically detecting an end event indicator, ending the audio logging.
- This detecting the start or end event indicators may be based at least in part on non-auditory information such as scheduling information or calendaring information.
- This document also describes an apparatus, a combination of means, and a computer-readable medium relating to this method.
- FIG. 6 is a diagram illustrating examples of context information S600.
- FIG. 9B is another embodiment of the generation mechanism of a single-level start event indicator.
- FIG. 15 is a flowchart of an embodiment of the Audio Logging Processor 230 during passive audio monitoring state SI .
- FIG. 23 is a flowchart of an embodiment of the Audio Logging Processor 230 during active audio logging state S5.
- FIG. 29 is a diagram of a second embodiment of multiple microphones ON and OFF control.
- FIG. 38 is a diagram of an embodiment of compression coding format selection in which the compression coding format selection or lack thereof may be dynamically configured according to context information S600.
- any disclosure of an operation of an apparatus having a particular feature is also expressly intended to disclose a method having an analogous feature (and vice versa), and any disclosure of an operation of an apparatus according to a particular configuration is also expressly intended to disclose a method according to an analogous configuration (and vice versa).
- the term "context” (or "audio context”) is used to indicate a component of an audio or speech and conveys information from the ambient environment of the speaker, and the term “noise” is used to indicate any other artifact in the audio or speech signal.
- the end event indicator may be configured to be based on non-occurrence of auditory event during predetermined period of time.
- the detection of the start event indicator and the end event indicator may include the steps of selecting at least one particular context information out of at least one auditory context information; comparing the selected context information with at least one pre-determined thresholds, and determining if the start or end event indicators have been detected based on the comparison.
- FIG. 4 is an example diagram of an embodiment of Input Processing Unit 250.
- the Input Processing Unit 250 processes various types of inputs and generates the Input Signal S220 which may be selectively transferred through Multiplexer (Mux) 410 to the Audio Logging Processor 230.
- the inputs may include, but not limited to, user's voice or key commands, the signal from non-acoustic sensors such as a camera, timer, GPS, proximity sensor, Gyro, ambient sensor, accelerometer, and so on.
- the inputs may be transmitted from another at least one smart audio logging systems.
- Control Signal Handler 550 handles various control signals that may be applied to peripheral units of the smart audio logging system. Two examples of the control signals, A/D Converter Control S215 and Microphone Unit Control S205, are disclosed in FIG. 5 for exemplary purposes.
- Start Event Manager 570 may be configured to handle, detect, or generate a start event indicator.
- the start event indicator is a flag or signal indicating that smart audio logging may be ready to start. It may be desirable to use the start event indicator for the Audio Logging Processor 230 to switch its internal state if its operation is based on a state machine. It should be obvious for one skilled in the art that the start event indicator is a conceptual flag or signal for the understanding of operation of the Audio Logging Processor 230.
- General Audio Signal Processor 595 is a multi-purpose module for handling all other fundamental audio and speech signal processing methods not explicitly presented in the present application but still necessary for successful implementation.
- these signal processing methods may include but not limited to time-to-frequency or frequency-to-time conversions; miscellaneous filtering; signal gain adjustment; or dynamic range control.
- each module disclosed separately in FIG. 5 is provided only for illustration purposes of the functional description of the Audio Logging Processor 230.
- some modules can be combined into a single module or some modules can be even further divided up into smaller modules in real-life implementation of the system.
- all of the modules disclosed in FIG. 5 may be integrated as a single module.
- Noise Classifier 840 may be configured to classify the characteristics of background noise of the Audio Input S270. For example, it may identify the background noise as "Stationary vs. Non-stationary,” “Street noise,” “Air plane noise,” or combination thereof. It may classify the background noise based on severity level of it such as “Severe” or “Medium.” The Noise Classifier 840 may be configured to classify the input in a single state processing or multi-stage processing.
- Combinatorial Logic 900 may be configured to generate the Start Event Indicator S910 based on certain combination mechanisms of the internal triggering signals. For example, combinatorial logic may be configured to generate the Start Event Indicator S910 according to OR operation or AND operation of the internal triggering signals from the Auditory Activity Detector 510, the Aux Signal Analyzer 530, or the Input Signal Handler 540. In another embodiment, it may be configured to generate the Start Event Indicator S910 when one or more internal triggering signals have been set or triggered.
- FIG. 11 is a diagram of a first exemplary embodiment illustrating internal states of Audio Logging Processor 230 and transition thereof for the multi-level start event indicator system.
- the default state at the start-up of the smart audio logging may be the Passive Audio Monitoring State SI during which the mobile device comprising smart audio logging feature is substantially equivalent to typical idle mode state.
- the Passive Audio Monitoring State SI it is critical to minimize the power consumption because statistically the mobile device stays in this state for most of time. Therefore, most of modules of the smart audio logging system, except a few modules required to detect the activity of the Audio Input S270, may be configured to remain in sleep state or in any other power-saving modes.
- such a few exceptional modules may include the Audio Capturing Unit 215, the Buffer 220, or the Auditory Activity Detector 510. In one embodiment, these modules may be configured to be on constantly or may be configured to wake up intermittently.
- modules of the smart audio logging system may be configured to remain in sleep state or in any other power- saving modes.
- the few exceptional modules may include the Audio Capturing Unit 215, the Buffer 220, or the Auditory Activity Detector 510. In one embodiment, these modules may be configured to be on constantly or may be configured to wake up intermittently.
- FIG. 19 is an example diagram of a context identification embodiment at the Audio Logging Processor 230 during the Active Audio Monitoring State S2. It shows that the context identification process, which is performed by the Context Identifier 560 at every T6 interval, may be configured to start asynchronously to T 4 interval. T6 interval may be determined in consideration of the size of the Buffer 220 and the tradeoff between power consumption and the accuracy of the decision. Too much frequent context identification process, or too small T6 interval, may result in increased power consumption whereas too often context identification process, or too big T 6 interval, may result in the accuracy degradation of context information S600.
- the contents of the Audio Input S270 may change over time, for example, from conversational speech to music or music plus speech and vice versa. It may be desirable to use a higher resolution of sampling frequency or data width for music content and lower resolution of sampling frequency or data width for mainly speech signal.
- the resolution may be configured to be different according to the characteristic of speech content.
- the system may be configured to use a different resolution for business communication compared to a personal conversation between friends.
- the blocks 2410, 2415, 2420 for dynamic setting of the configurations of A/D converter and dynamic selection of memory location according to the context information S600 may be re-positioned in different order in between thereof or as opposed to other blocks in the flowchart within the scope of general principle disclosed herein.
- the Audio Logging Processor 230 may be configured to determine 2440 if enhancement of the Audio Input S270 signal is desirable or in such case what types of enhancement processing may be desirable before the processed signal is stored in the selected memory. The determination may be based on the context information S600 or pre-configured automatically by the system or manually by the user.
- Such enhancement processing may include acoustic echo cancellation (AEC), receiving voice enhancement (RVE), active noise cancellation (ANC), noise suppression (NS), acoustic gain control (AGC), acoustic volume control (AVC), or acoustic dynamic range control (ADRC).
- AEC acoustic echo cancellation
- RVE receiving voice enhancement
- ANC active noise cancellation
- NS noise suppression
- AVC acoustic gain control
- AVC acoustic volume control
- ADRC acoustic dynamic range control
- the aggressiveness of signal enhancement may be based on the content of the Audio Input S270 or the context information S600.
- the Audio Logging Processor 230 may be configured to determine 2445 if compression of the Audio Input S270 signal is desirable or in such case what types of compression processing may be desirable before the processed signal is stored in the selected memory location. The determination may be based on the context information S600 or pre-configured automatically by the system or manually by the user. For example, the system may select to use compression before audio logging starts based on the expected duration of audio logging preferably based on the calendaring information. The selection of a compression method such as speech coding or audio coding may be dynamically configured based upon the content of the Audio Input S270 or the context information S600.
- FIG. 30 is a diagram of an embodiment of active microphone number control according to the present application in which active number of microphone can be dynamically controlled according to context information S600.
- the maximum number of available microphones is assumed as three and is also the maximum number of microphone that can be turned on during the Passive Audio Monitoring State SI, the Active Audio Monitoring State S2, or the Audio Monitoring State S4.
- the selection of different number of microphones may still be within the scope of the present disclosure.
- a microphone may be configured to be turned on periodically so it can monitor auditory event of environment. Therefore during these states, the active number of microphone may change preferably between zero and one.
- the number of microphones may increase with the quality of the Audio Input S270, for example according to the signal- to-ratio (SNR) of the Audio Input S270, degrades below a certain threshold.
- SNR signal- to-ratio
- the storage of audio logging may be configured to be changed dynamically between local storage and remote storage during the actual audio logging process or after the completion of audio logging.
- FIG. 31 shows an embodiment of storage location selection in which the selection may be controlled according to predefined context information S600 priority. This selection may be performed before the start of audio logging or after the completion of audio logging.
- the context information S600 may be pre-configured to have a different level of priority.
- FIG. 34 is a diagram of an embodiment of stage-by-stage power up of blocks within the smart audio logging system in which number of active blocks and total power consumption thereof may be controlled dynamically according to each state.
- one or more number of microphones may be configured to wake up periodically in order to receive the Audio Input S270.
- the system may be configured to wake up a portion of system and thereby the number of active blocks, or interchangeably the number of power-up blocks, of the system increased to Nl in FIG. 34.
- the Active Audio Monitoring State S2 one or more additional blocks may be configured to wake up in addition to Nl, which makes the total number of active blocks as N2 during the periods that one or more microphones are active 3420.
- the A/D converter setting during the Passive Audio Monitoring State SI and the Active Audio Monitoring State S2 stages may be configured to have the same resolution.
- A/D converter setting during the Active Audio Monitoring State S2 and the Active Audio Logging State S3 stage may be configured to have the same resolution.
- a user may be inside the subway station waiting for his or her train to arrive when the smart audio logging system might be in the Audio Logging State S3, S5, actively logging the Audio Input S270.
- the noise level often times exceeded a certain threshold beyond which normal conversational speech is hard to understand.
- the smart audio logging system may reconfigure audio signal enhancement settings accordingly.
- the audio signal enhancement setting change may be followed by or preceded by the active number of microphone.
- the change of compression mode may be initiated by the change of the content classification, which is subset of the context information S600, from “Music” to “Speech” or “Speech” to “Music.” It may be desirable to use a higher bitrate for "Music” content whereas it may be desirable to use a lower bitrate for "Speech” content in which the bandwidth of the signal to be encoded is typically much narrower than typical "Music” content. Alternatively, it may be initiated by the available memory size in local storage or the quality of channel between a mobile device and remote server.
- Each of the methods disclosed herein may also be tangibly embodied (for example, in one or more computer-readable media as listed above) as one or more sets of instructions readable and/or executable by a machine including an array of logic elements (e.g., a processor, microprocessor, microcontroller, or other finite state machine).
- a machine including an array of logic elements (e.g., a processor, microprocessor, microcontroller, or other finite state machine).
Abstract
Description
Claims
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
ES11717375.7T ES2574680T3 (en) | 2010-04-08 | 2011-04-08 | Smart audio recording system and procedure for mobile devices |
KR1020127029257A KR101498347B1 (en) | 2010-04-08 | 2011-04-08 | System and method of smart audio logging for mobile devices |
EP18179847.1A EP3438975B1 (en) | 2010-04-08 | 2011-04-08 | System and method of smart audio logging for mobile devices |
KR1020147006752A KR101523181B1 (en) | 2010-04-08 | 2011-04-08 | System and method of smart audio logging for mobile devices |
JP2013504014A JP2013527490A (en) | 2010-04-08 | 2011-04-08 | Smart audio logging system and method for mobile devices |
EP11717375.7A EP2556652B1 (en) | 2010-04-08 | 2011-04-08 | System and method of smart audio logging for mobile devices |
EP21171952.1A EP3917123B1 (en) | 2010-04-08 | 2011-04-08 | System and method of smart audio logging for mobile devices |
CN201180025888.9A CN102907077B (en) | 2010-04-08 | 2011-04-08 | For the system and method for the intelligent audio record of mobile device |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US32217610P | 2010-04-08 | 2010-04-08 | |
US61/322,176 | 2010-04-08 | ||
US13/076,242 | 2011-03-30 | ||
US13/076,242 US9112989B2 (en) | 2010-04-08 | 2011-03-30 | System and method of smart audio logging for mobile devices |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2011127457A1 true WO2011127457A1 (en) | 2011-10-13 |
Family
ID=44227871
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2011/031859 WO2011127457A1 (en) | 2010-04-08 | 2011-04-08 | System and method of smart audio logging for mobile devices |
Country Status (12)
Country | Link |
---|---|
US (3) | US9112989B2 (en) |
EP (4) | EP3035655B1 (en) |
JP (3) | JP2013527490A (en) |
KR (2) | KR101498347B1 (en) |
CN (2) | CN105357371B (en) |
DK (1) | DK3035655T3 (en) |
ES (4) | ES2574680T3 (en) |
HU (3) | HUE038690T2 (en) |
PL (1) | PL3035655T3 (en) |
PT (1) | PT3035655T (en) |
SI (1) | SI3035655T1 (en) |
WO (1) | WO2011127457A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013040414A1 (en) * | 2011-09-16 | 2013-03-21 | Qualcomm Incorporated | Mobile device context information using speech detection |
US9838810B2 (en) | 2012-02-27 | 2017-12-05 | Qualcomm Technologies International, Ltd. | Low power audio detection |
US10580428B2 (en) | 2014-08-18 | 2020-03-03 | Sony Corporation | Audio noise estimation and filtering |
CN113485551A (en) * | 2014-05-31 | 2021-10-08 | 苹果公司 | Message user interface for capture and transmission of media and location content |
US11417334B2 (en) * | 2019-11-27 | 2022-08-16 | Realtek Semiconductor Corp. | Dynamic speech recognition method and apparatus therefor |
Families Citing this family (112)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013043393A1 (en) | 2011-09-23 | 2013-03-28 | Digimarc Corporation | Context-based smartphone sensor logic |
US9992745B2 (en) | 2011-11-01 | 2018-06-05 | Qualcomm Incorporated | Extraction and analysis of buffered audio data using multiple codec rates each greater than a low-power processor rate |
KR20220002750A (en) * | 2011-12-07 | 2022-01-06 | 퀄컴 인코포레이티드 | Low power integrated circuit to analyze a digitized audio stream |
US9922646B1 (en) | 2012-09-21 | 2018-03-20 | Amazon Technologies, Inc. | Identifying a location of a voice-input device |
CN103811013B (en) * | 2012-11-07 | 2017-05-03 | 中国移动通信集团公司 | Noise suppression method, device thereof, electronic equipment and communication processing method |
US9311640B2 (en) | 2014-02-11 | 2016-04-12 | Digimarc Corporation | Methods and arrangements for smartphone payments and transactions |
US9275625B2 (en) | 2013-03-06 | 2016-03-01 | Qualcomm Incorporated | Content based noise suppression |
US9076459B2 (en) | 2013-03-12 | 2015-07-07 | Intermec Ip, Corp. | Apparatus and method to classify sound to detect speech |
US10255930B2 (en) * | 2013-06-28 | 2019-04-09 | Harman International Industries, Incorporated | Wireless control of linked devices |
US20150031416A1 (en) | 2013-07-23 | 2015-01-29 | Motorola Mobility Llc | Method and Device For Command Phrase Validation |
CN103841244A (en) * | 2013-12-03 | 2014-06-04 | 华为技术有限公司 | Terminal and recording method thereof |
US9449602B2 (en) * | 2013-12-03 | 2016-09-20 | Google Inc. | Dual uplink pre-processing paths for machine and human listening |
JP6478006B2 (en) * | 2013-12-16 | 2019-03-06 | パナソニックIpマネジメント株式会社 | Wireless communication apparatus, wireless communication system, and data processing method |
US9646607B2 (en) * | 2014-03-10 | 2017-05-09 | Dell Products, L.P. | Managing wake-on-voice buffer quality based on system boot profiling |
US9508359B2 (en) * | 2014-06-19 | 2016-11-29 | Yang Gao | Acoustic echo preprocessing for speech enhancement |
CN105637895B (en) * | 2014-07-10 | 2019-03-26 | 奥林巴斯株式会社 | The control method of recording device and recording device |
US9307317B2 (en) | 2014-08-29 | 2016-04-05 | Coban Technologies, Inc. | Wireless programmable microphone apparatus and system for integrated surveillance system devices |
US9225527B1 (en) | 2014-08-29 | 2015-12-29 | Coban Technologies, Inc. | Hidden plug-in storage drive for data integrity |
FI126923B (en) * | 2014-09-26 | 2017-08-15 | Genelec Oy | Method and apparatus for detecting a digital audio signal |
US20160125891A1 (en) * | 2014-10-31 | 2016-05-05 | Intel Corporation | Environment-based complexity reduction for audio processing |
US20160140978A1 (en) * | 2014-11-18 | 2016-05-19 | Qualcomm Incorporated | Customizable Local Media Mixing And Stream Selection In Group Communications |
US10271126B2 (en) * | 2015-01-26 | 2019-04-23 | Shenzhen Grandsun Electronic Co., Ltd. | Earphone noise reduction method and apparatus |
AU2016228113B2 (en) * | 2015-03-03 | 2017-09-28 | Openlive Australia Limited | A system, content editing server, audio recording slave device and content editing interface for distributed live performance scheduled audio recording, cloud-based audio content editing and online content distribution of audio track and associated metadata |
US9916836B2 (en) * | 2015-03-23 | 2018-03-13 | Microsoft Technology Licensing, Llc | Replacing an encoded audio output signal |
US10715468B2 (en) * | 2015-03-27 | 2020-07-14 | Intel Corporation | Facilitating tracking of targets and generating and communicating of messages at computing devices |
US20170069309A1 (en) * | 2015-09-03 | 2017-03-09 | Google Inc. | Enhanced speech endpointing |
US10186276B2 (en) | 2015-09-25 | 2019-01-22 | Qualcomm Incorporated | Adaptive noise suppression for super wideband music |
WO2017069310A1 (en) * | 2015-10-23 | 2017-04-27 | 삼성전자 주식회사 | Electronic device and control method therefor |
US10678828B2 (en) | 2016-01-03 | 2020-06-09 | Gracenote, Inc. | Model-based media classification service using sensed media noise characteristics |
US10165171B2 (en) | 2016-01-22 | 2018-12-25 | Coban Technologies, Inc. | Systems, apparatuses, and methods for controlling audiovisual apparatuses |
WO2017142112A1 (en) * | 2016-02-19 | 2017-08-24 | 주식회사 트리니티랩 | Audible frequency band audio signal reception method for low power |
US9947316B2 (en) | 2016-02-22 | 2018-04-17 | Sonos, Inc. | Voice control of a media playback system |
US9811314B2 (en) | 2016-02-22 | 2017-11-07 | Sonos, Inc. | Metadata exchange involving a networked playback system and a networked microphone system |
US10095470B2 (en) | 2016-02-22 | 2018-10-09 | Sonos, Inc. | Audio response playback |
US10743101B2 (en) | 2016-02-22 | 2020-08-11 | Sonos, Inc. | Content mixing |
US10264030B2 (en) | 2016-02-22 | 2019-04-16 | Sonos, Inc. | Networked microphone device control |
US9965247B2 (en) | 2016-02-22 | 2018-05-08 | Sonos, Inc. | Voice controlled media playback system based on user profile |
CN105788611A (en) * | 2016-02-25 | 2016-07-20 | 成都普创通信技术股份有限公司 | Audio quality online monitoring system |
US10789840B2 (en) | 2016-05-09 | 2020-09-29 | Coban Technologies, Inc. | Systems, apparatuses and methods for detecting driving behavior and triggering actions based on detected driving behavior |
US10370102B2 (en) | 2016-05-09 | 2019-08-06 | Coban Technologies, Inc. | Systems, apparatuses and methods for unmanned aerial vehicle |
US10152858B2 (en) | 2016-05-09 | 2018-12-11 | Coban Technologies, Inc. | Systems, apparatuses and methods for triggering actions based on data capture and characterization |
US9978390B2 (en) | 2016-06-09 | 2018-05-22 | Sonos, Inc. | Dynamic player selection for audio signal processing |
US20170372697A1 (en) * | 2016-06-22 | 2017-12-28 | Elwha Llc | Systems and methods for rule-based user control of audio rendering |
US10152969B2 (en) | 2016-07-15 | 2018-12-11 | Sonos, Inc. | Voice detection by multiple devices |
US10134399B2 (en) | 2016-07-15 | 2018-11-20 | Sonos, Inc. | Contextualization of voice inputs |
US10115400B2 (en) | 2016-08-05 | 2018-10-30 | Sonos, Inc. | Multiple voice services |
US9942678B1 (en) | 2016-09-27 | 2018-04-10 | Sonos, Inc. | Audio playback settings for voice interaction |
US10176809B1 (en) * | 2016-09-29 | 2019-01-08 | Amazon Technologies, Inc. | Customized compression and decompression of audio data |
US9743204B1 (en) | 2016-09-30 | 2017-08-22 | Sonos, Inc. | Multi-orientation playback device microphones |
US10181323B2 (en) | 2016-10-19 | 2019-01-15 | Sonos, Inc. | Arbitration-based voice recognition |
US10248613B2 (en) * | 2017-01-10 | 2019-04-02 | Qualcomm Incorporated | Data bus activation in an electronic device |
KR102580418B1 (en) * | 2017-02-07 | 2023-09-20 | 삼성에스디에스 주식회사 | Acoustic echo cancelling apparatus and method |
US11183181B2 (en) | 2017-03-27 | 2021-11-23 | Sonos, Inc. | Systems and methods of multiple voice services |
CN107343105B (en) * | 2017-07-21 | 2020-09-22 | 维沃移动通信有限公司 | Audio data processing method and mobile terminal |
US10475449B2 (en) | 2017-08-07 | 2019-11-12 | Sonos, Inc. | Wake-word detection suppression |
EP3664291A4 (en) | 2017-08-18 | 2020-08-19 | Guangdong Oppo Mobile Telecommunications Corp., Ltd. | Audio signal adjustment method and device, storage medium, and terminal |
US10048930B1 (en) | 2017-09-08 | 2018-08-14 | Sonos, Inc. | Dynamic computation of system response volume |
US10446165B2 (en) | 2017-09-27 | 2019-10-15 | Sonos, Inc. | Robust short-time fourier transform acoustic echo cancellation during audio playback |
US10482868B2 (en) | 2017-09-28 | 2019-11-19 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
US10621981B2 (en) | 2017-09-28 | 2020-04-14 | Sonos, Inc. | Tone interference cancellation |
US10051366B1 (en) | 2017-09-28 | 2018-08-14 | Sonos, Inc. | Three-dimensional beam forming with a microphone array |
US10466962B2 (en) | 2017-09-29 | 2019-11-05 | Sonos, Inc. | Media playback system with voice assistance |
US10614831B2 (en) * | 2017-10-12 | 2020-04-07 | Qualcomm Incorporated | Audio activity tracking and summaries |
US10880650B2 (en) | 2017-12-10 | 2020-12-29 | Sonos, Inc. | Network microphone devices with automatic do not disturb actuation capabilities |
US10818290B2 (en) | 2017-12-11 | 2020-10-27 | Sonos, Inc. | Home graph |
JP2019110447A (en) * | 2017-12-19 | 2019-07-04 | オンキヨー株式会社 | Electronic device, control method of electronic device, and control program of electronic device |
US11343614B2 (en) | 2018-01-31 | 2022-05-24 | Sonos, Inc. | Device designation of playback and network microphone device arrangements |
US11175880B2 (en) | 2018-05-10 | 2021-11-16 | Sonos, Inc. | Systems and methods for voice-assisted media content selection |
US10847178B2 (en) | 2018-05-18 | 2020-11-24 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection |
US10959029B2 (en) | 2018-05-25 | 2021-03-23 | Sonos, Inc. | Determining and adapting to changes in microphone performance of playback devices |
US10681460B2 (en) | 2018-06-28 | 2020-06-09 | Sonos, Inc. | Systems and methods for associating playback devices with voice assistant services |
US11100918B2 (en) | 2018-08-27 | 2021-08-24 | American Family Mutual Insurance Company, S.I. | Event sensing system |
US10461710B1 (en) | 2018-08-28 | 2019-10-29 | Sonos, Inc. | Media playback system with maximum volume setting |
US11076035B2 (en) | 2018-08-28 | 2021-07-27 | Sonos, Inc. | Do not disturb feature for audio notifications |
US10878811B2 (en) | 2018-09-14 | 2020-12-29 | Sonos, Inc. | Networked devices, systems, and methods for intelligently deactivating wake-word engines |
US10587430B1 (en) | 2018-09-14 | 2020-03-10 | Sonos, Inc. | Networked devices, systems, and methods for associating playback devices based on sound codes |
US11024331B2 (en) | 2018-09-21 | 2021-06-01 | Sonos, Inc. | Voice detection optimization using sound metadata |
US10811015B2 (en) | 2018-09-25 | 2020-10-20 | Sonos, Inc. | Voice detection optimization based on selected voice assistant service |
US11100923B2 (en) | 2018-09-28 | 2021-08-24 | Sonos, Inc. | Systems and methods for selective wake word detection using neural network models |
US10692518B2 (en) | 2018-09-29 | 2020-06-23 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection via multiple network microphone devices |
EP3641286B1 (en) * | 2018-10-15 | 2021-01-13 | i2x GmbH | Call recording system for automatically storing a call candidate and call recording method |
US11899519B2 (en) | 2018-10-23 | 2024-02-13 | Sonos, Inc. | Multiple stage network microphone device with reduced power consumption and processing load |
EP3654249A1 (en) | 2018-11-15 | 2020-05-20 | Snips | Dilated convolutions and gating for efficient keyword spotting |
US11183183B2 (en) | 2018-12-07 | 2021-11-23 | Sonos, Inc. | Systems and methods of operating media playback systems having multiple voice assistant services |
US11132989B2 (en) | 2018-12-13 | 2021-09-28 | Sonos, Inc. | Networked microphone devices, systems, and methods of localized arbitration |
US10602268B1 (en) | 2018-12-20 | 2020-03-24 | Sonos, Inc. | Optimization of network microphone devices using noise classification |
CN111383663A (en) * | 2018-12-29 | 2020-07-07 | 北京嘀嘀无限科技发展有限公司 | Recording control method, device, user terminal and storage medium |
US10867604B2 (en) | 2019-02-08 | 2020-12-15 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing |
US11315556B2 (en) | 2019-02-08 | 2022-04-26 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing by transmitting sound data associated with a wake word to an appropriate device for identification |
US11120794B2 (en) | 2019-05-03 | 2021-09-14 | Sonos, Inc. | Voice assistant persistence across multiple network microphone devices |
US11241616B1 (en) * | 2019-05-17 | 2022-02-08 | Amazon Technologies, Inc. | Techniques for conserving power on a device |
US11200894B2 (en) | 2019-06-12 | 2021-12-14 | Sonos, Inc. | Network microphone device with command keyword eventing |
US10586540B1 (en) | 2019-06-12 | 2020-03-10 | Sonos, Inc. | Network microphone device with command keyword conditioning |
US11361756B2 (en) | 2019-06-12 | 2022-06-14 | Sonos, Inc. | Conditional wake word eventing based on environment |
CN110246501B (en) * | 2019-07-02 | 2022-02-01 | 思必驰科技股份有限公司 | Voice recognition method and system for conference recording |
US11138969B2 (en) | 2019-07-31 | 2021-10-05 | Sonos, Inc. | Locally distributed keyword detection |
US11138975B2 (en) | 2019-07-31 | 2021-10-05 | Sonos, Inc. | Locally distributed keyword detection |
US10871943B1 (en) | 2019-07-31 | 2020-12-22 | Sonos, Inc. | Noise classification for event detection |
KR20210042520A (en) * | 2019-10-10 | 2021-04-20 | 삼성전자주식회사 | An electronic apparatus and Method for controlling the electronic apparatus thereof |
US11189286B2 (en) | 2019-10-22 | 2021-11-30 | Sonos, Inc. | VAS toggle based on device orientation |
US11200900B2 (en) | 2019-12-20 | 2021-12-14 | Sonos, Inc. | Offline voice control |
US11562740B2 (en) | 2020-01-07 | 2023-01-24 | Sonos, Inc. | Voice verification for media playback |
US11556307B2 (en) | 2020-01-31 | 2023-01-17 | Sonos, Inc. | Local voice data processing |
US11308958B2 (en) | 2020-02-07 | 2022-04-19 | Sonos, Inc. | Localized wakeword verification |
US11727919B2 (en) | 2020-05-20 | 2023-08-15 | Sonos, Inc. | Memory allocation for keyword spotting engines |
US11308962B2 (en) | 2020-05-20 | 2022-04-19 | Sonos, Inc. | Input detection windowing |
US11482224B2 (en) | 2020-05-20 | 2022-10-25 | Sonos, Inc. | Command keywords with input detection windowing |
US11698771B2 (en) | 2020-08-25 | 2023-07-11 | Sonos, Inc. | Vocal guidance engines for playback devices |
RU2766273C1 (en) * | 2020-09-24 | 2022-02-10 | Акционерное общество "Лаборатория Касперского" | System and method of detecting an unwanted call |
CN112508388B (en) * | 2020-12-02 | 2022-08-19 | 唐旸 | Method and system for inputting product quality detection data, server side and storage medium |
US11551700B2 (en) | 2021-01-25 | 2023-01-10 | Sonos, Inc. | Systems and methods for power-efficient keyword detection |
US11581007B2 (en) | 2021-04-27 | 2023-02-14 | Kyndryl, Inc. | Preventing audio delay-induced miscommunication in audio/video conferences |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004057892A1 (en) * | 2002-12-20 | 2004-07-08 | Nokia Corporation | Method and device for organizing user provided information with meta-information |
US20060149547A1 (en) * | 2005-01-06 | 2006-07-06 | Fuji Photo Film Co., Ltd. | Recording apparatus and voice recorder program |
US20070033030A1 (en) * | 2005-07-19 | 2007-02-08 | Oded Gottesman | Techniques for measurement, adaptation, and setup of an audio communication system |
US20080201142A1 (en) * | 2007-02-15 | 2008-08-21 | Motorola, Inc. | Method and apparatus for automication creation of an interactive log based on real-time content |
US20090177476A1 (en) * | 2007-12-21 | 2009-07-09 | May Darrell | Method, system and mobile device for registering voice data with calendar events |
US20090190769A1 (en) * | 2008-01-29 | 2009-07-30 | Qualcomm Incorporated | Sound quality by intelligently selecting between signals from a plurality of microphones |
US20100081487A1 (en) * | 2008-09-30 | 2010-04-01 | Apple Inc. | Multiple microphone switching and configuration |
Family Cites Families (59)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4704696A (en) * | 1984-01-26 | 1987-11-03 | Texas Instruments Incorporated | Method and apparatus for voice control of a computer |
US4780906A (en) * | 1984-02-17 | 1988-10-25 | Texas Instruments Incorporated | Speaker-independent word recognition method and system based upon zero-crossing rate and energy measurement of analog speech signal |
JPS63260345A (en) | 1987-04-17 | 1988-10-27 | Matsushita Electric Ind Co Ltd | Automatic voice recorder |
JPH04108246A (en) * | 1990-08-29 | 1992-04-09 | Oki Electric Ind Co Ltd | Hand-free telephone set |
JP3167385B2 (en) * | 1991-10-28 | 2001-05-21 | 日本電信電話株式会社 | Audio signal transmission method |
US5749072A (en) * | 1994-06-03 | 1998-05-05 | Motorola Inc. | Communications device responsive to spoken commands and methods of using same |
US5614914A (en) * | 1994-09-06 | 1997-03-25 | Interdigital Technology Corporation | Wireless telephone distribution system with time and space diversity transmission for determining receiver location |
JP3133632B2 (en) | 1994-12-29 | 2001-02-13 | 三洋電機株式会社 | Long time recording device |
JP3513320B2 (en) | 1996-04-15 | 2004-03-31 | シャープ株式会社 | Answering machine |
JPH10161698A (en) | 1996-11-28 | 1998-06-19 | Saitama Nippon Denki Kk | Answer phone and voice recording method |
JPH11187156A (en) | 1997-12-18 | 1999-07-09 | Brother Ind Ltd | Communication equipment |
US6549587B1 (en) * | 1999-09-20 | 2003-04-15 | Broadcom Corporation | Voice and data exchange over a packet based network with timing recovery |
JP2001022386A (en) | 1999-07-06 | 2001-01-26 | Sanyo Electric Co Ltd | Sound recording/reproducing device and automatic answering telephone |
US6959274B1 (en) * | 1999-09-22 | 2005-10-25 | Mindspeed Technologies, Inc. | Fixed rate speech compression system and method |
JP3429237B2 (en) | 1999-11-29 | 2003-07-22 | 船井電機株式会社 | Communication terminal device |
JP2002057749A (en) * | 2000-08-09 | 2002-02-22 | Denso Corp | Portable communication equipment |
US7231531B2 (en) * | 2001-03-16 | 2007-06-12 | Dualcor Technologies, Inc. | Personal electronics device with a dual core processor |
JP2002324290A (en) * | 2001-04-25 | 2002-11-08 | Yazaki Corp | Emergency information system |
JP2003198716A (en) | 2001-12-26 | 2003-07-11 | Hitachi Kokusai Electric Inc | Mobile phone |
US7224981B2 (en) | 2002-06-20 | 2007-05-29 | Intel Corporation | Speech recognition of mobile devices |
US7392183B2 (en) | 2002-12-27 | 2008-06-24 | Intel Corporation | Schedule event context for speech recognition |
JP2005221565A (en) | 2004-02-03 | 2005-08-18 | Nec Saitama Ltd | Voice data file storing method and sound-recording processor |
US20060020486A1 (en) * | 2004-04-02 | 2006-01-26 | Kurzweil Raymond C | Machine and method to assist user in selecting clothing |
KR100640893B1 (en) * | 2004-09-07 | 2006-11-02 | 엘지전자 주식회사 | Baseband modem and mobile terminal for voice recognition |
JP4686160B2 (en) | 2004-10-04 | 2011-05-18 | 沖コンサルティングソリューションズ株式会社 | Conversation recording apparatus and conversation recording method |
ES2675734T3 (en) * | 2005-04-07 | 2018-07-12 | Orange | Synchronization procedure between a speech recognition processing operation and an activation action of said processing |
JP2007140063A (en) | 2005-11-17 | 2007-06-07 | Olympus Imaging Corp | Device for sound recording and reproducing |
US7856283B2 (en) * | 2005-12-13 | 2010-12-21 | Sigmatel, Inc. | Digital microphone interface, audio codec and methods for use therewith |
KR100785076B1 (en) * | 2006-06-15 | 2007-12-12 | 삼성전자주식회사 | Method for detecting real time event of sport moving picture and apparatus thereof |
US20080005067A1 (en) * | 2006-06-28 | 2008-01-03 | Microsoft Corporation | Context-based search, retrieval, and awareness |
GB0619825D0 (en) * | 2006-10-06 | 2006-11-15 | Craven Peter G | Microphone array |
JP4979343B2 (en) | 2006-10-27 | 2012-07-18 | 三建設備工業株式会社 | Humidity control system for inside and outside air |
US8652040B2 (en) * | 2006-12-19 | 2014-02-18 | Valencell, Inc. | Telemetric apparatus for health and environmental monitoring |
JP2008165097A (en) | 2006-12-29 | 2008-07-17 | Mariko Kawashima | Voice recording device and voice data analyzing device for use targeting prevention against ill-treatment |
US8140325B2 (en) * | 2007-01-04 | 2012-03-20 | International Business Machines Corporation | Systems and methods for intelligent control of microphones for speech recognition applications |
US20080192906A1 (en) * | 2007-02-14 | 2008-08-14 | Winbond Electronics Corporation | Method and system for message management for audio storage devices |
US8977255B2 (en) * | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US8229134B2 (en) * | 2007-05-24 | 2012-07-24 | University Of Maryland | Audio camera using microphone arrays for real time capture of audio images and method for jointly processing the audio images with video images |
JP4909854B2 (en) * | 2007-09-27 | 2012-04-04 | 株式会社東芝 | Electronic device and display processing method |
US7962525B2 (en) | 2007-11-05 | 2011-06-14 | Microsoft Corporation | Automated capture of information generated at meetings |
US20090204243A1 (en) * | 2008-01-09 | 2009-08-13 | 8 Figure, Llc | Method and apparatus for creating customized text-to-speech podcasts and videos incorporating associated media |
US8554550B2 (en) * | 2008-01-28 | 2013-10-08 | Qualcomm Incorporated | Systems, methods, and apparatus for context processing using multi resolution analysis |
US8099289B2 (en) * | 2008-02-13 | 2012-01-17 | Sensory, Inc. | Voice interface and search for electronic devices including bluetooth headsets and remote systems |
CN101594410A (en) | 2008-05-27 | 2009-12-02 | 北京爱国者存储科技有限责任公司 | The method of automatic telephone recording of electronic recording equipment |
US8805348B2 (en) | 2008-07-30 | 2014-08-12 | Qualcomm Incorporated | Diary synchronization for smart phone applications |
CN201278556Y (en) | 2008-08-22 | 2009-07-22 | 深圳市中深瑞泰科技有限公司 | CDMA mobile phone with automatic response and recording function |
US8488799B2 (en) | 2008-09-11 | 2013-07-16 | Personics Holdings Inc. | Method and system for sound monitoring over a network |
US20110173235A1 (en) * | 2008-09-15 | 2011-07-14 | Aman James A | Session automated recording together with rules based indexing, analysis and expression of content |
GB0817950D0 (en) * | 2008-10-01 | 2008-11-05 | Univ Southampton | Apparatus and method for sound reproduction |
US8676904B2 (en) * | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
WO2010054373A2 (en) * | 2008-11-10 | 2010-05-14 | Google Inc. | Multisensory speech detection |
CN101404680A (en) | 2008-11-12 | 2009-04-08 | 深圳市杰特电信控股有限公司 | Method for inserting and playing media fragment in electronic document |
CN101478717A (en) | 2009-01-19 | 2009-07-08 | 深圳市同洲电子股份有限公司 | Call recording method, system and mobile communication terminal |
US8862252B2 (en) * | 2009-01-30 | 2014-10-14 | Apple Inc. | Audio user interface for displayless electronic device |
US8442833B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US7930436B1 (en) * | 2009-03-09 | 2011-04-19 | Znosko Dmitry Y | System and method for dynamically adjusting data compression parameters |
WO2011011438A2 (en) * | 2009-07-22 | 2011-01-27 | Dolby Laboratories Licensing Corporation | System and method for automatic selection of audio configuration settings |
US9167409B2 (en) * | 2010-02-19 | 2015-10-20 | Telefonaktiebolaget L M Ericsson (Publ) | Music control signal dependent activation of a voice activity detector |
US9679257B2 (en) * | 2010-07-01 | 2017-06-13 | Nokia Technologies Oy | Method and apparatus for adapting a context model at least partially based upon a context-related search criterion |
-
2011
- 2011-03-30 US US13/076,242 patent/US9112989B2/en active Active
- 2011-04-08 ES ES11717375.7T patent/ES2574680T3/en active Active
- 2011-04-08 ES ES15198125.5T patent/ES2688371T3/en active Active
- 2011-04-08 ES ES21171952T patent/ES2963099T3/en active Active
- 2011-04-08 HU HUE15198125A patent/HUE038690T2/en unknown
- 2011-04-08 PT PT15198125T patent/PT3035655T/en unknown
- 2011-04-08 KR KR1020127029257A patent/KR101498347B1/en active IP Right Grant
- 2011-04-08 HU HUE18179847A patent/HUE055010T2/en unknown
- 2011-04-08 ES ES18179847T patent/ES2877325T3/en active Active
- 2011-04-08 PL PL15198125T patent/PL3035655T3/en unknown
- 2011-04-08 DK DK15198125.5T patent/DK3035655T3/en active
- 2011-04-08 EP EP15198125.5A patent/EP3035655B1/en active Active
- 2011-04-08 EP EP21171952.1A patent/EP3917123B1/en active Active
- 2011-04-08 WO PCT/US2011/031859 patent/WO2011127457A1/en active Application Filing
- 2011-04-08 CN CN201510645020.9A patent/CN105357371B/en active Active
- 2011-04-08 EP EP11717375.7A patent/EP2556652B1/en active Active
- 2011-04-08 EP EP18179847.1A patent/EP3438975B1/en active Active
- 2011-04-08 JP JP2013504014A patent/JP2013527490A/en active Pending
- 2011-04-08 CN CN201180025888.9A patent/CN102907077B/en active Active
- 2011-04-08 HU HUE11717375A patent/HUE028665T2/en unknown
- 2011-04-08 KR KR1020147006752A patent/KR101523181B1/en active IP Right Grant
- 2011-04-08 SI SI201131527T patent/SI3035655T1/en unknown
-
2014
- 2014-05-07 JP JP2014096211A patent/JP2014195275A/en active Pending
-
2015
- 2015-07-17 US US14/802,088 patent/US20150325267A1/en not_active Abandoned
-
2016
- 2016-05-06 JP JP2016093278A patent/JP6689664B2/en active Active
-
2021
- 2021-05-11 US US17/317,702 patent/US20210264947A1/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004057892A1 (en) * | 2002-12-20 | 2004-07-08 | Nokia Corporation | Method and device for organizing user provided information with meta-information |
US20060149547A1 (en) * | 2005-01-06 | 2006-07-06 | Fuji Photo Film Co., Ltd. | Recording apparatus and voice recorder program |
US20070033030A1 (en) * | 2005-07-19 | 2007-02-08 | Oded Gottesman | Techniques for measurement, adaptation, and setup of an audio communication system |
US20080201142A1 (en) * | 2007-02-15 | 2008-08-21 | Motorola, Inc. | Method and apparatus for automication creation of an interactive log based on real-time content |
US20090177476A1 (en) * | 2007-12-21 | 2009-07-09 | May Darrell | Method, system and mobile device for registering voice data with calendar events |
US20090190769A1 (en) * | 2008-01-29 | 2009-07-30 | Qualcomm Incorporated | Sound quality by intelligently selecting between signals from a plurality of microphones |
US20100081487A1 (en) * | 2008-09-30 | 2010-04-01 | Apple Inc. | Multiple microphone switching and configuration |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013040414A1 (en) * | 2011-09-16 | 2013-03-21 | Qualcomm Incorporated | Mobile device context information using speech detection |
US9838810B2 (en) | 2012-02-27 | 2017-12-05 | Qualcomm Technologies International, Ltd. | Low power audio detection |
CN113485551A (en) * | 2014-05-31 | 2021-10-08 | 苹果公司 | Message user interface for capture and transmission of media and location content |
US10580428B2 (en) | 2014-08-18 | 2020-03-03 | Sony Corporation | Audio noise estimation and filtering |
US11417334B2 (en) * | 2019-11-27 | 2022-08-16 | Realtek Semiconductor Corp. | Dynamic speech recognition method and apparatus therefor |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210264947A1 (en) | System and method of determining auditory context information | |
KR101622493B1 (en) | Extraction and analysis of audio feature data | |
US8600743B2 (en) | Noise profile determination for voice-related feature | |
EP3138097B1 (en) | Voice profile management | |
US11551707B2 (en) | Speech processing method, information device, and computer program product | |
KR20200109830A (en) | A computer-readable recording medium on which an automatic speech recognition program is recorded | |
KR20200109827A (en) | Automatic speech recognition method for reducing the waiting time |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201180025888.9 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11717375 Country of ref document: EP Kind code of ref document: A1 |
|
DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 8571/CHENP/2012 Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2013504014 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011717375 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 20127029257 Country of ref document: KR Kind code of ref document: A |