CA2517751A1 - Operating method for voice activity detection/silence suppression system - Google Patents

Operating method for voice activity detection/silence suppression system Download PDF

Info

Publication number
CA2517751A1
CA2517751A1 CA002517751A CA2517751A CA2517751A1 CA 2517751 A1 CA2517751 A1 CA 2517751A1 CA 002517751 A CA002517751 A CA 002517751A CA 2517751 A CA2517751 A CA 2517751A CA 2517751 A1 CA2517751 A1 CA 2517751A1
Authority
CA
Canada
Prior art keywords
operating method
voice activity
activity detection
suppression system
silence suppression
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002517751A
Other languages
French (fr)
Other versions
CA2517751C (en
Inventor
Bing Chen
James H. James
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
AT&T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AT&T Corp filed Critical AT&T Corp
Publication of CA2517751A1 publication Critical patent/CA2517751A1/en
Application granted granted Critical
Publication of CA2517751C publication Critical patent/CA2517751C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/45Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of analysis window
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • G10L2025/935Mixed voiced class; Transitions

Abstract

A VAD/SS system is connected to a channel of a transmission pipe. The channel provides a pathway for the transmission of energy. A method for operating a VAD/SS system includes detecting the energy on the channel, and activating or suppressing activation of the VAD/SS system depending upon the nature of the energy detected on the channel.
CA002517751A 2004-09-16 2005-08-31 Operating method for voice activity detection/silence suppression system Expired - Fee Related CA2517751C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/942,518 US7917356B2 (en) 2004-09-16 2004-09-16 Operating method for voice activity detection/silence suppression system
US10/942,518 2004-09-16

Publications (2)

Publication Number Publication Date
CA2517751A1 true CA2517751A1 (en) 2006-03-16
CA2517751C CA2517751C (en) 2009-10-06

Family

ID=36087444

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002517751A Expired - Fee Related CA2517751C (en) 2004-09-16 2005-08-31 Operating method for voice activity detection/silence suppression system

Country Status (2)

Country Link
US (8) US7917356B2 (en)
CA (1) CA2517751C (en)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7272552B1 (en) 2002-12-27 2007-09-18 At&T Corp. Voice activity detection and silence suppression in a packet network
US7230955B1 (en) * 2002-12-27 2007-06-12 At & T Corp. System and method for improved use of voice activity detection
US8311814B2 (en) * 2006-09-19 2012-11-13 Avaya Inc. Efficient voice activity detector to detect fixed power signals
US8489396B2 (en) * 2007-07-25 2013-07-16 Qnx Software Systems Limited Noise reduction with integrated tonal noise reduction
KR101444099B1 (en) * 2007-11-13 2014-09-26 삼성전자주식회사 Method and apparatus for detecting voice activity
JP5293329B2 (en) * 2009-03-26 2013-09-18 富士通株式会社 Audio signal evaluation program, audio signal evaluation apparatus, and audio signal evaluation method
CN102044242B (en) 2009-10-15 2012-01-25 华为技术有限公司 Method, device and electronic equipment for voice activation detection
JP5874344B2 (en) * 2010-11-24 2016-03-02 株式会社Jvcケンウッド Voice determination device, voice determination method, and voice determination program
HUE053127T2 (en) 2010-12-24 2021-06-28 Huawei Tech Co Ltd Method and apparatus for adaptively detecting a voice activity in an input audio signal
US9064503B2 (en) 2012-03-23 2015-06-23 Dolby Laboratories Licensing Corporation Hierarchical active voice detection
US20130282373A1 (en) * 2012-04-23 2013-10-24 Qualcomm Incorporated Systems and methods for audio signal processing
CN104143326B (en) * 2013-12-03 2016-11-02 腾讯科技(深圳)有限公司 A kind of voice command identification method and device
KR102301880B1 (en) 2014-10-14 2021-09-14 삼성전자 주식회사 Electronic apparatus and method for spoken dialog thereof
WO2016103809A1 (en) * 2014-12-25 2016-06-30 ソニー株式会社 Information processing device, information processing method, and program
US10057939B2 (en) 2015-06-16 2018-08-21 Apple Inc. Managing packet-switched voice communications during silence intervals
US11138987B2 (en) 2016-04-04 2021-10-05 Honeywell International Inc. System and method to distinguish sources in a multiple audio source environment
US10257839B2 (en) 2017-03-20 2019-04-09 At&T Intellectual Property I, L.P. Facilitating communication of radio resource quality to a mobile application
CN108122552B (en) * 2017-12-15 2021-10-15 上海智臻智能网络科技股份有限公司 Voice emotion recognition method and device
CN108447506A (en) * 2018-03-06 2018-08-24 深圳市沃特沃德股份有限公司 Method of speech processing and voice processing apparatus
US10332543B1 (en) 2018-03-12 2019-06-25 Cypress Semiconductor Corporation Systems and methods for capturing noise for pattern recognition processing
CN110473542B (en) * 2019-09-06 2022-04-15 北京安云世纪科技有限公司 Awakening method and device for voice instruction execution function and electronic equipment
CN111354378B (en) * 2020-02-12 2020-11-24 北京声智科技有限公司 Voice endpoint detection method, device, equipment and computer storage medium

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4696039A (en) * 1983-10-13 1987-09-22 Texas Instruments Incorporated Speech analysis/synthesis system with silence suppression
US5960389A (en) * 1996-11-15 1999-09-28 Nokia Mobile Phones Limited Methods for generating comfort noise during discontinuous transmission
JP2856185B2 (en) * 1997-01-21 1999-02-10 日本電気株式会社 Audio coding / decoding system
US5867574A (en) * 1997-05-19 1999-02-02 Lucent Technologies Inc. Voice activity detection system and method
US20010014857A1 (en) 1998-08-14 2001-08-16 Zifei Peter Wang A voice activity detector for packet voice network
US6621812B1 (en) * 1998-11-10 2003-09-16 Cisco Technology, Inc. Method and apparatus for mapping voice activity detection to a scheduled access media
US7933295B2 (en) 1999-04-13 2011-04-26 Broadcom Corporation Cable modem with voice processing capability
US6549587B1 (en) * 1999-09-20 2003-04-15 Broadcom Corporation Voice and data exchange over a packet based network with timing recovery
US7023868B2 (en) 1999-04-13 2006-04-04 Broadcom Corporation Voice gateway with downstream voice synchronization
US7203164B2 (en) 1999-10-27 2007-04-10 Broadcom Corporation Voice architecture for transmission over a shared, contention based medium
US6993007B2 (en) 1999-10-27 2006-01-31 Broadcom Corporation System and method for suppressing silence in voice traffic over an asynchronous communication medium
US6526139B1 (en) * 1999-11-03 2003-02-25 Tellabs Operations, Inc. Consolidated noise injection in a voice processing system
WO2001086914A2 (en) 2000-05-08 2001-11-15 Broadcom Corporation System and method for supporting multiple voice channels
US7003093B2 (en) 2000-09-08 2006-02-21 Intel Corporation Tone detection for integrated telecommunications processing
US6738358B2 (en) 2000-09-09 2004-05-18 Intel Corporation Network echo canceller for integrated telecommunications processing
US20020116186A1 (en) 2000-09-09 2002-08-22 Adam Strauss Voice activity detector for integrated telecommunications processing
US6889187B2 (en) * 2000-12-28 2005-05-03 Nortel Networks Limited Method and apparatus for improved voice activity detection in a packet voice network
US20020110152A1 (en) 2001-02-14 2002-08-15 Silvain Schaffer Synchronizing encoder - decoder operation in a communication network
US7171357B2 (en) 2001-03-21 2007-01-30 Avaya Technology Corp. Voice-activity detection using energy ratios and periodicity
US20030120484A1 (en) 2001-06-12 2003-06-26 David Wong Method and system for generating colored comfort noise in the absence of silence insertion description packets
US20040002856A1 (en) * 2002-03-08 2004-01-01 Udaya Bhaskar Multi-rate frequency domain interpolative speech CODEC system
GB2388001A (en) 2002-04-26 2003-10-29 Mitel Knowledge Corp Compensating for beamformer steering delay during handsfree speech recognition
US20030212550A1 (en) 2002-05-10 2003-11-13 Ubale Anil W. Method, apparatus, and system for improving speech quality of voice-over-packets (VOP) systems
US7072828B2 (en) * 2002-05-13 2006-07-04 Avaya Technology Corp. Apparatus and method for improved voice activity detection
US7272552B1 (en) * 2002-12-27 2007-09-18 At&T Corp. Voice activity detection and silence suppression in a packet network
CA2420129A1 (en) * 2003-02-17 2004-08-17 Catena Networks, Canada, Inc. A method for robustly detecting voice activity
US7281051B2 (en) * 2003-06-30 2007-10-09 Nortel Networks Limited Apparatus, method, and computer program for managing resources in a communication system

Also Published As

Publication number Publication date
US20140195228A1 (en) 2014-07-10
US8346543B2 (en) 2013-01-01
US20130103395A1 (en) 2013-04-25
US20140039886A1 (en) 2014-02-06
US20160078885A1 (en) 2016-03-17
US20150187372A1 (en) 2015-07-02
US20060069551A1 (en) 2006-03-30
US8577674B2 (en) 2013-11-05
US8909519B2 (en) 2014-12-09
US20150073782A1 (en) 2015-03-12
US9412396B2 (en) 2016-08-09
US7917356B2 (en) 2011-03-29
US8700390B2 (en) 2014-04-15
US9009034B2 (en) 2015-04-14
US9224405B2 (en) 2015-12-29
US20110196675A1 (en) 2011-08-11
CA2517751C (en) 2009-10-06

Similar Documents

Publication Publication Date Title
CA2517751A1 (en) Operating method for voice activity detection/silence suppression system
AU2003263733A1 (en) Voice activity detection (vad) devices and methods for use with noise suppression systems
WO2006104576A3 (en) Adaptive voice mode extension for a voice activity detector
WO2009024784A3 (en) Ultrasound detectors
WO2008016949A3 (en) Voice and text communication system, method and apparatus
BRPI0518669A2 (en) hearing protection
FR2898209B1 (en) METHOD FOR DEBRUCTING AN AUDIO SIGNAL
WO2005081686A3 (en) Sonar system and process
CA2352017A1 (en) Method and apparatus for locating a talker
AU7111000A (en) System, method, and article of manufacture for detecting emotion in voice signals by utilizing statistics for voice signal parameters
WO2006012550A3 (en) Monitoring system for concrete pilings and method of installation
BRPI0500587A (en) Method and Equipment for Multisensory Speech Amplification on a Mobile Device
WO2006130668A3 (en) Person monitoring
AU3893700A (en) Noise suppression using external voice activity detection
WO2010056963A3 (en) Training/coaching system for a voice-enabled work environment
AU2003281449A1 (en) System and method for robustly detecting voice and dtx modes
BRPI0509404A (en) fluid ejection device and method
WO2004087563A3 (en) Barrier layers for microelectromechanical systems
WO2008014448A3 (en) Vascular access device non-adhering membranes
EP0944036A4 (en) Method and device for detecting voice sections, and speech velocity conversion method and device utilizing said method and device
WO2007001821A3 (en) Multi-sensory speech enhancement using a speech-state model
WO2007001768A3 (en) Multi-sensory speech enhancement using a clean speech prior
WO2003048711A3 (en) Speech detection system in an audio signal in noisy surrounding
WO2001001389A3 (en) Voice recognition method and device
EP1903557A3 (en) An efficient voice activity detactor to detect fixed power signals

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed

Effective date: 20170831