WO2005060337A3 - Automatic extraction of musical portions of an audio stream - Google Patents

Automatic extraction of musical portions of an audio stream Download PDF

Info

Publication number
WO2005060337A3
WO2005060337A3 PCT/IB2004/004085 IB2004004085W WO2005060337A3 WO 2005060337 A3 WO2005060337 A3 WO 2005060337A3 IB 2004004085 W IB2004004085 W IB 2004004085W WO 2005060337 A3 WO2005060337 A3 WO 2005060337A3
Authority
WO
WIPO (PCT)
Prior art keywords
audio stream
music
frames
automatic extraction
smoothed
Prior art date
Application number
PCT/IB2004/004085
Other languages
French (fr)
Other versions
WO2005060337A2 (en
Inventor
Ole Kirkeby
Jyri Huopaniemi
Timo Sorsa
Original Assignee
Nokia Corp
Nokia Inc
Ole Kirkeby
Jyri Huopaniemi
Timo Sorsa
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Corp, Nokia Inc, Ole Kirkeby, Jyri Huopaniemi, Timo Sorsa filed Critical Nokia Corp
Priority to DE602004016380T priority Critical patent/DE602004016380D1/en
Priority to EP04801373A priority patent/EP1692799B1/en
Publication of WO2005060337A2 publication Critical patent/WO2005060337A2/en
Publication of WO2005060337A3 publication Critical patent/WO2005060337A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/56Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
    • H04H60/58Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 of audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • G11B27/034Electronic editing of digitised analogue information signals, e.g. audio or video signals on discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/27Arrangements for recording or accumulating broadcast information or broadcast-related information
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/35Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
    • H04H60/37Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying segments of broadcast information, e.g. scenes or extracting programme ID
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/35Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
    • H04H60/48Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for recognising items expressed in broadcast information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • G10H2210/046Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for differentiation between music and non-music signals, based on the identification of musical parameters, e.g. based on tempo detection

Abstract

Music and non-music portions in an audio stream are identified. The audio stream is digitized and segmented into frames. Selected frames are passed through a filter bank which includes filters having bandwidths approximately proportional to their center frequencies. The spectral flux for each selected frame is calculated and smoothed. Frames having a smoothed spectral flux below a threshold value are associated with music, and frames having a smoothed spectral flux above a threshold value are associated with non-music.
PCT/IB2004/004085 2003-12-12 2004-12-08 Automatic extraction of musical portions of an audio stream WO2005060337A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
DE602004016380T DE602004016380D1 (en) 2003-12-12 2004-12-08 AUTOMATIC EXTRACTION OF MUSIC PARTS OF AN AUDIOSTROM
EP04801373A EP1692799B1 (en) 2003-12-12 2004-12-08 Automatic extraction of musical portions of an audio stream

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/733,257 2003-12-12
US10/733,257 US7179980B2 (en) 2003-12-12 2003-12-12 Automatic extraction of musical portions of an audio stream

Publications (2)

Publication Number Publication Date
WO2005060337A2 WO2005060337A2 (en) 2005-07-07
WO2005060337A3 true WO2005060337A3 (en) 2006-09-21

Family

ID=34653055

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2004/004085 WO2005060337A2 (en) 2003-12-12 2004-12-08 Automatic extraction of musical portions of an audio stream

Country Status (7)

Country Link
US (1) US7179980B2 (en)
EP (1) EP1692799B1 (en)
KR (1) KR100840745B1 (en)
CN (1) CN1977306A (en)
AT (1) ATE407419T1 (en)
DE (1) DE602004016380D1 (en)
WO (1) WO2005060337A2 (en)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8139793B2 (en) 2003-08-27 2012-03-20 Sony Computer Entertainment Inc. Methods and apparatus for capturing audio signals based on a visual image
US8233642B2 (en) 2003-08-27 2012-07-31 Sony Computer Entertainment Inc. Methods and apparatuses for capturing an audio signal based on a location of the signal
US8160269B2 (en) * 2003-08-27 2012-04-17 Sony Computer Entertainment Inc. Methods and apparatuses for adjusting a listening area for capturing sounds
US8666524B2 (en) * 2003-01-02 2014-03-04 Catch Media, Inc. Portable music player and transmitter
US7130623B2 (en) * 2003-04-17 2006-10-31 Nokia Corporation Remote broadcast recording
KR100782830B1 (en) 2006-01-02 2007-12-06 삼성전자주식회사 Broadcasting receiver and method for storing digital multimedia broadcasting audio data
JP4665836B2 (en) * 2006-05-31 2011-04-06 日本ビクター株式会社 Music classification device, music classification method, and music classification program
JP2008026662A (en) * 2006-07-21 2008-02-07 Sony Corp Data recording device, method, and program
JP2008076776A (en) * 2006-09-21 2008-04-03 Sony Corp Data recording device, data recording method, and data recording program
KR100832360B1 (en) * 2006-09-25 2008-05-26 삼성전자주식회사 Method for controlling equalizer in digital media player and system thereof
JP2008241850A (en) * 2007-03-26 2008-10-09 Sanyo Electric Co Ltd Recording or reproducing device
US20090062943A1 (en) * 2007-08-27 2009-03-05 Sony Computer Entertainment Inc. Methods and apparatus for automatically controlling the sound level based on the content
US20090163239A1 (en) * 2007-12-21 2009-06-25 Nokia Corporation Method, apparatus and computer program product for generating media content by recording broadcast transmissions
JP2009192725A (en) * 2008-02-13 2009-08-27 Sanyo Electric Co Ltd Music piece recording device
ES2895268T3 (en) * 2008-03-20 2022-02-18 Fraunhofer Ges Forschung Apparatus and method for modifying a parameterized representation
KR101599875B1 (en) * 2008-04-17 2016-03-14 삼성전자주식회사 Method and apparatus for multimedia encoding based on attribute of multimedia content, method and apparatus for multimedia decoding based on attributes of multimedia content
KR20090110244A (en) * 2008-04-17 2009-10-21 삼성전자주식회사 Method for encoding/decoding audio signals using audio semantic information and apparatus thereof
KR20090110242A (en) * 2008-04-17 2009-10-21 삼성전자주식회사 Method and apparatus for processing audio signal
US20100057472A1 (en) * 2008-08-26 2010-03-04 Hanks Zeng Method and system for frequency compensation in an audio codec
US8712771B2 (en) * 2009-07-02 2014-04-29 Alon Konchitsky Automated difference recognition between speaking sounds and music
DE112009005215T8 (en) * 2009-08-04 2013-01-03 Nokia Corp. Method and apparatus for audio signal classification
CN102044244B (en) * 2009-10-15 2011-11-16 华为技术有限公司 Signal classifying method and device
US8716584B1 (en) * 2010-11-01 2014-05-06 James W. Wieder Using recognition-segments to find and play a composition containing sound
US9117426B2 (en) 2010-11-01 2015-08-25 James W. Wieder Using sound-segments in a multi-dimensional ordering to find and act-upon a composition
US9153217B2 (en) 2010-11-01 2015-10-06 James W. Wieder Simultaneously playing sound-segments to find and act-upon a composition
US10194239B2 (en) * 2012-11-06 2019-01-29 Nokia Technologies Oy Multi-resolution audio signals
CN103974143B (en) * 2014-05-20 2017-11-07 北京速能数码网络技术有限公司 A kind of method and apparatus for generating media data
US9686382B2 (en) * 2014-08-04 2017-06-20 Honeywell International Inc. Double decoder system for decoding overlapping aircraft surveillance signals
KR102255152B1 (en) * 2014-11-18 2021-05-24 삼성전자주식회사 Contents processing device and method for transmitting segments of variable size and computer-readable recording medium
US10715868B2 (en) * 2016-06-07 2020-07-14 Maxell, Ltd. Broadcast receiving apparatus
GB2551807B (en) * 2016-06-30 2022-07-13 Lifescore Ltd Apparatus and methods to generate music
KR20190109661A (en) 2018-03-08 2019-09-26 한국전자통신연구원 Method for generating data for learning emotion in video, method for determining emotion in video, and apparatus using the methods
CN109658951B (en) * 2019-01-08 2021-03-26 北京雷石天地电子技术有限公司 Mixed signal detection method and system
WO2020153736A1 (en) 2019-01-23 2020-07-30 Samsung Electronics Co., Ltd. Method and device for speech recognition
CN110176262B (en) * 2019-04-23 2023-12-26 上海协言科学技术服务有限公司 Method for digital conversion of steel wire recording
EP3888084A4 (en) 2019-05-16 2022-01-05 Samsung Electronics Co., Ltd. Method and device for providing voice recognition service
US11575952B2 (en) * 2021-04-12 2023-02-07 Arris Enterprises Llc Digital rights management while streaming to display array

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5343251A (en) * 1993-05-13 1994-08-30 Pareto Partners, Inc. Method and apparatus for classifying patterns of television programs and commercials based on discerning of broadcast audio and video signals
US20030171936A1 (en) * 2002-02-21 2003-09-11 Sall Mikhael A. Method of segmenting an audio stream

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3353381B2 (en) * 1993-04-23 2002-12-03 ソニー株式会社 Recording and playback device
US7058376B2 (en) * 1999-01-27 2006-06-06 Logan James D Radio receiving, recording and playback system
US6570991B1 (en) * 1996-12-18 2003-05-27 Interval Research Corporation Multi-feature speech/music discrimination system
US5739451A (en) * 1996-12-27 1998-04-14 Franklin Electronic Publishers, Incorporated Hand held electronic music encyclopedia with text and note structure search
US6819863B2 (en) * 1998-01-13 2004-11-16 Koninklijke Philips Electronics N.V. System and method for locating program boundaries and commercial boundaries using audio categories
US6185527B1 (en) * 1999-01-19 2001-02-06 International Business Machines Corporation System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval
US7062442B2 (en) 2001-02-23 2006-06-13 Popcatcher Ab Method and arrangement for search and recording of media signals
EP2261892B1 (en) * 2001-04-13 2020-09-16 Dolby Laboratories Licensing Corporation High quality time-scaling and pitch-scaling of audio signals
US7711123B2 (en) * 2001-04-13 2010-05-04 Dolby Laboratories Licensing Corporation Segmenting audio signals into auditory events
US6785645B2 (en) * 2001-11-29 2004-08-31 Microsoft Corporation Real-time speech and music classifier
KR100880480B1 (en) * 2002-02-21 2009-01-28 엘지전자 주식회사 Method and system for real-time music/speech discrimination in digital audio signals
US7386357B2 (en) * 2002-09-30 2008-06-10 Hewlett-Packard Development Company, L.P. System and method for generating an audio thumbnail of an audio track
JP2006508390A (en) * 2002-11-28 2006-03-09 エイジェンシー フォー サイエンス, テクノロジー アンド リサーチ Digital audio data summarization method and apparatus, and computer program product
US8521529B2 (en) * 2004-10-18 2013-08-27 Creative Technology Ltd Method for segmenting audio signals

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5343251A (en) * 1993-05-13 1994-08-30 Pareto Partners, Inc. Method and apparatus for classifying patterns of television programs and commercials based on discerning of broadcast audio and video signals
US20030171936A1 (en) * 2002-02-21 2003-09-11 Sall Mikhael A. Method of segmenting an audio stream

Also Published As

Publication number Publication date
KR20060086420A (en) 2006-07-31
DE602004016380D1 (en) 2008-10-16
EP1692799A4 (en) 2007-06-13
EP1692799A2 (en) 2006-08-23
US20050126369A1 (en) 2005-06-16
EP1692799B1 (en) 2008-09-03
ATE407419T1 (en) 2008-09-15
US7179980B2 (en) 2007-02-20
CN1977306A (en) 2007-06-06
WO2005060337A2 (en) 2005-07-07
KR100840745B1 (en) 2008-06-23

Similar Documents

Publication Publication Date Title
WO2005060337A3 (en) Automatic extraction of musical portions of an audio stream
WO2005124101A3 (en) Method and system for producing gas and liquid in a subterranean well
CN102034482B (en) Apparatus of voice bandspreading and method of same
CN106878866A (en) Acoustic signal processing method, device and terminal
WO2006019555A3 (en) Music detection with low-complexity pitch correlation algorithm
DE602006005684D1 (en) Model-based improvement of speech signals
ZA200606215B (en) Method and device for speech enhancement in the presence of background noise
WO2006130226A3 (en) Audio codec post-filter
CN104781862B (en) Real-time traffic is detected
WO2001020965A3 (en) Method for determining a current acoustic environment, use of said method and a hearing-aid
WO2007035183A3 (en) Method, system, and program product for measuring audio video synchronization independent of speaker characteristics
EA201290082A1 (en) METHOD OF IDENTIFICATION OF PHONOGRAMMING OF ARBITRARY ORAL SPEECH BASED ON THE FORMANT ALIGNMENT
JP2012037582A (en) Signal processing apparatus and method, and program
CN104916288B (en) The method and device of the prominent processing of voice in a kind of audio
AU2003285721A1 (en) Method and arrangement for filter bank based signal processing
DE60118800D1 (en) UNIFIED FILTER BANK FOR CODING AUDIO SIGNALS
DK1225810T3 (en) Procedure for the treatment of chronic venous insufficiency using a leaf extract from red wine leaves
JP2000295699A5 (en)
Sofianos et al. Towards effective singing voice extraction from stereophonic recordings
WO2002084998A1 (en) Contour-emphasizing device
CN205773347U (en) A kind of air compressor machine nitrogen gas generating device
Yoshida et al. The Extremal Sampling Technique
GB2378911A (en) Multi-stage filter assembly for gaseous, moist media
WO2005071835A3 (en) Dynamic filter
Lyon Auditory effects for ASR

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200480036748.1

Country of ref document: CN

AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2004801373

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1020067009165

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 1020067009165

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2004801373

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 2004801373

Country of ref document: EP