EP2439735A1 - Method and Apparatus for generating reference phase patterns - Google Patents

Method and Apparatus for generating reference phase patterns Download PDF

Info

Publication number
EP2439735A1
EP2439735A1 EP10306091A EP10306091A EP2439735A1 EP 2439735 A1 EP2439735 A1 EP 2439735A1 EP 10306091 A EP10306091 A EP 10306091A EP 10306091 A EP10306091 A EP 10306091A EP 2439735 A1 EP2439735 A1 EP 2439735A1
Authority
EP
European Patent Office
Prior art keywords
blocks
frequency domain
audio signal
windowing
subblocks
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP10306091A
Other languages
German (de)
French (fr)
Inventor
Xiao-ming CHEN
Michael Arnold
Peter Georg Baum
Ulrich Gries
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thomson Licensing SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Priority to EP10306091A priority Critical patent/EP2439735A1/en
Publication of EP2439735A1 publication Critical patent/EP2439735A1/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring

Definitions

  • the invention relates to a method and to an apparatus for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal.
  • watermark embedding occurs in the phase domain by modifying phases of the host signal spectrum according to phase reference patterns. Specifically, the host audio signal is split into blocks of length L B and each block is used for embedding a watermark symbol. The watermark message is represented by a sequence of symbols. For embedding, each block can be further partitioned into N SB overlapping subblocks of L SB samples length with an overlap length of L SB /2.
  • phase of the weighted subblock spectrum are modified according to generated reference phases and according to results from a psycho-acoustic analysis of the host audio signal.
  • reference phases for each watermark message symbol are generated by the following steps:
  • Step B3) mimics the windowing-overlap process that is commonly used in the analysis stage of digital audio signal processing.
  • a windowing function is selected such that an ideal reconstruction is guaranteed following ideal analysis (windowing-overlap) and synthesis (windowing-overlap-add) processes.
  • generated reference angles in step B4) are employed to modify host audio spectrum phases after the host signal has undergone the windowing-overlap processing.
  • phase generation processing is that many IDFTs/DFTs are required, which imposes constraints on the achievable speed of hardware/software implementations, and the deployment of related audio watermarking schemes is therefore limited for real-time applications.
  • IDFTs/DFTs are required, which imposes constraints on the achievable speed of hardware/software implementations, and the deployment of related audio watermarking schemes is therefore limited for real-time applications.
  • the audio content is watermarked with a label that identifies the customer.
  • a copyright owner will license multiple different works to the customer.
  • different keys for individual copyrighted works are used. Therefore efficient reference phase generation is a necessary prerequisite for deploying an audio watermarking scheme in such application scenario.
  • a problem to be solved by the invention is to provide more efficient generation of the reference phase patterns. This problem is solved by the method disclosed in claim 1. An apparatus that utilises this method is disclosed in claim 6.
  • the efficient generation of random reference phase patterns is carried out by approximating the inverse transform of the phase reference patterns, and the corresponding forward transform and the windowing and windowing-overlapping effects using two different short-length frequency-domain filters for even and odd blocks or subblocks.
  • the computational complexity for generating the reference phase pattern blocks or subblocks for frequency domain phase modulation of the audio signal blocks or subblocks is significantly reduced, which in turn reduces the complexity of both watermark embedder at sender side and watermark detector at receiver side.
  • the inventive reference phase generation fulfils critical real-time requirements and enables a wide deployment of corresponding audio watermarking processings.
  • the flexibility of controlling the generated phases is maintained, e.g. in case structured phases are used instead of random ones.
  • the inventive method is suited for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal, said method including the steps:
  • the inventive apparatus is suited for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal, said apparatus including:
  • an audio signal block a is partitioned into N SB overlapped (50% overlapping) subblocks of length L SB denoted as ⁇ a m , 0 ⁇ m ⁇ N SB ⁇ . Accordingly, the even subblocks are non-overlapping and each odd subblock can be obtained by its two adjacent even subblocks. Specifically, the (2 i +1)-th (i.e.
  • the phases of spectra of windowed subblocks a w,m serve as reference phases for the watermark embedding, followed by DFT for the weighted subblocks to get reference spectra.
  • ⁇ A m [ k ], 0 ⁇ k ⁇ L SB -1 ⁇ and ⁇ W [ k ], 0 ⁇ k ⁇ L SB -1 ⁇ shall denote DFTs for the m-th subblock a m and for the windowing function, respectively.
  • the first term in Eq.(4) is the contribution from two spectral lines of the same index within two adjacent subblocks and the second term is the contribution from other spectral lines.
  • the reference spectrum for an odd subblock is obtained by inserting Eq.(4) into Eq.(2). Therefore, for the evaluation of reference spectra, two frequency-domain filters ⁇ F [ k ] ⁇ and ⁇ W [ k ] ⁇ are involved. In case they can be well-approximated by a few filter taps, an efficient evaluation of reference spectra is feasible.
  • the spectrum of the windowing function can also be approximated by a few dominant components.
  • the second term in Eq.(9) can be interpreted as the spectrum difference that passes through two filters, which can be regarded as representing in total a single filter:
  • L SB ⁇ q - L w L w - 1 k + q 2 ⁇ A 2 ⁇ i ⁇ ⁇ k + q > + A 2 ⁇ i + 2 ⁇ ⁇ k + q > ⁇ W ⁇ q > + 1
  • L SB 2 ⁇ q - L g L g - 1 k + q ⁇ A 2 ⁇ i ⁇ ⁇ k + q > - A 2 ⁇ i + 2 ⁇ ⁇ k + q > ⁇ G ⁇ q >
  • L g L w + L 1 .
  • Eq.(8) and Eq.(10) provide an efficient evaluation of the reference spectra, whereby only two frequency-domain filters ⁇ W [ ⁇ q >] ⁇ and ⁇ G [ ⁇ q >] ⁇ with a few dominant taps are required.
  • the corresponding filter coefficients can be pre-calculated, and the approximation accuracy can be controlled by filter length parameters L w and L g . That is, the trade-off between robustness and computational complexity can be adjusted by choosing different L w and L g values.
  • the original processing employing FFTs/IFFTs requires on average 62 real-value multiplications and 62 real-value additions for the evaluation of each DFT coefficient of the reference spectra.
  • a received audio signal AS is sampled in an acquisition or receiving section step or stage 51, and thereafter passes through a segmentation, windowing and DFT step or stage 52 to a phase modulation step or stage 53, in which the phases are modulated within the frequency domain using references patterns or phases from a block 56.
  • a corresponding IDFT, windowing and overlap-add step or stage 54 is passed, which outputs the watermarked audio signal WAS.
  • the phase value modification can be controlled according to results from a psycho-acoustic analysis of the audio signal AS.
  • a secret key SK is used to generate pseudo-random phases in a step or stage 55, which controls a reference pattern or phase generation step or stage 56 that operates using the above-described inventive processing.
  • stage 56 does not include IDFT, windowing, DFT and overlap+add, but instead approximates these functions by a first frequency domain filtering by W for even subblocks and a different second frequency domain filtering by W and G for odd subblocks, thereby generating approximated versions of reference phase patterns for the audio signal subblocks.
  • step 60 nSymbols, nSubBlocks and the secret key SK are input.
  • running variable 'j' is set '0' and in step 612 running variable 'i' is set '0'.
  • step 62 (which is included in step/stage 55 in Fig. 5 ) a random phase is generated for subblocks 2i and 2i+2.
  • step 63 the spectra are constructed from the phases, whereby the magnitude equals '1'.
  • step 64 the WOL subblock 2i+1 and the WIN subblocks 2i and 2i+2 are calculated. Steps 63 and 64 are included in step/stage 56 in Fig. 5 .
  • step 65 'i' is incremented by '1'.
  • the running variable 'j' is incremented by '1' in step 67. If 'j' has reached value (nSymbols-1) in step 68, nSubBlocks reference spectra for nSymbols are output in step 69.
  • the order of even and odd subblocks is exchanged.
  • a combination of frequency-domain convolution and IDFT-windowing-DFT can be employed for the phase generation.
  • Eq.(8) is used for generating the reference spectra for an even subblock Eq.(8).
  • Eq.(10) is realised as a frequency-domain convolution
  • IDFT-windowing-DFT a time-domain subblock corresponding to ⁇ A 2i [ k ] - A 2i+2 [ k ] ⁇ is obtained via IDFT, and is thereafter weighted by a time-domain filter corresponding to ⁇ G [ k ] ⁇ .
  • the reference spectrum of the odd subblock is obtained by a DFT of the weighted subblock.
  • Such alternative processing typically has a higher computational complexity than the inventive processing described before but a lower complexity than the processing described in the background section, because the IDFT-windowing-DFT is applied to odd blocks only.

Abstract

For real-time audio watermarking, wherein the watermark embedding occurs in the frequency domain by modifying phases of spectra of the audio signal according to phase reference patterns, low-complexity watermark embedding is desired. Depending on a secret key SK, random reference phases for each watermark message symbol are generated, wherein the corresponding reference phase patterns are generated by approximating the inverse transform of the phase reference patterns and the corresponding transform and the windowing and windowing-overlapping effects, using two different short-length frequency-domain filters for even and odd blocks or subblocks. Thereby the computational complexity for generating the reference phase pattern blocks or subblocks is significantly reduced.

Description

  • The invention relates to a method and to an apparatus for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal.
  • Background
  • For real-time applications of audio watermarking in the context of home networking and for efficient implementation, e.g. in set-top boxes, low-complexity watermark embedding and/or detecting are desired, if robustness and perceptual transparency are not sacrificed significantly.
  • In WO 2007/031423 A1 and in M.Arnold, P.G.Baum, W.Voessing, "A phase modulation audio watermarking technique", 11th Information Hiding Workshop, pages 102-116, 2009, watermark embedding occurs in the phase domain by modifying phases of the host signal spectrum according to phase reference patterns. Specifically, the host audio signal is split into blocks of length LB and each block is used for embedding a watermark symbol. The watermark message is represented by a sequence of symbols. For embedding, each block can be further partitioned into NSB overlapping subblocks of LSB samples length with an overlap length of LSB /2. Following weighting of each subblock by a windowing function, phases of the weighted subblock spectrum are modified according to generated reference phases and according to results from a psycho-acoustic analysis of the host audio signal. Depending on a secret key SK (e.g. as seed of a random generator), reference phases for each watermark message symbol are generated by the following steps:
    • B1) Randomly generate independently phase vectors Θ i of
      length LSB , which are uniformly distributed over [-π,π), where 0 i N SB - 1 2 .
      Figure imgb0001
    • B2) Generate time-domain subblocks via inverse discrete Fourier transform (IDFT) of a normalised spectrum:
      • a' i =IDFT{e jθi }, where
        Figure imgb0002
        . Concatenate time-domain subblocks to get an LB -length signal a.
    • B3) Divide block a into NSB overlapped subblocks of length LSB with an overlap length of LSB /2, multiply each subblock with a windowing function, and perform a DFT for weighted subblocks to get reference spectra.
    • B4) Extract angles of reference spectra that serve as reference angles for the data embedding.
  • Generating phase vectors at the beginning enables the flexibility of controlling the phase vectors in case structured phase vectors rather than randomly generated ones will be employed for the embedding. Step B3) mimics the windowing-overlap process that is commonly used in the analysis stage of digital audio signal processing. A windowing function is selected such that an ideal reconstruction is guaranteed following ideal analysis (windowing-overlap) and synthesis (windowing-overlap-add) processes. Finally, generated reference angles in step B4) are employed to modify host audio spectrum phases after the host signal has undergone the windowing-overlap processing.
  • Invention
  • One possible problem of such phase generation processing is that many IDFTs/DFTs are required, which imposes constraints on the achievable speed of hardware/software implementations, and the deployment of related audio watermarking schemes is therefore limited for real-time applications. For example in a home network where the AV material is sent to the customer, in a set-top box the audio content is watermarked with a label that identifies the customer. In a general setting a copyright owner will license multiple different works to the customer. In order to avoid the serious security drawback of using only one secret key for all AV material, different keys for individual copyrighted works are used. Therefore efficient reference phase generation is a necessary prerequisite for deploying an audio watermarking scheme in such application scenario.
  • A problem to be solved by the invention is to provide more efficient generation of the reference phase patterns. This problem is solved by the method disclosed in claim 1. An apparatus that utilises this method is disclosed in claim 6.
  • The efficient generation of random reference phase patterns is carried out by approximating the inverse transform of the phase reference patterns, and the corresponding forward transform and the windowing and windowing-overlapping effects using two different short-length frequency-domain filters for even and odd blocks or subblocks. Thereby the computational complexity for generating the reference phase pattern blocks or subblocks for frequency domain phase modulation of the audio signal blocks or subblocks is significantly reduced, which in turn reduces the complexity of both watermark embedder at sender side and watermark detector at receiver side.
  • The inventive reference phase generation fulfils critical real-time requirements and enables a wide deployment of corresponding audio watermarking processings. Advantageously, the flexibility of controlling the generated phases is maintained, e.g. in case structured phases are used instead of random ones.
  • In principle, the inventive method is suited for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal, said method including the steps:
    • based on at least one secret key, generating blocks with random or pseudo-random phases for said watermarking for frequency domain blocks of said audio signal,
    • approximating - for said blocks with random or pseudo-random phases - an inverse transform and windowing and a corresponding transform and overlap+add by a first frequency domain filtering for even ones of said blocks and a different second frequency domain filtering for odd ones of said blocks, thereby generating approximated versions of said reference phase patterns for said blocks;
    • using said approximated versions for said phase modulation.
  • In principle the inventive apparatus is suited for generating reference phase patterns to be used for controlling watermarking by frequency domain phase modulation of spectra of an audio signal, said apparatus including:
    • means being adapted for generating - based on at least one secret key - blocks with random or pseudo-random phases for said watermarking for frequency domain blocks of said audio signal;
    • means being adapted for approximating - for said blocks with random or pseudo-random phases - an inverse transform and windowing and a corresponding transform and overlap+add by a first frequency domain filtering for even ones of said blocks and a different second frequency domain filtering for odd ones of said blocks, thereby generating approximated versions of said reference phase patterns for said blocks, wherein said approximated versions are to be used for said phase modulation.
  • Advantageous additional embodiments of the invention are disclosed in the respective dependent claims.
  • Drawings
  • Exemplary embodiments of the invention are described with reference to the accompanying drawings, which show in:
  • Fig. 1
    known reference spectra generation;
    Fig. 2
    magnitude distribution of F[k'-k];
    Fig. 3
    DFT spectrum of windowing function w1 [n] for LSB = 1024;
    Fig. 4
    principle of inventive reference spectra generation;
    Fig. 5
    block diagram for the inventive watermark embedder;
    Fig. 6
    inventive phase generation procedure.
    Exemplary embodiments
  • According to the invention, for the watermark embedding and for watermark detection reference spectra are directly evaluated, which approximates above steps B2) and B3). To elaborate this, the original reference phase generation is investigated in detail. In the sequel, a subblock with an even/odd subscript is referred to as an even/odd subblock, respectively.
  • Regarding above step B3), an audio signal block a is partitioned into NSB overlapped (50% overlapping) subblocks of length LSB denoted as {a m , 0 ≤m< NSB }. Accordingly, the even subblocks are non-overlapping and each odd subblock can be obtained by its two adjacent even subblocks. Specifically, the (2i+1)-th (i.e. odd) subblock is constructed from the upper half of subblock a 2i and the lower half of subblock a 2i+2 as follows: a 2 i + 1 n = a 2 i n + L SB / 2 if 0 n < L SB / 2 a 2 i + 1 n = a 2 i + 2 n - L SB / 2 if L SB / 2 n < L SB
    Figure imgb0003
  • The even subblocks are obtained according to step B2) as a 2i =IDFT{e jθi }. Next, the even and odd subblocks are weighted by a windowing function, denoted as {w[n], 0 ≤n< LSB }, and a windowed subblock is obtained as a w,m [n] = a m [n]w[n], 0 ≤n< LSB , 0 ≤m< NSB . Finally, the phases of spectra of windowed subblocks a w,m serve as reference phases for the watermark embedding, followed by DFT for the weighted subblocks to get reference spectra.
  • This known reference spectra generation processing is depicted in Fig. 1.
  • Reference spectra evaluation
  • How to directly evaluate reference spectra, i.e. DFTs of {a w,m , 0 ≤m< NSB }? For DFT, multiplication in time-domain corresponds to circular convolution in frequency-domain. Therefore reference spectra can be obtained via circular convolution of the spectra for subblocks before windowing and of the spectrum of the windowing function.
  • {Am [k], 0 ≤k< LSB -1} and {W[k], 0 ≤k< LSB -1} shall denote DFTs for the m-th subblock a m and for the windowing function, respectively. The reference spectrum for the m-th subblock is obtained by A w , m k = 1 L SB = 0 L SB - 1 A m W < k - > ,
    Figure imgb0004

    where the term '<...>' denotes a modulo operation that is defined by < n > = n - n L SB L SB ,
    Figure imgb0005
    which maps any integer number to
    the interval [0,LSB ), and where
    Figure imgb0006
    denotes the greatest integer number not larger than x.
  • Therefore the reference spectrum for an even subblock is A 2 i k = e j θ i k , A w , 2 i k = 1 L SB = 0 L SB - 1 e j θ i W < k - >
    Figure imgb0007
  • For an odd subblock, the subblock spectrum is calculated according to Eq.(1) by A 2 i + 1 k = - 1 k 2 A 2 i k + A 2 i + 2 k + 1 L SB = 0 L SB - 1 F - 1 - 1 A 2 i - A 2 i + 2 ,
    Figure imgb0008

    where F - k = 1 - - 1 - k 1 - e j 2 π L SB - k
    Figure imgb0009
    if k'≠k and F[0]=0.
  • The first term in Eq.(4) is the contribution from two spectral lines of the same index within two adjacent subblocks and the second term is the contribution from other spectral lines. The reference spectrum for an odd subblock is obtained by inserting Eq.(4) into Eq.(2). Therefore, for the evaluation of reference spectra, two frequency-domain filters {F[k]} and {W[k]} are involved. In case they can be well-approximated by a few filter taps, an efficient evaluation of reference spectra is feasible.
  • Efficient reference phase generation by filter approximation Approximation for odd subblock spectrum
  • The coefficient F[k'-k] in Eq.(4) has the following properties: F - k = F < - k > , F - k = F k -
    Figure imgb0010
  • For a typical subblock length, there are only a few dominant central terms for F[k'-k] with respect to the magnitude, as shown in Fig. 2 for LSB = 1024. Assuming that there are 2L 1 central dominant terms, i.e. for 0 <|k'-k|≤ L 1, the Eq.(4) spectrum of an odd subblock can be approximated by setting q = k'-k: A 2 i + 1 k - 1 k 2 A 2 i k + A 2 i + 2 k + 1 L SB q = - L 1 L 1 F q - 1 k + q A 2 i < k + q > - A 2 i + 2 < k + q >
    Figure imgb0011
  • Approximation for windowing function spectrum
  • Typically, the spectrum of the windowing function can also be approximated by a few dominant components. Examples of such a windowing function are: w 1 n = sin π n L SB , w 2 n = sin π 2 n + 1 2 L SB , for 0 n < L SB ,
    Figure imgb0012
    w 3 n = sin π n + 1 L SB + 2 for 0 n < L SB / 2 ; w 3 n = w 3 L SB - 1 - n for L SB / 2 n < L SB
    Figure imgb0013
  • All three windowing functions have similar spectrum magnitudes. For example, the spectrum of w 1[n] is illustrated in Fig. 3 for LSB = 1024, and there are only few dominant central terms.
  • Assuming that the windowing function spectrum can be well-approximated by 2Lw +1 components, the reference spectrum in Eq.(2) is then approximated by: A w , m k 1 L SB q = - L w L w A m < k + q > W < q >
    Figure imgb0014
  • Approximation for even subblock spectrum
  • For even subblocks, the reference spectra are approximated by: A w , 2 i k 1 L SB q = - L w L w e j θ i < k + q > W < q >
    Figure imgb0015
  • For odd subblocks, by taking the approximation of Eq.(6) into account, the reference spectra can be approximated by: A w , 2 i + 1 k 1 L SB q = - L w L w - 1 k + q 2 A 2 i < k + q > + A 2 i + 2 < k + q > W < q > + 1 L SB 2 q = - L w L w W < q > p = - L 1 L 1 F p - 1 k + p + q A 2 i < k + p + q > - A 2 i + 2 < k + p + q >
    Figure imgb0016
  • The second term in Eq.(9) can be interpreted as the spectrum difference that passes through two filters, which can be regarded as representing in total a single filter: A w , 2 i + 1 k 1 L SB q = - L w L w - 1 k + q 2 A 2 i < k + q > + A 2 i + 2 < k + q > W < q > + 1 L SB 2 q = - L g L g - 1 k + q A 2 i < k + q > - A 2 i + 2 < k + q > G < q >
    Figure imgb0017

    where Lg = Lw +L1 .
  • This filter G[<q>] is defined for -Lg qLg as G < q > = p = - L 1 L 1 F p W < q - p >
    Figure imgb0018
  • Efficient reference phase generation
  • Advantageously, Eq.(8) and Eq.(10) provide an efficient evaluation of the reference spectra, whereby only two frequency-domain filters {W[<q>]} and {G[<q>]} with a few dominant taps are required. The corresponding filter coefficients can be pre-calculated, and the approximation accuracy can be controlled by filter length parameters Lw and Lg . That is, the trade-off between robustness and computational complexity can be adjusted by choosing different Lw and Lg values.
  • This efficient generation of reference spectra is illustrated in Fig. 4, where WIN{...} and WOL{...} denote operations defined by Eq.(8) and Eq.(10), respectively.
  • I.e, instead of using IDFTs/DFTs for evaluating the reference spectra, simple frequency-domain filtering is employed for direct evaluation of the reference spectra, resulting in significant complexity reduction. Since reference phase generation is an inherent step or stage to be carried out in both audio watermark embedder and detector, the complexity of embedder/detector is also reduced correspondingly.
  • For the case where LSB = 1024, NSB = 31 and w 3[n] are used as windowing function, experiments have shown that Lw = 2 and Lg = 3 are already sufficient to get a robustness against a wide range of signal processing attacks that is very similar to that of the original processing.
  • With respect to computational complexity, the original processing employing FFTs/IFFTs requires on average 62 real-value multiplications and 62 real-value additions for the evaluation of each DFT coefficient of the reference spectra.
  • In contrast, the inventive processing (with Lw = 2 and Lg = 3) requires on average only 9 real multiplications and 25 real additions for each DFT coefficient.
  • In the block diagram of the inventive watermark embedder in Fig. 5, a received audio signal AS is sampled in an acquisition or receiving section step or stage 51, and thereafter passes through a segmentation, windowing and DFT step or stage 52 to a phase modulation step or stage 53, in which the phases are modulated within the frequency domain using references patterns or phases from a block 56. Following the phase modulation, a corresponding IDFT, windowing and overlap-add step or stage 54 is passed, which outputs the watermarked audio signal WAS.
  • In a current block or subblock, the phase value modification can be controlled according to results from a psycho-acoustic analysis of the audio signal AS.
  • A secret key SK is used to generate pseudo-random phases in a step or stage 55, which controls a reference pattern or phase generation step or stage 56 that operates using the above-described inventive processing. In contrast to the prior art, stage 56 does not include IDFT, windowing, DFT and overlap+add, but instead approximates these functions by a first frequency domain filtering by W for even subblocks and a different second frequency domain filtering by W and G for odd subblocks, thereby generating approximated versions of reference phase patterns for the audio signal subblocks.
  • The inventive phase generation processing is depicted in Fig. 6. In step 60, nSymbols, nSubBlocks and the secret key SK are input. In step 611 running variable 'j' is set '0' and in step 612 running variable 'i' is set '0'. A first outer loop L1 runs from j = 0 to j = (nSymbols-1). A second inner loop L2 runs from i = 0 to 2i+2 = (nSubBlocks-1). In loop L2, in step 62 (which is included in step/stage 55 in Fig. 5) a random phase is generated for subblocks 2i and 2i+2. In step 63 the spectra are constructed from the phases, whereby the magnitude equals '1'. In step 64 the WOL subblock 2i+1 and the WIN subblocks 2i and 2i+2 are calculated. Steps 63 and 64 are included in step/stage 56 in Fig. 5. In step 65, 'i' is incremented by '1'. In the outer loop L2, following checking step 66, the running variable 'j' is incremented by '1' in step 67. If 'j' has reached value (nSymbols-1) in step 68, nSubBlocks reference spectra for nSymbols are output in step 69.
  • In an alternative embodiment, the order of even and odd subblocks is exchanged.
  • As a further alternative embodiment, a combination of frequency-domain convolution and IDFT-windowing-DFT can be employed for the phase generation. Specifically, for generating the reference spectra for an even subblock Eq.(8) is used. For an odd subblock the first term of Eq.(10) is realised as a frequency-domain convolution, while the second term is realised by IDFT-windowing-DFT. That is, a time-domain subblock corresponding to {A2i [k] - A2i+2 [k]} is obtained via IDFT, and is thereafter weighted by a time-domain filter corresponding to {G[k]}. The reference spectrum of the odd subblock is obtained by a DFT of the weighted subblock.
  • Such alternative processing typically has a higher computational complexity than the inventive processing described before but a lower complexity than the processing described in the background section, because the IDFT-windowing-DFT is applied to odd blocks only.

Claims (11)

  1. Method for generating (56; 63, 64) reference phase patterns to be used for controlling watermarking by frequency domain phase modulation (53) of spectra of an audio signal (AS), said method including the steps:
    - based on at least one secret key (SK), generating (55; 62) blocks with random or pseudo-random phases for said watermarking for frequency domain blocks of said audio signal (AS),
    characterised by:
    - approximating (56; 63, 64) - for said blocks with random or pseudo-random phases - an inverse transform and windowing and a corresponding transform and overlap+add by a first frequency domain filtering (WIN) for even ones of said blocks and a different second frequency domain filtering (WOL) for odd ones of said blocks, thereby generating approximated versions of said reference phase patterns for said blocks;
    - using said approximated versions for said phase modulation (53).
  2. Method according to claim 1, wherein said first frequency domain filtering (WIN) is A w , 2 i k 1 L SB q = - L w L w e j θ i < k + q > W < q >
    Figure imgb0019

    and said second frequency domain filtering (WOL) is A w , 2 i + 1 k 1 L SB q = - L w L w - 1 k + q 2 A 2 i < k + q > + A 2 i + 2 < k + q > W < q > + 1 L SB 2 q = - L g L g - 1 k + q A 2 i < k + q > - A 2 i + 2 < k + q > G < q > ,
    Figure imgb0020

    wherein i is the running number of a block and k is the position within a block and LSB is the length of said blocks, which blocks can be subblocks of blocks,
    and wherein Lw defines the length 2Lw +1 of central components used from a frequency domain filter W representing the windowing function spectrum,
    and wherein the term '<...>' denotes a modulo operation that is defined by < n > = n - n L SB L SB
    Figure imgb0021
    where
    Figure imgb0022
    denotes the greatest integer number not larger than x,
    and wherein Lg = Lw +L1 and L1 defines the length 2L1 of central components used from the function F - k = 1 - - 1 - k 1 - e j 2 π L SB - k
    Figure imgb0023
    if k'≠k and F[0]=0, 0 ≤k,k'< LSB -1,
    and wherein G < q > = p = - L 1 L 1 F p W < q - p > .
    Figure imgb0024
  3. Method according to claim 1 or 2, wherein within a current audio signal block in the frequency domain the phase of the audio signal is modified adaptively using psycho-acoustic calculations.
  4. Method according to claim 2 or 3, wherein in case there are 31 subblocks per audio signal block and LSB = 1024, and w 3 n = sin π n + 1 L SB + 2 for 0 n < L SB / 2 ; w 3 n = w 3 L SB - 1 - n for L SB / 2 n < L SB
    Figure imgb0025

    is used as windowing function, Lw = 2 and Lg = 3.
  5. Method according to claim 1, wherein the second term of said second frequency domain filtering is replaced by the known IDFT-windowing-DFT processing.
  6. Apparatus for generating (56; 63, 64) reference phase patterns to be used for controlling watermarking by frequency domain phase modulation (53) of spectra of an audio signal (AS), said apparatus including:
    - means (55) being adapted for generating - based on at least one secret key (SK) - blocks with random or pseudo-random phases for said watermarking for frequency domain blocks of said audio signal (AS);
    - means (56) being adapted for approximating - for said blocks with random or pseudo-random phases - an inverse transform and windowing and a corresponding transform and overlap+add by a first frequency domain filtering (WIN) for even ones of said blocks and a different second frequency domain filtering (WOL) for odd ones of said blocks, thereby generating approximated versions of said reference phase patterns for said blocks, wherein said approximated versions are to be used for said phase modulation (53).
  7. Apparatus according to claim 6, wherein said first frequency domain filtering (WIN) is A w , 2 i k 1 L SB q = - L w L w e j θ i < k + q > W < q >
    Figure imgb0026

    and said second frequency domain filtering (WOL) is A w , 2 i + 1 k 1 L SB q = - l w L w - 1 k + q 2 A 2 i < k + q > + A 2 i + 2 < k + q > W < q >
    Figure imgb0027
    1 L SB 2 q = - L g L g - 1 k + q A 2 i < k + q > - A 2 i + 2 < k + q > G < q > ,
    Figure imgb0028

    wherein i is the running number of a block and k is the position within a block and LSB is the length of said blocks, which blocks can be subblocks of blocks,
    and wherein Lw defines the length 2Lw +1 of central components used from a frequency domain filter W representing the windowing function spectrum,
    and wherein the term '<...>' denotes a modulo operation that is defined by <n> = n - n L SB L SB
    Figure imgb0029
    where
    Figure imgb0030
    denotes the greatest integer number not larger than x,
    and wherein Lg = Lw +L1 and L1 defines the length 2L1 of central components used from the function
    F - k = 1 - - 1 - k 1 - e j 2 π L SB - k
    Figure imgb0031
    if k'≠k and F[0]=0, 0 ≤k,k'< LSB -1,
    and wherein G < q > = p = - L 1 L 1 F p W < q - p > .
    Figure imgb0032
  8. Apparatus according to claim 6 or 7, wherein within a current audio signal block in the frequency domain the phase of the audio signal is modified adaptively using psycho-acoustic calculations.
  9. Apparatus according to claims 7 or 8, wherein in case there are 31 subblocks per audio signal block and LSB = 1024, and w 3 n = sin π n + 1 L SB + 2 for 0 n < L SB / 2 ;
    Figure imgb0033
    w 3[n]=w 3[LSB -1-n] for LSB /2 ≤n< LSB is used as windowing function, Lw = 2 and Lg = 3.
  10. Digital audio signal that was watermarked according to the method of one of claims 1 to 5.
  11. Storage medium that contains or stores, or has recorded on it, a digital audio signal according to claim 10.
EP10306091A 2010-10-06 2010-10-06 Method and Apparatus for generating reference phase patterns Withdrawn EP2439735A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP10306091A EP2439735A1 (en) 2010-10-06 2010-10-06 Method and Apparatus for generating reference phase patterns

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP10306091A EP2439735A1 (en) 2010-10-06 2010-10-06 Method and Apparatus for generating reference phase patterns

Publications (1)

Publication Number Publication Date
EP2439735A1 true EP2439735A1 (en) 2012-04-11

Family

ID=43607892

Family Applications (1)

Application Number Title Priority Date Filing Date
EP10306091A Withdrawn EP2439735A1 (en) 2010-10-06 2010-10-06 Method and Apparatus for generating reference phase patterns

Country Status (1)

Country Link
EP (1) EP2439735A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014062332A1 (en) * 2012-10-16 2014-04-24 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermark detection and extraction
US9596521B2 (en) 2014-03-13 2017-03-14 Verance Corporation Interactive content acquisition using embedded codes
US9639911B2 (en) 2014-08-20 2017-05-02 Verance Corporation Watermark detection using a multiplicity of predicted patterns
US9769543B2 (en) 2014-11-25 2017-09-19 Verance Corporation Enhanced metadata and content delivery using watermarks
US9942602B2 (en) 2014-11-25 2018-04-10 Verance Corporation Watermark detection and metadata delivery associated with a primary content
CN108417199A (en) * 2013-01-18 2018-08-17 株式会社东芝 Audio watermark information detection device and audio watermark information detection method
US10277959B2 (en) 2014-12-18 2019-04-30 Verance Corporation Service signaling recovery for multimedia content using embedded watermarks
US10504200B2 (en) 2014-03-13 2019-12-10 Verance Corporation Metadata acquisition using embedded watermarks
US11722741B2 (en) 2021-02-08 2023-08-08 Verance Corporation System and method for tracking content timeline in the presence of playback rate changes

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007031423A1 (en) 2005-09-16 2007-03-22 Thomson Licensing Blind watermarking of audio signals by using phase modifications

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007031423A1 (en) 2005-09-16 2007-03-22 Thomson Licensing Blind watermarking of audio signals by using phase modifications

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
FERRIS T L J ET AL: "Frequency domain method for windowing in Fourier analysis", ELECTRONICS LETTERS, vol. 28, no. 15, 16 July 1992 (1992-07-16), pages 1440, XP002626131, ISSN: 0013-5194, DOI: 10.1049/el:19920916 *
M.ARNOLD; P.G.BAUM; W.VOESSING: "A phase modulation audio watermarking technique", LLTH INFORMATION HIDING WORKSHOP, 2009, pages 102 - 116
MICHAEL ARNOLD ET AL: "A Phase Modulation Audio Watermarking Technique", 8 June 2009, INFORMATION HIDING, SPRINGER BERLIN HEIDELBERG, BERLIN, HEIDELBERG, PAGE(S) 102 - 116, ISBN: 978-3-642-04430-4, XP019129276 *

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9368123B2 (en) 2012-10-16 2016-06-14 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermark detection and extraction
WO2014062332A1 (en) * 2012-10-16 2014-04-24 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermark detection and extraction
CN108417199B (en) * 2013-01-18 2022-11-22 株式会社东芝 Audio watermark information detection device and audio watermark information detection method
CN108417199A (en) * 2013-01-18 2018-08-17 株式会社东芝 Audio watermark information detection device and audio watermark information detection method
US10110971B2 (en) 2014-03-13 2018-10-23 Verance Corporation Interactive content acquisition using embedded codes
US9596521B2 (en) 2014-03-13 2017-03-14 Verance Corporation Interactive content acquisition using embedded codes
US9681203B2 (en) 2014-03-13 2017-06-13 Verance Corporation Interactive content acquisition using embedded codes
US10504200B2 (en) 2014-03-13 2019-12-10 Verance Corporation Metadata acquisition using embedded watermarks
US10499120B2 (en) 2014-03-13 2019-12-03 Verance Corporation Interactive content acquisition using embedded codes
US9854331B2 (en) 2014-03-13 2017-12-26 Verance Corporation Interactive content acquisition using embedded codes
US9854332B2 (en) 2014-03-13 2017-12-26 Verance Corporation Interactive content acquisition using embedded codes
US10445848B2 (en) 2014-08-20 2019-10-15 Verance Corporation Content management based on dither-like watermark embedding
US10354354B2 (en) 2014-08-20 2019-07-16 Verance Corporation Content synchronization using watermark timecodes
US9805434B2 (en) 2014-08-20 2017-10-31 Verance Corporation Content management based on dither-like watermark embedding
US9639911B2 (en) 2014-08-20 2017-05-02 Verance Corporation Watermark detection using a multiplicity of predicted patterns
US10178443B2 (en) 2014-11-25 2019-01-08 Verance Corporation Enhanced metadata and content delivery using watermarks
US9942602B2 (en) 2014-11-25 2018-04-10 Verance Corporation Watermark detection and metadata delivery associated with a primary content
US9769543B2 (en) 2014-11-25 2017-09-19 Verance Corporation Enhanced metadata and content delivery using watermarks
US10277959B2 (en) 2014-12-18 2019-04-30 Verance Corporation Service signaling recovery for multimedia content using embedded watermarks
US11722741B2 (en) 2021-02-08 2023-08-08 Verance Corporation System and method for tracking content timeline in the presence of playback rate changes

Similar Documents

Publication Publication Date Title
EP2439735A1 (en) Method and Apparatus for generating reference phase patterns
EP2293294B1 (en) Device and method for manipulating an audio signal having a transient event
CN102741921B (en) Improved subband block based harmonic transposition
CN102985970B (en) Improved magnitude response and temporal alignment in phase vocoder based bandwidth extension for audio signals
Wang et al. EMD and psychoacoustic model based watermarking for audio
Lang et al. Audio watermark attacks: from single to profile attacks
Pourhashemi et al. Audio watermarking based on synergy between Lucas regular sequence and Fast Fourier Transform
Unoki et al. Robust, blindly-detectable, and semi-reversible technique of audio watermarking based on cochlear delay characteristics
Islam et al. Robust dual domain twofold encrypted image-in-audio watermarking based on SVD
Moorer A note on the implementation of audio processing by short-term fourier transform
Hemis et al. Adjustable audio watermarking algorithm based on DWPT and psychoacoustic modeling
Hemis et al. Digital watermarking in audio for copyright protection
Nouri et al. Conceptual authentication speech hashing base upon hypotrochoid graph
Zhang Audio dual watermarking scheme for copyright protection and content authentication
JP5649084B2 (en) Apparatus and method for processing transient audio events in an audio signal when changing playback speed or pitch
Singh et al. Multiplicative watermarking of audio in DFT magnitude
Salcic Audio watermarking
Lin et al. Audio watermarking techniques
Subhashini et al. Robust audio watermarking for monitoring and information embedding
Kumar et al. Increased watermark-to-host correlation of uniform random phase watermarks in audio signals
Nouri et al. Analysis of a novel audio hash function based upon stationary wavelet transform
Naqash et al. Robust Audio Watermarking Based on Iterative Filtering
Nishimura Reversible and robust audio watermarking based on spread spectrum and amplitude expansion
Safitri et al. Audio Watermarking Combined with Compressive Sampling Based on QIM and DST-QR Techniques
Garlapati et al. A symbol based watermarking approach for spread spectrum audio watermarking methods

Legal Events

Date Code Title Description
AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20121012