US20110125508A1 - Data embedding system - Google Patents

Data embedding system Download PDF

Info

Publication number
US20110125508A1
US20110125508A1 US12/994,716 US99471609A US2011125508A1 US 20110125508 A1 US20110125508 A1 US 20110125508A1 US 99471609 A US99471609 A US 99471609A US 2011125508 A1 US2011125508 A1 US 2011125508A1
Authority
US
United States
Prior art keywords
echo
audio signal
echoes
delay
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/994,716
Inventor
Peter Kelly
Michael Raymond Reynolds
Christopher John Joseph Sutton
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intrasonics SARL
Original Assignee
Intrasonics SARL
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/GB2008/001820 external-priority patent/WO2008145994A1/en
Priority claimed from GB0814041A external-priority patent/GB2462588A/en
Application filed by Intrasonics SARL filed Critical Intrasonics SARL
Assigned to INTRASONICS S.A.R.L. reassignment INTRASONICS S.A.R.L. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: REYNOLDS, MICHAEL REYMOND, KELLY, PETER, Sutton, Christopher John Joseph
Publication of US20110125508A1 publication Critical patent/US20110125508A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H20/00Arrangements for broadcast or for distribution combined with broadcast
    • H04H20/28Arrangements for simultaneous broadcast of plural pieces of information
    • H04H20/30Arrangements for simultaneous broadcast of plural pieces of information by a single channel
    • H04H20/31Arrangements for simultaneous broadcast of plural pieces of information by a single channel using in-band signals, e.g. subsonic or cue signal

Definitions

  • the present invention relates to a system for embedding data in an audio signal and to its subsequent recovery, which can be used for watermarking, data communications, audience surveying etc.
  • the invention has particular relevance, to a system for hiding data in an audio signal by adding echoes to the audio signal and to a system for recovering the hidden data by detecting the added echoes.
  • U.S. Pat. No. 5,893,067 discloses a technique for hiding data within an audio signal for transmission to a remote receiver.
  • the data is hidden in the audio signal by adding an artificial echo to the audio signal and varying the amplitude and/or delay of the echo in accordance with the data to be hidden.
  • the present invention aims to provide an alternative data hiding technique.
  • One embodiment of the invention at least alleviates the above problem by applying echoes of opposite polarity to represent each data value.
  • the present invention provides a method of embedding a data value in an audio signal, the method comprising: generating an echo of at least a portion of the received audio signal; and embedding the data value in the audio signal by combining the received audio signal with the generated echo; wherein the data value is embedded in the audio by varying the polarity of the echo that is combined with the audio signal in dependence upon the data value.
  • the inventors have found that using polarity modulation to embed the data in the audio signal can make the recovery of the embedded data easier in the receiver, especially in the presence of natural echoes caused, by for example the acoustics of the room.
  • the polarity modulation can be achieved by varying the echo that is generated and/or by varying the way in which the echo is combined with the audio signal.
  • the generating step generates a first echo of at least a portion of the received audio signal and a second echo of at least a portion of the received audio signal, the first and second echoes having first and second polarities respectively, which polarities vary in dependence upon the data value; and wherein the embedding step embeds the data value in the audio signal by combining the received audio signal with the generated first and second echoes.
  • Each of the echoes may be generated by repeating at least a part of said audio signal.
  • the first echo may be generated by repeating a first portion of the audio signal and the second echo may be generated by repeating a second portion of said audio signal.
  • the first and second echoes may be generated by repeating substantially the same first portion of the audio signal.
  • the or each echo may be generated by passing the stream of audio samples through a delay line.
  • third and fourth echoes may be generated, the third echo having the same polarity as said second echo and the fourth echo having the same polarity as said first echo.
  • the third and fourth echoes may be generated by repeating substantially the same second portion of the audio signal which is different to the first portion repeated by the first and second echoes.
  • the second portion of the audio signal may be adjacent to the first portion.
  • the generating step may generate the third and fourth echoes by inverting the polarity of a gain factor applied to the echoes before being combined with the audio signal.
  • the first echo may be combined with the audio signal at a first delay relative to the first portion of the audio signal; the second echo may be combined with the audio signal at a second delay relative to said first echo; the third echo may be combined with said audio signal at a third delay relative to said second portion of the audio signal; and the fourth echo may be combined with the audio signal at a fourth delay relative to the third echo.
  • the first delay may be equal to said third delay and/or the second delay may be equal to said fourth delay.
  • the delays and the amplitudes of the echoes are independent of the data value.
  • the first and third delays are between 0.5 ms and 100 ms and more preferably between 5 ms and 40 ms; and the second and fourth delays delayed (relative to the first and third echoes respectively) by between 0.125 ms and 3 ms and more preferably between 0.25 ms and 1 ms, as these delays are similar to those of natural echoes and so are less noticeable to users.
  • the or each echo has an amplitude that is less than the amplitude of said audio signal.
  • the or each echo is faded in and out to reduce obtrusiveness of the echoes to a listener.
  • the first and second portions of the audio signal should be long enough for the receiver to be able to detect the presence of the echoes but not too long as to overly reduce the data rate that can be communicated.
  • the inventors have found that echoes having durations of between 20 ms and 500 ms provides a reasonable data rate whilst keeping to a minimum data transmission errors when transmission occurs over an acoustic link. If transmission is over an electrical link, then shorter echoes may be used.
  • the echoes may be combined with the audio signal by adding and/or subtracting the echoes to/from the audio signal.
  • the polarity of each echo can therefore be controlled by controlling the way in which each echo is combined with the audio signal.
  • This aspect of the invention also provides a computer implementable instructions product comprising computer implementable instructions for causing a programmable computer device to carry out the method described above.
  • This aspect also provides an apparatus for embedding a data value in an audio signal, the apparatus comprising: an echo generator operable to generate an echo of at least a portion of the received audio signal; and a combiner operable to combine the received audio signal with the generated first and second echoes to embed the data value in the audio signal; wherein the echo generator and/or the combiner are arranged so that the data value is embedded in the audio by varying the polarity of the echo that is combined with the audio signal in dependence upon the data value.
  • the present invention provides a method of recovering a data value embedded in an audio signal, the method comprising: receiving an input signal having the audio signal and an echo of at least part of the audio signal whose polarity depends upon said data value; processing the received input signal to determine the polarity of the echo; and recovering the data value from the determined polarity.
  • the input signal may comprise a first echo of at least a portion of the audio signal and a second echo of at least a portion of the audio signal, the first and second echoes having first and second polarities respectively, which polarities vary in dependence upon the data value; and wherein the processing step processes the input signal to combine the first and second echoes and to determine the polarity of the combined echoes and wherein the recovering step recovers the data value from the determined polarity of the combined echoes.
  • the processing step processes the input signal to determine a first autocorrelation measure that depends upon the first echo and a second autocorrelation measure that depends upon the second echo and combines the echoes by differencing the first and second autocorrelation measures and determines the polarity of the combined echoes by determining the polarity of the result of the differencing step.
  • the first echo may be of a first portion of the audio signal and the second echo may be of a second portion of the audio signal.
  • the first and second echoes may be repeats of substantially the same portion of the audio signal.
  • the input signal comprises first, second, third and fourth echoes, the first and fourth echoes having the same polarity and the second and third echoes having the same polarity which is opposite to the polarity of the first and fourth echoes, wherein the processing step processes the input signal to combine the first to fourth echoes and to determine the polarity of the combined echoes and wherein the recovering step recovers the data value from the determined polarity of the combined echoes.
  • the processing step may process the input signal to determine a first autocorrelation measure that depends upon the first echo, a second autocorrelation measure that depends upon the second echo, a third autocorrelation measure that depends upon the third echo and a fourth autocorrelation measure that depends upon the fourth echo and combines the echoes by differencing the autocorrelation measures and determines the polarity of the combined echoes by determining the polarity of a result of the differencing step.
  • the differencing step may perform a first difference of the first and third autocorrelation measures, a second difference of the second and fourth autocorrelation measures, a third difference of the result of said first difference and the result of the second difference and wherein the polarity of the combined echoes may be determined from the polarity of a result of the third difference.
  • the first and second echoes may be repeats of substantially the same first portion of the audio signal and the third and fourth echoes may be repeats of substantially the same second portion of the audio signal.
  • the first and third echoes may be repeats of substantially the same first portion of the audio signal and the second and fourth echoes may be repeats of substantially the same second portion of the audio signal.
  • the or each echo is faded in and out to reduce obtrusiveness of the echoes to a listener.
  • the polarity of the echo may be determined when the amplitude of the echo is at or near a maximum.
  • the first echo may be delayed relative to said first portion of the audio signal by a first delay; the second echo may be delayed relative to the first echo by a second delay; the third echo may be delayed relative to the second portion of the audio signal by a third delay; and the fourth echo may be delayed relative to the third echo by a fourth delay.
  • the first delay may be equal to the third delay and/or the second delay may be equal to said fourth delay.
  • a computer implementable instructions product comprising computer implementable instructions for causing a programmable computer device to carry out the above method.
  • This aspect also provides an apparatus for recovering a data value embedded in an audio signal, the apparatus comprising: an input for receiving an input signal having the audio signal and an echo of at least part of the audio signal whose polarity depends upon said data value; a processor operable to process the input signal to determine the polarity of the echo; and a data regenerator operable to recover the data value from the determined polarity.
  • FIG. 1 is a block diagram illustrating the main components of a transmitter and receiver used in an exemplary embodiment
  • FIG. 2 a is an impulse plot illustrating the echoes that are added to an audio signal to encode a binary “one”
  • FIG. 2 b is an impulse plot illustrating the echoes that are added to an audio signal to encode a binary “zero”
  • FIG. 3 a is an impulse plot illustrating the presence of artificial echoes for a binary “one” after Manchester encoding and illustrating natural echoes;
  • FIG. 3 b is an impulse plot illustrating the presence of artificial echoes for a binary “zero” after Manchester encoding and illustrating natural echoes;
  • FIG. 4 is a block diagram illustrating in more detail the encoding performed in the transmitter shown in FIG. 1 ;
  • FIG. 5 is a block diagram illustrating the main components of an echo generation and shaping module forming part of the transmitter shown in FIG. 1 ;
  • FIG. 6 a illustrates a shaping and modulation function that is applied to the echoes prior to being combined with the audio signal when a binary “one” is to be transmitted;
  • FIG. 6 b illustrates a shaping and modulation function that is applied to the echoes prior to being combined with the audio signal when a binary “zero” is to be transmitted;
  • FIG. 6 c illustrates the way in which the shaping and modulation function varies when two successive binary “ones” are to be transmitted
  • FIG. 6 d illustrates the shaping and modulation function that is applied when a binary “zero” is transmitted after a binary “one”;
  • FIG. 7 illustrates the processing performed in the receiver shown in FIG. 1 for recovering the hidden data from the received audio signal
  • FIG. 8 a is an autocorrelation plot for a typical audio signal without artificial echoes
  • FIG. 8 b is an autocorrelation plot for the audio signal with artificial echoes during a first half of a bit symbol
  • FIG. 8 c is an autocorrelation plot for the audio signal with artificial echoes during the second half of the bit symbol
  • FIG. 8 d is a plot obtained by subtracting the autocorrelation plot shown in FIG. 8 c from the autocorrelation plot shown in FIG. 8 b;
  • FIG. 9 is a block diagram illustrating an alternative form of receiver used to receive and recover the hidden data embedded in the audio signal
  • FIG. 10 is a plot illustrating the way in which an FEC error count varies during a synchronisation process used to find the hidden data message within the input signal.
  • FIGS. 11 a and 11 b illustrate the processing performed respectively by an FEC encoder and an FEC decoder in one embodiment.
  • FIG. 1 is a block diagram illustrating a transmitter and receiver system according to one embodiment in which a transmitter 1 transmits data hidden within an acoustic signal 3 to a remote receiver 5 .
  • the transmitter 1 may form part of a television or radio distribution network and the receiver may be a portable device such as a cellular telephone handset that is capable of detecting the acoustic signal 3 output by the transmitter 1 .
  • the transmitter 1 includes a forward error and correction (FEC) encoder module 7 , which receives and encodes the input data to be transmitted to the remote receiver 5 .
  • the encoded message data output from the FEC encoding module 7 is then passed to an echo generation and shaping module 9 , which also receives an audio signal in which the encoded message data is to be hidden.
  • the echo generation and shaping module 9 then hides the message data into the audio by generating echoes of the audio which depend upon the message data to be transmitted.
  • the generated echoes are then combined with the original audio signal in a combiner module 11 and the resulting modified audio signal is then passed to a gain control module 13 for appropriate gain control.
  • the audio signal is then converted from a digital signal to an analogue signal by the digital to analogue converter 15 and it is then amplified by a driver module 17 for driving a loudspeaker 19 which generates the acoustic signal 3 having the data hidden therein.
  • the polarity of the echoes is varied in order to encode the data to be transmitted.
  • the inventors have found that this polarity modulation can be more robust in the presence of natural echoes and periodicities in the audio signal. This is particularly the case when each data value is represented by two echoes of the same magnitude but having different lags and opposite polarities.
  • the polarities of the echoes representing each message bit are reversed to distinguish between a binary zero and a binary one. This is illustrated by the impulse plots illustrated in FIG. 2 .
  • FIG. 2 a is an impulse plot illustrating the component signals that are present when a binary one is to be transmitted and FIG.
  • FIG. 2 b is an impulse plot illustrating the component signals present when a binary zero is to be transmitted.
  • the component signals include an initial impulse 21 representing the original audio signal followed by two lower amplitude impulses 23 - 1 and 23 - 2 representing the two echoes of the original signal component 21 which are added to the audio signal.
  • FIGS. 2 a and 2 b when a binary one is to be transmitted, a positive echo 23 - 1 is transmitted first followed by a negative echo 23 - 2 ; and when transmitting a binary zero a negative echo 23 - 1 is transmitted first followed by a positive echo 23 - 2 .
  • the first echo is added with a lag of approximately ten milliseconds and the second echo is added 0.25 milliseconds after the first echo. This is the same regardless of whether a binary one or a binary zero is to be transmitted.
  • the echoes that are added have lower amplitudes compared with the amplitude of the original audio signal. In particular, in this embodiment, the amplitude of the echoes is approximately one third that of the original audio signal.
  • FIG. 1 also illustrates the main components of the receiver 5 .
  • the receiver includes a microphone 31 for detecting the acoustic signal 3 and for converting it into a corresponding electrical signal which is then filtered and amplified by filter and amplification circuitry 33 .
  • the output from the filter amplification circuitry 33 is then digitised by an analogue to digital converter 35 and the digital samples are then passed to an echo detector 37 .
  • the echo detector 37 then processes the digital samples to identify the polarities of the echoes in the received signal.
  • This information is then passed through a data recovery module 39 which processes the echo information to recover the encoded message data.
  • This message data is then decoded by a decoder 41 to recover the original data that was input to the FEC decoding module of the transmitter 1 .
  • the echo detector 37 detects the echoes from the received signal by calculating the auto-correlation of the received signal at specified delays.
  • natural echoes e.g. room echoes
  • the message data is also Manchester encoded so that a message data value of “1” is transmitted as a “1”, followed by a “0” (or vice versa), whilst a message data value of “0” is transmitted as a “0” followed by a “1”.
  • this Manchester encoding is performed by the echo generation and shaping module 9 .
  • the reason that the Manchester encoding can help to distinguish the artificial echoes from the natural echoes is that the natural echoes will be stable over the two half symbol periods. Therefore, by subtracting the autocorrelations in the second half of the symbol from autocorrelations in the first half of the symbol (or vice versa), the effect of the natural echoes and periodicities will cancel, whilst the autocorrelation peaks caused by the artificial echoes will add constructively. Similarly, the reason for using two echoes in each half symbol period is to distinguish the artificial echoes from periodicities in the original track. Typically, the autocorrelation of the original track will not change significantly between these two lags (i.e. between 10 ms and 10.25 ms). Therefore, by differencing the autocorrelations at the two lags, the effect of the periodicities is reduced and the autocorrelation peaks caused by the two echoes add constructively.
  • FIGS. 3 a and 3 b are impulse plots showing the two half symbols and the artificial echoes 23 that are added within each half symbol period to represent a binary “1” and a binary “0” respectively.
  • FIGS. 3 a and 3 b also illustrate natural echoes 25 - 1 and 25 - 2 which do not change from one half period to the next. Therefore, by subtracting the echoes in one half of the symbol period from the corresponding echoes (i.e. those with the same lag or delay) in the other half of the symbol period, the effect of the natural echoes and periodicities will cancel, whilst the artificial echoes will add constructively, thereby making it easier to detect the hidden data.
  • FIG. 4 is a block diagram illustrating the main components of the FEC encoder module 7 used in this embodiment.
  • the first encoding module is a Reed-Solomon encoder module 51 which uses a shortened ( 13 , 6 ) block code to represent the input data.
  • the data output from the Reed-Solomon encoder 51 is then passed to a convolutional encoder 53 which performs convolutional encoding on the data.
  • the data bits output from the convolutional encoder 53 are then interleaved with each other by a data interleaving module 55 to protect against errors occurring in bursts.
  • a synchronisation data adder module 57 adds a sequence of synchronisation bits that will help the receiver 5 lock on to the encoded data within the received acoustic signal 3 .
  • the output from the synchronisation data adder module 57 represents the message data which is then passed to the echo generation and shaping module 9 shown in FIG. 1 .
  • FIG. 5 is a block diagram illustrating the main components of the echo generation and shaping module 9 and the combiner module 11 shown in FIG. 1 .
  • the input audio signal is represented by the sequence of audio samples a(n) which are applied to a 10 millisecond delay unit 61 and to the adder 63 (corresponding to the combiner 11 shown in FIG. 1 ).
  • the 10 millisecond delay unit 61 delays the input sample a(n) by 10 milliseconds which it then outputs to a 0.25 millisecond delay unit 65 and to a subtractor 67 .
  • the 0.25 millisecond delay unit 65 delays the audio sample output from the 10 millisecond delay unit 61 by a further 0.25 milliseconds which it then outputs to the subtractor 67 .
  • the subtractor 67 subtracts the 10.25 millisecond delayed sample from the 10 millisecond delayed sample outputting the result to a multiplier 69 .
  • the delay units and the subtractor operate each time a new audio sample a(n) arrives.
  • the audio sample frequency is one of 8 kHz, 32 kHz, 44.1 kHz or 48 kHz.
  • the 10 millisecond delay unit 61 will generate the two echoes 23 - 1 and 23 - 2 illustrated in FIG. 2 .
  • the echoes that have been generated do not depend on the data to be transmitted. As will be explained below, this dependency is achieved by multiplying the echoes in the multiplier 69 with a modulation function g(n) that is output by a lookup table 71 which is addressed by lookup table address logic 73 in response to the current message data value.
  • the lookup table output g(n) changes the polarity of the echoes in dependence upon the message data so that the echoes with the modulated polarities can then be added back to the original audio signal by the adder 63 to generated the echo-modulated audio output signal.
  • the lookup table output g(n) is gradually increased and decreased so that the echoes are effectively faded in and out.
  • FIG. 6 a is a plot illustrating the way in which the lookup table output g(n) varies over one symbol period, when the bit value of the message data is a binary “1”.
  • the symbol period is 100 ms.
  • the function g(n) increases from zero to a maximum value and then decreases back to zero at the end of the first half of the symbol period.
  • the function g(n) is negative and increases in magnitude to a maximum negative value and then decreases back to zero.
  • the gradual increasing and decreasing of the lookup table output g(n) is achieved by using a sinusoidal function. Therefore, during the first half of the symbol, the combined echoes output from the subtractor 67 will be multiplied by a positive value and so their polarity will not be changed when they are multiplied by g(n) in the multiplier 69 .
  • the lookup table output g(n) is negative and therefore, the polarities of the echoes output from the subtractor 67 will be reversed when the echoes are multiplied by g(n) in the multiplier 69 .
  • the artificial echoes 23 that are generated and added to the audio signal have an amplitude which is approximately a third that of the audio signal.
  • the amplitude of the echoes is controlled by the output of the lookup table g(n).
  • the peak amplitude of the lookup table output g(n) is a third, which means that the maximum amplitude of the echoes which are added to the audio signal will be a third of the amplitude of the original audio signal.
  • the lookup table output g(n) is inverted compared with when the message data has a binary value of “1”. Therefore, during the first half symbol period, the polarity of the echoes output from the subtractor 67 will be reversed when they are multiplied by g(n) in the multiplier 69 and during the second half of the symbol period the polarities of the echoes output by the subtractor 67 will not be inverted when they are multiplied by g(n) in the multiplier 69 .
  • FIG. 6 c illustrates the lookup table output g(n) over two symbol periods when the message data to be transmitted is a binary “1” followed by another binary “1”.
  • the lookup table output g(n) is a simple repeat of the output illustrated in FIG. 6 a .
  • the lookup table output g(n) over the two symbol periods will be the inverse of that shown in FIG. 6 c.
  • the function shown in FIG. 6 d is used instead of using a lookup table output function obtained by concatenating the functions shown in FIG. 6 a and FIG. 6 b .
  • the lookup table output g(n) reaches its peak negative value in the first symbol period, it remains at that value until the peak would have occurred in the second symbol period before decreasing in magnitude back to zero.
  • the lookup table output g(n) over the two symbol periods will be the inverse of that shown in FIG. 6 d .
  • the inventors have found that not returning to the zero level in this way reduces the obtrusiveness of the echo modulation scheme that is used. This is because the human ear is more sensitive to changing echoes than to constant echoes.
  • the lookup table address logic 73 is responsible for analysing the successive bits of the message data and then to look up the appropriate part of the lookup table 71 so that the appropriate output function g(n) is applied to the multiplier 69 .
  • FIG. 7 is a part schematic and part block diagram illustrating the processing performed by the echo detector 37 .
  • FIG. 7 illustrates 100 milliseconds of an input signal 61 at the input of the echo detector 37 .
  • the input signal 61 is illustrated schematically as a continuous signal for ease of understanding but it will be a sampled and digitised waveform.
  • the echo detector 37 includes two sliding windows 63 - 1 and 63 - 2 which extract adjacent segments of the input audio signal 61 - 1 and 61 - 2 , each of length 50 milliseconds. Therefore, the two windows 63 extract portions of the input acoustic signal 61 which correspond to the above-described half symbol periods. As shown in FIG. 7 , the extracted portion 61 - 1 of the input acoustic signal is input to a first autocorrelation unit 65 - 1 and the extracted portion 61 - 2 of the input audio signal is input to a second autocorrelation unit 65 - 2 .
  • Both autocorrelation units 65 operate to determine the autocorrelation of the corresponding portion 61 - 1 or 61 - 2 of the input acoustic signal at 10 millisecond and 10.25 millisecond lags.
  • the determined autocorrelation values at lags 10.25 from autocorrelation units 65 - 1 and 65 - 2 are then input to a subtractor 67 , that subtracts the autocorrelation value obtained from window j from the autocorrelation value obtained from window i (or vice versa).
  • the result of this subtraction is then supplied to another subtractor 69 .
  • the autocorrelation value at lag 10 milliseconds from window i and the autocorrelation value at lag 10 milliseconds from window j are output from the autocorrelation units 65 to the subtractor 71 , that subtracts the autocorrelation value obtained from window j from the autocorrelation value obtained from window i (or vice versa) and feeds the result to the subtractor 69 .
  • the subtractor 69 then subtracts the output from subtractor 67 from the output from subtractor 71 (or vice versa). Therefore, the output from the subtractor 69 is represented by the following equation:
  • FIG. 8 a shows an autocorrelation plot 81 obtained from a typical audio signal without any artificial echoes.
  • the autocorrelation plot 81 has a peak at zero lag.
  • the autocorrelation plot 81 does not tail off towards zero until about 15 milliseconds after the initial peak and exhibits local peaks and troughs in between.
  • Peak 82 illustrates such a local peak that may occur as a result of a natural echo being added to the audio signal.
  • FIG. 8 b illustrates an autocorrelation plot 83 for the same audio signal after a positive echo has been added at a lag of 10 milliseconds and a negative echo has been added at a lag of 12 milliseconds (rather than at 10.25 ms so that the two echoes can be seen more clearly).
  • the autocorrelation plot 83 includes a peak 85 at 10 milliseconds and a peak 87 at 12 milliseconds. However, the peak 85 is masked somewhat by the earlier peak 82 caused by a natural echo.
  • FIG. 8 c illustrates the autocorrelation plot 89 for the audio signal after the echoes have been added in the second half of the symbol period.
  • the autocorrelation plot 89 includes a negative peak 91 at 10 milliseconds and a positive peak 93 at 12 milliseconds.
  • FIG. 8 d illustrates the autocorrelation plot that is obtained by subtracting the autocorrelation plot shown in FIG. 8 c from the autocorrelation plot shown in FIG. 8 b .
  • the common peaks in the autocorrelation plots shown in FIGS. 8 b and 8 c have been removed, whilst the complementary peaks 85 and 91 ; and 87 and 93 have added together to create the combined peaks 95 and 97 respectively.
  • the echo detector 37 does not calculate the autocorrelation of the input signal over all lags. It only calculates the autocorrelation values at the lags where the artificial echoes have been added.
  • the plots shown in FIG. 8 show the autocorrelation values over lags from 0 to 15 milliseconds. These plots therefore help to illustrate the effect of natural echoes and periodicities in the audio signal which can mask the artificial echoes that are added to encode the data.
  • the receiver 5 knows the duration of each half symbol period. This defines the length of the windows 63 - 1 and 63 - 2 used in the echo detector 37 .
  • the echo detector 37 initially will not be synchronised with the transmitted data. In other words, the echo detector 37 does not know where each symbol period begins and ends or where the start of the message is located. Therefore, in this embodiment, the echo detector 37 performs the above analysis as each new sample is received from the analogue to the digital converter 35 .
  • the output from the subtractor 69 is then analysed by the data recovery module 39 to determine the most likely symbol boundaries.
  • the data recovery module determines the location of the start of the message by finding the synchronisation bits that were added by the synchronisation data adder 57 . At this point, the data recovery unit 39 can start to recover the whole message from the polarity of the autocorrelation values output from the subtractor 69 .
  • the echo detector 37 will typically determine the autocorrelation measurements in the middle of each half symbol period, when the echo is expected to be at its peak amplitude and the data recovery module 39 will determine the bit value from the polarity of the output from the subtractor 69 .
  • the echo detector 37 may also take measurements just before and just after the middle of each half symbol period, to allow the data recovery module 39 to track the synchronisation.
  • the message data recovered by the data recovery module 39 is then input to the FEC decoding module 41 where the message data is decoded (using the inverse processing of the FEC encoder 7 ) to obtain the original input data that was input to the encoder 7 of the transmitter 1 .
  • the data was hidden within an audio signal by employing a number of echoes whose polarity varied with the data value to be transmitted. These echoes were added to the original audio signal after appropriate delays. As those skilled in the art will appreciate, the echoes may be added before the original audio signal (pre-echoes), before and after the original audio signal or only after the original audio signal.
  • synchronisation bits were added to the data that was transmitted so that the decoder can identify the boundaries of each symbol period and the start and end of each message.
  • the use of such synchronisation bits significantly increases the overall message length that has to be transmitted (in some cases by as much as 25%).
  • the matching is not perfect which can reduce the chances of a successful synchronisation.
  • the inventors have realised, however, that the synchronisation bits are not required.
  • the FEC decoding module 41 will have higher error rates when the echo detector 37 is not properly synchronised with the incoming data compared with its error rate when the echo detector is synchronised with the incoming data. Therefore, in the embodiment illustrated in FIG. 9 , the error output generated by the FEC decoding module 41 is used to control the synchronisation of the receiver to the incoming data.
  • the echo detector 37 receives a block of samples corresponding to one or more symbol(s) and determines the optimum time within that block of samples to detect the echoes within the symbols. Multiple symbols may be required when Manchester encoding is used as a Manchester encoded “one” looks the same as a Manchester encoded “zero” with a time shift. Therefore, it may be necessary to consider a number of symbols to allow the symbol boundaries to be identified.
  • the actual determination of the optimum time within the block of samples to detect the echoes may be determined by passing the block of samples through a matched filter (loaded with the expected signal pattern for one symbol period) and the time within the symbol when the absolute output (averaged over a number of successive symbols) is at a maximum is deemed to be the best time to sample the symbols. For example, if there are N samples per symbol, and the block of samples has M symbols, then the following values are calculated:
  • x(i) is absolute output of the matched filter for sample i.
  • the largest average value thus determined identifies the best time to detect the echoeswithin the incoming signal during each symbol.
  • the echo detector 37 uses the determined optimum time to detect echoes in that symbol and in the previous N ⁇ 1 symbols of the input signal (where N is the number of symbols in the transmitted message).
  • the data recovery module 39 determines, from the detected echoes, bit value(s) for each symbol and outputs the string of bits corresponding to the possible message to the FEC decoding module 41 .
  • the FEC decoding module 41 then performs the inverse processing of the FEC encoder 7 to regenerate a candidate input data codeword, which is stored in the buffer 93 .
  • the FEC decoding module 41 also outputs an error count indicating how many errors are identified in the candidate codeword, which it passes to a controller 91 .
  • the controller 91 compares the error count with a threshold value and if it is greater than the threshold, then the controller 91 flushes the candidate codeword from the buffer 93 . The above process is then repeated for the next received symbol in the input signal, until the controller 91 determines that the error count is below the threshold. When it is, the controller 91 instructs the FEC decoding module 41 to accept the candidate codeword, which it then outputs for further use in the receiver 5 .
  • the echo detector 37 , the data recovery module 39 and the FEC decoding module 41 all operate on a window of the input signal corresponding to the length of the transmitted message, which window is slid over the input signal until a point is found where the FEC error count is below a defined threshold—indicating the identification of the full message within the input signal.
  • FIG. 10 is a plot illustrating the way in which the FEC decoding module's error count 99 is expected to change as the window 101 is slid over an input signal 103 containing a data message 105 , with the minimum appearing at symbol S N , when the window 101 is aligned with the data message 105 in the input signal 103 .
  • the threshold (Th) level is then set to reduce the possibility that false minimums in the FEC error output count are considered as possible codewords, so that (in the ideal situation) only when the receiver 5 is properly synchronised (aligned) to the message data, will the FEC decoding module's error count reduce below the threshold in the manner illustrated in FIG. 10 .
  • the FEC encoding/decoding that is used is designed to keep the error rate of the FEC decoding module 41 high except when the window 101 is aligned with the message data 105 in the input signal 103 .
  • the inventors have found that this simple thresholding technique is sufficient to identify the location of the message data in the input signal 103 .
  • further consideration can be made, varying the possible positions of the start and end of the message and looking for the positions that give the minimum FEC error count.
  • the above technique is useful for finding a single message in the input signal.
  • the synchronisation timing determined for the first data message may be used to identify the synchronisation timing for the next data message.
  • the FEC encoder 7 often uses cyclic codewords (for example when using Reed Solomon block encoding) which means that a one bit shift in the codeword can also be a valid codeword. This is problematic because it can result in false detections of a codeword (a so called false positive) in the input signal 105 .
  • This problem can be overcome by reordering the bits of the codeword in the FEC encoder 7 in some deterministic manner (for example in a pseudo random manner), and using the inverse reordering in the FEC decoder 41 .
  • the processing that may be performed by the FEC encoder 7 and by the FEC decoder 41 in such an embodiment is illustrated in FIGS.
  • the FEC encoder 7 performs a cyclic encoding of the data (in this case Reed Solomon encoding 111 ), followed by a pseudo random reordering 113 of the data. The reordered data is then convolutionally encoded 115 and then interleaved 117 as before.
  • the FEC decoding module 41 initially de-interleaves 121 the data and performs convolutional decoding 123 . The FEC decoding module 41 then reverses 123 the pseudo random data reordering performed by the FEC encoder 7 and then performs the Reed Solomon decoding 125 .
  • each data value was represented by four echoes—two echoes in each of two half symbol periods.
  • each data value may be represented by any number of echoes in any number of sub-symbol periods.
  • each data value may be represented by a single echo in each half symbol period.
  • the echoes in each half symbol period would preferably be of opposite polarity so that the same differencing technique can be used to reduce the effects of natural echoes.
  • the inventors have found that in some cases using two echoes of opposite polarity in each half symbol period can result in some frequency components within the original audio signal adding constructively with the echoes and some frequency components within the original audio signal adding destructively with the echoes. If a single artificial echo is added, then such distortions are less evident making the hidden data less noticeable to users in the acoustic sound that is heard.
  • each data value by one or more echoes in different sub-symbol periods, means that the echoes in each sub-symbol period will be a repetition of a different portion of the audio signal. If there is only one symbol period, then each data value will be represented by echoes of the same (or substantially the same) portion of the audio signal.
  • each data value was represented by a positive and a negative echo in a first half symbol period and by a positive and a negative echo in the second half symbol period.
  • the positive and negative echoes in the first half symbol period allowed the receiver to reduce the effects of periodicities in the original audio signal which effect the autocorrelation measurements.
  • the use of complementary echoes in adjacent half symbol periods allows the receiver to reduce the effect of natural echoes within the received audio signal, which might otherwise mask the artificial echoes added to represent the data.
  • neither or only one of these techniques may be used.
  • each data value was represented by echoes within two adjacent half symbol periods.
  • these two half symbol periods do not have to be immediately adjacent to each other and a gap may be provided between the two periods if required.
  • the echoes in each half symbol period were of exactly the same portion of the audio signal. As those skilled in the art will appreciate, this is not essential.
  • the echoes in each half symbol period may be of slightly different portions of the audio signal. For example, one echo may miss out some of the audio samples of the audio signal.
  • the audio signal may include different channels (for example left and right channels for a stereo signal) and one echo may be formed from a repetition of the left channel and the other may be formed from a repetition of the right channel. With modern multi channel surround sound audio the repetitions can be of any of these channels.
  • the echoes generated within the transmitter were added to the original audio signal.
  • the generated echoes may be combined with the original audio signal in other ways.
  • the echoes may be subtracted from the audio signal.
  • the same result can be achieved by changing the way in which the echoes are combined with the audio signal. For example, one echo may be added to the original audio signal whilst the next echo may be subtracted from the audio signal.
  • the lookup table stored values for g(n) corresponding to one or two bits of the message data (as illustrated in FIG. 6 ). As those skilled in the art will appreciate, this is not essential.
  • the lookup table could simply store a function which increased in value and then decreased in value. Additional circuitry could then be provided to convert the polarity of this output as appropriate for the two half symbol periods. In this way, the function stored in the lookup table would only control the fading in and out of the echo and the additional circuitry would control the polarity of the echo as required.
  • the Manchester encoding was performed by the echo generation and shaping module. As those skilled in the art will appreciate, this Manchester encoding, if performed, may be performed within the FEC encoding module.
  • the techniques described above for hiding data within the audio may be done in advance of the transmission of the acoustic signal or it may be done in real time. Even in the case where the data is to be embedded within an audio signal in real time, some of the processing can be done in advance. For example, the FEC encoding may be performed on the data in advance so that only the echo generation and echo shaping is performed in real time.
  • the data rate of the encoded data is preferably kept between one and twenty symbols per second. This corresponds to a symbol period of between 50 ms and 1 second.
  • a long symbol period is beneficial because the added echoes will span across spoken words within the audio, making it easier to hide the data echoes within the audio.
  • a larger symbol period also reduces audibility of the echoes. This is because humans are more sensitive to changing echoes than they are to static or fixed echoes. Therefore, by having a longer symbol period, the rate of change of the echoes is lower making the presence of the echoes less noticeable to a user.
  • the data rate of the data added to the audio signal in the transmitter was constant and was known by the receiver. This knowledge reduces the complexity of the receiver circuitry for locking on to the data within the received signal. However, it is not essential to the invention and more complex circuitry may be provided in the receiver to allow the receiver to try different data rates until the actual data rate is determined. Similarly, the receiver may use other techniques to synchronise itself with the transmitted data so that it knows where the symbol boundaries are in advance of receiving the data.
  • FEC encoding techniques were used to allow the receiver to be able to correct errors in the received data.
  • encoding techniques are not essential to the invention. However, they are preferred, as they help to correct errors that occur in the transmission process over the acoustic link.
  • the peak amplitudes of the echoes were all the same and were independent of the data value being transmitted. As those skilled in the art will appreciate, the peak amplitudes of the echoes may also be varied with data to be transmitted if desired.
  • the echoes in each half symbol period were at the same delays relative to the original audio signal. As those skilled in the art will appreciate, this is not essential. There may be some variation in the actual delay values used within each half symbol period.
  • each echo within each sub-symbol period was generated by delaying the first echo by a further delay value.
  • each echo within each sub-symbol period may be independently generated from the original audio signal using an appropriate delay line.
  • the encoded data may be used as a watermark to protect the original audio signal.
  • the embedded data may be used to control the receiver so that it can respond in synchronism with the audio signal.
  • the decoder can be programmed to perform some action a defined time after receiving the codeword. The time delay may be programmed into the decoder by any means and may even be defined by data in the received codewords. When used to perform such synchronisation, shorter symbol periods are preferred as shorter symbol periods allows for better temporal resolution and hence more accurate synchronisation.
  • the data may be used for interactive gaming applications, audience surveying, e-commerce systems, toys and the like. The reader is referred to the Applicant's earlier International application WO02/45273 which describes a number of uses for this type of data hiding system.
  • the receiver performed autocorrelation measurements on the input audio signal in order to identify the locations of the echoes.
  • other techniques can be used to identify the echoes. Some of these other techniques are described in the Applicant's earlier PCT application PCT/GB2008/001820 and in U.S. Pat. No. 5,893,067, the contents of which are incorporated herein by reference.
  • the techniques involve some form of autocorrelation of the original audio signal or of parameters obtained from the audio signal (eg LPC parameters, cepstrum parameters etc).
  • a best fit approach could be used in which an expected audio signal (with different echo polarities) is fitted to the actual signal until a match is found and the polarity of the echoes thus determined.
  • a single transmitter was provided together with a receiver.
  • multiple transmitters and/or multiple receivers may be provided.
  • the components of the transmitter may be distributed among a number of different entities.
  • the encoding and data hiding part of the transmitter may be provided within a head end of a television distribution system or a user's set top box and the loudspeaker 19 may be a speaker of the user's television set.
  • the echoes were directly derived from the original audio signal.
  • the echo may not include all frequency components of the audio signal.
  • one or more of the echoes may be generated from a portion of the audio signal after it has been filtered to remove certain frequencies. This may be beneficial where it is found, for example, that there is additional noise in the low frequency part of the echoes but not in the higher frequency part.
  • the received signals would also be filtered to remove the lower frequency components (for example frequencies below about 500 Hz) so that only the higher frequency components (those above the lower frequency components) of the audio signal and the echoes would be present in the signals being analysed.
  • the received signal may be passed through a filter that simply reduces the level of the lower frequency components in the received signal compared with the higher frequency components. This will have the effect of reducing the relevance of the noisy low frequency part of the received signal in the subsequent decoding process.
  • the echoes may be low pass filtered to remove the higher frequencies.
  • the division of the audio signal into separate frequency bands can also be used to carry data on multiple channels. For example, if the frequency band is divided into a high frequency part and a low frequency part, then one channel may be provided by adding echoes to the high frequency part and another channel may be provided by adding different echoes to the low frequency part.
  • the use of multiple channels in this way allows frequency or temporal diversity if the data carried in the two channels is the same; or allows for an increased data transfer rate if each channel carries different data.
  • Multiple channels can also be provided where the audio signal also contains multiple channels (used to drive multiple speakers). In this case, one or more data channels may be provided in the audio signal for each audio channel.
  • data was hidden within an audio signal by adding echoes to the audio signal.
  • the incoming audio may already contain hidden data in the form of such echoes.
  • the encoder could decode the existing hidden data from the received audio signal and then use the decoded data to clean the audio signal to remove the artificial echoes defining this hidden data. The encoder could then add new echoes to the thus cleaned audio signal to hide the new data in the audio signal. In this way, the original hidden data will not interfere with the new hidden data.
  • the echoes were obtained by delaying digital samples of the audio signal.
  • the echoes may be generated in the analogue domain, using suitable analogue delay lines and analogue circuits to perform the echo shaping and polarity modulation.
  • the audio signal with the embedded data was transmitted to a receiver over an acoustic link.
  • the audio signal may be transmitted to the receiver over an electrical wire or wireless link.
  • the data rates that are used may be higher, due to lower noise levels.
  • one data bit was transmitted within each symbol period.
  • multiple bits may be transmitted within each symbol period. For example a second pair of echoes may be added at lags of 20 ms and 20.25 ms within each half symbol period to encode a second bit; a third pair of echoes may be added at lags of 30 ms and 30.25 ms within each half symbol period to encode a third bit etc.
  • Each echo could then be faded in and out during each half symbol period and polarity modulated in accordance with the bit value as before.
  • the fading in and out of the echoes for the different bits may be the same or it may be different for the different bits.
  • the polarity modulation of the different echoes will of course depend on the different bit values to be transmitted in the symbol period.
  • the echoes for the different bits within the same half symbol period are faded in and out at different times of the half symbol period, so that the different echoes reach their peak amplitudes at different times within the half symbol period. In this way, when the echo for one bit is at its peak amplitude (or when all the echoes for one bit are at their peak amplitudes—if there are multiple echoes representing each bit in each half symbol period), the echoes for the other bits will not be at their peaks.
  • the inventors have found that the above described data hiding techniques do not work as well during portions of the audio that include single tones or multiple harmonic tones, such as would be found in some sections of music. This is because the hidden data becomes more obtrusive to the listener in these circumstances and if the tones are being used as part of an automatic setup procedure they can cause the procedure to fail. Therefore, in one embodiment, the inventors propose to include (within the encoder) a detector that detects the level of tonality or other characteristic of the audio signal and, if it is highly tonal, that switches off the echo addition circuitry.
  • the encoder may fade the echoes out during periods of high tonality and then fade them back in during periods of low tonality. In this way, the data is only added to the audio signal when the audio signal is not highly tonal in nature.
  • Various techniques may be used for making this detection.
  • One technique for determining the level of tonality of an audio signal is described in the applicant's earlier PCT application WO02/45286, the contents of which are incorporated herein by reference. Another technique can be found in Davis P (1995) “A tutorial on MPEG/Audio Compression”, IEEE Multimedia Magazine, 2(2), pp. 60-74.
  • the system may be arranged to adapt the amplitude of the added echoes depending on the detected characteristic of the audio signal.
  • the encoder may instead or in addition vary the data rate or the symbol period in order to reduce the obtrusiveness of the hidden data during periods when the audio signal is highly tonal.
  • a sequence of messages may be transmitted. These messages may be the same or they may be different. In either case, each message may be transmitted after a preceding message has been transmitted. Alternatively, the end of one message may be overlapped with the start of the next message in a predefined way (so that the receiver can regenerate each message. This arrangement can increase the time diversity of the transmitted messages making them less susceptible to certain types of noise or data loss.
  • the data from the different messages may be interleaved in a known manner and transmitted as a single data stream to the receiver. The receiver would then regenerate each message by de-interleaving the bits in the data stream using knowledge of how the messages were originally interleaved.
  • Convolutional Coding is used as part of the forward error correction (FEC) encoder.
  • FEC forward error correction
  • data encoded in this way generally is decoded using a Viterbi decoder, which operates by constructing a trellis of state probabilities and branch metrics.
  • the transmitted data is often terminated with a number of zeros to force the encoder back to the zero state.
  • This allows the decoder to start decoding from a known state, however, it requires extra symbols to be transmitted over the channel.
  • An alternative technique is to ensure that the trellis start and end states are identical. This technique is referred to as tail biting and has the advantage of not requiring any extra symbols to be transmitted. Tail biting is used in many communications standards and, if desired, may be used in the embodiments described above.
  • the decoder does not work as well when the message consists of predominantly ‘zero’ bits (or conversely predominately ‘one’ bits), since under the encoding scheme an ‘all zeros’ codeword segment looks the same as a time-shifted ‘all ones’ codeword segment.
  • a particular example is the ‘all zeros’ message, which results in an ‘all zeros’ codeword after Reed Solomon encoding.
  • the encoding works best when there are approximately equal numbers of ones and zeros in the codeword, evenly distributed throughout the codeword. This can be achieved for the disclosed system by inverting the Reed Solomon parity bits. This has the effect of changing the all zeroes codeword to a mixture of zeroes and ones.
  • processing modules and circuits may be provided as hardware circuits or as software modules running within memory of a general purpose processor.
  • the software may be provided on a storage medium such as a CD-ROM or it may be downloaded into an appropriate programmable device on a carrier signal over a computer network, such as the Internet.
  • the software may be provided in compiled form, partially compiled form or in un-compiled form.

Abstract

A data hiding system is described for hiding data within an audio signal. The system can be used for watermarking, data communications, audience surveying etc. The system hides data in an audio signal by adding artificial echoes whose polarity varies with the data to be hidden. In one embodiment, each data value is represented by a positive and a negative echo having different delays. A receiver can then remove the effects of natural echoes and/or periodicities in the audio signal by differencing measurements obtained at the different delays.

Description

  • The present invention relates to a system for embedding data in an audio signal and to its subsequent recovery, which can be used for watermarking, data communications, audience surveying etc. The invention has particular relevance, to a system for hiding data in an audio signal by adding echoes to the audio signal and to a system for recovering the hidden data by detecting the added echoes.
  • U.S. Pat. No. 5,893,067 discloses a technique for hiding data within an audio signal for transmission to a remote receiver. The data is hidden in the audio signal by adding an artificial echo to the audio signal and varying the amplitude and/or delay of the echo in accordance with the data to be hidden.
  • A problem with the data hiding technique described in U.S. Pat. No. 5,893,067 is that in most situations, natural echoes can mask the artificial echoes making it difficult for the receiver to be able to identify the artificial echoes and hence recover the hidden data.
  • The present invention aims to provide an alternative data hiding technique. One embodiment of the invention at least alleviates the above problem by applying echoes of opposite polarity to represent each data value.
  • According to one aspect, the present invention provides a method of embedding a data value in an audio signal, the method comprising: generating an echo of at least a portion of the received audio signal; and embedding the data value in the audio signal by combining the received audio signal with the generated echo; wherein the data value is embedded in the audio by varying the polarity of the echo that is combined with the audio signal in dependence upon the data value. The inventors have found that using polarity modulation to embed the data in the audio signal can make the recovery of the embedded data easier in the receiver, especially in the presence of natural echoes caused, by for example the acoustics of the room. The polarity modulation can be achieved by varying the echo that is generated and/or by varying the way in which the echo is combined with the audio signal.
  • In one embodiment, the generating step generates a first echo of at least a portion of the received audio signal and a second echo of at least a portion of the received audio signal, the first and second echoes having first and second polarities respectively, which polarities vary in dependence upon the data value; and wherein the embedding step embeds the data value in the audio signal by combining the received audio signal with the generated first and second echoes.
  • Each of the echoes may be generated by repeating at least a part of said audio signal. The first echo may be generated by repeating a first portion of the audio signal and the second echo may be generated by repeating a second portion of said audio signal. Alternatively, the first and second echoes may be generated by repeating substantially the same first portion of the audio signal. Where the audio signal is received as a stream of samples, the or each echo may be generated by passing the stream of audio samples through a delay line.
  • In one embodiment, third and fourth echoes may be generated, the third echo having the same polarity as said second echo and the fourth echo having the same polarity as said first echo. In this case, the third and fourth echoes may be generated by repeating substantially the same second portion of the audio signal which is different to the first portion repeated by the first and second echoes. The second portion of the audio signal may be adjacent to the first portion. The generating step may generate the third and fourth echoes by inverting the polarity of a gain factor applied to the echoes before being combined with the audio signal.
  • The first echo may be combined with the audio signal at a first delay relative to the first portion of the audio signal; the second echo may be combined with the audio signal at a second delay relative to said first echo; the third echo may be combined with said audio signal at a third delay relative to said second portion of the audio signal; and the fourth echo may be combined with the audio signal at a fourth delay relative to the third echo. The first delay may be equal to said third delay and/or the second delay may be equal to said fourth delay. In one embodiment, the delays and the amplitudes of the echoes are independent of the data value.
  • Preferably the first and third delays are between 0.5 ms and 100 ms and more preferably between 5 ms and 40 ms; and the second and fourth delays delayed (relative to the first and third echoes respectively) by between 0.125 ms and 3 ms and more preferably between 0.25 ms and 1 ms, as these delays are similar to those of natural echoes and so are less noticeable to users. In one embodiment, the or each echo has an amplitude that is less than the amplitude of said audio signal. Preferably the or each echo is faded in and out to reduce obtrusiveness of the echoes to a listener.
  • The first and second portions of the audio signal should be long enough for the receiver to be able to detect the presence of the echoes but not too long as to overly reduce the data rate that can be communicated. The inventors have found that echoes having durations of between 20 ms and 500 ms provides a reasonable data rate whilst keeping to a minimum data transmission errors when transmission occurs over an acoustic link. If transmission is over an electrical link, then shorter echoes may be used.
  • The echoes may be combined with the audio signal by adding and/or subtracting the echoes to/from the audio signal. The polarity of each echo can therefore be controlled by controlling the way in which each echo is combined with the audio signal.
  • This aspect of the invention also provides a computer implementable instructions product comprising computer implementable instructions for causing a programmable computer device to carry out the method described above.
  • This aspect also provides an apparatus for embedding a data value in an audio signal, the apparatus comprising: an echo generator operable to generate an echo of at least a portion of the received audio signal; and a combiner operable to combine the received audio signal with the generated first and second echoes to embed the data value in the audio signal; wherein the echo generator and/or the combiner are arranged so that the data value is embedded in the audio by varying the polarity of the echo that is combined with the audio signal in dependence upon the data value.
  • According to another aspect, the present invention provides a method of recovering a data value embedded in an audio signal, the method comprising: receiving an input signal having the audio signal and an echo of at least part of the audio signal whose polarity depends upon said data value; processing the received input signal to determine the polarity of the echo; and recovering the data value from the determined polarity.
  • The input signal may comprise a first echo of at least a portion of the audio signal and a second echo of at least a portion of the audio signal, the first and second echoes having first and second polarities respectively, which polarities vary in dependence upon the data value; and wherein the processing step processes the input signal to combine the first and second echoes and to determine the polarity of the combined echoes and wherein the recovering step recovers the data value from the determined polarity of the combined echoes.
  • In one embodiment the processing step processes the input signal to determine a first autocorrelation measure that depends upon the first echo and a second autocorrelation measure that depends upon the second echo and combines the echoes by differencing the first and second autocorrelation measures and determines the polarity of the combined echoes by determining the polarity of the result of the differencing step.
  • The first echo may be of a first portion of the audio signal and the second echo may be of a second portion of the audio signal. Alternatively the first and second echoes may be repeats of substantially the same portion of the audio signal.
  • In one embodiment, the input signal comprises first, second, third and fourth echoes, the first and fourth echoes having the same polarity and the second and third echoes having the same polarity which is opposite to the polarity of the first and fourth echoes, wherein the processing step processes the input signal to combine the first to fourth echoes and to determine the polarity of the combined echoes and wherein the recovering step recovers the data value from the determined polarity of the combined echoes.
  • In this embodiment, the processing step may process the input signal to determine a first autocorrelation measure that depends upon the first echo, a second autocorrelation measure that depends upon the second echo, a third autocorrelation measure that depends upon the third echo and a fourth autocorrelation measure that depends upon the fourth echo and combines the echoes by differencing the autocorrelation measures and determines the polarity of the combined echoes by determining the polarity of a result of the differencing step.
  • The differencing step may perform a first difference of the first and third autocorrelation measures, a second difference of the second and fourth autocorrelation measures, a third difference of the result of said first difference and the result of the second difference and wherein the polarity of the combined echoes may be determined from the polarity of a result of the third difference.
  • The first and second echoes may be repeats of substantially the same first portion of the audio signal and the third and fourth echoes may be repeats of substantially the same second portion of the audio signal. Alternatively, the first and third echoes may be repeats of substantially the same first portion of the audio signal and the second and fourth echoes may be repeats of substantially the same second portion of the audio signal. In one embodiment, the or each echo is faded in and out to reduce obtrusiveness of the echoes to a listener. In this case, the polarity of the echo may be determined when the amplitude of the echo is at or near a maximum.
  • The first echo may be delayed relative to said first portion of the audio signal by a first delay; the second echo may be delayed relative to the first echo by a second delay; the third echo may be delayed relative to the second portion of the audio signal by a third delay; and the fourth echo may be delayed relative to the third echo by a fourth delay. The first delay may be equal to the third delay and/or the second delay may be equal to said fourth delay.
  • According to this aspect, a computer implementable instructions product is also provided comprising computer implementable instructions for causing a programmable computer device to carry out the above method.
  • This aspect also provides an apparatus for recovering a data value embedded in an audio signal, the apparatus comprising: an input for receiving an input signal having the audio signal and an echo of at least part of the audio signal whose polarity depends upon said data value; a processor operable to process the input signal to determine the polarity of the echo; and a data regenerator operable to recover the data value from the determined polarity.
  • These and other aspects of the invention will become apparent to those skilled in the art from the following detailed description of exemplary embodiments, which are described with reference to the following drawings in which:
  • FIG. 1 is a block diagram illustrating the main components of a transmitter and receiver used in an exemplary embodiment;
  • FIG. 2 a is an impulse plot illustrating the echoes that are added to an audio signal to encode a binary “one”;
  • FIG. 2 b is an impulse plot illustrating the echoes that are added to an audio signal to encode a binary “zero”;
  • FIG. 3 a is an impulse plot illustrating the presence of artificial echoes for a binary “one” after Manchester encoding and illustrating natural echoes;
  • FIG. 3 b is an impulse plot illustrating the presence of artificial echoes for a binary “zero” after Manchester encoding and illustrating natural echoes;
  • FIG. 4 is a block diagram illustrating in more detail the encoding performed in the transmitter shown in FIG. 1;
  • FIG. 5 is a block diagram illustrating the main components of an echo generation and shaping module forming part of the transmitter shown in FIG. 1;
  • FIG. 6 a illustrates a shaping and modulation function that is applied to the echoes prior to being combined with the audio signal when a binary “one” is to be transmitted;
  • FIG. 6 b illustrates a shaping and modulation function that is applied to the echoes prior to being combined with the audio signal when a binary “zero” is to be transmitted;
  • FIG. 6 c illustrates the way in which the shaping and modulation function varies when two successive binary “ones” are to be transmitted;
  • FIG. 6 d illustrates the shaping and modulation function that is applied when a binary “zero” is transmitted after a binary “one”;
  • FIG. 7 illustrates the processing performed in the receiver shown in FIG. 1 for recovering the hidden data from the received audio signal;
  • FIG. 8 a is an autocorrelation plot for a typical audio signal without artificial echoes;
  • FIG. 8 b is an autocorrelation plot for the audio signal with artificial echoes during a first half of a bit symbol;
  • FIG. 8 c is an autocorrelation plot for the audio signal with artificial echoes during the second half of the bit symbol;
  • FIG. 8 d is a plot obtained by subtracting the autocorrelation plot shown in FIG. 8 c from the autocorrelation plot shown in FIG. 8 b;
  • FIG. 9 is a block diagram illustrating an alternative form of receiver used to receive and recover the hidden data embedded in the audio signal;
  • FIG. 10 is a plot illustrating the way in which an FEC error count varies during a synchronisation process used to find the hidden data message within the input signal; and
  • FIGS. 11 a and 11 b illustrate the processing performed respectively by an FEC encoder and an FEC decoder in one embodiment.
  • OVERVIEW
  • FIG. 1 is a block diagram illustrating a transmitter and receiver system according to one embodiment in which a transmitter 1 transmits data hidden within an acoustic signal 3 to a remote receiver 5. The transmitter 1 may form part of a television or radio distribution network and the receiver may be a portable device such as a cellular telephone handset that is capable of detecting the acoustic signal 3 output by the transmitter 1.
  • The Transmitter
  • As shown in FIG. 1, the transmitter 1 includes a forward error and correction (FEC) encoder module 7, which receives and encodes the input data to be transmitted to the remote receiver 5. The encoded message data output from the FEC encoding module 7 is then passed to an echo generation and shaping module 9, which also receives an audio signal in which the encoded message data is to be hidden. The echo generation and shaping module 9 then hides the message data into the audio by generating echoes of the audio which depend upon the message data to be transmitted. The generated echoes are then combined with the original audio signal in a combiner module 11 and the resulting modified audio signal is then passed to a gain control module 13 for appropriate gain control. The audio signal is then converted from a digital signal to an analogue signal by the digital to analogue converter 15 and it is then amplified by a driver module 17 for driving a loudspeaker 19 which generates the acoustic signal 3 having the data hidden therein.
  • As will be described in more detail below, in this embodiment, the polarity of the echoes (as opposed to their lag and/or amplitude) is varied in order to encode the data to be transmitted. The inventors have found that this polarity modulation can be more robust in the presence of natural echoes and periodicities in the audio signal. This is particularly the case when each data value is represented by two echoes of the same magnitude but having different lags and opposite polarities. The polarities of the echoes representing each message bit are reversed to distinguish between a binary zero and a binary one. This is illustrated by the impulse plots illustrated in FIG. 2. In particular, FIG. 2 a is an impulse plot illustrating the component signals that are present when a binary one is to be transmitted and FIG. 2 b is an impulse plot illustrating the component signals present when a binary zero is to be transmitted. As shown in FIG. 2 a, the component signals include an initial impulse 21 representing the original audio signal followed by two lower amplitude impulses 23-1 and 23-2 representing the two echoes of the original signal component 21 which are added to the audio signal. As can be seen by comparing FIGS. 2 a and 2 b, when a binary one is to be transmitted, a positive echo 23-1 is transmitted first followed by a negative echo 23-2; and when transmitting a binary zero a negative echo 23-1 is transmitted first followed by a positive echo 23-2. Although this could be reversed if desired.
  • As shown in FIG. 2, in this embodiment, the first echo is added with a lag of approximately ten milliseconds and the second echo is added 0.25 milliseconds after the first echo. This is the same regardless of whether a binary one or a binary zero is to be transmitted. Additionally, as represented in FIG. 2, in this embodiment, the echoes that are added have lower amplitudes compared with the amplitude of the original audio signal. In particular, in this embodiment, the amplitude of the echoes is approximately one third that of the original audio signal.
  • The Receiver
  • FIG. 1 also illustrates the main components of the receiver 5. As shown, the receiver includes a microphone 31 for detecting the acoustic signal 3 and for converting it into a corresponding electrical signal which is then filtered and amplified by filter and amplification circuitry 33. The output from the filter amplification circuitry 33 is then digitised by an analogue to digital converter 35 and the digital samples are then passed to an echo detector 37. The echo detector 37 then processes the digital samples to identify the polarities of the echoes in the received signal. This information is then passed through a data recovery module 39 which processes the echo information to recover the encoded message data. This message data is then decoded by a decoder 41 to recover the original data that was input to the FEC decoding module of the transmitter 1.
  • Manchester Encoding
  • As will be explained in more detail below, the echo detector 37 detects the echoes from the received signal by calculating the auto-correlation of the received signal at specified delays. However, natural echoes (e.g. room echoes) will also contribute to the autocorrelation values thus calculated as will periodicities of the original audio track. In order to distinguish the artificial echoes representing the encoded data from these natural echoes, the message data is also Manchester encoded so that a message data value of “1” is transmitted as a “1”, followed by a “0” (or vice versa), whilst a message data value of “0” is transmitted as a “0” followed by a “1”. In this embodiment, this Manchester encoding is performed by the echo generation and shaping module 9. Therefore, when a message bit value of “0” is to be transmitted, for the first half of the symbol, the first echo 23-1 is of positive polarity and the second echo 23-2 is of negative polarity, whilst for the second half of the symbol, the first echo 23-1 is of negative polarity and the second echo 23-2 is of positive polarity. To transmit a message bit value of “0”, all polarities are reversed, as summarised in the table given below.
  • first half of symbol second half of symbol
    data polarity of polarity of polarity of polarity of
    value first echo second echo first echo second echo
    0 Positive negative negative positive
    1 Negative positive positive negative
  • The reason that the Manchester encoding can help to distinguish the artificial echoes from the natural echoes is that the natural echoes will be stable over the two half symbol periods. Therefore, by subtracting the autocorrelations in the second half of the symbol from autocorrelations in the first half of the symbol (or vice versa), the effect of the natural echoes and periodicities will cancel, whilst the autocorrelation peaks caused by the artificial echoes will add constructively. Similarly, the reason for using two echoes in each half symbol period is to distinguish the artificial echoes from periodicities in the original track. Typically, the autocorrelation of the original track will not change significantly between these two lags (i.e. between 10 ms and 10.25 ms). Therefore, by differencing the autocorrelations at the two lags, the effect of the periodicities is reduced and the autocorrelation peaks caused by the two echoes add constructively.
  • FIGS. 3 a and 3 b are impulse plots showing the two half symbols and the artificial echoes 23 that are added within each half symbol period to represent a binary “1” and a binary “0” respectively. FIGS. 3 a and 3 b also illustrate natural echoes 25-1 and 25-2 which do not change from one half period to the next. Therefore, by subtracting the echoes in one half of the symbol period from the corresponding echoes (i.e. those with the same lag or delay) in the other half of the symbol period, the effect of the natural echoes and periodicities will cancel, whilst the artificial echoes will add constructively, thereby making it easier to detect the hidden data.
  • The above description provides an overview of the encoding and decoding techniques used in the present embodiment. A more detailed description will now be given of the main components of the transmitter 1 and receiver 5 to carry out the encoding and decoding processes described above.
  • FEC Encoder
  • FIG. 4 is a block diagram illustrating the main components of the FEC encoder module 7 used in this embodiment. As shown, the first encoding module is a Reed-Solomon encoder module 51 which uses a shortened (13, 6) block code to represent the input data. The data output from the Reed-Solomon encoder 51 is then passed to a convolutional encoder 53 which performs convolutional encoding on the data. The data bits output from the convolutional encoder 53 are then interleaved with each other by a data interleaving module 55 to protect against errors occurring in bursts. Finally, a synchronisation data adder module 57 adds a sequence of synchronisation bits that will help the receiver 5 lock on to the encoded data within the received acoustic signal 3. The output from the synchronisation data adder module 57 represents the message data which is then passed to the echo generation and shaping module 9 shown in FIG. 1.
  • Echo Generation and Shaping
  • FIG. 5 is a block diagram illustrating the main components of the echo generation and shaping module 9 and the combiner module 11 shown in FIG. 1. The input audio signal is represented by the sequence of audio samples a(n) which are applied to a 10 millisecond delay unit 61 and to the adder 63 (corresponding to the combiner 11 shown in FIG. 1). The 10 millisecond delay unit 61 delays the input sample a(n) by 10 milliseconds which it then outputs to a 0.25 millisecond delay unit 65 and to a subtractor 67. The 0.25 millisecond delay unit 65 delays the audio sample output from the 10 millisecond delay unit 61 by a further 0.25 milliseconds which it then outputs to the subtractor 67. The subtractor 67 subtracts the 10.25 millisecond delayed sample from the 10 millisecond delayed sample outputting the result to a multiplier 69. The delay units and the subtractor operate each time a new audio sample a(n) arrives. In this embodiment, the audio sample frequency is one of 8 kHz, 32 kHz, 44.1 kHz or 48 kHz.
  • Therefore, as those skilled in the art will appreciate, the 10 millisecond delay unit 61, the 0.25 millisecond delay unit 65 and the subtractor 67 will generate the two echoes 23-1 and 23-2 illustrated in FIG. 2. At this stage, however, the echoes that have been generated do not depend on the data to be transmitted. As will be explained below, this dependency is achieved by multiplying the echoes in the multiplier 69 with a modulation function g(n) that is output by a lookup table 71 which is addressed by lookup table address logic 73 in response to the current message data value. In particular, the lookup table output g(n) changes the polarity of the echoes in dependence upon the message data so that the echoes with the modulated polarities can then be added back to the original audio signal by the adder 63 to generated the echo-modulated audio output signal.
  • Lookup Table Output g(n)
  • The inventors have found that abrupt changes in the echoes that are added can make the echoes more obtrusive to users in the vicinity of the loudspeaker 19. Therefore, the lookup table output g(n) is gradually increased and decreased so that the echoes are effectively faded in and out.
  • Additionally, in this embodiment, the lookup table output g(n) also performs the above described Manchester encoding of the message data. The way in which this is achieved will now be explained with reference to FIG. 6. In particular, FIG. 6 a is a plot illustrating the way in which the lookup table output g(n) varies over one symbol period, when the bit value of the message data is a binary “1”. In this embodiment, the symbol period is 100 ms. As shown, during the first half of the symbol period, the function g(n) increases from zero to a maximum value and then decreases back to zero at the end of the first half of the symbol period. During the second half of the symbol period, the function g(n) is negative and increases in magnitude to a maximum negative value and then decreases back to zero. As can be seen from FIG. 6 a, in this embodiment, the gradual increasing and decreasing of the lookup table output g(n) is achieved by using a sinusoidal function. Therefore, during the first half of the symbol, the combined echoes output from the subtractor 67 will be multiplied by a positive value and so their polarity will not be changed when they are multiplied by g(n) in the multiplier 69. On the other hand, during the second half of the symbol period the lookup table output g(n) is negative and therefore, the polarities of the echoes output from the subtractor 67 will be reversed when the echoes are multiplied by g(n) in the multiplier 69.
  • As mentioned above, the artificial echoes 23 that are generated and added to the audio signal have an amplitude which is approximately a third that of the audio signal. In this embodiment, the amplitude of the echoes is controlled by the output of the lookup table g(n). As shown in FIG. 6 a, the peak amplitude of the lookup table output g(n) is a third, which means that the maximum amplitude of the echoes which are added to the audio signal will be a third of the amplitude of the original audio signal.
  • As shown in FIG. 6 b, when the message data is a binary value “0” the lookup table output g(n) is inverted compared with when the message data has a binary value of “1”. Therefore, during the first half symbol period, the polarity of the echoes output from the subtractor 67 will be reversed when they are multiplied by g(n) in the multiplier 69 and during the second half of the symbol period the polarities of the echoes output by the subtractor 67 will not be inverted when they are multiplied by g(n) in the multiplier 69.
  • FIG. 6 c illustrates the lookup table output g(n) over two symbol periods when the message data to be transmitted is a binary “1” followed by another binary “1”. As shown in FIG. 6 c, in this case, the lookup table output g(n) is a simple repeat of the output illustrated in FIG. 6 a. Similarly, if successive values of the message data are binary “0's” then the lookup table output g(n) over the two symbol periods will be the inverse of that shown in FIG. 6 c.
  • However, If the message data transitions from a binary “1” to a binary “0”, then instead of using a lookup table output function obtained by concatenating the functions shown in FIG. 6 a and FIG. 6 b, the function shown in FIG. 6 d is used instead. As can be seen in FIG. 6 d, when the lookup table output g(n) reaches its peak negative value in the first symbol period, it remains at that value until the peak would have occurred in the second symbol period before decreasing in magnitude back to zero. Similarly, when successive bits of the message data transition from a binary “0” to a binary “1”, the lookup table output g(n) over the two symbol periods will be the inverse of that shown in FIG. 6 d. The inventors have found that not returning to the zero level in this way reduces the obtrusiveness of the echo modulation scheme that is used. This is because the human ear is more sensitive to changing echoes than to constant echoes.
  • As those skilled in the art will appreciate, the lookup table address logic 73 is responsible for analysing the successive bits of the message data and then to look up the appropriate part of the lookup table 71 so that the appropriate output function g(n) is applied to the multiplier 69.
  • Echo Detector
  • FIG. 7 is a part schematic and part block diagram illustrating the processing performed by the echo detector 37. In particular, FIG. 7 illustrates 100 milliseconds of an input signal 61 at the input of the echo detector 37. As those skilled in the art will appreciate, the input signal 61 is illustrated schematically as a continuous signal for ease of understanding but it will be a sampled and digitised waveform.
  • As illustrated by window i and window j, the echo detector 37 includes two sliding windows 63-1 and 63-2 which extract adjacent segments of the input audio signal 61-1 and 61-2, each of length 50 milliseconds. Therefore, the two windows 63 extract portions of the input acoustic signal 61 which correspond to the above-described half symbol periods. As shown in FIG. 7, the extracted portion 61-1 of the input acoustic signal is input to a first autocorrelation unit 65-1 and the extracted portion 61-2 of the input audio signal is input to a second autocorrelation unit 65-2. Both autocorrelation units 65 operate to determine the autocorrelation of the corresponding portion 61-1 or 61-2 of the input acoustic signal at 10 millisecond and 10.25 millisecond lags. The determined autocorrelation values at lags 10.25 from autocorrelation units 65-1 and 65-2 are then input to a subtractor 67, that subtracts the autocorrelation value obtained from window j from the autocorrelation value obtained from window i (or vice versa). The result of this subtraction is then supplied to another subtractor 69. Similarly, the autocorrelation value at lag 10 milliseconds from window i and the autocorrelation value at lag 10 milliseconds from window j are output from the autocorrelation units 65 to the subtractor 71, that subtracts the autocorrelation value obtained from window j from the autocorrelation value obtained from window i (or vice versa) and feeds the result to the subtractor 69. The subtractor 69 then subtracts the output from subtractor 67 from the output from subtractor 71 (or vice versa). Therefore, the output from the subtractor 69 is represented by the following equation:

  • (Ai(10)−Aj(10))−(Ai(10.25)−Aj(10.25))
  • As mentioned above, subtracting the autocorrelation values of one half symbol period from the corresponding autocorrelation values of the other half symbol period can reduce the effect of natural echoes in the input acoustic signal 61. This is because natural echoes are unlikely to change from one half symbol period to the next and so their effect will be constant in the autocorrelations that are calculated. Consequently, performing this subtraction will remove this common effect. Likewise, subtracting the autocorrelation values obtained from each half symbol period will reduce the effect of periodicities in the original audio signal. This is because in the 0.25 ms delay between the first echo and the second echo in the half symbol period, the effect of the periodicities on the autocorrelations will be approximately constant and so this subtraction will remove this common effect. This will now be described in more detail with reference to FIG. 8.
  • FIG. 8 a shows an autocorrelation plot 81 obtained from a typical audio signal without any artificial echoes. As shown, the autocorrelation plot 81 has a peak at zero lag. However, because of periodicities in the audio signal and because of natural echoes, the autocorrelation plot 81 does not tail off towards zero until about 15 milliseconds after the initial peak and exhibits local peaks and troughs in between. Peak 82 illustrates such a local peak that may occur as a result of a natural echo being added to the audio signal.
  • FIG. 8 b illustrates an autocorrelation plot 83 for the same audio signal after a positive echo has been added at a lag of 10 milliseconds and a negative echo has been added at a lag of 12 milliseconds (rather than at 10.25 ms so that the two echoes can be seen more clearly). As shown in FIG. 8 b, as a result of the artificial echoes, the autocorrelation plot 83 includes a peak 85 at 10 milliseconds and a peak 87 at 12 milliseconds. However, the peak 85 is masked somewhat by the earlier peak 82 caused by a natural echo.
  • FIG. 8 c illustrates the autocorrelation plot 89 for the audio signal after the echoes have been added in the second half of the symbol period. As shown, the autocorrelation plot 89 includes a negative peak 91 at 10 milliseconds and a positive peak 93 at 12 milliseconds.
  • Finally, FIG. 8 d illustrates the autocorrelation plot that is obtained by subtracting the autocorrelation plot shown in FIG. 8 c from the autocorrelation plot shown in FIG. 8 b. As can be seen, the common peaks in the autocorrelation plots shown in FIGS. 8 b and 8 c have been removed, whilst the complementary peaks 85 and 91; and 87 and 93 have added together to create the combined peaks 95 and 97 respectively. As those skilled in the art will appreciate, it is therefore much easier to detect the peaks 95 and 97 because their values are much greater than the autocorrelation values at other lags. This effect is further enhanced by subtracting the autocorrelation value at 12 milliseconds from the autocorrelation value at 10 milliseconds. This will effectively add the two peaks 95 and 97 together to provide an even larger peak, which can then be detected by suitable thresholding. The value of the corresponding data value can then be determined from the polarity of the combined peak.
  • As those skilled in the art will appreciate, in this embodiment, the echo detector 37 does not calculate the autocorrelation of the input signal over all lags. It only calculates the autocorrelation values at the lags where the artificial echoes have been added. The plots shown in FIG. 8 show the autocorrelation values over lags from 0 to 15 milliseconds. These plots therefore help to illustrate the effect of natural echoes and periodicities in the audio signal which can mask the artificial echoes that are added to encode the data.
  • Synchronisation
  • In this embodiment, the receiver 5 knows the duration of each half symbol period. This defines the length of the windows 63-1 and 63-2 used in the echo detector 37. However, the echo detector 37 initially will not be synchronised with the transmitted data. In other words, the echo detector 37 does not know where each symbol period begins and ends or where the start of the message is located. Therefore, in this embodiment, the echo detector 37 performs the above analysis as each new sample is received from the analogue to the digital converter 35. The output from the subtractor 69 is then analysed by the data recovery module 39 to determine the most likely symbol boundaries. The data recovery module then determines the location of the start of the message by finding the synchronisation bits that were added by the synchronisation data adder 57. At this point, the data recovery unit 39 can start to recover the whole message from the polarity of the autocorrelation values output from the subtractor 69.
  • Once synchronisation has been achieved, the echo detector 37 will typically determine the autocorrelation measurements in the middle of each half symbol period, when the echo is expected to be at its peak amplitude and the data recovery module 39 will determine the bit value from the polarity of the output from the subtractor 69. The echo detector 37 may also take measurements just before and just after the middle of each half symbol period, to allow the data recovery module 39 to track the synchronisation.
  • The message data recovered by the data recovery module 39 is then input to the FEC decoding module 41 where the message data is decoded (using the inverse processing of the FEC encoder 7) to obtain the original input data that was input to the encoder 7 of the transmitter 1.
  • Modifications and Alternatives
  • In the above embodiments, the data was hidden within an audio signal by employing a number of echoes whose polarity varied with the data value to be transmitted. These echoes were added to the original audio signal after appropriate delays. As those skilled in the art will appreciate, the echoes may be added before the original audio signal (pre-echoes), before and after the original audio signal or only after the original audio signal.
  • In the above embodiment, synchronisation bits were added to the data that was transmitted so that the decoder can identify the boundaries of each symbol period and the start and end of each message. The use of such synchronisation bits significantly increases the overall message length that has to be transmitted (in some cases by as much as 25%). Additionally, as the decoding of each bit is subject to noise, the matching is not perfect which can reduce the chances of a successful synchronisation. The inventors have realised, however, that the synchronisation bits are not required. In particular, the inventors have realised that the FEC decoding module 41 will have higher error rates when the echo detector 37 is not properly synchronised with the incoming data compared with its error rate when the echo detector is synchronised with the incoming data. Therefore, in the embodiment illustrated in FIG. 9, the error output generated by the FEC decoding module 41 is used to control the synchronisation of the receiver to the incoming data.
  • More specifically, in this embodiment, the echo detector 37 receives a block of samples corresponding to one or more symbol(s) and determines the optimum time within that block of samples to detect the echoes within the symbols. Multiple symbols may be required when Manchester encoding is used as a Manchester encoded “one” looks the same as a Manchester encoded “zero” with a time shift. Therefore, it may be necessary to consider a number of symbols to allow the symbol boundaries to be identified. The actual determination of the optimum time within the block of samples to detect the echoes may be determined by passing the block of samples through a matched filter (loaded with the expected signal pattern for one symbol period) and the time within the symbol when the absolute output (averaged over a number of successive symbols) is at a maximum is deemed to be the best time to sample the symbols. For example, if there are N samples per symbol, and the block of samples has M symbols, then the following values are calculated:

  • average(0)=1/M*(x(0)+x(N)+x(2N)+ . . . )

  • average(1)=1/M*(x(1)+x(N+1)+x(2N+1)+ . . . )

  • . . .

  • average(N−1)=1/M*(x(N−1)+x(2N−1)+x(3N−1)+ . . . )
  • where x(i) is absolute output of the matched filter for sample i. The largest average value thus determined identifies the best time to detect the echoeswithin the incoming signal during each symbol.
  • The echo detector 37 then uses the determined optimum time to detect echoes in that symbol and in the previous N−1 symbols of the input signal (where N is the number of symbols in the transmitted message). The data recovery module 39 then determines, from the detected echoes, bit value(s) for each symbol and outputs the string of bits corresponding to the possible message to the FEC decoding module 41. The FEC decoding module 41 then performs the inverse processing of the FEC encoder 7 to regenerate a candidate input data codeword, which is stored in the buffer 93. The FEC decoding module 41 also outputs an error count indicating how many errors are identified in the candidate codeword, which it passes to a controller 91. In response, the controller 91 compares the error count with a threshold value and if it is greater than the threshold, then the controller 91 flushes the candidate codeword from the buffer 93. The above process is then repeated for the next received symbol in the input signal, until the controller 91 determines that the error count is below the threshold. When it is, the controller 91 instructs the FEC decoding module 41 to accept the candidate codeword, which it then outputs for further use in the receiver 5. In effect, therefore, the echo detector 37, the data recovery module 39 and the FEC decoding module 41 all operate on a window of the input signal corresponding to the length of the transmitted message, which window is slid over the input signal until a point is found where the FEC error count is below a defined threshold—indicating the identification of the full message within the input signal.
  • FIG. 10 is a plot illustrating the way in which the FEC decoding module's error count 99 is expected to change as the window 101 is slid over an input signal 103 containing a data message 105, with the minimum appearing at symbol SN, when the window 101 is aligned with the data message 105 in the input signal 103. The threshold (Th) level is then set to reduce the possibility that false minimums in the FEC error output count are considered as possible codewords, so that (in the ideal situation) only when the receiver 5 is properly synchronised (aligned) to the message data, will the FEC decoding module's error count reduce below the threshold in the manner illustrated in FIG. 10. Ideally, in this embodiment, the FEC encoding/decoding that is used is designed to keep the error rate of the FEC decoding module 41 high except when the window 101 is aligned with the message data 105 in the input signal 103. The inventors have found that this simple thresholding technique is sufficient to identify the location of the message data in the input signal 103. However, if more accurate detection is required, then further consideration can be made, varying the possible positions of the start and end of the message and looking for the positions that give the minimum FEC error count.
  • The above technique is useful for finding a single message in the input signal. Clearly, if a sequence of such data messages is transmitted, then the synchronisation timing determined for the first data message may be used to identify the synchronisation timing for the next data message.
  • One problem identified by the inventors with the synchronisation approach discussed above is that the FEC encoder 7 often uses cyclic codewords (for example when using Reed Solomon block encoding) which means that a one bit shift in the codeword can also be a valid codeword. This is problematic because it can result in false detections of a codeword (a so called false positive) in the input signal 105. This problem can be overcome by reordering the bits of the codeword in the FEC encoder 7 in some deterministic manner (for example in a pseudo random manner), and using the inverse reordering in the FEC decoder 41. The processing that may be performed by the FEC encoder 7 and by the FEC decoder 41 in such an embodiment is illustrated in FIGS. 11 a and 11 b respectively. As shown, the FEC encoder 7 performs a cyclic encoding of the data (in this case Reed Solomon encoding 111), followed by a pseudo random reordering 113 of the data. The reordered data is then convolutionally encoded 115 and then interleaved 117 as before. Similarly, the FEC decoding module 41 initially de-interleaves 121 the data and performs convolutional decoding 123. The FEC decoding module 41 then reverses 123 the pseudo random data reordering performed by the FEC encoder 7 and then performs the Reed Solomon decoding 125. As those skilled in the art will appreciate, by performing this reordering of the data in this way, if there is a bit shift in the message data output by the data recovery module 39, then it is far less likely to result in a valid codeword and so the FEC error rate output is unlikely to trigger the false identification of a data message.
  • In the above embodiments, each data value was represented by four echoes—two echoes in each of two half symbol periods. As those skilled in the art will appreciate, each data value may be represented by any number of echoes in any number of sub-symbol periods. For example, instead of having two echoes within each half symbol period, each data value may be represented by a single echo in each half symbol period. In this case, the echoes in each half symbol period would preferably be of opposite polarity so that the same differencing technique can be used to reduce the effects of natural echoes. Indeed, the inventors have found that in some cases using two echoes of opposite polarity in each half symbol period can result in some frequency components within the original audio signal adding constructively with the echoes and some frequency components within the original audio signal adding destructively with the echoes. If a single artificial echo is added, then such distortions are less evident making the hidden data less noticeable to users in the acoustic sound that is heard.
  • As those skilled in the art will appreciate, representing each data value by one or more echoes in different sub-symbol periods, means that the echoes in each sub-symbol period will be a repetition of a different portion of the audio signal. If there is only one symbol period, then each data value will be represented by echoes of the same (or substantially the same) portion of the audio signal.
  • In the above embodiments, each data value was represented by a positive and a negative echo in a first half symbol period and by a positive and a negative echo in the second half symbol period. The positive and negative echoes in the first half symbol period allowed the receiver to reduce the effects of periodicities in the original audio signal which effect the autocorrelation measurements. The use of complementary echoes in adjacent half symbol periods allows the receiver to reduce the effect of natural echoes within the received audio signal, which might otherwise mask the artificial echoes added to represent the data. As those skilled in the art will appreciate, in other embodiments, neither or only one of these techniques may be used.
  • In the above embodiment, each data value was represented by echoes within two adjacent half symbol periods. As those skilled in the art will appreciate, these two half symbol periods do not have to be immediately adjacent to each other and a gap may be provided between the two periods if required.
  • In the above embodiment, the echoes in each half symbol period were of exactly the same portion of the audio signal. As those skilled in the art will appreciate, this is not essential. The echoes in each half symbol period may be of slightly different portions of the audio signal. For example, one echo may miss out some of the audio samples of the audio signal. Alternatively, the audio signal may include different channels (for example left and right channels for a stereo signal) and one echo may be formed from a repetition of the left channel and the other may be formed from a repetition of the right channel. With modern multi channel surround sound audio the repetitions can be of any of these channels.
  • In the above embodiment, the echoes generated within the transmitter were added to the original audio signal. As those skilled in the art will appreciate, the generated echoes may be combined with the original audio signal in other ways. For example, the echoes may be subtracted from the audio signal. Similarly, instead of inverting the echoes to be added to the audio (by controlling the polarity of the function g(n)), the same result can be achieved by changing the way in which the echoes are combined with the audio signal. For example, one echo may be added to the original audio signal whilst the next echo may be subtracted from the audio signal.
  • In the above embodiment, the lookup table stored values for g(n) corresponding to one or two bits of the message data (as illustrated in FIG. 6). As those skilled in the art will appreciate, this is not essential. For example, the lookup table could simply store a function which increased in value and then decreased in value. Additional circuitry could then be provided to convert the polarity of this output as appropriate for the two half symbol periods. In this way, the function stored in the lookup table would only control the fading in and out of the echo and the additional circuitry would control the polarity of the echo as required.
  • In the above embodiment, the Manchester encoding was performed by the echo generation and shaping module. As those skilled in the art will appreciate, this Manchester encoding, if performed, may be performed within the FEC encoding module.
  • As those skilled in the art will appreciate, the techniques described above for hiding data within the audio may be done in advance of the transmission of the acoustic signal or it may be done in real time. Even in the case where the data is to be embedded within an audio signal in real time, some of the processing can be done in advance. For example, the FEC encoding may be performed on the data in advance so that only the echo generation and echo shaping is performed in real time.
  • In the above embodiments, specific examples have been given of sample rates for the audio signal and symbol rates for the data that is hidden within the audio signal. As those skilled in the art will appreciate, these rates are not intended to be limiting and they may be varied as required. However, in order to keep the obtrusiveness of the added echoes to a minimum, the data rate of the encoded data is preferably kept between one and twenty symbols per second. This corresponds to a symbol period of between 50 ms and 1 second. In some embodiments, a long symbol period is beneficial because the added echoes will span across spoken words within the audio, making it easier to hide the data echoes within the audio. A larger symbol period also reduces audibility of the echoes. This is because humans are more sensitive to changing echoes than they are to static or fixed echoes. Therefore, by having a longer symbol period, the rate of change of the echoes is lower making the presence of the echoes less noticeable to a user.
  • In the above embodiment, the data rate of the data added to the audio signal in the transmitter was constant and was known by the receiver. This knowledge reduces the complexity of the receiver circuitry for locking on to the data within the received signal. However, it is not essential to the invention and more complex circuitry may be provided in the receiver to allow the receiver to try different data rates until the actual data rate is determined. Similarly, the receiver may use other techniques to synchronise itself with the transmitted data so that it knows where the symbol boundaries are in advance of receiving the data.
  • In the above embodiment, FEC encoding techniques were used to allow the receiver to be able to correct errors in the received data. As those skilled in the art will appreciate, such encoding techniques are not essential to the invention. However, they are preferred, as they help to correct errors that occur in the transmission process over the acoustic link.
  • In the above embodiments, the peak amplitudes of the echoes were all the same and were independent of the data value being transmitted. As those skilled in the art will appreciate, the peak amplitudes of the echoes may also be varied with data to be transmitted if desired.
  • In the above embodiment, the echoes in each half symbol period were at the same delays relative to the original audio signal. As those skilled in the art will appreciate, this is not essential. There may be some variation in the actual delay values used within each half symbol period.
  • In the above embodiment, the second echo within each half symbol period was generated by delaying the first echo by a further delay value. In an alternative embodiment, each echo within each sub-symbol period may be independently generated from the original audio signal using an appropriate delay line.
  • As those skilled in the art will appreciate, various uses can be made of the above communication system. For example, the encoded data may be used as a watermark to protect the original audio signal. Alternatively, the embedded data may be used to control the receiver so that it can respond in synchronism with the audio signal. In particular, the decoder can be programmed to perform some action a defined time after receiving the codeword. The time delay may be programmed into the decoder by any means and may even be defined by data in the received codewords. When used to perform such synchronisation, shorter symbol periods are preferred as shorter symbol periods allows for better temporal resolution and hence more accurate synchronisation. The data may be used for interactive gaming applications, audience surveying, e-commerce systems, toys and the like. The reader is referred to the Applicant's earlier International application WO02/45273 which describes a number of uses for this type of data hiding system.
  • In the above embodiment, the receiver performed autocorrelation measurements on the input audio signal in order to identify the locations of the echoes. As those skilled in the art will appreciate, other techniques can be used to identify the echoes. Some of these other techniques are described in the Applicant's earlier PCT application PCT/GB2008/001820 and in U.S. Pat. No. 5,893,067, the contents of which are incorporated herein by reference. Typically, although not necessarily, the techniques involve some form of autocorrelation of the original audio signal or of parameters obtained from the audio signal (eg LPC parameters, cepstrum parameters etc). As an alternative, a best fit approach could be used in which an expected audio signal (with different echo polarities) is fitted to the actual signal until a match is found and the polarity of the echoes thus determined.
  • In the embodiment described above, a single transmitter was provided together with a receiver. As those skilled in the art will appreciate, multiple transmitters and/or multiple receivers may be provided. Further, the components of the transmitter may be distributed among a number of different entities. For example, the encoding and data hiding part of the transmitter may be provided within a head end of a television distribution system or a user's set top box and the loudspeaker 19 may be a speaker of the user's television set.
  • In the above embodiments, the echoes were directly derived from the original audio signal. In alternative embodiments, the echo may not include all frequency components of the audio signal. For example, one or more of the echoes may be generated from a portion of the audio signal after it has been filtered to remove certain frequencies. This may be beneficial where it is found, for example, that there is additional noise in the low frequency part of the echoes but not in the higher frequency part. In this case, the received signals would also be filtered to remove the lower frequency components (for example frequencies below about 500 Hz) so that only the higher frequency components (those above the lower frequency components) of the audio signal and the echoes would be present in the signals being analysed. Alternatively, in this case, the received signal may be passed through a filter that simply reduces the level of the lower frequency components in the received signal compared with the higher frequency components. This will have the effect of reducing the relevance of the noisy low frequency part of the received signal in the subsequent decoding process. Similarly, if it turns out that the added echoes introduce a noticeable distortion in the higher frequencies of the composite audio signal, then the echoes (or the signals from which they are derived) may be low pass filtered to remove the higher frequencies.
  • The division of the audio signal into separate frequency bands can also be used to carry data on multiple channels. For example, if the frequency band is divided into a high frequency part and a low frequency part, then one channel may be provided by adding echoes to the high frequency part and another channel may be provided by adding different echoes to the low frequency part. The use of multiple channels in this way allows frequency or temporal diversity if the data carried in the two channels is the same; or allows for an increased data transfer rate if each channel carries different data. Multiple channels can also be provided where the audio signal also contains multiple channels (used to drive multiple speakers). In this case, one or more data channels may be provided in the audio signal for each audio channel.
  • In the above embodiment, data was hidden within an audio signal by adding echoes to the audio signal. In some situations, the incoming audio may already contain hidden data in the form of such echoes. In this case, the encoder could decode the existing hidden data from the received audio signal and then use the decoded data to clean the audio signal to remove the artificial echoes defining this hidden data. The encoder could then add new echoes to the thus cleaned audio signal to hide the new data in the audio signal. In this way, the original hidden data will not interfere with the new hidden data.
  • In the above embodiment, the echoes were obtained by delaying digital samples of the audio signal. As those skilled in the art will appreciate, the echoes may be generated in the analogue domain, using suitable analogue delay lines and analogue circuits to perform the echo shaping and polarity modulation.
  • In the above embodiments, the audio signal with the embedded data was transmitted to a receiver over an acoustic link. In an alternative embodiment, the audio signal may be transmitted to the receiver over an electrical wire or wireless link. In such an embodiment, the data rates that are used may be higher, due to lower noise levels.
  • In the above embodiment, one data bit was transmitted within each symbol period. In an alternative embodiment, multiple bits may be transmitted within each symbol period. For example a second pair of echoes may be added at lags of 20 ms and 20.25 ms within each half symbol period to encode a second bit; a third pair of echoes may be added at lags of 30 ms and 30.25 ms within each half symbol period to encode a third bit etc. Each echo could then be faded in and out during each half symbol period and polarity modulated in accordance with the bit value as before. The fading in and out of the echoes for the different bits may be the same or it may be different for the different bits. The polarity modulation of the different echoes will of course depend on the different bit values to be transmitted in the symbol period. In a preferred embodiment, the echoes for the different bits within the same half symbol period are faded in and out at different times of the half symbol period, so that the different echoes reach their peak amplitudes at different times within the half symbol period. In this way, when the echo for one bit is at its peak amplitude (or when all the echoes for one bit are at their peak amplitudes—if there are multiple echoes representing each bit in each half symbol period), the echoes for the other bits will not be at their peaks. Doing this and sampling the different echoes when they are expected to be at their peak amplitudes, will reduce the interference between the echoes for the different bits within the same half symbol period. It also reduces constructive interference of the echoes that may render the added echoes more noticeable to a listener. Looking at this another way, this is the same as having multiple parallel data messages, each encoded as per the embodiments described above, but with their respective symbol periods offset in time from each other so that the echoes for the different messages peak at different times—thereby reducing interference between the messages if the echoes are all sampled around the time when they are each at their maximum amplitudes. This technique will increase the bit rate of data transmission between the transmitter and receiver. The additional bits may be of the same message or they may be bits of different messages.
  • The inventors have found that the above described data hiding techniques do not work as well during portions of the audio that include single tones or multiple harmonic tones, such as would be found in some sections of music. This is because the hidden data becomes more obtrusive to the listener in these circumstances and if the tones are being used as part of an automatic setup procedure they can cause the procedure to fail. Therefore, in one embodiment, the inventors propose to include (within the encoder) a detector that detects the level of tonality or other characteristic of the audio signal and, if it is highly tonal, that switches off the echo addition circuitry. Alternatively, as this switching off of the echoes may itself be noticeable to the user, the encoder may fade the echoes out during periods of high tonality and then fade them back in during periods of low tonality. In this way, the data is only added to the audio signal when the audio signal is not highly tonal in nature. Various techniques may be used for making this detection. One technique for determining the level of tonality of an audio signal (although for a different purpose) is described in the applicant's earlier PCT application WO02/45286, the contents of which are incorporated herein by reference. Another technique can be found in Davis P (1995) “A Tutorial on MPEG/Audio Compression”, IEEE Multimedia Magazine, 2(2), pp. 60-74. Instead of switching off the echo addition circuitry, the system may be arranged to adapt the amplitude of the added echoes depending on the detected characteristic of the audio signal. Alternatively, instead of varying the amplitudes of the echoes in this way, the encoder may instead or in addition vary the data rate or the symbol period in order to reduce the obtrusiveness of the hidden data during periods when the audio signal is highly tonal.
  • An embodiment was described above in which a single message was encoded and transmitted to a remote receiver as a number of echoes within an audio signal. In some applications, a sequence of messages may be transmitted. These messages may be the same or they may be different. In either case, each message may be transmitted after a preceding message has been transmitted. Alternatively, the end of one message may be overlapped with the start of the next message in a predefined way (so that the receiver can regenerate each message. This arrangement can increase the time diversity of the transmitted messages making them less susceptible to certain types of noise or data loss. In a further alternative, the data from the different messages may be interleaved in a known manner and transmitted as a single data stream to the receiver. The receiver would then regenerate each message by de-interleaving the bits in the data stream using knowledge of how the messages were originally interleaved.
  • As discussed above, Convolutional Coding is used as part of the forward error correction (FEC) encoder. As is well known to those skilled in the art, data encoded in this way generally is decoded using a Viterbi decoder, which operates by constructing a trellis of state probabilities and branch metrics. The transmitted data is often terminated with a number of zeros to force the encoder back to the zero state. This allows the decoder to start decoding from a known state, however, it requires extra symbols to be transmitted over the channel. An alternative technique is to ensure that the trellis start and end states are identical. This technique is referred to as tail biting and has the advantage of not requiring any extra symbols to be transmitted. Tail biting is used in many communications standards and, if desired, may be used in the embodiments described above.
  • The description above has described the operation of a system for hiding data as echoes within an audio signal. The systems described used time domain techniques to generate and add the echoes and to detect the echoes in the received signal. As those skilled in the art will appreciate, equivalent processing can be performed in the frequency domain to achieve the same or similar results.
  • The inventors have found that in some instances, the decoder does not work as well when the message consists of predominantly ‘zero’ bits (or conversely predominately ‘one’ bits), since under the encoding scheme an ‘all zeros’ codeword segment looks the same as a time-shifted ‘all ones’ codeword segment. A particular example is the ‘all zeros’ message, which results in an ‘all zeros’ codeword after Reed Solomon encoding. The encoding works best when there are approximately equal numbers of ones and zeros in the codeword, evenly distributed throughout the codeword. This can be achieved for the disclosed system by inverting the Reed Solomon parity bits. This has the effect of changing the all zeroes codeword to a mixture of zeroes and ones. This can also be achieved by altering the initial state of the feedback shift register used within the Reed Solomon encoder which is used to generate the parity bits. This gives more flexibility in setting the ratio of ones to zeroes in the codeword. Subsequent interleaving distributes these inverted parity bits throughout the codeword. As those skilled in the art of error detection and correction will appreciate, these approaches to balancing the distribution of ones and zeroes applies to any of the many FEC schemes implemented using feedback shift registers (or Galois field arithmetic) of which Reed Solomon is an example.
  • In the above embodiments, a number of processing modules and circuits have been described. As those skilled in the art will appreciate, these processing modules and circuits may be provided as hardware circuits or as software modules running within memory of a general purpose processor. In this case, the software may be provided on a storage medium such as a CD-ROM or it may be downloaded into an appropriate programmable device on a carrier signal over a computer network, such as the Internet. The software may be provided in compiled form, partially compiled form or in un-compiled form.

Claims (23)

1-48. (canceled)
49. A method of embedding a plurality of data values in an audio signal, the method comprising:
receiving the audio signal;
generating an echo of at least a portion of the received audio signal; and
embedding the plurality of data values in the audio signal by combining the received audio signal with multiple versions of the generated echo at different delays, wherein each delay is associated with a respective data value.
50. A method according to claim 49, wherein each data value is represented by first and second echoes having different delays and opposite polarities and wherein each data value is represented by third and fourth echoes, the third echo having the same polarity, when combined with said audio signal, as said second echo and the fourth echo having the same polarity, when combined with said audio signal, as said first echo, wherein the first and second echoes are generated by repeating substantially a first portion of the received audio signal and wherein the third and fourth echoes are generated by repeating substantially a second portion of the audio signal which is different to the first portion repeated by the first and second echoes.
51. A method according to claim 50, wherein said second portion is adjacent the first portion.
52. A method according to claim 51, wherein said first echo is combined with said audio signal at a first delay relative to said first portion of the audio signal; said second echo is combined with the audio signal at a second delay relative to said first echo; said third echo is combined with said audio signal at a third delay relative to said second portion of the audio signal; and said fourth echo is combined with the audio signal at a fourth delay relative to said third echo and preferably wherein said first delay is equal to said third delay and/or said second delay is equal to said fourth delay.
53. A method according to claim 49, wherein said delays are between 0.5 ms and 100 ms.
54. A method according to claim 49, wherein said echo of at least a portion of the audio signal has a duration of between 20 ms and 500 ms.
55. A method according to claim 49, wherein the polarity of an echo at a given delay is modulated in dependence upon the data value associated with the delay.
56. A method according to claim 49, wherein said audio signal is received as a stream of samples and wherein each echo is generated by passing the stream of audio samples through a respective delay line.
57. A method according to claim 49, wherein said combining step combines the or each echo with the audio signal by adding and/or subtracting each echo to/from the audio signal in dependence upon the data value.
58. A method according to claim 49, wherein the or each echo associated with each data value is faded in and out at different times within a symbol period so that different echoes reach their peak values at different times within the symbol period.
59. A computer executable instructions product comprising computer implementable instructions for causing a programmable computer device to carry out the method of claim 49.
60. An apparatus for embedding a plurality of data values in an audio signal, the apparatus comprising:
an input for receiving the audio signal;
an echo generator for generating an echo of at least a portion of the received audio signal; and
a combiner for embedding the plurality of data values in the audio signal by combining the received audio signal with multiple versions of the generated echo at different delays, wherein each delay is associated with a respective data value.
61. A method of recovering a plurality of data values embedded in an audio signal, the method comprising:
receiving an input signal having the audio signal and multiple versions of an echo of at least a portion of the audio signal at different delays, wherein each delay is associated with a respective data value;
processing the received input signal to detect said echoes at the delays; and
recovering the plurality of data values from the detected echoes.
62. A method according to claim 61, wherein each data value is represented by first and second echoes having different delays and opposite polarities, wherein each data value is represented by third and fourth echoes, the third echo having the same polarity as said second echo and the fourth echo having the same polarity as said first echo, wherein the first and second echoes are a repeat of substantially a first portion of the received audio signal, wherein the third and fourth echoes are a repeat of substantially a second portion of the audio signal which is different to the first portion repeated by the first and second echoes and wherein said processing processes the received input signal to detect the echoes representing each data value.
63. A method according to claim 62, wherein said second portion is adjacent the first portion.
64. A method according to claim 62, wherein said first echo is at a first delay relative to said first portion of the audio signal; said second echo is at a second delay relative to said first echo; said third echo is at a third delay relative to said second portion of the audio signal; and said fourth echo is at a fourth delay relative to said third echo and preferably wherein said first delay is equal to said third delay and/or said second delay is equal to said fourth delay.
65. A method according to claim 61, wherein said delays are between 0.5 ms and 100 ms.
66. A method according to claim 61, wherein each echo has a duration of between 20 ms and 500 ms.
67. A method according to claim 61, wherein the polarity of an echo at a given delay depends upon the data value associated with the delay and wherein said processing detects the polarity of the echoes and determines said plurality of data values from the detected echo polarities.
68. A method according to claim 61, wherein the or each echo associated with each data value is faded in and out at different times within a symbol period so that different echoes reach their peak values at different times within the symbol period and wherein the processing detects the echoes at a timing when an echo is expected to be at its peak amplitude.
69. A computer executable instructions product comprising computer implementable instructions for causing a programmable computer device to carry out the method of claim 61.
70. An apparatus for recovering a plurality of data values embedded in an audio signal, the apparatus comprising:
an input for receiving an input signal having the audio signal and multiple versions of an echo of at least a portion of the audio signal at different delays, wherein each delay is associated with a respective data value; and
a processor for processing the received input signal to detect said echoes at the delays and for recovering the plurality of data values from the detected echoes.
US12/994,716 2008-05-29 2009-05-29 Data embedding system Abandoned US20110125508A1 (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
GBPCT/GB2008/001820 2008-05-29
PCT/GB2008/001820 WO2008145994A1 (en) 2007-05-29 2008-05-29 Recovery of hidden data embedded in an audio signal
GB0814041A GB2462588A (en) 2008-04-29 2008-07-31 Data embedding system
GB0814041.0 2008-07-31
GB0821841.4A GB2460306B (en) 2008-05-29 2008-11-28 Data embedding system
GB0821841.4 2008-11-28
PCT/GB2009/001354 WO2009144470A1 (en) 2008-05-29 2009-05-29 Data embedding system

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2009/001354 A-371-Of-International WO2009144470A1 (en) 2008-05-29 2009-05-29 Data embedding system

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/232,190 Division US8560913B2 (en) 2008-05-29 2011-09-14 Data embedding system

Publications (1)

Publication Number Publication Date
US20110125508A1 true US20110125508A1 (en) 2011-05-26

Family

ID=39768060

Family Applications (2)

Application Number Title Priority Date Filing Date
US12/994,716 Abandoned US20110125508A1 (en) 2008-05-29 2009-05-29 Data embedding system
US13/232,190 Active 2029-09-29 US8560913B2 (en) 2008-05-29 2011-09-14 Data embedding system

Family Applications After (1)

Application Number Title Priority Date Filing Date
US13/232,190 Active 2029-09-29 US8560913B2 (en) 2008-05-29 2011-09-14 Data embedding system

Country Status (11)

Country Link
US (2) US20110125508A1 (en)
EP (3) EP2325839A1 (en)
JP (1) JP2011523091A (en)
CN (2) CN102047324A (en)
BR (1) BRPI0913228B1 (en)
DK (1) DK2631904T3 (en)
ES (1) ES2545058T3 (en)
GB (1) GB2460306B (en)
MX (1) MX2010013076A (en)
PL (1) PL2631904T3 (en)
WO (1) WO2009144470A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130151241A1 (en) * 2011-12-07 2013-06-13 Samsung Electronics Co., Ltd. Method of embedding digital information into audio signal machine-readable storage medium and communication terminal
US20130226586A1 (en) * 2012-02-24 2013-08-29 Pantech Co., Ltd. Device, system, and method for authenticating user using voice recognition
US20150023402A1 (en) * 2012-10-25 2015-01-22 Clip Interactive, Llc Method and system for sub-audible signaling
US20160286312A1 (en) * 2013-11-13 2016-09-29 Om Audio, Llc Signature tuning filters
US9626977B2 (en) * 2015-07-24 2017-04-18 Tls Corp. Inserting watermarks into audio signals that have speech-like properties
CN107395292A (en) * 2017-07-05 2017-11-24 厦门声戎科技有限公司 A kind of Information Hiding Techniques communication means based on marine organisms signal analysis
US10230778B2 (en) 2013-03-05 2019-03-12 Clip Interactive, Llc Apparatus, system, and method for integrating content and content services
US20230030369A1 (en) * 2021-07-27 2023-02-02 Acer Incorporated Processing method of sound watermark and sound watermark generating apparatus
US11599915B1 (en) 2011-10-25 2023-03-07 Auddia Inc. Apparatus, system, and method for audio based browser cookies

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2460306B (en) 2008-05-29 2013-02-13 Intrasonics Sarl Data embedding system
US8582781B2 (en) * 2009-01-20 2013-11-12 Koplar Interactive Systems International, L.L.C. Echo modulation methods and systems
US10705701B2 (en) 2009-03-16 2020-07-07 Apple Inc. Device, method, and graphical user interface for moving a current position in content at a variable scrubbing rate
CN101847409B (en) * 2010-03-25 2012-01-25 北京邮电大学 Voice integrity protection method based on digital fingerprint
JP5601665B2 (en) * 2010-07-22 2014-10-08 Kddi株式会社 Audio digital watermark embedding device, detection device, and program
JP5554658B2 (en) * 2010-08-06 2014-07-23 Kddi株式会社 Audio digital watermark embedding apparatus and program
US10706096B2 (en) 2011-08-18 2020-07-07 Apple Inc. Management of local and remote media items
US9002322B2 (en) 2011-09-29 2015-04-07 Apple Inc. Authentication with secondary approver
JP5364141B2 (en) * 2011-10-28 2013-12-11 楽天株式会社 Portable terminal, store terminal, transmission method, reception method, payment system, payment method, program, and computer-readable storage medium
GB201206564D0 (en) 2012-04-13 2012-05-30 Intrasonics Sarl Event engine synchronisation
WO2014143776A2 (en) 2013-03-15 2014-09-18 Bodhi Technology Ventures Llc Providing remote interactions with host device using a wireless device
KR101929372B1 (en) 2014-05-30 2018-12-17 애플 인크. Transition from use of one device to another
JP5887446B1 (en) 2014-07-29 2016-03-16 ヤマハ株式会社 Information management system, information management method and program
JP5871088B1 (en) 2014-07-29 2016-03-01 ヤマハ株式会社 Terminal device, information providing system, information providing method, and program
US10339293B2 (en) 2014-08-15 2019-07-02 Apple Inc. Authenticated device used to unlock another device
JP6484958B2 (en) 2014-08-26 2019-03-20 ヤマハ株式会社 Acoustic processing apparatus, acoustic processing method, and program
CN110072131A (en) 2014-09-02 2019-07-30 苹果公司 Music user interface
TWI556226B (en) * 2014-09-26 2016-11-01 威盛電子股份有限公司 Synthesis method of audio files and synthesis system of audio files using same
DK179186B1 (en) 2016-05-19 2018-01-15 Apple Inc REMOTE AUTHORIZATION TO CONTINUE WITH AN ACTION
DK201670622A1 (en) 2016-06-12 2018-02-12 Apple Inc User interfaces for transactions
GB2556023B (en) 2016-08-15 2022-02-09 Intrasonics Sarl Audio matching
US20180061875A1 (en) * 2016-08-30 2018-03-01 Stmicroelectronics (Crolles 2) Sas Vertical transfer gate transistor and active cmos image sensor pixel including a vertical transfer gate transistor
US10241796B2 (en) * 2017-02-13 2019-03-26 Yong-Kyu Jung Compiler-assisted lookahead (CAL) memory system apparatus for microprocessors
US10992795B2 (en) 2017-05-16 2021-04-27 Apple Inc. Methods and interfaces for home media control
US11431836B2 (en) 2017-05-02 2022-08-30 Apple Inc. Methods and interfaces for initiating media playback
US10928980B2 (en) 2017-05-12 2021-02-23 Apple Inc. User interfaces for playing and managing audio items
CN111343060B (en) 2017-05-16 2022-02-11 苹果公司 Method and interface for home media control
US20220279063A1 (en) 2017-05-16 2022-09-01 Apple Inc. Methods and interfaces for home media control
WO2020174289A2 (en) 2019-02-27 2020-09-03 Louisiana-Pacific Corporation Fire-resistant manufactured-wood based siding
JP6998338B2 (en) * 2019-03-28 2022-01-18 Toa株式会社 Acoustic signal formers, acoustic receivers, and acoustic systems
US10904029B2 (en) 2019-05-31 2021-01-26 Apple Inc. User interfaces for managing controllable external devices
US11010121B2 (en) 2019-05-31 2021-05-18 Apple Inc. User interfaces for audio media control
EP4134811A1 (en) 2019-05-31 2023-02-15 Apple Inc. User interfaces for audio media control
DK201970533A1 (en) 2019-05-31 2021-02-15 Apple Inc Methods and user interfaces for sharing audio
US11079913B1 (en) 2020-05-11 2021-08-03 Apple Inc. User interface for status indicators
US11392291B2 (en) 2020-09-25 2022-07-19 Apple Inc. Methods and interfaces for media control with dynamic feedback
US11847378B2 (en) 2021-06-06 2023-12-19 Apple Inc. User interfaces for audio routing

Citations (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US657379A (en) * 1900-05-16 1900-09-04 Edward C Bakken Stock for holding cattle while dehorning.
US2660662A (en) * 1947-10-24 1953-11-24 Nielsen A C Co Search signal apparatus for determining the listening habits of wave signal receiver users
US3651471A (en) * 1970-03-02 1972-03-21 Nielsen A C Co Data storage and transmission system
US3732536A (en) * 1970-09-18 1973-05-08 Gte Sylvania Inc Passive object detection apparatus including ambient noise compensation
US3742463A (en) * 1970-03-02 1973-06-26 Nielsen A C Co Data storage and transmission system
US3845391A (en) * 1969-07-08 1974-10-29 Audicom Corp Communication including submerged identification signal
US4025851A (en) * 1975-11-28 1977-05-24 A.C. Nielsen Company Automatic monitor for programs broadcast
US4237449A (en) * 1978-06-16 1980-12-02 Zibell J Scott Signalling device for hard of hearing persons
US4425642A (en) * 1982-01-08 1984-01-10 Applied Spectrum Technologies, Inc. Simultaneous transmission of two information signals within a band-limited communications channel
US4514725A (en) * 1982-12-20 1985-04-30 Bristley Barbara E Window shade mounted alarm system
US4718106A (en) * 1986-05-12 1988-01-05 Weinblatt Lee S Survey of radio audience
US4750034A (en) * 1987-01-21 1988-06-07 Cloeck En Moedigh Bioscoopreclame B.V. Apparatus for monitoring the replay of audio/video information carriers
US4807031A (en) * 1987-10-20 1989-02-21 Interactive Systems, Incorporated Interactive video method and apparatus
US4840602A (en) * 1987-02-06 1989-06-20 Coleco Industries, Inc. Talking doll responsive to external signal
US4846693A (en) * 1987-01-08 1989-07-11 Smith Engineering Video based instructional and entertainment system using animated figure
US4923428A (en) * 1988-05-05 1990-05-08 Cal R & D, Inc. Interactive talking toy
US4945412A (en) * 1988-06-14 1990-07-31 Kramer Robert A Method of and system for identification and verification of broadcasting television and radio program segments
US5085610A (en) * 1991-05-16 1992-02-04 Mattel, Inc. Dual sound toy train set
US5090936A (en) * 1988-07-30 1992-02-25 Takara Co., Ltd. Movable decoration
US5108341A (en) * 1986-05-28 1992-04-28 View-Master Ideal Group, Inc. Toy which moves in synchronization with an audio source
US5113437A (en) * 1988-10-25 1992-05-12 Thorn Emi Plc Signal identification system
US5136613A (en) * 1990-09-28 1992-08-04 Dumestre Iii Alex C Spread Spectrum telemetry
US5191615A (en) * 1990-01-17 1993-03-02 The Drummer Group Interrelational audio kinetic entertainment system
US5301167A (en) * 1992-08-05 1994-04-05 Northeastern University Apparatus for improved underwater acoustic telemetry utilizing phase coherent communications
US5475798A (en) * 1992-01-06 1995-12-12 Handlos, L.L.C. Speech-to-text translator
US5648789A (en) * 1991-10-02 1997-07-15 National Captioning Institute, Inc. Method and apparatus for closed captioning at a performance
US5893067A (en) * 1996-05-31 1999-04-06 Massachusetts Institute Of Technology Method and apparatus for echo data hiding in audio signals
US5940135A (en) * 1997-05-19 1999-08-17 Aris Technologies, Inc. Apparatus and method for encoding and decoding information in analog signals
US6263505B1 (en) * 1997-03-21 2001-07-17 United States Of America System and method for supplying supplemental information for video programs
US20010025241A1 (en) * 2000-03-06 2001-09-27 Lange Jeffrey K. Method and system for providing automated captioning for AV signals
US20020078359A1 (en) * 2000-12-18 2002-06-20 Jong Won Seok Apparatus for embedding and detecting watermark and method thereof
US6463413B1 (en) * 1999-04-20 2002-10-08 Matsushita Electrical Industrial Co., Ltd. Speech recognition training for small hardware devices
US6674861B1 (en) * 1998-12-29 2004-01-06 Kent Ridge Digital Labs Digital audio watermarking using content-adaptive, multiple echo hopping
US20040169581A1 (en) * 2000-02-16 2004-09-02 Verance Corporation Remote control signaling using audio watermarks
US20050240768A1 (en) * 2002-06-03 2005-10-27 Koninklijke Philips Electronics N.V. Re-embedding of watermarks in multimedia signals
US7013301B2 (en) * 2003-09-23 2006-03-14 Predixis Corporation Audio fingerprinting system and method
US20070036357A1 (en) * 2003-09-22 2007-02-15 Koninklijke Philips Electronics N.V. Watermarking of multimedia signals
US20080027734A1 (en) * 2006-07-26 2008-01-31 Nec (China) Co. Ltd. Media program identification method and apparatus based on audio watermarking
US20080049971A1 (en) * 2000-03-24 2008-02-28 Ramos Daniel O Systems and methods for processing content objects
US20090235079A1 (en) * 2005-06-02 2009-09-17 Peter Georg Baum Method and apparatus for watermarking an audio or video signal with watermark data using a spread spectrum

Family Cites Families (121)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3229405C2 (en) 1982-08-06 1984-08-30 Werner 8000 München Janz Device for testing the functionality of remote control transmitters
GB2135536A (en) 1982-12-24 1984-08-30 Wobbot International Limited Sound responsive lighting system and devices incorporating same
GB8314468D0 (en) 1983-05-25 1983-06-29 Agb Research Plc Television monitoring
AU3290084A (en) 1983-09-16 1985-03-21 Audicom Corp. Encoding of transmitted program material
FR2568432B1 (en) 1984-07-30 1990-06-29 Baranoff Rossine Dimitri METHOD AND DEVICE FOR TRANSMITTING CODED INFORMATION OVER THE WIRE OVERLAY ON A TRADITIONAL FREQUENCY MODULATED TRANSMISSION
GB8609524D0 (en) 1986-04-18 1986-05-21 British Broadcasting Corp Video receivers & recorders
GB2196167B (en) 1986-10-01 1991-01-02 Emi Plc Thorn Apparatus for marking a recorded signal
FR2626731B3 (en) 1988-01-28 1990-08-03 Informatique Realite SELF-CONTAINED ELECTRONIC DEVICE FOR ALLOWING PARTICIPATION IN A RADIO OR TELEVISION TRANSMISSION
US5499265A (en) 1989-08-07 1996-03-12 Omnipoint Data Company, Incorporated Spread spectrum correlator
US5446756A (en) 1990-03-19 1995-08-29 Celsat America, Inc. Integrated cellular communications system
JP2648527B2 (en) 1990-07-27 1997-09-03 パイオニア株式会社 Exhibit explanation system
FI88237C (en) 1991-05-24 1993-04-13 Nokia Mobile Phones Ltd Programming the functions of a mobile phone
US5382970A (en) 1991-07-19 1995-01-17 Kiefl; John B. Television viewer monitoring system including portable data meter for each viewer
JPH05145515A (en) 1991-11-19 1993-06-11 Canon Inc Spread spectrum communication equipment
US5319735A (en) 1991-12-17 1994-06-07 Bolt Beranek And Newman Inc. Embedded signalling
US5314336A (en) 1992-02-07 1994-05-24 Mark Diamond Toy and method providing audio output representative of message optically sensed by the toy
US5353352A (en) 1992-04-10 1994-10-04 Ericsson Ge Mobile Communications Inc. Multiple access coding for radio communications
JP2561232B2 (en) 1992-08-31 1996-12-04 双葉電子工業株式会社 Spectrum spread receiver and spectrum spread transmitter / receiver using this device
JPH06133367A (en) 1992-09-23 1994-05-13 Walt Disney Co:The Method and apparatus for remote synchronization of audio, illumination, animation and special effect
AU678806B2 (en) 1992-11-16 1997-06-12 Arbitron Inc. Method and apparatus for encoding/decoding broadcast or recorded segments and monitoring audience exposure thereto
US5570349A (en) 1994-06-07 1996-10-29 Stanford Telecommunications, Inc. Wireless direct sequence spread spectrum digital cellular telephone system
US5630203A (en) 1993-01-12 1997-05-13 Weinblatt; Lee S. Technique for surveying a radio or a television audience
US5412620A (en) 1993-03-24 1995-05-02 Micrilor, Inc. Hydroacoustic communications system robust to multipath
US5442343A (en) 1993-06-21 1995-08-15 International Business Machines Corporation Ultrasonic shelf label method and apparatus
US5481294A (en) 1993-10-27 1996-01-02 A. C. Nielsen Company Audience measurement system utilizing ancillary codes and passive signatures
US5436941A (en) 1993-11-01 1995-07-25 Omnipoint Corporation Spread spectrum spectral density techniques
US5574773A (en) 1994-02-22 1996-11-12 Qualcomm Incorporated Method and apparatus of providing audio feedback over a digital channel
US5457807A (en) 1994-03-21 1995-10-10 Weinblatt; Lee S. Technique for surveying a radio or a television audience
US5450490A (en) 1994-03-31 1995-09-12 The Arbitron Company Apparatus and methods for including codes in audio signals and decoding
US5519779A (en) 1994-08-05 1996-05-21 Motorola, Inc. Method and apparatus for inserting signaling in a communication system
JPH07336460A (en) 1994-06-03 1995-12-22 Hitachi Ltd Data communication equipment
US5555258A (en) 1994-06-17 1996-09-10 P. Stuckey McIntosh Home personal communication system
CA2129925A1 (en) 1994-08-11 1996-02-12 Hendrik Adolf Eldert Zwaneveld Audio synchronization of subtitles
US5539705A (en) 1994-10-27 1996-07-23 Martin Marietta Energy Systems, Inc. Ultrasonic speech translator and communications system
US6021432A (en) 1994-10-31 2000-02-01 Lucent Technologies Inc. System for processing broadcast stream comprises a human-perceptible broadcast program embedded with a plurality of human-imperceptible sets of information
EP0710022A3 (en) 1994-10-31 1998-08-26 AT&T Corp. System and method for encoding digital information in a television signal
EP0713335A2 (en) 1994-11-15 1996-05-22 AT&T Corp. System and method for wireless capture of encoded data transmitted with a television, video or audio signal and subsequent initiation of a transaction using such data
US5828956A (en) 1994-12-30 1998-10-27 Sony Electronics, Inc. Programmable cellular telephone and system
US5774452A (en) 1995-03-14 1998-06-30 Aris Technologies, Inc. Apparatus and method for encoding and decoding information in audio signals
US5978413A (en) 1995-08-28 1999-11-02 Bender; Paul E. Method and system for processing a plurality of multiple access transmissions
US5937000A (en) 1995-09-06 1999-08-10 Solana Technology Development Corporation Method and apparatus for embedding auxiliary data in a primary data signal
US5822360A (en) 1995-09-06 1998-10-13 Solana Technology Development Corporation Method and apparatus for transporting auxiliary data in audio signals
EP0769611A1 (en) 1995-09-22 1997-04-23 Bernd Scheffel Apparatus for intermittently atomizing and injecting fuel
CA2184949C (en) 1995-09-28 2000-05-30 Ingemar J. Cox Secure spread spectrum watermarking for multimedia data
US5752880A (en) 1995-11-20 1998-05-19 Creator Ltd. Interactive doll
US5719937A (en) 1995-12-06 1998-02-17 Solana Technology Develpment Corporation Multi-media copy management system
US5687191A (en) 1995-12-06 1997-11-11 Solana Technology Development Corporation Post-compression hidden data transport
EP0779759B1 (en) 1995-12-11 2005-10-26 Openwave Systems Inc. A method and architecture for an interactive two-way data communication network
US6035177A (en) 1996-02-26 2000-03-07 Donald W. Moses Simultaneous transmission of ancillary and audio signals by means of perceptual coding
WO1997033391A1 (en) 1996-03-07 1997-09-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Coding process for inserting an inaudible data signal into an audio signal, decoding process, coder and decoder
US5828325A (en) 1996-04-03 1998-10-27 Aris Technologies, Inc. Apparatus and method for encoding and decoding information in analog signals
EP0852441B1 (en) 1996-06-20 2010-08-25 International Business Machines Corporation Data hiding method
US5918223A (en) 1996-07-22 1999-06-29 Muscle Fish Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information
US5960398A (en) 1996-07-31 1999-09-28 Wictor Company Of Japan, Ltd. Copyright information embedding apparatus
US6061793A (en) 1996-08-30 2000-05-09 Regents Of The University Of Minnesota Method and apparatus for embedding data, including watermarks, in human perceptible sounds
US5848155A (en) 1996-09-04 1998-12-08 Nec Research Institute, Inc. Spread spectrum watermark for embedded signalling
US7607147B1 (en) 1996-12-11 2009-10-20 The Nielsen Company (Us), Llc Interactive service device metering systems
GB9700854D0 (en) 1997-01-16 1997-03-05 Scient Generics Ltd Sub-audible acoustic data transmission mechanism
JP3690043B2 (en) 1997-03-03 2005-08-31 ソニー株式会社 Audio information transmission apparatus and method, and audio information recording apparatus
CA2225060A1 (en) 1997-04-09 1998-10-09 Peter Suilun Fong Interactive talking dolls
US6125172A (en) 1997-04-18 2000-09-26 Lucent Technologies, Inc. Apparatus and method for initiating a transaction having acoustic data receiver that filters human voice
GB2326572A (en) 1997-06-19 1998-12-23 Softsound Limited Low bit rate audio coder and decoder
IL121642A0 (en) 1997-08-27 1998-02-08 Creator Ltd Interactive talking toy
US5945932A (en) 1997-10-30 1999-08-31 Audiotrack Corporation Technique for embedding a code in an audio signal and for detecting the embedded code
US6467089B1 (en) 1997-12-23 2002-10-15 Nielsen Media Research, Inc. Audience measurement system incorporating a mobile handset
JP3673664B2 (en) 1998-01-30 2005-07-20 キヤノン株式会社 Data processing apparatus, data processing method, and storage medium
AUPP170298A0 (en) 1998-02-06 1998-03-05 Pracas, Victor Manuel Electronic interactive puppet
US6389055B1 (en) 1998-03-30 2002-05-14 Lucent Technologies, Inc. Integrating digital data with perceptible signals
AU3656999A (en) * 1998-05-12 1999-11-29 Solana Technology Development Corporation Digital hidden data transport (dhdt)
WO2000021203A1 (en) 1998-10-02 2000-04-13 Comsense Technologies, Ltd. A method to use acoustic signals for computer communications
US6607136B1 (en) 1998-09-16 2003-08-19 Beepcard Inc. Physical presence digital authentication system
FI982363A (en) 1998-10-30 2000-05-01 Nokia Mobile Phones Ltd A method and system for limiting the operation of a radio device in a particular area
TW484267B (en) 1998-11-05 2002-04-21 Sony Corp Additional information transmission method, additional information transmission system, information signal output apparatus, information signal processing apparatus information signal recording apparatus and information signal recording medium
JP2000152217A (en) 1998-11-09 2000-05-30 Toshiba Corp Video acquisition limit system, video acquisition permission reject signal transmitter and video acquisition limit device
US6370666B1 (en) 1998-12-02 2002-04-09 Agere Systems Guardian Corp. Tuning scheme for error-corrected broadcast programs
US6876623B1 (en) 1998-12-02 2005-04-05 Agere Systems Inc. Tuning scheme for code division multiplex broadcasting system
US6512919B2 (en) 1998-12-14 2003-01-28 Fujitsu Limited Electronic shopping system utilizing a program downloadable wireless videophone
GB9828101D0 (en) 1998-12-21 1999-02-17 Roke Manor Research Acoustic marketing device
US6442283B1 (en) 1999-01-11 2002-08-27 Digimarc Corporation Multimedia data embedding
JP2000207170A (en) 1999-01-14 2000-07-28 Sony Corp Device and method for processing information
GB9902235D0 (en) 1999-02-01 1999-03-24 Emuse Corp Interactive system
JP3465616B2 (en) 1999-02-12 2003-11-10 株式会社デンソー Data distribution system and information distribution center
JP4024957B2 (en) 1999-03-12 2007-12-19 シャープ株式会社 Communication apparatus and communication system
US6765950B1 (en) 1999-04-01 2004-07-20 Custom One Design, Inc. Method for spread spectrum communication of supplemental information
JP2000308130A (en) 1999-04-16 2000-11-02 Casio Comput Co Ltd Communication system
US6674993B1 (en) 1999-04-30 2004-01-06 Microvision, Inc. Method and system for identifying data locations associated with real world observations
US6298322B1 (en) 1999-05-06 2001-10-02 Eric Lindemann Encoding and synthesis of tonal audio signals using dominant sinusoids and a vector-quantized residual tonal signal
US7031271B1 (en) 1999-05-19 2006-04-18 Motorola, Inc. Method of and apparatus for activating a spread-spectrum radiotelephone
US6442518B1 (en) 1999-07-14 2002-08-27 Compaq Information Technologies Group, L.P. Method for refining time alignments of closed captions
GB9917985D0 (en) 1999-07-30 1999-09-29 Scient Generics Ltd Acoustic communication system
GB2345779B (en) 1999-10-12 2004-08-11 Roke Manor Research Apparatus for controlling a remote item
AU2881300A (en) 1999-10-27 2001-05-08 Nielsen Media Research, Inc. System and method for encoding an audio signal for use in broadcast program identification systems, by adding inaudible codes to the audio signal
US20010030710A1 (en) 1999-12-22 2001-10-18 Werner William B. System and method for associating subtitle data with cinematic material
US6438117B1 (en) 2000-01-07 2002-08-20 Qualcomm Incorporated Base station synchronization for handover in a hybrid GSM/CDMA network
US6773344B1 (en) 2000-03-16 2004-08-10 Creator Ltd. Methods and apparatus for integration of interactive toys with interactive television and cellular communication systems
CN100592788C (en) 2000-04-14 2010-02-24 日本电信电话株式会社 Method, system, and apparatus for acquiring information concerning broadcast information
US6708214B1 (en) 2000-04-21 2004-03-16 Openwave Systems Inc. Hypermedia identifier input mode for a mobile communication device
EP1158800A1 (en) 2000-05-18 2001-11-28 Deutsche Thomson-Brandt Gmbh Method and receiver for providing audio translation data on demand
US6990453B2 (en) 2000-07-31 2006-01-24 Landmark Digital Services Llc System and methods for recognizing sound and music signals in high noise and distortion
US6782253B1 (en) 2000-08-10 2004-08-24 Koninklijke Philips Electronics N.V. Mobile micro portal
US20020069263A1 (en) 2000-10-13 2002-06-06 Mark Sears Wireless java technology
US6892175B1 (en) 2000-11-02 2005-05-10 International Business Machines Corporation Spread spectrum signaling for speech watermarking
ATE411658T1 (en) 2000-11-30 2008-10-15 Intrasonics Ltd APPARATUS AND SYSTEM FOR USING A DATA SIGNAL INTEGRATED IN AN ACOUSTIC SIGNAL
AU2211102A (en) 2000-11-30 2002-06-11 Scient Generics Ltd Acoustic communication system
JP2005500736A (en) 2001-08-14 2005-01-06 セントラル リサーチ ラボラトリーズ リミティド A system that provides access to information related to broadcast signals
US6862355B2 (en) 2001-09-07 2005-03-01 Arbitron Inc. Message reconstruction from partial detection
WO2003026153A1 (en) 2001-09-20 2003-03-27 Exo-Brain, Inc. Input-output device with universal phone port
KR100430566B1 (en) * 2001-11-02 2004-05-10 한국전자통신연구원 Method and Apparatus of Echo Signal Injecting in Audio Water-Marking using Echo Signal
EP1459459B1 (en) 2001-12-05 2017-05-31 Disney Enterprises, Inc. System and method of wirelessly triggering portable devices
US7308486B2 (en) 2001-12-06 2007-12-11 Accenture Global Services Gmbh Mobile guide communications system
JP3554825B2 (en) 2002-03-11 2004-08-18 東北大学長 Digital watermark system
AU2003282763A1 (en) 2002-10-15 2004-05-04 Verance Corporation Media monitoring, management and information system
US6845360B2 (en) 2002-11-22 2005-01-18 Arbitron Inc. Encoding multiple messages in audio data and detecting same
KR200337174Y1 (en) * 2003-08-25 2004-01-07 남준노 Mould assembly for constructing the tunnel type water conduit
US7616776B2 (en) * 2005-04-26 2009-11-10 Verance Corproation Methods and apparatus for enhancing the robustness of watermark extraction from digital host content
EP1542227A1 (en) 2003-12-11 2005-06-15 Deutsche Thomson-Brandt Gmbh Method and apparatus for transmitting watermark data bits using a spread spectrum, and for regaining watermark data bits embedded in a spread spectrum
KR101158717B1 (en) 2004-06-08 2012-06-22 코닌클리케 필립스 일렉트로닉스 엔.브이. Coding reverberant sound signals
KR100644627B1 (en) * 2004-09-14 2006-11-10 삼성전자주식회사 Method for encoding a sound field control information and method for processing therefor
GB0710211D0 (en) * 2007-05-29 2007-07-11 Intrasonics Ltd AMR Spectrography
GB2460306B (en) 2008-05-29 2013-02-13 Intrasonics Sarl Data embedding system
JP5252578B2 (en) 2009-08-31 2013-07-31 学校法人東北学院 Underwater detection device and fish species discrimination method

Patent Citations (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US657379A (en) * 1900-05-16 1900-09-04 Edward C Bakken Stock for holding cattle while dehorning.
US2660662A (en) * 1947-10-24 1953-11-24 Nielsen A C Co Search signal apparatus for determining the listening habits of wave signal receiver users
US3845391A (en) * 1969-07-08 1974-10-29 Audicom Corp Communication including submerged identification signal
US3651471A (en) * 1970-03-02 1972-03-21 Nielsen A C Co Data storage and transmission system
US3742463A (en) * 1970-03-02 1973-06-26 Nielsen A C Co Data storage and transmission system
US3732536A (en) * 1970-09-18 1973-05-08 Gte Sylvania Inc Passive object detection apparatus including ambient noise compensation
US4025851A (en) * 1975-11-28 1977-05-24 A.C. Nielsen Company Automatic monitor for programs broadcast
US4237449A (en) * 1978-06-16 1980-12-02 Zibell J Scott Signalling device for hard of hearing persons
US4425642A (en) * 1982-01-08 1984-01-10 Applied Spectrum Technologies, Inc. Simultaneous transmission of two information signals within a band-limited communications channel
US4514725A (en) * 1982-12-20 1985-04-30 Bristley Barbara E Window shade mounted alarm system
US4718106A (en) * 1986-05-12 1988-01-05 Weinblatt Lee S Survey of radio audience
US5108341A (en) * 1986-05-28 1992-04-28 View-Master Ideal Group, Inc. Toy which moves in synchronization with an audio source
US4846693A (en) * 1987-01-08 1989-07-11 Smith Engineering Video based instructional and entertainment system using animated figure
US4750034A (en) * 1987-01-21 1988-06-07 Cloeck En Moedigh Bioscoopreclame B.V. Apparatus for monitoring the replay of audio/video information carriers
US4840602A (en) * 1987-02-06 1989-06-20 Coleco Industries, Inc. Talking doll responsive to external signal
US4807031A (en) * 1987-10-20 1989-02-21 Interactive Systems, Incorporated Interactive video method and apparatus
US4923428A (en) * 1988-05-05 1990-05-08 Cal R & D, Inc. Interactive talking toy
US4945412A (en) * 1988-06-14 1990-07-31 Kramer Robert A Method of and system for identification and verification of broadcasting television and radio program segments
US5090936A (en) * 1988-07-30 1992-02-25 Takara Co., Ltd. Movable decoration
US5113437A (en) * 1988-10-25 1992-05-12 Thorn Emi Plc Signal identification system
US5191615A (en) * 1990-01-17 1993-03-02 The Drummer Group Interrelational audio kinetic entertainment system
US5136613A (en) * 1990-09-28 1992-08-04 Dumestre Iii Alex C Spread Spectrum telemetry
US5085610A (en) * 1991-05-16 1992-02-04 Mattel, Inc. Dual sound toy train set
US5648789A (en) * 1991-10-02 1997-07-15 National Captioning Institute, Inc. Method and apparatus for closed captioning at a performance
US5475798A (en) * 1992-01-06 1995-12-12 Handlos, L.L.C. Speech-to-text translator
US5301167A (en) * 1992-08-05 1994-04-05 Northeastern University Apparatus for improved underwater acoustic telemetry utilizing phase coherent communications
US5893067A (en) * 1996-05-31 1999-04-06 Massachusetts Institute Of Technology Method and apparatus for echo data hiding in audio signals
US6263505B1 (en) * 1997-03-21 2001-07-17 United States Of America System and method for supplying supplemental information for video programs
US5940135A (en) * 1997-05-19 1999-08-17 Aris Technologies, Inc. Apparatus and method for encoding and decoding information in analog signals
US6674861B1 (en) * 1998-12-29 2004-01-06 Kent Ridge Digital Labs Digital audio watermarking using content-adaptive, multiple echo hopping
US6463413B1 (en) * 1999-04-20 2002-10-08 Matsushita Electrical Industrial Co., Ltd. Speech recognition training for small hardware devices
US20040169581A1 (en) * 2000-02-16 2004-09-02 Verance Corporation Remote control signaling using audio watermarks
US20010025241A1 (en) * 2000-03-06 2001-09-27 Lange Jeffrey K. Method and system for providing automated captioning for AV signals
US20080049971A1 (en) * 2000-03-24 2008-02-28 Ramos Daniel O Systems and methods for processing content objects
US20020078359A1 (en) * 2000-12-18 2002-06-20 Jong Won Seok Apparatus for embedding and detecting watermark and method thereof
US20050240768A1 (en) * 2002-06-03 2005-10-27 Koninklijke Philips Electronics N.V. Re-embedding of watermarks in multimedia signals
US20070036357A1 (en) * 2003-09-22 2007-02-15 Koninklijke Philips Electronics N.V. Watermarking of multimedia signals
US7013301B2 (en) * 2003-09-23 2006-03-14 Predixis Corporation Audio fingerprinting system and method
US20090235079A1 (en) * 2005-06-02 2009-09-17 Peter Georg Baum Method and apparatus for watermarking an audio or video signal with watermark data using a spread spectrum
US20080027734A1 (en) * 2006-07-26 2008-01-31 Nec (China) Co. Ltd. Media program identification method and apparatus based on audio watermarking

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11830043B2 (en) 2011-10-25 2023-11-28 Auddia Inc. Apparatus, system, and method for audio based browser cookies
US11599915B1 (en) 2011-10-25 2023-03-07 Auddia Inc. Apparatus, system, and method for audio based browser cookies
KR101969316B1 (en) 2011-12-07 2019-04-17 삼성전자주식회사 Method of embedding digital information into audio signal, machine-readable storage medium and communication terminal
KR20130064028A (en) * 2011-12-07 2013-06-17 삼성전자주식회사 Method of embedding digital information into audio signal, machine-readable storage medium and communication terminal
US8972246B2 (en) * 2011-12-07 2015-03-03 Samsung Electronics Co., Ltd. Method of embedding digital information into audio signal machine-readable storage medium and communication terminal
US20130151241A1 (en) * 2011-12-07 2013-06-13 Samsung Electronics Co., Ltd. Method of embedding digital information into audio signal machine-readable storage medium and communication terminal
US20130226586A1 (en) * 2012-02-24 2013-08-29 Pantech Co., Ltd. Device, system, and method for authenticating user using voice recognition
US20150023402A1 (en) * 2012-10-25 2015-01-22 Clip Interactive, Llc Method and system for sub-audible signaling
US9391727B2 (en) * 2012-10-25 2016-07-12 Clip Interactive, Llc Method and system for sub-audible signaling
US10230778B2 (en) 2013-03-05 2019-03-12 Clip Interactive, Llc Apparatus, system, and method for integrating content and content services
US10623856B2 (en) 2013-11-13 2020-04-14 Om Audio, Llc Signature tuning filters
US10375476B2 (en) * 2013-11-13 2019-08-06 Om Audio, Llc Signature tuning filters
US20160286312A1 (en) * 2013-11-13 2016-09-29 Om Audio, Llc Signature tuning filters
US9626977B2 (en) * 2015-07-24 2017-04-18 Tls Corp. Inserting watermarks into audio signals that have speech-like properties
CN107395292A (en) * 2017-07-05 2017-11-24 厦门声戎科技有限公司 A kind of Information Hiding Techniques communication means based on marine organisms signal analysis
US20230030369A1 (en) * 2021-07-27 2023-02-02 Acer Incorporated Processing method of sound watermark and sound watermark generating apparatus

Also Published As

Publication number Publication date
BRPI0913228A8 (en) 2016-07-05
GB2460306A (en) 2009-12-02
CN102047324A (en) 2011-05-04
EP2301018A1 (en) 2011-03-30
JP2011523091A (en) 2011-08-04
GB2460306B (en) 2013-02-13
BRPI0913228A2 (en) 2016-01-19
CN102881290B (en) 2015-06-10
BRPI0913228B1 (en) 2020-09-15
DK2631904T3 (en) 2015-09-28
US20120004920A1 (en) 2012-01-05
EP2631904A1 (en) 2013-08-28
EP2325839A1 (en) 2011-05-25
EP2631904B1 (en) 2015-07-01
CN102881290A (en) 2013-01-16
GB0821841D0 (en) 2009-01-07
MX2010013076A (en) 2011-08-08
PL2631904T3 (en) 2015-12-31
ES2545058T3 (en) 2015-09-08
US8560913B2 (en) 2013-10-15
WO2009144470A1 (en) 2009-12-03

Similar Documents

Publication Publication Date Title
US8560913B2 (en) Data embedding system
KR101699548B1 (en) Encoder, decoder and method for encoding and decoding
JP4098773B2 (en) Receiving apparatus and receiving method
JP3238448B2 (en) Signal transmission equipment
MX2013003113A (en) Watermark generator, watermark decoder, method for providing a watermarked signal based on discrete valued data and method for providing discrete valued data in dependence on a watermarked signal.
JP2007517441A5 (en)
US6658112B1 (en) Voice decoder and method for detecting channel errors using spectral energy evolution
JP3154580B2 (en) Digital transmission equipment
US6782046B1 (en) Decision-directed adaptation for coded modulation
WO2004098142A1 (en) Multiuser detection aided multiple access differential m-ary coding applications
KR101078198B1 (en) Method for early decoding of control channel information
US7376171B2 (en) Multiuser detection aided multiple access differential M-ary coding applications
JP2000004171A (en) Mobile communication method
JP6313577B2 (en) OFDM transmitter for wireless microphone and transmission / reception system
RU2295198C1 (en) Code cyclic synchronization method
KR20010042372A (en) detection of code vectors in single frequency, multiple transmitter networks
WO2010060734A2 (en) Signal reception
JP3943447B2 (en) Data transmitting apparatus, data receiving apparatus, data encoding method, and data decoding method,
GB2462588A (en) Data embedding system
Chou et al. Audio data hiding with application to surround sound
US20110144985A1 (en) Method for Transporting Low-Bit Rate Information
JP2001268170A (en) Receiver and receiving control method for digital telephone system
JPH05276122A (en) Voice decoder
JPH0744197A (en) Speech decoding device
JP2001296898A (en) Voice encoding system using hamming distance and voice transmission system

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTRASONICS S.A.R.L., LUXEMBOURG

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KELLY, PETER;REYNOLDS, MICHAEL REYMOND;SUTTON, CHRISTOPHER JOHN JOSEPH;SIGNING DATES FROM 20110107 TO 20110202;REEL/FRAME:025778/0520

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION