WO2005099252A1 - Audio level control - Google Patents

Audio level control Download PDF

Info

Publication number
WO2005099252A1
WO2005099252A1 PCT/IB2005/051080 IB2005051080W WO2005099252A1 WO 2005099252 A1 WO2005099252 A1 WO 2005099252A1 IB 2005051080 W IB2005051080 W IB 2005051080W WO 2005099252 A1 WO2005099252 A1 WO 2005099252A1
Authority
WO
WIPO (PCT)
Prior art keywords
channel
level
sound
acl
channels
Prior art date
Application number
PCT/IB2005/051080
Other languages
French (fr)
Inventor
Mark J. W. Mertens
Ronaldus M. Aarts
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=34962758&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2005099252(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to JP2007506889A priority Critical patent/JP4913038B2/en
Priority to US10/599,630 priority patent/US8600077B2/en
Priority to EP05718606.6A priority patent/EP1736001B2/en
Priority to KR1020067020871A priority patent/KR101249239B1/en
Publication of WO2005099252A1 publication Critical patent/WO2005099252A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/60Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
    • H04N5/607Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals for more than one sound signal, e.g. stereo, multilanguages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/4508Management of client data or end-user data
    • H04N21/4532Management of client data or end-user data involving end-user characteristics, e.g. viewer profile, preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4622Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4852End-user interface for client configuration for modifying audio parameters, e.g. switching between mono and stereo
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors
    • H04N21/8405Generation or processing of descriptive data, e.g. content descriptors represented by keywords
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • H04N21/4314Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations for fitting data in a restricted space on the screen, e.g. EPG data in a rectangular grid
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/445Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
    • H04N5/45Picture in picture, e.g. displaying simultaneously another television channel in a region of the screen

Definitions

  • the present invention relates to controlling multiple audio levels. More in particular, the present invention relates to a device for controlling the sound levels of a group of audio channels which can be rendered simultaneously.
  • a television set may, for example, be able to provide a "split-screen" arrangement in which the television screen is divided into two or more sections, each section displaying a different video channel.
  • the corresponding audio channels may be rendered using different loudspeakers.
  • the sound level of these audio channels must be controlled in such a way that the viewer is able to listen to one or more channels, changing the sound level of the channels when a commercial break starts or when a particularly interesting topic or video item is announced.
  • United States Patent US 6,590,618 discloses a method and apparatus for changing a channel or varying a volume (sound) level of a television receiver having both a normal screen mode function and a multiple screen mode function.
  • a remote control unit has a separate set of sound level keys for each of the multiple screens. Although the screen which would be shown in single screen mode is labeled "main picture", the sound levels associated with the multiple screens are independently controlled by the user. When the user wants to listen more closely to one of the multiple channels, (s)he has to both increase the sound level of that channel and/or reduce the sound level of the at least one other channel manually, using the separate control keys. It will be clear that this is impractical.
  • the present invention provides a device for controlling the sound levels of a group of audio channels comprising a main channel and at least one auxiliary channel which can be rendered simultaneously, the device comprising: - user controlled selection means for selecting the main channel, and - automatic level adjustment means for adjusting the sound level of the at least one auxiliary channel relative to the main channel.
  • the device of the present invention comprises user controlled level adjustment means, hereinafter called first level adjustment means, for adjusting the sound level of the main channel.
  • first level adjustment means for adjusting the sound level of the main channel.
  • second level adjustment means are arranged for adjusting the auxiliary channel(s), the user has to control only a single channel, the main channel, in order to obtain a suitable overall sound level. This reduces both the amount of effort required by the user and the number of required keys on the (remote) control unit.
  • the respective sounds levels of the channels may be weighted and/or mutually adjusted using various suitable techniques. In this way the interference of the various audio channels as experienced by the user may be significantly reduced.
  • the user controlled (first) level adjustment means mentioned above are not essential and that embodiments of the device of the present invention can be envisaged in which the sound level of the main channel is fixed, or is controlled by level control means external to said device.
  • the (second) level adjustment means for adjusting the sound level of the auxiliary channel(s) are specifically referred to as being automatic
  • the user controlled (first) level control means may in certain embodiments also provide automatic level adjustment in addition to user adjustment.
  • the audio channels mentioned above may be part of communication channels containing audio (sound), video (moving images), pictures (still images), text, and/or other content items.
  • the present invention is particularly suitable for television (combined video and audio channels) but is not so limited and may also be applied in systems providing audio only.
  • the user controlled selection means allow a user to select one rendered (for example shown and/or played) channel as the main channel, all other rendered channels are designated auxiliary channels. The user will typically select as the main channel the channel which (s)he finds the most interesting to listen to.
  • the selection means are arranged for selecting successive available channels in response to user input. This allows the user to step through a succession of available channels using only a single (hardware or software) button or key.
  • buttons could be provided, one for each channel.
  • the first level adjustment means may be controlled by conventional sound level adjustment elements such as "volume up” and “volume down” buttons on a (remote) control unit.
  • the second level adjustment means are automatic in that they do not necessarily require user control but adjust the level(s) of the auxiliary channel(s) in response to, for example, changes in the sound level of the main channel or an auxiliary channel. Although a virtually infinite number of different level(s) of the auxiliary channel(s) could be provided, it is preferred that the second level adjustment means provides a plurality of pre-set relative sound levels. In this way is it possible to quickly and conveniently step through a number of levels.
  • pre-set levels may differ in absolute and/or relative terms, where relative is multiplicative with respect to the level of the main channel.
  • the said plurality of pre-set levels may be per channel and/or per user. In the latter case, each user may be provided with an individual series of pre-set levels.
  • the pre-set levels are preferably factory-set, but in an advantageous embodiment the pre-set relative sound levels may be altered by the user. This allows the sound levels of the auxiliary channels to be adapted to the user's preferences and/or hearing.
  • the second level adjustment means are arranged for adapting the respective sound levels to the content of each associated audio channel and/or to the sound source.
  • a detector could be provided for detecting sound characteristics of the audio content, and/or changes in associated video content, for example by motion or color analysis. Such detectors are known perse, an exemplary speech detector is disclosed in United States Patent US 5,878,391.
  • the level adjustment means are arranged for adapting the respective sound levels to user preferences regarding the content of the channels. That is, user preferences with respect to content (movies, news items, commercials, etc.) may be stored and used to choose desired sound levels when such content is rendered. It is possible to extract information from the channels indicating the type of music being provided by the channel, whereby the second level adjustment means can automatically set the corresponding level(s). Such information could be provided by the channels as meta-data, that is data describing the content of the channels, or could be derived from the content itself, using a suitable detector as mentioned above.
  • the second level adjustment means are arranged for adapting the respective sound levels to the signal characteristics of each associated audio channel. That is, the second level adjustment means of this embodiment are responsive to the signal characteristics and adjust the signal level(s) accordingly.
  • the second level adjustment means are arranged for speech detection. More in particular, the second level adjustment means may further be arranged for formant detection, prosody detection and/or keyword detection. This allows intelligent software to change the sound level when a news item or movie begins, for example.
  • the device of the present invention preferably takes the signal characteristics of all channels into account, including the main channel, and adjusts the level(s) of the auxiliary channel(s) in response thereto.
  • the level adjustment means are arranged for temporarily adjusting the sound level of a channel in response to the content and/or signal characteristics of at least one channel. That is, the sound level may be raised for a duration of approximately one second or several seconds to alert the user to a particular content item, for example an announcement containing a certain key word, or a particular type of signal, such as speech. It is preferred that the raised sound level gradually reverts to its original state. In some embodiments both the increase and the decrease of the sound level are gradual. While the sound level of the channel containing the content item of interest may be temporarily raised, the sound levels of the other channel(s) being rendered may be temporarily lowered during the same time duration so as to make the content item concerned more audible.
  • the level(s) may be adjusted by merely proportionally adjusting the volume, for example by multiplying the audio signal with a gain factor which can be larger or smaller than 1.
  • the second level adjustment means may be arranged for clipping and/or filtering audio signals contained in the channels, preferably using "intelligent" clipping and/or filtering techniques.
  • the audio signal level(s) may be compressed and/or limited (clipping) or may be adjusted in dependence of the particular frequencies of the signal (filtering). It will be understood that these techniques may be combined to achieve any desired level adjustment.
  • the various audio channels may be rendered by a single, common transducer, such as a loudspeaker.
  • the main channel and the at least one auxiliary channel are rendered by different transducers. This allows a spatial separation of the audio channels, thus making them easier to distinguish.
  • the main channel is rendered by a transducer which is centrally located with respect to the audio system of which it is part. This allows the main channel to be heard clearly and distinctly, in particular when the auxiliary channels are rendered by non-centrally located transducers, for example transducers located to the side(s) of an apparatus.
  • a particularly flexible embodiment of the device of the present invention is further provided with transducer selecting means for selecting one or more transducers which render the main channel and the auxiliary channel(s) respectively.
  • the present invention further provides a remote control unit for use with the device as defined above, the unit comprising selection interface components, such as buttons, for selecting the main channel.
  • the remote control unit of the present invention may advantageously further comprise a first sound level interface component, such as a toggle stick, for setting a ratio of sound levels of rendered channels.
  • the remote control unit may further comprise second sound level interface components, such as knobs, for manually adjusting the sound levels of rendered channels.
  • the present invention additionally provides an audio system, preferably an audio-visual system, comprising a device as defined above.
  • an audio system may suitably be constituted by a television set, a music center, or a home entertainment system (which may include a personal computer).
  • a remote control unit for use with such an audio system is discussed above.
  • the present invention also provides a method of controlling the sound levels of a group of audio channels comprising a main channel and at least one auxiliary channel which can be rendered simultaneously, the method comprising the steps of: - selecting, under user control, the main channel, and - automatically adjusting the sound level of the at least one auxiliary channel relative to the main channel.
  • the method of the present invention further comprises the step of adjusting, under user control, the sound level of the main channel, although this step is not essential and may be omitted in some embodiments.
  • the present invention further provides a computer program product for carrying out the method defined above.
  • Fig. 1 schematically shows a first embodiment of a device according to the present invention.
  • Fig. 2 schematically shows a second embodiment of a device according to the present invention.
  • Fig. 3 schematically shows a first embodiment of a level adjustment unit for use in the device of Figs. 1 and 2.
  • Fig. 4 schematically shows a second embodiment of a level adjustment unit for use in the device of Figs. 1 and 2.
  • Fig. 5 schematically shows a third embodiment of a level adjustment unit according to the present invention.
  • Fig. 6 schematically shows a remote control unit for use with the device of the present invention.
  • Fig. 1 schematically shows a first embodiment of a device according to the present invention.
  • Fig. 2 schematically shows a second embodiment of a device according to the present invention.
  • Fig. 3 schematically shows a first embodiment of a level adjustment unit for use in the device of Figs. 1 and 2.
  • Fig. 4 schematically shows a second embodiment of a level adjustment unit
  • FIG. 7 schematically shows a home cinema system containing a device of the present invention.
  • the device 1 shown merely by way of non-limiting example in Fig. 1 comprises a first level adjustment unit 11, a second level adjustment unit 12 and a third level adjustment unit 13 arranged in parallel.
  • the three channels Chi, Ch2, and Ch3 are coupled to the inputs of the level adjustment units 11, 12 and 13 via a switching unit 14.
  • the outputs of the level adjustment units 11, 12 and 13 are, in the exemplary embodiment shown, coupled to a signal addition unit 15 which, in turn, is coupled to a transducer (loudspeaker) 2 for rendering the audio signals of the three channels.
  • the channels Chi, Ch2, and Ch3 may, for example, be constituted by multimedia channels containing both audio and video (sub-)channels.
  • the audio channels contain audio signals which are associated with respective video channels containing video signals that are to be rendered simultaneously.
  • the channels Chi, Ch2, and Ch3 may comprise one or more radio channels.
  • the channels Chi, Ch2, and Ch3 may be transmitted via radio, cable, telephone lines, or other communication means.
  • the switching unit 14, which is controlled by a selection signal Sel, connects one of the channels Chi, Ch2, and Ch3 to each of the level adjustment units 11, 12 and 13.
  • the selection signal Sel which is typically initiated by a user, selects one of the channels Chi, Ch2, and Ch3.
  • the selected channel is fed to the first level adjustment unit 11, while the remaining channels, labeled first auxiliary channel AC1 and second auxiliary channel AC2, are fed to the second level adjustment unit 12 and the third level adjustment unit 13 respectively.
  • the remaining channels labeled first auxiliary channel AC1 and second auxiliary channel AC2
  • the second level adjustment unit 12 and the third level adjustment unit 13 are fed to the second level adjustment unit 12 and the third level adjustment unit 13 respectively.
  • an additional selection signal may be used to select the rendered channels out of the available channels.
  • the adjustment units 11, 12 and 13 receive control signals for adjusting the signal levels.
  • the first level adjustment unit 11 receives a user control signal UC
  • the second and third adjustment units 12 and 13 receive control signals that are (identical to or derived from) the output signal of the first adjustment unit 11.
  • This output signal is the adjusted main channel (MC) signal.
  • the user control signal UC typically comprises a numerical value (or an equivalent signal) representing a gain setting, for example a value ranging from 1 to 20.
  • the main channel MC can be adjusted under user control, while the auxiliary channels AC1 and AC2 are adjusted in dependence of the main channel. In this way, a change in the sound level of the main channel MC may automatically result in changes in the sound levels of the auxiliary channels.
  • the main channel may be exclusively user controlled, an embodiment can be envisaged in which the sound level of the main channel can be automatically adjusted in response to the sound level in the auxiliary channels, and/or in response to changes (for example the sound level, the sound type and/or the content) in the main channel itself.
  • the sound level of the main channel may therefore be adjusted either upwards (sound level increase) or downwards (sound level decrease) and need not be fixed or solely determined by the user. Accordingly, the main channel may also be primarily user controlled.
  • the level adjustment units 11, 12 and 13 will later be explained in more detail with reference to Fig. 3.
  • the selection signal Sel and the user (level) control signal UC may originate from a (remote) control unit which is typically present in a television set or similar apparatus.
  • a remote control unit an example of which is schematically shown in Fig. 6, comprises means for selecting the main channel.
  • the device 1 of the present invention is designed for two channels. Each channel Chi, Ch2 is directly coupled to a respective level adjustment unit 11, 12.
  • a selection unit 16 is provided which is controlled by the selection signal Sel.
  • the selection unit 16 feeds the user control signal UC to the first level adjustment unit 11 and feeds the output signal of the first level adjustment unit 11 as a control signal to the second level adjustment unit 12.
  • the selection unit 16 feeds the user control signal UC to the second level adjustment unit 12 and feeds the output signal of the second level adjustment unit 12 as a control signal to the first level adjustment unit 11.
  • the selection unit 16 switches between these two modes under control of the user control signal UC. As shown in Fig.
  • each level adjustment unit 11, 12 is connected to an individual transducer (loudspeaker) 2, 3.
  • individual loudspeakers may also be employed in the embodiment of Fig. 1, in which case the signal addition unit 15 will be deleted.
  • a signal addition unit (15 in Fig. 1) may be used in the embodiment of Fig. 2.
  • An exemplary embodiment of a level adjustment unit is schematically depicted in Fig. 3.
  • the level adjustment unit 10 (which may correspond to any of the level adjustment units 11, 12 and 13 discussed above) comprises a controlled amplifier 17 and a level control unit 18.
  • the level control unit 18 receives the signal of the channel concerned and passes it on to the controlled amplifier 17 while measuring one or more characteristics of the signal, such as its level (amplitude).
  • the level control unit 18 also receives a control signal Cntl, for example from the selection unit 16 shown in Fig. 2, or from another level adjustment unit.
  • This control signal may for example be the user control signal UC produced by the user, the audio output signal of one of the other level adjustment units, the (preferably delayed) output signal of the level adjustment unit itself, and/or a combination thereof.
  • the level control unit 18 schematically illustrated in Fig. 3 may comprise suitable processing means for processing the control signal and the channel signal so as to produce a suitable amplifier control signal for the amplifier 17.
  • These processing means may advantageously comprise a microprocessor and an associated memory.
  • the memory may be used to store, among other things, pre-set and user adjusted sound levels, pre-set and user adjusted sound ratios, user preferences regarding content, and/or other information.
  • Various processing techniques may be used.
  • the sound levels of the auxiliary channels may be set to a certain percentage of the main channel, for example 20%, 30% or 40% (these percentages may also be calculated on the basis of the total sound level produced by all channels together, in that case the main channel may, for example, be allocated 80% and the auxiliary channels 20% of the total sound volume). These percentages may be pre-set in the factory and may be based upon statistical user listening tests. Such tests may indicate which percentages yield a good intelligibility and/or a suitable channel balance.
  • the respective levels may be based upon a calculation of the total signal volume or signal power.
  • the signal power of the main channel may be calculated for a certain (typically short) time period using well-known techniques such as integrating the square of the amplitude over said time period. The same calculation is carried out for the auxiliary channel(s). If a certain target ratio of the signal powers (or volumes: the integral of the absolute value of the signal) is given, the adjustment is carried out in such a way that the actual ratio becomes (approximately) equal to the target ratio.
  • the level control unit 18 may therefore be provided with a division unit for dividing the sound (signal) levels of the level control units and determining a percentage (ratio).
  • the level control unit 18 may further be provided with a comparison unit for comparing the calculated ratio with a predetermined (that pre-set or previously altered) ratio and deriving a compensation signal from any deviation.
  • the levels depend on the channel content and/or on the channel signal characteristics. With regard to content, different sound levels may be assigned to, for example, speech and music. A user will typically want to hear what is being said on the main channel and may accept a certain level of background music of an auxiliary channel. Conversely, when music is rendered on both the main channel and the auxiliary channel(s) and an auxiliary channel changes to speech, the user will typically want this auxiliary channel to be rendered louder in order to be able to hear and understand the speech.
  • the present invention may advantageously provide automatic content type detection which may distinguish between, for example speech, music, noise and silence.
  • different types of music may be distinguished, for example classical music, hard rock, jazz, blues, etc..
  • Determining the audio content of a channel can be carried out in various ways.
  • Information (so called meta-information or meta-data) on the content may be available, for example the RDS (Radio Data System) information which may be broadcast together with radio signals and typically indicates the broadcasting station, the artist, and other information.
  • RDS Radio Data System
  • meta-data may also be transmitted via other communication channels, such as the Internet. If this information includes the type of music, it can be used to determine the type of the audio content and adapt the levels of the channels accordingly.
  • EPG Electronic Program Guide
  • ID3 tag ofMP3 e.g., EPG (Electronic Program Guide) information and/or the so-called ID3 tag ofMP3.
  • an indication of the audio content could be achieved using audio analysis such as speech detection and/or speech analysis.
  • Speech analysis could, furthermore, involve prosody analysis and/or key word recognition, so that the device of the present invention may adapt the channel levels to user preferences.
  • Other ways of determining content could be based upon the analysis of video or still images associated with the audio content, for example in the case of television.
  • level adjustment could be carried out on the basis of signal characteristics.
  • Signal analysis involving, for example, average signal amplitude and dominant frequencies (spectral analysis) can assist in automatically choosing a suitable level adjustment.
  • signal analysis may also assist in determining the content of the channels.
  • gain adjustment that is, the signal of the channel is multiplied with a suitable gain factor (typically smaller than 1), resulting in the desired sound level.
  • a suitable gain factor typically smaller than 1
  • the present invention is not so limited. More in particular, the sound levels of the various channels may be reduced or adjusted using other techniques, such as clipping, compression and filtering.
  • the clipping technique which is known per se, involves limiting the signal amplitude to a certain threshold level. Although this technique may introduce some signal distortion, it is very simple and effective.
  • any signal distortion may be significantly decreased by "soft clipping", that is clipping in which the signal amplitude above the threshold value is (proportionally) reduced by multiplying the signal with a factor instead of “cutting off'.
  • Another suitable technique which is known per se is filtering, which allows the signal amplitude to be reduced in dependence of the frequency. Using filtering, specific frequency ranges of the audio channels can be selectively reduced, instead of, or in addition to, adjusting the overall level of the channel. In this way it is possible to reduce the sound level in accordance with the sensitivity of the human ear: certain frequency ranges which cause more perceptual interference of simultaneously rendered sound (audio channels) could be reduced more than others frequency ranges.
  • the sound level as experienced by the human ear is not only determined by the actual signal power of the sound but also by psychological factors. This phenomenon can advantageously be used to provide "intelligent" sound level processing, as in the exemplary embodiment of Fig. 4.
  • the sound level adjustment unit 10 of Fig. 4 which may correspond to any of the units 11, 12 and 13 of Figs.
  • the level adjustment unit 10 receives type information from a type information unit (TPY) 130.
  • TPY type information unit
  • the first input signal of the unit 10 is the audio signal of an auxiliary channel AC1 or AC2, the perceptive sound level of which is to be reduced (it is noted that reducing the sound level of an auxiliary channel is substantially equivalent to increasing the sound level of the main channel as in both cases the relative sound level of the auxiliary channel is decreased).
  • the sound signal of the auxiliary channel AC1 or AC2 is temporally analyzed by the temporal analysis unit 102: the history of the signal is determined for a certain time period or "time slice", for example ranging from tl to t2.
  • the temporal analysis unit 102 is arranged for activating the other components of the unit 10 only for certain time periods. If there is a peak in the sound level (as schematically depicted in Fig.
  • the audio signal may be compressed in its entirety, or for certain time slices only, by the compressor 104, thus imposing a compression relationship on the input levels to obtain the desired output levels.
  • the compressor may optionally be controlled by the output signal of parameter setting unit 124, which in turn is derived from the control signal Cntl.
  • the compressor 104 may use any suitable compression technique.
  • K increases as Sj increases, thus compressing high amplitude signals more than low amplitude signals. Compression techniques are well known in the art and the particular compression technique used is not essential to the present invention.
  • the audio signal may be filtered, for example using the multi-band splitter 106 and the relatively simple multi-band filter 112 which is arranged for filtering the signal per frequency band.
  • the multi-band filter 112 may be provided with amplifiers 113 for each frequency band.
  • the filter characteristics of the multi-band filter 112 may be fixed, however, they can also be dependent on type information provided by a type determining unit 130 arranged for determining the type of the audio signal, for example pop music or classical music. In this way, the bass of pop music may be reduced by adjusting the gain of the respective amplifier 113 for the low frequency bands.
  • a Fourier transform may be calculated by the first Fourier transform unit 108, followed by fixed or adaptive frequency domain filtering by the frequency domain filter 110, which is, in the embodiment shown, a signal dependent filter.
  • the filter 110 may be adapted in response to an adaptation signal which is derived from the control signal Cntl (see also Fig. 3), which may be the (level adjusted) audio signal of the main channel, as discussed above with reference to Figs. 1 and 2.
  • This adaptation signal is derived from the control signal Cntl using the second temporal analysis unit 120, the second fast Fourier transform unit 122 and the parameter setting unit 124.
  • the filter 110 may for example suppress the (higher) amplitudes of audio frequency components in the channel AC1 or AC2 in a typical frequency band or in a frequency band which is actually determined by measurements.
  • the output signal of the filter 110 is fed to an (optional) amplifier 115 which serves to compensate for any decrease in the (average) signal level caused by the filter 110.
  • the output signal of the amplifier 115 (or, if the amplifier 115 is not present, the output signal of the filter 110) is fed to the switch 118 to be selectively coupled to the output of the unit 10. It is noted that it may be advantageous to use a critical band filter 116 that has a filtering characteristic modeled in accordance with the human auditory system.
  • the audio signals may, for example, be split up in accordance with the well-known critical band theory, and the audio levels of the auxiliary channels (and/or main channel) are changed in dependence on their (mutual) interferences in each critical band.
  • the exemplary embodiment of a level adjustment module 15 schematically shown in Fig. 5 comprises an intelligibility improvement unit (INTIMP) 152, an envelope unit 154, a topic detector (TOPDET) 156, a prosody analyzer (PROS) 158 and a keyword detector (KEYWD) 160.
  • the keyword detector 160 receives relevant keywords from a keyword database (KEYDAT) 190 which may be external to the unit 10.
  • the level adjustment unit 15 may represent any of the level adjustment units 11, 12 and 13 shown in Figs. 1 and 2.
  • the unit 15 of Fig. 5 does not have a control input (Cntl in Fig. 3).
  • the unit 15 of Fig. 5 as shown is therefore suitable for embodiments of the device according to the present invention in which user control (UC in Fig. 1) is not required or is achieved by other means.
  • the control signal Cntl could be fed to the intelligibility improvement unit 152.
  • the control signal Cntl could be fed to the topic detector 156, the prosody analyzer 158 and a keyword detector 160, instead of or in addition to the sound signals of the channels MC, AC1 and/or AC2.
  • the unit 15 is also suitable as an additional level control unit, arranged in series with a unit 10 of Fig. 3 or 4, for improving the intelligibility of speech before or after adjusting its level.
  • the main channel MC and the auxiliary channels AC1 and AC2 may be level adjusted by the unit 15, resulting in level adjusted channels MC, ACl ' and AC2'.
  • the audio signal contained in the channel (MC, AC1, AC2) of interest is improved by the intelligibility improvement module 152 by increasing the amplitudes of formants (as schematically illustrated in Fig, 5) or by other signal processing techniques known in the art.
  • An envelope unit 154 then adjusts the envelope E of the improved audio signal.
  • the envelope unit 154 is preferably arranged for temporarily changing (increasing or decreasing) the envelope of the audio signal.
  • the envelope unit 154 is provided with a controlled amplifier or equivalent means for adjusting the gain of the signal.
  • a preferred embodiment of the envelope unit 154 is arranged for increasing the sound level at certain moments of interest. These moments may be detected by a topic detector 156 which may be arranged to analyze the audio and video content of the channel and detect certain features, for example pauses in speech, pauses in motion, end of a video insertion, and reappearance of a central character (such as a news reader) by face detection and similar techniques.
  • a prosody analyzer 158 is provided for analyzing the prosody in the speech and enhancing the prosody by sending a suitable prosody enhancement signal to envelope unit 154. Keywords may also be detected using the keyword detector 160 and the associated keyword database 190.
  • the keyword database 190 is updated using, for example, EPG (Electronic Programming Guide) information summarizing the topics of television programs, and/or monitoring the interests of a user. If the rendering of a channel is suitably delayed, detected keywords may be rendered louder, thus alerting the user to these words.
  • EPG Electronic Programming Guide
  • the remote control unit 6 operates a television set and/or a home video system.
  • Channel change button (key) 63 allows the channels being rendered to be changed by the user. In a typical embodiment, depressing the channel change button 63 repeatedly will result in the displayed channels to "rotate", that is, to displayed in succession. Depressing the selection button (key) 62 selects the current channel (NL1 in the example shown) as the main channel.
  • This selection of the main channel will generate a selection signal (Sel in Figs. 1 and 2).
  • a keypad (not shown) may be provided to enter a channel number of a first channel to be rendered, depressing the channel selection button 62 will select this rendered channel as the main channel and generate the selection signal, after which any further channel number entered in the keypad will display the (first) auxiliary channel (SBS6 in the example shown).
  • SBS6 the manner in which two or more channels out of a plurality of channels are chosen is not essential to the present invention.
  • the sound level ratio of the main channel and the auxiliary channel(s) may be adjusted by the user.
  • the toggle stick 61 which essentially is a switch that can be moved from a central neutral position to either a left active position or a right active position, is arranged in such a way the user can step through a number of ratios.
  • the (factory set or programmed) ratio of the sound levels is 70/30 (that is, main channel 70% of total sound level, auxiliary channel 30%)
  • moving the toggle stick to the left once may change the ratio into 80/20, and doing this twice may result in a ratio of 90/10.
  • moving the toggle stick to the right once may change the ratio from 70/30 into 60/40.
  • the remote control unit may be arranged such that the ratio cannot exceed a threshold value, for example 50/50.
  • the remote control unit and/or the level adjustment units of the present invention may advantageously be designed such that activating the toggle stick 61, or any equivalent sound level interface component, causes a temporary balance adjustment which lasts for a duration of, for example, approximately one second or several seconds, after which the sound levels revert to their previous values.
  • using the toggle stick 61 may provide an alternative way of selecting the main channel, producing a selection signal Sel when the ratio reaches 40/60, for example.
  • the toggle stick 61 is a useful but optional feature of the remote control unit 6. Each newly selected ratio may be displayed, for example on a screen of the remote control unit or on the screen of an associated television set.
  • an aural indication could be provided, for example an audible signal produced by a signal generator or a speech generator.
  • An alternative way of adjusting the ratio of the rendered channels is provided by (optional) auxiliary channel adjustment knobs 64 and 65. Rotating each of these knobs causes the level of the respective auxiliary channel to be adjusted.
  • This manual adjustment is in addition to the automatic adjustment provided by the present invention. Embodiments can be envisaged in which the automatic adjustment overrides the manual adjustment or vice versa.
  • the ratio adjustment assembly 66 which comprises four buttons 67. These buttons may serve to manually adjust sound levels (and/or sound level ratios) in the respective channels and to choose the channel to be adjusted.
  • the main channel selection button of a remote control unit is typically distinct from the usual channel selection buttons of a remote control which merely serve to select a channel to be rendered.
  • the main channel selection button (or its equivalent) determines which channel of the channels being rendered simultaneously is to be the main channel, that is the channel under direct user control, in contrast to the auxiliary channels the levels of which are automatically controlled relative to the main channel.
  • a remote control unit instead of using a remote control unit, other controls are possible.
  • the device of the present invention could, for example, be provided with a speech command interpreter.
  • the level adjusted audio channels can be rendered using a single, common transducer (such as a loudspeaker) or set of transducers reproducing summed signals, or using individual transducers or sets of the transducers for one or more channels.
  • the main channel is preferably rendered using a separate transducer or set of transducers (it will be understood that a set of transducers may comprise, for example, a woofer and a tweeter, or other combinations of loudspeakers, resonators and/or other transducers).
  • the main channel is rendered using a centrally located transducer (or set of transducers), while the auxiliary channel(s) is/are rendered using laterally located transducers (or sets of transducers).
  • a television set 9 is provided with a centrally positioned loudspeaker 2 for rendering the main channel and four laterally positioned loudspeakers 3, 3', 4 and 4' for rendering the auxiliary channel(s).
  • the television set 9 is further provided with a device (1 in Figs. 1 and 2) according to the present invention.
  • the television set has a screen 8 that is divided in two parts which are schematically indicated I and II. Each part is assigned a channel comprising both audio and video.
  • the central loudspeaker 2 renders the sound of the main channel which is displayed in, for example, screen section I, while the lateral loudspeakers 3 and 4 render the sound of the auxiliary channel, the video of which is in this example rendered by screen section II.
  • the television set 9 shown in Fig. 7 is part of a home cinema system which further comprises a set-top box 7 and stand-alone loudspeaker units 3' and 4'.
  • the present invention is based upon the insight that the sound levels of several audio channels which can be rendered simultaneously should be controlled interdependently: adjusting the sound level of one channel may require the adjustment of one or more other channels.
  • the present invention benefits from the further insight that user control of multiple channel is facilitated if the user has to control the sound level of a single, main channel only, all other channels being controlled automatically in dependence of the main channel.
  • the term computer program product should be understood to include any physical realization, e.g. an article of manufacture, of a collection of commands enabling a processor -generic or special purpose-, after a series of loading steps to get the commands into the processor, to execute any of the characteristic functions of an invention.
  • the computer program product may be realized as program code, processor adapted code derived from this program code, or any intermediate translation of this program code, on a carrier such as e.g.

Abstract

A device (1) is arranged for controlling the sound levels of a group of audio channels including a user selected main channel (MC) and at least one auxiliary channel (AC1; AC2). The audio channels can be rendered simultaneously. The device comprises automatic level adjustment means (12, 13) for adjusting the sound level of the at least one auxiliary channel relative to the main channel. The level adjustment means (12, 13) may be arranged for adapting the respective sound levels to the content or signal characteristics of each associated audio channel.

Description

Audio level control
The present invention relates to controlling multiple audio levels. More in particular, the present invention relates to a device for controlling the sound levels of a group of audio channels which can be rendered simultaneously. In modern communication devices, such as television sets, it is often possible to render two or more audio channels simultaneously. A television set may, for example, be able to provide a "split-screen" arrangement in which the television screen is divided into two or more sections, each section displaying a different video channel. The corresponding audio channels may be rendered using different loudspeakers. The sound level of these audio channels must be controlled in such a way that the viewer is able to listen to one or more channels, changing the sound level of the channels when a commercial break starts or when a particularly interesting topic or video item is announced. United States Patent US 6,590,618 discloses a method and apparatus for changing a channel or varying a volume (sound) level of a television receiver having both a normal screen mode function and a multiple screen mode function. A remote control unit has a separate set of sound level keys for each of the multiple screens. Although the screen which would be shown in single screen mode is labeled "main picture", the sound levels associated with the multiple screens are independently controlled by the user. When the user wants to listen more closely to one of the multiple channels, (s)he has to both increase the sound level of that channel and/or reduce the sound level of the at least one other channel manually, using the separate control keys. It will be clear that this is impractical.
It is an object of the present invention to overcome these and other problems of the Prior Art and to provide a device for controlling the sound levels of a group of audio channels which is easier to use and facilitates the user control task. It is another object of the present invention to provide an audio system comprising such a device for controlling the sound levels of a group of audio channels. Accordingly, the present invention provides a device for controlling the sound levels of a group of audio channels comprising a main channel and at least one auxiliary channel which can be rendered simultaneously, the device comprising: - user controlled selection means for selecting the main channel, and - automatic level adjustment means for adjusting the sound level of the at least one auxiliary channel relative to the main channel.
By providing, in accordance with the present invention, automatic level adjustment means for adjusting the sound level of the at least one auxiliary channel relative to the sound level of the main channel, there is no need for the user to control the sound level of the auxiliary channel(s). In typical embodiments, the device of the present invention comprises user controlled level adjustment means, hereinafter called first level adjustment means, for adjusting the sound level of the main channel. As the automatic level adjustment means, hereinafter called second level adjustment means, are arranged for adjusting the auxiliary channel(s), the user has to control only a single channel, the main channel, in order to obtain a suitable overall sound level. This reduces both the amount of effort required by the user and the number of required keys on the (remote) control unit. In addition, the respective sounds levels of the channels may be weighted and/or mutually adjusted using various suitable techniques. In this way the interference of the various audio channels as experienced by the user may be significantly reduced. It is noted that the user controlled (first) level adjustment means mentioned above are not essential and that embodiments of the device of the present invention can be envisaged in which the sound level of the main channel is fixed, or is controlled by level control means external to said device. Although the (second) level adjustment means for adjusting the sound level of the auxiliary channel(s) are specifically referred to as being automatic, the user controlled (first) level control means may in certain embodiments also provide automatic level adjustment in addition to user adjustment. It is further noted that the audio channels mentioned above may be part of communication channels containing audio (sound), video (moving images), pictures (still images), text, and/or other content items. The present invention is particularly suitable for television (combined video and audio channels) but is not so limited and may also be applied in systems providing audio only. The user controlled selection means allow a user to select one rendered (for example shown and/or played) channel as the main channel, all other rendered channels are designated auxiliary channels. The user will typically select as the main channel the channel which (s)he finds the most interesting to listen to. In a preferred embodiment the selection means are arranged for selecting successive available channels in response to user input. This allows the user to step through a succession of available channels using only a single (hardware or software) button or key. Alternatively, or additionally, a plurality of buttons could be provided, one for each channel. The first level adjustment means may be controlled by conventional sound level adjustment elements such as "volume up" and "volume down" buttons on a (remote) control unit. The second level adjustment means are automatic in that they do not necessarily require user control but adjust the level(s) of the auxiliary channel(s) in response to, for example, changes in the sound level of the main channel or an auxiliary channel. Although a virtually infinite number of different level(s) of the auxiliary channel(s) could be provided, it is preferred that the second level adjustment means provides a plurality of pre-set relative sound levels. In this way is it possible to quickly and conveniently step through a number of levels. These pre-set levels may differ in absolute and/or relative terms, where relative is multiplicative with respect to the level of the main channel. The said plurality of pre-set levels may be per channel and/or per user. In the latter case, each user may be provided with an individual series of pre-set levels. The pre-set levels are preferably factory-set, but in an advantageous embodiment the pre-set relative sound levels may be altered by the user. This allows the sound levels of the auxiliary channels to be adapted to the user's preferences and/or hearing. In a particularly advantageous embodiment, the second level adjustment means are arranged for adapting the respective sound levels to the content of each associated audio channel and/or to the sound source. That is, different levels or sets of levels may be applied, depending on whether the channel renders music, speech, or other audio content, and whether the sound source is cable, antenna, VCR (Video Cassette Recorder), DVD (Digital Video Disc) player, or any other source. Speech could be played louder than music, or vice versa, and sound originating from a cable source could be amplified while sound originating from a VCR could be attenuated. A detector could be provided for detecting sound characteristics of the audio content, and/or changes in associated video content, for example by motion or color analysis. Such detectors are known perse, an exemplary speech detector is disclosed in United States Patent US 5,878,391. In particularly advantageous further embodiments, different levels or sets of levels are used for different types of music, such as classical, pop, folk, and hard rock. In a particularly advantageous embodiment of the device of the present invention, the level adjustment means are arranged for adapting the respective sound levels to user preferences regarding the content of the channels. That is, user preferences with respect to content (movies, news items, commercials, etc.) may be stored and used to choose desired sound levels when such content is rendered. It is possible to extract information from the channels indicating the type of music being provided by the channel, whereby the second level adjustment means can automatically set the corresponding level(s). Such information could be provided by the channels as meta-data, that is data describing the content of the channels, or could be derived from the content itself, using a suitable detector as mentioned above. In a particularly advantageous embodiment, the second level adjustment means are arranged for adapting the respective sound levels to the signal characteristics of each associated audio channel. That is, the second level adjustment means of this embodiment are responsive to the signal characteristics and adjust the signal level(s) accordingly. In a particularly advantageous embodiment, for example, the second level adjustment means are arranged for speech detection. More in particular, the second level adjustment means may further be arranged for formant detection, prosody detection and/or keyword detection. This allows intelligent software to change the sound level when a news item or movie begins, for example. It is noted that the device of the present invention preferably takes the signal characteristics of all channels into account, including the main channel, and adjusts the level(s) of the auxiliary channel(s) in response thereto. In a particularly advantageous embodiment, the level adjustment means are arranged for temporarily adjusting the sound level of a channel in response to the content and/or signal characteristics of at least one channel. That is, the sound level may be raised for a duration of approximately one second or several seconds to alert the user to a particular content item, for example an announcement containing a certain key word, or a particular type of signal, such as speech. It is preferred that the raised sound level gradually reverts to its original state. In some embodiments both the increase and the decrease of the sound level are gradual. While the sound level of the channel containing the content item of interest may be temporarily raised, the sound levels of the other channel(s) being rendered may be temporarily lowered during the same time duration so as to make the content item concerned more audible. In any of the embodiments mentioned above, the level(s) may be adjusted by merely proportionally adjusting the volume, for example by multiplying the audio signal with a gain factor which can be larger or smaller than 1. Additionally, or alternatively, the second level adjustment means may be arranged for clipping and/or filtering audio signals contained in the channels, preferably using "intelligent" clipping and/or filtering techniques. The audio signal level(s) may be compressed and/or limited (clipping) or may be adjusted in dependence of the particular frequencies of the signal (filtering). It will be understood that these techniques may be combined to achieve any desired level adjustment. The various audio channels may be rendered by a single, common transducer, such as a loudspeaker. It is preferred, however, that the main channel and the at least one auxiliary channel are rendered by different transducers. This allows a spatial separation of the audio channels, thus making them easier to distinguish. In a preferred embodiment, the main channel is rendered by a transducer which is centrally located with respect to the audio system of which it is part. This allows the main channel to be heard clearly and distinctly, in particular when the auxiliary channels are rendered by non-centrally located transducers, for example transducers located to the side(s) of an apparatus. A particularly flexible embodiment of the device of the present invention is further provided with transducer selecting means for selecting one or more transducers which render the main channel and the auxiliary channel(s) respectively. It will be clear to those skilled in the art that the above features of the device according to the present invention may be present in isolation or in combination. More in particular, any of the features discussed above may be provided in combination with one of more of the other features. The present invention further provides a remote control unit for use with the device as defined above, the unit comprising selection interface components, such as buttons, for selecting the main channel. The remote control unit of the present invention may advantageously further comprise a first sound level interface component, such as a toggle stick, for setting a ratio of sound levels of rendered channels. Alternatively, or additionally, the remote control unit may further comprise second sound level interface components, such as knobs, for manually adjusting the sound levels of rendered channels. The present invention additionally provides an audio system, preferably an audio-visual system, comprising a device as defined above. Such an audio system may suitably be constituted by a television set, a music center, or a home entertainment system (which may include a personal computer). A remote control unit for use with such an audio system is discussed above. The present invention also provides a method of controlling the sound levels of a group of audio channels comprising a main channel and at least one auxiliary channel which can be rendered simultaneously, the method comprising the steps of: - selecting, under user control, the main channel, and - automatically adjusting the sound level of the at least one auxiliary channel relative to the main channel. Typically, the method of the present invention further comprises the step of adjusting, under user control, the sound level of the main channel, although this step is not essential and may be omitted in some embodiments. The present invention further provides a computer program product for carrying out the method defined above.
The present invention will further be explained below with reference to exemplary embodiments illustrated in the accompanying drawings, in which: Fig. 1 schematically shows a first embodiment of a device according to the present invention. Fig. 2 schematically shows a second embodiment of a device according to the present invention. Fig. 3 schematically shows a first embodiment of a level adjustment unit for use in the device of Figs. 1 and 2. Fig. 4 schematically shows a second embodiment of a level adjustment unit for use in the device of Figs. 1 and 2. Fig. 5 schematically shows a third embodiment of a level adjustment unit according to the present invention. Fig. 6 schematically shows a remote control unit for use with the device of the present invention. Fig. 7 schematically shows a home cinema system containing a device of the present invention. The device 1 shown merely by way of non-limiting example in Fig. 1 comprises a first level adjustment unit 11, a second level adjustment unit 12 and a third level adjustment unit 13 arranged in parallel. The three channels Chi, Ch2, and Ch3 are coupled to the inputs of the level adjustment units 11, 12 and 13 via a switching unit 14. The outputs of the level adjustment units 11, 12 and 13 are, in the exemplary embodiment shown, coupled to a signal addition unit 15 which, in turn, is coupled to a transducer (loudspeaker) 2 for rendering the audio signals of the three channels. The channels Chi, Ch2, and Ch3 may, for example, be constituted by multimedia channels containing both audio and video (sub-)channels. The audio channels contain audio signals which are associated with respective video channels containing video signals that are to be rendered simultaneously. Alternatively, or additionally, the channels Chi, Ch2, and Ch3 may comprise one or more radio channels. The channels Chi, Ch2, and Ch3 may be transmitted via radio, cable, telephone lines, or other communication means. The switching unit 14, which is controlled by a selection signal Sel, connects one of the channels Chi, Ch2, and Ch3 to each of the level adjustment units 11, 12 and 13. The selection signal Sel, which is typically initiated by a user, selects one of the channels Chi, Ch2, and Ch3. The selected channel, labeled main channel MC, is fed to the first level adjustment unit 11, while the remaining channels, labeled first auxiliary channel AC1 and second auxiliary channel AC2, are fed to the second level adjustment unit 12 and the third level adjustment unit 13 respectively. Instead of the three channels shown, four or more channels may be present, even when only three adjustment units are provided. In some embodiments, therefore, the number of channels may exceed the number of adjustment units. In such embodiments, an additional selection signal may be used to select the rendered channels out of the available channels. The adjustment units 11, 12 and 13 receive control signals for adjusting the signal levels. The first level adjustment unit 11 receives a user control signal UC, while the second and third adjustment units 12 and 13 receive control signals that are (identical to or derived from) the output signal of the first adjustment unit 11. This output signal is the adjusted main channel (MC) signal. The user control signal UC typically comprises a numerical value (or an equivalent signal) representing a gain setting, for example a value ranging from 1 to 20. As can be seen, the main channel MC can be adjusted under user control, while the auxiliary channels AC1 and AC2 are adjusted in dependence of the main channel. In this way, a change in the sound level of the main channel MC may automatically result in changes in the sound levels of the auxiliary channels. Although the main channel may be exclusively user controlled, an embodiment can be envisaged in which the sound level of the main channel can be automatically adjusted in response to the sound level in the auxiliary channels, and/or in response to changes (for example the sound level, the sound type and/or the content) in the main channel itself. The sound level of the main channel may therefore be adjusted either upwards (sound level increase) or downwards (sound level decrease) and need not be fixed or solely determined by the user. Accordingly, the main channel may also be primarily user controlled. The level adjustment units 11, 12 and 13 will later be explained in more detail with reference to Fig. 3. The selection signal Sel and the user (level) control signal UC may originate from a (remote) control unit which is typically present in a television set or similar apparatus. In accordance with the present invention, such a remote control unit, an example of which is schematically shown in Fig. 6, comprises means for selecting the main channel. In the exemplary embodiment of Fig. 2, the device 1 of the present invention is designed for two channels. Each channel Chi, Ch2 is directly coupled to a respective level adjustment unit 11, 12. In contrast to Fig. 1, therefore, none of the level adjustment units shown is specifically dedicated to the main channel MC and either level adjustment unit can be the first level adjustment unit 11 or the second level adjustment unit 12. For this purpose, a selection unit 16 is provided which is controlled by the selection signal Sel. In a first mode, the selection unit 16 feeds the user control signal UC to the first level adjustment unit 11 and feeds the output signal of the first level adjustment unit 11 as a control signal to the second level adjustment unit 12. In a second mode, the selection unit 16 feeds the user control signal UC to the second level adjustment unit 12 and feeds the output signal of the second level adjustment unit 12 as a control signal to the first level adjustment unit 11. The selection unit 16 switches between these two modes under control of the user control signal UC. As shown in Fig. 2, each level adjustment unit 11, 12 is connected to an individual transducer (loudspeaker) 2, 3. It will be understood that individual loudspeakers may also be employed in the embodiment of Fig. 1, in which case the signal addition unit 15 will be deleted. It is also possible to use one transducer (or set of transducers) for the main channel MC and another transducer (or set of transducers) for the combined auxiliary channels. Alternatively, a signal addition unit (15 in Fig. 1) may be used in the embodiment of Fig. 2. An exemplary embodiment of a level adjustment unit is schematically depicted in Fig. 3. The level adjustment unit 10 (which may correspond to any of the level adjustment units 11, 12 and 13 discussed above) comprises a controlled amplifier 17 and a level control unit 18. The level control unit 18 receives the signal of the channel concerned and passes it on to the controlled amplifier 17 while measuring one or more characteristics of the signal, such as its level (amplitude). The level control unit 18 also receives a control signal Cntl, for example from the selection unit 16 shown in Fig. 2, or from another level adjustment unit. This control signal may for example be the user control signal UC produced by the user, the audio output signal of one of the other level adjustment units, the (preferably delayed) output signal of the level adjustment unit itself, and/or a combination thereof. The level control unit 18 schematically illustrated in Fig. 3 may comprise suitable processing means for processing the control signal and the channel signal so as to produce a suitable amplifier control signal for the amplifier 17. These processing means may advantageously comprise a microprocessor and an associated memory. The memory may be used to store, among other things, pre-set and user adjusted sound levels, pre-set and user adjusted sound ratios, user preferences regarding content, and/or other information. Various processing techniques may be used. The sound levels of the auxiliary channels may be set to a certain percentage of the main channel, for example 20%, 30% or 40% (these percentages may also be calculated on the basis of the total sound level produced by all channels together, in that case the main channel may, for example, be allocated 80% and the auxiliary channels 20% of the total sound volume). These percentages may be pre-set in the factory and may be based upon statistical user listening tests. Such tests may indicate which percentages yield a good intelligibility and/or a suitable channel balance. The respective levels may be based upon a calculation of the total signal volume or signal power. For example, the signal power of the main channel may be calculated for a certain (typically short) time period using well-known techniques such as integrating the square of the amplitude over said time period. The same calculation is carried out for the auxiliary channel(s). If a certain target ratio of the signal powers (or volumes: the integral of the absolute value of the signal) is given, the adjustment is carried out in such a way that the actual ratio becomes (approximately) equal to the target ratio. The level control unit 18 may therefore be provided with a division unit for dividing the sound (signal) levels of the level control units and determining a percentage (ratio). The level control unit 18 may further be provided with a comparison unit for comparing the calculated ratio with a predetermined (that pre-set or previously altered) ratio and deriving a compensation signal from any deviation. In a particularly advantageous embodiment, the levels depend on the channel content and/or on the channel signal characteristics. With regard to content, different sound levels may be assigned to, for example, speech and music. A user will typically want to hear what is being said on the main channel and may accept a certain level of background music of an auxiliary channel. Conversely, when music is rendered on both the main channel and the auxiliary channel(s) and an auxiliary channel changes to speech, the user will typically want this auxiliary channel to be rendered louder in order to be able to hear and understand the speech. Accordingly, the present invention may advantageously provide automatic content type detection which may distinguish between, for example speech, music, noise and silence. In addition, different types of music may be distinguished, for example classical music, hard rock, jazz, blues, etc.. Determining the audio content of a channel can be carried out in various ways. Information (so called meta-information or meta-data) on the content may be available, for example the RDS (Radio Data System) information which may be broadcast together with radio signals and typically indicates the broadcasting station, the artist, and other information. Such meta-data may also be transmitted via other communication channels, such as the Internet. If this information includes the type of music, it can be used to determine the type of the audio content and adapt the levels of the channels accordingly. Other suitable information that may be used is EPG (Electronic Program Guide) information and/or the so-called ID3 tag ofMP3. Alternatively, or additionally, an indication of the audio content could be achieved using audio analysis such as speech detection and/or speech analysis. Speech analysis could, furthermore, involve prosody analysis and/or key word recognition, so that the device of the present invention may adapt the channel levels to user preferences. Other ways of determining content could be based upon the analysis of video or still images associated with the audio content, for example in the case of television. Instead of, or in addition to (amplitude) level adjustment based upon content, level adjustment could be carried out on the basis of signal characteristics. Signal analysis involving, for example, average signal amplitude and dominant frequencies (spectral analysis) can assist in automatically choosing a suitable level adjustment. It will be understood that signal analysis may also assist in determining the content of the channels. In the above discussion it has been assumed that the sound level adjustment of the various channels involved gain adjustment, that is, the signal of the channel is multiplied with a suitable gain factor (typically smaller than 1), resulting in the desired sound level. Although this is a very suitable technique, the present invention is not so limited. More in particular, the sound levels of the various channels may be reduced or adjusted using other techniques, such as clipping, compression and filtering. The clipping technique, which is known per se, involves limiting the signal amplitude to a certain threshold level. Although this technique may introduce some signal distortion, it is very simple and effective. Any signal distortion may be significantly decreased by "soft clipping", that is clipping in which the signal amplitude above the threshold value is (proportionally) reduced by multiplying the signal with a factor instead of "cutting off'. Another suitable technique which is known per se is filtering, which allows the signal amplitude to be reduced in dependence of the frequency. Using filtering, specific frequency ranges of the audio channels can be selectively reduced, instead of, or in addition to, adjusting the overall level of the channel. In this way it is possible to reduce the sound level in accordance with the sensitivity of the human ear: certain frequency ranges which cause more perceptual interference of simultaneously rendered sound (audio channels) could be reduced more than others frequency ranges. The sound level as experienced by the human ear is not only determined by the actual signal power of the sound but also by psychological factors. This phenomenon can advantageously be used to provide "intelligent" sound level processing, as in the exemplary embodiment of Fig. 4. The sound level adjustment unit 10 of Fig. 4, which may correspond to any of the units 11, 12 and 13 of Figs. 1 and 2, comprises in the exemplary embodiment shown a first temporal analysis (TEMP AN) module 102, a compressor (COMP) 104, a multi-band splitter (MULTIBAN) 106, a first fast Fourier transform (FFT) unit 108, a signal dependent filter (SDF) 110, a multi-band filter (MBFIL) 112, an (optional) controlled amplifier 115, a critical band filter (CRITBF) 116, a switch 118, a second temporal analysis module (TEMP AN) 120, a second fast Fourier transform (FFT) unit 122, and a parameter setting (PARS) unit 124. The level adjustment unit 10 receives type information from a type information unit (TPY) 130. The first input signal of the unit 10 is the audio signal of an auxiliary channel AC1 or AC2, the perceptive sound level of which is to be reduced (it is noted that reducing the sound level of an auxiliary channel is substantially equivalent to increasing the sound level of the main channel as in both cases the relative sound level of the auxiliary channel is decreased). The sound signal of the auxiliary channel AC1 or AC2 is temporally analyzed by the temporal analysis unit 102: the history of the signal is determined for a certain time period or "time slice", for example ranging from tl to t2. The temporal analysis unit 102 is arranged for activating the other components of the unit 10 only for certain time periods. If there is a peak in the sound level (as schematically depicted in Fig. 4) between tl and t2, it may be advantageous to process the audio signal only for this time slice, leaving the remainder of the signal level unchanged. This may typically occur during a commercial break in a television program, in which case so-called commercial detection by other means may help in automatically selecting the lowest of a set of user ratio preferences, for example a ratio of 100/0 (that is, main channel 100%, auxiliary channel 0%). Subsequently, the audio signal may be compressed in its entirety, or for certain time slices only, by the compressor 104, thus imposing a compression relationship on the input levels to obtain the desired output levels. The compressor may optionally be controlled by the output signal of parameter setting unit 124, which in turn is derived from the control signal Cntl. The compressor 104 may use any suitable compression technique. In the example shown, a compression function is used which may be defined mathematically as: S0 = Sj / K(Sj) + B(S;) , where Sj is the input signal of the compressor 104 (in the present example, the audio signal of an auxiliary channel AC1 or AC2), SQ is the output signal of the compressor 104, K is a scaling factor which may depend on the input signal Sj and B is an addition factor which may also depend on the input signal Sj. Typically, K increases as Sj increases, thus compressing high amplitude signals more than low amplitude signals. Compression techniques are well known in the art and the particular compression technique used is not essential to the present invention. After being compressed, the audio signal may be filtered, for example using the multi-band splitter 106 and the relatively simple multi-band filter 112 which is arranged for filtering the signal per frequency band. The multi-band filter 112 may be provided with amplifiers 113 for each frequency band. The filter characteristics of the multi-band filter 112 may be fixed, however, they can also be dependent on type information provided by a type determining unit 130 arranged for determining the type of the audio signal, for example pop music or classical music. In this way, the bass of pop music may be reduced by adjusting the gain of the respective amplifier 113 for the low frequency bands. Additionally, or alternatively (as indicated by the switch 118), a Fourier transform may be calculated by the first Fourier transform unit 108, followed by fixed or adaptive frequency domain filtering by the frequency domain filter 110, which is, in the embodiment shown, a signal dependent filter. The filter 110 may be adapted in response to an adaptation signal which is derived from the control signal Cntl (see also Fig. 3), which may be the (level adjusted) audio signal of the main channel, as discussed above with reference to Figs. 1 and 2. This adaptation signal is derived from the control signal Cntl using the second temporal analysis unit 120, the second fast Fourier transform unit 122 and the parameter setting unit 124. The filter 110 may for example suppress the (higher) amplitudes of audio frequency components in the channel AC1 or AC2 in a typical frequency band or in a frequency band which is actually determined by measurements. The output signal of the filter 110 is fed to an (optional) amplifier 115 which serves to compensate for any decrease in the (average) signal level caused by the filter 110. The output signal of the amplifier 115 (or, if the amplifier 115 is not present, the output signal of the filter 110) is fed to the switch 118 to be selectively coupled to the output of the unit 10. It is noted that it may be advantageous to use a critical band filter 116 that has a filtering characteristic modeled in accordance with the human auditory system. The audio signals may, for example, be split up in accordance with the well-known critical band theory, and the audio levels of the auxiliary channels (and/or main channel) are changed in dependence on their (mutual) interferences in each critical band. The exemplary embodiment of a level adjustment module 15 schematically shown in Fig. 5 comprises an intelligibility improvement unit (INTIMP) 152, an envelope unit 154, a topic detector (TOPDET) 156, a prosody analyzer (PROS) 158 and a keyword detector (KEYWD) 160. The keyword detector 160 receives relevant keywords from a keyword database (KEYDAT) 190 which may be external to the unit 10. The level adjustment unit 15 may represent any of the level adjustment units 11, 12 and 13 shown in Figs. 1 and 2. However, in contrast to the level adjustment units 10 of Figs. 3 and 4, the unit 15 of Fig. 5 does not have a control input (Cntl in Fig. 3). The unit 15 of Fig. 5 as shown is therefore suitable for embodiments of the device according to the present invention in which user control (UC in Fig. 1) is not required or is achieved by other means. However, the control signal Cntl (see Fig. 3) could be fed to the intelligibility improvement unit 152. Alternatively, or additionally, the control signal Cntl could be fed to the topic detector 156, the prosody analyzer 158 and a keyword detector 160, instead of or in addition to the sound signals of the channels MC, AC1 and/or AC2. The unit 15 is also suitable as an additional level control unit, arranged in series with a unit 10 of Fig. 3 or 4, for improving the intelligibility of speech before or after adjusting its level. As illustrated in Fig. 5, the main channel MC and the auxiliary channels AC1 and AC2 (if present) may be level adjusted by the unit 15, resulting in level adjusted channels MC, ACl ' and AC2'. The audio signal contained in the channel (MC, AC1, AC2) of interest is improved by the intelligibility improvement module 152 by increasing the amplitudes of formants (as schematically illustrated in Fig, 5) or by other signal processing techniques known in the art. An envelope unit 154 then adjusts the envelope E of the improved audio signal. More in particular, the envelope unit 154 is preferably arranged for temporarily changing (increasing or decreasing) the envelope of the audio signal. To this end, the envelope unit 154 is provided with a controlled amplifier or equivalent means for adjusting the gain of the signal. A preferred embodiment of the envelope unit 154 is arranged for increasing the sound level at certain moments of interest. These moments may be detected by a topic detector 156 which may be arranged to analyze the audio and video content of the channel and detect certain features, for example pauses in speech, pauses in motion, end of a video insertion, and reappearance of a central character (such as a news reader) by face detection and similar techniques. A prosody analyzer 158 is provided for analyzing the prosody in the speech and enhancing the prosody by sending a suitable prosody enhancement signal to envelope unit 154. Keywords may also be detected using the keyword detector 160 and the associated keyword database 190. The keyword database 190 is updated using, for example, EPG (Electronic Programming Guide) information summarizing the topics of television programs, and/or monitoring the interests of a user. If the rendering of a channel is suitably delayed, detected keywords may be rendered louder, thus alerting the user to these words. The exemplary remote control unit 6 shown in Fig. 6 comprises a toggle stick 61, a channel selection button 62, a channel change button 63, (optional) auxiliary channel adjustment knobs 64 and 65, a ratio adjustment assembly 66 comprising four buttons 67, and a screen 68 on which a channel overview 69 is displayed. In the example shown, the remote control unit 6 operates a television set and/or a home video system. Channel change button (key) 63 allows the channels being rendered to be changed by the user. In a typical embodiment, depressing the channel change button 63 repeatedly will result in the displayed channels to "rotate", that is, to displayed in succession. Depressing the selection button (key) 62 selects the current channel (NL1 in the example shown) as the main channel. This selection of the main channel will generate a selection signal (Sel in Figs. 1 and 2). Alternatively, a keypad (not shown) may be provided to enter a channel number of a first channel to be rendered, depressing the channel selection button 62 will select this rendered channel as the main channel and generate the selection signal, after which any further channel number entered in the keypad will display the (first) auxiliary channel (SBS6 in the example shown). It will be understood that the manner in which two or more channels out of a plurality of channels are chosen is not essential to the present invention. In the embodiment shown, the sound level ratio of the main channel and the auxiliary channel(s) may be adjusted by the user. To this end, the toggle stick 61, which essentially is a switch that can be moved from a central neutral position to either a left active position or a right active position, is arranged in such a way the user can step through a number of ratios. Assuming an initial situation in which the (factory set or programmed) ratio of the sound levels is 70/30 (that is, main channel 70% of total sound level, auxiliary channel 30%), moving the toggle stick to the left once may change the ratio into 80/20, and doing this twice may result in a ratio of 90/10. Similarly, moving the toggle stick to the right once may change the ratio from 70/30 into 60/40. The remote control unit may be arranged such that the ratio cannot exceed a threshold value, for example 50/50. The remote control unit and/or the level adjustment units of the present invention may advantageously be designed such that activating the toggle stick 61, or any equivalent sound level interface component, causes a temporary balance adjustment which lasts for a duration of, for example, approximately one second or several seconds, after which the sound levels revert to their previous values. In some embodiments using the toggle stick 61 may provide an alternative way of selecting the main channel, producing a selection signal Sel when the ratio reaches 40/60, for example. Those skilled in the art will understand that various alternative or complementary arrangements are possible and that the toggle stick 61 is a useful but optional feature of the remote control unit 6. Each newly selected ratio may be displayed, for example on a screen of the remote control unit or on the screen of an associated television set. Alternatively, an aural indication could be provided, for example an audible signal produced by a signal generator or a speech generator. An alternative way of adjusting the ratio of the rendered channels is provided by (optional) auxiliary channel adjustment knobs 64 and 65. Rotating each of these knobs causes the level of the respective auxiliary channel to be adjusted. This manual adjustment is in addition to the automatic adjustment provided by the present invention. Embodiments can be envisaged in which the automatic adjustment overrides the manual adjustment or vice versa. A further optional feature of the remote control unit 6 is the ratio adjustment assembly 66 which comprises four buttons 67. These buttons may serve to manually adjust sound levels (and/or sound level ratios) in the respective channels and to choose the channel to be adjusted. It will be understood that the main channel selection button of a remote control unit according to the present invention is typically distinct from the usual channel selection buttons of a remote control which merely serve to select a channel to be rendered. The main channel selection button (or its equivalent) determines which channel of the channels being rendered simultaneously is to be the main channel, that is the channel under direct user control, in contrast to the auxiliary channels the levels of which are automatically controlled relative to the main channel. It will further be understood that instead of using a remote control unit, other controls are possible. The device of the present invention could, for example, be provided with a speech command interpreter. As discussed above, the level adjusted audio channels can be rendered using a single, common transducer (such as a loudspeaker) or set of transducers reproducing summed signals, or using individual transducers or sets of the transducers for one or more channels. The main channel is preferably rendered using a separate transducer or set of transducers (it will be understood that a set of transducers may comprise, for example, a woofer and a tweeter, or other combinations of loudspeakers, resonators and/or other transducers). In a particularly advantageous embodiment, the main channel is rendered using a centrally located transducer (or set of transducers), while the auxiliary channel(s) is/are rendered using laterally located transducers (or sets of transducers). Such an arrangement is schematically shown in Fig. 7, where a television set 9 is provided with a centrally positioned loudspeaker 2 for rendering the main channel and four laterally positioned loudspeakers 3, 3', 4 and 4' for rendering the auxiliary channel(s). The television set 9 is further provided with a device (1 in Figs. 1 and 2) according to the present invention. In the example shown, the television set has a screen 8 that is divided in two parts which are schematically indicated I and II. Each part is assigned a channel comprising both audio and video. As mentioned above, it is preferred that the central loudspeaker 2 renders the sound of the main channel which is displayed in, for example, screen section I, while the lateral loudspeakers 3 and 4 render the sound of the auxiliary channel, the video of which is in this example rendered by screen section II. The television set 9 shown in Fig. 7 is part of a home cinema system which further comprises a set-top box 7 and stand-alone loudspeaker units 3' and 4'. The present invention is based upon the insight that the sound levels of several audio channels which can be rendered simultaneously should be controlled interdependently: adjusting the sound level of one channel may require the adjustment of one or more other channels. The present invention benefits from the further insight that user control of multiple channel is facilitated if the user has to control the sound level of a single, main channel only, all other channels being controlled automatically in dependence of the main channel. The term computer program product should be understood to include any physical realization, e.g. an article of manufacture, of a collection of commands enabling a processor -generic or special purpose-, after a series of loading steps to get the commands into the processor, to execute any of the characteristic functions of an invention. In particular the computer program product may be realized as program code, processor adapted code derived from this program code, or any intermediate translation of this program code, on a carrier such as e.g. a disk or other plug-in component, present in a memory, temporarily present on a network connection -wired or wireless- , or program code on paper. Apart from program code, invention characteristic data required for the program may also be embodied as a computer program product. It is noted that any terms used in this document should not be construed so as to limit the scope of the present invention. In particular, the words "comprise(s)" and "comprising" are not meant to exclude any elements not specifically stated. Single (circuit) elements may be substituted with multiple (circuit) elements or with their equivalents. It will be understood by those skilled in the art that the present invention is not limited to the embodiments illustrated above and that many modifications and additions may be made without departing from the scope of the invention as defined in the appending claims.

Claims

CLAIMS:
1. A device (1 ) for controlling the sound levels of a group of audio channels comprising a main channel (MC) and at least one auxiliary channel (ACl) which can be rendered simultaneously, the device comprising: - user controlled selection means (14, 16) for selecting the main channel, and - automatic level adjustment means (12, 13) for adjusting the sound level of the at least one auxiliary channel relative to the main channel.
2. The device according to claim 1, wherein the selection means (14, 16) are arranged for selecting successive available channels in response to user input (Sel).
3. The device according to claim 1 or 2, wherein the level adjustment means (12, 13) are arranged for providing pre-set relative sound levels.
4. The device according to claim 3, further arranged for altering the pre-set relative sound levels by the user.
5. The device according to claim 1, wherein the level adjustment means (12, 13) are arranged for adapting the respective sound levels to the content of each associated audio channel (ACl, AC2).
6. The device according to claim 5, further arranged for adapting the respective sound levels to user preferences regarding the content.
7. The device according to claim 1, wherein the level adjustment means (12, 13) are arranged for adapting the respective sound levels to the signal characteristics of each associated audio channel (ACl, AC2).
8. The device according to claim 7, wherein the level adjustment means (12, 13) are arranged for speech detection.
9. The device according to claim 8, wherein the level adjustment means (12, 13) are further arranged for speech analysis.
10. The device according to claims 5 and 7, wherein the level adjustment means
(12, 13) are arranged for temporarily adjusting the sound level of a channel (MC, ACl, AC2) in response to the content and/or signal characteristics of at least one channel (MC; ACl).
11. The device according to claim 1, wherein the level adjustment means (12, 13) are arranged for gradually adjusting the sound level.
12. The device according to claim 1, wherein the level adjustment means (12, 13) are arranged for clipping, compressing and/or filtering audio signals contained in the channels.
13. The device according to claim 1, wherein the main channel (MC) and the at least one auxiliary channel (ACl) are rendered by different transducers (2, 3, 4).
14. The device of claim 13, further provided with transducer selecting means for selecting a transducer (2, 3, 4) which renders the main channel (MC) and/or the at least one auxiliary channel (ACl, AC2).
15. A level adjustment means (11, 12, 13) for use in the device according to claim 1.
16. A remote control unit (6) for use with the device according to claim 1, comprising selection interface components (62, 63), such as buttons, for selecting the main channel (MC).
17. The remote control unit according to claim 16, further comprising a first sound level interface component (61), such as a toggle stick, for setting a ratio of sound levels of rendered channels (MC, ACl, AC2).
18. The remote control unit according to claim 16 or 17, further comprising second sound level interface components (64, 65), such as knobs, for manually adjusting the sound levels of rendered channels (MC, ACl, AC2).
19. An audio system comprising a device (1) according to claim 1.
20. A home entertainment system comprising a device (1) according to claim 1.
21. A television system (9) comprising a device (1) according to any of claim 1.
22. The system according to claim 19, 20 or 21, wherein the main channel (MC) is rendered by a transducer (ACl) which is located centrally relative to the system.
23. A method of controlling the sound levels of a group of audio channels comprising a main channel (MC) and at least one auxiliary channel (ACl) which can be rendered simultaneously, the method comprising the steps of: - selecting, under user control, the main channel, and - automatically adjusting the sound level of the at least one auxiliary channel (ACl) relative to the main channel.
24. The method according to claim 23, wherein the sound level of the at least one auxiliary channel (ACl; AC2) is set using a plurality of pre-set relative sound levels.
25. The method according to claim 23, wherein the respective sound levels of the at least one auxiliary channel (ACl ; AC2) are adapted to the content of each associated audio channel.
26. The method according to claim 23, wherein the respective sound levels of the at least one auxiliary channel (ACl; AC2) are adapted to the signal characteristics of each associated audio channel.
27. The method according to claim 25 or 26, wherein speech detection is used.
28. The method according to claim 27, wherein formant detection, prosody detection and/or keyword detection is used.
29. A computer program product for carrying out the method according to claim 22.
PCT/IB2005/051080 2004-04-08 2005-03-31 Audio level control WO2005099252A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2007506889A JP4913038B2 (en) 2004-04-08 2005-03-31 Audio level control
US10/599,630 US8600077B2 (en) 2004-04-08 2005-03-31 Audio level control
EP05718606.6A EP1736001B2 (en) 2004-04-08 2005-03-31 Audio level control
KR1020067020871A KR101249239B1 (en) 2004-04-08 2005-03-31 Audio level control

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP04101456.4 2004-04-08
EP04101456 2004-04-08

Publications (1)

Publication Number Publication Date
WO2005099252A1 true WO2005099252A1 (en) 2005-10-20

Family

ID=34962758

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2005/051080 WO2005099252A1 (en) 2004-04-08 2005-03-31 Audio level control

Country Status (6)

Country Link
US (1) US8600077B2 (en)
EP (1) EP1736001B2 (en)
JP (1) JP4913038B2 (en)
KR (1) KR101249239B1 (en)
CN (1) CN100518269C (en)
WO (1) WO2005099252A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011044153A1 (en) * 2009-10-09 2011-04-14 Dolby Laboratories Licensing Corporation Automatic generation of metadata for audio dominance effects
US9462382B2 (en) 2012-03-12 2016-10-04 Jaguar Land Rover Limited Audio system

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8139793B2 (en) 2003-08-27 2012-03-20 Sony Computer Entertainment Inc. Methods and apparatus for capturing audio signals based on a visual image
US8233642B2 (en) 2003-08-27 2012-07-31 Sony Computer Entertainment Inc. Methods and apparatuses for capturing an audio signal based on a location of the signal
US8160269B2 (en) 2003-08-27 2012-04-17 Sony Computer Entertainment Inc. Methods and apparatuses for adjusting a listening area for capturing sounds
KR100640477B1 (en) * 2005-06-29 2006-10-30 삼성전자주식회사 Method and apparatus for outputting audio signal dependent on digital multimedia broadcasting channel
US20090062943A1 (en) * 2007-08-27 2009-03-05 Sony Computer Entertainment Inc. Methods and apparatus for automatically controlling the sound level based on the content
PA8847501A1 (en) * 2008-11-03 2010-06-28 Telefonica Sa METHOD AND REAL-TIME IDENTIFICATION SYSTEM OF AN AUDIOVISUAL AD IN A DATA FLOW
EP2392072A4 (en) * 2009-02-02 2014-09-03 Hewlett Packard Development Co Method of leveling a plurality of audio signals
JP2010244602A (en) * 2009-04-03 2010-10-28 Sony Corp Signal processing device, method, and program
US8434006B2 (en) * 2009-07-31 2013-04-30 Echostar Technologies L.L.C. Systems and methods for adjusting volume of combined audio channels
JP5389214B2 (en) * 2012-03-30 2014-01-15 株式会社東芝 Volume control device
US10027303B2 (en) * 2012-11-13 2018-07-17 Snell Advanced Media Limited Management of broadcast audio loudness
US9571054B2 (en) * 2013-02-28 2017-02-14 Rovi Guides, Inc. Systems and methods for dynamically adjusting volume based on media content
US9385678B2 (en) 2013-05-03 2016-07-05 Honda Motor Co., Ltd. Methods and systems for controlling volume
WO2015165076A1 (en) 2014-04-30 2015-11-05 Motorola Solutions, Inc. Method and apparatus for discriminating between voice signals
US9525392B2 (en) * 2015-01-21 2016-12-20 Apple Inc. System and method for dynamically adapting playback device volume on an electronic device
US10091581B2 (en) * 2015-07-30 2018-10-02 Roku, Inc. Audio preferences for media content players
US9947332B2 (en) 2015-12-11 2018-04-17 Ibiquity Digital Corporation Method and apparatus for automatic audio alignment in a hybrid radio system
US9755598B2 (en) 2015-12-18 2017-09-05 Ibiquity Digital Corporation Method and apparatus for level control in blending an audio signal in an in-band on-channel radio system
US10177729B1 (en) * 2018-02-19 2019-01-08 Ibiquity Digital Corporation Auto level in digital radio systems
KR20210066282A (en) * 2019-11-28 2021-06-07 삼성전자주식회사 Display apparatus and control method for the same
US11381209B2 (en) 2020-03-12 2022-07-05 Gaudio Lab, Inc. Audio signal processing method and apparatus for controlling loudness level and dynamic range

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0851580A (en) * 1994-08-08 1996-02-20 Fujitsu General Ltd Audio circuit for screen division display device
US5878391A (en) * 1993-07-26 1999-03-02 U.S. Philips Corporation Device for indicating a probability that a received signal is a speech signal
JP2000069391A (en) * 1998-08-17 2000-03-03 Toshiba Corp Multi-screen receiver
EP1035732A1 (en) * 1998-09-24 2000-09-13 Fourie Inc. Apparatus and method for presenting sound and image
JP2001125695A (en) * 1999-10-28 2001-05-11 Matsushita Electric Ind Co Ltd Window managing device
US6590618B1 (en) * 1998-09-14 2003-07-08 Samsung Electronics Co., Ltd. Method and apparatus for changing the channel or varying the volume level in a television receiver having a double screen mode function

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5718110A (en) * 1980-07-09 1982-01-29 Arupain Kk Equalizer device
JP2656306B2 (en) * 1988-07-05 1997-09-24 株式会社東芝 Telephone
JP2630651B2 (en) * 1989-07-26 1997-07-16 ヤマハ株式会社 Fader device
JP3057719B2 (en) * 1990-06-22 2000-07-04 ソニー株式会社 Volume control circuit
JPH04114576A (en) 1990-09-04 1992-04-15 Sony Corp Sound output circuit for electronic unit provided with screen synthesis function
JPH04251325A (en) * 1991-01-09 1992-09-07 Nec Corp System for controlling sound volume of multiwindow system
JPH0519729A (en) * 1991-07-12 1993-01-29 Hitachi Ltd Image device and its sound volume control method
US5430826A (en) * 1992-10-13 1995-07-04 Harris Corporation Voice-activated switch
JP3293240B2 (en) * 1993-05-18 2002-06-17 ヤマハ株式会社 Digital signal processor
GB2284968A (en) 1993-12-18 1995-06-21 Ibm Audio conferencing system
JP2836508B2 (en) * 1994-12-09 1998-12-14 日本電気株式会社 Electronic conference terminal
US5692058A (en) 1995-03-02 1997-11-25 Eggers; Philip E. Dual audio program system
JP3329361B2 (en) * 1995-04-24 2002-09-30 ソニー株式会社 Video display audio output device
JP3393480B2 (en) * 1995-04-28 2003-04-07 ソニー株式会社 Image display audio output device
JPH09307833A (en) * 1996-05-17 1997-11-28 Sony Corp Audio controller for video equipment and audio control method
US6026168A (en) 1997-11-14 2000-02-15 Microtek Lab, Inc. Methods and apparatus for automatically synchronizing and regulating volume in audio component systems
US6396549B1 (en) * 1997-11-19 2002-05-28 Harold J. Weber Remote controller for a multi-device television receiving system providing channel number auto-completion, presettable audio hush level and base channel auto-reaffirm
US6311155B1 (en) * 2000-02-04 2001-10-30 Hearing Enhancement Company Llc Use of voice-to-remaining audio (VRA) in consumer applications
JPH11355691A (en) 1998-06-03 1999-12-24 Toshiba Corp Volume controller of two picture television receiver
US6442278B1 (en) 1999-06-15 2002-08-27 Hearing Enhancement Company, Llc Voice-to-remaining audio (VRA) interactive center channel downmix
JP2001043062A (en) * 1999-07-27 2001-02-16 Nec Corp Personal computer, volume control method thereof, and recording medium
US6965676B1 (en) * 1999-10-19 2005-11-15 Texas Instruments Incorporated Volume-responsive loudness compensation circuits, systems, and methods
JP2003518831A (en) 1999-12-22 2003-06-10 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Multiple window display system
US7373650B1 (en) * 2000-02-01 2008-05-13 Scientific-Atlanta, Inc. Apparatuses and methods to enable the simultaneous viewing of multiple television channels and electronic program guide content
JP2002165152A (en) * 2000-11-28 2002-06-07 Matsushita Electric Ind Co Ltd Volume control apparatus
JP4051607B2 (en) * 2002-03-06 2008-02-27 船井電機株式会社 Television receiver
CN1267814C (en) * 2002-11-14 2006-08-02 三星电子株式会社 Electronic apparatus and control method thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5878391A (en) * 1993-07-26 1999-03-02 U.S. Philips Corporation Device for indicating a probability that a received signal is a speech signal
JPH0851580A (en) * 1994-08-08 1996-02-20 Fujitsu General Ltd Audio circuit for screen division display device
JP2000069391A (en) * 1998-08-17 2000-03-03 Toshiba Corp Multi-screen receiver
US6590618B1 (en) * 1998-09-14 2003-07-08 Samsung Electronics Co., Ltd. Method and apparatus for changing the channel or varying the volume level in a television receiver having a double screen mode function
EP1035732A1 (en) * 1998-09-24 2000-09-13 Fourie Inc. Apparatus and method for presenting sound and image
JP2001125695A (en) * 1999-10-28 2001-05-11 Matsushita Electric Ind Co Ltd Window managing device

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
PATENT ABSTRACTS OF JAPAN vol. 1996, no. 06 28 June 1996 (1996-06-28) *
PATENT ABSTRACTS OF JAPAN vol. 2000, no. 06 22 September 2000 (2000-09-22) *
PATENT ABSTRACTS OF JAPAN vol. 2000, no. 22 9 March 2001 (2001-03-09) *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011044153A1 (en) * 2009-10-09 2011-04-14 Dolby Laboratories Licensing Corporation Automatic generation of metadata for audio dominance effects
US9552845B2 (en) 2009-10-09 2017-01-24 Dolby Laboratories Licensing Corporation Automatic generation of metadata for audio dominance effects
US9462382B2 (en) 2012-03-12 2016-10-04 Jaguar Land Rover Limited Audio system

Also Published As

Publication number Publication date
CN1947417A (en) 2007-04-11
KR20070020440A (en) 2007-02-21
JP4913038B2 (en) 2012-04-11
CN100518269C (en) 2009-07-22
US8600077B2 (en) 2013-12-03
JP2007533191A (en) 2007-11-15
EP1736001B1 (en) 2019-01-09
US20070177743A1 (en) 2007-08-02
EP1736001B2 (en) 2021-09-29
EP1736001A1 (en) 2006-12-27
KR101249239B1 (en) 2013-04-16

Similar Documents

Publication Publication Date Title
EP1736001B1 (en) Audio level control
US6552753B1 (en) Method and apparatus for maintaining uniform sound volume for televisions and other systems
US5065432A (en) Sound effect system
US6195438B1 (en) Method and apparatus for leveling and equalizing the audio output of an audio or audio-visual system
US7567898B2 (en) Regulation of volume of voice in conjunction with background sound
KR100604016B1 (en) An image display device for having function of controlling sound level and method of controlling the same
EP3108672A2 (en) Content-aware audio modes
KR101558199B1 (en) Digital sound mixing apparatus having control function through GUI
JPH1195759A (en) Automatic timbre correction method and apparatus therefor
KR20140090469A (en) Method for operating an apparatus for displaying image
CN114902560A (en) Apparatus and method for automatic volume control with ambient noise compensation
JP2010258776A (en) Sound signal processing apparatus
KR100970724B1 (en) Sound reproducing apparatus based on auditory characteristic and television system thereof
KR100688650B1 (en) Method and apparatus for processing sound of an image display device
KR20050062201A (en) Automatic image and automatic sound set up methode according to program genre of broadcast receiver
KR100203308B1 (en) Audio signal setting level integration regulating and reproducing apparatus for a television
KR920007139Y1 (en) Circuit choosing automatic sound by using equalizer in tv
WO2006059539A1 (en) Broadcast program reception device, broadcast program reception signal processing device, method, and program
JP2013121096A (en) Voice regulator and digital broadcast receiver
KR970003041B1 (en) Sound automatic converting circuit and method therefor
KR20060134492A (en) Method and apparatus for controlling sound of (an) image display device
JPH04188971A (en) Sound quality setting device
JPH05236389A (en) Tv receiver
KR20040008928A (en) Television with sound selecting function
KR19990085424A (en) Sound mode automatic setting device and method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2005718606

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2007506889

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 10599630

Country of ref document: US

Ref document number: 2007177743

Country of ref document: US

Ref document number: 1020067020871

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 200580012152.2

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Ref document number: DE

WWE Wipo information: entry into national phase

Ref document number: 4057/CHENP/2006

Country of ref document: IN

WWP Wipo information: published in national office

Ref document number: 2005718606

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1020067020871

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 10599630

Country of ref document: US