WO2007125470A2 - Method of determining a characteristic fragment of a recording and system for playing back a recording - Google Patents

Method of determining a characteristic fragment of a recording and system for playing back a recording Download PDF

Info

Publication number
WO2007125470A2
WO2007125470A2 PCT/IB2007/051489 IB2007051489W WO2007125470A2 WO 2007125470 A2 WO2007125470 A2 WO 2007125470A2 IB 2007051489 W IB2007051489 W IB 2007051489W WO 2007125470 A2 WO2007125470 A2 WO 2007125470A2
Authority
WO
WIPO (PCT)
Prior art keywords
recording
recordings
reproduction
location
determined
Prior art date
Application number
PCT/IB2007/051489
Other languages
French (fr)
Other versions
WO2007125470A3 (en
Inventor
Clemens C. WÜST
Paulus M. H. M. A. Gorissen
Wilhelmus P. A. J. Michiels
Norbert C. Esser
Paul F. Hoogendijk
Petrus A. C. J. Van Loon
Original Assignee
Pace Plc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pace Plc filed Critical Pace Plc
Publication of WO2007125470A2 publication Critical patent/WO2007125470A2/en
Publication of WO2007125470A3 publication Critical patent/WO2007125470A3/en

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • G11B27/105Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording

Definitions

  • the invention relates to a method of determining characteristic fragments of recordings.
  • the invention also relates to a system for playing back a recording, including a control system for controlling the system for playing back the recording so as to reproduce perceptibly at least one signal corresponding to at least part of the recording.
  • the invention also relates to a portable media player, comprising a storage apparatus for storing a plurality of sets of data corresponding to recordings.
  • the invention also relates to a computer program.
  • US2003/012339 describes a sound reproducing apparatus including a sound storing unit, a sound critical points retrieving unit, a sound property unit, a sound reproducing unit and a reproduction control unit.
  • the reproduction control unit refers to the property data of the sound piece being object of reproduction, recorded in the storing unit, and obtains a critical time point property. Then, the reproduction control unit indicates the critical point time to the sound reproducing unit as the time for start of reproduction, thereby starting reproduction from the critical point.
  • the reproduction control unit acquires the next critical point time according to the same procedure so as to control the sound reproducing unit, and starts reproduction at that time.
  • a sound critical points retrieving apparatus comprises a sound features extracting unit, which extracts predetermined sound features from a sound signal; a cost function calculating unit, which calculates a cost function indicating the likelihood of sound critical points from the sound features which have been extracted by the sound features extracting unit; a peak detecting unit, which detects peak times and peak values in a case where a value of the cost function calculated by the cost function calculating unit indicates a maximum value; and a selecting unit which selects sound critical points having likelihood values of a predetermined value or more from the peak times and peak values detected by the peak detecting unit.
  • a problem of the known system is that not all prominent signal features are indicative of the boundaries of a readily recognisable part of the song.
  • the method is provided for determining a characteristic fragment of a recording, the method comprising steps of - controlling a playback of the recording so as to cause to reproduce perceptibly at least one signal corresponding to at least part of the recording, - receiving a command inputted by a user during the playback, wherein the controller is configured to carry out an operation relating to the recording in response to the command, the controller being configured to determine a location, in a pre-determined relation to a point reached during the perceptible reproduction of at least part of the recording at a time of receipt of the command, as location of the characteristic fragment of the recording.
  • Recordings include audio, visual and audiovisual material with a time base, recorded in digital or analogue form, optionally in a compressed digital representation. At least upon further processing, the information in the recording is suitable for generating one or more input signals for generating a perceptible reproduction of the material over a time interval corresponding to the length of the recording.
  • the method is based on the insight that during reproduction of an as yet unknown recording, a user will provide an input to carry out an operation in relation to the recording, the moment the user has recognised the current recording. Such recognition is most likely to occur at around the time that the reproduction of the current recording has reached the characteristic fragment.
  • the step of perceptibly reproducing at least part of a recording is carried out as part of a step of perceptibly reproducing at least respective parts of a series of recordings in succession according to a pre-determined scheme, and wherein reproduction according to the pre-determined scheme is terminated in response to the input signal generated by the user.
  • the effect is that the time of receipt of the input signal is more likely to correspond to recognition of a characteristic part of the recording. This is so, because the recordings are reproduced in succession according to a pre-determined scheme, rather than in response to selection commands from the user.
  • a form of "browsing" is implemented, whereby the user is provided with a succession of reproductions of (parts of) recordings, without having immediate influence over which one will follow next.
  • the user perceives a targeted one of the recordings, he or she can provide an input signal. This will occur upon recognition of the targeted recording.
  • an order of succession of the series of recordings is determined on the basis of statistics, maintained for the recordings in the series and indicative of respective frequencies of perceptible reproduction of the recordings.
  • the operation relating to the recording of which at least a part is being reproduced includes perceptibly reproducing more than a pre-determined part of that recording if the input signal generated by the user is of a first type.
  • This provides an alternative to a menu-driven facility for selecting a recording for complete reproduction.
  • the user Upon recognition of the recording, preferably only after perceiving part of the recording, the user can select it with an input signal that need not convey much more information than a binary selection. It is particularly suitable for implementation in small form- factor media players, where the ability to dispense with an elaborate graphical user interface for selecting a recording is useful.
  • the operation relating to the recording of which at least a part is being reproduced includes terminating perceptible reproduction of any part of that recording and commencing perceptible reproduction of at least a part of a next recording in a series of recordings.
  • the effect is that a user can go through a series of recordings in search for a targeted recording at relatively high speed. As soon as he or she has recognised a recording as not being the targeted one, the next one can be tried.
  • an interval ending at a point within a pre-determined distance to the point reached during perceptible reproduction of at least part of the recording at a time of receipt of the input signal and preceding that point during perceptible reproduction of the recording is determined as the characteristic fragment of the current recording.
  • An embodiment includes storing data for determining at least the location of the characteristic fragment in association with the current recording upon determining at least the location of the characteristic fragment.
  • the characteristic fragment as an annotation to the complete recording, to be used for selecting the complete recording for perceptible reproduction.
  • At least one step of perceptibly reproducing at least part of a recording includes selecting at least a point in the recording from which to start perceptible reproduction of a part thereof in dependence on data stored in association with the recording.
  • a device for determining a characteristic fragment of a recording, the device comprising a controller for controlling a playback of the recording so as to cause to reproduce perceptibly at least one signal corresponding to at least part of the recording, an interface for receiving a command inputted by a user during the playback, wherein the controller is configured to carry out an operation relating to the recording in response to the command, the controller being configured to determine a location, in a pre-determined relation to a point reached during the perceptible reproduction of at least part of the recording at a time of receipt of the command, as location of the characteristic fragment of the recording.
  • a device may be implemented as a server remotely located from a portable media player of the user.
  • a system for playing back a recording comprises a reproduction apparatus for perceptibly reproducing at least one signal from a recording, a control system for controlling the system for playing back recordings so as to reproduce perceptibly at least one signal corresponding to at least part of a current recording, an interface for receiving a command from a user and providing a signal in response to the command from the user to the control system, wherein the control system is configured to carry out an operation relating to the current recording in response to the input signal, characterised in that the control system is configured to determine a location in a pre-determined relation to a point reached during perceptible reproduction of at least part of the recording at a time of receipt of the input signal as location of a characteristic fragment of the current recording.
  • system is configured to carry out a method according to the invention.
  • a portable media player comprising a storage apparatus for storing a plurality of sets of data corresponding to recordings, and a system for playing back recordings according to the invention.
  • the portable media player can operate with relatively low processing power and at relatively high speed to determine a characteristic fragment.
  • the portable media player can be kept relatively simple.
  • a computer program including a set of instructions capable, when incorporated in a machine-readable medium, of causing a system having information processing capabilities to perform a method according to the invention.
  • Fig. 1 is a very schematic view of some components of a portable media player
  • Fig. 2 is a schematic simplified view of the exterior of such a media player
  • Fig. 3 is a flow chart illustrating some steps in a method of determining characteristic fragments of recordings.
  • a portable media player 1 (Fig. 2) comprises a system 2 (Fig. 1) for perceptible reproduction of recordings, in the present example audio tracks.
  • the example of a system for reproducing audio tracks will be used throughout the present description in the understanding that the principles underlying the method of determining characteristic fragments of audio tracks can also be applied to determined characteristic fragments of other types of recordings. These include video sequences (either synchronised with accompanying audio tracks or silent), still images (to determine a characteristic region), etc.
  • the system 2 comprises a microprocessor 3 for executing instructions enabling the microprocessor 3 to control the operation of the system 2.
  • Such instructions can be stored in a first flash memory device 4, connected to the microprocessor 3 via a bus 5.
  • a second flash memory device 6 is provided for storing audio files representing recordings of audio tracks.
  • a Random Access Memory (RAM) 7 and Digital Signal Processor (DSP) 8 are connected to the second flash memory device 6 via a second bus 9.
  • the DSP 8 operates in conjunction with a decoder chip 10 to decompresses and decode the audio information in the files so as perceptibly to reproduce the audio recording on an output of the system 2.
  • the decoder chip 10 is connected to an output amplifier 11 delivering an analogue output signal to, for example speakers (not shown) or headphones (not shown).
  • USB controller 12 and USB connector 13 enable the loading of new audio files, for example from a computer.
  • USB Universal Serial Bus
  • other variants of the system 2 are obtainable using other types of physical links to a personal computer or server.
  • the microprocessor 3 receives input signals from an input device 14, comprising a number of keys enabling a user to generate an input signal.
  • a display unit 15 is provided.
  • the use of characteristic fragments allows a user to locate a desired audio track for reproduction without having to use an elaborate graphical user interface with menus.
  • the display unit 15 could in this respect be dispensed with.
  • the method of locating a desired audio track can be implemented with only two types of input signal.
  • three types of input signal can be generated by the user.
  • a first key 16 for generating a signal to the microprocessor 3 to skip back one track a second key 17 for generating a signal representative of selection of a track
  • the microprocessor 3 controls the system to carry out a method as illustrated in Fig. 3.
  • a first step 19 a current one of a series of audio tracks is selected.
  • the audio tracks in the series are selected in a particular order of succession.
  • the order is determined by the order of storage in the second flash memory device 6. In another embodiment, the order is determined by referring to metadata, stored in association with the audio files and indicating the frequency of complete reproduction of the respective audio tracks. This indication may be in the form of an absolute count or a value relative to the total number of reproductions of the audio files in the collection.
  • a second step 20 the audio file selected in the first step 19 is read.
  • metadata associated with the audio track is read for use in a subsequent step 21.
  • this step 21 it is determined whether data identifying at least the location of a characteristic fragment within the audio track has been stored with the audio track.
  • Each recording representing an audio track has a certain progression, so that it is possible to relate a location to the length of the audio track (e.g. half-way through).
  • a location can be related to the starting point at normal reproduction speed (e.g. ten seconds into the track, etc.).
  • only a particular point within the audio track is identified in the data stored in association with it. This could be the starting point or end point of a characteristic fragment.
  • the length of the characteristic fragment is a default value, stored, for example in the first flash memory device 4.
  • the interval corresponding to the characteristic fragment is identified in terms of its length and location, e.g. by including information representative of the starting point and end point of the characteristic fragment.
  • Perceptible reproduction involving decompression and decoding of the audio information in the audio file, commences (step 22) at the starting point that has been determined.
  • the length of the fragment that is reproduced is pre-determined. Upon reaching the end of the fragment without receiving any input signal, the first step 19 is repeated to select the next audio track in the order of succession. In another variant, the current audio track is reproduced until the end. If no input signal is received, the next audio track is selected.
  • a break-off point is determined.
  • the break-off point is determined as the point in the audio track reached during reproduction of the audio track in its recorded order of progression that coincides with receipt of the input signal generated by the user using one of the keys 16-18.
  • a location of the characteristic fragment is determined. The location of the characteristic fragment is determined in relation to the break-off point.
  • the characteristic fragment is determined to correspond to an interval directly preceding the break-off point.
  • the length of the interval is a default value.
  • the characteristic fragment corresponds to an interval within which the break-off point lies.
  • the length of the interval is a value stored in the first flash memory device 4 and part of the user settings. That is to say that the user may change the default value.
  • the current audio track is selected for complete reproduction (step 24).
  • the complete reproduction continues until the end of the audio track.
  • the complete reproduction recommences from the start of the audio track and continues until the end.
  • step 25 data for determining at least its location, and optionally its length, are stored (step 25) in association with the audio file including the current audio track.
  • the step 22 of commencing reproduction of a characteristic fragment can take account of the data identifying what the user considers to be the characteristic fragment of the audio track.
  • At least a location, and optionally a length, of a characteristic fragment is also determined - in the manner described above - when the input signal generated by the user is of a second type.
  • the input signal of the second type is generated by pressing one of the first and third keys.
  • the system 2 then immediately terminates reproduction of the current recording. It then autonomously carries out the first step 19 to select a next (or preceding) recording in accordance with a pre-determined scheme defining the order of succession of the series of audio tracks.
  • the order of succession is the same as used when no input signal is received, which situation has been described above already.
  • the data identifying at least the location of a characteristic fragment - determined in relation to the break-off point - are also stored in association with the audio file including the current audio track (step 26).
  • the method illustrated in Fig. 3 is an effective way of determining at least the location of characteristic fragments of a user's collection of audio tracks, adapted to the user's idea of what constitutes a characteristic fragment. Using such characteristic fragments, an effective way of aurally browsing the collection of audio tracks is made possible.
  • the interface thus provided is suited to portable media players having a minimum of input devices and even fewer visual display capabilities.
  • the methods and systems outlined herein are suitable also for incorporation into a personal computer, cellular telephone handset, personal digital assistant or other type of data processing arrangement.
  • the facility for selecting a file by reproducing characteristic fragments may also operate in a mode without simultaneous determination of break-off points and characteristic fragment location in relation thereto.
  • the input signals used to implement the method of Fig. 3 can alternatively be generated by means of a jog dial, a touch pad or other type of capacitive sensor, a remote control device, etc.
  • the signals can also be generated by processing speech input provided by the user and captured by means of a microphone (not shown). Actions conditional upon receipt of a particular input signal can also be carried out in the absence of an input signal for a pre-determined period of time.

Abstract

A method of determining characteristic fragments of recordings, includes perceptibly reproducing at least part of a current recording, and receiving an input signal generated by a user, in response to which an operation relating to the current recording is carried out, wherein the time of receipt of the input signal is coincident with a point in time at which a break-off point in the current recording has been reached during reproduction of at least part of the current recording. A location in a pre-determined relation to the break-off point is determined as location of a characteristic fragment of the current recording.

Description

Method of determining a characteristic fragment of a recording and system for playing back a recording
The invention relates to a method of determining characteristic fragments of recordings.
The invention also relates to a system for playing back a recording, including a control system for controlling the system for playing back the recording so as to reproduce perceptibly at least one signal corresponding to at least part of the recording.
The invention also relates to a portable media player, comprising a storage apparatus for storing a plurality of sets of data corresponding to recordings.
The invention also relates to a computer program.
US2003/012339 describes a sound reproducing apparatus including a sound storing unit, a sound critical points retrieving unit, a sound property unit, a sound reproducing unit and a reproduction control unit. When a user presses a critical point skip button, the reproduction control unit refers to the property data of the sound piece being object of reproduction, recorded in the storing unit, and obtains a critical time point property. Then, the reproduction control unit indicates the critical point time to the sound reproducing unit as the time for start of reproduction, thereby starting reproduction from the critical point. When the user presses the critical point skip button again during reproduction, the reproduction control unit acquires the next critical point time according to the same procedure so as to control the sound reproducing unit, and starts reproduction at that time. A sound critical points retrieving apparatus comprises a sound features extracting unit, which extracts predetermined sound features from a sound signal; a cost function calculating unit, which calculates a cost function indicating the likelihood of sound critical points from the sound features which have been extracted by the sound features extracting unit; a peak detecting unit, which detects peak times and peak values in a case where a value of the cost function calculated by the cost function calculating unit indicates a maximum value; and a selecting unit which selects sound critical points having likelihood values of a predetermined value or more from the peak times and peak values detected by the peak detecting unit. A problem of the known system is that not all prominent signal features are indicative of the boundaries of a readily recognisable part of the song.
It is desirable to provide a readily implemented method, device, system, portable media player and computer which are better able to take account of the fact that which part of a recording is considered to be characteristic is very personal.
The method is provided for determining a characteristic fragment of a recording, the method comprising steps of - controlling a playback of the recording so as to cause to reproduce perceptibly at least one signal corresponding to at least part of the recording, - receiving a command inputted by a user during the playback, wherein the controller is configured to carry out an operation relating to the recording in response to the command, the controller being configured to determine a location, in a pre-determined relation to a point reached during the perceptible reproduction of at least part of the recording at a time of receipt of the command, as location of the characteristic fragment of the recording.
Recordings include audio, visual and audiovisual material with a time base, recorded in digital or analogue form, optionally in a compressed digital representation. At least upon further processing, the information in the recording is suitable for generating one or more input signals for generating a perceptible reproduction of the material over a time interval corresponding to the length of the recording.
The method is based on the insight that during reproduction of an as yet unknown recording, a user will provide an input to carry out an operation in relation to the recording, the moment the user has recognised the current recording. Such recognition is most likely to occur at around the time that the reproduction of the current recording has reached the characteristic fragment.
In an embodiment, wherein the step of perceptibly reproducing at least part of a recording is carried out as part of a step of perceptibly reproducing at least respective parts of a series of recordings in succession according to a pre-determined scheme, and wherein reproduction according to the pre-determined scheme is terminated in response to the input signal generated by the user. The effect is that the time of receipt of the input signal is more likely to correspond to recognition of a characteristic part of the recording. This is so, because the recordings are reproduced in succession according to a pre-determined scheme, rather than in response to selection commands from the user. Thus, a form of "browsing" is implemented, whereby the user is provided with a succession of reproductions of (parts of) recordings, without having immediate influence over which one will follow next. When the user perceives a targeted one of the recordings, he or she can provide an input signal. This will occur upon recognition of the targeted recording.
In a variant, an order of succession of the series of recordings is determined on the basis of statistics, maintained for the recordings in the series and indicative of respective frequencies of perceptible reproduction of the recordings.
An effect is that a user looking for a particular recording in relation to which to have an action performed, is able to find it relatively quickly. Unnecessary reproductions of other recordings are avoided as much as possible. In an embodiment, the operation relating to the recording of which at least a part is being reproduced includes perceptibly reproducing more than a pre-determined part of that recording if the input signal generated by the user is of a first type.
This provides an alternative to a menu-driven facility for selecting a recording for complete reproduction. Upon recognition of the recording, preferably only after perceiving part of the recording, the user can select it with an input signal that need not convey much more information than a binary selection. It is particularly suitable for implementation in small form- factor media players, where the ability to dispense with an elaborate graphical user interface for selecting a recording is useful.
In an embodiment, the operation relating to the recording of which at least a part is being reproduced includes terminating perceptible reproduction of any part of that recording and commencing perceptible reproduction of at least a part of a next recording in a series of recordings.
The effect is that a user can go through a series of recordings in search for a targeted recording at relatively high speed. As soon as he or she has recognised a recording as not being the targeted one, the next one can be tried.
In an embodiment, an interval ending at a point within a pre-determined distance to the point reached during perceptible reproduction of at least part of the recording at a time of receipt of the input signal and preceding that point during perceptible reproduction of the recording is determined as the characteristic fragment of the current recording.
This takes into account that the point in time at which the user is most likely to provide the input signal in recognition of a characteristic part of the recording is just after having been presented with the characteristic part.
An embodiment includes storing data for determining at least the location of the characteristic fragment in association with the current recording upon determining at least the location of the characteristic fragment.
Thus, it is possible to provide the characteristic fragment as an annotation to the complete recording, to be used for selecting the complete recording for perceptible reproduction.
In an embodiment, at least one step of perceptibly reproducing at least part of a recording includes selecting at least a point in the recording from which to start perceptible reproduction of a part thereof in dependence on data stored in association with the recording. Thus, it is possible to make use of annotation of the recording to provide a perceptible reproduction of only a part thereof, the part being deemed a priori most likely to contain the characteristic fragment. The effect is that the user will generally provide the input signal sooner. This makes the method faster.
A device is provided for determining a characteristic fragment of a recording, the device comprising a controller for controlling a playback of the recording so as to cause to reproduce perceptibly at least one signal corresponding to at least part of the recording, an interface for receiving a command inputted by a user during the playback, wherein the controller is configured to carry out an operation relating to the recording in response to the command, the controller being configured to determine a location, in a pre-determined relation to a point reached during the perceptible reproduction of at least part of the recording at a time of receipt of the command, as location of the characteristic fragment of the recording. Such a device may be implemented as a server remotely located from a portable media player of the user.
According to another aspect, a system for playing back a recording is provided. The system comprises a reproduction apparatus for perceptibly reproducing at least one signal from a recording, a control system for controlling the system for playing back recordings so as to reproduce perceptibly at least one signal corresponding to at least part of a current recording, an interface for receiving a command from a user and providing a signal in response to the command from the user to the control system, wherein the control system is configured to carry out an operation relating to the current recording in response to the input signal, characterised in that the control system is configured to determine a location in a pre-determined relation to a point reached during perceptible reproduction of at least part of the recording at a time of receipt of the input signal as location of a characteristic fragment of the current recording.
In an embodiment, the system is configured to carry out a method according to the invention.
According to another aspect of the invention, there is provided a portable media player, comprising a storage apparatus for storing a plurality of sets of data corresponding to recordings, and a system for playing back recordings according to the invention.
An effect is that the portable media player can operate with relatively low processing power and at relatively high speed to determine a characteristic fragment. The portable media player can be kept relatively simple.
According to another aspect of the invention, there is provided a computer program including a set of instructions capable, when incorporated in a machine-readable medium, of causing a system having information processing capabilities to perform a method according to the invention.
The invention will be explained in further detail with reference to the drawings, in which:
Fig. 1 is a very schematic view of some components of a portable media player; Fig. 2 is a schematic simplified view of the exterior of such a media player; and
Fig. 3 is a flow chart illustrating some steps in a method of determining characteristic fragments of recordings.
A portable media player 1 (Fig. 2) comprises a system 2 (Fig. 1) for perceptible reproduction of recordings, in the present example audio tracks. The example of a system for reproducing audio tracks will be used throughout the present description in the understanding that the principles underlying the method of determining characteristic fragments of audio tracks can also be applied to determined characteristic fragments of other types of recordings. These include video sequences (either synchronised with accompanying audio tracks or silent), still images (to determine a characteristic region), etc.
The system 2 comprises a microprocessor 3 for executing instructions enabling the microprocessor 3 to control the operation of the system 2. Such instructions can be stored in a first flash memory device 4, connected to the microprocessor 3 via a bus 5. A second flash memory device 6 is provided for storing audio files representing recordings of audio tracks. A Random Access Memory (RAM) 7 and Digital Signal Processor (DSP) 8 are connected to the second flash memory device 6 via a second bus 9. The DSP 8 operates in conjunction with a decoder chip 10 to decompresses and decode the audio information in the files so as perceptibly to reproduce the audio recording on an output of the system 2. To this end, the decoder chip 10 is connected to an output amplifier 11 delivering an analogue output signal to, for example speakers (not shown) or headphones (not shown).
A Universal Serial Bus (USB) controller 12 and USB connector 13 enable the loading of new audio files, for example from a computer. Of course, other variants of the system 2 are obtainable using other types of physical links to a personal computer or server.
The microprocessor 3 receives input signals from an input device 14, comprising a number of keys enabling a user to generate an input signal. In the illustrated embodiment, a display unit 15 is provided. However, the use of characteristic fragments allows a user to locate a desired audio track for reproduction without having to use an elaborate graphical user interface with menus. Thus, the display unit 15 could in this respect be dispensed with.
The method of locating a desired audio track can be implemented with only two types of input signal. In the present example, three types of input signal can be generated by the user. To this end, there are provided (Fig. 2) a first key 16 for generating a signal to the microprocessor 3 to skip back one track, a second key 17 for generating a signal representative of selection of a track, and a third key 18 for generating an input signal indicating to the microprocessor 3 to skip forward one track. The microprocessor 3 controls the system to carry out a method as illustrated in Fig. 3. In a first step 19, a current one of a series of audio tracks is selected. The audio tracks in the series are selected in a particular order of succession. In one embodiment, the order is determined by the order of storage in the second flash memory device 6. In another embodiment, the order is determined by referring to metadata, stored in association with the audio files and indicating the frequency of complete reproduction of the respective audio tracks. This indication may be in the form of an absolute count or a value relative to the total number of reproductions of the audio files in the collection.
In a second step 20, the audio file selected in the first step 19 is read. In particular, metadata associated with the audio track is read for use in a subsequent step 21. In this step 21, it is determined whether data identifying at least the location of a characteristic fragment within the audio track has been stored with the audio track. Each recording representing an audio track has a certain progression, so that it is possible to relate a location to the length of the audio track (e.g. half-way through). In any case, a location can be related to the starting point at normal reproduction speed (e.g. ten seconds into the track, etc.). In one variant, only a particular point within the audio track is identified in the data stored in association with it. This could be the starting point or end point of a characteristic fragment. In such an embodiment, the length of the characteristic fragment is a default value, stored, for example in the first flash memory device 4. In another variant, the interval corresponding to the characteristic fragment is identified in terms of its length and location, e.g. by including information representative of the starting point and end point of the characteristic fragment.
If no information identifying the location of the characteristic fragment is found, then a default value is chosen as the starting point for reproduction. In most embodiments, this starting point will correspond to the starting point of the audio track, although other default starting points are possible. Perceptible reproduction, involving decompression and decoding of the audio information in the audio file, commences (step 22) at the starting point that has been determined.
In one variant, the length of the fragment that is reproduced is pre-determined. Upon reaching the end of the fragment without receiving any input signal, the first step 19 is repeated to select the next audio track in the order of succession. In another variant, the current audio track is reproduced until the end. If no input signal is received, the next audio track is selected.
In each variant, if an input signal is received during reproduction of the current audio track, a break-off point is determined. The break-off point is determined as the point in the audio track reached during reproduction of the audio track in its recorded order of progression that coincides with receipt of the input signal generated by the user using one of the keys 16-18. Having determined the break-off point, a location of the characteristic fragment is determined. The location of the characteristic fragment is determined in relation to the break-off point.
In one embodiment, the characteristic fragment is determined to correspond to an interval directly preceding the break-off point. The length of the interval is a default value. In other embodiments, the characteristic fragment corresponds to an interval within which the break-off point lies. In yet other embodiments, the length of the interval is a value stored in the first flash memory device 4 and part of the user settings. That is to say that the user may change the default value.
If the input signal generated by the user is of a first type, generated by pressing the second key 17, then the current audio track is selected for complete reproduction (step 24). In one variant, the complete reproduction continues until the end of the audio track. In another variant, the complete reproduction recommences from the start of the audio track and continues until the end.
When the characteristic fragment's location has been determined, data for determining at least its location, and optionally its length, are stored (step 25) in association with the audio file including the current audio track. Thus, when the method illustrated in Fig. 3 is carried out a second time, the step 22 of commencing reproduction of a characteristic fragment can take account of the data identifying what the user considers to be the characteristic fragment of the audio track.
At least a location, and optionally a length, of a characteristic fragment is also determined - in the manner described above - when the input signal generated by the user is of a second type. The input signal of the second type is generated by pressing one of the first and third keys. The system 2 then immediately terminates reproduction of the current recording. It then autonomously carries out the first step 19 to select a next (or preceding) recording in accordance with a pre-determined scheme defining the order of succession of the series of audio tracks. The order of succession is the same as used when no input signal is received, which situation has been described above already.
Since an input signal of the second type is also a relatively good indication that the current audio track has been recognised, the data identifying at least the location of a characteristic fragment - determined in relation to the break-off point - are also stored in association with the audio file including the current audio track (step 26).
Thus, the method illustrated in Fig. 3 is an effective way of determining at least the location of characteristic fragments of a user's collection of audio tracks, adapted to the user's idea of what constitutes a characteristic fragment. Using such characteristic fragments, an effective way of aurally browsing the collection of audio tracks is made possible. The interface thus provided is suited to portable media players having a minimum of input devices and even fewer visual display capabilities.
It should be noted that the above-mentioned embodiments illustrate, rather than limit, the invention, and that those skilled in the art will be able to design many alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps other than those listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
For example, the methods and systems outlined herein are suitable also for incorporation into a personal computer, cellular telephone handset, personal digital assistant or other type of data processing arrangement. The facility for selecting a file by reproducing characteristic fragments may also operate in a mode without simultaneous determination of break-off points and characteristic fragment location in relation thereto. The input signals used to implement the method of Fig. 3 can alternatively be generated by means of a jog dial, a touch pad or other type of capacitive sensor, a remote control device, etc. The signals can also be generated by processing speech input provided by the user and captured by means of a microphone (not shown). Actions conditional upon receipt of a particular input signal can also be carried out in the absence of an input signal for a pre-determined period of time.

Claims

CLAIMS:
1. Method of determining characteristic fragments of recordings, including perceptibly reproducing at least part of a recording, and receiving an input signal generated by a user, in response to which an operation relating to the recording being reproduced is carried out, and determining a location in a pre-determined or flexible relation to a point reached during perceptible reproduction of at least part of the recording at a time of receipt of the input signal as location of a characteristic fragment of the recording.
2. Method according to claim 1, wherein the step (22) of perceptibly reproducing at least part of a recording is carried out as part of a step of perceptibly reproducing at least respective parts of a series of recordings in succession according to a pre-determined scheme, and wherein reproduction according to the pre-determined scheme is terminated in response to the input signal generated by the user.
3. Method according to claim 2, wherein an order of succession of the series of recordings is determined on the basis of statistics, maintained for the recordings in the series and indicative of respective frequencies of perceptible reproduction of the recordings.
4. Method according to any one of claims 1-3, wherein the operation relating to the recording of which at least a part is being reproduced includes perceptibly reproducing more than a pre-determined part of that recording if the input signal generated by the user is of a first type.
5. Method according to any one of claims 1-4, wherein the operation relating to the recording of which at least a part is being reproduced includes terminating perceptible reproduction of any part of that recording and commencing perceptible reproduction of at least a part of a next recording in a series of recordings.
6. Method according to any one of claims 1-5, wherein an interval ending at a point within a pre-determined distance to the point reached during perceptible reproduction of at least part of the recording at a time of receipt of the input signal and preceding that point during perceptible reproduction of the recording is determined as the characteristic fragment of the current recording.
7. Method according to any one of claims 1-6, including storing data for determining at least the location of the characteristic fragment in association with the current recording upon determining at least the location of the characteristic fragment.
8. Method according to any one of claims 1-7, wherein at least one step of perceptibly reproducing at least part of a recording includes selecting at least a point in the recording from which to start perceptible reproduction of a part thereof in dependence on data stored in association with the recording.
9. A device for determining a characteristic fragment of a recording, the device comprising a controller (3,4) for controlling a playback of the recording so as to cause to reproduce perceptibly at least one signal corresponding to at least part of the recording, an interface (14;16-18) for receiving a command inputted by a user during the playback, wherein the controller (3,4) is configured to carry out an operation relating to the recording in response to the command, the controller (3,4) being configured to determine a location, in a pre- determined relation to a point reached during the perceptible reproduction of at least part of the recording at a time of receipt of the command, as location of the characteristic fragment of the recording.
10. System for playing back recordings, including a reproduction apparatus (7,8,10,11) for perceptibly reproducing at least one signal from a recording, a control system (3,4) for controlling the system for playing back recordings so as to reproduce perceptibly at least one signal corresponding to at least part of a current recording, an interface (14;16-18) for receiving a command from a user and providing a signal in response to the command from the user to the control system (3,4), wherein the control system (3,4) is configured to carry out an operation relating to the current recording in response to the input signal, the control system (3,4) being configured to determine a location in a predetermined relation to a point reached during perceptible reproduction of at least part of the recording at a time of receipt of the input signal as location of a characteristic fragment of the current recording.
11. System according to claim 10, configured to carry out a method according to any one of claims 1-8.
12. Portable media player, comprising a storage apparatus (5) for storing a plurality of sets of data corresponding to recordings, and a system (2) for playing back recordings according to claim 10 or 11.
13. Computer program including a set of instructions capable, when incorporated in a machine-readable medium, of causing a system having information processing capabilities to perform a method according to any one of claims 1-8.
PCT/IB2007/051489 2006-05-02 2007-04-24 Method of determining a characteristic fragment of a recording and system for playing back a recording WO2007125470A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP06113355.9 2006-05-02
EP06113355 2006-05-02

Publications (2)

Publication Number Publication Date
WO2007125470A2 true WO2007125470A2 (en) 2007-11-08
WO2007125470A3 WO2007125470A3 (en) 2008-11-13

Family

ID=38655894

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2007/051489 WO2007125470A2 (en) 2006-05-02 2007-04-24 Method of determining a characteristic fragment of a recording and system for playing back a recording

Country Status (1)

Country Link
WO (1) WO2007125470A2 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1126463A2 (en) * 2000-02-19 2001-08-22 Lg Electronics Inc. Method for recording and reproducing audio data to/from a rewritable recording medium
WO2001082302A1 (en) * 2000-04-27 2001-11-01 Samsung Electronics Co., Ltd. Method and apparatus for recording and reproducing audio highlight portion and recording medium for storing additional information for the same
US20030012339A1 (en) * 2001-07-12 2003-01-16 Medtronic Ave, Inc X-ray catheter with miniature emitter and focusing cup
US20050219963A1 (en) * 2001-07-11 2005-10-06 Sony Corporation Editing apparatus for optical reproducing device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1126463A2 (en) * 2000-02-19 2001-08-22 Lg Electronics Inc. Method for recording and reproducing audio data to/from a rewritable recording medium
WO2001082302A1 (en) * 2000-04-27 2001-11-01 Samsung Electronics Co., Ltd. Method and apparatus for recording and reproducing audio highlight portion and recording medium for storing additional information for the same
US20050219963A1 (en) * 2001-07-11 2005-10-06 Sony Corporation Editing apparatus for optical reproducing device
US20030012339A1 (en) * 2001-07-12 2003-01-16 Medtronic Ave, Inc X-ray catheter with miniature emitter and focusing cup

Also Published As

Publication number Publication date
WO2007125470A3 (en) 2008-11-13

Similar Documents

Publication Publication Date Title
KR101156827B1 (en) Creation of playlists using audio identification
US10110964B2 (en) Multimedia system and method of performing a playback by means of a multimedia system
US20070265720A1 (en) Content marking method, content playback apparatus, content playback method, and storage medium
CN101794208A (en) The audio user interface that is used for the electronic equipment of displayless
JP2009543497A (en) Apparatus and method for playing audio-video content
CN105210364A (en) Dynamic audio perspective change during video playback
US9384752B2 (en) Audio device and storage medium
US8925029B2 (en) Recording/transferring program, recording/transferring apparatus, and recording/transferring method
KR100609171B1 (en) Apparatus and method music play control using function keys of earphone
JP2007220212A (en) Reproduction controller, reproduction control program, and content reproduction device
US20050016364A1 (en) Information playback apparatus, information playback method, and computer readable medium therefor
US20090164901A1 (en) Electronic device and method for quickly playing through media files
WO2007125470A2 (en) Method of determining a characteristic fragment of a recording and system for playing back a recording
JP4934990B2 (en) Audio signal recording / playback device
US10282161B2 (en) Digitizing analog audio data
WO2020038290A1 (en) Bookmark display method and device
KR20140038791A (en) The method of playing multimedia data and the multimedia playing apparatus thereof
JP2005038541A (en) Device and method for recording/reproducing contents data
US20050117415A1 (en) Method and apparatus for replaying a recording medium
JP2006236446A (en) Content acquiring apparatus, method and program therefor, and recording medium in which the program is recorded
JPWO2006095847A1 (en) CONTENT RECORDING DEVICE, METHOD THEREOF, PROGRAM THEREOF, AND RECORDING MEDIUM CONTAINING THE PROGRAM
JP2007073139A (en) Recording/reproducing device and control method
JP3899716B2 (en) Playback device
JP2012120128A (en) Playback system and playback method
JP5812531B2 (en) Function execution device, function execution method, and program

Legal Events

Date Code Title Description
NENP Non-entry into the national phase in:

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07735614

Country of ref document: EP

Kind code of ref document: A2