US20050147174A1 - Encoding apparatus and method of same and decoding apparatus and method of same - Google Patents

Encoding apparatus and method of same and decoding apparatus and method of same Download PDF

Info

Publication number
US20050147174A1
US20050147174A1 US11/052,190 US5219005A US2005147174A1 US 20050147174 A1 US20050147174 A1 US 20050147174A1 US 5219005 A US5219005 A US 5219005A US 2005147174 A1 US2005147174 A1 US 2005147174A1
Authority
US
United States
Prior art keywords
processor
processing
decoding
processors
encoding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/052,190
Inventor
Eiji Iwata
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US11/052,190 priority Critical patent/US20050147174A1/en
Publication of US20050147174A1 publication Critical patent/US20050147174A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/436Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding

Definitions

  • the present invention relates to an encoding apparatus for transforming data such as video data and audio data, for example, the MPEG method (high quality moving picture encoding system by Moving Picture Coding Experts Group), to a bit stream composed of variable length data, and to a decoding apparatus of the same, more particularly relates to an encoding apparatus and a decoding apparatus for carrying out encoding and decoding at a high speed by parallel processing and methods of the same.
  • MPEG method high quality moving picture encoding system by Moving Picture Coding Experts Group
  • MPEG MPEG1 and MPEG2
  • MPEG1 and MPEG2 the standard encoding and decoding system of images currently in general used.
  • FIG. 1 is a view of the structure of image data in the MPEG method.
  • the image data of the MPEG method is comprised in a hierarchical structure.
  • the hierarchy is, in order from the top, a video sequence (hereinafter simply referred to as a “sequence”), groups of pictures (GOP), pictures, slices, macroblocks, and blocks.
  • sequence a video sequence
  • GOP groups of pictures
  • pictures slices, macroblocks, and blocks.
  • the image data is sequentially encoded based on this hierarchical structure so as to be transformed to a bit stream.
  • FIG. 2 The structure of a bit stream of MPEG encoded data is shown in FIG. 2 .
  • each picture has j number of slices, and each slice has i number of macroblocks.
  • each level of data other than the blocks in the hierarchy shown in FIG. 1 has a header in which an encoding mode etc. are stored. Accordingly, when describing the structure of a bit stream from the headers of the video sequence, it becomes a sequence header (SEQH) 151 , a GOP header (GOPH) 152 , a picture header (PH) 153 , a slice header (SH) 154 , a macroblock header (MH) 155 , compressed data (MB 0 ) 156 of a macroblock 0 , a macroblock header (MH) 157 , and compressed data (MB 1 ) 158 of a macroblock 1 .
  • SEQH sequence header
  • GOP header GOPH
  • PH picture header
  • SH slice header
  • MH macroblock header
  • MH macroblock header
  • the size of the compressed data of a macroblock contained in a bit stream is of a variable length and differs depending on the nature of the image etc.
  • this bit stream is sequentially decoded and the image is reconstructed based on the hierarchical structure of FIG. 14 .
  • FIG. 3 is a block diagram of the configuration of a general processing unit for carrying out MPEG encoding.
  • An encoding apparatus 160 shown in FIG. 3 has a motion vector detection unit (ME) 161 , a subtractor 162 , a Fourier discrete cosine transform (FDCT) unit 163 , a quantization unit 164 , a variable length coding unit (VLC) 165 , an inverse quantization unit (IQ) 166 , an inverse discrete cosine transform (IDCT) unit 167 , an adder 168 , a motion compensation unit (MC) 169 , and an encode control unit 170 .
  • ME motion vector detection unit
  • FDCT Fourier discrete cosine transform
  • VLC variable length coding unit
  • IQ inverse quantization unit
  • IDCT inverse discrete cosine transform
  • MC motion compensation unit
  • the motion compensation prediction is carried out in units of macroblocks at the motion vector detection unit 161 , a predicted error is detected at the subtractor 162 , DCT is carried out with respect to the predicted error at the discrete cosine transform unit 163 , and thereby a DCT coefficient is found.
  • the encoded picture is an I (Intra-coded) picture
  • the pixel value is input to the discrete cosine transform unit 163 as it is, DCT is carried out, and thereby the DCT coefficient is found.
  • the found DCT coefficient is quantized at the quantization unit 164 and subjected to variable length coding together with the motion vector or encoding mode information at the variable length coding unit 165 , whereby an encoded bit stream is generated. Further, the quantized data generated at the quantization unit 164 is inversely quantized at the inverse quantization unit 166 , subjected to IDCT at the inverse discrete cosine transform unit 167 to be restored to an original predicted error, and added to a reference image at the adder 168 , whereby a reference image is generated at the motion compensation unit 169 .
  • the encode control unit 170 controls the operation of these parts of the encoding apparatus 160 .
  • Such encoding is generally roughly classified into processing at three processing units, that is, the encoding from the motion vector detection at the motion vector detection unit 161 to the quantization at the quantization unit 164 , the variable length coding in the variable length coding unit 165 for generating the bit stream, and the local decoding from the inverse quantization in the inverse quantization unit 166 to the motion compensation in the motion compensation unit 169 .
  • FIG. 4 is a flow chart of the flow of the processing for generating a bit stream by carrying out MPEG encoding.
  • step S 180 When the encoding is started (step S 180 ), a sequence header is generated (step S 181 ), a GOP header is generated (step S 182 ), a picture header is generated (step S 183 ), and a slice header is generated (step S 184 ).
  • step S 185 macroblock encoding is carried out
  • step S 186 macroblock variable length coding is carried out
  • step S 187 macroblock local encoding
  • the processing routine shifts to the processing of the next slice (step S 188 ).
  • the processing routine shifts to the processing of the next picture (step S 189 ).
  • the processing routine shifts to the processing of the next GOP (step S 190 ). This series of processing is repeated until the sequence is ended (step S 181 ), whereupon the processing is ended (step S 192 ).
  • DSP digital signal processor
  • the processing of the flow chart shown in FIG. 4 is sequentially carried out for every macroblock.
  • the processing “MBx-ENC” indicates the encoding with respect to the data of an (x+1)th macroblock x
  • the processing “MBx-VLC” indicates variable length coding with respect to the data of the (x+1)th macroblock x
  • the processing “MBx-DEC” indicates the local encoding with respect to the data of the (x+1)th macroblock x.
  • FIG. 6 is a block diagram of the configuration of a general processing unit for carrying out the MPEG decoding.
  • a decoding apparatus 200 shown in FIG. 6 has a variable length decoding unit (VLD) 201 , an inverse quantization unit (IQ) 202 , an inverse discrete cosine transform unit (IDCT) 203 , an adder 204 , a motion compensation unit (MC) 205 , and a decode control unit 206 .
  • VLD variable length decoding unit
  • IQ inverse quantization unit
  • IDCT inverse discrete cosine transform unit
  • adder 204 a motion compensation unit
  • MC motion compensation unit
  • decode control unit 206 decode control unit
  • a bit stream of the input encoded data is decoded at the variable length decoding unit 201 to separate the encoding mode, motion vector, quantization information, and quantized DCT coefficient for every macroblock.
  • the decoded quantized DCT coefficient is subjected to inverse quantization at the inverse quantization unit 202 , restored to the DCT coefficient, subjected to IDCT by the inverse discrete cosine transform unit 203 , and transformed to pixel space data.
  • the motion compensation predicted block data is added at the adder 204 to restore and output the original data. Further, the motion compensation unit 205 carries out motion compensation prediction based on the decoded image to generate the data to be added at the adder 204 .
  • decode control unit 206 controls the operations of these units of the decoding apparatus 200 .
  • decoding may be generally roughly classified into processing at two processing units, that is, the variable length decoding at the variable length decoding unit 201 for decoding the bit stream and the decoding from the inverse quantization in the inverse quantization unit 202 to the motion compensation in the motion compensation unit 205 .
  • FIG. 7 is a flow chart showing the flow of the processing for generating the original image data by carrying out MPEG decoding.
  • step S 210 When the decoding is started (step S 210 ), the sequence header is decoded (step S 211 ), the GOP header is decoded (step S 212 ), the picture header is decoded (step S 213 ), and the slice header is decoded (step S 214 ).
  • step S 215 macroblook variable length decoding is carried out
  • step S 216 decoding of the macroblock is carried out
  • the processing routine shifts to the processing of the next slice (step S 217 ).
  • the processing routine shifts to the processing of the next picture (step S 218 ), and when all processing of one GOP is ended, the processing routine shifts to the processing of the next GOP (step S 219 ). This series of processings is repeated until the sequence is ended (step S 220 ), whereupon the processing is ended (step S 221 ).
  • FIG. 8 A timing chart of the sequential execution of such decoding by a processor, for example, a DSP, is shown in FIG. 8 .
  • processing of the flow chart shown in FIG. 7 is sequentially carried out for every slice and for every macroblock inside each slice.
  • the processing “SH-VLD” indicates the slice header decoding
  • the processing “MBx-VLD” indicates the variable length decoding with respect to the encoded data of the (x+1)th macroblock x
  • the processing “MBx-DEC” indicates the decoding with respect to the encoded data of the (x+1)th macroblock x.
  • variable length data since variable length data is to be processed, sequential processing must be carried out as the order of the data processing in the variable length coding and variable length decoding. For this reason, there is the disadvantage that the parallel processing is interrupted at the time of execution of the sequential processing parts or that the processing speed is limited since the sequential processing parts become an obstacle.
  • An object of the present invention is to provide an encoding apparatus and a decoding apparatus having a plurality of processors capable of carrying out the encoding and decoding of for example image data at a high speed and having simple configurations.
  • Another object of the present invention is to provide an encoding method and a decoding method which can be applied to parallel processors having any configurations and capable of carrying out the encoding and decoding of for example image data at a high speed.
  • an encoding apparatus for encoding a data which comprises a plurality of block data including a plurality of element data which are sequentially transferred in a form of a data stream, the encoding apparatus comprising a plurality of signal processing devices connected by a signal transfer means on which the data is transferred, each signal processing device comprising; an encoding means for encoding a block data including a plurality of element data on the signal transfer means, and a variable length coding means for carrying out a variable length coding of the encoded block data and outputting the variable length coded data via the signal transfer means in accordance with the data stream.
  • an encoding method for encoding a data stream having a plurality of element data comprising the steps of; dividing the data stream into a predetermined plurality of block data, successively allotting the divided plurality of block data to a plurality of signal processing devices, encoding the allotted block data based on a predetermined method in each of the plurality of signal processing devices, successively carrying out variable length coding on the encoded data in the same signal processing devices as those for the encoding so that the encoded data for every the block data encoded in the plurality of signal processing devices are successively subjected to the variable length coding according to the order in the data stream, and successively allotting new block data to the signal processing devices for which the variable length coding is ended.
  • a decoding apparatus for decoding encoded and variable length coded data which comprises a plurality of block data including a plurality of element data in a form of a data stream, the decoding apparatus comprising a plurality of signal processing devices, each of the signal processing devices comprising; a variable length decoding means for successively carrying out variable length decoding on variable length coded block data in accordance with the data stream, and a decoding means for decoding the variable length decoded block data.
  • a decoding method for decoding a variable length coded data stream obtained by encoding a data stream having a plurality of element data for every predetermined block data and further carrying out variable length coding comprising the steps of; successively allotting the variable length coded data for every the block data successively arranged in the variable length coded data stream to a plurality of signal processing devices, successively carrying out variable length decoding on the variable length coded data for every allotted block data so that the variable length decoding carried out in the plurality of signal processing devices is successively carried out according to the order of the block data in the data stream in each of the plurality of signal processing devices, decoding the encoded data for every the block image data subjected to the variable length decoding in the same signal processing device in each of the plurality of signal processing devices, and allotting variable length coded data of new block data to be decoded next to the signal processing devices for which the decoding is ended.
  • FIG. 1 is a view of the structure of image data in MPEG encoding
  • FIG. 2 is a view of the structure of an MPEG encoded image data bit stream
  • FIG. 3 is a block diagram of the configuration of a processing unit for carrying out the MPEG encoding
  • FIG. 4 is a flow chart of the flow of processing for generating a bit stream shown in FIG. 15 by carrying out MPEG encoding
  • FIG. 5 is a timing chart of the operation of the processing unit when MPEG encoding is carried out by sequential processing
  • FIG. 6 is a block diagram of the configuration of a processing unit for carrying out MPEG decoding
  • FIG. 7 is a flow chart of the flow of processing for generating a bit stream shown in FIG. 15 by carrying out MPEG decoding
  • FIG. 8 is a timing chart of the operation of a processing unit when MPEG decoding is carried out by sequential processing
  • FIG. 9 is a schematic block diagram of the configuration of a parallel processing unit of an image encoding/decoding apparatus according to the present invention.
  • FIG. 10 is a flow chart of the processing in the case where an image is encoded by the conventional parallel processing method of in a master processor (first processor) of the parallel processing unit shown in FIG. 9 ;
  • FIG. 11 is a flow chart of the processing in the case where an image is encoded by the conventional parallel processing method in slave processors (second to n-th processors) of the parallel processing unit shown in FIG. 9 ;
  • FIG. 12 is a timing chart of the state of processing in processors in a case where an image is encoded by the conventional parallel processing method in the parallel processing unit shown in FIG. 9 ;
  • FIG. 13 is a flow chart of the processing in the case where an image is decoded by the conventional parallel processing method in the master processor (first processor) of the parallel processing unit shown in FIG. 9 ;
  • FIG. 14 is a flow chart of the processing in the case where an image is decoded by the conventional parallel processing method in slave processors (second to n-th processors) of the parallel processing unit shown in FIG. 9 ;
  • FIG. 15 is a timing chart of the state of processing in processors in a case where an image is decoded by the conventional parallel processing method in the parallel processing unit shown in FIG. 9 ;
  • FIG. 16 is a flow chart of the processing in the case where an image is encoded by the parallel processing method according to the present invention in the master processor (first processor) of the parallel processing unit shown in FIG. 9 ;
  • FIG. 17 is a flow chart of the processing in the case where an image is encoded by the parallel processing method according to the present invention in slave processors (second to n-th processors) of the parallel processing unit shown in FIG. 9 ;
  • FIG. 18 is a timing chart of the state of processing in processors in a case where an image is encoded out by the parallel processing method according to the present invention in the parallel processing unit shown in FIG. 9 ;
  • FIG. 19 is a flow chart of the processing in a case where an image is decoded by the parallel processing method according to the present invention in the master processor (first processor) of the parallel processing unit shown in FIG. 9 ;
  • FIG. 20 is a flow chart of the processing in a case where an image is decoded by the parallel processing method according to the present invention in slave processors (second to n-th processors) of the parallel processing unit shown in FIG. 9 ; and
  • FIG. 21 is a flow chart of the state of processing in processors in a case where an image is decoded by the parallel processing method according to the present invention in the parallel processing unit shown in FIG. 9 .
  • any of the levels shown in FIG. 1 or a pixel can be considered, but in the following embodiment, the explanation will be made of a case where a macroblock is selected as the unit of parallel processing.
  • variable length coding and variable length decoding When using a macroblock as the unit of parallel processing, the encoding, local decoding, and decoding can be executed in parallel inside one slice, but it is necessary to sequentially execute the variable length coding and variable length decoding. This is because, in variable length coding and variable length decoding, the compressed data of the macroblock has a variable length and the header position of the compressed data of a macroblock on the bit stream is not determined until the variable length coding or the variable length decoding of the macroblock immediately before this is completed.
  • FIG. 9 is a schematic block diagram of the configuration of a parallel processing unit of an image encoding/decoding apparatus.
  • the parallel processing unit 9 of the image encoding/decoding apparatus has n number of processors 2 - 1 to 2 -n, a memory 3 , and a connection network 4 .
  • processors 2 - 1 to 2 -n are processors for independently carrying out predetermined processing.
  • the processor 2 -i carries out the predetermined processing according to the program stored in the program ROM or program RAM in advance.
  • n 3, that is, the parallel processing unit 9 has three processors 2 - 1 to 2 - 3 .
  • the processors 2 - 1 to 2 -n carry out the processing as will be explained below in association or in synchronization.
  • the memory 3 is a common memory of the n number of processors 2 - 1 to 2 -n.
  • the image data to be processed and the data of the processing result are stored in the memory 3 .
  • Data is appropriately read and written by n number of processors 2 - 1 to 2 -n.
  • connection network 4 is a connection portion for connecting the n number of processors 2 - 1 to 2 -n and the memory 3 to each other so that the n number of processors 2 - 1 to 2 -n operate in association or the n number of processors 2 - 1 to 2 -n appropriately refer to the memory 3 .
  • the variable length coding of the macroblocks is allotted to one processor (hereinafter, this processor will be referred to as the “master processor”) in a fixed manner and that processor made to sequentially execute the processing, and the encoding and the local decoding are allotted to other processors (hereinafter, these processors will be referred to as “slave processors”) and those processors made to execute the parallel processing.
  • the first processor 2 - 1 is made the master processor
  • the second and the third processors 2 - 2 and 2 - 3 are made the slave processors.
  • the first processor 2 - 1 serving as the master processor carries out the processing as shown in the flow chart of FIG. 10 .
  • step S 10 when the encoding is started (step S 10 ), the sequence header is generated (step S 11 ), the GOP header is generated (step S 12 ), the picture header is generated (step S 13 ), and the slice header is generated (step S 14 ).
  • the master processor activates the slave processors (step S 15 ) and enters into a state waiting for the end of the encoding in the slave processors (step S 16 ).
  • step S 16 When the encoding of the macroblocks in the slave processors is ended (step S 16 ), the variable length coding of those macroblocks is started (step S 17 ). Note that this variable length coding must be sequentially executed due to the limitation as mentioned above. Accordingly, even if the encoding of the macroblock 1 is ended before the encoding of the macroblock 0 , the processor 0 first carries out the variable length coding of the macroblock 0 without fail.
  • the master processor repeats this procedure until all processing inside a slice is ended (step S 18 ). When all processing inside the slice is ended, it waits for the end of all processing in the slave processors (step S 19 ).
  • step S 20 when all processings of one picture are ended, the processing routine shifts to the processing of the next picture (step S 20 ), and when the processing of all pictures of 1 GOP are ended, the processing routine shifts to the processing of the next GOP (step S 21 ). Then, when these processings are repeated until the sequence is ended (step S 22 ), the processing is ended (step S 23 ).
  • the second and third processors 2 - 2 and 2 - 3 serving as the slave processors carry out the processing as shown in the flow chart of FIG. 11 .
  • step S 30 when started by the processing of step S 15 in the master processor and starting the encoding (step S 30 ), first each of the processors acquires the number of the macroblock to process (step S 31 ) and encodes that macroblock (step S 32 ).
  • the slave processors wait for the end of the variable length coding in the master processor (step S 33 ).
  • they carry out the local decoding (step S 34 ).
  • step S 35 This procedure is repeated until all processing inside a slice are ended.
  • step S 35 the processing of the slave processors is ended.
  • the programs by which the master processor and slave processors carry out the processing are stored in advance in the program ROMs or the program RAMs provided with respect to the processors 2 -i.
  • the processors 2 -i operate in accordance with these programs so as to carry out these processings.
  • FIG. 12 is a timing chart of the state of the encoding in the three processors 2 - 1 to 2 - 3 .
  • the processing “MBx-ENC” indicates the encoding with respect to the (x+1)th macroblock x (step S 32 in FIG. 11 )
  • the processing “MBx-DEC” indicates the local decoding with respect to the (x+1)th video segment x (step S 34 in FIG. 11 )
  • the processing “MBx-VLC” indicates the variable length coding with respect to the (x+1)th video segment x (step S 17 in FIG. 10 ).
  • first the second processor 2 - 2 and the third processor 2 - 3 carry out the encoding MB 0 -ENC and MB 1 -ENC of the macroblock 0 and the macroblock 1 .
  • the first processor 2 - 1 carries out the variable length coding MB 0 -VLC with respect to the encoded data.
  • the encoding MB 1 -ENC of the macroblock 1 in the third processor 2 - 3 is ended while the variable length coding MB 0 -VLC of the macroblock 0 is being carried out in the first processor 2 - 1 , therefore, the first processor 2 - 1 subsequently carries out the variable length coding MB 1 -VLC with respect to the encoded data of the macroblock 1 .
  • the second processor 2 - 2 when the variable length coding MB 0 -VLC with respect to the macroblock 0 is ended in the first processor 2 - 1 , the local decoding MB 0 -DEC with respect to that data is carried out. Then, when this local decoding MB 0 -DEC is ended, the encoding MB 2 -ENC with respect to the next macroblock 2 is carried out.
  • the third processor 2 - 3 similarly, when the variable length coding MB 1 -VLC with respect to the macroblock 1 is ended in the first processor 2 - 1 , the local decoding MB 0 -DEC with respect to that data is carried out. Then, when this local decoding MB 0 -DEC is ended, the encoding MB 3 -ENC with respect to the next macroblock 3 is carried out.
  • the decoding MBx-VLC of the encoded data is sequentially carried out.
  • the second processor 2 - 2 and the third processor 2 - 3 when the variable length coding MBx-VLC is ended in the first processor 2 - 1 , the local encoding MBx-DEC with respect to the macroblock thereof is carried out, and after the end of the processing, the encoding MBx-ENC with respect to the next macroblock x+1 is subsequently carried out.
  • variable length coding can be divided into the phase for generating the variable length data from the fixed length data by table conversion and the phase for combining the variable length data to generate the bit stream. These two phases may be sequentially executed, or only the latter phase may be sequently executed and the former phase be executed in parallel. Note that a buffer memory becomes necessary between the former phase and the latter phase in the latter method.
  • the variable length decoding of macroblocks is allotted to one processor (hereinafter this processor will be referred to as the “master processor”) in a fixed manner and that processor made to sequentially execute the processing.
  • the decoding is allotted to the other processors (hereinafter, these processors will be referred to as the “slave processors”) and the slave processors made to carry out the parallel processing.
  • the first processor 2 - 1 is made the master processor
  • the second and the third processors 2 - 2 and 2 - 3 are made the slave processors.
  • the first processor 2 - 1 serving as the master processor carries out the processing as shown in the flow chart of FIG. 13 .
  • step S 40 when the decoding is started (step S 40 ), the sequence header is decoded (step S 41 ), the GOP header is decoded (step S 42 ), the picture header is decoded (step S 43 ), and the slice header is decoded (step S 44 ).
  • the master processor activates the slave processors (step S 45 ) and carries out the variable length decoding with respect to a macroblock (step S 46 ).
  • the master processor repeatedly carries out this variable length decoding (step S 4 i 6 ) until this processing is ended for all macroblocks inside the slice.
  • the master processor waits for the end of all processings in the slave processors (step S 48 ).
  • the processing routine shifts to the processing with respect to the next picture (step S 49 ).
  • step S 49 When the processing of all pictures of one GOP is ended (step S 49 ), the processing routine shifts to the processing of the next GOP (step S 50 ). When the processing of all GOPs is ended (step S 50 ), the processing routine shifts to the processing of the next sequence (step S 51 ). This series of processing is repeated until all sequences are ended (step S 51 ), whereby the processing is ended (step S 52 ).
  • the second and third processors 2 - 2 and 2 - 3 serving as the slave processors carry out the processing as shown in the flow chart of FIG. 14 .
  • each slave processor when started by the processing of step S 45 in the master processor and starting the decoding (step S 60 ), first each slave processor obtains the number of the macroblock to be processed (step S 61 ) and waits for the end of the variable length decoding of the related macroblock at step S 46 at the master processor (step S 62 ).
  • the slave processor decodes the macroblock using that data (step S 63 ).
  • step S 64 This procedure is repeated until the processing of all macroblocks inside the slice is ended (step S 64 ).
  • step S 64 the processing of the slave processors is ended (step S 65 ).
  • the programs by which the master processor and slave processors carry out the processing are stored in advance in the program ROMs or the program RAMs provided with respect to the processors 2 -i.
  • the processors 2 -i operate in accordance with these programs so as to carry out these processings.
  • the header of the next slice on the bit stream can be found without carrying out the variable length decoding. This becomes possible by finding the slice start code placed at the header of the slice by scanning. Accordingly, a processing method of carrying out only this scanning sequentially and carrying out the other processing containing the variable length decoding in parallel is possible.
  • FIG. 15 is a timing chart of the state of the decoding in the three processors 2 - 1 to 2 - 3 .
  • the processing “MBx-VLD” indicates the variable length decoding with respect to the (x+1)th macroblock x (step S 46 in FIG. 13 ), and the processing “MBx-DEC” indicates the decoding with respect to the (x+1)th video segment x (step S 63 in FIG. 14 ).
  • the first processor 2 - 1 sequentially carries out the variable length decoding from the macroblock 0 .
  • the second processor 2 - 2 carries out the decoding MB 0 -DEC with respect to this data.
  • the third processor 2 - 3 carries out the decoding MB 1 -DEC with respect to this data.
  • the processor which ended the decoding among the second processor 2 - 2 and the third processor 2 - 3 fetches the data of the next macroblock subjected to the variable length decoding at the first processor 2 - 1 and carries out the encoding.
  • the first image encoding/decoding apparatus divides the processing steps of the encoding and decoding into steps able to be processed in parallel and steps relating to variable length coding/decoding not able to be processed in parallel and having to be processed sequentially, allots the steps for which sequential processing is necessary to the master processor and steps which can be processed in parallel to the slave processors, and then carries out the encoding and the decoding.
  • the sequentially input data is sequentially processed at these three processors 2 - 1 to 2 - 3 and transformed to the intended compressed and encoded data or the restored image data.
  • the load of the variable length coding is relatively light, therefore the first processor 2 - 1 frequently is idle. This becomes even more conspicuous in a parallel processing apparatus having two processors.
  • the first processor 2 - 1 since the load of the variable length decoding is relatively light, the first processor 2 - 1 becomes idle at the point of time when one slice's worth of the variable length decoding is ended and until all decoding in the second processor 2 - 2 and the third processor 2 - 3 is ended.
  • an explanation will be made of an image encoding/decoding apparatus according to the present invention, as a second image encoding/decoding apparatus, which solves such disadvantages, in particular, which can encode and decode an image at a further high speed and further which can simplify the structure and control method etc.
  • the hardware structure of the second image encoding/decoding apparatus is the same as that of the first image encoding/decoding apparatus mentioned above.
  • the parallel processing unit 1 has the configuration as shown in FIG. 9 , i.e., has n number of processors 2 - 1 to 2 -n, a memory 3 , and a connection network 4 . Note that these components are the same as those of the case of the parallel processing unit 9 of the first image encoding/decoding apparatus in terms of hardware structure and therefore will be explained by using the same reference numerals.
  • connection network 4 the functions and configurations of the n number of processors 2 - 1 to 2 -n to the connection network 4 are the same as those of the case of the parallel processing unit 9 of the first image encoding/decoding apparatus, so explanations thereof will be omitted.
  • the number n of processors is 3.
  • the programs stored in the program ROMs or the program RAMs provided for the three processors 2 - 1 to 2 - 3 are different from those of the case of the first image encoding/decoding apparatus. Due to this, the parallel processing unit 1 of the second image encoding/decoding apparatus carries out processing different from that of the parallel processing unit 9 of the first image encoding/decoding apparatus as a whole.
  • the processors are made to divide and execute not only the parallel processing part, but also the sequential processing part.
  • the processors divide and sequentially carry out the variable length coding of the macroblocks. Accordingly, each processor carries out all of the encoding, variable length coding, and local decoding for the macroblock it is in charge of. At this time, when the variable length coding of a certain macroblock is started, the end of the variable length coding is awaited only when the variable length coding of the previous macroblock has not yet been ended.
  • the processors divide and sequentially carry out also the variable length decoding of the macroblocks. Accordingly, each processor carries out both of the variable length decoding and decoding for the macroblock it is in charge of. At this time, the end of the variable length decoding is awaited only when the variable length decoding of a certain macroblock has not yet been ended.
  • the parallel processing unit 1 of the second image encoding/decoding apparatus in the same way as the first image encoding/decoding apparatus mentioned above, one processor is decided on as the master process and the others as the slave processors and made to carry out different predetermined processing.
  • the master processor generates the headers and starts the slave processors:
  • the encoding, the variable length coding, and the local decoding regarding the actual encoding are carried out at both of the master processor and the slave processors by similar procedures. Namely, the master processor and the slave processors carry out the processing by different processing procedures, but the main processing part of the encoding is carried out by the same procedure.
  • the first processor 2 - 1 serving as the master processor carries out the processing as shown in the flow chart of FIG. 16 .
  • step S 70 when the encoding is started (step S 70 ), the sequence header is generated (step S 71 ), the GOP header is generated (step S 72 ), the picture header is generated (step S 73 ), and the slice header is generated (step S 74 ).
  • the master processor starts the slave processors (step S 75 ).
  • the master processor carries out the processing relating to the encoding in the same way as that by the slave processors.
  • step S 76 acquires the number of a macroblock to be processed (step S 76 ) and encodes that macroblock (step S 77 ).
  • step S 78 it confirms that the variable length coding of the previous macroblock is ended (step S 78 ), carries out the variable length coding (step S 79 ), and, further, carries out the local decoding (step S 80 ).
  • step S 81 When all processing inside a slice is ended, the end of all processing in the slave processors is awaited (step S 82 ).
  • step S 83 When the processing of all pictures of one GOP is ended, the processing routine shifts to the processing of the next GOP (step S 84 ).
  • step S 85 This processing is repeated until the sequence is ended (step S 85 ), whereupon the processing is ended (step S 86 ).
  • the second and third processors 2 - 2 and 2 - 3 serving as the slave processors carry out the processing as shown in the flow chart of FIG. 17 .
  • step S 90 when started by the processing of step S 75 in the master processor and starting the encoding (step S 90 ), first each slave processor obtain the number of the macroblock to be processed (step S 91 ) and encodes that macroblock (step S 92 ).
  • step S 93 it confirms that the variable length coding of the previous macroblock is ended (step S 93 ), carries out the variable length coding (step S 94 ), and further carries out the local decoding (step S 95 ).
  • step S 96 the processing in the slave processor is ended (step S 97 ).
  • FIG. 18 is a timing chart of the state of the encoding in the three processors 2 - 1 to 2 - 3 .
  • the three processors 2 - 1 to 2 - 3 start the encodings MB 0 -ENC, MB 1 -ENC, and MB 2 -ENC of the macroblock 0 , macroblock 1 , and macroblock 2 .
  • the first processor 2 - 1 successively carries out the variable length coding MB 0 -VLC of the macroblock 0 and, further, the local decoding MB 0 -DEC of the macroblock 0 . Further, when the local decoding MB 0 -DEC of the macroblock 0 is ended, it starts the processing with respect to the next macroblock, that is, the macroblock 3 , from the encoding MB 3 -ENC.
  • the variable length coding MB 0 -VLC of the previous macroblock 0 is still being carried out at the first processor 2 - 1 , therefore the second processor 2 - 2 waits for the end of this variable length coding.
  • the variable length coding MB 1 -VLC starts the variable length coding MB 1 -VLC of the macroblock 1 .
  • the variable length coding MB 1 -VLC carries out the local decoding MB 1 -DEC of the macroblock 1 .
  • the local decoding MB 1 -DEC of the macroblock 1 is ended, it starts the encoding MB 4 -ENC with respect to the next macroblock 4 .
  • the variable length coding MB 0 -VLC and MB 1 -VLC of the previous macroblock 0 and macroblock 1 have not yet been ended, therefore, the end of the processing is awaited.
  • the variable length coding of the macroblock 0 and the macroblock 1 is ended, the variable length coding MB 2 -VLC of the macroblock 2 is carried out.
  • the variable length coding MB 2 -VLC is ended, the local decoding of the macroblock 2 is carried out. Further, when the local decoding MB 2 -DEC of the macroblock 2 is ended, the encoding MB 5 -ENC with respect to the next macroblock 5 is started.
  • the processors 2 - 1 to 2 - 3 successively select macroblocks x to be processed and carry out the encoding MBx-ENC, variable length coding MBx-VLC, and the local decoding MBx-DEC with respect to the macroblocks x.
  • variable length coding MBx-VLC In the variable length coding MBx-VLC thereof as well, the encoding is simultaneously started at the processors 2 - 1 to 2 - 3 just at the start of the processing as shown in FIG. 18 . Therefore, requests for the start of the variable length coding are superimposed, and idling occurs in the processors 2 - 2 and 2 - 3 . After this, however, the processing steps in the processors will always be offset from each other and therefore such idling will hardly ever occur. Also in the example shown in FIG. 18 , no idling will occur at all in other parts—it will only be necessary to wait a little in the variable length coding MB 5 -VLC of the macroblock 5 in the third processor 2 - 3 .
  • one processor is decided on as the master processor and the others as the slave processors and made to carry out processing different from each other.
  • the master processor differs from the processing of the slave processors only in the point that it decodes the headers and starts the slave processors: the variable length coding and decoding regarding the actual decoding are carried out by both of the master processor and slave processors by similar procedures. Namely, the master processor and the slave processors carry out processing by different processing procedures, but the main processing part of the decoding is achieved by the same procedure.
  • the first processor 2 - 1 serving as the master processor carries out the processing as shown in the flow chart of FIG. 19 .
  • step S 100 when the decoding is started (step S 100 ), the sequence header is decoded (step S 101 ), the GOP header is decoded (step S 102 ), the picture header is decoded (step S 103 ), and the slice header is decoded (step S 104 ).
  • the master processor starts the slave processors (step S 105 ).
  • the master processor carries out processing relating to the decoding in the same way as that for the slave processors.
  • step S 106 acquires the number of the macroblock to be processed (step S 106 ), confirms that the variable length decoding of the previous macroblock is ended (step S 107 ), and carries out the variable length decoding of that macroblock (step S 108 ).
  • variable length decoding When the variable length decoding is ended, it decodes that macroblock (step S 109 ).
  • step S 110 When all processing inside the slice is ended, it waits for the end of all processing in the slave processors (step S 111 ).
  • the processing routine shifts to the processing of the next picture (step S 112 ).
  • the processing routine shifts to the processing of the next GOP (step S 113 ).
  • step S 114 This processing is repeated until the sequence is ended (step S 114 ), whereupon the processing is ended (step S 115 ).
  • the second and third processors 22 and 2 - 3 serving as the slave processors carry out the processing as shown in the flow chart of FIG. 20 .
  • step S 120 when started by the processing of step S 105 in the master processor and starting the decoding (step S 120 ), first each slave processor acquires the number of the macroblock to be processed (step S 121 ), confirms that the variable length decoding of the previous macroblock is ended (step S 122 ), and then carries out the variable length decoding of that macroblock (step S 123 ).
  • variable length decoding when the variable length decoding is ended, it decodes that macroblock (step S 124 ).
  • step S 125 the processing in the slave processors is ended.
  • FIG. 21 is a timing chart of the state of the decoding in the three processors 2 - 1 to 2 - 3 .
  • the first processor 2 - 1 carries out the variable length decoding MB 0 -VLD of the first macroblock 0 .
  • the second processor 2 - 2 carries out the processing with respect to the macroblock 1 , but since it is necessary to successively carry out the processing for every macroblock in variable length decoding, it carries out the variable length decoding MB 1 -VLD of the macroblock 1 after waiting for the end of the variable length decoding MB 0 -VLD of the macroblock 0 at the first processor 2 - 1 .
  • the third processor 2 - 3 similarly carries out the variable length decoding MB 2 -VLD of the macroblock 2 after waiting for the end of the variable length decoding MB 0 -VLD for the macroblock 0 at the first processor 2 - 1 and the variable length decoding MB 1 -VLD for the macroblock 1 at the second processor 2 - 2 .
  • the processors 2 - 1 to 2 - 3 successively select the macroblocks x to be processed and carry out the variable length decoding MBx-VLD and decoding MBx-DEC with respect to the macroblocks x.
  • variable length decoding MBx-VLD In the variable length decoding MBx-VLD thereof as well, the decoding is simultaneously started at the processors 2 - 1 to 2 - 3 at the start of the processing as shown in FIG. 21 , therefore the second processor 2 - 2 and the third processor 2 - 3 are made to wait and the idling occurs in the processing, but, thereafter, the processing steps in the processors will always be offset from each other and such idling will hardly ever occur. Also, in the example shown in FIG. 13 , no idling at all occurs in other processing—though the variable length decoding MB 3 -VLD of the macroblock 3 at the first processor 2 - 1 is made to slightly wait.
  • the second image encoding/decoding apparatus when carrying out MPEG encoding and decoding, the processors can carry out in a dispersed manner not only the encoding part, the local decoding part, and the decoding part which can be processed in parallel, but also the variable length coding part and variable length decoding part which must be sequentially processed.
  • the load of the sequential processing part can be uniformly and equally dispersed among the processors, and, as shown in FIG. 18 and FIG. 21 , the idling time of the processors can be greatly reduced when compared with the first image encoding/decoding apparatus. As a result, the entire encoding and decoding speed can be greatly improved. Note that the effect becomes even more pronounced in a parallel processing apparatus having just two processors.
  • each of a plurality of processors 2 - 1 to 2 -n carries out a series of encoding and a series of decoding for the macroblock to be processed allotted to it on a continuous basis. For this reason, it is possible to synchronize the processors and reduce the load of the data communication etc. Further, as a result, all of the processing time can be used for the encoding and decodings. As a result, the loads at the processors substantially become uniform and equal, and the encoding and the decoding can be carried out efficiently and at a high speed.
  • the present invention provides a scalable parallel processing apparatus not depending upon the number of processors, so can be applied to parallel processing apparatus of various configurations.
  • the macroblock number acquired by a slave processor may be dynamically determined by the operating system, may be statically uniquely determined by a compiler or hardware, or may be determined by any other method.
  • a shared memory type parallel processing apparatus was shown as an example, but the hardware configuration is not limited to this.
  • a so-called “message communication” type parallel processing apparatus not having a common memory and carrying out the transfer etc. of the data “message communication” can be adopted as well.
  • the invention is not restricted to a parallel processing apparatus in which processors are closely connected such as in the present embodiment and can also be applied to a apparatus comprised of respectively independent processors connected by any communication means to cooperate and carry out some intended processing.
  • the actual configuration of the apparatus may be arbitrarily determined.
  • the parallel processing unit of the image encoding/decoding apparatus was configured having a plurality of processors carrying out predetermined operations according to certain programs operating in parallel to carry out the intended processing, but can also be configured having a plurality of processors comprised of dedicated hardware operating in parallel.
  • the present invention can also be applied to a circuit designed exclusively for variable length coding/decoding such as the encoding/decoding circuit of the MPEG, an image coding DSP, or a media processor.
  • DCT was used as the transform system to be carried out at the encoding and decoding.
  • any orthogonal transform system can be used as the transform system.
  • Any transform for example a Fourier transform such as a high speed Fourier transform (FET) and discrete Fourier transform (DFT), a Hadamard transform, and a K-L transform can be used.
  • FET high speed Fourier transform
  • DFT discrete Fourier transform
  • K-L transform K-L transform
  • the present invention is not just applicable to the encoding and decoding of a moving picture as exemplified in the present embodiment.
  • it can also be applied to the encoding and decoding of audio data and text data and the encoding and the decoding of any other data.
  • the loads when carrying out the encoding and the decoding of, for example, image data, the loads can be equally and efficiently distributed among a plurality of processors and the communication for synchronization among the processors and data communication can be reduced.
  • the encoding and decoding can be carried out at a high speed, and the control method and the hardware configuration can be simplified.
  • the loads when carrying out the encoding and the decoding of for example image data by the parallel processing using a plurality of processors, the loads can be equally and efficiently distributed among the processors. Further, the communication for the synchronization among the processors and the data communication can be reduced. As a result, the encoding and decoding can be carried out at a high speed by easy control.
  • the encoding method and the decoding method of the present invention are scalable methods in which the method of distribution of loads does not depend upon the structure of the parallel processor, for example, the number of the processors, so can be applied to parallel processors of a variety of configurations.

Abstract

Encoding and decoding systems for MPEG encoding and decoding at a high speed using a parallel processing system, wherein macroblocks to be processed are designated for first to third processors which are made to carry out all processings of encoding, variable length coding, and local decoding of those macroblocks; the variable length coding is carried out after confirming that the variable length coding with respect to the previous macroblock is ended; the variable length coding which was normally sequentially carried out at a specific processor is carried out at all of the processors; and the encoding and local decoding are carried out at all of the processors; whereby the loads are dispersed, the efficiency is improved as a whole, and the processing speed becomes fast.

Description

    BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • The present invention relates to an encoding apparatus for transforming data such as video data and audio data, for example, the MPEG method (high quality moving picture encoding system by Moving Picture Coding Experts Group), to a bit stream composed of variable length data, and to a decoding apparatus of the same, more particularly relates to an encoding apparatus and a decoding apparatus for carrying out encoding and decoding at a high speed by parallel processing and methods of the same.
  • 2. Description of the Related Art
  • First, an explanation will be made of the MPEG method (MPEG1 and MPEG2)—the standard encoding and decoding system of images currently in general used.
  • FIG. 1 is a view of the structure of image data in the MPEG method.
  • As shown in FIG. 1, the image data of the MPEG method is comprised in a hierarchical structure.
  • The hierarchy is, in order from the top, a video sequence (hereinafter simply referred to as a “sequence”), groups of pictures (GOP), pictures, slices, macroblocks, and blocks.
  • In MPEG encoding, the image data is sequentially encoded based on this hierarchical structure so as to be transformed to a bit stream.
  • The structure of a bit stream of MPEG encoded data is shown in FIG. 2.
  • In the bit stream of FIG. 2, each picture has j number of slices, and each slice has i number of macroblocks.
  • Further, each level of data other than the blocks in the hierarchy shown in FIG. 1 has a header in which an encoding mode etc. are stored. Accordingly, when describing the structure of a bit stream from the headers of the video sequence, it becomes a sequence header (SEQH) 151, a GOP header (GOPH) 152, a picture header (PH) 153, a slice header (SH) 154, a macroblock header (MH) 155, compressed data (MB0) 156 of a macroblock 0, a macroblock header (MH) 157, and compressed data (MB1) 158 of a macroblock 1.
  • Note that the size of the compressed data of a macroblock contained in a bit stream is of a variable length and differs depending on the nature of the image etc.
  • In MPEG decoding, this bit stream is sequentially decoded and the image is reconstructed based on the hierarchical structure of FIG. 14.
  • Next, the structure of a processing unit for carrying out the encoding and the decoding by the MPEG method, the processing algorithms, and the flow of the processing will be concretely explained.
  • First, an explanation will be made of the encoding.
  • FIG. 3 is a block diagram of the configuration of a general processing unit for carrying out MPEG encoding.
  • An encoding apparatus 160 shown in FIG. 3 has a motion vector detection unit (ME) 161, a subtractor 162, a Fourier discrete cosine transform (FDCT) unit 163, a quantization unit 164, a variable length coding unit (VLC) 165, an inverse quantization unit (IQ) 166, an inverse discrete cosine transform (IDCT) unit 167, an adder 168, a motion compensation unit (MC) 169, and an encode control unit 170.
  • In an encoding apparatus 160 having such a configuration, when the encoding mode of the input image data is a P (predictive coded) picture or B (bidirectionally predictive coded) picture, the motion compensation prediction is carried out in units of macroblocks at the motion vector detection unit 161, a predicted error is detected at the subtractor 162, DCT is carried out with respect to the predicted error at the discrete cosine transform unit 163, and thereby a DCT coefficient is found. Further, when the encoded picture is an I (Intra-coded) picture, the pixel value is input to the discrete cosine transform unit 163 as it is, DCT is carried out, and thereby the DCT coefficient is found.
  • The found DCT coefficient is quantized at the quantization unit 164 and subjected to variable length coding together with the motion vector or encoding mode information at the variable length coding unit 165, whereby an encoded bit stream is generated. Further, the quantized data generated at the quantization unit 164 is inversely quantized at the inverse quantization unit 166, subjected to IDCT at the inverse discrete cosine transform unit 167 to be restored to an original predicted error, and added to a reference image at the adder 168, whereby a reference image is generated at the motion compensation unit 169.
  • Note that, the encode control unit 170 controls the operation of these parts of the encoding apparatus 160.
  • Such encoding is generally roughly classified into processing at three processing units, that is, the encoding from the motion vector detection at the motion vector detection unit 161 to the quantization at the quantization unit 164, the variable length coding in the variable length coding unit 165 for generating the bit stream, and the local decoding from the inverse quantization in the inverse quantization unit 166 to the motion compensation in the motion compensation unit 169.
  • Next, an explanation will be made of the flow of the processing for carrying out such encoding and generating an encoded bit stream having the structure shown in FIG. 2 by referring to FIG. 4.
  • FIG. 4 is a flow chart of the flow of the processing for generating a bit stream by carrying out MPEG encoding.
  • When the encoding is started (step S180), a sequence header is generated (step S181), a GOP header is generated (step S182), a picture header is generated (step S183), and a slice header is generated (step S184).
  • When the generation of headers of the different levels is ended, macroblock encoding is carried out (step S185), macroblock variable length coding is carried out (step S186), and macroblock local encoding is carried out (step S187).
  • When the encoding is ended for all macroblocks inside a slice, the processing routine shifts to the processing of the next slice (step S188). Below, similarly, when all processing of a picture is ended, the processing routine shifts to the processing of the next picture (step S189). When all processing of one GOP is ended, the processing routine shifts to the processing of the next GOP (step S190). This series of processing is repeated until the sequence is ended (step S181), whereupon the processing is ended (step S192).
  • A timing chart showing the sequential execution of such encoding by a processor, for example, a digital signal processor (DSP), is shown in FIG. 5.
  • As shown in FIG. 5, in the processor, the processing of the flow chart shown in FIG. 4 is sequentially carried out for every macroblock.
  • Note that, in FIG. 5, the processing “MBx-ENC” indicates the encoding with respect to the data of an (x+1)th macroblock x, the processing “MBx-VLC” indicates variable length coding with respect to the data of the (x+1)th macroblock x, and the processing “MBx-DEC” indicates the local encoding with respect to the data of the (x+1)th macroblock x.
  • Next, an explanation will be made of the decoding.
  • FIG. 6 is a block diagram of the configuration of a general processing unit for carrying out the MPEG decoding.
  • A decoding apparatus 200 shown in FIG. 6 has a variable length decoding unit (VLD) 201, an inverse quantization unit (IQ) 202, an inverse discrete cosine transform unit (IDCT) 203, an adder 204, a motion compensation unit (MC) 205, and a decode control unit 206.
  • In a decoding apparatus 200 having such a configuration, a bit stream of the input encoded data is decoded at the variable length decoding unit 201 to separate the encoding mode, motion vector, quantization information, and quantized DCT coefficient for every macroblock. The decoded quantized DCT coefficient is subjected to inverse quantization at the inverse quantization unit 202, restored to the DCT coefficient, subjected to IDCT by the inverse discrete cosine transform unit 203, and transformed to pixel space data.
  • When the block is in the motion compensation prediction mode, the motion compensation predicted block data is added at the adder 204 to restore and output the original data. Further, the motion compensation unit 205 carries out motion compensation prediction based on the decoded image to generate the data to be added at the adder 204.
  • Note that the decode control unit 206 controls the operations of these units of the decoding apparatus 200.
  • Note that such decoding may be generally roughly classified into processing at two processing units, that is, the variable length decoding at the variable length decoding unit 201 for decoding the bit stream and the decoding from the inverse quantization in the inverse quantization unit 202 to the motion compensation in the motion compensation unit 205.
  • Next, an explanation will be made of the flow of the processing for carrying out such decoding to decode an encoded bit stream having the structure shown in FIG. 2 by referring to FIG. 7.
  • FIG. 7 is a flow chart showing the flow of the processing for generating the original image data by carrying out MPEG decoding.
  • When the decoding is started (step S210), the sequence header is decoded (step S211), the GOP header is decoded (step S212), the picture header is decoded (step S213), and the slice header is decoded (step S214).
  • When the decoding of the headers of the different levels is ended, macroblook variable length decoding is carried out (step S215), and decoding of the macroblock is carried out (step S216).
  • When the decoding is ended for all macroblocks inside the slice, the processing routine shifts to the processing of the next slice (step S217). Below, similarly, when all processing of one picture is ended, the processing routine shifts to the processing of the next picture (step S218), and when all processing of one GOP is ended, the processing routine shifts to the processing of the next GOP (step S219). This series of processings is repeated until the sequence is ended (step S220), whereupon the processing is ended (step S221).
  • A timing chart of the sequential execution of such decoding by a processor, for example, a DSP, is shown in FIG. 8.
  • As shown in FIG. 8, in the processor, processing of the flow chart shown in FIG. 7 is sequentially carried out for every slice and for every macroblock inside each slice.
  • Note that, in FIG. 8, the processing “SH-VLD” indicates the slice header decoding, the processing “MBx-VLD” indicates the variable length decoding with respect to the encoded data of the (x+1)th macroblock x, and the processing “MBx-DEC” indicates the decoding with respect to the encoded data of the (x+1)th macroblock x.
  • Summarizing the disadvantage to be solved by the invention, there is a demand that such encoding and decoding of image and other data be efficiently carried out at a high speed by a parallel processor having a plurality of processors. However, the parallel processors and parallel processing methods heretofore have suffered from various disadvantages, so have not been able to carry out high speed processing with a sufficiently high efficiency.
  • Specifically, first, when it is desired to carry out the encoding and decoding efficiently by parallel processing, there is a disadvantage that it is difficult to determine how to allocate which steps to the plurality of processors.
  • Further, in such encoding and decoding, since variable length data is to be processed, sequential processing must be carried out as the order of the data processing in the variable length coding and variable length decoding. For this reason, there is the disadvantage that the parallel processing is interrupted at the time of execution of the sequential processing parts or that the processing speed is limited since the sequential processing parts become an obstacle.
  • Further, if the times for execution of the processing in the processors are equal, the loads become uniform and equal and efficient processing can be carried out, but since the processing times of the different steps are different, there is a disadvantage that the loads of the processors become nonuniform and unequal and therefore high efficiency processing cannot be carried out.
  • Further, in such a parallel processing method, since in the case of for example the above image data, the processing with respect to one set of data like one video segment is carried out divided among a plurality of processors, it is necessary to carry out synchronization along with the transfer of the data or control the communication, so there is the disadvantage that the configuration of the hardware, the control method, etc. become complex.
  • Further, since the processing to be carried out at the different processors differ, processing programs must be prepared for the individual processors and the processing must be separately controlled for the individual processors, so there is the disadvantage that the configuration of the hardware, control method, etc. become even more complex.
  • SUMMARY OF THE INVENTION
  • An object of the present invention is to provide an encoding apparatus and a decoding apparatus having a plurality of processors capable of carrying out the encoding and decoding of for example image data at a high speed and having simple configurations.
  • Further, another object of the present invention is to provide an encoding method and a decoding method which can be applied to parallel processors having any configurations and capable of carrying out the encoding and decoding of for example image data at a high speed.
  • According to a first aspect of the present invention, there is provided an encoding apparatus for encoding a data which comprises a plurality of block data including a plurality of element data which are sequentially transferred in a form of a data stream, the encoding apparatus comprising a plurality of signal processing devices connected by a signal transfer means on which the data is transferred, each signal processing device comprising; an encoding means for encoding a block data including a plurality of element data on the signal transfer means, and a variable length coding means for carrying out a variable length coding of the encoded block data and outputting the variable length coded data via the signal transfer means in accordance with the data stream.
  • According to a second aspect of the present invention, there is provided an encoding method for encoding a data stream having a plurality of element data, comprising the steps of; dividing the data stream into a predetermined plurality of block data, successively allotting the divided plurality of block data to a plurality of signal processing devices, encoding the allotted block data based on a predetermined method in each of the plurality of signal processing devices, successively carrying out variable length coding on the encoded data in the same signal processing devices as those for the encoding so that the encoded data for every the block data encoded in the plurality of signal processing devices are successively subjected to the variable length coding according to the order in the data stream, and successively allotting new block data to the signal processing devices for which the variable length coding is ended.
  • According to a third aspect of the present invention, there is provided a decoding apparatus for decoding encoded and variable length coded data which comprises a plurality of block data including a plurality of element data in a form of a data stream, the decoding apparatus comprising a plurality of signal processing devices, each of the signal processing devices comprising; a variable length decoding means for successively carrying out variable length decoding on variable length coded block data in accordance with the data stream, and a decoding means for decoding the variable length decoded block data.
  • According to a fourth aspect of the present invention, there is provided a decoding method for decoding a variable length coded data stream obtained by encoding a data stream having a plurality of element data for every predetermined block data and further carrying out variable length coding, comprising the steps of; successively allotting the variable length coded data for every the block data successively arranged in the variable length coded data stream to a plurality of signal processing devices, successively carrying out variable length decoding on the variable length coded data for every allotted block data so that the variable length decoding carried out in the plurality of signal processing devices is successively carried out according to the order of the block data in the data stream in each of the plurality of signal processing devices, decoding the encoded data for every the block image data subjected to the variable length decoding in the same signal processing device in each of the plurality of signal processing devices, and allotting variable length coded data of new block data to be decoded next to the signal processing devices for which the decoding is ended.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • These and other objects and features of the present invention will become clearer from the following description of a preferred embodiment given with reference to the accompanying drawings, in which:
  • FIG. 1 is a view of the structure of image data in MPEG encoding;
  • FIG. 2 is a view of the structure of an MPEG encoded image data bit stream;
  • FIG. 3 is a block diagram of the configuration of a processing unit for carrying out the MPEG encoding;
  • FIG. 4 is a flow chart of the flow of processing for generating a bit stream shown in FIG. 15 by carrying out MPEG encoding;
  • FIG. 5 is a timing chart of the operation of the processing unit when MPEG encoding is carried out by sequential processing;
  • FIG. 6 is a block diagram of the configuration of a processing unit for carrying out MPEG decoding;
  • FIG. 7 is a flow chart of the flow of processing for generating a bit stream shown in FIG. 15 by carrying out MPEG decoding;
  • FIG. 8 is a timing chart of the operation of a processing unit when MPEG decoding is carried out by sequential processing;
  • FIG. 9 is a schematic block diagram of the configuration of a parallel processing unit of an image encoding/decoding apparatus according to the present invention;
  • FIG. 10 is a flow chart of the processing in the case where an image is encoded by the conventional parallel processing method of in a master processor (first processor) of the parallel processing unit shown in FIG. 9;
  • FIG. 11 is a flow chart of the processing in the case where an image is encoded by the conventional parallel processing method in slave processors (second to n-th processors) of the parallel processing unit shown in FIG. 9;
  • FIG. 12 is a timing chart of the state of processing in processors in a case where an image is encoded by the conventional parallel processing method in the parallel processing unit shown in FIG. 9;
  • FIG. 13 is a flow chart of the processing in the case where an image is decoded by the conventional parallel processing method in the master processor (first processor) of the parallel processing unit shown in FIG. 9;
  • FIG. 14 is a flow chart of the processing in the case where an image is decoded by the conventional parallel processing method in slave processors (second to n-th processors) of the parallel processing unit shown in FIG. 9;
  • FIG. 15 is a timing chart of the state of processing in processors in a case where an image is decoded by the conventional parallel processing method in the parallel processing unit shown in FIG. 9;
  • FIG. 16 is a flow chart of the processing in the case where an image is encoded by the parallel processing method according to the present invention in the master processor (first processor) of the parallel processing unit shown in FIG. 9;
  • FIG. 17 is a flow chart of the processing in the case where an image is encoded by the parallel processing method according to the present invention in slave processors (second to n-th processors) of the parallel processing unit shown in FIG. 9;
  • FIG. 18 is a timing chart of the state of processing in processors in a case where an image is encoded out by the parallel processing method according to the present invention in the parallel processing unit shown in FIG. 9;
  • FIG. 19 is a flow chart of the processing in a case where an image is decoded by the parallel processing method according to the present invention in the master processor (first processor) of the parallel processing unit shown in FIG. 9;
  • FIG. 20 is a flow chart of the processing in a case where an image is decoded by the parallel processing method according to the present invention in slave processors (second to n-th processors) of the parallel processing unit shown in FIG. 9; and
  • FIG. 21 is a flow chart of the state of processing in processors in a case where an image is decoded by the parallel processing method according to the present invention in the parallel processing unit shown in FIG. 9.
  • DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • An explanation will be made next of a preferred embodiment of the present invention by referring to FIG. 9 to FIG. 21.
  • In the following embodiment, the present invention will be explained by taking as an example an image encoding/decoding apparatus carrying out parallel processing by a plurality of processors to encode and decode a moving picture by MPEG2.
  • Note that, as the units of processing when carrying out the parallel processing of the MPEG encoding and decoding, any of the levels shown in FIG. 1 or a pixel can be considered, but in the following embodiment, the explanation will be made of a case where a macroblock is selected as the unit of parallel processing.
  • When using a macroblock as the unit of parallel processing, the encoding, local decoding, and decoding can be executed in parallel inside one slice, but it is necessary to sequentially execute the variable length coding and variable length decoding. This is because, in variable length coding and variable length decoding, the compressed data of the macroblock has a variable length and the header position of the compressed data of a macroblock on the bit stream is not determined until the variable length coding or the variable length decoding of the macroblock immediately before this is completed.
  • Note that the same limitation applies in the case where the slice is used as the unit of parallel processing.
  • First Image Encoding/Decoding Apparatus
  • First, an explanation will be made of an image encoding/decoding apparatus of the related art for carrying out the encoding and decoding of an image as mentioned above by parallel processing.
  • FIG. 9 is a schematic block diagram of the configuration of a parallel processing unit of an image encoding/decoding apparatus.
  • As shown in FIG. 9, the parallel processing unit 9 of the image encoding/decoding apparatus has n number of processors 2-1 to 2-n, a memory 3, and a connection network 4.
  • First, an explanation will be made of the configuration of this parallel processing unit 9.
  • The n number of processors 2-1 to 2-n are processors for independently carrying out predetermined processing. Each processor 2-i (i=1 to n) has a program read only memory (ROM) or program random access memory (RAM) storing a processing program to be executed and a RAM for storing data etc. regarding the processing. The processor 2-i carries out the predetermined processing according to the program stored in the program ROM or program RAM in advance.
  • Note that, in the present embodiment, it is assumed that n=3, that is, the parallel processing unit 9 has three processors 2-1 to 2-3.
  • Further, in the following explanation, the description will be made of only the processing concerning the encoding and decoding of the image data by the processors 2-1 to 2-n, but the processing for controlling the operation of the entire parallel processing unit 9 is carried out in one of the processors 2-i (i=1 to n) or in each of the n number of processors 2-1 to 2-n in parallel. By this control operation, the processors 2-1 to 2-n carry out the processing as will be explained below in association or in synchronization.
  • The memory 3 is a common memory of the n number of processors 2-1 to 2-n. The image data to be processed and the data of the processing result are stored in the memory 3. Data is appropriately read and written by n number of processors 2-1 to 2-n.
  • The connection network 4 is a connection portion for connecting the n number of processors 2-1 to 2-n and the memory 3 to each other so that the n number of processors 2-1 to 2-n operate in association or the n number of processors 2-1 to 2-n appropriately refer to the memory 3.
  • Next, an explanation will be made of the processing in each processor 2-i (i=1 to 3) and the processing of the parallel processing unit 9 where the parallel processing unit 9 having such a configuration is encoding a moving picture as mentioned above.
  • First, an explanation will be made of the processing in each processor 2-i.
  • In the parallel processing unit 9, the variable length coding of the macroblocks is allotted to one processor (hereinafter, this processor will be referred to as the “master processor”) in a fixed manner and that processor made to sequentially execute the processing, and the encoding and the local decoding are allotted to other processors (hereinafter, these processors will be referred to as “slave processors”) and those processors made to execute the parallel processing. In the parallel processing unit 9 shown in FIG. 9, the first processor 2-1 is made the master processor, and the second and the third processors 2-2 and 2-3 are made the slave processors.
  • First, the first processor 2-1 serving as the master processor carries out the processing as shown in the flow chart of FIG. 10.
  • Namely, when the encoding is started (step S10), the sequence header is generated (step S11), the GOP header is generated (step S12), the picture header is generated (step S13), and the slice header is generated (step S14).
  • When the generation of the slice header is ended, the master processor activates the slave processors (step S15) and enters into a state waiting for the end of the encoding in the slave processors (step S16).
  • When the encoding of the macroblocks in the slave processors is ended (step S16), the variable length coding of those macroblocks is started (step S17). Note that this variable length coding must be sequentially executed due to the limitation as mentioned above. Accordingly, even if the encoding of the macroblock 1 is ended before the encoding of the macroblock 0, the processor 0 first carries out the variable length coding of the macroblock 0 without fail.
  • The master processor repeats this procedure until all processing inside a slice is ended (step S18). When all processing inside the slice is ended, it waits for the end of all processing in the slave processors (step S19).
  • Below, similarly, when all processings of one picture are ended, the processing routine shifts to the processing of the next picture (step S20), and when the processing of all pictures of 1GOP are ended, the processing routine shifts to the processing of the next GOP (step S21). Then, when these processings are repeated until the sequence is ended (step S22), the processing is ended (step S23).
  • Next, the second and third processors 2-2 and 2-3 serving as the slave processors carry out the processing as shown in the flow chart of FIG. 11.
  • Namely, when started by the processing of step S15 in the master processor and starting the encoding (step S30), first each of the processors acquires the number of the macroblock to process (step S31) and encodes that macroblock (step S32).
  • When the encoding is ended, the slave processors wait for the end of the variable length coding in the master processor (step S33). When the variable length coding is ended, they carry out the local decoding (step S34).
  • This procedure is repeated until all processing inside a slice are ended (step S35). When all processing inside the slice is ended (step S35), the processing of the slave processors is ended (step S36).
  • Note that, the programs by which the master processor and slave processors carry out the processing are stored in advance in the program ROMs or the program RAMs provided with respect to the processors 2-i. The processors 2-i operate in accordance with these programs so as to carry out these processings.
  • Next, an explanation will be made of the operation of the parallel processing unit 9 when encoding a moving picture by referring to FIG. 12.
  • FIG. 12 is a timing chart of the state of the encoding in the three processors 2-1 to 2-3.
  • Note that, in FIG. 12, the processing “MBx-ENC” indicates the encoding with respect to the (x+1)th macroblock x (step S32 in FIG. 11), the processing “MBx-DEC” indicates the local decoding with respect to the (x+1)th video segment x (step S34 in FIG. 11), and the processing “MBx-VLC” indicates the variable length coding with respect to the (x+1)th video segment x (step S17 in FIG. 10).
  • As shown in FIG. 12, when the encoding is started, first the second processor 2-2 and the third processor 2-3 carry out the encoding MB0-ENC and MB1-ENC of the macroblock 0 and the macroblock 1.
  • When the encoding MB0-ENC of the macroblock 0 in the second processor 2-2 is ended, the first processor 2-1 carries out the variable length coding MB0-VLC with respect to the encoded data.
  • The encoding MB1-ENC of the macroblock 1 in the third processor 2-3 is ended while the variable length coding MB0-VLC of the macroblock 0 is being carried out in the first processor 2-1, therefore, the first processor 2-1 subsequently carries out the variable length coding MB1-VLC with respect to the encoded data of the macroblock 1.
  • On the other hand, in the second processor 2-2, when the variable length coding MB0-VLC with respect to the macroblock 0 is ended in the first processor 2-1, the local decoding MB0-DEC with respect to that data is carried out. Then, when this local decoding MB0-DEC is ended, the encoding MB2-ENC with respect to the next macroblock 2 is carried out.
  • Also in the third processor 2-3, similarly, when the variable length coding MB1-VLC with respect to the macroblock 1 is ended in the first processor 2-1, the local decoding MB0-DEC with respect to that data is carried out. Then, when this local decoding MB0-DEC is ended, the encoding MB3-ENC with respect to the next macroblock 3 is carried out.
  • Below, similarly, in the first processor 2-1, the second processor 2-2, or the third processor 2-3, when the encoding MBx-ENC of the encoding of the macroblock to be processed next is ended, the decoding MBx-VLC of the encoded data is sequentially carried out.
  • Further, in the second processor 2-2 and the third processor 2-3, when the variable length coding MBx-VLC is ended in the first processor 2-1, the local encoding MBx-DEC with respect to the macroblock thereof is carried out, and after the end of the processing, the encoding MBx-ENC with respect to the next macroblock x+1 is subsequently carried out.
  • Note that the variable length coding can be divided into the phase for generating the variable length data from the fixed length data by table conversion and the phase for combining the variable length data to generate the bit stream. These two phases may be sequentially executed, or only the latter phase may be sequently executed and the former phase be executed in parallel. Note that a buffer memory becomes necessary between the former phase and the latter phase in the latter method.
  • Next, an explanation will be made of the processing in each processor 2-i (i=1 to 3) when decoding the moving picture as mentioned above in the parallel processing unit 9 and of the operation of the parallel processing unit 9.
  • First, an explanation will be made of the processing in each processor 2-i.
  • In the parallel processing unit 9, the variable length decoding of macroblocks is allotted to one processor (hereinafter this processor will be referred to as the “master processor”) in a fixed manner and that processor made to sequentially execute the processing. The decoding is allotted to the other processors (hereinafter, these processors will be referred to as the “slave processors”) and the slave processors made to carry out the parallel processing. In the parallel processing unit 9 shown in FIG. 9, the first processor 2-1 is made the master processor, and the second and the third processors 2-2 and 2-3 are made the slave processors.
  • First, the first processor 2-1 serving as the master processor carries out the processing as shown in the flow chart of FIG. 13.
  • Namely, when the decoding is started (step S40), the sequence header is decoded (step S41), the GOP header is decoded (step S42), the picture header is decoded (step S43), and the slice header is decoded (step S44).
  • When the decoding of the slice header is ended, the master processor activates the slave processors (step S45) and carries out the variable length decoding with respect to a macroblock (step S46). The master processor repeatedly carries out this variable length decoding (step S4i6) until this processing is ended for all macroblocks inside the slice.
  • When the variable length decoding with respect to all macroblocks inside a slice is ended, the master processor waits for the end of all processings in the slave processors (step S48). When the processings in the slave processors are ended (step S48), the processing routine shifts to the processing with respect to the next picture (step S49).
  • When the processing of all pictures of one GOP is ended (step S49), the processing routine shifts to the processing of the next GOP (step S50). When the processing of all GOPs is ended (step S50), the processing routine shifts to the processing of the next sequence (step S51). This series of processing is repeated until all sequences are ended (step S51), whereby the processing is ended (step S52).
  • Next, the second and third processors 2-2 and 2-3 serving as the slave processors carry out the processing as shown in the flow chart of FIG. 14.
  • Namely, when started by the processing of step S45 in the master processor and starting the decoding (step S60), first each slave processor obtains the number of the macroblock to be processed (step S61) and waits for the end of the variable length decoding of the related macroblock at step S46 at the master processor (step S62).
  • Next, when the variable length decoding is ended, the slave processor decodes the macroblock using that data (step S63).
  • This procedure is repeated until the processing of all macroblocks inside the slice is ended (step S64). When all processing inside the slice is ended (step S64), the processing of the slave processors is ended (step S65).
  • Note that, the programs by which the master processor and slave processors carry out the processing are stored in advance in the program ROMs or the program RAMs provided with respect to the processors 2-i. The processors 2-i operate in accordance with these programs so as to carry out these processings.
  • Further, when a slice is used as the unit of parallel processing in the variable length decoding, the header of the next slice on the bit stream can be found without carrying out the variable length decoding. This becomes possible by finding the slice start code placed at the header of the slice by scanning. Accordingly, a processing method of carrying out only this scanning sequentially and carrying out the other processing containing the variable length decoding in parallel is possible.
  • Next, an explanation will be made of the operation of the parallel processing unit 9 when decoding a moving picture by referring to FIG. 15.
  • FIG. 15 is a timing chart of the state of the decoding in the three processors 2-1 to 2-3.
  • Note that, in FIG. 15, the processing “MBx-VLD” indicates the variable length decoding with respect to the (x+1)th macroblock x (step S46 in FIG. 13), and the processing “MBx-DEC” indicates the decoding with respect to the (x+1)th video segment x (step S63 in FIG. 14).
  • As shown in FIG. 15, when the decoding is started, the first processor 2-1 sequentially carries out the variable length decoding from the macroblock 0.
  • When the variable length decoding of the macroblock 0 is ended in the first processor 2-1, the second processor 2-2 carries out the decoding MB0-DEC with respect to this data.
  • Further, when the variable length decoding of the next macroblock 1 is ended in the first processor 2-1, the third processor 2-3 carries out the decoding MB1-DEC with respect to this data.
  • Thereafter, the processor which ended the decoding among the second processor 2-2 and the third processor 2-3 fetches the data of the next macroblock subjected to the variable length decoding at the first processor 2-1 and carries out the encoding.
  • In this way, the first image encoding/decoding apparatus divides the processing steps of the encoding and decoding into steps able to be processed in parallel and steps relating to variable length coding/decoding not able to be processed in parallel and having to be processed sequentially, allots the steps for which sequential processing is necessary to the master processor and steps which can be processed in parallel to the slave processors, and then carries out the encoding and the decoding.
  • Accordingly, the sequentially input data is sequentially processed at these three processors 2-1 to 2-3 and transformed to the intended compressed and encoded data or the restored image data. By carrying out the encoding and the decoding by parallel processing in this way, the processing can be carried out at a higher speed compared with the usual case where the processing is carried out by one processor.
  • Second Image Encoding/Decoding Apparatus
  • In the first image encoding/decoding apparatus, however, since the sequential processing part (variable length coding and the variable length decoding) was allotted to a specific processor (first processor 2-1) in a fixed manner and that processor made to sequentially execute the processing, there was the disadvantage that the loads became nonuniform among the three processors 2-1 to 2-3.
  • In such a case, if the ratio of execution times of the sequential processing part and the parallel processing part were proportional to the ratio of the numbers of the processors for executing the sequential processing part and the parallel processing part, the loads would become uniform and equal, but if not proportional, the loads of the processors would become nonuniform and unequal resulting in a fall in the performance.
  • For example, in the parallel processing of MPEG encoding shown in FIG. 12, the load of the variable length coding is relatively light, therefore the first processor 2-1 frequently is idle. This becomes even more conspicuous in a parallel processing apparatus having two processors.
  • Further, also in the parallel processing of the MPEG decoding shown in FIG. 15, since the load of the variable length decoding is relatively light, the first processor 2-1 becomes idle at the point of time when one slice's worth of the variable length decoding is ended and until all decoding in the second processor 2-2 and the third processor 2-3 is ended.
  • Further, in the first image encoding/decoding apparatus, since the processing executed at the different processors is different, it is necessary to separately control the processors and synchronize the transfer of data and communication, so there also arises a disadvantage of complicated control.
  • Therefore, an explanation will be made of an image encoding/decoding apparatus according to the present invention, as a second image encoding/decoding apparatus, which solves such disadvantages, in particular, which can encode and decode an image at a further high speed and further which can simplify the structure and control method etc.
  • The hardware structure of the second image encoding/decoding apparatus is the same as that of the first image encoding/decoding apparatus mentioned above.
  • Namely, the parallel processing unit 1 has the configuration as shown in FIG. 9, i.e., has n number of processors 2-1 to 2-n, a memory 3, and a connection network 4. Note that these components are the same as those of the case of the parallel processing unit 9 of the first image encoding/decoding apparatus in terms of hardware structure and therefore will be explained by using the same reference numerals.
  • Further, the functions and configurations of the n number of processors 2-1 to 2-n to the connection network 4 are the same as those of the case of the parallel processing unit 9 of the first image encoding/decoding apparatus, so explanations thereof will be omitted.
  • Further, in the case of the parallel processing unit 1 of the second image encoding/decoding apparatus as well, the number n of processors is 3.
  • In the case of the parallel processing unit 1 of the second image encoding/decoding apparatus having the same hardware structure as that of the parallel processing unit 9 of the first image encoding/decoding apparatus, the method of the encoding and decoding of a moving picture and the operations of the processors 2-i (i=1 to 3) are different from those of the first image encoding/decoding apparatus.
  • Namely, the programs stored in the program ROMs or the program RAMs provided for the three processors 2-1 to 2-3 are different from those of the case of the first image encoding/decoding apparatus. Due to this, the parallel processing unit 1 of the second image encoding/decoding apparatus carries out processing different from that of the parallel processing unit 9 of the first image encoding/decoding apparatus as a whole.
  • In the second image encoding/decoding apparatus, the processors are made to divide and execute not only the parallel processing part, but also the sequential processing part.
  • For encoding, in the parallel processing unit 1 of the second image encoding/decoding apparatus, the processors divide and sequentially carry out the variable length coding of the macroblocks. Accordingly, each processor carries out all of the encoding, variable length coding, and local decoding for the macroblock it is in charge of. At this time, when the variable length coding of a certain macroblock is started, the end of the variable length coding is awaited only when the variable length coding of the previous macroblock has not yet been ended.
  • Further, for the decoding, in the parallel processing unit 1 of the second image encoding/decoding apparatus, the processors divide and sequentially carry out also the variable length decoding of the macroblocks. Accordingly, each processor carries out both of the variable length decoding and decoding for the macroblock it is in charge of. At this time, the end of the variable length decoding is awaited only when the variable length decoding of a certain macroblock has not yet been ended.
  • Below, an explanation will be made of the processing in each processor 2-i (i=1 to 3) when encoding and decoding a moving picture in the parallel processing unit 1 of the second image encoding/decoding apparatus and of the operation of the parallel processing unit 1.
  • First, an explanation will be made of the processing in each processor 2-i when encoding.
  • In the parallel processing unit 1 of the second image encoding/decoding apparatus, in the same way as the first image encoding/decoding apparatus mentioned above, one processor is decided on as the master process and the others as the slave processors and made to carry out different predetermined processing. However, the only difference of processing between the master processor and slave processors is that the master processor generates the headers and starts the slave processors: The encoding, the variable length coding, and the local decoding regarding the actual encoding are carried out at both of the master processor and the slave processors by similar procedures. Namely, the master processor and the slave processors carry out the processing by different processing procedures, but the main processing part of the encoding is carried out by the same procedure.
  • Below, an explanation will be made of the processing of each processor.
  • First, the first processor 2-1 serving as the master processor carries out the processing as shown in the flow chart of FIG. 16.
  • Namely, when the encoding is started (step S70), the sequence header is generated (step S71), the GOP header is generated (step S72), the picture header is generated (step S73), and the slice header is generated (step S74).
  • When the generation of the slice header is ended, the master processor starts the slave processors (step S75).
  • When the start-up of the slave processors is ended, the master processor carries out the processing relating to the encoding in the same way as that by the slave processors.
  • Namely, first, it acquires the number of a macroblock to be processed (step S76) and encodes that macroblock (step S77).
  • Next, it confirms that the variable length coding of the previous macroblock is ended (step S78), carries out the variable length coding (step S79), and, further, carries out the local decoding (step S80).
  • This procedure is repeated until all processing inside the slice is ended (step S81). When all processing inside a slice is ended, the end of all processing in the slave processors is awaited (step S82).
  • Then, when all processing for one picture is ended, the processing routine shifts to the processing of the next picture (step S83). When the processing of all pictures of one GOP is ended, the processing routine shifts to the processing of the next GOP (step S84).
  • This processing is repeated until the sequence is ended (step S85), whereupon the processing is ended (step S86).
  • Next, the second and third processors 2-2 and 2-3 serving as the slave processors carry out the processing as shown in the flow chart of FIG. 17.
  • Namely, when started by the processing of step S75 in the master processor and starting the encoding (step S90), first each slave processor obtain the number of the macroblock to be processed (step S91) and encodes that macroblock (step S92).
  • Next, it confirms that the variable length coding of the previous macroblock is ended (step S93), carries out the variable length coding (step S94), and further carries out the local decoding (step S95).
  • This procedure is repeated until all processing inside the slice is ended (step S96). When all processing inside the slice is ended, the processing in the slave processor is ended (step S97).
  • Next, an explanation will be made of the operation of the parallel processing unit 1 when encoding by the operation of three processors 2-1 to 2-3 by such a processing procedure by referring to FIG. 18.
  • FIG. 18 is a timing chart of the state of the encoding in the three processors 2-1 to 2-3.
  • Note that the reference symbols showing processings in FIG. 18 are the same as those shown in FIG. 12, so explanations will be omitted.
  • As illustrated, when the encoding is started, the three processors 2-1 to 2-3 start the encodings MB0-ENC, MB1-ENC, and MB2-ENC of the macroblock 0, macroblock 1, and macroblock 2.
  • Then, when the encoding MB0-ENC is ended, the first processor 2-1 successively carries out the variable length coding MB0-VLC of the macroblock 0 and, further, the local decoding MB0-DEC of the macroblock 0. Further, when the local decoding MB0-DEC of the macroblock 0 is ended, it starts the processing with respect to the next macroblock, that is, the macroblock 3, from the encoding MB3-ENC.
  • On the other hand, when the encoding MB1-ENC of the macroblock 1 is ended, the variable length coding MB0-VLC of the previous macroblock 0 is still being carried out at the first processor 2-1, therefore the second processor 2-2 waits for the end of this variable length coding. When this is ended, it starts the variable length coding MB1-VLC of the macroblock 1. Then, when the variable length coding MB1-VLC is ended, it carries out the local decoding MB1-DEC of the macroblock 1. Further, when the local decoding MB1-DEC of the macroblock 1 is ended, it starts the encoding MB4-ENC with respect to the next macroblock 4.
  • Further, in the third processor 2-3, when the encoding MB2-ENC of the macroblock 2 is ended, the variable length coding MB0-VLC and MB1-VLC of the previous macroblock 0 and macroblock 1 have not yet been ended, therefore, the end of the processing is awaited. When the variable length coding of the macroblock 0 and the macroblock 1 is ended, the variable length coding MB2-VLC of the macroblock 2 is carried out. When the variable length coding MB2-VLC is ended, the local decoding of the macroblock 2 is carried out. Further, when the local decoding MB2-DEC of the macroblock 2 is ended, the encoding MB5-ENC with respect to the next macroblock 5 is started.
  • In this way, the processors 2-1 to 2-3 successively select macroblocks x to be processed and carry out the encoding MBx-ENC, variable length coding MBx-VLC, and the local decoding MBx-DEC with respect to the macroblocks x.
  • By carrying out the processing in this way, the start of the processing need be awaited for only the variable length coding MBx-VLC when the variable length coding MB(x−1)-VLC with respect to the previous macroblock x−1 has not been ended, but the processing can be carried out completely in parallel for other portions.
  • In the variable length coding MBx-VLC thereof as well, the encoding is simultaneously started at the processors 2-1 to 2-3 just at the start of the processing as shown in FIG. 18. Therefore, requests for the start of the variable length coding are superimposed, and idling occurs in the processors 2-2 and 2-3. After this, however, the processing steps in the processors will always be offset from each other and therefore such idling will hardly ever occur. Also in the example shown in FIG. 18, no idling will occur at all in other parts—it will only be necessary to wait a little in the variable length coding MB5-VLC of the macroblock 5 in the third processor 2-3.
  • Next, an explanation will be made of the processing in each processor 2-i when decoding in the second image encoding/decoding apparatus.
  • In the case of decoding as well, in the same way as the first image encoding/decoding apparatus, one processor is decided on as the master processor and the others as the slave processors and made to carry out processing different from each other. The master processor, however, differs from the processing of the slave processors only in the point that it decodes the headers and starts the slave processors: the variable length coding and decoding regarding the actual decoding are carried out by both of the master processor and slave processors by similar procedures. Namely, the master processor and the slave processors carry out processing by different processing procedures, but the main processing part of the decoding is achieved by the same procedure.
  • Below, an explanation will be made of the processing of each processor.
  • First, the first processor 2-1 serving as the master processor carries out the processing as shown in the flow chart of FIG. 19.
  • Namely, when the decoding is started (step S100), the sequence header is decoded (step S101), the GOP header is decoded (step S102), the picture header is decoded (step S103), and the slice header is decoded (step S104).
  • Then, when the decoding of the slice header is ended, the master processor starts the slave processors (step S105).
  • When the start-up of the slave processors is ended, the master processor carries out processing relating to the decoding in the same way as that for the slave processors.
  • Namely, first, it acquires the number of the macroblock to be processed (step S106), confirms that the variable length decoding of the previous macroblock is ended (step S107), and carries out the variable length decoding of that macroblock (step S108).
  • When the variable length decoding is ended, it decodes that macroblock (step S109).
  • This procedure is repeated until all processing inside the slice is ended (step S110). When all processing inside the slice is ended, it waits for the end of all processing in the slave processors (step S111).
  • When all processing for one picture is ended, the processing routine shifts to the processing of the next picture (step S112). When the processing of all pictures of one GOP is ended, the processing routine shifts to the processing of the next GOP (step S113).
  • This processing is repeated until the sequence is ended (step S114), whereupon the processing is ended (step S115).
  • Next, the second and third processors 22 and 2-3 serving as the slave processors carry out the processing as shown in the flow chart of FIG. 20.
  • Namely, when started by the processing of step S105 in the master processor and starting the decoding (step S120), first each slave processor acquires the number of the macroblock to be processed (step S121), confirms that the variable length decoding of the previous macroblock is ended (step S122), and then carries out the variable length decoding of that macroblock (step S123).
  • Next, when the variable length decoding is ended, it decodes that macroblock (step S124).
  • This procedure is repeated until all processing inside the slice is ended (step S125). When all processing inside the slice are ended, the processing in the slave processors is ended (step S126).
  • Next, an explanation will be made of the operation of the parallel processing unit 1 when decoding by the operation of the three processors 2-1 to 2-3 by such a processing procedure by referring to FIG. 21.
  • FIG. 21 is a timing chart of the state of the decoding in the three processors 2-1 to 2-3.
  • Note that reference symbols showing processing in FIG. 21 are the same as those shown in FIG. 15, so explanations will be omitted.
  • As illustrated, when the decoding is started, first, the first processor 2-1 carries out the variable length decoding MB0-VLD of the first macroblock 0.
  • The second processor 2-2 carries out the processing with respect to the macroblock 1, but since it is necessary to successively carry out the processing for every macroblock in variable length decoding, it carries out the variable length decoding MB1-VLD of the macroblock 1 after waiting for the end of the variable length decoding MB0-VLD of the macroblock 0 at the first processor 2-1.
  • The third processor 2-3 similarly carries out the variable length decoding MB2-VLD of the macroblock 2 after waiting for the end of the variable length decoding MB0-VLD for the macroblock 0 at the first processor 2-1 and the variable length decoding MB1-VLD for the macroblock 1 at the second processor 2-2.
  • The first processor 2-1 finishing the variable length decoding MB0-VLD with respect to the macroblock 0 successively carries out the decoding MB0-DEC with respect to the macroblock 0.
  • When that decoding MB0-DEC is ended, the processing with respect to the next macroblock 3 is started. At this time, however, as shown in FIG. 21, if the variable length coding MB2-VLD with respect to the previous macroblock 2 has not been ended, this is waited for before starting and the variable length decoding MB3-VLD with respect to the macroblock 3.
  • Below, similarly, the processors 2-1 to 2-3 successively select the macroblocks x to be processed and carry out the variable length decoding MBx-VLD and decoding MBx-DEC with respect to the macroblocks x.
  • By carrying out the processing in this way, while the start of the variable length decoding MBx-VLD is delayed when the variable length decoding MB(x−1)-VLD with respect to the previous macroblock x−1 has not been ended, the processings can be carried out completely in parallel for other portions.
  • In the variable length decoding MBx-VLD thereof as well, the decoding is simultaneously started at the processors 2-1 to 2-3 at the start of the processing as shown in FIG. 21, therefore the second processor 2-2 and the third processor 2-3 are made to wait and the idling occurs in the processing, but, thereafter, the processing steps in the processors will always be offset from each other and such idling will hardly ever occur. Also, in the example shown in FIG. 13, no idling at all occurs in other processing—though the variable length decoding MB3-VLD of the macroblock 3 at the first processor 2-1 is made to slightly wait.
  • In this way, the second image encoding/decoding apparatus, when carrying out MPEG encoding and decoding, the processors can carry out in a dispersed manner not only the encoding part, the local decoding part, and the decoding part which can be processed in parallel, but also the variable length coding part and variable length decoding part which must be sequentially processed.
  • Accordingly, the load of the sequential processing part can be uniformly and equally dispersed among the processors, and, as shown in FIG. 18 and FIG. 21, the idling time of the processors can be greatly reduced when compared with the first image encoding/decoding apparatus. As a result, the entire encoding and decoding speed can be greatly improved. Note that the effect becomes even more pronounced in a parallel processing apparatus having just two processors.
  • Further, in the parallel processing unit 1 of the second image encoding/decoding apparatus, each of a plurality of processors 2-1 to 2-n carries out a series of encoding and a series of decoding for the macroblock to be processed allotted to it on a continuous basis. For this reason, it is possible to synchronize the processors and reduce the load of the data communication etc. Further, as a result, all of the processing time can be used for the encoding and decodings. As a result, the loads at the processors substantially become uniform and equal, and the encoding and the decoding can be carried out efficiently and at a high speed.
  • Further, all processors can be operated substantially under the same control and processing procedure, therefore the hardware configuration becomes simple.
  • Further, the present invention provides a scalable parallel processing apparatus not depending upon the number of processors, so can be applied to parallel processing apparatus of various configurations.
  • Note that, the present invention is not limited to only the present embodiment. Various modifications are possible.
  • For example, in the parallel processing unit of the embodiment, while there is only one master processor, but there is no restriction on the number of slave processors. Any number is possible.
  • Further, the macroblock number acquired by a slave processor may be dynamically determined by the operating system, may be statically uniquely determined by a compiler or hardware, or may be determined by any other method.
  • Further, it is possible to adopt a configuration in which the programs to be executed at the processors are stored in ROMs in advance and then provided to the parallel processing unit of the image encoding/decoding apparatus or to adopt a configuration in which the programs are stored on a storage medium such as a hard disk or CD-ROM and read into program RAMs or the like at the time of execution.
  • Further, in the present embodiment, as the processor according to the present invention, as shown in FIG. 1, a shared memory type parallel processing apparatus was shown as an example, but the hardware configuration is not limited to this. A so-called “message communication” type parallel processing apparatus not having a common memory and carrying out the transfer etc. of the data “message communication” can be adopted as well.
  • Further, the invention is not restricted to a parallel processing apparatus in which processors are closely connected such as in the present embodiment and can also be applied to a apparatus comprised of respectively independent processors connected by any communication means to cooperate and carry out some intended processing.
  • Namely, the actual configuration of the apparatus may be arbitrarily determined.
  • Further, the parallel processing unit of the image encoding/decoding apparatus was configured having a plurality of processors carrying out predetermined operations according to certain programs operating in parallel to carry out the intended processing, but can also be configured having a plurality of processors comprised of dedicated hardware operating in parallel. For example, the present invention can also be applied to a circuit designed exclusively for variable length coding/decoding such as the encoding/decoding circuit of the MPEG, an image coding DSP, or a media processor.
  • Further, in the present embodiment, DCT was used as the transform system to be carried out at the encoding and decoding. However, any orthogonal transform system can be used as the transform system. Any transform, for example a Fourier transform such as a high speed Fourier transform (FET) and discrete Fourier transform (DFT), a Hadamard transform, and a K-L transform can be used.
  • Further, the present invention is not just applicable to the encoding and decoding of a moving picture as exemplified in the present embodiment. For example, it can also be applied to the encoding and decoding of audio data and text data and the encoding and the decoding of any other data.
  • Summarizing the advantageous effects of the present invention, as explained above, according to the encoding apparatus and decoder of the present invention, when carrying out the encoding and the decoding of, for example, image data, the loads can be equally and efficiently distributed among a plurality of processors and the communication for synchronization among the processors and data communication can be reduced. As a result, the encoding and decoding can be carried out at a high speed, and the control method and the hardware configuration can be simplified.
  • Further, according to the encoding method and the decoding method of the present invention, when carrying out the encoding and the decoding of for example image data by the parallel processing using a plurality of processors, the loads can be equally and efficiently distributed among the processors. Further, the communication for the synchronization among the processors and the data communication can be reduced. As a result, the encoding and decoding can be carried out at a high speed by easy control.
  • Further, the encoding method and the decoding method of the present invention are scalable methods in which the method of distribution of loads does not depend upon the structure of the parallel processor, for example, the number of the processors, so can be applied to parallel processors of a variety of configurations.

Claims (16)

1-19. (canceled)
20. A data processing method for data which is divided into blocks to carry out a predetermined operation consists of a first operation performed sequentially and a second operation performed in parallel, using a multi-processor system including a plurality of processor units, the method comprising:
allotting said first operation to a first processor;
allotting said second operation to a plurality of processors other than said first processor; and
carrying out said allotted operations on each processor successively to perform said predetermined operation of each block in parallel.
21. A data processing method as set forth in claim 1, wherein said first processor carries out said first operation of each block in an order corresponding to the order of said block.
22. A data processing method as set forth in claim 2, wherein said first processor carries out said first operation of each block after said plurality of processors other than said first processor operating a predetermined part of said second operation of the corresponding block.
23. A data processing method as set forth in claim 2, wherein said plurality of processors other than said first processor carry out said second operation of each block after said first operation.
24. A data processing method as set forth in claim 2, wherein said first operation comprises variable length encoding or decoding.
25. A data processing method as set forth in claim 2, wherein said block comprises a macro block or slice of image data.
26. A data processing method as set forth in claim 2, wherein said multi-processor system has a plurality of processors carrying out predetermined operations according to certain programs operating in parallel.
27. A data processing method as set forth in claim 2, wherein said multi-processor system has a plurality of processors comprised of dedicated hardware operating in parallel.
28. A data processing method as set forth in claim 2, wherein said multi-processor system has a plurality of respectively independent processors connected by any communication means to cooperate and carry out some intended processing.
29. A data processing apparatus for data which is divided into blocks to carry out a predetermined operation that consists of a first operation performed sequentially and a second operation performed in parallel, with a multi-processor system including a plurality of processor units, the apparatus comprising:
a first processor for carrying out said first operation; and
a plurality of processor other than said first processor for respectively carrying out said second operation, wherein
each processor carries out each operation successively to perform said predetermined operation of each block in parallel.
30. A data processing apparatus as set forth in claim 10, wherein said first processor carries out said first operation of each block in an order corresponding to the order of said block.
31. A program for processing data divided into blocks to carry out a predetermined operation consists of a first operation performed sequentially and a second operation performed in parallel, using a multi-processor system including a plurality of processor units, comprising:
allotting said first operation to first processor;
allotting said second operation respectively to a plurality of processors other than said first processor; and
carrying out said allotted operation to each processor successively to perform said predetermined operation of each block in parallel.
32. A program for data processing as set forth in claim 12, wherein said first processor carries out said first operation of each block in order corresponding to the order of said block.
33. A multi-processor system including a plurality of processor units for processing data divided into blocks to carry out a predetermined operation consists of a first operation performed sequentially and a second operation performed in parallel, comprising:
a first processor for operating said first operation; and
a plurality of processors other than said first processor for respectively operating said second operation, wherein each processor carries out each operation successively to perform said predetermined operation of each block in parallel.
34. A multi-processor system as set forth in claim 14, wherein said first processor carries out said first operation of each block in order corresponding to the order of said block.
US11/052,190 1998-07-15 2005-02-07 Encoding apparatus and method of same and decoding apparatus and method of same Abandoned US20050147174A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/052,190 US20050147174A1 (en) 1998-07-15 2005-02-07 Encoding apparatus and method of same and decoding apparatus and method of same

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP20035398A JP4427827B2 (en) 1998-07-15 1998-07-15 Data processing method, data processing apparatus, and recording medium
JPP10-200353 1998-07-15
US09/352,422 US6870883B2 (en) 1998-07-15 1999-07-12 Parallel encoding and decoding processor system and method
US11/052,190 US20050147174A1 (en) 1998-07-15 2005-02-07 Encoding apparatus and method of same and decoding apparatus and method of same

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US09/352,422 Continuation US6870883B2 (en) 1998-07-15 1999-07-12 Parallel encoding and decoding processor system and method

Publications (1)

Publication Number Publication Date
US20050147174A1 true US20050147174A1 (en) 2005-07-07

Family

ID=16422895

Family Applications (2)

Application Number Title Priority Date Filing Date
US09/352,422 Expired - Fee Related US6870883B2 (en) 1998-07-15 1999-07-12 Parallel encoding and decoding processor system and method
US11/052,190 Abandoned US20050147174A1 (en) 1998-07-15 2005-02-07 Encoding apparatus and method of same and decoding apparatus and method of same

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US09/352,422 Expired - Fee Related US6870883B2 (en) 1998-07-15 1999-07-12 Parallel encoding and decoding processor system and method

Country Status (2)

Country Link
US (2) US6870883B2 (en)
JP (1) JP4427827B2 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080063082A1 (en) * 2006-09-07 2008-03-13 Fujitsu Limited MPEG decoder and MPEG encoder
US20080111820A1 (en) * 2006-11-15 2008-05-15 Electronics And Telecommunications Research Institute Ocn-based moving picture decoder
US20080212683A1 (en) * 2006-11-14 2008-09-04 Hiroaki Nakata Image Decoding Device, Image Encoding Device and System LSI
US20100061464A1 (en) * 2008-06-03 2010-03-11 Fujitsu Limited Moving picture decoding apparatus and encoding apparatus
US20100166052A1 (en) * 2008-12-25 2010-07-01 Sony Corporation Encoder, decoder, encoding method and decoding method, and recording medium
US20110211642A1 (en) * 2008-11-11 2011-09-01 Samsung Electronics Co., Ltd. Moving picture encoding/decoding apparatus and method for processing of moving picture divided in units of slices
US20120275516A1 (en) * 2010-09-16 2012-11-01 Takeshi Tanaka Image decoding device, image coding device, methods thereof, programs thereof, integrated circuits thereof, and transcoding device
US20140002918A1 (en) * 2012-06-27 2014-01-02 Marvell World Trade Ltd. Systems and methods for reading and decoding encoded data from a storage device
US20140307805A1 (en) * 2013-04-10 2014-10-16 Fujitsu Limited Encoding apparatus, decoding apparatus, encoding method, and decoding method
US9591283B2 (en) 2012-03-17 2017-03-07 Fujitsu Limited Encoding appartus, decoding apparatus, encoding method, and decoding method
US10412409B2 (en) 2008-03-07 2019-09-10 Sk Planet Co., Ltd. Encoding system using motion estimation and encoding method using motion estimation

Families Citing this family (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1187489B1 (en) * 2000-04-14 2007-08-29 Sony Corporation Decoder and decoding method, recorded medium, and program
US7088720B1 (en) * 2000-08-07 2006-08-08 Sbc Technology Resources, Inc. Multiservice use of network connection capability under user-to-network interface signaling
US7073118B2 (en) * 2001-09-17 2006-07-04 Digeo, Inc. Apparatus and method for saturating decoder values
US7440523B2 (en) * 2001-10-04 2008-10-21 Digeo, Inc. Apparatus and method for decode arbitration in a multi-stream multimedia system
US7181070B2 (en) * 2001-10-30 2007-02-20 Altera Corporation Methods and apparatus for multiple stage video decoding
JP2003150395A (en) * 2001-11-15 2003-05-23 Nec Corp Processor and program transfer method thereof
JP4335516B2 (en) 2001-12-04 2009-09-30 パナソニック株式会社 Video encoding apparatus and method using a plurality of processors
US8401084B2 (en) * 2002-04-01 2013-03-19 Broadcom Corporation System and method for multi-row decoding of video with dependent rows
KR20040098060A (en) * 2002-04-08 2004-11-18 가부시키가이샤 에이팅구 Network game method, network game terminal, and server
JP2005005844A (en) * 2003-06-10 2005-01-06 Hitachi Ltd Computation apparatus and coding processing program
US7782940B2 (en) * 2003-08-01 2010-08-24 Polycom, Inc. Methods for encoding or decoding in a videoconference system to reduce problems associated with noisy image acquisition
JP3879741B2 (en) * 2004-02-25 2007-02-14 ソニー株式会社 Image information encoding apparatus and image information encoding method
JP2006031480A (en) * 2004-07-16 2006-02-02 Sony Corp Information processing system, information processing method, and computer program thereof
US7983342B2 (en) * 2004-07-29 2011-07-19 Stmicroelectronics Pvt. Ltd. Macro-block level parallel video decoder
KR20070105999A (en) 2005-02-16 2007-10-31 톰슨 라이센싱 Agile decoder
JP4374548B2 (en) * 2005-04-15 2009-12-02 ソニー株式会社 Decoding device and method, recording medium, and program
JP4609223B2 (en) * 2005-07-25 2011-01-12 富士ゼロックス株式会社 Image data processing apparatus, image data processing method, and image data processing program
DE602006010402D1 (en) * 2005-08-17 2009-12-24 Nxp Bv VIDEO CODING AND DECODING INCLUDING POSITIONS IN THE CACHE
JP4534910B2 (en) 2005-08-26 2010-09-01 ソニー株式会社 Image processing apparatus, image processing method, recording medium, and program
US8250618B2 (en) * 2006-09-18 2012-08-21 Elemental Technologies, Inc. Real-time network adaptive digital video encoding/decoding
US20080152014A1 (en) * 2006-12-21 2008-06-26 On Demand Microelectronics Method and apparatus for encoding and decoding of video streams
US20080162743A1 (en) * 2006-12-28 2008-07-03 On Demand Microelectronics Method and apparatus to select and modify elements of vectors
KR101343407B1 (en) * 2007-02-26 2013-12-20 삼성전자주식회사 Digital transmission system for transmitting additional data and method thereof
US8184715B1 (en) 2007-08-09 2012-05-22 Elemental Technologies, Inc. Method for efficiently executing video encoding operations on stream processor architectures
JP2009048469A (en) * 2007-08-21 2009-03-05 Fujitsu Ltd Information processing apparatus and information processing method
US8121197B2 (en) * 2007-11-13 2012-02-21 Elemental Technologies, Inc. Video encoding and decoding using parallel processors
EP2269356B1 (en) * 2008-04-18 2018-07-25 Koninklijke Philips N.V. Improved Dual Carrier Modulation Precoding
US8041132B2 (en) * 2008-06-27 2011-10-18 Freescale Semiconductor, Inc. System and method for load balancing a video signal in a multi-core processor
US8311111B2 (en) * 2008-09-11 2012-11-13 Google Inc. System and method for decoding using parallel processing
US20100220215A1 (en) * 2009-01-12 2010-09-02 Jorge Rubinstein Video acquisition and processing systems
JP5275454B2 (en) * 2009-03-31 2013-08-28 パナソニック株式会社 Image decoding device
US8379718B2 (en) 2009-09-02 2013-02-19 Sony Computer Entertainment Inc. Parallel digital picture encoding
US8737825B2 (en) * 2009-09-10 2014-05-27 Apple Inc. Video format for digital video recorder
JP2011066843A (en) * 2009-09-18 2011-03-31 Toshiba Corp Parallel encoding device, program and method for encoding image data
EP2567546A4 (en) * 2010-05-10 2014-01-15 Samsung Electronics Co Ltd Method and apparatus for transmitting and receiving layered coded video
WO2012135019A2 (en) * 2011-03-29 2012-10-04 Lyrical Labs LLC Video encoding system and method
US9100657B1 (en) 2011-12-07 2015-08-04 Google Inc. Encoding time management in parallel real-time video encoding
JP5979406B2 (en) * 2012-02-22 2016-08-24 ソニー株式会社 Image processing apparatus, image processing method, and image processing system
US9588994B2 (en) * 2012-03-02 2017-03-07 International Business Machines Corporation Transferring task execution in a distributed storage and task network
US20140072027A1 (en) * 2012-09-12 2014-03-13 Ati Technologies Ulc System for video compression
US9319702B2 (en) 2012-12-03 2016-04-19 Intel Corporation Dynamic slice resizing while encoding video
US9299166B2 (en) * 2012-12-13 2016-03-29 Qualcomm Incorporated Image compression method and apparatus for bandwidth saving
US10045032B2 (en) 2013-01-24 2018-08-07 Intel Corporation Efficient region of interest detection
US11425395B2 (en) 2013-08-20 2022-08-23 Google Llc Encoding and decoding using tiling
US9794574B2 (en) 2016-01-11 2017-10-17 Google Inc. Adaptive tile data size coding for video and image compression
US10542258B2 (en) 2016-01-25 2020-01-21 Google Llc Tile copying for video compression
US10531134B2 (en) 2017-11-10 2020-01-07 Akamai Technologies, Inc. Determining a time budget for transcoding of video
CN109819178B (en) * 2017-11-21 2022-07-08 虹软科技股份有限公司 Method and device for frame processing

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5115309A (en) * 1990-09-10 1992-05-19 At&T Bell Laboratories Method and apparatus for dynamic channel bandwidth allocation among multiple parallel video coders
US5448655A (en) * 1992-05-26 1995-09-05 Dainippon Screen Mfg. Co., Ltd. Image data processor and image data processing method
US5532744A (en) * 1994-08-22 1996-07-02 Philips Electronics North America Corporation Method and apparatus for decoding digital video using parallel processing
US5650860A (en) * 1995-12-26 1997-07-22 C-Cube Microsystems, Inc. Adaptive quantization
US5663962A (en) * 1994-09-29 1997-09-02 Cselt- Centro Studi E Laboratori Telecomunicazioni S.P.A. Method of multiplexing streams of audio-visual signals coded according to standard MPEG1

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3093499B2 (en) * 1992-03-06 2000-10-03 三菱電機株式会社 Scene change detection device
KR0128244B1 (en) * 1992-09-23 1998-04-02 배순훈 Apparatus for decoding image signal
JP3190755B2 (en) * 1993-01-22 2001-07-23 株式会社東芝 Decryption device
US5583500A (en) * 1993-02-10 1996-12-10 Ricoh Corporation Method and apparatus for parallel encoding and decoding of data
US5376968A (en) * 1993-03-11 1994-12-27 General Instrument Corporation Adaptive compression of digital video data using different modes such as PCM and DPCM
KR100213015B1 (en) * 1994-03-31 1999-08-02 윤종용 Quantization method and circuit
US6005623A (en) * 1994-06-08 1999-12-21 Matsushita Electric Industrial Co., Ltd. Image conversion apparatus for transforming compressed image data of different resolutions wherein side information is scaled
JPH0837662A (en) * 1994-07-22 1996-02-06 Hitachi Ltd Picture encoding/decoding device
KR100197190B1 (en) * 1994-10-12 1999-06-15 모리시다 요이치 Decoding circuit for run length codes
JP3279852B2 (en) * 1994-12-14 2002-04-30 株式会社日立製作所 Signal decoding device
US5757668A (en) * 1995-05-24 1998-05-26 Motorola Inc. Device, method and digital video encoder of complexity scalable block-matching motion estimation utilizing adaptive threshold termination
JPH10164581A (en) * 1996-12-03 1998-06-19 Sony Corp Method and device for coding image signal and signal-recording medium
JPH11215009A (en) * 1998-01-22 1999-08-06 Sony Corp Code decoding device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5115309A (en) * 1990-09-10 1992-05-19 At&T Bell Laboratories Method and apparatus for dynamic channel bandwidth allocation among multiple parallel video coders
US5448655A (en) * 1992-05-26 1995-09-05 Dainippon Screen Mfg. Co., Ltd. Image data processor and image data processing method
US5532744A (en) * 1994-08-22 1996-07-02 Philips Electronics North America Corporation Method and apparatus for decoding digital video using parallel processing
US5663962A (en) * 1994-09-29 1997-09-02 Cselt- Centro Studi E Laboratori Telecomunicazioni S.P.A. Method of multiplexing streams of audio-visual signals coded according to standard MPEG1
US5650860A (en) * 1995-12-26 1997-07-22 C-Cube Microsystems, Inc. Adaptive quantization

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8259810B2 (en) * 2006-09-07 2012-09-04 Fujitsu Limited MPEG decoder and MPEG encoder
US20080063082A1 (en) * 2006-09-07 2008-03-13 Fujitsu Limited MPEG decoder and MPEG encoder
US20080212683A1 (en) * 2006-11-14 2008-09-04 Hiroaki Nakata Image Decoding Device, Image Encoding Device and System LSI
US20080111820A1 (en) * 2006-11-15 2008-05-15 Electronics And Telecommunications Research Institute Ocn-based moving picture decoder
US8526503B2 (en) * 2006-11-15 2013-09-03 Electronics And Telecommunications Research Institute OCN-based moving picture decoder
US10412409B2 (en) 2008-03-07 2019-09-10 Sk Planet Co., Ltd. Encoding system using motion estimation and encoding method using motion estimation
US20100061464A1 (en) * 2008-06-03 2010-03-11 Fujitsu Limited Moving picture decoding apparatus and encoding apparatus
US9042456B2 (en) * 2008-11-11 2015-05-26 Samsung Electronics Co., Ltd. Moving picture encoding/decoding apparatus and method for processing of moving picture divided in units of slices
US20110211642A1 (en) * 2008-11-11 2011-09-01 Samsung Electronics Co., Ltd. Moving picture encoding/decoding apparatus and method for processing of moving picture divided in units of slices
US9432687B2 (en) 2008-11-11 2016-08-30 Samsung Electronics Co., Ltd. Moving picture encoding/decoding apparatus and method for processing of moving picture divided in units of slices
US20100166052A1 (en) * 2008-12-25 2010-07-01 Sony Corporation Encoder, decoder, encoding method and decoding method, and recording medium
US8351722B2 (en) * 2008-12-25 2013-01-08 Sony Corporation Encoder, decoder, encoding method and decoding method, and recording medium
US9185406B2 (en) * 2010-09-16 2015-11-10 Panasonic Intellectual Property Management Co., Ltd. Image decoding device, image coding device, methods thereof, programs thereof, integrated circuits thereof, and transcoding device
US8982964B2 (en) 2010-09-16 2015-03-17 Panasonic Intellectual Property Management Co., Ltd. Image decoding device, image coding device, methods thereof, programs thereof, integrated circuits thereof, and transcoding device
US20120275516A1 (en) * 2010-09-16 2012-11-01 Takeshi Tanaka Image decoding device, image coding device, methods thereof, programs thereof, integrated circuits thereof, and transcoding device
US9591283B2 (en) 2012-03-17 2017-03-07 Fujitsu Limited Encoding appartus, decoding apparatus, encoding method, and decoding method
CN103606378A (en) * 2012-06-27 2014-02-26 马维尔国际贸易有限公司 Systems and methods for reading and decoding encoded data from a storage device
US20140002918A1 (en) * 2012-06-27 2014-01-02 Marvell World Trade Ltd. Systems and methods for reading and decoding encoded data from a storage device
US10255944B2 (en) * 2012-06-27 2019-04-09 Marvell World Trade Ltd. Systems and methods for reading and decoding encoded data from a storage device
US20140307805A1 (en) * 2013-04-10 2014-10-16 Fujitsu Limited Encoding apparatus, decoding apparatus, encoding method, and decoding method
US9948940B2 (en) * 2013-04-10 2018-04-17 Fujitisu Limited Encoding apparatus, decoding apparatus, encoding method, and decoding method

Also Published As

Publication number Publication date
US6870883B2 (en) 2005-03-22
US20020031184A1 (en) 2002-03-14
JP2000030047A (en) 2000-01-28
JP4427827B2 (en) 2010-03-10

Similar Documents

Publication Publication Date Title
US6870883B2 (en) Parallel encoding and decoding processor system and method
USRE48845E1 (en) Video decoding system supporting multiple standards
US9104424B2 (en) Method of operating a video decoding system
Shen et al. Parallel implementation of an MPEG-1 encoder: faster than real time
KR100232992B1 (en) Moving picture decoding circuit
US7403564B2 (en) System and method for multiple channel video transcoding
US6172621B1 (en) Coding apparatus and method of same and decoding apparatus and method of same
US7227589B1 (en) Method and apparatus for video decoding on a multiprocessor system
JPH05268590A (en) Moving picture coding decoding circuit
JP2004514353A (en) Scalable MPEG-2 video system
US8175152B2 (en) Picture coding apparatus and method
KR100266338B1 (en) Moving picture compression unit and method of the moving picture compressing
JP3871348B2 (en) Image signal decoding apparatus and image signal decoding method
WO2011143585A1 (en) Parallel processing of sequentially dependent digital data
JP2000175201A (en) Image processing unit, its method and providing medium
EP1351512A2 (en) Video decoding system supporting multiple standards
JPH10111792A (en) Picture processor
EP1351513A2 (en) Method of operating a video decoding system
Jaspers et al. Embedded compression for memory resource reduction in MPEG systems
JP3861317B2 (en) Variable length decoder
Bozoki et al. Parallel algorithms for MPEG video compression with PVM.
US20230269368A1 (en) Supporting multiple partition sizes using a unified pixel input data interface for fetching reference pixels in video encoders
JP2008288986A (en) Image processor, method thereof, and program

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION