WO2009029797A1 - Architecture for multi-stage decoding of a cabac bitstream - Google Patents

Architecture for multi-stage decoding of a cabac bitstream Download PDF

Info

Publication number
WO2009029797A1
WO2009029797A1 PCT/US2008/074796 US2008074796W WO2009029797A1 WO 2009029797 A1 WO2009029797 A1 WO 2009029797A1 US 2008074796 W US2008074796 W US 2008074796W WO 2009029797 A1 WO2009029797 A1 WO 2009029797A1
Authority
WO
WIPO (PCT)
Prior art keywords
intermediate signal
decoding
cabac
decoder
decode
Prior art date
Application number
PCT/US2008/074796
Other languages
French (fr)
Inventor
Yiliang Bao
Toshiaki Yoshino
Kai Wang
Original Assignee
Qualcomm Incorporated
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Incorporated filed Critical Qualcomm Incorporated
Priority to KR1020127013225A priority Critical patent/KR20120058631A/en
Priority to JP2010523157A priority patent/JP4976553B2/en
Priority to EP08798975.2A priority patent/EP2191650B1/en
Priority to KR1020107007048A priority patent/KR101321012B1/en
Priority to CN200880104665XA priority patent/CN101790889B/en
Publication of WO2009029797A1 publication Critical patent/WO2009029797A1/en

Links

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/4006Conversion to or from arithmetic code
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding

Definitions

  • the present disclosure relates generally to the field of video decoding and, more specifically, to techniques for optimizing Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream decoding.
  • CABAC Context-based Adaptive Binary Arithmetic Coding
  • H.264/AVC has two different entropy coding modes to generate the bitstream.
  • AVC represents adaptive video coding.
  • One coding mode is Context-based Adaptive Variable Length Coding (CAVLC), and the other coding mode is Context- based Adaptive Binary Arithmetic Coding (CABAC).
  • CAVLC is similar to other variable length coding (VLC) schemes. Each syntax element is decoded from one or more integer number of bits.
  • DecodeDecision a binary symbol is decoded in the context of the information already decoded from the bitstream.
  • the probability model of decoding the next binary symbol of the same category is updated based on the value of the binary symbol just decoded.
  • DecodeBypass the symbol is decoded without referring to other information already decoded.
  • DecodeTerminate is similar to DecodeBypass, except that the process is so defined that decoding a symbol of one value consumes only a fractional amount of bits, while decoding a symbol of the other value consumes significantly more bits.
  • the encoding process is the inverse of the decoding process.
  • a CABAC encoder performs binarization to convert each syntax element into one or several binary symbols, and each binary symbol is encoded using one of three binary arithmetic coding processes, namely EncodeDecision, EncodeBypass and EncodeTerminate which correspond to DecodeDecision, DecodeBypass and DecodeTerminate, respectively.
  • EncodeDecision, EncodeBypass and EncodeTerminate which correspond to DecodeDecision, DecodeBypass and DecodeTerminate, respectively.
  • One approach to the problem mentioned above is to transcode the CABAC bitstream into a bitstream without CABAC in the first stage and to perform the decoding of this non-CABAC bitstream and other processing steps in the second stage. Buffering is generally needed between these two stages.
  • the operations normally involve decoding the values of the syntax elements from the CABAC bitstream and re- encoding the syntax elements using another entropy coding scheme, such as H.264 CAVLC or MPEG-2 like V
  • CABAC Context-based Adaptive Binary Arithmetic Coding
  • a device comprising a first decoder stage operable to decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order is provided.
  • the device further includes a buffer operable to buffer the intermediate signal in the decoded order and a second decoder stage operable to decode the intermediate signal into a video output signal using a non-CABAC decoding standard.
  • an integrated circuit comprising a first decoder circuit operable to decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order.
  • the integrated circuit also includes a buffer operable to buffer the intermediate signal in the decoded order.
  • a second decoder circuit is operable to decode the intermediate signal into a video output signal using a non- CABAC decoding standard.
  • a computer program product includes a computer readable medium having instructions for causing a computer to decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order.
  • CABAC Context-based Adaptive Binary Arithmetic Coding
  • the product also includes instructions to cause a computer to buffer the intermediate signal in the decoded order and to decode the intermediate signal into a video output signal using a non-CABAC decoding standard.
  • FIG. 1 shows a general block diagram of a wireless device.
  • FIG. 2A shows an exemplary H.264 standard range register.
  • FIG. 2B shows an exemplary H.264 standard offset register.
  • FIG. 2C shows an exemplary H.264 standard MPS case.
  • FIG. 2D shows an exemplary H.264 standard LPS case.
  • FIG. 3 shows a sample (psuedocode) instruction set of a H.264 standard arithmetic decoding process for one bin.
  • FIG. 4 shows a flowchart of a H.264 standard renormalization process.
  • FIG. 5 shows a flowchart of a H.264 standard normal decoding mode process (DecodeDecision).
  • FIG. 6 shows a flowchart of a H.264 standard bypass decoding mode process
  • FIG. 7 shows a flowchart of a H.264 standard terminate decoding mode process (DecodeTerminte).
  • FIG. 8 shows a general block diagram of a processor performing multi-stage decoding of a CABAC bitstream.
  • FIG. 9 shows a flowchart of a CABAC bitstream multi-stage decoding process.
  • FIG. 10 shows a general block diagram of a multi-stage CABAC decoder engine with a bypass mode.
  • FIG. 11 shows a flowchart of a CABAC bitstream multi-stage decoding process with a bypass mode.
  • exemplary is used herein to mean “serving as an example, instance, or illustration.” Any configuration or design described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other configurations or designs, and the terms “core”, “engine”, “machine”, “processor” and “processing unit” are used interchangeably.
  • Video signals may be characterized in terms of a series of pictures, frames, and/or fields, any of which may further include one or more slices.
  • frame is a broad term that may encompass one or more of frames, fields, pictures and/or slices.
  • Embodiments include systems and methods that facilitate channel switching in a multimedia transmission system.
  • Multimedia data may include one or more of motion video, audio, still images, text or any other suitable type of audio-visual data.
  • Multimedia processing systems such as video encoders, may encode multimedia data using encoding methods based on international standards such as Moving Picture Experts Group (MPEG)-I, -2 and -4 standards, the International Telecommunication Union (ITU)-T H.263 standard, and the ITU-T H.264 standard and its counterpart, ISO/IEC MPEG-4, Part 10, i.e., Advanced Video Coding (AVC), each of which is fully incorporated herein by reference for all purposes.
  • MPEG Moving Picture Experts Group
  • ITU International Telecommunication Union
  • ISO/IEC MPEG-4 Part 10
  • AVC Advanced Video Coding
  • Such encoding, and by extension, decoding, methods generally are directed to compressing the multimedia data for transmission and/or storage. Compression can be broadly thought of as the process of removing redundancy from the multimedia
  • a video signal may be described in terms of a sequence of pictures, which include frames (an entire picture), or fields (e.g., an interlaced video stream comprises fields of alternating odd or even lines of a picture). Further, each frame or field may further include two or more slices, or sub-portions of the frame or field.
  • Multimedia processors such as video encoders, may encode a frame by partitioning it into a subset of pixels. These subsets of pixels may be referred to as blocks or macroblocks and may include, for example, 16x16 pixels. The encoder may further partition each 16x16 macroblock into subblocks. Each subblock may further comprise additional subblocks.
  • subblocks of a 16x16 macroblock may include 16x8 and 8x16 subblocks.
  • Each of the 16x8 and 8x16 subblocks may include, for example, 8x8 subblocks, which themselves may include, for example, 4x4, 4x2 and 2x4 subblocks, and so forth.
  • block may refer to either a macroblock or any size of subblock.
  • a communications system or network may comprise one or more of a wired or wireless communication system, including one or more of a Ethernet, telephone (e.g., POTS), cable, power-line, and fiber optic systems, and/or a wireless system comprising one or more of a code division multiple access (CDMA or CDMA2000) communication system, a frequency division multiple access (FDMA) system, an orthogonal frequency division multiple (OFDM) access system, a time division multiple access (TDMA) system such as GSM/GPRS (General packet Radio Service)/EDGE (enhanced data GSM environment), a TETRA (Terrestrial Trunked Radio) mobile telephone system, a wideband code division multiple access (WCDMA) system, a high data rate (IxEV-DO or IxEV-DO Gold Multicast) system, an IEEE 802.11 system, a MediaFLO system, a D
  • FIG. 1 shows a block diagram of a configuration of a wireless device 10 in a wireless communication system or network.
  • the wireless device 10 may be a cellular or camera phone, a terminal, a handset, a personal digital assistant (PDA), or some other device.
  • the wireless communication system may be a Code Division Multiple Access (CDMA) system, a Global System for Mobile Communications (GSM) system, or some other system.
  • CDMA Code Division Multiple Access
  • GSM Global System for Mobile Communications
  • a handset may be a cellular phone, wireless device, wireless communications device, a video game console, a wirelessly-equipped personal digital assistant (PDA), a laptop computer, or a video-enabled device.
  • CDMA Code Division Multiple Access
  • GSM Global System for Mobile Communications
  • the wireless device 10 is capable of providing bi-directional communications via a receive path and a transmit path.
  • signals transmitted by base stations are received by an antenna 12 and provided to a receiver (RCVR) 14.
  • the receiver 14 conditions and digitizes the received signal and provides samples to a digital section 20 for further processing.
  • a transmitter (TMTR) 16 receives data to be transmitted from the digital section 20, processes and conditions the data, and generates a modulated signal, which is transmitted via the antenna 12 to the base stations.
  • the digital section 20 includes various processing, interface and memory units such as, for example, a modem processor 22, a video processor 24, a controller/processor 26, a display processor 28, an ARM/DSP 32, a graphics processing unit (GPU) 34, an internal memory 36, and an external bus interface (EBI) 38.
  • the modem processor 22 performs processing for data transmission and reception (e.g., encoding, modulation, demodulation, and decoding).
  • the video processor 24 performs processing on video content (e.g., still images, moving videos, and moving texts) for video applications such as camcorder, video playback, and video conferencing.
  • the controller/processor 26 may direct the operation of various processing and interface units within digital section 20.
  • the display processor 28 performs processing to facilitate the display of videos, graphics, and texts on a display unit 30.
  • the ARM/DSP 32 may perform various types of processing for the wireless device 10.
  • the graphics processing unit 34 performs graphics processing.
  • the techniques described herein may be used for any of the processors in the digital section 20, e.g., the video processor 24.
  • the internal memory 36 stores data and/or instructions for various units within the digital section 20.
  • the EBI 38 facilitates the transfer of data between the digital section 20 (e.g., internal memory 36) and a main memory 40 along a bus or data line DL.
  • the digital section 20 may be implemented with one or more DSPs, microprocessors, RISCs, etc.
  • the digital section 20 may also be fabricated on one or more application specific integrated circuits (ASICs) or some other type of integrated circuits (ICs).
  • ASICs application specific integrated circuits
  • ICs integrated circuits
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • FPGA field programmable gate array
  • a general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine.
  • a processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
  • the processor may also include RISCs, ARMs, digital signal processing devices (DSPDs), programmable logic devices (PLDs).
  • RISCs RISCs
  • ARMs digital signal processing devices
  • PLDs programmable logic devices
  • the blocks of a process or algorithm described in connection with the examples disclosed herein may be embodied directly in hardware, in one or more software modules executed by one or more processing elements, or in a combination of the two.
  • a software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form or combination of storage medium known in the art.
  • An example storage medium is coupled to the processor such that the processor can read information from, and write information to, the storage medium.
  • the storage medium may be integral to the processor.
  • the processor and the storage medium may reside in an Application Specific Integrated Circuit (ASIC).
  • the ASIC may reside in a wireless modem.
  • the processor and the storage medium may reside as discrete components in the wireless modem.
  • FIGS. 2A-2D and 3-7 are related to the H.264 standard such as for decoding a CABAC bitstream.
  • FIGS. 2A-2D and 3-7 disclose generic principles defined by the H.264 standard that may be applied to the configurations of FIGS. 8-11.
  • FIG. 2A shows an exemplary H.264 standard range register 50 and FIG. 2B shows an exemplary H.264 standard offset register 60.
  • the basic idea of the binary arithmetic coding process is recursive interval division.
  • the arithmetic decoding engine core keeps two registers.
  • the first register is a range register 50 with 9-bits.
  • the second register is an offset register 60 which is 9-bits in a regular mode and 10-bits in a bypass mode.
  • FIG. 2C shows an exemplary H.264 standard most probability symbol (MPS) case and FIG. 2D shows an exemplary H.264 standard least probability symbol (LPS) case.
  • the range register 50 keeps track of the width, denoted by the total distance of the subinterval rLPS 52 and subinterval rMPS 54, of the current interval. The offset is from the bit-stream and a point to the current location within the range.
  • the range is divided into two subintervals rLPS 52 and rMPS 54 depending on the context to decode a specific bin.
  • a bin is a CABAC decoded standard format. As will be seen from the description below, a bin when decoded is a binary symbol.
  • the subinterval rLPS 52 and rMPS 54 where the offset falls, decides whether the bin is a MPS or a LPS bin. If the offset is > rMPS, the bin is a LPS bin. Otherwise, the bin is a MPS bin. After the bin is decided, the range and offset are updated.
  • the term pMPS is the probability. The probability should within 0 to 1.
  • the term rMPS is the range*pMPS. The summation of the probabilities of MPS and LPS should be equal to 1.
  • flowchart blocks are performed in the depicted order or these blocks or portions thereof may be performed contemporaneously, in parallel, or in a different order.
  • FIG. 3 shows a sample (psuedocode) instruction set of a H.264 standard arithmetic decoding process 100 for one bin.
  • the instruction set indicates that both the range register 50 and offset register 60 are 9 bits.
  • the range register 50 configuration is also indicated.
  • the bin is a LPS case. Then the range is updated to a new range (range new) set equal to the subinterval rLPS (FIG. 2D) and the new offset (offset new) is set equal to offset - rMPS.
  • FIG. 4 shows a flowchart of a H.264 standard renormalization process 150. After decoding one bin, the range and the offset will be renormalized to keep the precision to decode the next bin.
  • the standard renormalization process 150 ensures that the most significant bit (MSB) of the 9-bit range register 50 is always 1, as represented in FIG. 2A.
  • the standard renormalization process 150 begins with block 152 where a decision is made whether the range is ⁇ 0X100.
  • the value of the range is compared with 256 (or 0x100). If the determination is "No,” the process 150 ends. However, if the determination at block 152 is "Yes,” then block 152 is followed by block 154.
  • the offset is also set to offset (bitwise OR) read bits (1).
  • offset (bitwise OR) read bits (1) represents the value of the RANGE/OFFSET registers shifted left by one bit. After the shift, the least signif ⁇ cant(right most) bit is 0.
  • the expression read bits (1) reads one bit from the bitstream and this one bit is added to the least significant(right most) bit of offset register 60.
  • Block 154 loops back to block 152 described above. The loop of blocks 152 and 154 are repeated until the determination at block 152 is "No," which completes the renormalization process 150.
  • FIG. 5 shows a flowchart of a H.264 standard normal decoding mode process 200.
  • a 64x4 look up table LUT
  • the range is approximated by equal-partitions of the 9-bit range register 50 into four cells.
  • the pLPS is approximated by 64 quantized values indexed by a 6-bit context state.
  • ctxldx is an input to the process 200, represents the index to the context state and provides state information
  • range » 6 represents a right shift by 6 bits or a division by 2 6
  • the result of (range»6)&3 extracts bits 7-6 (the 2 bits after the MSB) in the range register 50 used to address the LUT.
  • the expression ctxldx->state can take a value from 0 to 63 which is used in the 64x4 LUT to get the rLPS.
  • rMPS range-rLPS (4) where rLPS is calculated in equation Eq. (3).
  • !ctxIdx->valMPS denotes an expression where ctxIdx->valMPS can take a value 0 or 1, and "! means bit flip.
  • ctxldx is the input parameter to the function, and it provides the state and valMPS information.
  • valMPS represents the bin output in the MPS case.
  • Block 206 is followed by block 208 where a determination is made whether ctxldx->state is equal to 0. If the determination at bock 208 is "Yes,” then block 208 is followed by block 210 where ctxIdx->valMPS is assigned to equal !ctxIdx->valMPS. Block 210 is followed by block 212. Additionally, if the determination at block 208 is "No,” then block 208 is also followed by block 212. At block 212, ctxldx->state is assigned to equal TransIndexLPS(ctxIDx->state). After each bin is decoded, the state/valMPS associated with each ctxldx needs to be updated. The terms TransIndexLPS/ TransIndexMPS are just 2 LUTs defined in the H.264 standard to calculate the state transition.
  • FIG. 6 shows a general flowchart of a H.264 standard bypass decoding mode process 250.
  • the offset is shifted left by 1 bit and 1 bit is read from the bit stream. The new offset is compared with the range to determine whether the bin is 1 or 0.
  • Block 254 is followed by block 258 where the bin is set equal to zero (0).
  • Blocks 256 and 258 end the process 250. It should be noted that the term bin is also the same as bit.
  • FIG. 7 shows a flowchart of a H.264 standard terminate decoding process 300.
  • the range register 50 (FIG. 2A) is set to OxIFE, 9 bits are read from the bitstream to set the initial offset register 60. [0066] As can be readily seen from above, the 9 bits are used to represent both the range and offset. Therefore, there are a great number of bit wise operations in the CABAC core processes.
  • FIG. 8 shows a general block diagram of a processor 401 performing multistage decoding of a received CABAC bitstream (BS).
  • the processor 401 includes a processing circuit 401 and a multi-stage CABAC decoder engine 402.
  • the multi-stage CABAC decoder engine 402 is implemented in two stages.
  • the CABAC bitstream is decoded only into bins or binary symbols by a CABAC decoding sub-module 404.
  • the bins or binary symbols may be buffered in a bin buffer 406 in the exact order (hereinafter referred to as a "decoded order") as they are decoded from the CABAC bitstream.
  • processing in the first stage is not generally performed after buffering.
  • the bin buffer 406 is shown in phantom to denote that the memory for storing the bins or binary symbols is external to the processor 400.
  • the output of the CABAC decoding sub-module 40 is an intermediate signal and is a CABAC decoded standard format. In general, the operations of the CABAC decoding sub-module 404 can be slow depending on at least the size of the frame.
  • a video decoding sub-module 410 decodes the intermediate signal from the bin buffer 406 and may include a variable length coding (VLC) decoder.
  • VLC variable length coding
  • FIG. 9 shows a flowchart of a CABAC bitstream multi-stage decoding process 500 according to the aspect shown in FIG. 8.
  • the process 500 begins with block 502 where a CABAC bitstream is decoded into an intermediate signal having only bins or binary symbols having a CABAC decoded standard format.
  • Block 602 is followed by block 604 where the binary symbols are buffered in the decoded order which is the exact order as the bins are decoded.
  • Block 604 is followed by block 606.
  • syntax elements reconstruction takes place.
  • Block 606 is followed by block 608 where any final video decoding operations take place to produce a video output signal.
  • FIG. 10 shows a general block diagram of a multi-stage CABAC decoder engine 600 with a bypass mode.
  • the multi-stage CABAC decoder engine 600 includes first and second processing stages 610 and 620, an external memory 630 and a switch 640.
  • the output of the first processing stage 610 is sent to the memory 630 on line 614.
  • the second processing stage 620 is ready to read the data of the buffered intermediate signal in the memory 630, the data is sent to the second processing stage 620 on line 634.
  • the first processing stage 610 and the second processing stage 620 may be separate processing circuits each having their own independent processing speed and processing performance.
  • the first processing stage 610 has a first processing speed and a first performance.
  • the second processing stage 620 has a second processing speed and a second processing performance.
  • the first and second processing speeds may be the same or different.
  • the first processing speed or performance of the first processing stage 610 may be slower than the second processing speed or performance of the second processing stage 620. Nonetheless, the resultant processing performance of first processing stage 610 may diminish as the result of a frame size and/or the implementations of the CABAC decoding process.
  • the switch 640 is operable to pass the buffered intermediate signal stored in the memory 630 when in a non-bypass mode.
  • the non-bypass mode is necessary when the processing speed or performance of the first stage 610 is below a predetermined threshold.
  • the switch 640 is operable to pass the intermediate signal directly from the first processing stage 610 to the second processing stage 620 when the processing speed or performance is above the predetermined threshold.
  • the engine 600 switches to bypass mode.
  • the first processing stage 610 and the second processing stage 620 are implemented in software where the first processing stage 610 is implemented using a first instruction set (e.g. software or source code) and the second processing stage 620 is implemented using a second instruction set (e.g. software or source code). The first instruction set is separate and distinct from the second instruction set.
  • the first processing stage 610 is implemented in hardware and the second processing stage 620 is implemented in software.
  • the first processing stage 610 is implemented in software and the second processing stage 620 is implemented in hardware.
  • the second processing stage 620 receives and decodes the intermediate signal in decoded order from the first processing stage 610 or the memory
  • FIG. 11 shows a flowchart of a CABAC bitstream multi-stage decoding process 700 with a bypass mode.
  • the process 700 begins with CABAC decoding at block 702 of a received bitstream.
  • the CABAC decoding takes place in the first processing stage 610 arranged to only decode the bitstream in accordance with a
  • Block 702 is followed by block 704 where the processing speed and/or processor performance (PP) of the first processing stage 610 is calculated.
  • PP processor performance
  • Block 704 is followed by block 706 where a determination is made whether the PP is greater than a predetermined threshold. If the determination at block 706 is "YES,” then block 706 is followed by block 708 where bin buffering takes place. The intermediate signal is buffered in the exact decoded order.
  • Block 708 is followed by block 710 where the switch 640 is controlled to switch between the bypass mode and the non-bypass modes.
  • Block 710 is followed by block 712 where video decoding takes place to produce a video output signal using a non-CABAC decoding standard.
  • block 706 is followed directly by block 710 where the switch is changed to the bypass mode. In the bypass mode, the intermediate signal having a
  • CABAC decoded standard format is immediately decoded by the second processing stage 620 in accordance with a second decoding process based on a non-CABAC decoding standard without the need for buffering.
  • the objective is to seek a trade-off between the coding efficiency and computation complexity.
  • the processing speed or performance is determined dynamically. The check may be performed periodically or at other designated intervals.
  • the 402 or 600 may be implemented in a Video Bitstream Parser as part of a high definition (HD) decoder or other integrated circuits.
  • HD high definition
  • the functions and/or blocks described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium.
  • Computer- readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another.
  • a storage media may be any available media that can be accessed by a computer.
  • such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer.
  • any connection is properly termed a computer-readable medium.
  • the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave
  • the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium.
  • Disk and disc includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer- readable media.

Abstract

Techniques for optimizing the Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream decoding are disclosed. In one configuration, a device has a first processing circuit operative to decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order. A second processing circuit decodes the intermediate signal using a non-CABAC decoding standard. A buffer is provided between the first and second processing circuits to improve processing speeds.

Description

ARCHITECTURE FOR MULTI-STAGE DECODING OF A CABAC BITSTREAM
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] The present Application for Patent claims priority benefit of commonly- assigned Provisional Application Serial No. 60/969,565, filed August 31, 2007 and Provisional Application Serial No. 61/057,009 filed May 29, 2008. This provisional patent application is hereby expressly incorporated by reference herein.
BACKGROUND
I. Field
[0002] The present disclosure relates generally to the field of video decoding and, more specifically, to techniques for optimizing Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream decoding.
II. Background
[0003] H.264/AVC has two different entropy coding modes to generate the bitstream. AVC represents adaptive video coding. One coding mode is Context-based Adaptive Variable Length Coding (CAVLC), and the other coding mode is Context- based Adaptive Binary Arithmetic Coding (CABAC). CAVLC is similar to other variable length coding (VLC) schemes. Each syntax element is decoded from one or more integer number of bits.
[0004] CABAC decodes syntax elements very differently. Each syntax element can be decoded conceptually in two steps. In the first step, CABAC decodes one or more binary symbols using an adaptive binary arithmetic decoder. In the second step, a debinarization process is performed to reconstruct the value of syntax element, and this value is used in the remaining decoding steps of a video decoder. Depending on a bypass flag and context index, a binary symbol is decoded using one of three different decoding processes defined in the adaptive binary arithmetic coder, namely DecodeDecision, DecodeBypass, and DecodeTerminate.
[0005] In DecodeDecision, a binary symbol is decoded in the context of the information already decoded from the bitstream. In addition, the probability model of decoding the next binary symbol of the same category is updated based on the value of the binary symbol just decoded. In DecodeBypass, the symbol is decoded without referring to other information already decoded. In addition, there is no need to maintain a context model after the symbol is decoded. DecodeTerminate is similar to DecodeBypass, except that the process is so defined that decoding a symbol of one value consumes only a fractional amount of bits, while decoding a symbol of the other value consumes significantly more bits.
[0006] Since decoding a symbol, especially using the process DecodeDecision, requires several steps of serial operations, it is very difficult to speed up the overall CABAC decoding process. This has been a challenge in implementing the high- definition (HD) video decoder. Some compressed frames are too large. The CABAC decoder can take significantly more time to finish than other decoding stages. Thus, the overall performance of the decoder is affected.
[0007] The encoding process is the inverse of the decoding process. A CABAC encoder performs binarization to convert each syntax element into one or several binary symbols, and each binary symbol is encoded using one of three binary arithmetic coding processes, namely EncodeDecision, EncodeBypass and EncodeTerminate which correspond to DecodeDecision, DecodeBypass and DecodeTerminate, respectively. [0008] One approach to the problem mentioned above is to transcode the CABAC bitstream into a bitstream without CABAC in the first stage and to perform the decoding of this non-CABAC bitstream and other processing steps in the second stage. Buffering is generally needed between these two stages. In the first stage, the operations normally involve decoding the values of the syntax elements from the CABAC bitstream and re- encoding the syntax elements using another entropy coding scheme, such as H.264 CAVLC or MPEG-2 like VLC encoder.
[0009] One problem with this transcoding approach is the complexity associated with the transcoding operations. The complete CABAC decoder essentially needs to be implemented with the overhead of another VLC encoder and VLC decoder. In addition, the compression efficiency of the VLC encoder may not be satisfactory to minimize the additional traffic.
[0010] There is therefore a continuing need for techniques for optimizing the Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream decoding. SUMMARY
[0011] Techniques for optimizing the Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream decoding are provided. In one configuration, a device comprising a first decoder stage operable to decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order is provided. The device further includes a buffer operable to buffer the intermediate signal in the decoded order and a second decoder stage operable to decode the intermediate signal into a video output signal using a non-CABAC decoding standard.
[0012] In another aspect, an integrated circuit comprising a first decoder circuit operable to decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order is provided. The integrated circuit also includes a buffer operable to buffer the intermediate signal in the decoded order. A second decoder circuit is operable to decode the intermediate signal into a video output signal using a non- CABAC decoding standard.
[0013] In a further aspect, a computer program product is provided. The computer program product includes a computer readable medium having instructions for causing a computer to decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order. The product also includes instructions to cause a computer to buffer the intermediate signal in the decoded order and to decode the intermediate signal into a video output signal using a non-CABAC decoding standard.
[0014] Additional aspects will become more readily apparent from the detailed description, particularly when taken together with the appended drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
[0015] Aspects and configurations of the disclosure will become more apparent from the detailed description set forth below when taken in conjunction with the drawings in which like reference characters identify corresponding elements throughout.
[0016] FIG. 1 shows a general block diagram of a wireless device.
[0017] FIG. 2A shows an exemplary H.264 standard range register. [0018] FIG. 2B shows an exemplary H.264 standard offset register.
[0019] FIG. 2C shows an exemplary H.264 standard MPS case.
[0020] FIG. 2D shows an exemplary H.264 standard LPS case.
[0021] FIG. 3 shows a sample (psuedocode) instruction set of a H.264 standard arithmetic decoding process for one bin.
[0022] FIG. 4 shows a flowchart of a H.264 standard renormalization process.
[0023] FIG. 5 shows a flowchart of a H.264 standard normal decoding mode process (DecodeDecision).
[0024] FIG. 6 shows a flowchart of a H.264 standard bypass decoding mode process
(DecodeBypass).
[0025] FIG. 7 shows a flowchart of a H.264 standard terminate decoding mode process (DecodeTerminte).
[0026] FIG. 8 shows a general block diagram of a processor performing multi-stage decoding of a CABAC bitstream.
[0027] FIG. 9 shows a flowchart of a CABAC bitstream multi-stage decoding process.
[0028] FIG. 10 shows a general block diagram of a multi-stage CABAC decoder engine with a bypass mode.
[0029] FIG. 11 shows a flowchart of a CABAC bitstream multi-stage decoding process with a bypass mode.
[0030] The images in the drawings are simplified for illustrative purposes and are not depicted to scale. To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the
Figures, except that suffixes may be added, when appropriate, to differentiate such elements.
[0031] The appended drawings illustrate exemplary configurations of the invention and, as such, should not be considered as limiting the scope of the invention that may admit to other equally effective configurations. It is contemplated that features or steps of one configuration may be beneficially incorporated in other configurations without further recitation. DETAILED DESCRIPTION
[0032] The word "exemplary" is used herein to mean "serving as an example, instance, or illustration." Any configuration or design described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other configurations or designs, and the terms "core", "engine", "machine", "processor" and "processing unit" are used interchangeably.
[0033] Video signals may be characterized in terms of a series of pictures, frames, and/or fields, any of which may further include one or more slices. As used herein, the term "frame" is a broad term that may encompass one or more of frames, fields, pictures and/or slices.
[0034] Embodiments include systems and methods that facilitate channel switching in a multimedia transmission system. Multimedia data may include one or more of motion video, audio, still images, text or any other suitable type of audio-visual data. [0035] Multimedia processing systems, such as video encoders, may encode multimedia data using encoding methods based on international standards such as Moving Picture Experts Group (MPEG)-I, -2 and -4 standards, the International Telecommunication Union (ITU)-T H.263 standard, and the ITU-T H.264 standard and its counterpart, ISO/IEC MPEG-4, Part 10, i.e., Advanced Video Coding (AVC), each of which is fully incorporated herein by reference for all purposes. Such encoding, and by extension, decoding, methods generally are directed to compressing the multimedia data for transmission and/or storage. Compression can be broadly thought of as the process of removing redundancy from the multimedia data.
[0036] A video signal may be described in terms of a sequence of pictures, which include frames (an entire picture), or fields (e.g., an interlaced video stream comprises fields of alternating odd or even lines of a picture). Further, each frame or field may further include two or more slices, or sub-portions of the frame or field. [0037] Multimedia processors, such as video encoders, may encode a frame by partitioning it into a subset of pixels. These subsets of pixels may be referred to as blocks or macroblocks and may include, for example, 16x16 pixels. The encoder may further partition each 16x16 macroblock into subblocks. Each subblock may further comprise additional subblocks. For example, subblocks of a 16x16 macroblock may include 16x8 and 8x16 subblocks. Each of the 16x8 and 8x16 subblocks may include, for example, 8x8 subblocks, which themselves may include, for example, 4x4, 4x2 and 2x4 subblocks, and so forth. As used herein, the term "block" may refer to either a macroblock or any size of subblock.
[0038] The techniques described herein may be used for wireless communications, computing, personal electronics, etc. An exemplary use of the techniques for wireless communication is described below. A communications system or network may comprise one or more of a wired or wireless communication system, including one or more of a Ethernet, telephone (e.g., POTS), cable, power-line, and fiber optic systems, and/or a wireless system comprising one or more of a code division multiple access (CDMA or CDMA2000) communication system, a frequency division multiple access (FDMA) system, an orthogonal frequency division multiple (OFDM) access system, a time division multiple access (TDMA) system such as GSM/GPRS (General packet Radio Service)/EDGE (enhanced data GSM environment), a TETRA (Terrestrial Trunked Radio) mobile telephone system, a wideband code division multiple access (WCDMA) system, a high data rate (IxEV-DO or IxEV-DO Gold Multicast) system, an IEEE 802.11 system, a MediaFLO system, a DMB system, a DVB-H system, and the like.
[0039] FIG. 1 shows a block diagram of a configuration of a wireless device 10 in a wireless communication system or network. The wireless device 10 may be a cellular or camera phone, a terminal, a handset, a personal digital assistant (PDA), or some other device. The wireless communication system may be a Code Division Multiple Access (CDMA) system, a Global System for Mobile Communications (GSM) system, or some other system. A handset may be a cellular phone, wireless device, wireless communications device, a video game console, a wirelessly-equipped personal digital assistant (PDA), a laptop computer, or a video-enabled device.
[0040] The wireless device 10 is capable of providing bi-directional communications via a receive path and a transmit path. On the receive path, signals transmitted by base stations are received by an antenna 12 and provided to a receiver (RCVR) 14. The receiver 14 conditions and digitizes the received signal and provides samples to a digital section 20 for further processing. On the transmit path, a transmitter (TMTR) 16 receives data to be transmitted from the digital section 20, processes and conditions the data, and generates a modulated signal, which is transmitted via the antenna 12 to the base stations. [0041] The digital section 20 includes various processing, interface and memory units such as, for example, a modem processor 22, a video processor 24, a controller/processor 26, a display processor 28, an ARM/DSP 32, a graphics processing unit (GPU) 34, an internal memory 36, and an external bus interface (EBI) 38. The modem processor 22 performs processing for data transmission and reception (e.g., encoding, modulation, demodulation, and decoding). The video processor 24 performs processing on video content (e.g., still images, moving videos, and moving texts) for video applications such as camcorder, video playback, and video conferencing. The controller/processor 26 may direct the operation of various processing and interface units within digital section 20. The display processor 28 performs processing to facilitate the display of videos, graphics, and texts on a display unit 30. The ARM/DSP 32 may perform various types of processing for the wireless device 10. The graphics processing unit 34 performs graphics processing.
[0042] The techniques described herein may be used for any of the processors in the digital section 20, e.g., the video processor 24. The internal memory 36 stores data and/or instructions for various units within the digital section 20. The EBI 38 facilitates the transfer of data between the digital section 20 (e.g., internal memory 36) and a main memory 40 along a bus or data line DL.
[0043] The digital section 20 may be implemented with one or more DSPs, microprocessors, RISCs, etc. The digital section 20 may also be fabricated on one or more application specific integrated circuits (ASICs) or some other type of integrated circuits (ICs).
[0044] The techniques described herein may be implemented in various hardware units. For example, the various illustrative logical blocks, components, modules, and circuits described in connection with the examples disclosed herein may be implemented or performed with a general purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration. The processor may also include RISCs, ARMs, digital signal processing devices (DSPDs), programmable logic devices (PLDs). [0045] The blocks of a process or algorithm described in connection with the examples disclosed herein may be embodied directly in hardware, in one or more software modules executed by one or more processing elements, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form or combination of storage medium known in the art. An example storage medium is coupled to the processor such that the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an Application Specific Integrated Circuit (ASIC). The ASIC may reside in a wireless modem. In the alternative, the processor and the storage medium may reside as discrete components in the wireless modem.
[0046] FIGS. 2A-2D and 3-7, as described below, are related to the H.264 standard such as for decoding a CABAC bitstream. FIGS. 2A-2D and 3-7 disclose generic principles defined by the H.264 standard that may be applied to the configurations of FIGS. 8-11.
[0047] FIG. 2A shows an exemplary H.264 standard range register 50 and FIG. 2B shows an exemplary H.264 standard offset register 60. The basic idea of the binary arithmetic coding process is recursive interval division. The arithmetic decoding engine core keeps two registers. The first register is a range register 50 with 9-bits. The second register is an offset register 60 which is 9-bits in a regular mode and 10-bits in a bypass mode.
[0048] FIG. 2C shows an exemplary H.264 standard most probability symbol (MPS) case and FIG. 2D shows an exemplary H.264 standard least probability symbol (LPS) case. The range register 50 keeps track of the width, denoted by the total distance of the subinterval rLPS 52 and subinterval rMPS 54, of the current interval. The offset is from the bit-stream and a point to the current location within the range. It should be noted that many of the equations and expressions set forth below use syntax similar to C or C++ computer programming language. The expressions are for illustrative purposes and can be expressed in other computer programming languages with different syntax.
[0049] When decoding a bin, the range is divided into two subintervals rLPS 52 and rMPS 54 depending on the context to decode a specific bin. A bin is a CABAC decoded standard format. As will be seen from the description below, a bin when decoded is a binary symbol. The subintervals rLPS 52 and rMPS 54 are defined in equation Eqs.(l) and (2) rLPS = range * pLPS, and (1) rMPS = range * pMPS = range * (1- pLPS) = range - rLPS, (2) where pLPS is the probability of the least probability symbol; and pMPS is the probability of the most probability symbol. The subinterval rLPS 52 and rMPS 54 where the offset falls, decides whether the bin is a MPS or a LPS bin. If the offset is >= rMPS, the bin is a LPS bin. Otherwise, the bin is a MPS bin. After the bin is decided, the range and offset are updated. The term pMPS is the probability. The probability should within 0 to 1. The term rMPS is the range*pMPS. The summation of the probabilities of MPS and LPS should be equal to 1.
[0050] In various configurations below, flowchart blocks are performed in the depicted order or these blocks or portions thereof may be performed contemporaneously, in parallel, or in a different order.
[0051] FIG. 3 showsa sample (psuedocode) instruction set of a H.264 standard arithmetic decoding process 100 for one bin. The instruction set indicates that both the range register 50 and offset register 60 are 9 bits. The range register 50 configuration is also indicated. The instruction set indicates that the range is within 28<=range<29. The arithmetic decoding process 100 is abbreviated and begins at an instruction where a determination is made whether range is >offset >=0. If the determination is "No," the process 100 ends. However, if the determination is "Yes," the next set of instructions is an if-else set. The if statement checks to see if the offset is >= rMPS. If the determination is "Yes," the bin is a LPS case. Then the range is updated to a new range (range new) set equal to the subinterval rLPS (FIG. 2D) and the new offset (offset new) is set equal to offset - rMPS.
[0052] If the if condition is "No," then the bin is a MPS case. Then the range is updated to a new range (range new) set equal to the subinterval rMPS and the new offset (offset new) is set equal to offset. [0053] FIG. 4 shows a flowchart of a H.264 standard renormalization process 150. After decoding one bin, the range and the offset will be renormalized to keep the precision to decode the next bin. The standard renormalization process 150 ensures that the most significant bit (MSB) of the 9-bit range register 50 is always 1, as represented in FIG. 2A. The standard renormalization process 150 begins with block 152 where a decision is made whether the range is <0X100. At block 152, the value of the range is compared with 256 (or 0x100). If the determination is "No," the process 150 ends. However, if the determination at block 152 is "Yes," then block 152 is followed by block 154. At block 154, the range is left shifted by one bit denoted by range = range«l . Likewise, the offset is left shifted by one bit, denoted by offset = offset «1. The offset is also set to offset (bitwise OR) read bits (1). The expression offset (bitwise OR) read bits (1) represents the value of the RANGE/OFFSET registers shifted left by one bit. After the shift, the least signifϊcant(right most) bit is 0. The expression read bits (1) reads one bit from the bitstream and this one bit is added to the least significant(right most) bit of offset register 60.
[0054] Block 154 loops back to block 152 described above. The loop of blocks 152 and 154 are repeated until the determination at block 152 is "No," which completes the renormalization process 150.
[0055] FIG. 5 shows a flowchart of a H.264 standard normal decoding mode process 200. In the standard decoding process 200, to avoid multiplication, a 64x4 look up table (LUT) is used to approximate the rLPS set forth in equation Eq.(l) above. The range is approximated by equal-partitions of the 9-bit range register 50 into four cells. The pLPS is approximated by 64 quantized values indexed by a 6-bit context state. Therefore, at block 202, the rLPS is calculated according to equation Eq.(3) rLPS = lutLPS[ctxIdx->state][(range»6)&3] (3) where ctxldx is an input to the process 200, represents the index to the context state and provides state information; range » 6 represents a right shift by 6 bits or a division by 26; and the result of (range»6)&3 extracts bits 7-6 (the 2 bits after the MSB) in the range register 50 used to address the LUT. The expression ctxldx->state can take a value from 0 to 63 which is used in the 64x4 LUT to get the rLPS. For example, if a range is ObI xx yy yyyy, the range will be within 0x100 to OxIFE, and (range»6)&3 is used to get "xx" of the range. The expression & is a bitwise AND function. [0056] At block 202, the rMPS is also calculated according to equation Eq.(4) rMPS = range-rLPS (4) where rLPS is calculated in equation Eq. (3).
[0057] Block 202 is followed by block 204 where a determination is made whether the offset >= rMPS. If the determination is "Yes," then block 204 is followed by block 206 where the bin, range and offset are calculated according to equations Eq.(5), (6) and
(V) bin = !ctxIdx->valMPS (5) range = rLPS, and (6) offset = offset - rMPS (7) where !ctxIdx->valMPS denotes an expression where ctxIdx->valMPS can take a value 0 or 1, and "!" means bit flip. The term ctxldx is the input parameter to the function, and it provides the state and valMPS information. The term valMPS represents the bin output in the MPS case.
[0058] Block 206 is followed by block 208 where a determination is made whether ctxldx->state is equal to 0. If the determination at bock 208 is "Yes," then block 208 is followed by block 210 where ctxIdx->valMPS is assigned to equal !ctxIdx->valMPS. Block 210 is followed by block 212. Additionally, if the determination at block 208 is "No," then block 208 is also followed by block 212. At block 212, ctxldx->state is assigned to equal TransIndexLPS(ctxIDx->state). After each bin is decoded, the state/valMPS associated with each ctxldx needs to be updated. The terms TransIndexLPS/ TransIndexMPS are just 2 LUTs defined in the H.264 standard to calculate the state transition.
[0059] Returning again to block 204, if the determination at block 204 is "No," then block 204 is followed by block 214 where bin and range are calculated according to equations Eq. (8) and (9) bin = ctxIdx->valMPS; and (8) range = rMPS. (9)
[0060] Block 214 is followed by block 216 where ctxldx->state is assigned to equal TransIndexLPS(ctxIDx->state). Both blocks 212 and 216 proceed to block 218 where the renormalization process 150 takes place. Block 218 ends the process 200. [0061] FIG. 6 shows a general flowchart of a H.264 standard bypass decoding mode process 250. For the bypass decoding mode process 250. In the H.264 standard bypass decoding mode process 250, the offset is shifted left by 1 bit and 1 bit is read from the bit stream. The new offset is compared with the range to determine whether the bin is 1 or 0.
[0062] The standard bypass decoding mode process 250 begins with block 252 where the offset is set equal to offset «1 where «1 represents multiply by 2 or a left shift by 1. Furthermore offset is set equal to offset (bitwise OR) read bits(l). Block 252 is followed by block 254 where a determination is made whether offset is >= range. If the determination is "Yes," then block 254 is followed by block 256 where the bin and offset are calculated according to equations Eq.(lO) and (11)
Bin = l; and (10)
Offset = offset - range. (11)
[0063] If the determination is "No," then block 254 is followed by block 258 where the bin is set equal to zero (0). Blocks 256 and 258 end the process 250. It should be noted that the term bin is also the same as bit.
[0064] FIG. 7 shows a flowchart of a H.264 standard terminate decoding process 300. When decoding the bin indicating the end of slice flag and the I-Pulse Code Modulation (I PCM) mode, a special decoding routine the standard terminate decoding process 300 is called. The standard terminate decoding process 300 begins with block 302 where the range is decremented by 2 (range = range -2). Block 302 is followed by block 304 where a determination is made whether the offset is >= range. If the determination at block 304 is "Yes," then the bin is set equal to one (1) at block 306. However, if the determination at block 304 is "No," then block 304 is followed by block 308 where the bin is set equal to zero (0). Block 308 is followed by block 310 where the renormalization process 150 (FIG. 4) is performed. Both blocks 306 and 310 end the H.264 standard terminate decoding process 300.
[0065] During the CABAC initial stage, the range register 50 (FIG. 2A) is set to OxIFE, 9 bits are read from the bitstream to set the initial offset register 60. [0066] As can be readily seen from above, the 9 bits are used to represent both the range and offset. Therefore, there are a great number of bit wise operations in the CABAC core processes.
[0067] In the H.264 standard normal decoding mode process 200 (FIG 5), whenever an LPS case, since the LPS probability is < 0.5, the new range will be < 0x100. Thus, renormalization is needed to bring the range >= 0x100. In the new exemplary configuration, a count leading zero (CLZ) instruction is used to calculate the amount of left shift needed instead of using a loop. Whenever a MPS case, since the MPS probability is >=0.5, the new range will be from 0x080 to OxIFE. Therefore, at most one left shift is needed for renormalization to bring the most significant bit (MSB) to 1. At the same time, the offset is left shifted by the same amount and new bits are read from the bit stream to fill it up.
[0068] Moreover, in the H.264 standard bypass decoding mode process 250, the offset is always left shifted by 1 and 1 bit is read from the bitstream. This requires very frequent renormalization and reading of bits from the bit-stream both of which are very computationally costly.
[0069] FIG. 8 shows a general block diagram of a processor 401 performing multistage decoding of a received CABAC bitstream (BS). The processor 401 includes a processing circuit 401 and a multi-stage CABAC decoder engine 402. In one aspect, the multi-stage CABAC decoder engine 402 is implemented in two stages. In the first stage (STAGE 1), the CABAC bitstream is decoded only into bins or binary symbols by a CABAC decoding sub-module 404. The bins or binary symbols may be buffered in a bin buffer 406 in the exact order (hereinafter referred to as a "decoded order") as they are decoded from the CABAC bitstream. Further, processing in the first stage is not generally performed after buffering. The bin buffer 406 is shown in phantom to denote that the memory for storing the bins or binary symbols is external to the processor 400. [0070] The output of the CABAC decoding sub-module 40 is an intermediate signal and is a CABAC decoded standard format. In general, the operations of the CABAC decoding sub-module 404 can be slow depending on at least the size of the frame. [0071] In the second stage (STAGE 2) of the multi-stage CABAC decoder engine 402, a video decoding sub-module 410 decodes the intermediate signal from the bin buffer 406 and may include a variable length coding (VLC) decoder. The video decoding sub-module 410 includes a debinarization sub-module 412 operable to reconstruct the values of the syntax elements from the intermediate signal. The syntax elements are then used any remaining decoding operations of the video decoding sub- module 410 in a final decoding sub-module 414 to produce a video output. [0072] FIG. 9 shows a flowchart of a CABAC bitstream multi-stage decoding process 500 according to the aspect shown in FIG. 8. The process 500 begins with block 502 where a CABAC bitstream is decoded into an intermediate signal having only bins or binary symbols having a CABAC decoded standard format. Block 602 is followed by block 604 where the binary symbols are buffered in the decoded order which is the exact order as the bins are decoded. Block 604 is followed by block 606. At block 606, syntax elements reconstruction takes place. Block 606 is followed by block 608 where any final video decoding operations take place to produce a video output signal.
[0073] FIG. 10 shows a general block diagram of a multi-stage CABAC decoder engine 600 with a bypass mode. The multi-stage CABAC decoder engine 600 includes first and second processing stages 610 and 620, an external memory 630 and a switch 640. In operation, the output of the first processing stage 610 is sent to the memory 630 on line 614. When the second processing stage 620 is ready to read the data of the buffered intermediate signal in the memory 630, the data is sent to the second processing stage 620 on line 634.
[0074] The first processing stage 610 and the second processing stage 620 may be separate processing circuits each having their own independent processing speed and processing performance. The first processing stage 610 has a first processing speed and a first performance. The second processing stage 620 has a second processing speed and a second processing performance. The first and second processing speeds may be the same or different. For example, the first processing speed or performance of the first processing stage 610 may be slower than the second processing speed or performance of the second processing stage 620. Nonetheless, the resultant processing performance of first processing stage 610 may diminish as the result of a frame size and/or the implementations of the CABAC decoding process.
[0075] The switch 640 is operable to pass the buffered intermediate signal stored in the memory 630 when in a non-bypass mode. The non-bypass mode is necessary when the processing speed or performance of the first stage 610 is below a predetermined threshold. Alternately, the switch 640 is operable to pass the intermediate signal directly from the first processing stage 610 to the second processing stage 620 when the processing speed or performance is above the predetermined threshold. Thus, the engine 600 switches to bypass mode.
[0076] In one aspect, the first processing stage 610 and the second processing stage 620 are implemented in software where the first processing stage 610 is implemented using a first instruction set (e.g. software or source code) and the second processing stage 620 is implemented using a second instruction set (e.g. software or source code). The first instruction set is separate and distinct from the second instruction set. [0077] In another aspect, the first processing stage 610 is implemented in hardware and the second processing stage 620 is implemented in software. In a still further aspect, the first processing stage 610 is implemented in software and the second processing stage 620 is implemented in hardware.
[0078] In one aspect, the second processing stage 620 receives and decodes the intermediate signal in decoded order from the first processing stage 610 or the memory
630.
[0079] FIG. 11 shows a flowchart of a CABAC bitstream multi-stage decoding process 700 with a bypass mode. The process 700 begins with CABAC decoding at block 702 of a received bitstream. The CABAC decoding takes place in the first processing stage 610 arranged to only decode the bitstream in accordance with a
CABAC decoding standard. Block 702 is followed by block 704 where the processing speed and/or processor performance (PP) of the first processing stage 610 is calculated.
Block 704 is followed by block 706 where a determination is made whether the PP is greater than a predetermined threshold. If the determination at block 706 is "YES," then block 706 is followed by block 708 where bin buffering takes place. The intermediate signal is buffered in the exact decoded order.
[0080] Block 708 is followed by block 710 where the switch 640 is controlled to switch between the bypass mode and the non-bypass modes. Block 710 is followed by block 712 where video decoding takes place to produce a video output signal using a non-CABAC decoding standard. Returning again to block 706, if the determination at block 706 is "NO," then block 706 is followed directly by block 710 where the switch is changed to the bypass mode. In the bypass mode, the intermediate signal having a
CABAC decoded standard format is immediately decoded by the second processing stage 620 in accordance with a second decoding process based on a non-CABAC decoding standard without the need for buffering.
[0081] The objective is to seek a trade-off between the coding efficiency and computation complexity. In the configuration of FIG. 11, the processing speed or performance is determined dynamically. The check may be performed periodically or at other designated intervals.
[0082] The 402 or 600 may be implemented in a Video Bitstream Parser as part of a high definition (HD) decoder or other integrated circuits.
[0083] In one or more exemplary configurations, the functions and/or blocks described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer- readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer- readable media.
[0084] The previous description of the disclosed configurations is provided to enable any person skilled in the art to make or use the disclosure. Various modifications to these configurations will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other configurations without departing from the spirit or scope of the disclosure. Thus, the disclosure is not intended to be limited to the configurations shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein. [0085] WHAT IS CLAIMED IS:

Claims

1. A device comprising: a first decoder stage operable to decode a Context- based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order; a buffer operable to buffer the intermediate signal in the decoded order; and a second decoder stage operable to decode the intermediate signal into a video output signal using a non- CABAC decoding standard.
2. The device according to claim 1, wherein the second decoder stage is operable to reconstruct syntax elements from the intermediate signal and decode the syntax elements.
3. The device according to claim 2, wherein the second decoder stage includes a debinarization sub-module to reconstruct values of the syntax elements.
4. The device according to claim 3, wherein the second decoder stage includes a variable length coding decoder.
5. The device according to claim 1, wherein the intermediate signal includes bins.
6. The device according to claim 1, further comprising: a switch, coupled between the first decoder stage and the second decoder stage, and being operable to directly pass the intermediate signal from the first decoder stage to the second decoder stage in a bypass mode and operable to pass the intermediate signal from the buffer to the second decoder stage in a non-bypass mode.
7. The device according to claim 6, further comprising: a processor performance calculator operable to calculate a processor performance of the first decoder stage; and mode control to control the operation of the switch in response to the calculated processor performance.
8. The device according to claim 1, wherein the device is a cellular phone, a wireless device, a wireless communications device, a video game console, a wirelessly- equipped personal digital assistant (PDA), a laptop computer, or a video-enabled device.
9. An integrated circuit comprising: a first decoder circuit operable to decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order; a buffer operable to buffer the intermediate signal in the decoded order; and a second decoder circuit operable to decode the intermediate signal into a video output signal using a non-CABAC decoding standard.
10. The integrated circuit according to claim 9, wherein the second decoder circuit is operable to reconstruct syntax elements from the intermediate signal and decode the syntax elements.
11. The integrated circuit according to claim 10, wherein the second decoder circuit includes a debinarization sub-module to reconstruct values of the syntax elements.
12. The integrated circuit according to claim 11, wherein the second decoder circuit includes a variable length coding decoder.
13. The integrated circuit according to claim 9, wherein the intermediate signal includes bins.
14. The integrated circuit according to claim 9, further comprising: a switch, coupled between the first decoder circuit and the second decoder circuit, and being operable to directly pass the intermediate signal from the first decoder circuit to the second decoder circuit in a bypass mode and operable to pass the intermediate signal from the buffer to the second decoder circuit in a non-bypass mode.
15. The integrated circuit according to claim 14, further comprising: a processor performance calculator operable to calculate a processor performance of the first decoder circuit; and mode control to control the operation of the switch in response to the calculated processor performance.
16. The integrated circuit according to claim 9, wherein the integrated circuit is a portion of a cellular phone, a wireless device, a wireless communications device, a video game console, a wirelessly-equipped personal digital assistant (PDA), a laptop computer, or a video-enabled device.
17. An integrated circuit comprising: means for decoding a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order; means for buffering the intermediate signal in the decoded order; and means for decoding the intermediate signal into a video output signal using a non-CABAC decoding standard.
18. The integrated circuit according to claim 17, wherein the means for decoding the intermediate signal includes means for reconstructing syntax elements from the intermediate signal and decoding the syntax elements.
19. The integrated circuit according to claim 18, wherein the means for decoding the intermediate signal includes a variable length coding decoder.
20. The integrated circuit according to claim 17, wherein the intermediate signal includes bins.
21. The integrated circuit according to claim 17, further comprising means for switching to pass the intermediate signal from the means for decoding the CABAC bitstream directly to the means for decoding the intermediate signal in a bypass mode and to pass the intermediate signal from the means for buffering to the means for decoding the intermediate signal in a non-bypass mode.
22. The integrated circuit according to claim 21, further comprising: means for calculating a processor performance of the means for decoding the CABAC bitstream; and means for controlling the operation of the switch in response to the calculated processor performance.
23. A computer program product including a computer readable medium having instructions for causing a computer to: decode a Context-based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order; buffer the intermediate signal in the decoded order; and decode the intermediate signal into a video output signal using a non-CABAC decoding standard.
24. The computer program product according to claim 23, wherein the instructions to decode the intermediate signal include instruction to cause the computer to reconstruct syntax elements from the intermediate signal and decode the syntax elements.
25. The computer program product according to claim 24, wherein the instructions to decode the intermediate signal include instructions to cause the computer to decode the intermediate signal according to a variable length coding scheme.
26. The computer program product according to claim 23, wherein the intermediate signal includes bins.
27. The computer program product according to claim 23, further comprising instructions to cause the computer to calculate a processor performance of a processor decoding the CABAC bitstream; and switch, in response to the calculated processor performance, between a non-bypass mode and a bypass mode wherein in the non- bypass mode, the instructions to decode the intermediate signal include instructions to decode the buffered intermediate signal and, in the bypass mode, the instructions to decode the intermediate signal directly.
28. A method comprising: decoding, by a first processing stage, a Context- based Adaptive Binary Arithmetic Coding (CABAC) bitstream into an intermediate signal having a CABAC decoded standard format and a decoded order; buffering the intermediate signal in the decoded order; and decoding, by a second processing stage, the intermediate signal into a video output signal using a non-CABAC decoding standard.
29. The method according to claim 28, wherein the decoding of the intermediate signal includes reconstructing syntax elements from the intermediate signal and decoding the syntax elements.
30. The method according to claim 29, wherein the decoding of the intermediate signal includes a variable length coding decoding.
31. The method according to claim 28, further comprising: selectively bypassing the buffering in a bypass mode; and directly passing the intermediate signal from the first processing stage to the second processing stage in the bypass mode.
32. The method according to claim 28, further comprising: calculating a processor performance of the first processing circuit; and controlling the selectively bypassing in response to the calculated processor performance.
PCT/US2008/074796 2007-08-31 2008-08-29 Architecture for multi-stage decoding of a cabac bitstream WO2009029797A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
KR1020127013225A KR20120058631A (en) 2007-08-31 2008-08-29 Architecture for multi-stage decoding of a cabac bitstream
JP2010523157A JP4976553B2 (en) 2007-08-31 2008-08-29 Architecture for multistage decoding of CABAC bitstreams
EP08798975.2A EP2191650B1 (en) 2007-08-31 2008-08-29 Architecture for multi-stage decoding of a cabac bitstream
KR1020107007048A KR101321012B1 (en) 2007-08-31 2008-08-29 Architecture for multi-stage decoding of a cabac bitstream
CN200880104665XA CN101790889B (en) 2007-08-31 2008-08-29 Architecture for multi-stage decoding of a cabac bitstream

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US96956507P 2007-08-31 2007-08-31
US60/969,565 2007-08-31
US5700908P 2008-05-29 2008-05-29
US61/057,009 2008-05-29
US12/197,133 US7839311B2 (en) 2007-08-31 2008-08-22 Architecture for multi-stage decoding of a CABAC bitstream
US12/197,133 2008-08-22

Publications (1)

Publication Number Publication Date
WO2009029797A1 true WO2009029797A1 (en) 2009-03-05

Family

ID=40279207

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/074796 WO2009029797A1 (en) 2007-08-31 2008-08-29 Architecture for multi-stage decoding of a cabac bitstream

Country Status (7)

Country Link
US (1) US7839311B2 (en)
EP (1) EP2191650B1 (en)
JP (1) JP4976553B2 (en)
KR (2) KR20120058631A (en)
CN (1) CN101790889B (en)
TW (1) TW200926827A (en)
WO (1) WO2009029797A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012172115A1 (en) * 2011-06-16 2012-12-20 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Entropy coding supporting mode switching
US8805099B2 (en) 2011-06-22 2014-08-12 Panasonic Intellectual Property Corporation Of America Image decoding method and image coding method
US9419749B2 (en) 2009-08-19 2016-08-16 Qualcomm Incorporated Methods and apparatus employing FEC codes with permanent inactivation of symbols for encoding and decoding processes
US9438933B2 (en) 2011-11-08 2016-09-06 Samsung Electronics Co., Ltd. Method and device for arithmetic coding of video, and method and device for arithmetic decoding of video
US9538172B2 (en) 2012-04-11 2017-01-03 Qualcomm Incorporated Grouping bypass coded syntax elements in video coding
US10645388B2 (en) 2011-06-16 2020-05-05 Ge Video Compression, Llc Context initialization in entropy coding
US11006115B2 (en) 2011-11-07 2021-05-11 Huawei Technologies Co., Ltd. Video decoder with enhanced CABAC decoding
TWI729135B (en) * 2016-05-20 2021-06-01 美商松下電器(美國)知識產權公司 Encoding device, decoding device, encoding method and decoding method

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9083976B2 (en) * 2008-09-05 2015-07-14 Freescale Semiconductor, Inc. Processing a video stream in real time based on binary information of the video stream
BR112012009445B1 (en) 2009-10-20 2023-02-14 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. AUDIO ENCODER, AUDIO DECODER, METHOD FOR CODING AUDIO INFORMATION, METHOD FOR DECODING AUDIO INFORMATION USING A DETECTION OF A GROUP OF PREVIOUSLY DECODED SPECTRAL VALUES
US20110125987A1 (en) * 2009-11-20 2011-05-26 Qualcomm Incorporated Dedicated Arithmetic Decoding Instruction
CN102792370B (en) 2010-01-12 2014-08-06 弗劳恩霍弗实用研究促进协会 Audio encoder, audio decoder, method for encoding and audio information and method for decoding an audio information using a hash table describing both significant state values and interval boundaries
US10123053B2 (en) * 2011-05-23 2018-11-06 Texas Instruments Incorporated Acceleration of bypass binary symbol processing in video coding
ES2660911T3 (en) * 2011-06-28 2018-03-26 Nec Corporation Method for encoding video quantization parameters and method for decoding video quantization parameters
US9503717B2 (en) 2012-01-09 2016-11-22 Texas Instruments Incorporated Context adaptive binary arithmetic coding (CABAC) with scalable throughput and coding efficiency
AU2012200319B2 (en) 2012-01-19 2015-11-26 Canon Kabushiki Kaisha Method, apparatus and system for encoding and decoding the significance map for residual coefficients of a transform unit
AU2012200345B2 (en) 2012-01-20 2014-05-01 Canon Kabushiki Kaisha Method, apparatus and system for encoding and decoding the significance map residual coefficients of a transform unit
US9445132B2 (en) * 2013-09-09 2016-09-13 Qualcomm Incorporated Two level last significant coefficient (LSC) position coding
CN104980749B (en) * 2014-04-11 2018-04-24 扬智科技股份有限公司 The decoding apparatus and method of arithmetic coding
US9455743B2 (en) * 2014-05-27 2016-09-27 Qualcomm Incorporated Dedicated arithmetic encoding instruction
KR20180031944A (en) 2016-09-21 2018-03-29 삼성전자주식회사 Video decoder and electronic system including the same
US10264264B2 (en) * 2016-09-24 2019-04-16 Apple Inc. Multi-bin decoding systems and methods
US10757412B2 (en) * 2017-01-03 2020-08-25 Avago Technologies International Sales Pte. Limited Architecture flexible binary arithmetic coding system
KR102414164B1 (en) 2017-03-31 2022-06-29 한국전자통신연구원 A method of video processing providing high-throughput arithmetic coding and a method and appratus for decoding and encoding video using the processing.
CN110995285B (en) * 2019-12-27 2023-05-05 成都达安众科技有限公司 UHF RFID step-by-step instruction decoding method and chip

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040085233A1 (en) * 2002-10-30 2004-05-06 Lsi Logic Corporation Context based adaptive binary arithmetic codec architecture for high quality video compression and decompression
US20040260739A1 (en) * 2003-06-20 2004-12-23 Broadcom Corporation System and method for accelerating arithmetic decoding of video data
US20040263361A1 (en) * 2003-06-25 2004-12-30 Lsi Logic Corporation Video decoder and encoder transcoder to and from re-orderable format
WO2006051796A1 (en) * 2004-11-09 2006-05-18 Matsushita Electric Industrial Co., Ltd. Two step arithmetic decoding with conversion into an intermediate format
US20060126744A1 (en) * 2004-12-10 2006-06-15 Liang Peng Two pass architecture for H.264 CABAC decoding process
EP1684435A1 (en) * 2003-10-29 2006-07-26 NEC Corporation Decoding apparatus or encoding apparatus wherein intermediate buffer is inserted between arithmetic sign decoder or encoder and debinarizer or binarizer

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05176187A (en) * 1991-12-20 1993-07-13 Fujitsu Ltd Data compression decoding device
JP3704356B2 (en) * 1995-07-21 2005-10-12 株式会社 日立製作所 Decoded video signal decoding apparatus and storage decoding apparatus using the same

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040085233A1 (en) * 2002-10-30 2004-05-06 Lsi Logic Corporation Context based adaptive binary arithmetic codec architecture for high quality video compression and decompression
US20040260739A1 (en) * 2003-06-20 2004-12-23 Broadcom Corporation System and method for accelerating arithmetic decoding of video data
US20040263361A1 (en) * 2003-06-25 2004-12-30 Lsi Logic Corporation Video decoder and encoder transcoder to and from re-orderable format
EP1684435A1 (en) * 2003-10-29 2006-07-26 NEC Corporation Decoding apparatus or encoding apparatus wherein intermediate buffer is inserted between arithmetic sign decoder or encoder and debinarizer or binarizer
WO2006051796A1 (en) * 2004-11-09 2006-05-18 Matsushita Electric Industrial Co., Ltd. Two step arithmetic decoding with conversion into an intermediate format
US20060126744A1 (en) * 2004-12-10 2006-06-15 Liang Peng Two pass architecture for H.264 CABAC decoding process

Cited By (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9660763B2 (en) 2009-08-19 2017-05-23 Qualcomm Incorporated Methods and apparatus employing FEC codes with permanent inactivation of symbols for encoding and decoding processes
US9419749B2 (en) 2009-08-19 2016-08-16 Qualcomm Incorporated Methods and apparatus employing FEC codes with permanent inactivation of symbols for encoding and decoding processes
US9876607B2 (en) 2009-08-19 2018-01-23 Qualcomm Incorporated Methods and apparatus employing FEC codes with permanent inactivation of symbols for encoding and decoding processes
US9936227B2 (en) 2011-06-16 2018-04-03 Ge Video Compression, Llc Entropy coding of motion vector differences
WO2012172115A1 (en) * 2011-06-16 2012-12-20 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Entropy coding supporting mode switching
US9628827B2 (en) 2011-06-16 2017-04-18 Ge Video Compression, Llc Context initialization in entropy coding
US11012695B2 (en) 2011-06-16 2021-05-18 Ge Video Compression, Llc Context initialization in entropy coding
US10819982B2 (en) 2011-06-16 2020-10-27 Ge Video Compression, Llc Entropy coding supporting mode switching
US9473170B2 (en) 2011-06-16 2016-10-18 Ge Video Compression, Llc Entropy coding of motion vector differences
US9686568B2 (en) 2011-06-16 2017-06-20 Ge Video Compression, Llc Context initialization in entropy coding
US9729883B2 (en) 2011-06-16 2017-08-08 Ge Video Compression, Llc Entropy coding of motion vector differences
US9743090B2 (en) 2011-06-16 2017-08-22 Ge Video Compression, Llc Entropy coding of motion vector differences
US9762913B2 (en) 2011-06-16 2017-09-12 Ge Video Compression, Llc Context initialization in entropy coding
US9768804B1 (en) 2011-06-16 2017-09-19 Ge Video Compression, Llc Context initialization in entropy coding
US9455744B2 (en) 2011-06-16 2016-09-27 Ge Video Compression, Llc Context initialization in entropy coding
US9596475B2 (en) 2011-06-16 2017-03-14 Ge Video Compression, Llc Entropy coding of motion vector differences
US11838511B2 (en) 2011-06-16 2023-12-05 Ge Video Compression, Llc Entropy coding supporting mode switching
US9973761B2 (en) 2011-06-16 2018-05-15 Ge Video Compression, Llc Context initialization in entropy coding
US11533485B2 (en) 2011-06-16 2022-12-20 Ge Video Compression, Llc Entropy coding of motion vector differences
US11516474B2 (en) 2011-06-16 2022-11-29 Ge Video Compression, Llc Context initialization in entropy coding
US9918090B2 (en) 2011-06-16 2018-03-13 Ge Video Compression, Llc Entropy coding supporting mode switching
US9918104B2 (en) 2011-06-16 2018-03-13 Ge Video Compression, Llc Entropy coding of motion vector differences
US9930370B2 (en) 2011-06-16 2018-03-27 Ge Video Compression, Llc Entropy coding of motion vector differences
US9930371B2 (en) 2011-06-16 2018-03-27 Ge Video Compression, Llc Entropy coding of motion vector differences
US10021393B2 (en) 2011-06-16 2018-07-10 Ge Video Compression, Llc Entropy coding of motion vector differences
US10645388B2 (en) 2011-06-16 2020-05-05 Ge Video Compression, Llc Context initialization in entropy coding
US11277614B2 (en) 2011-06-16 2022-03-15 Ge Video Compression, Llc Entropy coding supporting mode switching
US10432939B2 (en) 2011-06-16 2019-10-01 Ge Video Compression, Llc Entropy coding supporting mode switching
US10063858B2 (en) 2011-06-16 2018-08-28 Ge Video Compression, Llc Entropy coding of motion vector differences
US10148962B2 (en) 2011-06-16 2018-12-04 Ge Video Compression, Llc Entropy coding of motion vector differences
US10230954B2 (en) 2011-06-16 2019-03-12 Ge Video Compression, Llp Entropy coding of motion vector differences
US10298964B2 (en) 2011-06-16 2019-05-21 Ge Video Compression, Llc Entropy coding of motion vector differences
US10306232B2 (en) 2011-06-16 2019-05-28 Ge Video Compression, Llc Entropy coding of motion vector differences
US10313672B2 (en) 2011-06-16 2019-06-04 Ge Video Compression, Llc Entropy coding supporting mode switching
US10425644B2 (en) 2011-06-16 2019-09-24 Ge Video Compression, Llc Entropy coding of motion vector differences
US10057603B2 (en) 2011-06-16 2018-08-21 Ge Video Compression, Llc Entropy coding supporting mode switching
US10432940B2 (en) 2011-06-16 2019-10-01 Ge Video Compression, Llc Entropy coding of motion vector differences
US10440364B2 (en) 2011-06-16 2019-10-08 Ge Video Compression, Llc Context initialization in entropy coding
US10630987B2 (en) 2011-06-16 2020-04-21 Ge Video Compression, Llc Entropy coding supporting mode switching
US10630988B2 (en) 2011-06-16 2020-04-21 Ge Video Compression, Llc Entropy coding of motion vector differences
US9344726B2 (en) 2011-06-22 2016-05-17 Sun Patent Trust Image decoding method and image coding method
US8805099B2 (en) 2011-06-22 2014-08-12 Panasonic Intellectual Property Corporation Of America Image decoding method and image coding method
US11445197B2 (en) 2011-11-07 2022-09-13 Huawei Technologies Co., Ltd. Video decoder with enhanced CABAC decoding
US11006115B2 (en) 2011-11-07 2021-05-11 Huawei Technologies Co., Ltd. Video decoder with enhanced CABAC decoding
US9438933B2 (en) 2011-11-08 2016-09-06 Samsung Electronics Co., Ltd. Method and device for arithmetic coding of video, and method and device for arithmetic decoding of video
US9888264B2 (en) 2011-11-08 2018-02-06 Samsung Electronics Co., Ltd. Method and device for arithmetic coding of video, and method and device for arithmetic decoding of video
US9888263B2 (en) 2011-11-08 2018-02-06 Samsung Electronics Co., Ltd. Method and device for arithmetic coding of video, and method and device for arithmetic decoding of video
US9888261B2 (en) 2011-11-08 2018-02-06 Samsung Electronics Co., Ltd. Method and device for arithmetic coding of video, and method and device for arithmetic decoding of video
US9888262B2 (en) 2011-11-08 2018-02-06 Samsung Electronics Co., Ltd. Method and device for arithmetic coding of video, and method and device for arithmetic decoding of video
US9538172B2 (en) 2012-04-11 2017-01-03 Qualcomm Incorporated Grouping bypass coded syntax elements in video coding
TWI729135B (en) * 2016-05-20 2021-06-01 美商松下電器(美國)知識產權公司 Encoding device, decoding device, encoding method and decoding method

Also Published As

Publication number Publication date
CN101790889B (en) 2012-07-04
KR101321012B1 (en) 2013-10-23
CN101790889A (en) 2010-07-28
EP2191650B1 (en) 2018-08-15
JP2010538547A (en) 2010-12-09
EP2191650A1 (en) 2010-06-02
JP4976553B2 (en) 2012-07-18
KR20120058631A (en) 2012-06-07
TW200926827A (en) 2009-06-16
US20090058695A1 (en) 2009-03-05
US7839311B2 (en) 2010-11-23
KR20100058618A (en) 2010-06-03

Similar Documents

Publication Publication Date Title
US7839311B2 (en) Architecture for multi-stage decoding of a CABAC bitstream
JP7231594B2 (en) Entropy encoding of motion vector differences
US8782379B2 (en) H.264 video decoder CABAC core optimization techniques
US8294603B2 (en) System and method for providing high throughput entropy coding using syntax element partitioning
KR101187243B1 (en) Fast parsing of variable-to-fixed-length codes
US11516474B2 (en) Context initialization in entropy coding
KR20070006445A (en) Method and apparatus for hybrid entropy encoding and decoding
JP2022516132A (en) Escape coding for coefficient levels
US10070127B2 (en) Method and apparatus for arithmetic coding and termination
WO2012159295A1 (en) Entropy decoding method and device
Orlandić et al. An efficient hardware architecture of CAVLC encoder based on stream processing
Tian et al. Implementation strategies for statistical codec designs in h. 264/avc standard
Ahangar et al. Real time low complexity VLSI decoder for prefix coded images
TW202349966A (en) Filtering method, filtering model training method and related apparatus
Liu et al. VLD design for AVS and H. 264 dual standards video decoder
Chang et al. A high-performance entropy decoding system for H. 264/AVC
Liu et al. VLD Design for AVS Video Decoder
Wang et al. A CAVLC Decoding Architecture Based on FSM for Real-Time HD Application

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880104665.X

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08798975

Country of ref document: EP

Kind code of ref document: A1

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 307/MUMNP/2010

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 2010523157

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2008798975

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 20107007048

Country of ref document: KR

Kind code of ref document: A