Context-based Adaptive Coding and the Emerging H.26L Video Compression Standard - PowerPoint PPT Presentation

1 / 27
About This Presentation
Title:

Context-based Adaptive Coding and the Emerging H.26L Video Compression Standard

Description:

Context-based Adaptive Coding and the Emerging H.26L Video Compression Standard Thomas Wiegand Heinrich Hertz Institute Berlin, Germany wiegand_at_hhi.de – PowerPoint PPT presentation

Number of Views:275
Avg rating:3.0/5.0
Slides: 28
Provided by: GarySu8
Category:

less

Transcript and Presenter's Notes

Title: Context-based Adaptive Coding and the Emerging H.26L Video Compression Standard


1
Context-based Adaptive Codingand the Emerging
H.26L Video Compression Standard
  • Thomas Wiegand
  • Heinrich Hertz Institute
  • Berlin, Germany
  • wiegand_at_hhi.de

2
ITU-T Project H.26L
  • New ITU-T Q.6/SG16 (VCEG) standardization
    activity for video coding especially aimed at 3G
    mobile networks (e.g. UMTS) and broadcast
  • Possible formation of a joint video team with
    MPEG
  • Goal for H.26L 50 bit-rate reduction for same
    fidelity against every existing standard
  • 1999 3 proposals for definition of a first test
    model
  • HHI Warping/OBMC motion model, wavelet- and
    context-based adaptive coding (CABAC)
  • Nokia Affine motion model, multiple block
    transforms
  • Telenor Block-matching with variable
    block-sizes, 4x4-DCT
  • Current Status Definition of 8th test model
    (TML-8) based on Telenor proposal
  • Schedule Final approval in November 2002

3
H.26L Structure
4
The H.26L TML-8 Design, Part 1 of 4
  • Still using a hybrid of DPCM and transform coding
    as in prior standards.
  • Common elements with other standards include
  • 16x16 macroblocks
  • Conventional sampling of chrominance and
    association of luminance and chrominance data
  • Block motion displacement
  • Motion vectors over picture boundaries
  • Variable block-size motion
  • Block transforms (not wavelets or fractals)
  • Scalar quantization

5
H.26L Motion Compensation Accuracy
6
H.26L Multiple Reference Frames
Coder Control
Control Data
Transform/Quantizer
Quant.Transf. coeffs
-
Decoder
Deq./Inv. Transform
0
Motion- Compensated Predictor
Intra/Inter
Motion Data
Motion Estimator
7
The H.26L TML-8 Design, Part 2 of 4
  • Motion Compensation
  • Various block sizes and shapes for motion
    compensation (7 segmentations of the macroblock
    16x16, 16x8, 8x16, 8x8, 8x4, 4x8, 4x4)
  • Multiple reference pictures (per H.263 Annex U)
  • Temporally-reversed motion
  • B picture prediction weighting
  • New SP transition pictures for sequence
    switching
  • 1/4 sample (sort of per MPEG-4) and 1/8 sample
    accuracy motion
  • 6x6 tap filtering to 1/2 sample accuracy,
    bilinear filtering to 1/4 sample accuracy,
    special position with heavier filtering
  • 8x8 tap filtering applied repeatedly for 1/8 pel
    motion

8
H.26L Residual Coding
  • Residual coding is based on 4x4 blocks
  • Integer Transform

9
The H.26L TML-8 Design, Part 3 of 4
  • Transform
  • Integer transform approximating a DCT
  • Based primarily on 4x4 transform size (all prior
    standards used 8x8)
  • Expanded to 8x8 for chroma by 2x2 DC transform
  • Intra Coding Structure
  • Directional spatial prediction (6 types luma, 1
    chroma)
  • Expanded to 16x16 for luma intra by 4x4 DC Xfm

10
The H.26L TML-8 Design, Part 4 of 4
  • Quantization
  • Two inverse scan patterns
  • Logarithmic step size control
  • Smaller step size for chroma (per H.263 Annex T)
  • Deblocking Filter (in loop)
  • Distinct Network Adaptation Layer (NAL) design
    for network transport
  • Slice-structure coding
  • Data partitioning

11
H.26L Entropy Coding
12
Entropy Coding in H.26L Type 1 of 2
  • Universal Variable Length Code (UVLC)
  • Simple design with some disadvantages
  • Probability distribution is static
  • Correlations between symbols are ignored, i.e.
    no conditional probabilities are used
  • Code words must have integer number of bits(Low
    coding efficiency for highly peaked pdfs)

13
Entropy Coding in H.26L Type 2 of 2
  • Context-based adaptive binary arithmetic
    codes(CABAC)
  • Usage of adaptive probability models
  • Exploiting symbol correlations by using contexts
  • Non-integer number of bits per symbol by using
    arithmetic codes
  • Restriction to binary arithmetic coding
  • Simple and fast adaptation mechanism
  • But Binarization is needed for non-binary
    symbols
  • Binarization enables partitioning of state space

14
CABAC Technical Overview
update probability estimation
Context modeling
Probability estimation
Coding engine
Binarization
Adaptive binary arithmetic coder
Uses the provided model for the actual
encodingand updates the model
Maps non-binary symbols to a binary sequence
Chooses a model conditioned on past observations
15
Binarization
  • Mapping to a binary sequence using the unary code
    tree
  • Applies to all non-binary syntax elements except
    for macroblock type
  • Ease of implementation
  • Optimal codes for a geometric pdf p(x) 2(x1)
  • Discriminate between binary decisions (bins) by
    their position in the binary sequence
  • ? Usage of different models for different
    bin_numbers in the arithmetic coder

16
Example Context Modeling and Binarization
Neighboring symbols A and B used for
conditioning of current symbol C
Context determination rule
Current symbol
C4
A2, B3 ? ctx_no(C)1
Context selection
0 0 0 0 1
Binarization
Choice of model depends on bin_no
(bit, model_no.) (0,1) (0,2) (0,3) (0,3)
(1,3)
Feed into the arithmetic coder
17
Probability Estimation and Adaptation
  • Each model only consists of two counters
    counts(0), counts(1)
  • Coding with multiple contexts (models) is easy to
    obtain because of a clean interface between model
    and coder
  • Model (probability estimate) is updated after
    each symbol is encoded ? adaptive model

18
Binary Arithmetic Coding
  • Standard implementations use integer arithmetic
  • Fast, multiplication-free variants of binary
    arithmetic coder exists e.g. MQ-coder used in
    JBIG-2, JPEG-LS, JPEG-2000
  • Estimation Increase in computational complexity
    lower than 10 (MQ) and 20 (Standard-AC) of the
    total decoder execution time at medium bitrate

19
Results Bit-Rate Reduction
20
Results Bit-Rate Reduction
21
Comparison of H.26L to MPEG-4
  • MPEG-4 Advanced Simple Profile (ASP)
  • Motion Compensation 1/4 pel
  • Global Motion Compensation
  • H.26L
  • Motion Compensation 1/4 pel (QCIF), 1/8 pel
    (CIF)
  • Using CABAC entropy coding
  • 5 reference frames in 7 of 8 cases (News 17 /
    25)
  • Both
  • Sequence structure IBBPBBP...
  • QPBQPP2 (step size 25)
  • Search range 32x32 around 16x16 predictor
  • Well-known DlR optimization techniques

22
RD Curves Foreman (QCIF, 10Hz)
39
38
37
36
35
34
33
Average PSNR(Y) dB
32
31
30
29
28
MPEG-4
27
H.26L
26
0
16
32
48
64
80
96
112
128
Bit-rate kbit/s
23
RD Curves Flowergarden (CIF, 30Hz)
38
37
36
35
34
33
32
31
Average PSNR(Y) dB
30
29
28
27
26
25
MPEG-4
24
23
H.26L
22
0
256
512
768
1024
1280
1536
1792
2048
2304
Bit-rate kbit/s
24
PSNR Results H.26L TML8vs. MPEG-4 ASP Anchor
Average A,B,C 2.0 dB Average A,B,C,D,E,F 2.1 dB Average A,B,C 2.0 dB Average A,B,C,D,E,F 2.1 dB Average A,B,C 2.0 dB Average A,B,C,D,E,F 2.1 dB Average PSNR of Luminance Average PSNR of Luminance Average PSNR of Luminance
Average A,B,C 2.0 dB Average A,B,C,D,E,F 2.1 dB Average A,B,C 2.0 dB Average A,B,C,D,E,F 2.1 dB Average A,B,C 2.0 dB Average A,B,C,D,E,F 2.1 dB H.26L TML-8 MPEG-4 ASP Gain
Case A (Avg. 1.9) 32 kbit/s QCIF, 10 fps Foreman 31.9 dB 30.1 dB 1.8 dB
Case A (Avg. 1.9) 32 kbit/s QCIF, 10 fps News 36.0 dB 33.3 dB 2.8 dB
Case A (Avg. 1.9) 32 kbit/s QCIF, 10 fps Container Ship 38.5 dB 36.8 dB 1.8 dB
Case A (Avg. 1.9) 32 kbit/s QCIF, 10 fps Tempete 29.3 dB 27.8 dB 1.5 dB
Case B (Avg. 2.3) 64 kbit/s QCIF, 15 fps Foreman 34.7 dB 32.8 dB 1.9 dB
Case B (Avg. 2.3) 64 kbit/s QCIF, 15 fps News 39.4 dB 35.8 dB 3.7 dB
Case B (Avg. 2.3) 64 kbit/s QCIF, 15 fps Container Ship 40.5 dB 38.6 dB 1.9 dB
Case B (Avg. 2.3) 64 kbit/s QCIF, 15 fps Tempete 31.3 dB 29.4 dB 1.9 dB
Case C (Avg. 1.9) 128 kbit/s CIF, 15 fps Foreman 33.3 dB 31.3 dB 2.0 dB
Case C (Avg. 1.9) 128 kbit/s CIF, 15 fps News 38.7 dB 35.9 dB 2.9 dB
Case C (Avg. 1.9) 128 kbit/s CIF, 15 fps Container Ship 36.7 dB 35.4 dB 1.3 dB
Case C (Avg. 1.9) 128 kbit/s CIF, 15 fps Tempete 28.8 dB 27.6 dB 1.3 dB
25
PSNR Results H.26L TML8vs. MPEG-4 ASP Anchor
Average D,E,F 2.1 dB Average A,B,C,D,E,F 2.1 dB Average D,E,F 2.1 dB Average A,B,C,D,E,F 2.1 dB Average D,E,F 2.1 dB Average A,B,C,D,E,F 2.1 dB Average PSNR of Luminance Average PSNR of Luminance Average PSNR of Luminance
Average D,E,F 2.1 dB Average A,B,C,D,E,F 2.1 dB Average D,E,F 2.1 dB Average A,B,C,D,E,F 2.1 dB Average D,E,F 2.1 dB Average A,B,C,D,E,F 2.1 dB H.26L TML-8 MPEG-4 ASP Gain
Case D (Avg. 1.7) 256 kbit/s CIF, 15 fps Bus 29.4 dB 28.2 dB 1.2 dB
Case D (Avg. 1.7) 256 kbit/s CIF, 15 fps Mobile Calendar 29.6 dB 27.1 dB 2.4 dB
Case D (Avg. 1.7) 256 kbit/s CIF, 15 fps Flower Garden 27.7 dB 26.1 dB 1.5 dB
Case D (Avg. 1.7) 256 kbit/s CIF, 15 fps Tempete 31.5 dB 29.9 dB 1.6 dB
Case E (Avg. 2.0) 512 kbit/s CIF, 30 fps Bus 31.5 dB 29.8 dB 1.8 dB
Case E (Avg. 2.0) 512 kbit/s CIF, 30 fps Mobile Calendar 31.3 dB 28.6 dB 2.8 dB
Case E (Avg. 2.0) 512 kbit/s CIF, 30 fps Flower Garden 28.1 dB 29.9 dB 1.8 dB
Case E (Avg. 2.0) 512 kbit/s CIF, 30 fps Tempete 32.7 dB 30.9 dB 1.8 dB
Case F (Avg. 2.5) 1024 kbit/s CIF, 30 fps Bus 35.0 dB 32.8 dB 2.1 dB
Case F (Avg. 2.5) 1024 kbit/s CIF, 30 fps Mobile Calendar 34.9 dB 31.3 dB 3.6 dB
Case F (Avg. 2.5) 1024 kbit/s CIF, 30 fps Flower Garden 33.6 dB 31.4 dB 2.2 dB
Case F (Avg. 2.5) 1024 kbit/s CIF, 30 fps Tempete 35.5 dB 33.3 dB 2.1 dB
26
Subjective Comparison
27
Conclusions
  • Draft H.26L design is based on hybrid video
    coding
  • Similar in spirit to other standards but with
    important differences
  • Entropy coding can be conducted using
  • One VLC
  • Context-based adaptive arithmetic coding
  • Context-based adaptive arithmetic coding provides
    improvements between 5-15
  • H.26L shows a significant performance gain over
    existing standards including MPEG-4
  • Bit-rate savings up to 50 against MPEG-4 ASP
Write a Comment
User Comments (0)
About PowerShow.com