JPEG and MPEG Standards - PowerPoint PPT Presentation

1 / 32

About This Presentation

Title:

JPEG and MPEG Standards

Description:

Then they are scanned in a zig-zag order into a 1D sequence to be subject to AC ... Zig-Zag scan order. 18. ECE734 VLSI Array Processor for Digital Image Processing ... – PowerPoint PPT presentation

Number of Views:103

Avg rating:3.0/5.0

Slides: 33

Provided by: YuHe8

Category:

more less

Transcript and Presenter's Notes

Title: JPEG and MPEG Standards

1
JPEG and MPEG Standards
2
Outline

Transform-based Image and Video Coding
Linear Transformation DCT
Quantization
Scalar Quantization
Vector Quantization
Entropy Coding
Video Coding Motion Compensation

3
Transform-based Image Coding
Binary bit stream
Input Image
Linear Transform
Quanti- zatioin
Entropy Coding
4
Linear Transform

If the signal is formatted as a vector, a linear
transform can be formulated as a matrix-vector
product that transform the signal into a
different domain.
Examples
K-L Expansion
Discrete Fourier Transform
Discrete cosine transform
Discrete wavelet transform

Energy compaction property The transformed
signal vector has few, large coefficients and
many nearly zero small coefficients. These few
large coefficients can be encoded efficiently
with few bits while retaining the majority of
energy of the original signal.

5
Block-based Image Coding

An image is a 2D signal of pixel intensities
(including colors).
A block-based image coding scheme partitions the
entire image into 8 by 8 or 16 by 16 (or other
size) blocks. Then the coding algorithm is
applied to individual blocks independently.

Blocks may be overlapping or non-overlapping.
Advantage parallel processing can be applied to
process individual blocks in parallel. For
hand-held devices, only one block needs be loaded
into main memory each time.

6
JPEG Image Coding Algorithms
7
JPEG Decoding
DC
DC
IDPCM
8x8
Huffman
block
IDCT
IQ
AC
Huffman
AC
JPEG Decoding Process
8
Pre-Processing

Color sub-sampling
A color image is converted from RGB to YUV color
space. Each pixel in each dimension is 1 byte.
Sub-sample U-V planes 411 scheme.
For every 16 by 16 block of a color image, six 8
by 8 blocks are encoded.
Level shifting Each pixel value is subtracted by
128 so it ranges (128, 127).

Four 8?8 blocks of luminance pixels, plus two 8?8
sub-sampled chrominance components makes a 16 by
16 macro-block
9
Discrete Cosine Transform

8?8 two-dimensional separable DCT
DCT is chosen because it leads to superior energy
compaction for natural images.
F(0,0) DC coefficient ranges (-128x64/4,127x16)
needs 12 bits to represent (including sign bit).
12 bits are more than enough for the remaining AC
coefficients (u gt 0, or v gt 0)

10
Inverse DCT (IDCT)

8?8 two-dimensional separable IDCT
IDCT can be computed using the same routine as DCT

11
DCT Basis Functions
12
Quantization of DCT Coefficients
13
DPCM of DC coefficients

DC coding All DC coefficients of each 8 by 8
blocks of the entire image are combined to make a
sequence of DC coefficients.
Next, DPCM is applied
DiffDC(blocki) DC(blocki) DC(blocki1)
Then DiffDCs will be encoded using Hoffman entropy

Example
Original
1216 ? 1232 ? 1224 ? 1248 ? 1248 ? 1208
After DPCM
1216 ? 16 ? -8 ? 24 ? 0 ? -40

14
Huffman Encoding of DC Coefficients

Encoding and decoding of Huffman code is done via
look-up table.
In JPEG, DC coefficients (after DPCM) are first
grouped according to their magnitudes. Each
category is assigned as a symbol and a Hoffman
table is given. For example, 7 to 4 and 4 to 7
are listed as category 3 which has a code "00.

If the number is positive, the binary
representation of the number will be append to
the Hoffman code of the category number directly.
For example, 6 is encoded as 00 110. If the
number is negative, the appended code is the 1s
complement of that number. For example, -5 is
encoded as 00 010.
Question Given such a table, how to devise a
dedicated hardware to implement the encoding
procedure?

15
JPEG Huffman Table Categories
16
JPEG DC Entropy Coding

Example
-9 category 4. Hence Base code 101
1s complement of (-9) 1C(1001) 0110
Code word 101 0110 1010110
Note that category 3 occurs most frequent and
hence has shortest base code word.

17
AC Coefficients

AC coefficients are first weighted with a
quantization matrix
C(i,j)/q(i,j) Cq(i,j)
Then quantized.
Then they are scanned in a zig-zag order into a
1D sequence to be subject to AC Huffman encoding.
Question Given a 8 by 8 array, how to convert it
into a vector according to the zig-zag scan
order? What is the algorithm?

Zig-Zag scan order
18
AC Coefficients Huffman Encoding

The symbols for encoding AC coefficient consists
both the number of significant bits, as well as
runs of 0s preceding the nonzero AC coefficient.
For example,
5 0 2 0 0 1 is encoded as 100101 11100110
110110
This is according to the table below

19
Huffman Decoding

A look-up table procedure.
Challenge How to perform decoding fast?
Example a Huffman table for six symbols

The decoding process can be modeled as a finite
state machine with the following state diagram.
It decodes one bit of input bit stream per clock
cycle.
Question How to make this process fast enough to
match any input bit rate?

20
Huffman Decoding Implementation

The FSM decoding model decodes one bit per clock
cycle ? constant input rate.
A FSM is a nonlinear recursive equation! ?
look-ahead transform may be applied to expedite
evaluation.
Look-ahead means to exam two or more bits of the
input stream per clock cycle.

If the maximum code word length is examined per
clock cycle, then it is possible to produce one
output symbol per clock cycle, giving a constant
output rate realization.
The complexity of the combinational logic that is
required for look-ahead transformation is the
most difficult part.

21
Video Coding

Video coding is often implemented as encoding a
sequence of images. Motion compensation is used
to exploit temporal redundancy between successive
frames.
Examples MPEG-I, MPEG-II, MPEG-IV, H.323, H.263,
H.263, etc.
Existing video coding standards are based on JPEG
image compression as well as motion compensation.

22
MPEG Encoding
Buffer control
Current frame x(t)
r
Bit stream Buffer

VLC
DCT
Q
?
Q-1
IDCT

Qr(t) reconstructed residue
x(t) predicted frame

x(t) reconstructed current frame

Motion Estimation Compensation
x(t-1)
x(t)
Frame Buffer
This is a simplified block diagram where the
encoding of intra coded frames is not shown.
Motion vectors
23
MPEG Decoding
VLD Variable Length Decoding
Received bit stream
Bit stream Buffer
VLD
Q-1
IDCT

Qr(t) reconstructed residue
x(t) predicted frame

x(t) reconstructed current frame

Frame Buffer
Motion Compensation
x(t-1)
Motion vectors
24
Motion Estimation

Three types of frames
Intra (I) the frame is coded as if it is an
image
Predicted (P) predicted from an I or P frame
Bi-directional (B) forward and backward
predicted from a pair of I or P frames.
A typical frame arrangement is (subscripts are
used to distinguish them)
I1 B1 B2 P1 B3 B4 P2 B5 B6 I2
P1, P2 are both forward-predicted from I1. B1, B2
are interpolated from I1 and P1, B3, B4 are
interpolated from P1, P2, and B5, B6 are
interpolated from P2, I2.

25
Forward Motion Estimation
1
2
3
4
2
4
1
3
8
5
5
6
7
8
7
6
12
11
9
9
10
11
12
10
15
13
16
13
14
15
16
14
Current frame constructed From different
parts of reference frame
Reference frame
26
Block Motion Estimation

MAD Mean absolute difference between the I,j-th
pixel of the current block x(i,j) and the
(Im,jn)-th pixel of the reference frame.
(-p?m,n ? p) is the motion vector corresponding
to the macro-block. M and N are search range.
It is similar to DPCM in the temporal domain, and
has less to do with object motion.

27
Video sequence Tennis frame 0
Prepared by Surin Kittitornkun
28
Video sequence Tennis frame 1
Prepared by Surin Kittitornkun
29
Frame Difference
Prepared by Surin Kittitornkun
30
What is motion estimation?
Prepared by Surin Kittitornkun
31
What is motion compensation ?
Prepared by Surin Kittitornkun
32
Motion Compensated Frame Difference
Prepared by Surin Kittitornkun

Write a Comment

User Comments (0)