Visual Optimizations for Wavelet-Based Lossy Image Compression - PowerPoint PPT Presentation

1 / 109
About This Presentation
Title:

Visual Optimizations for Wavelet-Based Lossy Image Compression

Description:

Visual Optimizations for WaveletBased Lossy Image Compression – PowerPoint PPT presentation

Number of Views:111
Avg rating:3.0/5.0
Slides: 110
Provided by: DC277
Category:

less

Transcript and Presenter's Notes

Title: Visual Optimizations for Wavelet-Based Lossy Image Compression


1
Visual Optimizations for Wavelet-Based Lossy
Image Compression
  • Damon Chandler
  • Visual Communications Lab
  • School of Electrical and Computer Engineering

2
Lossy vs. Lossless Compression
  • CompressionRepresent a signal using fewer
    bits/sample
  • Lossless compression Invertible reconstructed
    signal original signal GIF, PNG, TIFF,
    PKZip, gzip, StuffIt.
  • Lossy compression Non-invertible, reconstructed
    signal ? original signal MP3, JPEG,
    JPEG-2000.

3
Lossy vs. Lossless Compression
  • CompressionRepresent a signal using fewer
    bits/sample
  • Lossless compression Invertible reconstructed
    signal original signal GIF, PNG, TIFF,
    PKZip, gzip, StuffIt.
  • Lossy compression Non-invertible, reconstructed
    signal ? original signal MP3, JPEG,
    JPEG-2000.
  • Focus only on lossy compression of images.
  • Focus only on natural images.
  • Focus only on grayscale natural images.

4
Lossy Image Compression
  • Pixel-value amplitude quantization

8 bits/pixel (256 shades of gray)
2 bits/pixel (4 shades of gray)
5
Lossy Image Compression
  • Pixel-value amplitude quantization

8 bits/pixel (256 shades of gray)
2 bits/pixel (4 shades of gray)
6
Lossy Image Compression
  • Transmit every other row and column

8 bits/pixel (1/4 size)
2 bits/pixel required to reconstruct
7
Lossy Image Compression
2 bits/pixel (dithered)
2 bits/pixel (interpolated)
8
Lossy Image Compression
GoalRepresent the image using building blocks
such that
  • The correlation between elements is reduced.
  • Quantization of element amplitudes produces
    visually pleasing artifacts.

2 bits/pixel (dithered)
2 bits/pixel (interpolated)
9
Discrete Wavelet Transform
  • Invertible, linear transform.
  • Represents an image as sum of little waves.
  • Localized in space and frequency.
  • Subband filtering implementation.

10
Discrete Wavelet Transform
11
Discrete Wavelet Transform
12
Discrete Wavelet Transform
13
Discrete Wavelet Transform
14
Quantization of Subband Coefficient Amplitudes
15
Quantization of Subband Coefficient Amplitudes
Goal Quantize the subbands in a visually optimal
manner.
  • Most image compression methods minimize MSE
    (i.e., minimize the Euclidean distance between
    original and reconstructed images).

16
Quantization of Subband Coefficient Amplitudes
Goal Quantize the subbands in a visually optimal
manner.
  • Most image compression methods minimize MSE
    (i.e., minimize the Euclidean distance between
    original and reconstructed images).

Original
Quantized finest scale (MSE118)
Quantized mid. scale (MSE118)
17
Quantization of Subband Coefficient Amplitudes
GoalQuantize the subbands in a visually optimal
manner.
  • Most image compression methods minimize MSE
    (i.e., minimize the Euclidean distance between
    original and reconstructed images).

To maximize visual quality
  • Need to understand human visual sensitivity to
    the distortions.
  • Need to understand visual processing of the image.

Original
Quantized finest scale (MSE118)
Quantized mid. scale (MSE118)
18
Outline
  • Background Visual detection, contrast
    sensitivity, visual summation.
  • Experiments Detection of wavelet quantization
    distortions, contrast matching.
  • Application Contrast-based quantization.
  • Work in progress State-space of natural images,
    medical-image compression.

19
Contrast Metrics
  • Simple contrast
  • Michelson contrast
  • RMS contrast

20
Contrast Metrics
  • Simple contrast
  • Michelson contrast
  • RMS contrast
  • Contrast detection threshold (CT) Minimum
    contrast required to visually detect a target.
  • Contrast sensitivity ? 1 / CT.

21
Contrast Sensitivity Function
From Peli 1996
22
Contrast Sensitivity Function
From Peli 1996
23
  • Targets Wavelet subband quantization
    distortions 1.15, 2.3, 4.6, 9.2, 18.4 c/deg.
  • Backgrounds (masks)

Unmasked (10.1 cd/m2 gray)
Natural-image maskers
24
Experiment 1 Detection of Simple Wavelet
Distortions
luminance (cd/m2)
  • Apparatus
  • HP4033A 21 monitor,
  • Stimuli
  • Subbands obtained using 9/7 Daubechies filters.
  • Distortions via uniform scalar quantization of
    one subband.
  • Two natural images balloon and horse.
  • Procedures
  • 3AFC paradigm (choose the odd one out).
  • Interleaved conditions (alternate images).

pixel value
25
DWT
Quantize one band
Subtract orig. image
DWT-1
Add 128
26
(No Transcript)
27
(No Transcript)
28
Experiment 1 Unmasked Detection Thresholds
29
Experiment 1 Masked Detection Thresholds
30
Experiment 1 Threshold Elevations
31
Experiment 1 Observations
  • CTs vary with spatial frequency.
  • Unmasked CSF for wavelet distortions much more
    low-pass than CSF for gratings.
  • Differences in bandwidth?
  • Similar results for 1-octave Gabors (Peli 93)
    and for wavelet noise (Watson 97).
  • Masked CSF shows greatest elevations in threshold
    at lower spatial frequencies.
  • Attributable to 1/f spectrum?

32
Compound Wavelet Distortions
33
Quicks Vector Magnitude Summation
34
Quicks Vector Magnitude Summation
35
Quicks Vector Magnitude Summation
(Graham 1977,1989)
36
Quicks Vector Magnitude Summation
(Graham 1977,1989)
37
Quicks Vector Magnitude Summation
(Graham 1977,1989)
Relative Contrast
38
Quicks Vector Magnitude Summation
(Graham 1977,1989)
Relative Contrast
39
Quicks Vector Magnitude Summation
Compound _at_ threshold
Relative Contrast Threshold
40
Quicks Vector Magnitude Summation
Compound _at_ threshold
Relative Contrast Threshold
With N2 components
41
Relative Contrast Space
42
Experiment 2 Detection of Compound Wavelet
Distortions
  • Apparatus
  • HP4033A 21 monitor
  • Stimuli
  • Subbands obtained using 9/7 Daubechies filters.
  • Distortions via uniform scalar quantization of
    two subbands.
  • Two natural images balloon and horse.
  • Procedures
  • 3AFC paradigm (choose the odd one out).
  • Interleaved conditions (alternate images).

43
DWT
Quantize two bands
Subtract orig. image
DWT-1
Add 128
44
HV, 4.6 c/deg
HV, 2.3 c/deg
HV, 1.15 c/deg
H, 2.34.6 c/deg
H, 1.152.3 c/deg
V, 2.34.6 c/deg
V, 1.152.3 c/deg
45
HV, 4.6 c/deg
HV, 2.3 c/deg
HV, 1.15 c/deg
H, 2.34.6 c/deg
H, 1.152.3 c/deg
V, 2.34.6 c/deg
V, 1.152.3 c/deg
46
Experiment 2 Unmasked Summation HV
47
Experiment 2 Unmasked Summation SF
48
Experiment 2 Masked Summation HV
49
Experiment 2 Masked Summation SF
50
Experiment 2 Observations
  • Unmasked summation more in line with probability
    summation
  • ? 4-5
  • Non-significant effect of spatial-frequency or
    orientation.
  • Masked summation closer to energy or linear
    summation
  • ? ? 1.5
  • Off-frequency looking (channel switching)?
  • Natural-image backgrounds activate higher-levels?

51
Image with Compound Distortions _at_ Threshold (?
?)
52
Image with Compound Distortions _at_ Threshold (?
1.5)
53
What about suprathreshold distortions?
  • Can use CSF to generate an image with distortions
    _at_ threshold.
  • Detection thresholds reveal little about visual
    responses _at_ suprathreshold contrasts
  • Contrast constancy (Georgeson et al. 1975, Brady
    et al. 1995).

54
Contrast Constancy Unmasked
55
Contrast Constancy Masked
56
Scaled CSF proportions
Contrast-matching proportions
Total C 0.18
57
What about suprathreshold distortions?
  • Can use CSF to generate an image with distortions
    _at_ threshold.
  • Detection thresholds reveal little about visual
    responses _at_ suprathreshold contrasts
  • Contrast constancy (Georgeson et al. 1975, Brady
    et al. 1995).
  • Must consider visual processing of image.

58
Visual Scale-Space Integration
  • Natural images ? many low frequencies.
  • Global-to-local analysis
  • Global Precedence (Navon 1977)
  • Global structure influences processing of local
    structure (Schyns Oliva 1994, Hucka Kaplan
    1996).
  • Edges tend to show power across scales
  • Perception of edge-structure requires a continuum
    across scale-space (Witkin 1983).
  • Global-to-local integration across scale-space
    (Hayes 1989).
  • Edge detection algorithms
  • Marr Hildreth (no integration) vs. Canny
    (integration).

59
(No Transcript)
60
Coarse
Intermediate
Fine
61
Coarse
Intermediate
Fine
62
(No Transcript)
63
Coarse
Intermediate
Fine
64
(No Transcript)
65
Coarse
Intermediate
Fine
66
(No Transcript)
67
What about suprathreshold distortions?
  • Proportion the contrasts to preserve continuity
    across scale space

CSNR ? Cimage / Cdistortions
68
What about suprathreshold distortions?
  • Proportion the contrasts to preserve continuity
    across scale space

CSNR ? Cimage / Cdistortions
69
What about suprathreshold distortions?
  • Proportion the contrasts to preserve continuity
    across scale space

CSNR ? Cimage / Cdistortions
70
Edge-Preserving Ratios
  • Contrast Signal-to-Noise Ratio

RMS contrast of image _at_ scale s, orientation ?
RMS contrast of distortions _at_ scale s,
orientation ?
71
Edge-Preserving Ratios
  • Contrast Signal-to-Noise Ratio
  • Model best CSNR curve _at_ a given visual
    distortion (VD)

RMS contrast of image _at_ scale s, orientation ?
RMS contrast of distortions _at_ scale s,
orientation ?
72
Application to Compression
  • Display model

73
Application to Compression
  • Display model
  • Relate MSE and RMS contrast in the image

74
Application to Compression
  • Display model
  • Relate MSE and RMS contrast in the image

T.S.A.
75
Application to Compression
  • Display model
  • Relate MSE and RMS contrast in the image

T.S.A.
76
Application to Compression
  • Display model
  • Relate MSE and RMS contrast in the image

T.S.A.
77
Application to Compression
  • Display model
  • Relate MSE and RMS contrast in the image

T.S.A.
where
78
Application to Compression
  • Display model
  • Relate MSE and RMS contrast in the image

where
79
Application to Compression
  • Display model
  • Relate MSE and RMS contrast in the image
  • MSE in subband _at_ (s, ? )

where
80
  • Measure Cimage(s,? ) or estimate using
  • Compute CSNR(s,?,VD ) using
  • Compute D (s,?,VD ) using

81
  • Measure Cimage(s,? ) or estimate using
  • Compute CSNR(s,?,VD ) using
  • Compute D (s,?,VD ) using

Visually tuned distortions for scalable coding
where
82
JPEG-2000
JPEG-2000 Contrast-Based
_at_ 0.4 bits/pixel
83
JPEG-2000 WMSE
JPEG-2000 Contrast-Based
_at_ 0.25 bits/pixel
84
JPEG-2000 WMSE
JPEG-2000 Contrast-Based
_at_ 0.1 bits/pixel
85
JPEG-2000 WMSE
JPEG-2000 Contrast-Based
_at_ 0.1 bits/pixel
86
(No Transcript)
87
(No Transcript)
88
(No Transcript)
89
Conclusions and Future Work
  • Can generate better-looking compressed images by
    accounting for properties of vision
  • Image and rate adaptive via CSNR(VD).
  • Implemented using MSE.
  • Integrates easily with embedded coders.

90
Conclusions and Future Work
  • Can generate better-looking compressed images by
    accounting for properties of vision
  • Image and rate adaptive via CSNR(VD).
  • Implemented using MSE.
  • Integrates easily with embedded coders.
  • Spatially local contrast-based quantization.

91
Conclusions and Future Work
  • Can generate better-looking compressed images by
    accounting for properties of vision
  • Image and rate adaptive via CSNR(VD).
  • Implemented using MSE.
  • Integrates easily with embedded coders.
  • Spatially local contrast-based quantization.
  • Quantify VD using psychophysical scaling.

92
Future Work Visual Distortion Metric
Same MSE
93
Future Work Visual Distortion Metric
94
Future Work Visual Distortion Metric
Pixel-value quantized
Pixel-value quantized dithered
Interpolated
95
Future Work Visual Distortion Metric
Pixel-value quantized
Pixel-value quantized dithered
Interpolated
96
Conclusions and Future Work
  • Can generate better-looking compressed images by
    accounting for properties of vision
  • Image and rate adaptive via CSNR(VD).
  • Implemented using MSE.
  • Integrates easily with embedded coders.
  • Spatially local contrast-based quantization.
  • Quantify VD using psychophysical scaling.
  • Compression of medical images.

97
Future Work Medical Image Compression
98
Future Work Medical Image Compression
99
Conclusions and Future Work
  • Can generate better-looking compressed images by
    accounting for properties of vision
  • Image and rate adaptive via CSNR(VD).
  • Display adaptive via contrast.
  • Integrates easily with embedded coders.
  • Spatially local contrast-based quantization.
  • Compression of medical images.
  • Quantify VD using psychophysical scaling.
  • Better model of natural images.

100
Future Work Better Building Blocks?
  • Ultimate coder
  • Digital image compression One index for every
    image that would be encountered.
  • Efficient visual encoding One cell for every
    image that would be encountered.
  • Enormous memory requirements.
  • Can we find a compromise between memory and
    efficiency?
  • Provide an index (or cell) only for the most
    likely image structures.

101
Future Work Better Building Blocks?
  • What are the most likely natural-image structures?

102
Future Work Better Building Blocks?
  • What are the most likely 8x8 natural-image
    structures?

103
Future Work Better Building Blocks?
  • Using 260K unique 8x8 puzzle pieces

104
Backup Slides
105
Lossy Image Compression
  • Transmit every other row and column

8 bits/pixel (original)
2 bits/pixel (reconstructed)
106
Lossy Image Compression
  • The pixel-values of natural images are highly
    correlated
  • Scale-invariance model (Field)
  • Amplitude( f ) ? 1 / f
  • Autocorrelation model (Girod et al.)
  • R(d) ? exp(-d)
  • ?(?) ? 1 / ?2
  • ? Amplitude(? ) ? 1 / ?

107
Contrast Sensitivity Function
Do these results hold when
  • Targets are wavelet subband quantization
    distortions?
  • These distortions are presented against a
    natural-image background?

108
Future Work Better Building Blocks?
  • Using 16K unique 32x32 puzzle pieces

109
Future Work Better Building Blocks?
  • Using 65K unique 16x16 puzzle pieces
Write a Comment
User Comments (0)
About PowerShow.com