Reducing Complexity in Signal Processing Algorithms for Communication Receiver and Image Display Software - PowerPoint PPT Presentation

About This Presentation
Title:

Reducing Complexity in Signal Processing Algorithms for Communication Receiver and Image Display Software

Description:

Wireless Networking and Communications Group Reducing Complexity in Signal Processing Algorithms for Communication Receiver and Image Display Software – PowerPoint PPT presentation

Number of Views:264
Avg rating:3.0/5.0
Slides: 37
Provided by: Bria4178
Category:

less

Transcript and Presenter's Notes

Title: Reducing Complexity in Signal Processing Algorithms for Communication Receiver and Image Display Software


1
Reducing Complexity inSignal Processing
Algorithms forCommunication Receiver andImage
Display Software
Wireless Networking and Communications Group
Brian L. Evans Prof. Brian L. Evans
Seminar at the American University of Beirut
27 July 2010
2
Outline
  • Embedded digital systems
  • Generating sinusoidal waveforms
  • Discrete-time filters
  • Multicarrier equalizers
  • Image halftoning algorithms
  • Conclusion

2004
2005
2006
2007
2008
2009
2010
3
Embedded Digital Systems
  • Often work on application-specific tasks
  • In consumer products (2008 units)
  • 1200M cell phones 70M DSL modems
  • 300M PCs 55M
    cars/light trucks
  • 100M digital cameras 30M gaming consoles
    (2007)
  • 100M DVD players
  • iPhone has six programmable processors (2008)
  • Embedded programmable processors
  • Inexpensive with small area and volume
  • Predictable off-chip input/output (I/O) rates
  • Low power (TI C5504 45mW _at_ 300MHz)

Limited on-chip memory Fixed-point arithmetic
4
Embedded Digital Systems
  • Memory access in processors
  • External I/O block data transfers to/from
    on-chip memory
  • Internal I/O on-chip memory to CPU registers
    using data buses (e.g. TI C6000 processor has two
    32-bit data buses)
  • Common word sizes for signal processing software
  • 64-bit floating-point for desktop computing (e.g.
    Matlab)
  • 32-bit floating-point for pro-audio and sonar
    beamforming
  • 16-bit fixed-point for speech, consumer audio,
    image proc.
  • IEEE floating-point operations
  • Handles many special cases (e.g. 8, -8 and not a
    number)
  • Add, multiply, divide have comparable hardware
    complexity

5
Embedded Digital Systems
  • Fixed-pointoperations
  • Multiplicationbased on addition operations
  • Division takes 1-2instructions perbit of
    accuracy
  • Multiplication canconsume muchdynamic power
  • Truncate constantsfor power savings

56
Multiplier used in TI C64 processors
Han, Evans Swartzlander, 2005
6
Generating Sinusoidal Waveforms
  • Sample continuous-time cosine signal at rate fs
  • Discrete-time fixed frequency ?0 2 ? f0 / fs
  • Example f0 1200 Hz and fs 8000 Hz, ?0 3/10
    ?
  • Discrete-time realization drops fs term in front
    of cosine
  • Math library call to cos function in C
  • Uses double-precision floating-point arithmetic
  • No standard in C for internal implementation
  • Generally meant for high-accuracy desktop
    calculations
  • Call to gsl_sf_cos_e in GNU scientific library
    1.8
  • 20 multiply, 30 add, 2 divide, 2 power
    calculations/output

7
Generating Sinusoidal Waveforms
  • Difference equation with input xn and output
    yn
  • yn (2 cos ?0) yn-1 - yn-2 xn - (cos
    ?0) xn-1
  • From inverse z-transform of z-transform of cos(?0
    n) un
  • Impulse response gives cos(?0 n) un
  • 2 multiplications and 3 adds per output value
  • Buildup in error as n increases due to feedback
  • Lookup table pre-compute samples offline
  • Discrete-time frequency ?0 2 ? f0 / fs 2 ? N
    / L
  • All common factors between integers N and L
    removed
  • ? 2 ? k 2 ? (N / L) n ? n L ? store L
    samples
  • Entries in either floating-point or fixed-point
    format
  • Table would contain N periods of the cosine

Initial conditions are all zero
8
Generating Sinusoidal Waveforms
  • Signal quality vs. implementation complexity in
    generating cos(?0 n) un with ?0 2 ? N / L

Method MACs/ sample ROM (words) RAM (words) Quality in floating pt. Quality in fixed point
C math library call 30 22 1 Second Best N/A
Difference equation 2 2 3 Worst Second Best
Lookup table 0 L 0 Best Best
MAC Multiplication-accumulationRAM Random
Access Memory (writeable) ROM Read-Only
Memory
9
Discrete-Time Filters
  • Finite impulse response (FIR) filter
  • Impulse response hk has finite extent k 0,,
    M-1

xk-1
z-1
z-1
z-1

xk

h0
h1
h2
hM-1
S
yk
Discrete-time convolution
10
Discrete-Time Filters
  • Infinite impulse response (IIR) filter
  • Biquad building block 2 poles and 0-2 zeros
  • Generally, coefficients a1, a2, b0, b1, b2 are
    real-valued

Biquad is short for biquadratic- transfer
function is ratio of two quadratic polynomials
11
Discrete-Time Filters
FIR Filters IIR Filters
Implementation complexity (1) Higher Lower (sometimes by factor of four)
Minimum order design Parks-McClellan (Remez exchange) algorithm (2) Elliptic design algorithm
Stable? Always May become unstablewhen implemented (3)
Linear phase If impulse response is symmetric or anti-symmetric about midpoint No, but phase may made approximately linear over passband (or other band)
(1) For same piecewise constant magnitude
specification(2) Algorithm to estimate minimum
order for Parks-McClellan algorithm by
Kaiser may be off by 10. Search for minimum
order is often needed.(3) Algorithms can tune
design to implementation target to minimize risk
12
Discrete-Time Filters
  • Keep roots computed by filter design algorithms
  • Polynomial deflation (rooting) reliable in
    floating-point
  • Polynomial inflation (expansion) may degrade
    roots
  • Choice of IIR filter structure matters
  • Direct form IIR structures expand zeros and
    poles, and may become unstable for large order
    filters (order gt 12)
  • Cascade of biquads expands zeros and poles in
    each biquad
  • Minimum order design not always most efficient
  • Efficiency depends on target implementation
  • Consider power-of-two coefficient design
  • Efficient designs may require search of 8 design
    space

13
Halftime AUB Summer 2005
  • EECE 503 Real-Time DSP Lab
  • Embedded digital systems
  • Generating sinusoidal waveforms
  • Discrete-time filters
  • Multicarrier equalizers
  • Image halftoning algorithms
  • Conclusion

14
Channel Equalization
  • Channel degrades transmitted signal
  • Nonlinear distortion, e.g. amplitude
    nonlinearities
  • Linear distortion, e.g. convolution by channel
    impulse response
  • Additive noise, e.g. thermal (Gaussian) and
    impulsive
  • Equalization compensates linear distortion
  • Spreading/attenuation in time
  • Magnitude/phase distortion in frequency

15
Multicarrier Modulation
  • Divide channel into narrowband subchannels
  • Discrete multitone modulation
  • Baseband transmission based on fast Fourier
    transform (FFT)
  • Each subchannel carries single-carrier
    transmission
  • Standardized for digital subscriber line (DSL)
    communication

channel
carrier
magnitude
subchannel
frequency
Subchannels are 4.3 kHz wide in DSL systems
16
Channel Equalization
nk
Channel
Equalizer
  • Equalizer
  • Shortens channelimpulse response(time domain
    eq.)
  • Compensates phase/magnitude distortion(freq.
    domain eq.)
  • Single carrier system g is scalar constant
  • FIR filter w performs time and frequency domain
    equalization
  • Multicarrier system g is FIR filter of length
    n1
  • Time domain equalizer (w) then FFT freq.
    domain equalizer

yk
xk
rk
ek
h
w



Training signal
-
Ideal Channel
Receiver generates xk
g
z-?
Discretized Baseband System
Equalization in DSL receivers increases bit rate
by 10x
17
Multicarrier Equalization
  • Maximum shortening SNR time domain equalizer
  • Minimize energy leakage outside shortened channel
    length
  • For each position of window ? Melsa, Younce
    Rohrs, 1996
  • Cholesky decomposition of Bleads to optimal
    eigensolution
  • Computationally-intensive O(Lw3)
  • Floating-point multiplications/divisions
  • Restricts TEQ length to be less than n1

18
Time Domain Equalizer Design
Bit Rate (Mbps)
TEQ length of 17 Data rates averaged over eight
standard DSL test lines Martin et al., 2006
Training complexity in log10(multiply-add
operations)
Most efficient floating-point versions of
algorithms used
19
Time Domain Equalizer Design
  • Unified framework Martin et al., 2006
  • A and B are square (Lw ? Lw) and depend on choice
    of ?
  • Constraint prevents trivial non-practical
    solution w 0
  • Find eigenvector for largest generalized
    eigenvalue
  • Formulation
  • Power method
  • Alternating
  • Lagrangian

Iterative Methods
division-free
20 iterations to converge for 17-tap MSSNR TEQ
design
20
Digital Image Halftoning
  • For display on devices with fewer bits
    ofgray/color resolution than original image
  • Grayscale 8-bit image to 1-bit image
  • Color 24-bit RGB image to 12-bit RGB display
  • Produces artifacts

Each pixel in original image is 8-bit unsigned
intensity in 0, 255 For display, 0 is black
and 255 is white
21
Quantization with Feedback
  • Consider 4-bit data on 2-bit display (unsigned)
  • Feedback quantization error
  • For constant input 1001 9
  • Average output value
  • ¼ (10101011) 1001
  • 4-bit resolution at DC !
  • Noise shaping
  • Truncating from 4 to 2 bits increases noise by
    12dB
  • Feedback removes noise at DC increases HF noise

Adder Inputs
OutputTime Upper Lower Sum to display
1 1001 00 1001 10 2
1001 01 1010 10 3 1001
10 1011 10 4 1001 11
1100 11
Periodic
22
Error Diffusion Halftoning
  • Quantize each pixel
  • Diffuse filtered quantization error to future
    pixels

current pixel
Floyd Steinberg, 1976
error filter weights
23
Error Diffusion Halftoning
Artifact Model Compensation Added Complexity
Sharpening Linear Sharpnesscontrol 1 multiplication and1 addition per pixel
False textures Nonlinear Deterministic bit flipping quantization 1 comparison per pixel
  • Deterministic bit flipping quantizer
    (DBF)Damera-Venkata Evans, 2001
  • Thresholds input to black (0) or white (255)
  • Flip quantized value about mid-gray (128)
  • Reduces false textures in mid-grays
  • Implemented with two comparisons

DBF(x)
255
x
128
x2
x1
24
Sharpness Control
  • Model quantizer as gain plus noise Kite, Evans
    Bovik, 1997
  • Signal transfer function models sharpening
  • Ks 2 for Floyd-Steinberg
  • Noise transfer function models noise-shaping
  • Kn 1

Ks 2
Plots for ideal lowpass H(?)
25
Sharpness Control
  • Adjust by threshold modulation Eschbach Knox,
    1991
  • Scale image by gain L and add it to quantizer
    input
  • Flatten signal transfer function Kite, Evans
    Bovik, 2000

26
Results
Floyd-Steinberg
Original
DBF quantizer
Unsharpened
27
Conclusion
  • Processor architecture
  • Decrease data sizes to reduce on-chip memory
    usage and increase data bus efficiency
  • Truncate multiplicand constants to reduce power
  • Compute signal values by recursion or lookup
    table
  • Algorithm design
  • Keep offline design results in full precision
    until end
  • Order of calculations matters in implementation
  • Exploit problem structure in developing
    fixed-point algorithms
  • Linearize nonlinear systems to leverage linear
    system methods
  • Many other ways to reduce complexity exist

28
Invitations
  • Panel discussion on graduate studies
  • Tomorrow (Wednesday) 130 230 pm in this room
    (RCR)
  • Panelists Prof. Zaher Dawy (AUB), Prof. Imad
    El-Hajj (AUB) and Prof. Brian Evans (UT Austin)
  • IEEE Workshop on Signal Processing Systems
  • Early October 2011
  • Short walk from the AUB campus
  • Organizers include Prof. Magdy Bayoumi (Univ. of
    Louisiana at Lafayette), Prof. Brian Evans (UT
    Austin), Dean Ibrahim Hajj (AUB) and Prof.
    Mohammad Mansour (AUB)

29
Thank You!
30
Digital Signal Processors
DSP Processor Market
  • Market
  • 1/3 of 25B embedded digital signal processing
    market
  • 2007 cholesterol loweringPzifer Lipitor sales
    13B
  • Applications (2007)

Source Forward Concepts
Source Forward Concepts
31
Screening (Masking) Methods
Introduction
  • Periodic thresholds to binarize image
  • Periodic application leads to aliasing (gridding
    effect)
  • Clustered dot screening is more resistant to ink
    spread
  • Dispersed dot screening has higher spatial
    resolution
  • Blue larger masks (e.g. 1 by 1)

Clustered dot mask
Dispersed dot mask
index
Threshold Lookup Table
32
Linear Gain Model for Quantizer
  • Extend sigma-delta modulation analysis to 2-D
  • Linear gain model for quantizer in 1-D Ardalan
    and Paulos, 1988
  • Linear gain model for grayscale image Kite,
    Evans, Bovik, 1997
  • Error diffusion is modeled as linear,
    shift-invariant
  • Signal transfer function (STF) quantizer acts as
    scalar gain
  • Noise transfer function (NTF) quantizer acts as
    additive noise


us(m)
Ks us(m)
Signal Path
u(m)
b(m)
n(m)
un(m)
un(m) n(m)
Noise Path
33
Spatial Domain
34
Magnitude Spectra
35
Human Visual System Modeling
  • Contrast at particular spatialfrequency for
    visibility
  • Bandpass non-dimbackgroundsManos Sakrison,
    1974 1978
  • Lowpass high-luminance officesettings with
    low-contrast imagesGeorgeson G. Sullivan,
    1975
  • Exponential decay Näsäsen, 1984
  • Modified lowpass versione.g. J. Sullivan, Ray
    Miller, 1990
  • Angular dependence cosinefunction Sullivan,
    Miller Pios, 1993

36
Linear Gain Model for Quantizer
Analysis and Modeling
  • Best linear fit for Ks between quantizer input
    u(m) and halftone b(m)
  • Stable for Floyd-Steinberg
  • Can use average value to estimate Ks from only
    error filter
  • Sharpening proportional to Ks Kite, Evans
    Bovik, 2000
  • Value of Ks Floyd Steinberg lt Stucki lt Jarvis
  • Weighted SNR using unsharpened halftone
  • Floyd-Steinberg gt Stucki gt Jarvis at all viewing
    distances
Write a Comment
User Comments (0)
About PowerShow.com