RealTime VLSI Architecture for Detection of Moving Object Using Wronskian Determinant - PowerPoint PPT Presentation

1 / 35
About This Presentation
Title:

RealTime VLSI Architecture for Detection of Moving Object Using Wronskian Determinant

Description:

Real-Time VLSI Architecture for Detection of Moving Object Using Wronskian ... The most instinctive technique is Frame Differencing followed by thresholding. ... – PowerPoint PPT presentation

Number of Views:425
Avg rating:3.0/5.0
Slides: 36
Provided by: cacsLou
Category:

less

Transcript and Presenter's Notes

Title: RealTime VLSI Architecture for Detection of Moving Object Using Wronskian Determinant


1
Real-Time VLSI Architecture for Detection of
Moving Object Using Wronskian Determinant
  • R. Aguilar-Ponce, J. Tessier, C. Emmela, A.
    Baker, J. Das, J.L. Tecpanecatl-Xihuitl, A. Kumar
    and Magdy Bayoumi
  • Center for Advance Computer Studies
  • University of Louisiana at Lafayette

2
Agenda
1. Introduction
2. Proposed Architecture
3. Results
4. Conclusion
3
Introduction
  • Change detection takes one or several references
    frames and models the background and foreground
    of the image.

Background Subtraction Technique
  • Detect
  • Moving objects
  • Appearing objects
  • Disappearing objects
  • Discard
  • Changes due to global illumination variations
  • Shadow cast by moving objects

4
Introduction
  • Applications that extract high level information
    from raw data, i.e. video stream require accurate
    and robust Change Detection Systems.
  • Such applications include
  • Video surveillance
  • Remote sensing
  • Object-based video coding
  • Smart cameras

5
Introduction
  • Video Surveillance Systems must determined when
    an intruder has appear on the scene
  • Tracking of moving automobiles and persons are
    issues of interests on these systems.
  • In order to achieve these task, change detection
    must be performed

6
Introduction
  • Handheld devices such as cellular phones or PDAs
    include acquisition, storage and/or transmission
    of images. In order to achieve these operations,
    images must be compressed.
  • In an Object-based Video Coding approach a scene
    is represented as a composition of objects, which
    can be independently processed and coded.
  • In the object-based approach, the moving objects
    in the video scene are extracted, and each object
    is represented by its shape, motion, and texture.

7
Introduction
  • While todays digital cameras capture images,
    smart cameras capture high-level descriptions of
    the scene and analyze what they see
  • A smart camera combines video sensing, high-level
    video processing and communication within a
    single embedded device.

8
Goal
  • Change detection has been performed purely in
    software.
  • The problem of object detection, however, becomes
    critical in the upcoming wireless visual sensors
    because of size and power constraints.
  • The need for low-power, small size, hardware
    implementations is greatly felt.
  • This paper introduces a VLSI architecture for
    Wronskian Change Detector (WCD).

9
Background Subtraction Techniques
  • The most instinctive technique is Frame
    Differencing followed by thresholding.
  • Change is detected if the difference of the
    corresponding pixels exceeds a preset threshold.
  • The advantage of this technique is its low
    computational complexity, however it is very
    susceptible to noise and illumination changes.

10
Background Subtraction Techniques
  • Median filter is one of the most popular
    background subtraction techniques.
  • Median of each pixel of all the frames in the
    buffer constitutes the background estimation.
  • Background pixels are considered to be those that
    stay on more than half of the frames on the
    buffer.
  • However, this technique requires a buffer large
    enough to store L frames.

11
Background Subtraction Techniques
  • Mixture of Gaussian is a recursive background
    technique, that recursively updates the
    background model based on each input frame.
  • This method models each background pixel by a
    mixture of K Gaussian distributions (K is a
    number between 3 and 5).
  • Different Gaussians are assumed to represent
    different colors. The probable background colors
    are the ones that stay longer and more static.
  • This technique is computationally intensive its
    parameters require careful tuning and it is very
    sensitive to sudden changes in global
    illumination. Any error in the background
    estimation can remain for a long period due to
    its recursive nature

12
Background Subtraction Techniques
  • Wronskian Change Detector employs the Wronskian
    of intensity ratios as a measure of change.
  • A large mean or large variance of the intensity
    ratios increases the Wronskian value.
  • This method can detect object interiors and
    structural changes. Also, WCD is robust against
    illumination changes.
  • WCD is a suitable algorithm to be implemented in
    real-time due to its low complexity. Also, this
    technique requires only one previous frame
    therefore it is appropriate for applications
    where resources are limited

13
Background Subtraction Techniques
14
Background Subtraction Techniques
15
Background Subtraction Techniques
Frame Differencing
Median Filter
Wronskian Change Detector
16
Wronskian Change Detector
  • In order to determine if a change has occurred, a
    region of support is assigned to each pixel.
  • The size of the region of support can vary from 3
    3, 5 5 and 9 9 pixels

17
Wronskian Change Detector
Window size 3 3
Window size 5 5
Window size 9 9
18
Wronskian Change Detector
  • Wronskian Change Detector employs the following
    equation
  • W(x/y) detects changes corresponding to dark
    zones, while its inverse ration W(y/x) finds if a
    change has occurred in bright zones. Therefore,
    computing both values allows robust detection
    against global illumination changes.
  • In our simulations, sizes of region of support
    larger than 3 do not provide better results but
    increases the computational complexity. Therefore
    a fixed value of 3 is employed in our approach.

19
NTSC and PAL Standards
  • American Video standard, National Television
    System Committee (NTSC).
  • The NTSC standard displays 60 fields per second.
    Each field is composed by even and odd lines.
  • The NTSC signal transmits the odd fields first
    and then the even fields
  • The even and odd fields are displayed
    sequentially, thus interlacing the full frame.
  • PAL (Phase Alternation by Line) standard is the
    dominant television standard in Europe.
  • The distinction between these standards is that
    color is handled differently.

20
NTSC/PAL
Odd Field
Even Field
21
Wronskian Change Detector
where
Previous Frame
Current Frame
22
Proposed Architecture
  • Proposed architecture is composed by three units
  • Processing unit
  • Main Controller
  • Memory Unit
  • Decoder and encoder are used to process both
    standards NTSC and PAL

NTSC/PAL
Memory Unit
Decoder
Main Controller
Processing Unit
Pipeline Processing Element
Adder Tree
Queue 1
Encoder
Queue 2
VGA Output
23
Pipeline Processing Element
  • To achieve a low-power implementation a 8-bit
    unsigned integer arithmetic was used.
  • There are two main concerns
  • The first one is how to capture the range of the
    function with only 8-bit unsigned arithmetic.
  • The second concern is guaranteeing precision,
    considering that threshold values are in the
    range of 0.6 to 0.7 to detect a change

24
Pipeline Processing Element
  • The PE must be designed to capture the range of
    D(xi,yi) that could indicate a change.
  • Therefore, the equation must be scaled so that an
    unsigned 8-bit integer threshold can be used and
    all overflows are saturated.
  • Only the partial range of D(xi,yi) where
  • THmin D(xi,yi) nTHmax
  • is significant, where THmin and THmax are the
    minimum and maximum threshold to be used

25
Pipeline Processing Element
  • This solved the problem of precision, but creates
    results that are too large to add n times.
  • For that reason, the five least significant bit
    of the product are discarded after
    multiplication, and the rest of the bits are
    employed as the result

26
Pipeline Processing Element
  • The implementation of the system is done with a
    fixed region of support size of 3 3.
  • The main components of the PE are divider, adder
    and multiplier.
  • Multiplication is done by Booth algorithm because
    it represents a good trade-off between speed and
    power for 8 bit fixed point arithmetic
  • Integer division using 8 conditional subtractors
    is simple and fast enough for our application
  • The architecture is capable of analyze frame size
    of 640 480 pixels

27
Processing Unit
  • The design uses control signals to pad the image
    by grounding the bus whenever it is required.
  • The adder trees sum PE outputs to produce the
    final results.
  • These results are compared to the threshold, and
    the change/no change bits are then stored into
    the output frame

28
Main Controller
  • The system is managed by the control unit. The
    controller has three states
  • Process, the system input and calculates
    Wronskian value
  • Display shows the output through the encoder
  • Idle, the process unit does not performed any
    action
  • The maximum frame rate is 15 frames per second.
    If the application require less than 15 fps, the
    system will remain idle for the rest of the
    frames of a second

29
Memory Unit
  • For storing the preceding frame data, we used a
    300Kb memory.
  • Another memory of same size is required to store
    the output values. T
  • The memory is addressed by 19 bits that includes
    field index for a frame, vertical addressing and
    horizontal addressing.
  • The values for 2-pixels, i.e., 16 bits of data is
    read and stored at a time.

30
Implementation
  • Implementation of the proposed architecture was
    done in VHDL using Mentor Graphic Modelsim
    Simulator.
  • Synthesis was done using Synopsis Synthesis Tools
    targeting Xilinx Virtex II XCV800 FPGA.
  • The XSV-800 board can accept PAL, SECAM, or NTSC
    video with up to 9-bits of resolution on the red,
    green, and blue channels and can output video
    images through a 110 MHz, 24-bit digital to
    analog converter.
  • Two independent banks of 512K x 16 SRAM are
    provided for local buffering of signals and data

31
Simulation Results
32
Simulation Results
  • Most of the area and power consumption is
    occupied by the processing unit.
  • The adder tree is fully asynchronous and its
    maximum delay of critical path is 12 ns.
  • The PE is synchronous with four stages of
    asynchronous logic.
  • One stage's maximum delay of critical path is 27
    ns.
  • The total power consumption of the system is 121
    mW. The total area of the system in LUT is 2312
    (7247 slices).

33
Conclusion
  • A background subtraction architecture using the
    Wronskian Change Detector algorithm has been
    presented for VLSI realization in wireless visual
    applications.
  • The proposed architecture consists of three
    units processing, memory and controller.
  • The processing unit is composed by pipeline
    processing element that performs the basic
    operation.

34
Conclusion
  • Partial results are stored and used on the adder
    tree to obtain final results.
  • Memory unit consists of two buffers, one stored
    the previous frame (Frame Buffer) and the other
    stored partial results and output.
  • The architecture is capable of computing
    Wronskian, Conjugate Wronskian and both.
  • The maximum frame rate is 15 fps. The power
    dissipated by the whole system is 121 mW. The
    total area of the system in LUT is 2312.

35
Thank you
Write a Comment
User Comments (0)
About PowerShow.com