Artificial Intelligence: Vision - PowerPoint PPT Presentation

1 / 17
About This Presentation
Title:

Artificial Intelligence: Vision

Description:

Another 'mundane' task involves being able to make sense of what we see. ... from visual signal (digitized image) to representation of the scene adequate for ... – PowerPoint PPT presentation

Number of Views:48
Avg rating:3.0/5.0
Slides: 18
Provided by: supp161
Category:

less

Transcript and Presenter's Notes

Title: Artificial Intelligence: Vision


1
Artificial Intelligence Vision
  • Lecture 12
  • Intro
  • Stages of analysis
  • Low level vision

2
Introduction
  • Another mundane task involves being able to
    make sense of what we see.
  • We can handle images of objects
  • differing in
  • size
  • orientation
  • colour
  • lighting
  • expression (for faces etc)
  • obscured by other objects
  • .. And recognise the objects in the scene, and
    what is happening in the scene.

3
Vision task
  • Ultimate task
  • from visual signal (digitized image) to
    representation of the scene adequate for carrying
    out actions on the objects on the scene.
  • E.g., image of parts of device --gt representation
    of location, orientation, shape, type etc of
    parts enabling robot to assemble device.
  • More limited task
  • recognise objects (from limited set) - is it a
    widget, wodget or wadget?

4
Stages of processing
  • Like NLP, we are mapping from an unstructured raw
    signal to a structured meaningful representation.
  • Like NLP we do it in stages
  • Digitisation - raw data -gt digitised image (e.g.,
    2d array of intensity/brightness)
  • Low level processing - identify features like
    lines/edges from the raw image.
  • Medium level - determine distances and
    orientation of surfaces.
  • High level - Create useful high level
    representation (e.g., 3-d models, with objects
    and parts identified)

5
Low level Processing
  • Ignore digitisation. First task then is to
    extract some primitive features from the image.
  • We might have a 512x512 image, where each image
    point (pixel) has a certain image intensity or
    brightness, represented by a number 0-255.
  • For colour need three numbers per image point
    (blue, green, red), but start just considering
    bw.
  • We start with a grey-level image. Image
    intensity sometimes called grey level.

6
Edge Detection
  • Consider the image below
  • First task is to find the edges of the image.
    We obtain, from the array of grey levels, a
    sketch consisting of a number of lines.

7
Simplifying..
  • Lets see what it might look like as an array of
    intensity values (ignoring door, window)
  • Edges occur where the intensity value changes
    significantly.
  • We find the difference between intensity values
    at neighbouring points, and if large, mark poss.
    edge.

8
Applying difference operation
  • Just considering horizontal differences, and
    marking when the difference is greater than a
    threshold of 3, we get the following
  • Have found vertical sides of house and bits of
    roof.
  • But how to we set the threshold?

9
Difference Operators
  • Consider intensities 0 0 5 11 23 27 27
  • Just taking difference and thresholding would
    give 0 0 1 1 1 1 0. - Edge covers large region.
  • Hard to come up with fixed thresholds that avoids
    this.
  • We really want to find the points in the image
    where the intensity is changing most rapidly.
  • Mathematically when the rate of change of
    intensity is at a maximum, this corresponds to
    where the second derivative is zero..

10
Difference Operators
  • Image Intensity
  • Rate of change of image(first derivative)
  • Rate of change of this.(second derivative)
  • Edge

11
Edge Detection
  • Edge detection therefore often based on finding
    second derivative of imagine intensity, then
    finding zero crossings in this resulting array.
  • This method is not sensitive to the values at
    which thresholds are set in looking for big
    changes in intensity.

12
Line Fitting
  • Weve now got a simplified image with points
    corresponding to edges in an image marked in.
  • Next task is to get from that to a set of lines.
  • This reduces the amount of data and gets closer
    to useful representation.

13
Simple Approach Tracking
  • Find an edge point.
  • Look at all surrounding points to find connected
    edge points.
  • Keep going while the points you are finding form
    a straight line. When no more points in that
    direction, stop and make last one end point of
    line.

14
Problems..
  • What about
  • curved lines
  • obscured lines (e.g., edge of an object, when
    parts of that edge are obscured by another
    object).
  • Solution is to try and find candidate likes such
    that the number of edge points falling on that
    line is maximised.
  • Formally this is done with something called Hough
    Transform. Will attempt to sketch idea informally.

15
Line Fitting
  • Suppose we have a possible line expressed as an
    equation ymxc
  • We want to find values for m and c that mean,
    over all possible values of x and y on that line,
    as many as possible are edge points.
  • E.g., consider y3x
  • Two points on line, sopossible line?

16
Line fitting
  • Approach generalises well to curves.
  • Just use equation of curve, not line.
  • Also works fine given breaks in line.

17
Summary
  • Vision - from grey level image to recognised
    object and model of scene.
  • Start with low level vision
  • Find candidate edge points where intensity level
    changes quickly.
  • Find lines (where many edge points fall on
    possible line).
Write a Comment
User Comments (0)
About PowerShow.com