Computer Vision - PowerPoint PPT Presentation

1 / 42
About This Presentation
Title:

Computer Vision

Description:

Title: Slide 1 Author: hkpu Last modified by: T43 Created Date: 8/31/2005 10:44:29 AM Document presentation format: On-screen Show Company: hkpu Other titles – PowerPoint PPT presentation

Number of Views:116
Avg rating:3.0/5.0
Slides: 43
Provided by: hkpu
Category:

less

Transcript and Presenter's Notes

Title: Computer Vision


1
Computer Vision
  • Filename eie426-computer-vision-0809.ppt

2
Contents
  • Perception generally
  • Image formation
  • Color vision
  • Edge detection
  • Image segmentation
  • Visual attention
  • 2D ? 3D
  • Object recognition

3
Perception generally
  • Stimulus (percept) S, World W
  • S g(W)
  • E.g., g graphics."
  • Can we do vision as inverse graphics?
  • W g-1(S)
  • Problem massive ambiguity!

Missing depth information!
4
Better approaches
  • Bayesian inference of world configurations
  • P(WS) P(SW) x P(W) / P(S) a x P(SW) x
    P(W)
  • graphics prior
    knowledge
  • Better still no need to recover exact scene!
  • Just extract information needed for
  • navigation
  • manipulation
  • recognition/identification

5
Vision subsystems
Vision requires combining multiple cues
6
Image formation
  • P is a point in the scene, with coordinates (X
    Y Z)
  • P is its image on the image plane, with
    coordinates (x y z)
  • x -fX/Z y -fY/Z (by similar triangles)
  • Scale/distance is indeterminate!

7
Len systems
f the focal length of the lens
8
Images
9
Images (cont.)
  • I(x y t) is the intensity at (x y) at time t
  • CCD camera 4,000,000 pixels human eyes
    240,000,000 pixels

10
Color vision
  • Intensity varies with frequency ?
    infinite-dimensional signal
  • Human eye has three types of color-sensitive
    cells each integrates the signal ? 3-element
    vector intensity

11
Color vision (cont.)
12
Edge detection
  • Edges are straight lines or curves in the image
    plane across which there is significant changes
    in image brightness.
  • The goal of edge detection is to abstract away
    from messy, multi-megabyte image and towards a
    more compact, abstract representation.

13
Edge detection (cont.)
  • Edges in image ? discontinuities in scene
  • 1) Depth discontinuities
  • 2) surface orientation
  • 3) reflectance (surface markings) discontinuities
  • 4) illumination discontinuities (shadows, etc.)

14
Edge detection (cont.)
15
Edge detection (cont.)
  • Sobel operator

the location of the origin (the image pixel to be
processed)
Other operators Roberts (2x2), Prewitt (3x3),
Isotropic (3x3)
16
Edge detection (cont.)
A color picture of a steam engine.
The Sobel operator applied to that image.
2013-12-3
16
EIE426-AICV
17
Edge detection application 1
An edge extraction based method to produce the
pen-and-ink like drawings from photos
2013-12-3
17
EIE426-AICV
18
Edge detection application 2


Leaf (vein pattern) characterization

19
Image segmentation
  • In computer vision, segmentation refers to the
    process of partitioning a digital image into
    multiple segments (sets of pixels).
  • The goal of segmentation is to simplify and/or
    change the representation of an image into
    something that is more meaningful and easier to
    analyze.
  • Image segmentation is typically used to locate
    objects and boundaries (lines, curves, etc.) in
    images. More precisely, image segmentation is the
    process of assigning a label to every pixel in an
    image such that pixels with the same label share
    certain visual characteristics.

20
Image segmentation (cont.)
21
Image segmentation the quadtree partition based
split-and-merge algorithm
  • Split into four disjoined quadrants any region Ri
    where P(Ri) FALSE.
  • Merge any adjacent regions Ri and Rk for which
    P(Ri ? Rk ) TRUE and
  • Stop when no further merging or splitting is
    possible.
  • P(Ri) TRUE if all pixels in Ri have the same
    intensity or are uniform in some measure.

22
Image segmentation the quadtree partition based
split-and-merge algorithm (cont.)
23
Visual attention
  • Attention is the cognitive process of selectively
    concentrating on one aspect of the environment
    while ignoring other things.
  • Attention mechanism of human vision system has
    been applied to serve machine visual system for
    sampling data nonuniformly and utilizing its
    computational resources efficiently.

24
Visual attention (cont.)
  • The visual attention mechanism may have at least
    the following basic components
  • (1) the selection of a region of interest in the
    visual field (2) the selection of feature
    dimensions and values of interest (3) the
    control of information flow through the network
    of neurons that constitutes the visual system
    and (4) the shifting from one selected region to
    the next in time.

25
Attention-driven object extraction
The more attentive a object/region, the higher
priority it has
26
Attention-driven object extraction (cont.)
Objects 1, 2, ,background
27
Motion
  • The rate of apparent motion can tell us something
    about distance. A nearer object has a larger
    motion.
  • Object tracking

28
Motion Estimation
2013-12-3
28
EIE426-AICV
29
Stereo
The nearest point of the pyramid is shifted to
the left in the right image and to the right in
the left image. Disparity (x difference in two
images) ?? Depth
30
Disparity and depth
31
Disparity and depth (cont.)
Depth is inversely proportional to disparity.
32
Example Electronic eyes for the blind
33
Example Electronic eyes for the blind (cont.)
34
Example Electronic eyes for the blind (cont.)
Left x549 Right x476 ?73
Left x333 Right x273 ?60
35
Texture
  • Texture a spatially repeating pattern on a
    surface that can be sensed visually.
  • Examples the pattern windows on a building, the
    stitches on a sweater,
  • The spots on a leopards skin, grass on a lawn,
    etc.

36
Edge and vertex types
  • and - labels represent convex and concave
    edges, respectively. These are associated with
    surface normal discontinuities wherein both
    surfaces that meet along the edge are visible.
  • A ? or a ? represents an occluding convex
    edge. As one moves in the direction of the arrow,
    the (visible) surfaces are to the right.
  • A ?? or a ?? represents a limb. Here, the
    surface curves smoothly around to occlude itself.
    As one moves in the direction of the twin arrow,
    the (visible) surfaces lies to the right.

37
Object recognition
  • Simple idea
  • - extract 3-D shapes from image
  • - match against shape library
  • Problems
  • - extracting curved surfaces from image
  • - representing shape of extracted object
  • - representing shape and variability of library
    object classes
  • - improper segmentation, occlusion
  • - unknown illumination, shadows, markings,
    noise, complexity, etc.
  • Approaches
  • - index into library by measuring invariant
    properties of objects
  • - alignment of image feature with projected
    library object feature
  • - match image against multiple stored views
    (aspects) of library object
  • - machine learning methods based on image
    statistics

38
Biometric identification
  • Criminal investigations and access control for
    restricted facilities require the ability to
    indentify unique individuals.

(the blueish area)
39
Content-based image retrieval
  • The application of computer vision to the image
    retrieval problem, that is, the problem of
    searching for digital images in large databases.
  • Content-based means that the search will
    analyze the actual contents of the image. The
    term content in this context might refer to
    colors, shapes, textures, or any other
    information that can be derived from the image
    itself. Without the ability to examine image
    content, searches must rely on metadata such as
    captions or keywords, which may be laborious or
    expensive to produce.

40
Content-based image retrieval (cont.)
  • http//labs.systemone.at/retrievr/?sketchName2009
    -03-26-01-22-37-828150.3sketchName2009-03-26-01-
    23-35-358087.4

41
Handwritten digit recognition
  • 3-nearest-neighbor 2.4 error
  • 400-300-10 unit MLP (a neural network approach)
    1.6 error
  • LeNet 768-192-30-10 unit MLP 0.9 error

42
Summary
  • Vision is hard -- noise, ambiguity, complexity
  • Prior knowledge is essential to constrain the
    problem
  • Need to combine multiple cues motion, contour,
    shading, texture, stereo
  • Library object representation shape vs.
    aspects
  • Image/object matching features, lines, regions,
    etc.
Write a Comment
User Comments (0)
About PowerShow.com