SCSC 585 Imaging and Image Representation - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

SCSC 585 Imaging and Image Representation

Description:

Receptors in human eye sensitive to radiation (light) with wave lengths ranging ... The coordinates of 3D corner point B relative to the object frame are [xb,0,zb] ... – PowerPoint PPT presentation

Number of Views:20
Avg rating:3.0/5.0
Slides: 29
Provided by: SVAND6
Category:

less

Transcript and Presenter's Notes

Title: SCSC 585 Imaging and Image Representation


1
SCSC 585 Imaging and Image Representation
  • Sebastian van Delden
  • USC Upstate
  • svandelden_at_uscupstate.edu

2
Sensing Light
  • Receptors in human eye sensitive to radiation
    (light) with wave lengths ranging from about 400
    nanometers (violet) to 800 nanometers (red).

3
Simple Model of Common Photography
  • Light reaches
  • surfaces in 3D
  • Surfaces reflect
  • Sensor element
  • receives light energy
  • Intensity counts
  • Angles count
  • Material counts

4
Imaging Devices
  • CCD Camera
  • Charge-coupled Device (CCD)
  • Instead of chemical film that reacts to light
    (like 35mm film), tiny solid cells convert light
    energy into electrical charge.

5
CCD Cameras
  • Camera inputs
  • to frame buffer
  • a high speed
  • image store
  • which can
  • store several
  • images

6
Image Formation
  • Each point of 3D scene is projected through the
    center of projection or lens center onto the
    image plane.
  • Pin-Hole camera model
  • Lenses
  • Light Collector light reaches image point from
    entire cone of rays
  • Circle of Confusion spot on image plane is
    blurred.
  • Each discrete CCD sensor receives light from
    several neighboring 3D points.

7
CCD Arrays
  • Usually about 1 cm x 1 cm 640 x 480 pixels or
    512 x 512 pixels actual width 0.001 inch.
  • Other ways to position sensor cells

8
Video Cameras
  • Sequences of images at 30 frames per second
  • 60 half frames per second are used all odd rows
    followed by all even rows
  • MPEG (Motion Picture Expert Group) compression
    scheme (based on spatial and temporal redundancy)
    to reduce the amount of data that would need to
    be transmitted
  • Consider a 640x480 CCD array where each sensor
    stores 1 byte. 30 frames per seconds equals
    9216000 bytes almost 1 GB per second.

9
The Human Eye
  • 100M sensing elts in retina
  • Rods sense intensity
  • Cones sense color
  • Fovea has tightly packed elts, more cones
  • Periphery has more rods
  • Focal length is about 20mm
  • Pupil/iris controls light entry
  • Eye scans, or saccades to image details on fovea
  • 100M sensing cells funnel to 1M optic nerve
    connections to the brain

10
Problem with Digital Images
  • Blooming
  • Difficult to insulate adjacent sensing elements.
  • Charge often leaks from hot cells to neighbors,
    making bright regions larger.

11
Problem with Digital Images
  • Clipping or Wraparound
  • Dark grid intersections at left were actually
    brightest of scene.
  • In A/D conversion the bright values were clipped
    to lower values.

12
Problem with Digital Images
  • Lens distortion distorts image
  • Barrel distortion of rectangular grid is common
    for cheap lenses (50)
  • Precision lenses can cost 1000 or more.
  • Zoom lenses often show severe distortion.

13
Problem with Digital Images
  • CCD Variations
  • CCD sensors imperfections can cause different
    reading for the same light intensity.
  • Chromatic Distortion
  • Different light wavelength bent differently.
  • Quantization Effects
  • Mixing and Rounding problems.

14
Resolution
  • Nominal Resolution of a CCD sensor is the size of
    the scene that images to a pixel.
  • Resolution
  • Pixels per unit length (nominal resolution)
  • Another definition number of pixels captured.
  • Field of View (FOV) size of scene that can be
    captured often denoted with angles.

15
Resolution cont
  • Resolution decreases by one half in cases at left
  • Human faces can be recognized at 64 x 64 pixels
    per face

16
Spatial Quantization Problems
  • Both pixel size relative position make a
    difference.
  • Mixed pixel represents a mixture of intensity
    values in a real scene.
  • Small features can be lost or blended.

17
Digital Image Formats
  • Portable Gray Map (PGM/PBM/PPM) older form
  • GIF was early commercial version
  • JPEG (JPG) is modern version
  • Many others exist
  • Do they handle color?
  • Do they provide for compression?
  • Need to have size parameters pixels

18
Portable Bit Map Image (PGM)
  • P2 means
  • ASCII gray
  • Comments
  • W16 H8
  • 192 is max
  • intensity
  • Can be made
  • with editor

19
Compression
  • Lossless if a decompression method exists to
    precisely recover the original image.
  • Lossy information is lost during compression
    and cannot to recovered during decompression
    (JPG, GIF)
  • GIF (Graphics Interchange Format) only 8 bits
    used for color can contain transparency and
    animation.
  • JPG (Joint Photographic Experts Group) for high
    quality images considers human vision systems
    and uses discrete cosine transform and Huffman
    coding.

20
Run-Length Encoding
  • Label consecutive runs of like value
  • Consider a binary image row
  • Image Row 00000000111110001111000000111
  • Run Code A 8(0)5(1)3(0)4(1)6(0)3(1)
  • Run Code B (8,12) (16,19) (26,28)

21
3D Structure from 2D Images
  • Interposition if object A is in front of object
    B then object is closer to the camera than object
    B
  • Vanishing point - Where parallel lines meet in a
    image.
  • Foreshortening closer objects appear larger
    than farther away ones.
  • Texture Gradient texture of an object changes
    with distance.

22
(No Transcript)
23
Five frames of reference
W world O object (P, B) C camera F real
image I pixel image
24
Pixel Coordinate Frame (I)
  • In the pixel array, each point has integer pixel
    coordinates.
  • Pixel coordinate frame can be useful in obtaining
    information about the object such as texture,
    color, intensity etc.
  • Pixel coordinates however cannot to be used to
    obtain size of objects and depth information.

25
Object coordinate frame (O)
  • An object coordinate frame is used to model ideal
    objects for computer graphics and computer
    vision.
  • The coordinates of 3D corner point B relative to
    the object frame are xb,0,zb.
  • The object coordinate frame is needed to inspect
    an object, for example, to check if a particular
    hole relatively in the correct place.

26
Camera coordinate frame (C)
  • The camera coordinate frame C is often needed for
    egocentric view (cameracentric).
  • Helps represent whether the object is in front of
    the camera or moving etc.
  • Computer graphics systems allow the user to
    select different camera views of the 3D scene
    being viewed.

27
Real image coordinate frame (F)
  • 3D points project to the real image plane at
    coordinates xf,yf,f, where f is the focal
    length. xf,yf are not subscripts of the pixels in
    the image array but relate to the pixel size and
    pixel position along the optical axis. xf,yf are
    based on the world coordinate system.
  • Frame F contains the picture function that is
    digitized to form the digital image I.

28
World coordinate frame (W)
  • The world coordinate frame is needed to relate
    objects in 3D.
  • Humans robots etc operate based on the world
    coordinate system.
Write a Comment
User Comments (0)
About PowerShow.com