Composition-Guided Image Acquisition presentation

About This Presentation

Transcript and Presenter's Notes

Title: Composition-Guided Image Acquisition

1
Composition-Guided Image Acquisition
Serene Banerjee Ph.D. Defense, April 28th, 2004
http//www.ece.utexas.edu/serene
Committee Members Prof. Ross Baldick Prof.
Alan C. Bovik Prof. Brian L. Evans
(Advisor) Prof. Wilson S. Geisler Prof. Joydeep
Ghosh Prof. Robert W. Heath, Jr.
Computer Engineering Curriculum Track Dept. of
Electrical and Computer Engineering The
University of Texas at Austin
2
One day Alice came to a fork in the road and saw
a Cheshire cat in a tree. Which Road do I
take? she asked. Where do you want to go? was
his response. I dont know, Alice answered.
Then, said the cat, it doesnt matter.
Lewis Carroll Alice in Wonderland
3
Outline

Introduction
Motivation
Overview of contributions
Summary of previous research for main subject
detection
Contributions
Online main subject detection
Aesthetic enhancements, given main subject
Blur background objects merging with main subject
Conclusions

4
Motivation

Problem Amateur photographers take unappealing
pictures (e.g. personal and business use)
Help users take better pictures with digital
cameras

5
Enhance Picture Appeal

Improving photograph appeal Savakis, Etz Loui
2000
Photographic composition
Objective measures
People/expression
Examples of photographic composition rules

6
Enhance Acquired Picture Appeal

Goal Provide well-composed alternative pictures
during image acquisition in digital still cameras
Solution Framework for in-camera automation of
photographic composition rules
Acquire picture user intended to take
Locate main subject by combining optical and
digital image processing on a supplementary
picture
Apply composition rules to user-intended picture
Place main subject according to rule-of-thirds
Blur entire background given main subject
location
Blur background objects that merge with main
subject
User takes intended picture and framework also
returns three alternative pictures

7
Offline Main Subject Detection

Neural network based training Luo, Etz, Singhal
Gray 2000-2001
Cluster multi-level wavelet coefficients Wang et
al. 1999-2001
Iterative classification from variance maps Won,
Pyan Gray 2002

Algorithm Training complexity Runtime complexity
Neural network Difficult to form widely applicable training set High (e.g. feature extraction, grouping)
Wavelet-based No training required High (e.g. wavelet, k-means clustering)
Variance-based No training required High (e.g. iterations, watershed)
8
Automating Composition Rules
Original color image

In-camera online framework
Provide alternatives to user during image
acquisition
One-pass low-complexity algorithms Banerjee
Evans 2003-04
Independent of scene content and setting
Amenable to fixed-point implementation
Match processing on digital still cameras

Detect main subject
Supplementary picture
Generated picture with rule-of-thirds
Rule-of-thirds
Background blur
Generated picture with blur
Mitigate merger
Generated picture without mergers
9
Digital Still Cameras

Converts optical image to electric signal
Software control
Shutter aperture and speed
Focus
Zoom
White balance
Additional hardware could control
Camera angle
Aspect ratio landscape or portrait

10
Outline

Introduction
Contributions
Online main subject detection
In-camera segmentation of the main subject
Low-complexity one-pass algorithm
Amenable to implementation in digital still
cameras
Aesthetic enhancement, given main subject
Mitigation of mergers with background objects
Conclusions

11
Online Main Subject Detection
Contribution 1

Auto-focus main subject
Take supplementary picture
Open shutter aperture (takes 1s) to blur objects
not in focus
In-focus edges stronger than out-of-focus edges
Process supplementary picture to find main
subject mask
Enhance in-focus edges
Detect strong edges
Close boundary

Scene
Auto-focus filter
Open shutter for blur
Supplementary picture
Compute intensity
3x3 Highpass filter
Detect sharper edges
Close boundary
Binary main subject mask
12
Main Subject Detection Formulation
Contribution 1

Supplementary picture has intensity function, I
IH and IL are highpass and lowpass versions
For background image, contribution from IL is
greater
Goal Identify pixels contributing high
frequencies
I is modeled as mixture of IH and IL
Highpass filtering of I enhances main subject
edges

where k ? 1
13
Step 1 Enhance In-focus Edges
Contribution 1

Subtract smoothed image from sharpened one
Strong edges in main subject, weak edges in
background

Highboost image

S
-
Edge-enhanced image with stronger main subject
edges
Supplementary image
Lowpass image
14
Step 2 Detect Strong Edges
Contribution 1

Canny edge detector detects strong edges Canny
1986
Selects weak edges only if they are connected to
strong edges
Laplacian of Gaussian detector Burt Adelson
1983
Selects edges based on zero crossings of second
derivative
Either detects weak and strong edges or
eliminates weak edges from main subject (depends
on threshold)

Laplacian of Gaussian
Canny edge detector
15
Step 3 Generate Mask
Contribution 1

Goal Generate closed contour from strong edges
Gradient vector flow Xu, Yezzi Prince 2001
Balances forces
Internal spline characteristics
External normal of gradient of detected strong
edges
Outer boundary of detected sharp edges is initial
contour
Change shape of initial contour, depending on
gradient
Approximate lower complexity method
Select leftmost rightmost ON pixel and
make row pixels in between them ON
Can detect convex regions but fails at concavities

16
Main Subject Detection Results
Contribution 1
Supplementary image
Step 2 Strong edge detection
Step 3a Gradient of strong edges
Step 1 Edge map
Step 3b Gradient vector flow field
Step 3c Initial contour
Step 3d Contour after 5 iterations (not
mandatory)
Main subject mask
17
Implementation Complexity
Contribution 1

Per-pixel complexity for algorithm Banerjee
Evans 2003-04
Multi-level wavelet based Wang, Lee, Gray,
Wiederhold 1999-2001
Variance of multi-level wavelet coefficients 2X
increase
k-means clustering 2(image size)(no. of
iterations)X increase
Iterative classification from variance maps Won
et al. 2002
Iterative maximum a posteriori segmentation 3X
increase
Watershed refinement 6 passes per pixel

Process/Operation Multiply-accumulates Compares Memory accesses
Pre-filtering (3x3) 9
Edge detection 9 2 5
Close boundary 2 1
18
Comparison With Previous Methods
Contribution 1
Proposed algorithm
Original image
Banerjee Evans 2003-4
19
Limitations
Contribution 1

Frequency-based features not applicable if
Main subject does not have enough high
frequencies
Background not blurry enough
Could incorporate region-based features

Example of an image where the proposed algorithm
fails to detect the main subject, the flower
20
Outline

Introduction
Contributions
Main subject detection
Aesthetic enhancement, given main subject
Reposition main subject to follow rule-of-thirds
Simulate background blur for motion or clarity
Mitigation of mergers with background objects
Conclusions

21
Rule-of-Thirds
Contribution 2

Better interaction of main subject with image
background
Center of mass of main subject at 1/3 or 2/3
picture width (or height) from the left (or top)
edge

Main subject in center of picture
Main subject follows rule-of-thirds
Outdoor setting the flower is main subject
22
Rule-of-Thirds Algorithm
Contribution 2

Compute center-of-mass of main subject
2 multiply-accumulates, 1 memory read per pixel
1 division per image
Locate closest one-third corner
8 compares per image (4 comparisons of (x,y)
points)
Shift picture so center-of-mass falls at desired
corner
Mirror undefined boundary pixels
Best case no change to image
Worst case 1/3 rows/columns need to be shifted
Average (main subject in middle) shift 1/6
rows/columns
0 to 2 memory accesses per pixel

23
Ideal Background Blur Example
Contribution 2
Background blur emphasizes main subject, the
shell, and aids in constrained image
communication
Indoor setting no humans in picture
24
Simulated Background Blur
Contribution 2

Possible camera blurs
Background blur shutter aperture
Linear blur subject or camera motion
Radial blur camera rotation
Zoom change in zoom
Digital alternatives
Original image masked with detected main subject
mask
Region of interest filtering performed on
non-masked pixels
Complexity 9 multiply-accumulates and 4 memory
accesses per pixel for convolution with symmetric
3x3 filter

25
Results (1)
Contribution 2
Supplementary image with main subject(s) in focus
Detected main subject mask
Rule-of-Thirds Main subject repositioned
Simulated background blur
Outdoor setting human main subject
26
Results (2)
Contribution 2
Supplementary image with main subject(s) in focus
Detected main subject mask
Rule-of-Thirds Main subject repositioned
Simulated background blur
Outdoor setting human main subject
27
Results (3)
Contribution 2
Supplementary image with main subject(s) in focus
Detected main subject mask
Rule-of-Thirds Main subject repositioned
Simulated background blur
Indoor setting no human subjects
28
Outline

Introduction
Contributions
Main subject detection
Aesthetic enhancement, given main subject
Mitigation of mergers with background objects
Framework for background analysis and merger
detection
Low-complexity one-pass algorithm for merger
mitigation
Conclusions

29
Ideal Merger Mitigation Example
Contribution 3
Unwanted mergers avoided
Background bar merges with gymnasts hand
30
Mitigation of Mergers Overview
Contribution 3

Goal Identify background objects merging with
main subject
In-focus background object
Connected to main subject mask
Large area relative to image size
Merger detection
Color segmentation based on hue
Identify distracting background object based on
distance to main subject and frequency content
Blur merging background objects to induce a sense
of distance

Merging background objects trees and bush over
right shoulder
31
Segmentation of Background Objects
Contribution 3

Hues above histogram average are dominant hues
Background is a mixture of dominant hues
Thresholds average of two consecutive dominant
hues

Histogram of background hues and identified
objects
Background hues
Thresholds 87, 151
32
Merger Object Detection
Contribution 3

Define Frequency Inverse Distance Measure for
each disjoint background object Oi
Decreases with nearest distance (di) from main
subject
Increases with high spatial frequency
coefficients (?iH)
Merged object Object with highest transform value

33
Measure Selection
Contribution 3

Linear, division, and exponential forms to
combine
High frequencies computed with residual in
Gaussian pyramid decomposition
Euclidean distance measured from main subject
mask

Attribute Linear Divisional Exponential
Computational complexity Low High High
Merged objects size Large Small Small
34
Merger Mitigation Results
Contribution 3
Background tree and bush merging with main subject
Blurred tree and bush appear to be farther away
High frequency and inv. distance values for
background
35
Per-pixel Implementation Complexity
Contribution 3
Process /Operation Multiply-accumulates Compares Memory accesses
RGB to hue 3 6 4
Histogram and thresholding 1 2
RGB to intensity 2
Gaussian pyramid 9 4
Approx. inv. distance measure 2 1 2
Detect merged object 1 1
Gaussian pyramid reconstruction 9 1 5
TOTAL 27 11 15
For comparison, JPEG compression takes 60
operations/pixel
36
System Prototype
Scene
Original color image
Color Gaussian pyramid
Transform coefficients
Inverse distance transform
Supplementary image
Compute intensity
Grayscale image
Grayscale image
Background segmentation
X
3x3 Highpass filter
Detect sharper edges
Close boundary
Intensity Gaussian pyramid
Measure how close rule-of-thirds followed
Binary main subject mask
Detect merging object
Automate rule-of-thirds
Simulate background blur
Reconstruct color pyramid
Generated picture with rule-of-thirds
Generated picture with blur
Merger mitigated picture
37
Conclusion

Contributions
Combined optical/digital image acquisition
Provide online feedback to amateur photographers
Low-complexity one-pass method for main subject
detection
Rule-of-thirds placement of the main subject on
the canvas
Simulated background blur motion and
depth-of-field
Mitigation of mergers with background objects
Deliverables
Prototype development for digital still image
acquisition
Copies of MATLAB code, slides, and papers,
available at
http//www.ece.utexas.edu/bevans/projects/dsc/in
dex.html

38
Future Work

Automate other photographic composition rules
Best zoom
Available frames, lines of interest, best angle,
balanced picture
Extension for video acquisition
Frame-by-frame basis
Compressed domain
Digital image stabilization Subject mask as
feature
Potential research impact
Video cameras, Surveillance, Image/video
retrieval, Constrained image/video communication,
Main subject detection for specific applications

Write a Comment

User Comments (0)

About PowerShow.com

Composition-Guided Image Acquisition PowerPoint PPT Presentation