Title: Motion Segmentation over Image Sequences Using Multiway Cuts and Affine Transformations
1Motion Segmentation over Image Sequences Using
Multiway Cuts and Affine Transformations
2Organization
Motion Segmentation
Energy minimization, graph cuts, multiway cuts
Motion models, affine transformation
Combining multiway cuts and affine transformations
Algorithms and results
3Motion segmentation
- Image segmentation by color, texture, shape,
and motion - Motion very important cue
- Divide image into regions exhibiting relatively
different coherent motion - Ideal motion
- segmentation
4Why motion segmentation?
- Number of applications
- Robotics
- Video coding/compression
- Video indexing/retrieval
- Object tracking, surveillance
- Intermediate image processing task output given
to high level computer vision problems
5Motion representation
- Pixel motion represented by a 2D vectorEither
dense (optical flow) or sparse (features) - Models
translation 2 parameters
affine 6 parameters
6Lucas-Kanade affine estimation
- Minimize nonlinear equation
- Iterative linear solution (Newtons method)
current image
2x2 matrix
2x1 vector
residue
pixel location
next image
unknown parameters (A, d), 6x1 vector
6x6 gradient matrix
6x1 error vector
Lucas Kanade, 1981 Shi Tomasi, 1994
7(No Transcript)
8Energy minimization
- Motion segmentation as energy minimization
- challenge Thousands of dimensions!
- Solution Graph cut methods Boykov, Veksler,
and Zabih 1999 - accurate
- fast
Smoothness penalties
Data penalties
9Image Correspondence
- Motion segmentation comes under the general
category of image correspondence - Goal of image correspondence assign labels to
every pixel in the image - Energy functions can be devised once labels are
defined and listed - What do labels mean?
10Stereo Correspondence
11Motion
12Binary cut
- Maximum flow-minimum cut, Ford-Fulkerson 1956
- graphs constructed - a node per pixel
- source and sink terminals binary variables 0 and
1 - t-link weight data penalty, n-link weight
smoothness penalty - Minimum s-t cut, pixels get label 0 or 1 based on
what links are cut
t-link
n-link
cut n-link
13Multiway cuts
- More than 2 labels typical for motion and stereo
multiway cuts - Repeated binary cuts by forming binary graphs for
every pair of labels alpha-beta swap - Repeated binary cuts by forming binary graphs for
a particular label and the existing label, for
all labels alpha expansion
14Multiway cuts
15Parent algorithm
- Multiway cut for stereo and motion with slanted
surfaces, Birchfield and Tomasi 1999 - Combines multiway cuts and affine transformation
- Works iteratively by progressive refinement of
displacement functions of labels - Algorithm re-implemented, proposed algorithms are
extensions to this paper
16(No Transcript)
17(No Transcript)
18Motion segmentation over image sequences
- Parent algorithm when employed on sequence of
images, does not produce consistent results - Also computationally inefficient to exhaustively
search over all translational displacement
functions for every frame.
frame1, 5 segments
frame2, 6 segments
19Changes to parent algorithm
result from previous frame pair
control number of loop iterations
do affine merge at the end
20Algorithm
frame t
frame t1
parent algorithm, affine merge
- Run parent algorithm on first frame and get
correct number of segments, parameters are fixed - Set number of iteration loop for affine update of
displacement functions to a constant - Initial motion segment image for next frame is
predicted by affine warping of current motion
segments and re-estimation of displacement
functions - Final iterative affine merge step merges
neighboring regions
predict label image
initialize parent algorithm for next frame
frame t1
frame t2
21- Affine merging of regions if neighbor regions
within threshold, then merge if number of
segments is still more relax threshold and repeat - This step similar to over segmentation step but
does not involve energy computation, hence
threshold dependent
22- Results for frames 2, 10, 19 and 25 are shown.
- Number of motion segments maintained
- Algorithm took 71.18 seconds to run on 27 frames,
parent algorithm took 97.04 seconds - Boundaries between segments are not crisp due to
occlusions and lack of texture
23- Taxi sequence, algorithm works well for frames 1
to 36. Frame 5, 18 are shown. - Failure for 36 to 40 due to small motion of taxi
and two components for right vehicle - Frame 40 failure of affine merge shown
- Right vehicle segmentation poor due to occlusion
by tree
24Hard constraint points for stereo
- Another extension to stereo correspondence
- Cost functional not able to preserve small and
thin long objects in depth maps - Multiway cuts smoothes out small regions
25Normalized correlation
- Normalized correlation performed, unambiguous
disparity points are chosen and initialized as
hard constraint points - These points initialize multiway cuts, number of
iterations of affine updating is controlled
sum of squared differences inside window
scan line
left image
right image
clear minimum
ambiguous minimum
26(No Transcript)
27Occlusion detection
- Errors of regions in between motion layers due to
movement of foreground over background - Selective occlusion detection is done using
estimated affine parameters - Assumption multiway cuts labels occluded areas
with the label of the foreground -
28- Compute residues of region a and region b based
on affine parameters of both region 1 and region
2 and pick the worst. - The occluded region has the worst residue because
it has no matching region in the next frame.
29(No Transcript)
30Conclusions
- Studied, analyzed and implemented multiway cuts
and affine transformation techniques - All implementation in C from scratch, using
Kolmogorov and Blepo - Two extensions to the parent algorithm motion
segmentation over image sequences and hard
constraint points for stereo - Simple occlusion detection presented
- Results are reasonable
31Future work
- Spatiotemporal multiway cuts for segmenting
object in the video volume - Redesigning cost functionals to improve
segmentation results - Integrate occlusion detection with multiway cuts
for getting cleaner borders.
32Thank You