Visual Object Recognition - PowerPoint PPT Presentation

About This Presentation
Title:

Visual Object Recognition

Description:

Visual Object Recognition – PowerPoint PPT presentation

Number of Views:264
Avg rating:3.0/5.0
Slides: 39
Provided by: robfe
Learn more at: https://cs.nyu.edu
Category:

less

Transcript and Presenter's Notes

Title: Visual Object Recognition


1
Visual Object Recognition
  • Rob Fergus
  • Courant Institute, New York University

http//cs.nyu.edu/fergus/icml_tutorial/
2
Agenda
  • Introduction
  • Bag-of-words models
  • Visual words with spatial location
  • Part-based models
  • Discriminative methods
  • Segmentation and recognition
  • Recognition-based image retrieval
  • Datasets Conclusions

3
Recognizing and Learning Object Categories Year
2007
  • Li Fei-Fei, Princeton
  • Rob Fergus, NYU
  • Antonio Torralba, MIT

http//people.csail.mit.edu/torralba/shortCourseRL
OC
4
Agenda
  • Introduction
  • Bag-of-words models
  • Visual words with spatial location
  • Part-based models
  • Discriminative methods
  • Segmentation and recognition
  • Recognition-based image retrieval
  • Datasets Conclusions

5
So what does object recognition involve?
6
Classification are there street-lights in the
image?
7
Detection localize the street-lights in the image
8
Object categorization
mountain
tree
building
banner
street lamp
vendor
people
9
Scene and context categorization
  • outdoor
  • city

10
Application Assisted driving
Pedestrian and car detection
Lane detection
  • Collision warning systems with adaptive cruise
    control,
  • Lane departure warning systems,
  • Rear object detection systems,

11
ApplicationComputational photography
12
Application Improving online search
Query STREET
Organizing photo collections
13
Challenges 1 view point variation
Michelangelo 1475-1564
14
Challenges 2 scale
15
Challenges 3 illumination
slide credit S. Ullman
16
Challenges 4 background clutter
Bruegel, 1564
17
Challenges 5 occlusion
http//lh5.ggpht.com/_wJc6t2hDl2M/RrL7Gh6sS7I/AAAA
AAAAAYY/n3xaHc2opls/DSC00633.JPG
18
Challenges 6 deformation
http//img.timeinc.net/time/asia/magazine/2007/111
2/racehorse_1112.jpg
Xu, Beihong 1943
19
History single object recognition
Object 1
Object 2
Object 3
20
David Lowe 1985
Single object recognition history Geometric
methods
Rothwell et al. 1992
21
Single object recognition history
Appearance-based methods
  • Murase Nayer 1995
  • Schmid Mohr 1997
  • Lowe, et al. 1999, 2003
  • Mahamud and Herbert, 2000
  • Ferrari et al. 2004
  • Rothganger et al. 2004
  • Moreels and Perona, 2005

22
Challenges 7 intra-class variation
Shoe class
Instance 1
Instance 2
Instance 3
23
History early object categorization
24
  • Fischler, Elschlager, 1973
  • Turk and Pentland, 1991
  • Belhumeur, Hespanha, Kriegman, 1997
  • Rowley Kanade, 1998
  • Schneiderman Kanade 2004
  • Viola and Jones, 2000
  • Heisele et al., 2001
  • Amit and Geman, 1999
  • LeCun et al. 1998
  • Belongie and Malik, 2002
  • DeCoste and Scholkopf, 2002
  • Simard et al. 2003
  • Poggio et al. 1993
  • Argawal and Roth, 2002
  • Schneiderman Kanade, 2004
  • ..

25
10,000 to 30,000
26
Three main issues
  • Representation
  • How to represent an object category
  • Learning
  • How to form the classifier, given training data
  • Recognition
  • How the classifier is to be used on novel data

27
Representation
  • Generative / discriminative / hybrid

28
Representation
  • Generative / discriminative / hybrid
  • Appearance only or location and appearance

29
Representation
  • Generative / discriminative / hybrid
  • Appearance only or location and appearance
  • Invariances
  • View point
  • Illumination
  • Occlusion
  • Scale
  • Deformation
  • Clutter
  • etc.

30
Representation
  • Generative / discriminative / hybrid
  • Appearance only or location and appearance
  • Invariances
  • Part-based or
  • global with sub-window

31
Representation
  • Generative / discriminative / hybrid
  • Appearance only or location and appearance
  • Invariances
  • Parts or global w/sub-window
  • Use set of features or each pixel in image

32
Learning
  • Unclear how to model categories, so learn rather
    than manually specify

33
Learning
  • Unclear how to model categories, so learn rather
    than manually specify
  • Methods of training generative vs. discriminative

34
Learning
  • Unclear how to model categories, so learn rather
    than manually specify
  • Methods of training generative vs.
    discriminative
  • Level of supervision
  • Manual segmentation bounding box image labels
    noisy labels

Contains a motorbike
35
Learning
  • Unclear how to model categories, so learn rather
    than manually specify
  • Methods of training generative vs.
    discriminative
  • Level of supervision
  • Manual segmentation bounding box image labels
    noisy labels
  • -- Training images
  • Issue of over-fitting (typically limited training
    data)
  • Negative images for discriminative methods

36
Learning
  • Unclear how to model categories, so learn rather
    than manually specify
  • Methods of training generative vs.
    discriminative
  • Level of supervision
  • Manual segmentation bounding box image labels
    noisy labels
  • -- Training images
  • Issue of over-fitting (typically limited training
    data)
  • Negative images for discriminative methods
  • -- Priors

37
Recognition
  • Scale / orientation range to search over
  • Speed
  • Context

38
Recognition
  • Context enables pruning of detector output

Hoiem, Efros, Herbert, 2006
Write a Comment
User Comments (0)
About PowerShow.com