• No results found

Visual Object Recognition

N/A
N/A
Protected

Academic year: 2021

Share "Visual Object Recognition"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

67777

part

part 5 5: categorization : categorization

VisualObject Recognition -67777

(2)

67777

Main problem with bag

Main problem with bag--of of--words words

VisualObject Recognition -67777

2

o All have equal probability for bag-of-words methods

o Location information is important

(3)

67777

Constellation Models: Parts and Structure

VisualObject Recognition -67777

3

(4)

67777

Representation Representation

o Model has two components

 parts

(2D image fragments)

 structure

(configuration of parts)

VisualObject Recognition -67777

4

o Issues:

 How to model location

 How to represent appearance

 Sparse or dense (pixels or regions)

 How to handle occlusion/clutter

Figure from [Fischler73]

(5)

67777

Example scheme Example scheme

o Model shape using Gaussian distribution on location between parts

o Model appearance as pixel templates

VisualObject Recognition -67777

5

(6)

67777

Sparse representation

+ Computationally tractable (105 pixels  101 -- 102 parts) + Generative representation of class

+ Avoid modeling global variability + Success in specific object recognition

VisualObject Recognition -67777

6

- Throw away most image information

- Parts need to be distinctive to separate from other classes

(7)

67777

The correspondence problem The correspondence problem

o Model with P parts

o Image with N possible locations for each part

VisualObject Recognition -67777

7

 NP combinations!!!

(8)

67777

Different graph structures Different graph structures

1

3

4 5

2

1

3 6

2

1

3

4

5

6 2

VisualObject Recognition -67777

8

4 5

6 4 5

Fully

connected Star structure

4 6

Tree structure

O(N6) O(N2) O(N2)

• Sparser graphs cannot capture all interactions between parts

(9)

67777

Spatial Models Considered Here Spatial Models Considered Here

x1

x6 x2

“Star” shape model

x1

x6 x2

Fully connected shape model

VisualObject Recognition -67777

9

x3 x4

x5 x3

x4 x5

e.g. Constellation Model

Parts fully connected

Recognition complexity: O(NP)

Method: Exhaustive search

e.g. ISM

Parts mutually independent

Recognition complexity: O(NP)

Method: Gen. Hough Transform

(10)

67777

How to model location?

How to model location?

o

Explicit: Probability density functions

o

Implicit: Voting scheme

o

Invariance

VisualObject Recognition -67777

10

o

Invariance

 Translation

 Scaling

 Similarity/affine

 Viewpoint

Similarity transformationTranslation and ScalingAffine transformationTranslation

(11)

67777

Learning situations Learning situations

o Varying levels of supervision

 Unsupervised

 Image labels

 Object centroid/bounding box

 Segmented object

Contains a motorbike

VisualObject Recognition -67777

11

 Segmented object

 Manual correspondence (typically sub-optimal)

o Generative models naturally incorporate labelling information (or lack of it)

o Discriminative schemes require labels for all data points

(12)

67777VisualObject Recognition -67777

12

(13)

67777

Constellation Model Constellation Model

Gaussian shape pdf Gaussian part appearance pdf

Gaussian relative scale pdf

Log(scale)

VisualObject Recognition -67777

13 Uniform shape pdf

Clutter model

Gaussian appearance pdf Uniform relative scale pdf

Log(scale)

Slide credit: Grauman & Leibe

(14)

67777

o Task: Estimation of model parameters

Learning using EM

o Let the assignments be a hidden variable and use EM algorithm to o Chicken and Egg type problem, since we initially know neither:

 Model parameters

 Assignment of regions to parts

VisualObject Recognition -67777

14

o Let the assignments be a hidden variable and use EM algorithm to learn them and the model parameters

(15)

67777

Learning procedure

E-step: Compute assignments for which regions belong to which part M-step: Update model parameters

o Find regions & their location & appearance

o Initialize model parameters

o Use EM and iterate to convergence:

Trying to maximize likelihood – consistency in shape & appearance

VisualObject Recognition -67777

o Trying to maximize likelihood – consistency in shape & appearance

15

(16)

67777

Example scheme, using EM for maximum Example scheme, using EM for maximum

likelihood learning likelihood learning

1. Current estimate of θ

...

2. Assign probabilities to constellations

Large P

pdf

VisualObject Recognition -67777

16

Image 1 Image 2 Image i

Small P

3. Use probabilities as weights to re-estimate parameters. Example: µµµµ

Large P x + Small P x

new estimate of µµµµ + … =

(17)

67777

Example: Motorbikes Example: Motorbikes

VisualObject Recognition -67777

17

Slide credit: Grauman & Leibe

(18)

67777

Example: Motorbikes ( Example: Motorbikes (2 2))

VisualObject Recognition -67777

18

Slide credit: Grauman & Leibe

(19)

67777

Example: Spotted Cats Example: Spotted Cats

VisualObject Recognition -67777

19

Slide credit: Grauman & Leibe

(20)

67777

Implicit Shape Model (ISM) Implicit Shape Model (ISM)

o Basic ideas

 Learn an appearance codebook

 Learn a star-topology structural model

(Features are considered independent given obj. center)

x1

x3

x4

x6

x5

x2

VisualObject Recognition -67777

o Algorithm: probabilistic Gen. Hough Transform

20

Slide credit: Grauman & Leibe

(21)

67777

Codebook Representation Codebook Representation

o

Extraction of local object features

 Interest Points (e.g. Harris detector)

 Sparse representation of the object appearance

VisualObject Recognition -67777

o

Collect features from whole training set

o

Example:

21

Slide credit: Grauman & Leibe

(22)

67777

Appearance Codebook Appearance Codebook

VisualObject Recognition -67777

 Visual similarity preserved

 Wheel parts, window corners, fenders, ...

 Store cluster centers as Appearance Codebook

22

Slide credit: Grauman & Leibe

(23)

67777

Gen. Hough Transform with Local Gen. Hough Transform with Local

Features Features

o

For every feature, store possible “occurrences”

– Object identity – Pose

– Relative position

VisualObject Recognition -67777

23

– Relative position

For new image, let the matched features vote for possible object positions

Slide credit: Grauman & Leibe

(24)

67777

Implicit Shape Model

Implicit Shape Model -- Recognition Recognition

Interest Points Matched Codebook Entries

Probabilistic Voting

y

VisualObject Recognition -67777

24

Slide credit: Grauman & Leibe

3D Voting Space (continuous)

x s

(25)

67777

Implicit Shape Model

Implicit Shape Model -- Recognition Recognition

Interest Points Matched Codebook Entries

Probabilistic Voting

y

VisualObject Recognition -67777

Backprojected Hypotheses

3D Voting Space (continuous)

x s

Backprojection of Maxima

25 Slide credit: Grauman & Leibe

(26)

67777

Discussion: Constellation Model Discussion: Constellation Model

o Advantages

 Works well for many different object categories

 Can adapt well to categories where

 Shape is more important

 Appearance is more important

 Everything is learned from training data

VisualObject Recognition -67777

 Weakly-supervised training possible

o Disadvantages

 Model contains many parameters that need to be estimated

 Cost increases exponentially with increasing number of parameters

⇒ Fully connected model restricted to small number of parts.

26

Slide credit: Grauman & Leibe

(27)

67777

Summary Summary

o Main issues in categorization:

 Representation - how to represent an object category

 Learning - how to form the classifier, given training data

 Recognition - how the classifier is to be used on novel data

VisualObject Recognition -67777

27

 Recognition - how the classifier is to be used on novel data

o Methods reviewed here

 Bag of features

 Parts and structure (via generative models)

 Discriminative methods

References

Related documents

Additional components such as problem identification, locating information, evaluation or analysis of information, as well as synthesis of information may all act as elements

Since the local level model is silent about the drivers of the estimated r ∗ , we investigate the rise and fall of r ∗ by estimating a Panel ECM using data on the observed real

(2014) reported in their parametric study on the continuum level that the grain misorientation has significant influence on the mechanical response as well as the hydrogen diffusion

The personality types do not differ from each other in terms of punishment, while mean values of three subscales of worry, positive beliefs about worry, and uncontrollability

The opinion must be “written to the standards of the profession” and must provide three assurances: (1) the insured intellectual property is valid; (2) the existence of infringement

• Both lung and colorectal patients had previously presented at their hospital with cancer symptoms or as emergency on more than one occasion. CADIAS (East of

This category can be used to record hours in Hazardous Materials Awareness, Operations, and Technician level that can be used as refresher training, once