Greedy structure learning
● Two phases:
○ Bottom-up compositional clustering.
○ top-down model composition phase.
● Bottom up phase (“EM”): at each layer,
select two part models , (learned with
matching pursuit) from image patches that
are randomly sampled from the training
data. Then repeat:
○ Detection (E-step): Detect part models in
the training images at different locations
and orientations. Cut out patches at the
detected positions which serve as new
training data for the M-step.
○ Learning (M-step): Learn a part model from
the training patches with matching pursuit.
● Dealing with background:
○ Update only one of the models at each
iteration. The other only participates in the
E-step.
○ It will serve as a generic background model.