Statistical models of shape and appearance, which are often referred to as combined appearance models, encapsulate variation across the set from which they are built. To construct these models, one needs to establish dense point-to-point correspondence across all images in the set, thereby highlighting analogous structures.
Fig. 1. Appearance models showing the effect
of varying the first, second, and third model parameters. Each of
the models is subjected to variation of at most standard
deviations.
Construction of appearance models involves a stage where variation
in shape and texture (image intensities) are learned in turn. Shape
can be represented as a vector while texture represented
as a vector
. Both shape and texture can be directly
controlled by models of the form
![]() |
(1) |
In the formulation above,
are the shape
parameters,
are the texture parameters,
and
are the mean shape and texture while
and
are the principal modes of
shape and texture variation respectively. In practice, there is a
tight correlation between shape and in intensity so a combined statistical
model of the form
![]() |
(2) |
appears to work even more gracefully, integrating both sources
of variation. The model parameters control both shape
and texture simultaneously and
,
are matrices describing the modes of variation derived from the training
set.