Statistical models of shape and appearance, which are often referred to as combined appearance models, encapsulate variation across the set from which they are built. To construct these models, one needs to establish dense point-to-point correspondence across all images in the set, thereby highlighting analogous structures.
Fig. 1. Appearance models showing the effect of varying the first, second, and third model parameters. Each of the models is subjected to variation of at most standard deviations.
Construction of appearance models involves a stage where variation in shape and texture (image intensities) are learned in turn. Shape can be represented as a vector while texture represented as a vector . Both shape and texture can be directly controlled by models of the form
(1) |
In the formulation above, are the shape parameters, are the texture parameters, and are the mean shape and texture while and are the principal modes of shape and texture variation respectively. In practice, there is a tight correlation between shape and in intensity so a combined statistical model of the form
(2) |
appears to work even more gracefully, integrating both sources of variation. The model parameters control both shape and texture simultaneously and , are matrices describing the modes of variation derived from the training set.