Statistical Models of Appearance

Next: The Correspondence Problem Up: Background Previous: Overlap-based Assessment

Statistical Models of Appearance

Statistical models of shape and appearance (combined appearance models) were introduced by Cootes, Edwards, Lanitis and Taylor [2,3,9]. They have been applied extensively in medical image analysis [11,17,24], among other related domains. Brain morphometry has been one main point of focus while cardiac imaging incorporated a third and fourth dimension, which is time series [23].

The construction of an appearance model depends on establishing a dense correspondence across a training set of images. That correspondence is defined using a set of landmark points marked consistently on each training image. Landmark points are often prominent anatomical positions, which can easily be identified as they lies on stronger edges. Moreover, they have meaningful properties such as being markers of a boundary of an organ, or as in our case - a brain compartment or the skull.

**Figure 1:** The effect of varying the first (top row), second, and third model parameters of a brain appearance model by $\pm 2.5$ standard deviations

Using the notation of Cootes [3], the shape (configuration of landmark points) can be represented as a vector $\mathbf{x}$ and the texture (intensity values) represented as a vector $\mathbf{g}$ . The two vectors are formed by simple concatenation of values, either intensity (usually grayscale) values or geometric position of landmark points in the image. Using Principal Component Analysis (PCA) [13], the variation in terms of shape and texture can be learned and decomposed. The shape and texture are controlled by a linear statistical models of the form

$\begin{displaymath} \begin{array}{cc} \mathbf{x}=\mathbf{\overline{x}}+\mathbf{P... ...}=\overline{\mathbf{g}}+\mathbf{P}_{g}\mathbf{b}_{g}\end{array}\end{displaymath}$

(2)

where $\mathbf{b}_{s}$ are shape parameters, $\mathbf{b}_{g}$ are texture parameters, $\mathbf{\overline{x}}$ and $\overline{\mathbf{g}}$ are the mean shape and texture, and $\mathbf{P}_{s}$ and $\mathbf{P}_{g}$ are the principal modes of shape and texture variation respectively. By varying $\mathbf{b}_{s}$ and $\mathbf{b}_{g}$ , the image produced by the model can be altered.

Since shape and texture are often correlated, we can take this into account by applying yet another stage that involves PCA. We then obtain a combined statistical model (encapsulating both shape and intensity) of the form

$\begin{displaymath} \begin{array}{cc} \mathbf{x}=\bar{\mathbf{x}}+\mathbf{Q}_{s}... ...\mathbf{g}=\mathbf{\bar{g}}+\mathbf{Q}_{g}\mathbf{c}\end{array}\end{displaymath}$

(3)

where the model parameters $\mathbf{c}$ control the shape and texture simultaneously and $\mathbf{Q}_{s}$ , $\mathbf{Q}_{g}$ are matrices describing the modes of variation derived from the training set. The effect of varying one element of $\mathbf{c}$ for a model built from a set of 2D MR brain image is shown in Fig. 1.

To generate the positions of points in an image we use

$\begin{displaymath} \mathbf{X}=T_{\mathbf{t}}\mathbf{x}\end{displaymath}$

(4)

where $\mathbf{x}$ are the points in the model frame, $\mathbf{X}$ are the points in the image, and $T_{\mathbf{t}}\mathbf{x}$ applies a global transformation with parameters $\mathbf{t}$ . For instance, in 2D, $T_{\mathbf{t}}\mathbf{x}$ is commonly a similarity transform with four parameters describing the translation, rotation and scale.

The texture in the image frame is generated by applying a scaling and offset to the intensities, $\mathbf{g}_{im}=T_{gtrans}\mathbf{g}$ where $\mathbf{u}$ is the vector of transformation parameters.

Next: The Correspondence Problem Up: Background Previous: Overlap-based Assessment

Roy Schestowitz 2007-03-11