To identify which weighting scheme provides the best behaviour, a measure of sensitivity was desirable. Performance of a measure can be described it term of its ability to discern a good registration from a worse one. Put differently, the problem in question is revealing howsmall a degradation can be and still get detected. This enables us to compare the merits of our model-based assessement method and comparing it with an overlap-based method. It also makes it clearer to see which weighting scheme works best in terms of performance.
It is worth paying attention to the fact that slopes and errors vary systematically. This affects the size of perturbation that can be detected. To make a quantitative comparison of the different methods, we define the sensitivity, as a function of perturbation as , where is the quality measured for a given value of displacement, is the measured quality at registration, is the degree of deformation and is the mean error in the estimate of over the range.
Sensitivity averaged of the range of perturbations shown in figures 10, 11, and 12 is plotted in Figure 13 for all the methods of assessment. This shows that the Specificitymeasure with shuffle radius 1.5 or 2.1 is the most sensitive of the measures studied, and that this difference is statistically significant.