GISdevelopment.net ---> Technology---> Image Processing

Introducing correctness coefficient as an accuracy measure for sub pixel classification results

Hassan Emami
Email: hemamy@yahoo.com
Maybod Azad University

S.B. Fatemi
Email: sbfatemi@yahoo.com
K.N.T Univ. of Technology

M.Mojarradi
Email: Mojaradi @yhoo.com
K.N.T Univ. of Technology




Abstract
After each classification its results must be evaluated and their accuracy must be assessed. In respect of the result's type (thematic map/fraction map), an adequate strategy for accuracy assessment must be chosen. Methods of accuracy assessment for traditional pixel-based classifications are not fully suitable for sub pixel classifications. Because, training and ground truth data are pixel-based and they can not be used directly for accuracy assessment of the sub pixel classification results (fraction maps).

Generally there is no common and standard sub pixel accuracy assessment method for evaluation of the sub pixel classification results. Very few methods and measures such as entropy and cross entropy have been proposed for the sub pixel accuracy assessment. These have some limitations to be used in accuracy asseement of the sub pixel classiifcation results. Cross entropy needs to a fuzzy ground truth data set, the matter that is not available simply. For this purpose, we introduce the correctness coeffient parameter for the sub pixel accuracy assessment. Correctness coeffient expresses the matching rate of the results of subpixel classification (fraction maps) with the ground truth data. Correctness coefficient is one of the efforts to ensure the flexibility and consistency of the sub pixel accuracy assessment regarding the type of the available data and classification methods. The proposed method for the accuracy assessment of the sub pixel classifiers make possible to inspect the classes individually. Additionally each class can be investigated individually in respect of the corresponding commission and omission errors. An experiment using the real data has been implemented in order to show the ability of the new accuracy measure.

1. Introduction
One of the most important information extraction methods for the remotely sensed images is classification. Classification traditionally is defined as a mapping function from the image apace into a nominal space in which each pixel has one label. Usually the result of the classification is a thematic map in which each pixel is allocated to a specific class. In some cases classification tries to delineate objects on the real world and this is done by preprocessing the image (e.g. segmentation). In the other hand, some of the classification methods defines the per pixel fraction of each class and allow calculating the correct area estimation of the classes.

There are many classification methods that so far have been proposed and some of them exist in the state of the art softwares like as Maximum likelihood (MLC), Minimum Distance, etc. Traditional classifiers often are oriented to generate a thematic map but this leads to the incomplete area estimation of the classes. Because of the mismatching of the sensor grid and the real object boundaries, some mixed pixels (mixels) will appear in the image (Fisher 1997).

The gray value of such mixels is a composition of the radiometric properties of the several classes (objects) and therefore generates some confusion in classification procedures. Traditional classifiers like MLH assign each pixel to only one class. Consequently the mixels usually are labeled erroneously. After each classification the results of it must be investigated and the accuracy of them must be reported. In respect of the result type (Thematic map, fraction map, ...) we can choose an adequate strategy for accuracy assessment. At last, some parameters, tables and maps will be calculated and generated to show the accuracy of the result. In this paper we try to show some aspects of the sub pixel accuracy assessment of the classified maps resulted from the sub pixel classifiers. However a rectified version of traditional accuracy measures is proposed for using in accuracy assessment of the sub pixel classifiers.

2. Sub-pixel classification methods
The main problem and limitation of traditional hard (pixel based) image classification procedures is in the classification of mixed pixels. Mixed pixel classification is a process which tries to extract the proportions of the pure components of each mixed pixel. To resolve the mixed pixel problem, there are different approaches. Some of the most important soft classification methods are: (i) Deterministic approaches; (ii) Fuzzy set theory based approaches; (iii) Neural network based approaches and (iv) Linear mixture modeling approach (Emami 2002). Among of these approaches we chose the linear mixture modeling approach to produce some (semi) fuzzy results and use them to test the accuracy assessment approaches.

There are two different mixture models for mixed pixel classification: the nonlinear mixing and the linear mixture model. The nonlinear mixture model for unmixing analysis considers not only the pixel of interest, but also involving the neighboring pixels i.e. each photon that reaches the sensor has interacted with multiple scattering between the different class types. In the linear mixture model, each pixel is modeled as a linear combination of a number of pure materials or endmembers. The linear mixture model is known as the spectral unmixing. Spectral unmixing is a method in which the user allowed to determine information on a sub pixel level and to study decomposition of mixed pixels. The basic idea under linear mixture model is that each photon which reaches the sensor has interacted with just one class (Mather 1999).


Figure 1. Non-linear mixture model                 Figure 2. The linear mixture model

In this research, the linear mixture model is concerned and linear unmixing model is used to classify a hyperspectral image. This linear mixture model can be mathematically described as a set of linear vector-matrix equations,




Solving the equation 1 results the unconstrained unmixing using no constrain. The resulting fractions may have negative values and are not constrained to sum to unity. In order to avoid this, the sum to unit constraint is added to the equations of the unmixing process. Applying the condition that all the resulting fractions must sum to unit is referred to partially constrain unmixing. However, fraction values which are negative or greater than one are still possible. Fully constrained unmixing implies an additional condition in that all determined endmember fractions must be between 0 to1. It should be noted that the final results of unmixing algorithm depend to the type and number of endmembers. Therefore, any changes applied to the reference endmembers will cause changes on the fraction map results.

A solution for the linear unmixing problem requires that; the sum of the coefficients equals one, because ensure the whole pixel area is represented in the model and also each of the fraction coefficients be nonnegative to avoid negative subpixel areas. The first requirement can be modeled by a constraint equation, for the second requirement, the coefficients need to be constrained by :


Together, the mixing equations and the constraints describe a model that must be solved for each pixel which should be decomposed, i.e. given and , we have to determine and in equation 1(Mather 1999).

3. Accuracy assessment of the classification results
Accuracy assessment is an essential post classification stage. Accuracy of the results is expressed in various forms relative to the classification results and method. The result of the common classification methods (e.g. MLC) is in the form of land cover/use map and usually the accuracy of it is assessed by comparing it to a ground truth map. The ground truth or reference map is usually stored in the digital form and defines well known land cover types for some pixels of the scene. Pixel by pixel comparison of these two maps results an error (or confusion) matrix.

From the error matrix some error and accuracy measurements are derived which each of them show some error or accuracy aspects of the final results. One of the most popular parameter calculated on the basis of the error matrix is overall accuracy. This parameter equals the ratio of sum of the diagonal elements of the error matrix on the number of pixels which have been correctly classified. For each category (class), an accuracy parameter is also defined. In each row, the ratio of the diagonal component (for each class) on the sum of pixels of that row is called user's accuracy. Analogously this ratio is calculated for each column and is called producer's accuracy.

Based on the error matrix another measure for accuracy is defined which is called Kappa coefficient. This accuracy criterion is calculated by (Richards 1993):


Commision error is defined as the ratio of the sum of the off diagonal components in each row to the number of pixels of that row. Ommision is a similar error mesurment for columns. Hence for each row we can calculate commision and user's accuracy, and for each column ommision and producer's accuracy are calculated. These fasctors need ground truth data and comparing thematic map and ground truth which results an error matrix.

Avoiding the dependency to the ground truth data, entropy is defined.Entropy measures the uncertainty in a single value of a statistical variable and is defined as the information content of a piece of information that would reveal this value with perfect accuracy. This quantity is weighted by the probability that value occurs and summed overall values, which gives (Gorte 1998):


In which, N is number of classes and (Xi/Ci) is the posteriori probability of the class Ci in the pixel xp. Thus entropy is calculated per pixel.

Sub pixel classification tries to define fraction of each class per pixel therefor these techniques have no absolute decision on the pixel label. Then generating a thematic map in this manner is not straight forward and some other postprocessings (e.g. thresholding ) must be applied. For this reason, accuracy assessment of the sub pixel classification results is not similar to the common accuray assessment methods. If we want to use the traditional accuracy assessment (e.g. confusion matrix) we have to generate a thematic map and then compare it with a ground truth map. The next section deals with the available proposed sub pixel accuracy assessment techniques.

4. Sub pixel accuracy assessment methods
In respect to the result's type (thematic map/fraction map), an adequate strategy for accuracy assessment of the classification results must be chosen. Finally some parameters, tables and maps will be calculated and generated to show the accuracy of the results. Methods of accuracy assessment for traditional pixel-based classification (previous section) are not fully suitable for subpixel classification accuracy assessment. Because, training data and ground truth are pixel-based and we can not use directly any pixel based method for accuracy assessment of the sub pixel classification results. Although in some cases the only way to compute the accuracy of a sub pixel classifier is to harden its results (Foody 1996).

On the basis of the subpixel classification results, some methods have been proposed to estimate the accuracy of such a classification method. Foody (1996) has an excellent review on the available sub pixel accuracy assessment approaches. The most of the methods that he mentioned need to a fuzzy ground truth map the matter that is not available simply.

Entropy was defined in the previous section, have several limits and disadvantages. One of the limitations of the entropy is that it can't show how much the classification accuracy is reasonable (Maselli et al. 1996). When pixels are mixed, the entropy parameter can not to be a good index for classification accuracy, because the mixed pixels have a high entropy and ratio entropy. Therofore in this cases alwayes the classification accuracy will be best. Other limit of entropy is that it can not show, how much the classification accuracy is best or poor. Therfore entropy parameter can not be used to campare the accuracy of two classification procedures. Therefore, Foody (1996) propose that we can use of cross entropy for subpixel classification accuracy, if a subpixel or fuzzy ground truth map exist. Cross entropy parameter is determined as the following equation :



P(Xp)is the posteriori probability of the classification result in pixel xp P(Xp)is the posteriori probability of the pixel xp in ground truth map. Thus cross entropy is calculated for each pixel and defined as the expected information content of a piece of information that would reveal its true class. The major problem of this method is that it needs the fuzzy ground truth map; the matter which often is hard to be available.

One of the other accuracy assessment methods for the LUM results is the area estimation and comparison.In this method the area of each class using the appropriate fraction map is calculated (Zhu et al. 2001). These values are compared with the same other areas which come from the other reliable sources (e.g. Old maps, databased or other classifications).More similar values the more accurate classifications. This approach uses the fraction maps to calculate the area covered by each class. In this manner we just sum the fractions of each class ignoring the spatial distributuion of the errors. The nonsite-specific nature of this approach is, however, a major limitation as a map could easily dsiplay the classes in the correct proportions but in the incorrect locations (Foody 2002).

Additionally this method dose note give any accuracy parameter that can be used at the comparing two or more classifcations.Logically the closeness of the estimated and true area is the basic criterion for accuracy of the classification. Thus relativly we can just say "this classification is more accurate than the other one".

5. Correctness coefficient
As mentioned in the previous section, we can not use traditional accuracy assessment procedures for sub pixel accuracy assessment. For accuracy assessment of this kind of classification results we have to use fraction maps as the main results of the linear unmixing classification.

In the first step we need a parameter to express the matching rate of the results (fraction maps) with the ground truth data. For this purpose, we introduce the correctness coeffient (CC) parameter. In order to calculation of correctness coeffient, a binary map for each class is generated from the ground truth:




In fact by this multiplication, for each pixel with value 1, the calculated fraction remains and zero components of the binary map dismiss the other fractions which have no any corresponding ground truth data. So, in this manner for each grtound truth pixel of a particular class, the relevant fraction value will be remained. Then it will be possible to calculate the correspondence of the resulted fraction with the ground truth data. after this step, correctness coeffient can be esdtimated using the folowing formula:


Ng is the number of known pixels in the ground truth map. CC also can be computed for each calass individuaaly:


NPi is the number of known pixels for ith class in the ground truth map. Correctness coeffient is expressed as the percentage and resembels the overall accuracy in the traditional error matrices and it can be used as an overall acuracy measure for the sub pixel results.

In addiotion to the accuracy parameters, some error mesures can also be derived to express the contained errors in the results. As the commision and ommision error are defined in the tradditional accuracy assessment, we introduce these parameters on the basis of the ground truth binary maps and classification resulted fraction maps. In a tradditional error matrix, commision errors define the percentaeg of those pixels that have been labled as a particular class but in ground truth are in a different category. Anagolously ommision defines the percentage of pixels from a particular class which have been labled as the other classes. By this concept we can calculate ommision and commision errors for each class using sub pixel classification results. Firstly we subtract each fraction map from the binary map for each class individually.


The resulted map has some positive and negative values. Positive values are for those pixels which have the value 1 in the corresponding binary map. Therefor the negative values are the result of the subtraction of the zero values from the fraction values. In fact the positive values are the values which have been allocated to other classes. This resembels the ommision error in the tradditional error matrices. considering the same concept we can define the commision error using the negative values:



These error mesures are defined per class and can explain the error rate of the resulted fraction map. The next section pertains to these concepts and shows one case study using the real data.

6. Case study
Showing the applicability of the proposed method for the sub pixel accuracy assessment we applied the proposed accuracy assessment procedure on a sub window image, a portion of the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) of hyperspectral data taken over an agricultural area of California, USA in 1994. This data has 220 spectral bands about 10 nm apart in the spectral region from 0.4 to 2.45 m with a spatial resolution of 20 m. The image has a size of 145 rows by 145 columns. Its corresponding ground truth map has been shown in the figure 3. This map consists of 12 classes including of Alfalfa, Corn, Corn_min, Corn_notill, Grass_pasture, Grass_pasture_moved, Grass_Trees, Hay_windrowed, Oatas, Soybeans_clean, Soybeans_notill, and Wood. Although there are more than 12 classes in the reality but regarding the radiometric overlap and number of pixels for each class we merge some of the (spectrally) similar classes and therefore in this case study the image is classified assuming 12 spectral classes exist in the study area. This assumption can not affect the obtained results. Table 1 shows the number of pixels for each class in the ground truth map.



Figure 3. Color composite image of the study area (RGB bands: 31,19 and 9).




Figure 4. The ground truth map (includes 12 classes) of the study area.

The basic assumption of the linear spectral unmixing is that the most of pixels are mixtures of objects. Once all the endmembers are found in an image (using the mean ROI procedure), all the remaining pixels are considered to be linear combinations of these endmember pixels. Some of the hyperspectral image bands are the subject of absorption and they contain only little signal and more noise. By applying Minimum Noise Fraction (MNF) transformation on the hyperspectral image, the correlation and noises are removed from the bands and they are sorted in respect of their variance (Emami 2002).

Table .1 number of pixels per class on the ground truth map



In the bands with a high variance, the features can be distinguished from each other in a better mode, therefore classification accuracy increased sensibly. Also, applying MNF transformation on the hyperspectral images reduces the dimensionality of hyperspectral image data for next processing steps. Hence this method was applied upon MNF images which lead to the better results than the case which uses the original images. In this procedure, 20, 50, 87, 120, 155, 175 and 195 number of bands were tested for classification. In the final classification 120 bands have been used extracted of the MNF produce. Four fraction images resulted of the LSU have been shown in the figure 5.

Although LSU classification dose not label directly any pixel, however a classified map, which is a pixel-based representation of classification, can be generated by choosing the class with the highest pixel value in the fraction images for the output map (maximum value rule). By applying this rule on the fraction maps, a thematic map has been generated in which each pixel has a specific class label (harden result). This map can be used to compare the proposed sub pixel and traditional pixel based accuracy assessment methods. Hardening the LSU results can lead to some lose of the information and accuracy but this is one of the common approaches to evaluate the accuracy of the subpixel classification results and also we used it only for the comparison of the two methods.

Therefore using the ground truth data (figure 4) we calculate the mentioned accuracy measures for the resulted fraction maps and generated thematic map. Table 2 shows the confusion matrix of the thematic map, overall accuracy outputted from this matrix is 86% and the corresponding Kappa coefficient is about 84%.



Figure 5. Four sample fraction maps resulted from the LSU performed on the [120] MNF bands.


Table 2. Confusion matrix resulted from applying the maximum value rule on the fraction maps



Considering the number of pixels for each class in the ground truth map and the radiometric overlap between classes this can be an optimistic evaluation of the accuracy and in this level of the information (hard) we have not any tool to show the error distribution over the entire image. But using the difference and accuracy maps of the classes (Equation. 9) we can show the error involved in the results and spatial distribution relative to the ground truth map. (Figure 6)

Using the concept presented at Section 5, overall CC and individual CCs have been calculated. The calculated overall CC using 120 bands in LSU is about 84.9%. The calculated CCs for individual classes have been shown at Table 3.

Table 3. The calculated individual CCs for linear unmixing of the 120 MNF bands.



Using the defined formulas at Section 5, commision and omission errors for each class are the same as Table 4. As the table 3 and table 4 show, we can calculate the soft omission error using the individual CCs:

OEi (%) = 100 - CCi (%)


Table 4. Estimation of Comission and Omission errors in LUM method for MNF images (120 bands)


7. Conclusion
Accuracy assessment is one of the challenging aspects of the subpixel classifiers. Traditional accuracy assessment methods have been designed on the basis of the hard classifiers and therefore there is no straightforward approach for this aim. In the other hand ground truth data that are inherently hard in nature force the accuracy assessment to be hard.

Very few methods have been proposed so far for the sub pixel accuracy assessment. These approaches can not ensure desired flexibility and consistency. Correctness coefficient, recommended by the authors is one of the efforts to ensure the flexibility and consistency of the subpixel accuracy assessment regarding the type of the available data and classification methods.

Obtained results from the case study show that correctness coefficient (soft) and overall accuracy and kappa (hard) are approximately the same or similar in value enough. The proposed method for the accuracy assessment of the subpixel classifiers make possible to inspect the classes individually (individual CCs). Additionally each class can be investigated individually in respect of the corresponding commission and omission errors.

Although in this manner we used hard ground truth data, but this is the common case and there is no any fuzzy ground truth in the traditional image analysis. However, using the correctness coefficient parameter and its family we can take into account the major subpixel properties of such a classifiers and the effect of the inherent limitation of the hard ground truth data can be reduced.

8. References

  • Conese C., Maselli F.,1992, Use of error matrices to improve area estimates with maximum likelihood classification procedures, Remote sensing of Environment, 40:113-124
  • Emami H., 2002, Evaluation and decomposition of mixed pixels in remotely sensed hyperspectral images for accuracy improvement of classification results, MSc thesis, K.N.Toosi University, Tehran, Iran
  • Fisher, P., 1997, The Pixel: a Snare and a Delusion, International Journal of Remote Sensing,
  • Vol 18, no.3, 679-685.
  • Foody G.M., 1996, Approaches for the production and evaluation of fuzzy land cover classifications from remotely sensed data, INT. J. REMOTE SENSING, 1996, VOL. 17, 1317-1340.
  • Foody G.M., 2002, Status of land cover classification accuracy assessment, Remote Sensing of Environment 80 (2002) 185-201.
  • Gorte B ,1998, Probabilistic segmentation of remotely sensed images, PhD Thesis, Wayeningen Agricultural University (WAU), ITC, Enschlend, The Netherlands, ISBN 90-6164-157-8, ITC Publication 63.
  • Maselli F., Rodolfi A., Conese C.,1996 , Fuzzy classification of spatially degraded Thematic mapper data for the estimation of sub-pixel components, Remote sensing of Environment,
  • Mather P.M, 1999, Computer processing of remotely sensed images : an introduction, Second edition, John Wiley & Sons
  • Richards J.A ,1993 ,Remote Sensing Digital Image Analysis: An Introduction, Second Edition, Springer, ISBN 0-387-5480-8.
© GISdevelopment.net. All rights reserved.