Logo GISdevelopment.net

GISdevelopment > Proceedings > ACRS > 1999


1989 | 1990 | 1991 | 1992 | 1994 | 1995 | 1996 | 1997 | 1998 | 1999 | 2000 | 2002
Sessions

Agriculture/Soil

Water Resources

Disasters

Measurement and Modeling

Land Use

Forest Resources

Mapping from Space

Oceanography/Coastal Zone

Topics Including Education

Hyper Spectral Image Processing

Image Processing

Geology

Environment

GIS

Global Change

Airborne Remote Sensing

Poster Sessions
  • Session 1
  • Session 2
  • Session 3
  • Session 4
  • Session 5
  • Session 6



  • ACRS 1999


    Hyper Spectral Image Processing
    Adaptable Class Data Representation for Hyperspectral Image Classification

    2.2 Cluster Assignments Using Bayes’ Rule
    After each class training data has been represented by the given set of clusters, clusters can be recognized as an associated spectral classes of information classes.

    The cluster vector is identified by its index number k, k= 1,2,3, . . .K.   Hi(k), k=1,2,3, . . .K represents the probability distribution for class wi in the cluster-space. Hi(k), i=1,2,3, . . .M gives the relative likelihoods that the cluster vector k belongs to each defined class. Cluster index k can be classified into the information classes using Bayes’ Theorem. Assuming that prior probabilities are equal for all the classes, the decision rule is:

    kÎ wi,    if Hi(k)>Hj(k)     for all j>¹i     (1)

    This cluster assignment process is simple, fast and reliable (assume that the training data is reliable) for the following reasons: (i) This is a one dimensional problem; (ii) The number of data to classify (K) is small; (iii) The true probability distribution shapes are used.

    The clusters labels, wi, are stored for later use in classifying unknown data. If the probability values Hi(k) are recorded as well, it will be possible to provide information on how reliable the classification results are.

    Since several clusters may be labeled as the same class generally, each class has been defined by a group of clusters. In other words, instead of using a single mean position as a class signature, several cluster mean positions are used. This multi-signature structure generates a flexible and adaptable representation which is wholly determined by the training data itself.

    2.3 Unknown Data Classification
    There are two steps in labeling an unknown pixel vector.

    An unknown pixelvector x will be first classified into one of the given set of clusters (k) using the minimum distance classifier. The dicision rule is:

    xÎk,     if d(x, ck)2<d(x,cj)2     for all k¹j    (2)

    where ck is the mean vector of the cluster k data and d(x, ck)2=(x –ck)t (x –ck).

    The second step is to label the data into the classes with which the cluster is associated.

    xÎ wi,     if kÎ wi.    (3)

    3. Experiments
    An AVIRIS data set covering an area of mixed agriculture and forestry in Northwestern Indiana, USA, was used in the following tests. It was recorded in June, 1992 with 220 bands. Water absorption bands, 104 to 108 and 150 to 162, were removed, leaving 202 bands. A traditional principal components transformation was applied to the data and the first 40 new features, which contained 99.93% of total variance in the original data, were used for the tests. 4 classes were selected as shown in Fig.1.



    Class Name Training Fields No. of Training Pixels Testing Fields No. of Testing Pixels
    Soybean 1, 2 121 3 156
    Corn 4 105 5, 6 75
    Grass 7 56 8 60
    Hay 9 95 10 96

    Fig. 1. The Image and the Reference Data.


    The ISODATA clustering algorithm was run using the software package MultiSpec (Landgrebe and Biehl, 1999). 20 clusters were generated from the training data for the four classes. The training samples were then labeled into the defined clusters. The results have been normalised to show the percentage (density) of the training pixels which were labeled into each of the clusters. They are plotted in Fig. 2.

    It can be seen from Fig. 2 that three clusters, Clusters 2, 4 and 6, were shared with more than one class. The rest of clusters has been fully dominated by one class only, ie., Hj(k)=0 for j.i. Nevertheless, all the 20 clusters have been assigned to one of the 4 classes, using the decision rule given in (1). The allocation results for all the clusters are given in Table 1.

    Each class has been represented by several clusters. The cluster vectors form the multiple signatures for each class.



    Fig. 2. The Cluster-Space Training Data Representation.


    Table 1. The Subclasses of the Information Classes
      Soybeans Corn Grass Hay
    Clusters (Subclasses) 1,3,5,9,11,13 2,7,14,15,17,19 4,6,10 8,12,16,18,20


    To assess the algorithm quantitatively, the training data and the testing data have been classified using the two-step decision rules given in (2) and (3) and the conventional minimum distance classifier (single signature) for comparison purposes. Table 2 shows the classification accuracy obtained from the two methods. The classification results on the training data and the testing data have both been improved significantly.


    Table 2. Overall Classification Results Comparison
      Training Data Testing Data
    Conventional Single Signature 78% 59%
    Multi-Signature 97% 74%

    Page 2 of 3
    | Previous | Next |

    Applications | Technology | Policy | History | News | Tenders | Events | Interviews | Career | Companies | Country Pages | Books | Publications | Education | Glossary | Tutorials | Downloads | Site Map | Subscribe | GIS@development Magazine | Updates | Guest Book