Advanced Supervised Sub-Pixel Multi-Spectral Image Classifier

Anil Kumar
Indian Institute of Remote Sensing, Dehradun,India.
E mail:- iirs_anil@yahoo.com

Dr. S. K. Ghosh
Deptt of Civil Engineering, IIT Roorkee, Roorkee - 247 667,India.
E mail:- scnagosh@datainfosys.net

Dr. V. K. Dadhwal
Indian Institute of Remote Sensing, Dehradun,India.
E mail:- vkdadhwal@iirs.gov.in


Abstract
Sub-pixel level classification is essential for the successful description of many land cover patterns of less than ~1 km and has been widely used in global or continental scale land cover mapping with remote sensing data. There are many algorithms available for sub-pixel classification. Some are known as statistical classifiers like maximum likelihood (MLC), Linear Mixture Model (LMM) etc. Some are based on fuzzy set theory like Fuzzy c-Means (FCM), Possibilistic c-Means (PCM) and some are neural network based. Statistical sub-pixel classifiers are dependent on multivariate normal distribution, while in the case of Linear Mixture Model (LMM) number of classes to be extracted from the data should be less than or equal to dimensionality of the data plus one. Artificial Neural Network is a learning algorithm but does not always generalize well. Support Vector Machines (SVM) are for remote sensing image classification for a number of reasons. SVM can work well with a small training data set yielding high classification accuracy. It is well suited for high dimensionality data such as classification of hyperspectral data, where data is available in hundreds of bands. Further, SVM is robust to the overfitting problem as it relies on margin maximization rather than finding a decision boundary from the training samples. The structure of an SVM is less complex in comparison to Neural Network, for example, which has a complex structure for processing high dimensional data. Finally, as compared to another recent nonparametric classifier namely the decision tree classifier, SVM does not require the generation of rules that heavily depend on the knowledge from experts. This is crucial to achieve high classification accuracy.

Statistical based algorithms, like Linear Mixture Model (LMM); Fuzzy Set Theory based algorithms, like Fuzzy c-means (FCM), and Artificial Neural Network (ANN) based classification algorithms have been incorporated in different commercially available digital image processing softwares; like neural network and fuzzy c-mean in PCI Geomatica, Linear Mixture Model (LMM) in ERDAS, ENVI etc. But there are hardly any digital image processing software's available having Support Vector Machines (SVM) based algorithms for multi-spectral remote sensing data sup-pixel classification. This paper presents the capabilities of "Advanced Supervised Sub-Pixel Multi-Spectral Image Classifier" system developed at Indian Institute of Remote Sensing, Dehradun, India, using JAVA programming language for classifying multi-spectral data from remote sensing satellites at sub-pixel level using advance classifiers like, Support Vector Machine (SVM). This system can handle multi-spectral image of any bands. In this system four different classification algorithms (like; Fuzzy c-Means, Possibilistic c-Means, Maximum Likelihood Classifier, Support Vector Machine) have been incorporated in supervised mode for sup-pixel classification. As it is well known that in supervised classification reference data is required. This reference data can be generated from this system in two modes i.e. pure reference data as well as mixed reference data. Reference data in pure or in mixed form can be used to all the classification algorithms incorporated in this system for generating fraction images. There is also provision in this system for saving the membership values generated using different classifiers for sup-pixel classification. Accuracy assessment is very critical component in classification for assessing the accuracy of fraction images generated using sub-pixel classifier. As this system is capable for generating hard as well as sub-pixel level outputs, so for both types of outputs, different accuracy assessment methods have been incorporated. For hard classification output error matrix as well as Khat coefficient have been used for assessing the accuracy of classified output. For sup-pixel classification, when reference as well as output is soft, Fuzzy Error Matrix (FERM) has been incorporated in this system for assessing the accuracy of soft classified output. As this system is developed in JAVA language, so this system is platform independent. Presently no commercial software is available for classification of multi-spectral remote sensing images using advance classifier like; support vector machine. So it will be quite useful for incorporating this system in different professional programs run by Indian Institute of Remote Sensing, Dehradun, India, in the field of Geoinformatics.

Introduction:
Preparation of the land cover maps from remote sensing images is viewed as a classification problem. In general, classification algorithms are statistical in nature and assign a unique value to each pixel. However, a pixel may contain more than one class, and such pixels are known as mixed pixels. As the remote sensing images become courser, the problems of mixed pixel increases, leading to erroneous classification. To account for information with a mixed pixel, the analyst has to make an assessment regarding the proportion of classes within a mixed pixel. Normal statistical classifiers are not able to provide such information and hence different types of classifications are required. These classifiers are known as soft classifiers.

There are large numbers of soft classifier algorithms in digital image processing for capturing the respective proportions of land cover classes within mixed pixels and characterize land cover more accurately. A fraction images represents the proportions of each land cover within pixel. This land cover information is very important for land resource management. One important use of the fraction image information is that discrete classification maps of any type can be produced out of the continuous land cover information if desired. Soft classifier algorithms typically require training pixels, often called "image endmembers," which can be either pure pixels or mixed pixels with known class proportions.

The algorithms for sub pixel classification can be grouped under statistical, fuzzy set theory and some neural network based. Statistical based sub-pixel classifiers are dependent on multivariate normal distribution, while in the case of Linear Mixture Model (LMM) number of classes to be extracted from the data should be less than or equal to dimensionality of the data plus one. Artificial Neural Network is a learning algorithm but does not always generalize well. Upcoming machine learning Support Vector Machines (SVM) is a group of theoretically superior machine learning algorithms. It is found to be competitive with the best available machine learning algorithms in classifying high-dimensional data sets (Huang et al. 2002).

SVM can work well with a small training data set yielding high classification accuracy. It is well suited for high dimensionality data such as classification of hyperspectral data, where data is available in hundreds of bands. Further, SVM is robust to the overfitting problem as it relies on margin maximization rather than finding a decision boundary from the training samples. The structure of an SVM is less complex in comparison to Neural Network, which has a complex structure for processing high dimensional data. Further, when compared to some of nonparametric classifier, namely Decision Tree Classifier, SVM does not require the generation of rules that heavily depend on the knowledge from experts. This is crucial for achieving high classification accuracy.

A review of some of the commercial software reveals that only one or two soft classifiers are implemented, e.g. Linear Mixture Model is available in ERDAS Imagine, Fuzzy C- Means is available in PCI Geomatica. However, all soft classifiers are not available in any of the commercial software. SVM based algorithms are not available in any of the software, which is fast becoming popular amongst remote sensing analyst. This paper presents the capabilities of "Advanced Supervised Sub-Pixel Multi-Spectral Image Classifier" system developed at Indian Institute of Remote Sensing, Dehradun, India, using JAVA programming language for classifying multi-spectral data from remote sensing satellites at sub-pixel level using advance classifiers like, Support Vector Machine (SVM).

Soft Classification - Classifiers and Methods:
Land cover information has been identified as one of the crucial data components for many aspects of global change studies and environmental applications (Sellers et al. 1995). The derivation of such information increasingly relies on remote sensing technology due to its ability to acquire measurements of land surfaces at various spatial and temporal scales. One of the major approaches to deriving land cover information from remotely sensed images is classification. Numerous classification algorithms have been developed since the first Landsat image was acquired in early 1970s (Townshend 1992, Hall et al. 1995). Among the most popular are the maximum likelihood classifier (MLC), neural network classifiers and decision tree classifiers. The MLC is a parametric classifier based on statistical theory. Despite limitations due to its assumption of normal distribution of class signature (e.g. Swain and Davis 1978), it is perhaps one of the most widely used classifiers (e.g. Wang 1990, Hansen et al. 1996). Neural networks avoid some of the problems of the MLC by adopting a non-parametric approach. Their potential discriminating power has attracted a great deal of research effort. As a result, many types of neural networks have been developed (Lippman 1987); the most widely used in the classification of remotely sensed images is a group of networks called a multi-layer perceptron (MLP) (e.g. Paola and Schowengerdt 1995, Atkinson and Tatnall 1997).

A decision tree classifier takes a different approach to land cover classification. It breaks a complex classification problem into multiple stages of simple decision-making processes (Safavian and Landgrebe, 1991). Depending on the number of variables used at each stage, there are univariate and multivariate decision trees (Friedl and Brodley, 1997). Univariate decision trees have been used to develop land cover classifications at a global scale (DeFries et al. 1998, Hansen et al. 2000). Though multivariate decision trees are often more compact and can be more accurate than univariate decision trees (Brodley and Utgov 1995), they involve more complex algorithms and, as a result, are affected by a suite of algorithm-related factors (Friedl and Brodley, 1997).

The support vector machine (SVM) represents a group of theoretically superior machine learning algorithms. SVM employs optimization algorithms to locate the optimal boundaries between classes. Statistically, the optimal boundaries should be generalized to unseen samples with least errors among all possible boundaries separating the classes, therefore minimizing the confusion between classes. In practice, the SVM has been applied to optical character recognition, handwritten digit recognition and text categorization (Vapnik 1995, Joachims 1998b). These experiments reveals that SVM to be competitive with the best available classification methods, including neural networks and decision tree classifiers. The superior performance of the SVM has been demonstrated in classifying hyperspectral images acquired from the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) (Gualtieri and Cromp, 1998). While hundreds of variables were used as the input in the experiments mentioned above, there are far fewer variables in data acquired from operational sensor systems such as Landsat, the Advanced Very High Resolution Radiometer (AVHRR) and the Moderate Resolution Imaging Spectro radiometer (MODIS). As these are some of the major sensor systems from which land cover information is derived, an evaluation of the performance of the SVM using images from such sensor systems should have practical implications for land cover classification. A Java based software implementing different types of soft classifiers known as SMIC (Sub-pixel Multispectral Image Classifier) has been developed. The details of this classifier are given ahead.

Components of SMIC:
This system has been designed to handle multi-spectral image of any sensor. In this system, four different classification algorithms (Fuzzy c-Means, Possibilistic c-Means, Maximum Likelihood Classifier, Support Vector Machine have been incorporated in a supervised mode for sub-pixel classification. Fig 1. shows the different components of SMIC and are discussed below.

File: This option is for performing the file read, display and storing operations. One of the unique aspect incorporated regarding reading of a file is that any number of bands in file can be read simultaneously. Thus high dimensionality data sets can be read and displayed on the screen.



Fig 1. Main Window of the SMIC System


Utilities: This option allows for creation of a False Color Composite, and also allows for zooming in and out. Further simple linear enhancement facilities can be carried out which helps while generating reference information. Signature files: The option allows the user to create of reference data, both pure and mixed in nature. The reference data can then be used for any of the four soft classifiers.

Classifiers: In this option four classifiers had been incorporated. There is also option for saving the membership values generated through sub-pixel classifier option using any four classifiers algorithms incorporated in the SMIC system.

Accuracy Assessment: This option allows for accuracy assessment, which is a critical part for evaluating the output. Here the hard classifier output is evaluated through error matrix as well as through kappa coefficient. The accuracy of sub-pixel classifier is checked using fuzzy error matrix.

When carrying out soft classification, it requires a number of input parameters to be defined or assessed beforehand as the classifiers have to be trained before the data can be classified. Some of these parameters are weight matrix 'A' known as the norm, a weighting exponent m etc.

In this system provision has been kept that the user can select weight matrix 'A' according to type of norm, while calculating squared distance in fuzzy c-means as well as in possibilistic c-means. As mentioned in following equation;





Where xi is the vector denoting spectral response of a pixel i (i.e. a vector of spectral response of a pixel in various bands of a multi-spectral remote sensing image), vj is the vector of cluster center, A is the weight matrix, used as three namely Euclidean, Diagonal and Mahalonobis norm. The formulations of each norm are given as (Bezdek, 1981),





Where, I is the identity matrix, Dj is the diagonal matrix having diagonal elements as the eigen values of the variance covariance matrix, Cj given by;









While incorporating maximum likelihood classifier algorithm, provision had been kept that in the Gaussian expression the weight matrix can be any of three norms as mentioned in equation (1). The probability density function P (x/j) can be written as (Polubinskas et al. 1995),





Page 1 of 2
Next