Classification of Remotely Sensed Data using Gravitational Symbolic Clustering D.S.Vinod Research scholar Department of Computer Science and Engineering Sri Jayachamarajendra College of Engineering Manasagangothri, Mysore 570 006 Karnataka, India Email:-ds_vinod@mailcity.com, vinod@sjce.ac.in Tel:-0821-512568(college), 481314(Res.) T. N. Nagabhushana Asst. Prof. Department of Computer Science and Engineering Sri Jayachamarajendra College of Engineering Manasagangothri, Mysore 570 006,Karnataka, India. Email:-tnn@sjce.ac.in Tel:-0821-512568(college)
Abstract
Remotely sensed data is obtained through artificial satellites from various geographic and astronomical sources. Remotely sensed image data are well adopted for monitoring and analyzing the behavior of different covers and also the temporal changes occurring on the target. Observations are made by periodically collecting the data and then comparing previous data. Using various sensors, data is collected and further analyzed to obtain information about objects and areas under investigation. The number of features in each sample of the 'Remotely sensed data' or 'multispectral image' depends upon the number of channels from which the information is collected. Data from different channels are combined to obtain the detailed information about the body or area under observation. We propose an improvised method of classification of such data using gravitational symbolic clustering. Symbolic data is a special case of conventional data, which is closer to real life interpretation and analysis. In conventional data, the objects are individualized, whereas in symbolic data sets they are more unified by means of relationships. The multispectral image is a quantitative interval type of symbolic data. It requires lot of time and memory space to analyze or classify the data. In order to overcome such limitations a data reduction technique is proposed. The data reduction technique uses storage bin arrays to store useful information of the reduced image data. The major idea is to present a clustering algorithm based on gravitational symbolic approach. The procedure is based on the physical phenomenon in which a system of particles in space converges to the centroid of the system due to the gravitational attraction between the particles. Both Agglomerative and Disaggregative Gravitational Symbolic approaches are considered. The concept of mutual pairs is used to merge the samples. The process of merging reduces the number of samples each time they are available for consideration. The process terminates at a stage when no more mutual pairs are available for merging. A detailed study has been carried out on different sets of multispectral images. We have also compared our results using cluster indices. Introduction Symbolic objects are extensions of classical data types. In conventional data sets, the objects are individualized, whereas in symbolic data sets they are more unified by means of relationships (2-4,16-17). Gowda and Diday(2-3) have presented an agglomerative clustering algorithm clustering algorithm for symbolic objects. They form composite symbolic objects (CSO) using a Cartesian join operator whenever a mutual pairs of symbolic objects are selected for agglomeration based on minimum dissimilarity and maximum similarity. The combined usage of similarity and dissimilarity measures for agglomerative and divisive clustering of symbolic objects is presented by Gowda and Ravi(5-6). To manage time and memory requirements, the clustering method involves the use of data reduction techniques(11-12). Here the data reduction as proposed by K.C.Gowda and S.K.Prakash(15) is modified to accommodate the task without dimensionality reduction. We have also incorporated the gravitational clustering techniques to cluster multispectral images of satellites(7,10). We have also compared our results using cluster indices(4). Both Agglomerative and Disaggregative Gravitational Symbolic approaches are considered. The concept of mutual pairs is used to merge or split the samples. Data Reduction Techniques Data reduction techniques are used before clustering the data. The data reduction technique uses bin arrays to store useful information of the reduced image. Here the m samples of d-dimensions are mapped on to bin arrays. Two cases are: 1) When dimension d ł2 and 2) d = 1. The results of the reduced data must be mapped on to the original data after clustering the reduced data. This requires a set of labels of all the m samples, which identifies the location of each sample in the nonempty bins. This is used for making the classification-output map after clustering the reduced data. 1 When dimension d ł 2 This works by assigning d-dimensional m number of samples to any one of the 2-dimensional arrays. The d 2-dimensional R Ž R bin arrays (R is a positive integer as required) are used to store the useful information of all the features. The data before reduction requires normalization between 1 to R. The normalized features are assigned to the RŽR arrays. The idea that the first feature is used to determine the column position of the bin to which the sample is to be assigned. The combination of the remaining feature values is used to determine the row position of the bin. If the first sample have the feature values f11, f12, f13, ,f1d, which are normalized between 1 to R. This sample is assigned to a bin having a column value of f11. As this is the first sample, the row value of its bin is also 1. If the second sample have the feature values f21, f22, f23, ,f2d. Say if f11 = f21, then the bin to which this sample is assigned also has a column value of f11 and its bin has a row value of 1 if the below condition is satisfied:
The steps are as follows: If S1, S2, S3, Sd be the RŽR 2-dimensional arrays to store updated information of each feature and W is another such RŽR 2-dimensional such bin-weight array to update number of samples assigned to each corresponding bin.
This works by assigning 1-dimensional m number of samples to any one of the 1-dimensional arrays. The one 1-dimensional R bin arrays are used to store the useful information of all the features. The data before reduction requires normalization between 1 to R. The normalized features are assigned to the R arrays. The idea that the first feature is used to determine the column position of the bin to which the sample is to be assigned. If the first sample has the feature values f11, which is normalized between 1 to R. This sample is assigned to a bin having a column value of f11. If the second sample have the feature values f21. Say if f11 = f21, then the bin to which this sample is assigned also has a column value of f11. The steps are as follows: Let S1 be the R 1-dimensional arrays to store updated information of the feature and W is another such R 1-dimensional bin-weight array to update number of samples assigned to each corresponding bin.
Cluster Coglometrate and Global Coglomerate Strengths The cluster coglomerate strength (CCS) of cluster Cj is :
Where mj cluster weight of Cj Xm Composite object of Cj Dj max Xi Î Cj D ( Xi , Xm ) or maximum dissimilarity. s: a suitable positive number and r: a suitable negative number depends on the input sample set. The global coglomerate strength (GCS) of cluster at any level is defined as the sum of all the CCS of all the prevalent clusters at that level. Thus the GCS when cluster Cj is formed, is defined as:
Symbolic Gravitational Clustering Algorithm 1 Agglomerative Method
Image of Moon taken by Galileo space craft on December 9 1990, with catalog number PIA00113 is taken for testing. Size of image is 535 * 535 with 3 features. Both Agglomerative and Divisive Gravitational Symbolic methods are considered, results are as shown in figure 1 and 2 respectively . Plot of Cluster Index Vs Number of Clusters is also shown in figure 3. ![]() Fig 1. Classification Map and details of Classification Cover of Moon (Agglomerative method) ![]() Fig 2. Classification Map and details of Classification Cover of Moon (Divisive method) ![]() Fig 3. Variation for CI for Moon Image ( Series 1 Agglomerative Method Series 2 Divisive metod.) Conclusions We have proposed a clustering methodology to classify multispectral images. The algorithm is based on the physical phenomenon in which a system of particles in space converge due to the gravitational attraction between the particles. To accomplish space and time complexity requirements data reduction techniques are used. We have proposed both Agglomerative and Divisive methods of Clustering. Cluster indices for both the methods are shown in figure 3. Analyzing the plot (figure 3) it is evident that Divisive technique requires less number of iterations when compared to Agglomerative technique. The process ends at a stage when there are no more mutual pairs available for merging or splitting. References
| ||
|
|