|
GISdevelopment.net --> Application --> Miscellaneous
Land Cover Classification from Remote Sensing Imagery: Revisiting and Reevaluating Classification AccuracyRamita Manandhar Inakwu Odeh Faculty of Agriculture Food and Natural Resources The University of Sydney NSW, Australia Abstract The remote sensing community has long been interested in image classification of aerially sensed and satellite imageries for land cover maps as land use information is the basis for many environmental and socioeconomic applications. However, classifying remote sensing imageries to obtain land use and land cover information still remains a challenge that depends on many factors such as complexity of landscape, selected remote sensing data, image processing and classification methods, etc. In most cases, land cover maps derived from remote sensing imageries are often judged to be insufficient in quality and thus are not considered reliable for quantitative environmental applications. This has also led to the questioning of the suitability of remotely sensed imageries for thematic mapping. Heavy dependence on spectral characteristics or black and white tone is the one of the main reason for poor reliability of classified map. The use of ancillary data before or during or after classification is one way of improving the classification accuracy and reliability of the resulting maps. In this paper we applied the most popular Maximum likelihood classifier for the classification of the land cover of Lower Hunter region of New South Wales, Australia, using Landsat-TM for the year 2005. The major land cover and land use types are Woodland, Pasture and scrubland, Vineyard, Built-up and Water body. For classification purpose seven classes (Woodland, “pasture and scrubland”, Vineyard, Builtup, Water body, “Mine and quarry” and Olive) were identified. For accuracy assessment, only the five major classes are considered while maintaining enough sample size for each land cover class. However, while built-up and vineyard land cover types were found to have a high commission error, the “pasture and scrubland” land type is characterised by high omission error. By applying post-classification sorting using ancillary data, such as DEM, land use boundaries, roads, along with spatial texture and a vegetation index, the overall classification accuracy was improved from 79.5 % to 85.4 % with overall Kappa statistics from 0.74 to 0.81. The individual class user’s accuracy of the post classification corrected map ranged from 73.1 % for vineyard to 96.1 % for water body. Therefore, we conclude that post classification refinement with the use of ancillary data is effective in improving the accuracy of land cover maps. Introduction Remote sensing community has been interested in image classification of remote sensing imageries as classification results are the basis for many environmental and socioeconomic applications and to bring the satellite imageries to usable geographic products (Lu and Weng, 2007; Wilkinson, 2005; Foody, 2002). However, classifying a remote sensing imagery still remains a challenge that depends on many factors such as complexity of landscape in a study area, selected remote sensing data, and image processing and classification approaches etc (Steele et.al, 1998; Lu and Weng, 2007). Most of the time, land cover maps derived from remote sensing are often judged to be insufficient in quality and thus not trusted for quantitative environmental application purpose (Stow et.al., 1990; Foody, 2002; Wilkinson, 2005) thus leading to questioning of the spectral and radiometric suitability of remotely sensed data sets for thematic mapping. This means that fairly specific types of change must be identified using aerial photography and ground reconnaissance (Stow et.al., 1990). Wilkinson (2005) based on a review of 15 years of peer-reviewed experiments on satellite image classification, observed that, there has been no demonstrable improvement in classification performance over the 15 years period though a considerable inventiveness occurred in establishing and testing new classification methods during the period. He even raised doubts about the value of continued research efforts to improve classification algorithms in remote sensing. Jensen (2005) opined that there is no surprise of low reliability of remote sensing classification as 95 % of the scientists attempt to accomplish classification only using one variable i.e. spectral characteristic (color) or black and white tone. However, some of the researchers have started utilizing ancillary data in combination of remote sensing data to improve classification accuracy (Stefanov et.al., 2001; Watson and Wilcock, 2001; Xiuwan, 2002; Abdul-salem, 2002; Currit, 2005; Yuan et.al., 2005; Judex et.al., 2006). This article is trying to show usefulness of ancillary data in improving the classification of satellite imageries. TM/ETM+ are the most frequently used data sets at a regional scale (Lu and Weng, 2007; Stefanov et.al., 2001; Yang and Lo, 2002; Judex et.al., 2006; Yuan et.al., 2006) due to their relatively lower cost, longer history and higher frequency of archive. Information regarding the land covers over time and space is a fundamental requirement for environmental monitoring in order to prevent from detrimental environmental impacts before it becomes irreparable. In Australia, little research has been undertaken on land covers change especially in the eastern fringe of Australian continent where changes had occurred more drastically. This study is broadly aimed to fill this gap, and the study site is the Lower Hunter Valley, a well known tourist destination, within the Lower Hunter Region of New South Wales, Australia. Landsat thematic mapper imagery of the year 2005 were classified with the most widely used parametric classifier, maximum likelihood (ML) decision rule combined with a few ancillary data (e.g. DEM and knowledge of the locality, Land use data, vegetation index and textural analysis of the landsat imagery) through an expert (or hypothesis testing) system to improve the classification accuracy. The aim of this paper is therefore to test the hypothesis that the use of ancillary data could lead to improvement of land use classification. This premise is particularly pertinent because good quality satellite imageries of the study region are not available due to cloud cover and atmospheric haziness- fairly common phenomena in the study region. Materials and Methods Study area The study area is located in the lower Hunter region of New South Wales Australia, about 160 km north of Sydney. The key industries of mining, winery and tourism which are the economic engine of the area, are contributing to rapid economic growth. The study area, also called the Hunter Wine Country Private Irrigation District (HWCPID), covers approximately 379 km2. It is located within an undulating plain of the lower Hunter valley, centred on little town of Pokolbin. Geographically it lies between 151°09’43” E to 151°24’58”E Longitude and 32°37’21” S to 32°51’45” S Latitude (Fig 1). The area has been gaining popularity as a tourist attraction due to winery, stretching grape vineyard beyond the horizon, and golf courses. Pokolbin’s image of a bucolic rural landscape with its varied mosaic of vineyards, pastures, scattered woodlands and wineries, is being threatened by overdevelopment (Holmes and Hartig, 2007). Therefore there is the concern for environmental protection of the region. In spite of the importance of land cover for environmental modelling and planning, our knowledge of land cover/land use and its dynamics for the region is limited. ![]() Fig 1. Map of NSW with the study region Land user types for the area: In HWCPID land use ranges from viticulture and dairying to extensive grazing and forestry (Robinson and Helyar, 1996). Pastoral systems have been the dominant agricultural land use in the region for past 100 years and grape vines was started in the 1820s, but has expanded to 3500 hectares of vines today with an annual crush of 35,678 tonnes of grapes (Hunter valley wine country tourism paper distributed by the information centre in Cessnock). The other land uses include livestock production for beef, orchard, and vegetable production. In order to protect the booming wine grape cultivation from drought, the HWCPID was established. The Pokolbin Pipeline Project (PPP) consists of about 128 km of network PVC pipes pumping 5100 million litres annually from Hunter River. The network was designed to supply water to nearly 400 properties spread through out the project area (information collected from Mr. Ken Bray, operational manager of HWCPID, during the field visit to the site). Data sets A 2005 subscene of path/row 89/83 of Landsat 5 Thematic Mapper (TM) was procured from the Australian Centre for Remote Sensing. As it was difficult to obtain cloud-free Landsat imagery for the period between November and March, the actively growing period for vineyard, the TM image was obtained for June 8th, 2005. Orthorectified aerial photographs of the period 2004-06 was also procured from Plateau Images, Alstonville, New South Wales, which were used as reference image for selecting training sites and for validating the classification results. Land use map of Singleton covering the study area projected to GDA 1994 was also obtained from the Department of Natural Resource; this map contains vector layers of land covers. The meta-data specified the data set belonged to land use between June, 2000 and June, 2007. Preprocessing: The procured landsat TM image of 2005 was projected to WGS 1984 system, which was reprojected to GDA 1994, and then clipped off for the study area. The orthorectified aerial photographs were mosaicked together for displaying as one sheet. Initial land cover classification based on maximum likelihood algorithm In this study we adopted the maximum likelihood classifier (MLC), the most widely adopted parametric classification algorithm (Jensen, 2005; Bailly et.al., 2007; Liu et.al., 2002; Currit, 2005; Weng, 2002). It is the optimal choice if the assumption of a normal distribution (in feature space) for each class training area is correct (Liu et. al., 2002; Currit, 2005). The algorithm is based on probability distributions and decision rules which assume the data values to be a set of multivariate normal distributions. MLC assign a particular class to each pixel based on the shortest modified “Mahalanobis distance” of the pixel from the class mean. The algorithm also considers shape, size and orientation of the training samples. The total number of land cover classes delineated by the classification is seven considering the characteristic of satellite data used, and knowledge of land use of the study region (Table1). Table1. Land cover classes delineated for the classification.
Jensen (2005) suggested that it is not appropriate to attempt to derive some of level II classes (US Geological Survey Land-use Land-Cover Classification System) using Landsat TM data due to issues of spatial resolution and interpretability. Pastures could not be separated into irrigated and non-irrigated ones. There were very small patches of “Mine and quarry” and olive. Though olive covered very small area, we tried to delineate it due to its growing coverage in recent years. Processing was done as follows:
Post classification refinements were applied to reduce the errors caused by similarity of spectral responses of certain classes such as “Pasture and scrubland” and Vineyards and “Pasture and scrubland” and Builtup. Based on the accuracy assessment of the initial classified map, omission and commission errors need to be reduced to improve the accuracy of the map produced. To reduce the commission error of Builtup class, textural analysis of the landsat data was utilized. Additionally in order to reduce the commission error of classifying vineyard, boundary vector of military area, and western state forest were used in the expert system classification. Texture analysis Builtup areas typically have significant texture resulting from buildings and street grids, whereas homogenous areas such as vineyards have little to no texture. Stefanov et.al. (2001) had successfully utilized texture analysis of landsat imagery for improving the classification accuracy of urban centres. Here, we performed the texture analysis for the TM band 3 using a 3 X 3 moving window and the variance [V; Eq. (1)] (Erdas Field Guide, 2005): ----------------------[1]where xij= DN value of pixel (i, j); = n number of pixels in a window; and Mis the Mean of the moving window which is defined as [Eq.(2)] ----------------------[2]Normalized Difference Vegetation Index (NDVI) Normalized Difference Vegetation Index (NDVI) is the most widely used vegetation index to distinguish healthy vegetation from others or from non-vegetated areas. NDVI was derived using the expression [Eq.(3)]: NDVI = (NIR-R) / (NIR+R)------------------------[3] where NIR= Near infra red (band 4 of Landsat TM image); R=Red (band 3). Expert system classification
![]() Fig 2. The left white box is the hypothesis being tested, the ellipses represent the conjuctive decision rules and right coloured boxes represent the variables used. MLC did not produce encouraging result for classifying land covers especially for Builtup and Vineyard classes. The accuracy assessment of the classified map resulting from the initial classification with MLC showed high commission error for the Builtup and Vineyard classes, meaning that there is a probability (proportionate to the error) that pixels classified as builtup and vineyard may not actually exist on the ground. “Pasture and scrubland” had high omission error, meaning that there is a probability (proportionate to the errors) that ground reference points for this class is classified incorrectly (Table 2). As stated earlier, the imageries were acquired during the dormancy period of vineyard cultivation; it was therefore very difficult to spectrally distinguish land covers of vineyards from that of pasture area with scanty vegetation specifically in the North West of study area (military area) and also vineyards with western rocky mountainous forest (Fig.3.a). Additionally, builtup areas also were overestimated. ![]() Lu et.al. (2003) found that most of the time, the traditional approach to classification only distinguishes clearly between forest and non forest land covers. In this study, the classification accuracy for Woodland and Water body were found to be good just using the traditional MLC. But for other land cover classes, the MLC performed quite poorly. Jensen (2005b) also reasoned that the low reliability of remote sensing classification of land use and land cover is because of heavy dependence on only one variable; spectral characteristic (colour) or black and white tone. Although MLC is one of the widely used classifier, it requires input samples to have a normal distribution and heavily dependent on statistics of data. The new thinking is to let the geographical data itself “have a stronger voice” rather that let statistics derived from dataset dictate the analysis (Jensen, 2005). Expert system allows integration of remotely sensed data with other sources of georeferenced information such as previous land use data, spatial texture, and digital elevation models (DEMs), slope, aspect, geology, soils, hydrology, transportation network, vegetation to obtain greater classification accuracy (Stefanov et.al., 2001; Judex et.al., 2002; Jensen, 2005). Therefore expert system classification was constructed for post classification sorting and improvement of accuracy of the initial classification map to reduce the errors of commission and omission. The accuracy assessment of the final map corrected with the expert system showed an increase in overall accuracy from 79.5% to 85.4% and increased overall Kappa statistics from 0.74 to 0.81 (Table 2 and 3). The commission errors of the Builtup and Vineyard classes (as indicated from improved user’s accuracy) and omission error of the “Pasture and scrubland” class (as indicated by the increased producer’s accuracy of the class) were also reduced. The misclassified patches of vineyards in the western forest as well as in the military reserve have disappeared in addition to the reduction of overly estimated builtup patches resulted from initial classification (Fig 3.b). Overestimation of builtup was noticed in the area of low builtup areas in the initial classification; therefore they were reduced using the logic that only the builtup of the initial classification map with a textural value = 5 are reclassified as builtup in the final classification. No modification from the initial classification was performed for heavily builtup areas as they also were found to have lesser textural value at some regions due to their homogeneity. Likewise, commission error of Vineyard class was reduced removing the vineyard patches in the military reserve and western forest. Additionally, vineyards and olives at high elevation were removed utilizing DEM of the region as these classes are not expected at higher elevation. NDVI is another widely used and useful index for the classification and improving the classification. NDVI for water body is usually a negative value, while areas of healthy vegetation have high NDVI. Builtup and vineyard in dormancy stage also are found to have NDVI < 0.15, except in the vineyard areas with grass covers on ground. With the use of various ancillary data of the area, we were able to get more than 70 % user’s accuracy for the individual classes of the final map. ![]() Fig 3. Initial classified map with MLC (a) and the final map after correction using ancillary data (b). ![]() Conclusion The expert system of classification allows the integration of remotely sensed data with other sources of georeferenced information, such as land use, spatial texture and DEM to obtain greater accuracy. Here we have used the widely adopted maximum likelihood classifier for initial classification and then attempted to improve the classification accuracy incorporating additional data, such as land use, DEM, spatial texture and NDVI value of the landsat imagery for post classification correction in the expert system classification and conclude that the incorporation of ancillary data along with the spectral classification is beneficial for improvement of land cover maps. References
|
||||||||||||||||||||||||
| © GISdevelopment.net. All rights reserved. |