Segmentation of High Resolution Imagery B. Krishna Mohan Centre of Studies in Resources Engineering Indian Institute of Technology, Bombay Powai, Mumbai 400076, India Tel: +91-22-25767684, Fax: +91-22-25723190 Email: bkmohan@iitb.ac.in S. U. Kadam Reliance Infocomm Ltd. Dhirubhai Ambani Knowledge City Navi Mumbai 400709, India Tel: +91-22-30373333, Fax: +91 22 2762 4213 E. P. Rao Department of Civil Engineering Indian Institute of Technology, Bombay Powai, Mumbai - 400076, India Tel: +91-22-25767345, Fax: +91-22-25723480
Abstract High resolution remote sensing is revolutionizing the way earth resources information is extracted from spaceborne sensors (Ikonos, Quickbird, Orbview, TES). Detailed information that was hitherto unavailable such as minor and major roads including widths and surface markings is now possible to derive from remote sensing. In medium resolution images (up to 5 metres resolution), using operators such as line detectors thin features like road networks could be extracted. However, at higher resolutions (1-metre and sub-metre), a road object is really an elongated feature of sizeable width that is far from negligible. Buildings are no longer point objects, but polygons with prominent shadows. In such cases the traditional approaches of image segmentation using per-pixel methods are bound to fail. There are two key issues to be considered when using high spatial resolution images - we see objects having structure (such as broad, elongated, compact, textured) as well as relationship with other objects (buildings have neighboring road objects). If we reduce the resolution and progressively increase it towards full, we see objects being under-segmented (at coarser resolutions) to over-segmented (at finer resolutions). However, the coarse-to-fine tracking provides global view initially and helps focus on specific details at local level at higher resolutions. This is quite similar to the way human vision system works - from a distance we first mark out broad classes within our view, and then proceed to study the objects of our interest in detail, by going closer to them. A general region based segmentation / classification system for remote sensing proceeds by initial extraction of regions at multiple levels of resolution. These regions are linked using a number of rules that govern the relationship between the object at a given level with other objects at the same level or other levels. Training of classifiers can now proceed the same way per-pixel classifiers do, with the difference that instead of pixels, each object or region is classified as a separate entity. In this paper, line based, texture based and region based segmentation techniques are applied to 1-metre IKONOS and 5.8 metre IRS -1C panchromatic remotely sensed images and the results are discussed. Introduction High resolution imagery from resources satellites are revolutionizing the generation of geo-spatial information and building up geographic databases. The amount of detail presented by the 1-metre and sub-metre resolution imagery enables analysis and mapping of the terrain to a level not attempted before. Extraction of information from the high resolution imagery is a challenging task since the per-pixel methods adopted here are likely to fail due to their inability to capture the increased natural variability in the reflectance, and also the fact that each landcover category is comprised of several spatially adjacent pixels. Therefore the methodology to be adopted in such a case has to be region based or object oriented. Another aspect of information extraction from remotely sensed images (for that matter from any images) is that the objects of interest exhibit tonal / textural structure at different scales. The meaning of scale here is that some objects may contain a large amount of fine detail, while some objects are (intensity-wise) flat with little variation across their spatial extent. Therefore the objects that indeed contain a lot of important detail are studied at one scale, while objects that have only noise induced variations are studied often at a coarser scale. Scale space approach to image analysis was introduced by Marr (1980) and has since then been extensively investigated. One of the most recent examples is the eCognition TM software (Definiens, 2003) wherein the image is segmented based on scale, texture, shape and color (tone). Image Segmentation by Texture Texture in images is an important cue for the purpose of segmentation and has been under investigation for over three decades. Among the statistical texture measures, none has been studied in greater detail than the second-order statistics based gray level co-occurrence matrix method proposed by Haralick et al.(1973). An early analysis of texture feature extraction methods can be found in (Weszka, 1976). Dengru et al., (1999) reviewed the existing methods for finding texture classification and they introduced new texture measures. A comparison of performance is made between structured feature and Haralick's features extracted from grey level co-occurrence matrices, they found that structure feature measures minimize the internal calculations with in the window and are relatively simple. The structured feature allows additional information extracted from Remotely Sensed imagery. Argenti et al., (1990) give a fast algorithm for calculating the parameters of co-occurrence matrices. Co-occurrence matrices are quite effective for discriminating the different textures but had a disadvantage of high computational cost. With the help of their fast algorithm they reduced computational cost and experimental results shows improvement of classification rates when compared to a single window classification. They have used these features for a pixel based classification giving particular attention to edge detection. A well-known signal processing method is based on Gabor filters. In this approach texture features are obtained by subjecting each (selected) multichannel filtered image to a non-linear transformation and computing a measure of energy in window around each pixel (Jain 1991). In model based texture analysis, images are often modeled as Markov random fields (Geman and Geman, 1990) or its variants. Analysis of natural textures has also been advanced by the use of metrics derived from fractal geometry (Mandelbrot 1982). Fractal based descriptions of image texture have been used effectively in characterization and segmentation of natural imagery. (Keller et al., 1989), shape from texture and medical images enhancement and classification. This fractal metrics provides accurate measures of roughness or irregularity in scale invariant forms. Some of the methods that are widely used are: (reviewed by Santhosh Kumar, 1996)
Region Based Segmentation Region Based segmentation is a technique by which the image is segmented by identifying regions of common property. This can be further divided into Region Growing and Region Split and Merge approaches. This paper focuses only on region growing, a procedure that groups pixels or subregions into larger regions based on some predefined criterion. The general idea of the region growing methods is to create a partition of the picture into homogeneous regions by any kind of process that starts with small regions and then grows them according to some homogeneity criterion. The object-oriented image processing techniques rely on successful image segmentation of image features based on contextual information such as texture, connectivity, and multiresolution hierarchy. Cheng et al. (2000) define image segmentation as a process of dividing images into different regions such that each region is, but the union of two different regions is not, homogeneous. Image segmentation methods are split into two main domains: knowledge driven (top down) methods and data driven (bottom up) methods (Gao Yan 2003). See also, Lobo (1997) for a philosophical discussion on the interpretation of objects in natural scenes, and Bock and Lessing, (2002). Baatz and Schape (2000) discuss a fractal net evolution approach for image segmentation wherein metrics are defined for merging regions with their neighbors. Multiscale Linear Feature Detection One of the approaches to linear feature extraction is to regard the image as a two-dimensional function z(x,y) of the spatial variables x and y; and extract lines from this function using differential geometry properties. Steger (1998) used Guassian kernels in his method. Steger proposed an approach to line detection that uses an explicit model for lines and line profile models of increasing sophistication. A scale-space analysis is carried out for each of the models. This analysis is used to derive an algorithm in which lines and their widths can be extracted with subpixel accuracy. The algorithm uses a modification of the differential geometric approaches to detect lines and their corresponding edges. Since Gaussian masks are used to estimate the derivatives of the image, the algorithm scales to lines of arbitrary widths while always yielding a single response. The Gaussian kernels are given by equations 1, 2 and 3. For discrete signals, the convolution masks are given by:
For the proposed line detector, the parameters that are taken into consideration are the line width w and its contrast h. To convert thresholds on w and h into thresholds the operator can use, first a s should be chosen such that
The Gaussian operator involves compromises, One has to make a judgement and compromise between losing line pixels and eliminating noise pixels without distorting the shape of lines. On the one hand, by using a large value of s, the noise is reduced but one loses genuine line pixels. On the other hand, using a small values of s, local information about lines is not lost but noise and other unnecessary details are enhanced. To overcome this problem, Bergholm (1987) introduced the concept of coarse to fine feature tracking known as edge focusing. He has applied this principle for edge detection. The steps in the line tracking algorithm based on Bergholm's approach are as follows:
A non-parametric method was first developed for contour extraction in grey images, which relied in defining the contours as watershed. (Beucher, 1992) This method has been considerably improved with the tools of mathematical morphology using watershed transformation. The non-parametric method of contour extraction using watershed transformation in the earlier stages lead to over-segmentation of images. Later a strategy called Marker Controlled Segmentation. In this approach, user defines object markers and background markers so as to differentiate between the two and later segment the image using the gradient image. The gradient image is often used in watershed transformation, as the main criterion of segmentation is image homogeneity. (Beucher 1992). Watershed analysis is well recognized as being useful for image segmentation and has been made computationally practical thanks to a fast technique presented by Vincent and Soille (1991). Watershed analysis uses an image's gradient magnitude as input to subdivide the image into low-gradient catchment basins surrounded by high-gradient watershed lines. The catchment basins consist of locally homogeneous connected sets of pixels. The watershed lines are made up of connected pixels exhibiting local maxima in gradient magnitude; to achieve a final segmentation, these pixels are typically absorbed into adjacent catchment basins. Results and Discussion In this paper an attempt is made to study different image segmentation algorithms from the point of view of segmenting images of different spatial resolutions including 5.8 metre IRS and 1-metre IKONOS images. The results of line detection using the IRS panchromatic image can be seen in Fig. 1 (a)-(d). The multiscale line tracking has been found to fill the gaps in the line map at sigma=3 and reduce the noise at sigma=1. Similar results can be seen with the 1-metre IKONOS images in Fig. 1 (e)-(h). To compare the textural segmentation and the line detection methods, the line network extracted is overlaid on the textural classification as can be seen in Fig. 2. The classes extracted using the textural classification include water, marshy land, open areas, densely built-up areas and sparsely built-up areas. ![]() Fig 1(a) Original Image - 5.8 metre resolution ![]() Fig 1(b) Line detection, sigma=1.0 ![]() Fig 1(c) Line detection, sigma=3.0 ![]() Fig 1(d) Multiscale coarse-fine line tracking ![]() Fig 1(e) Original image - 1-metre resolution ![]() Fig 1(f) Line detection using sigma=1.0 ![]() Fig 1(g) Line detection at sigma=3 ![]() Fig 1(h) Coarse-fine tracking ![]() Fig 2(a) Landuse classification using textural features ![]() Fig 2(b) Linear features overlaid on texture classification To compare region segmentation of the 1-metre IKONOS imagery, the fractal dimension texture image suggested by Sarkar and Choudhary (1995) using differential box counting was implemented and the result of clustering the original image and the texture image can be found in Fig. 3 (a)-(b). Again, the clusters indicate the categories such as building tops, water, roads, open areas and vegetation. The image is also segmented using a region growing algorithm based on the morphological watershed transform (Beucher, 1992). This result can be seen in Fig. 3 (c). The region segmentation visually compares well with the other segmentation results. A detailed classification system is being developed to utilize the watershed transform output as the first step, followed by computing the region statistics, region connectivity with neighboring regions and finally region classification using neural networks, maximum likelihood method, fuzzy and hard clustering. Integration of edge/line and region information is important for fusing information extracted about the image from different approaches. This approach is also being built into the above system. ![]() Fig. 3(a) Fractal feature using Box-counting ![]() Fig. 3(b) Classification based on texture ![]() Fig. 3(c) Watershed based Segmentation Acknowledgements This work is supported by a research grant from Indian Space Research Organization under the ISRO-IIT(Bombay) Space Technology Cell. Contributions to the work reported here from the research associates J. Kannan and G. Bimal Raj, and former students G.M.Bhat and M. Lakshmikant is acknowledged. The encouragement received from Head, CSRE during the course of this work is gratefully acknowledged. The IKONOS image used in this academic research was downloaded from the website of Space Imaging Inc., USA (www.spaceimaging.com) who have been producing magnificent quality high resolution imagery for civilian applications since 1999. References
| ||
|
|