ASPECTS OF THE DSM PRODUCTION WITH HIGH RESOLUTION IMAGES
Charles Lemaire / Karl-Ludwig Lothhammer
INPHO GmbH
Germany
Karl-Ludwig.Lothhammer@inpho.de
ABSTRACT
An approach is introduced to derive digital surface models (DSM) from high resolution
arial images. The dependence on high quality input data and dense DSM will be
described as well. Well known MATCH-T has been redesigned to fulfil sophisticated
requirements on highly densed DSM data. Therefore a sequential multi-image matching
algorithm has been intrduced into MATCH-T, which will be from now on called
MATCH-T DSM. Point extraction is no longer based on static models, but on
computation units choosing best suited pairs of images. Thereafter, a point cloud is
derived by each unit to be filtered by robust analysis. Two case studies show precision
and completeness of derived surface models by MATCH-T DSM.
1 Introduction
Matching algorithms undergo a renaissance. Increasing demand on high accurate
and low price DSM cause this resurgence and can hardly be compared with
previous matching strategies. Current technologies offer much more computing
power and the introduction of state-of-the-art digital photogrammetric cameras have
revolutionised traditional photogrammetric workflow as well as project scale and
overlap. Requirements for production of very dense DSMs from high resolution
imagery, which means rersolution higher than 20cm, will be described within this
paper. Further, the changes of matching technologies to achieve high quality results
will aslo be discussed.
2 Input and data compilation
The quality of a dense DSM depends on the quality of input data and data
compilation. The input data are images, orientations and camera calibration data.
Additionally, results will be influenced by the stability of the hardware and flight
planning. Since MATCH-T DSM can correlate with sub-pixel accuracy, it is
essential to use a digital metric camera with reliable stability and precision.
2.1 Overlap
The traditional photogrammetric workflow is based on 60% forward- and 30%
side overlap. This standard overlap creates occlusion areas and reduces the
redundancy of image information. Overlaps of 60/60 or 80/30 allow just enough
redundancy for the DSM extraction as 4 images cover any open area. Only very
high overlap configurations like 80/60 or 90/70 reduce significantly the
occlusion areas in forest- or urban areas.
Most of the new digital large frame cameras have a non-square format, hence
the viewing angles along the line of flight and across the line of flight differ.
The longer side which has the larger angle of view exhibits more occlusions
than the short side. Thus the camera should be mounted in a way that the
smaller side of the sensor is perpendicular to the flight direction.
The high overlap allows a higher probability of successful matches, as the
features are very similar. On the other hand the base line is smaller so the height
accuracy is lower. That means that the extraction needs both: Models with high
overlap in order to minimise the occlusion areas, and models with large base
lines to get better height accuracy. Hence the DSM quality has two facets height
precision and completeness.
2.2 Direct geo-referencing
Direct geo-referencing has become more and more popular in the last decade.
Direct geo-referencing is mainly used for orthophoto production. With high
resolution images an aerial triangulation is still mandatory because the sub-pixel
precision potential of the high resolution images cannot be achieve with direct
geo-referencing [Cramer 2005]. Without this accuracy, the DSM extraction
performance is reduced.
2.3 Grid correction
Insito calibration is more and more done to minimise the effects of the sensor
instability. For lower precision photogrammetric production the insito camera
calibration is not mandatory, as the achieved correction is within sub pixel
range. For high accurate matching with high overlapping images this correction
reduces the noise of the point cloud because remaining image errors caused by
the sensor instability are better compensated [Cramer 2007]. Usually, the
benefits of the self calibration are mostly visible at the model border and
corners.
2.4 Ground sampling distance
As the large frame and push broom digital cameras have a fixed focal length the
only way to modify the GSD is to change the flying height but this also changes
the perspective of the images. Therefore high resolution digital image capture is
traditionally flown at low heights, but here the amount of occluded areas rises
quickly.
Of course a strong overlap of 80% reduces the amount of excluded areas.
Nevertheless, because of the perspective changes, the image features are less
similar than if it were captured from a higher altitude. This reduces the
matching accuracy and augments the risk of miss matching. In general it can be
said that DSM extraction from high resolution images is more complicated than
DSM extraction from middle resolution digital imageries. The situation may
change with the introduction of digital cameras with a smaller angle of view.
3 DSM extraction method
INPHO’s automatic DTM derivation tool MATCH-T DSM has been redesigned to
produce very dense DSM data. The most important improvement was the
introduction of the sequential multi-image matching and a new robust algorithm for
point filtering.
3.1 Short review of the MATCH-T method
The automatic DTM generation approach in MATCH-T is mainly characterised
by the feature-based matching technique being hierarchically applied in image
pyramids and a robust surface reconstruction with finite elements.
For DTM extraction the measured 3D points, together with curvature and
torsion constraints are introduced as observations. The weights for the curvature
and torsions observations both regularize and smooth the DTM.
A complete description of the MATCH-T design can be found in Krzystek, P.
and Ackermann, F., 1995.
3.2 Introduction to the MATCH-T DSM method
The key idea of the MATCH-T DSM method is the automatic measurement of
an extremely large number of irregularly distributed surface points. Robust
statistics can successfully eliminate gross error to reduce the noise of the point
cloud, as long as most of those points represent the surface and outliers caused
by mismatches or displacement in the scene deviate from the majority of “good”
points in a statistical sense.
3.3 Sequential Multi-Matching
In order to increase the amount of 3D points, the point extraction is no longer
based on static models, but on computation units. Each computation unit in
MATCH-T DSM chooses the best suited image pair. Each image pair delivers a
point cloud. The combined point clouds are filtered by a robust analysis. INPHO
calls this extraction method sequential multi-matching.
3.4 From FBM to LSM
The previous MATCH-T versions used feature based matching for the autocorrelation,
where sub-pixel precision is up to one third of a pixel. In order to
improve the matching precision, LSM can be optionally selected in the new
MATCH-T DSM version. The improvement in height accuracy of the raster is
about 20%, but computation time increases by a factor of two, thus LSM is
optional. It is up to the user to decide whether 20% accuracy improvement is
worth spending that extra time.
3.5 Model selection
The selection of the best suited image pair is based on the analysis of the DSM
slope. The algorithm chooses images that have the best viewing angle of the
matching unit. The algorithm allows a limitation of the number of models which
are used for the DSM extraction in one matching unit. Indeed with high
overlapping images, the amount of image pair combinations increases quickly
by ½ (n-1) (n) with n the number of images.
As a significant parameter, the model azimuth direction has been selected. The
point extraction is made in 6 main directions. If one model delivers not enough
3D points then MATCH-T DSM selects the next best suited model for this
azimuth.
It is possible that some matching units do not have any texture. For this reason,
MATCH-T DSM analyses the quantity of extracted 3D points and recognizes if
the image area has poor or no texture. Hence, MATCH-T DSM tries up to 20
models combinations per matching unit.
3.6 True 3D filtering
Filtering must be used to eliminate mismatched points. Such filtering is a
classification in correctly matched points. Often, the filtering is performed using
an interpolation of the terrain surface because the end product is a DTM. Thus
MATCH-T has used a finite element interpolation in order to filter the point
cloud. This interpolation describes a 2.5D surface. The finite element filtering
has to choose one Z value for one X,Y coordinate pair. It is well suited for DTM
extraction but the real world is 3D. This method cannot be used to extract 3D
Surface Models (3DSM): the extracted point cloud of MATCH-T DSM delivers
a true 3D representation (figure 1).

Figure 1 True 3D filtered MATCH-T DSM point cloud from aerial images
The new filter algorithm of MATCH-T DSM works in 3D and can select more
than one Z for one X,Y coordinate pair. A statistical analysis recognizes points
with high redundancy and then selects those with the best accuracy. The filter
realizes both a noise and data reduction without loss of information.
3.7 Point distribution
Figure 2 illustrates the 3D point distribution. One can recognize that the
distribution is similar to an image that has been processed with an edge
detection operator. Indeed, as MATCH-T DSM uses the ‘Förstner operator’ to
extract points, this point distribution is as expected.

Figure 2 Filtered MATCH-T DSM point cloud distribution
Once the point cloud has been filtered the distribution is more regular but areas
with poor textures are still easy to recognize.
4 Case studies
INPHO has made two case studies using different digital camera geometries and
different GSDs. The goals have been to determine the accuracy, the completeness
and the reliability of the MATCH-T 3DSM point cloud. In each case, the analysis
has been made with high resolution images, the image orientation parameters have
been determined by aerial triangulation.
4.1 Case Study 1: 80/30 compare to 80/60 Overlap
This case study compares the quality of DSMs extracted from two project
configurations using the same imagery. The project with 80/30 overlap has been
derived from the 80/60 project by omitting each second strip.

Information about the project is given in table 1.
The result summarized in the table 2 shows clearly the benefit of the higher side
overlap. The amount of extracted points is twice, the final point density is
almost 50% higher. With a completeness of 93% the DSM covers effectively
the complete surface. Only poor textured areas are not covered. The precision is
significantly better and the mean Z offset is considerably reduced. Thanks to the
high resolution images, the point density is very high. Such a point density for
photogrammetric products is unconventional and opens new fields of research
and applications.

Table 2 Summary of the results of case study 1
4.2 Case study 2: MATCH-T DSM from ADS40 compared to ALS 50 First pulse
point cloud
This case study estimates the accuracy of the MATCH-T DSM point cloud from
a reference surface. This surface model was generated from LIDAR first pulse
point cloud using SCOP++ software. The information about the project is given
in table 3.

Table 3 Input information of case study 1
From the filtered MATCH-T DSM point cloud a reduced point cloud was
obtained. For each point a height difference to the interpolated LIDAR surface
is computed, from those differences the accuracy of the MATCH-T DSM point
cloud has been estimated.
LIDAR data are used as reference because at this image scale the accuracy of
the interpolated surface from the LIDAR points is higher than the MATCH-T
DSM point cloud. The result is given in table 4.

Table 4 Summary of the results of case study 2
As expected the MATCH-T DSM point cloud is more accurate on the flat
terrain than on the roof surfaces. But height differences on sloped surfaces like
roofs do not directly correspond to the residual error, which is measured
perpendicular to the sloped surface. Furthermore MATCH-T DSM delivers a
point cloud that contains approximately twice as many points as the comparable
LIDAR flight. Thus some deviations in the comparison between LIDAR and
MATCH-T DSM result from small structures like chimneys and jutties which
are not always completely represented in the LIDAR point cloud or the
MATCH-T DSM.
The high percentage of accepted points shows that the MATCH-T DSM point
cloud is very well filtered. The few gross errors can be eliminated through a
second filter process.
The achieved mean accuracy corresponds to a matching accuracy better than
half of a pixel. Then, the accuracy of the MATCH-T DSM point cloud is well
suited for automatic building generation or high precision DTM production
from high resolution images.
5 Conclusion
This paper has shown that MATCH-T DSM delivers a highly reliable and highly
accurate result. The point cloud extracted with MATCH-T DSM from high
resolution images delivers a better 3D representation than a traditional raster. The
point cloud extracted with MATCH-T DSM is well suited for building extraction
(figure 3), high accurate DTM production and object recognition. The studies show
that MATCH-T DSM is competitive to LIDAR for large surface DSM production
especially if coupled with high resolution orthophoto production. One can consider
MATCH-T DSM as a passive point scanner, the measurement speed only
depending on office computing resources.

Figure 3 Extracted building from MATCH-T DSM point cloud
6 References
Cramer, M., 2007. The EuroSDR Performance Test for Digital Aerial Camera
Systems, Photogrammetric Week '05, Wichmann, Heidelberg, pp. 79-92.
Cramer, M., 2005. 10 Years ifp Test Site Vaihingen/Enz: An Independent
Performance Study, 50 Photogrammetric Week, Stuttgart, Wichmann (.ed),pp. 89-
106.
Ackermann, F. and P. Krzystek, New Investigations into the Technical Performance
of Automatic DEM Generation, Proceedings 1995 of ACSM/ASPRS Annual
Convention, Charlotte, NC, Vol. 2, pp. 488-500.