Review of Digital Image Orthorectification Techniques
Rigorous physical sensor models are more accurate (Rosenholm and Akerman, 1998; Tao and Hu, 2001). An example is the orbital model of the satellite track and the rotation angles of the instrument for orthorectification of IRS images (Rosenholm and Akerman, 1998). However the development of generalized sensor models independent of sensor platforms and sensor types is very attractive. In a generalized sensor model, the transformation between the image and the object space is represented as some general function without modeling the physical imaging process.
Rational function model rectification
Generalized sensor models, such as the usage of the Rational Function sensor Model (RFM) (Tao and Hu, 2001), have alleviated the requirement to obtain a physical sensor model, and with it, the requirement for a comprehensive understanding of the physical model parameters. Furthermore, as the RFM sensor model implicitly provides the interior and exterior sensor orientation, the availability of GCPs is no longer a mandatory requirement. Consequently, the use of the RFM for photogrammetric mapping is becoming a new standard in high-resolution satellite imagery. This has already been implemented in various high-resolution sensors, such as IKONOS and QuickBird (Croitoru et al., 2004).
The RFM sensor model describes the geometric relationship between the object space and image space. It relates object point coordinates (X,Y,Z) to image pixel coordinates (r,c) or vice versa using 78 rational polynomial coefficients (RPCs). For the ground-to-image transformation, the defined ratios of polynomials have the following form (Croitoru et al., 2004):

Where (r
n,c
n) are the normalized row (line) and column (sample) index of pixels in image space; X
n) ,Y
n) and Z
n) are normalized coordinate values of object points in ground space; and the polynomial coefficients a
ijk, b
ijk, c
ijk, are called Rational Function Coefficients (RFCs).
Orthorectification reprojection
Orthorectification algorithms are often performed in conjunction with reprojection procedure, where rays from the image are reprojected onto a model of the terrain. Fundamentally reprojection can be done in two ways:
- Forward projection or direct projection
- Backward projection or indirect method
In the first case of forward projection, the pixels from the original image are projected on top of the DEM of the 3D model and the pixels’ object space coordinates are calculated. Then, the object space points are projected into the orthoimage. Because of the gap between the points projected into the orthoimage, due to the terrain deviation and perspective effects, the forward projection projects regularly spaced points in the source image to a set of irregular spaced points. Therefore they have to be interpolated into a regular array of pixel that can be stored in a digital image. This is why the backward projection is often preferred (Mikhail et al., 2001). If the corner of the orthoimage is placed at X
0) ,Y
0) , the pixel coordinate column and row,(c,r) in the orthoimage is found by (Mikhail et al., 2001):

Where
GSD is the Ground Sample Distance, which is the distance between each pixel, also referred to as the pixel size. Notice that the equation takes into account that pixel coordinate system has the Y-axis downwards and the world coordinate system has the Y-axis pointing upwards/north.
In the case of backward projection, the object space X, Y coordinates related to every pixel of the final orthoimage are determined. The height Z at a specific X, Y point is calculated from the DEM or the 3D model and then the X, Y, Z object space coordinates are projected in the original image in order to acquire the gray level value for the orthoimage pixel. Interpolation or resampling process in the original image is also essential because of the fact that the projected coordinates will not fit accurately at the original image pixel centers. In the backward projection instead of interpolating in the orthoimage, the interpolation is done in the source image. This is easier to implement, and the interpolation can be done right away for each output pixel. Further more only pixels that are needed in the orthoimage are reprojected. In the backward projection method, the pixel-to-world transformation is again given by (Mikhail et al., 2001) as

The position in the same image that corresponds to the found XYZ can be obtained by a sensor model (i.e collinearity equations as applied in the differential orthorectification method).
General orthorectification workflow
Most commercial off-the-shelf software (COTS) for software (i.e Erdas Imagine, PCI Geomatics, ENVI, ZI Imagine, LH System, TNT products, etc) contain comprehensive suite of products designed for performing tasks required in producing high quality, seamless digital orthophoto imagery products from aerial (standard and digital) and commercial satellite imagery. These software products include programs required to produce the input components for orthorectification including project setup, DEM interpolation/formatting, and GCP and tie point collection. These products can also be used to create imagery mosaics, and they have the ability for orthorectification using RPFs
Most of these software products currently support processing of ephemeris data and orthorectification for the following commercial imaging satellites:
- Landsat 4, 5 and 7
- SPOT 1,2,3, and 4
- IRS 1-A, 1-B, 1-C, and1-D
- AVHRR
- ASTER
- Ikonos
- Quickbird
In these software, project setup requires the user to specify project, name, input and output projections, formats (i.e. what projection the GCPs are being collected in), whether aerial or satellite modeling, type of sensor, etc.) Depending on the availability and format of the control points and the method used, GCPs can be input using a variety of methods including extraction from geocoded images and vector files, importing from text files and captured from hardcopy sources using a digitizing tablet. A minimum of four GCPs is required for each satellite image, although six to eight are recommended.
Tie points are produced by identifying the same pixel location in two or more images within their overlap areas. These software products can process tie points for an unlimited number of input images.
To get an overview of the control point residuals and identify possible outliers, these software provide the ability to generate residual reports for all scenes within a project or for individual scenes.
Mosaicking requires delineation of cutlines and, in many cases, radiometric adjustment of adjacent images to hide the seamlines and produce a more visually pleasing product. These software have both manual and automatic mosaicking options.
General orthorectification workflow includes the following tasks:
- Importing raw satellite imagery, including orbital metadata for a variety of airborne cameras and sensors
- Collecting ground control points from an existing geocoded images
- Refining the collected ground control points based on criteria such as high residual error
- Calculating the updated model for the images with the refined GCPs
- Given the model and a DEM (or an approximate elevation), generate the orthorectified images
Conclusion
This article has described, in a succinct form, some of the popular conventional techniques of orthorectification. These methods range from those with simple algorithm to those with sophisticated routines, such as the physical and the generalized models. Modern trend has shown more inclination to the use of generalized models. One of such state-of-the-art method is the application of RFM. In future RFM will have the advantage that the photogrammetric processing software can be kept unchanged when dealing with different sensor data since the generalized sensor model is sensor independent. For new sensors, only the values of coefficients in the generalized sensor model need to be updated.
Again since there are no functional relations between the parameters of the rigorous model and those of the RPF, the physical parameters cannot be recovered from the RPF and the sensor information can be kept confidential. Currently, in order to protect the undisclosed sensor information, some commercial satellite data vendors, such as Space Imaging Inc., only provide users with the RFM parameters but do not provide any information regarding the physical sensor model. Even under this condition, users are still able to achieve a reasonable accuracy without knowing the rigorous sensor models for photogrammetric processing.
References
- Croitoru, A. et al., (2004). "The Rational Function Model: A Unified 2d And 3d Spatial Data Generation Scheme". Proceedings of ASPRS Annual Conference, Denver, Colorado, USA
- Hemmleb, M. and Wiedemann, A., (1997). "Digital Rectification and Generation of Orthoimages in Architectural Photogrammetry". CIPA International Symposium, IAPRS, XXXII, Part 5C1B: 261-267, Göteborg, Sweden,
- Kraus, K., (1992). "Photogrammetry Fundamentals and Processes". Dummler Verlag, Bonn.
- Mayr, W. and Heipke, C., (1988). "A Contribution to Digital Orthophoto Generation." International Archives of Photogrammetry and Remote Sensing, 27, Part B11-IV: 430 - 439.
- Mikhail, E.M., Bethel, J.S. and McGlone, J.C., (2001). "Introduction to Modern Photogrammetry." Wiley, New York.
- Novak, K., (1992). "Rectification of digital imagery". Photogrammetric Engineering and Remote Sensing, 58(3): 339-344.
- Rosenholm, D. and Akerman, D., (1998). "Digital Orthophotos from IRS - Production and Utilization". GIS - Between Visions and Applications, ISPRS, 32, Stuttgart, Germany
- Tao, C.V. and Hu, Y., (2001). "The Rational Function Model: a tool for processing high resolution imagery". Earth Observation Magazine (EOM), 10(1).