Have Data, Will Travel: A Data-Centric Approach to Enterprise Systems Development
John A. Zumbado
GeorgiaPowerCompany 241 RalphMcGillBlvd., Bin 20020 Atlanta,GA30144-3374 Email: jazumbad@southemco.com PhillipA. Naecker GatekeeperSystems,Inc. 1010E. UnionStreet, Suite 101 Pasadena,CA9110-69756 Email:pan@gatekeeper.com WilliamH. Iller OrbitalImagingSystems, 21700AtlanticBlvd. Dunes,VA20166 Email:mailto:iler.bill@orbital.com Introduction-The Data Centric Approach Applied To AM/FM/GIS The development of a large AM/FM/GIS system is always a challenge. Such systems often involve a large number of diverse data sources, with complex interrelationships between the data elements. These data sources typically include those both internal and external to the enterprise. Moreover, AM/FM/GIS systems are often burdened with very broad application requirements, including complex spatial analysis and display operations. To make matters more difficult, large AM/FM/GIS systems often involve interaction between multiple-user departments and support organizations. Traditional application requirement definition and deployment approaches involve extended requirements gathering, business process reengineering, infrastructure upgrades, and a plethora of other activities. The result is typically a lengthy development process fraught with delayed deployment of critical information systems. These delays often result in a failed or much under-delivered project. This paper describes an approach that enables the rapid development and deployment of AM/FM/GIS systems capabilities in the face of such challenges. This approach has been successfully used in implementing large-scale, enterprise-wide AM/FM/GIS projects in very short timeframes. The data-centric approach, as this new methodology has been dubbed, focuses principal attention on delivery of geospatial data and related tabular data to end users rather than providing new geospatial data or new geospatial analysis applications. This approach concentrates on identifying, collecting, coordinating, and linking together enterprise data resources, and then providing the widest possible access to this spatially enabled data via Internet tools. The business driver for this approach is that properly organized data has tremendous value to workers throughout an enterprise. Organizations gain business value by quickly providing data access instead of waiting for the development and implementation of complex applications. With the data in place and accessible, increasingly complex applications can evolve as the business need is identified and addressed. The principal components of the data-centric approach are:
Experience to date with the data-centric approach applied to AM/FM/GIS applications development has been exceedingly positive. This success has spawned some observations as to why this has been the case. These are:
The data-centric approach has already gained acceptance in all corners of the enterprise. Analyzing organizations that rapidly adopt this approach reveals these key business drivers:
As with other systems development methodologies, the data-centric approach has some inherent risks, but these are minimized with its proper application. Typical risks are discussed below: Risk Number 1: Cost and Schedule Inflation Ballooning costs and schedule creep is probably the greatest risk for any AM/FM/GIS project. The authors have found that a data-centric application development approach has a lower risk of these undesirable effects than other software development approaches, for some fimdamental reasons. First, the data-centric approach avoids complex "complete" applications. Most of the benefit of many GIS applications is achieved simply by "getting the data out" to the user. In the data-centric approach, the data is typically organized and deployed in a map-enabled Web environment Complex GIS processing systems and extensive requirement analysis are avoided. Second, the data-centric approach is centered on deploying existing spatial data, and spatially existing tabular data. These tasks are fundamentally less risky because the data sets involved have been previously captured and are already understood. Third, a phased-in or departmental approach can be used where specific data is added or brought on-line. Departmental fimding is typically less costly and the results of the effort can be demonstrated more quickly. Risk Number 2: Hostile Res~onse from Traditional IT or GIS Organizations Traditional information technology organizations generally prefer a highly planned and carefully managed approach to applications development. For many, the risks of deploying an application that does not fit into the long-range IT plan are dominant decision factors. The data-centric methodology is an advocate of proper systems planning, but provides a mechanism to build upon existing company assets (data), without having to construct a complete system to do so. Likewise, traditional GIS organizations may feel threatened by the data-centric approach. In the data-centric approach, the idea is to get the data to the end users as quickly as possible, with minimal GIS applications and tools. This can cause consternation among GIS data producers; exposing data to end-users will create additional demand for data maintenance, and potentially reducing the end-user's dependence on the GIS department for map production. Risk Number 3: Data Source Instability Central to the data-centric approach is the availability of quality data. One of the greatest problems the authors have encountered in deploying data-centric applications is reliability of the data sources. Other departments or organizations develop the data sources, often for completely different purposes. Oftentimes, departments that produce GIS and spatial data are not managed and staffed by data processing professionals familiar with change control and data interchange. Mitigating this risk is difficult. Experience has shown that the best strategy is to work with the other departments at the point where management structures intersect. Most departments welcome the opportunity to demonstrate to management that their work-products are being broadly used in the rest of the enterprise. Risk Number 4: Confusion about GIS Vendors and Tools At its core, the data-centric application development methodology is a rapid application development (RAD) technique. The idea is to quickly deploy a solution that will satisfy a large portion of the organization's demand for data. Any resistance to moving quickly is detriment to the data-centric approach. A slow data-centric development project misses the point of this approach. Software vendors, on the other hand, are interested in keeping a customer within a family of tools. If an organization wants to choose a software tool from outside their product line, vendors will encourage them to wait for a tool that "integrates fully" with existing tools and data. Vendors may attempt to insert fear, uncertainty, and doubt (FUD) into development plans. The author's experience has shown that the data-centric approach enables data to be deployed in much less time than vendors can deploy new products. Therefore, it is recommended to adhere to a data-centric application development methodology and ignore the pleas of software vendors, and simply choose whatever tool can most quickly deploy the data. Software will come and go, but the data is the organizational asset that will be used forever. How to do it: management, tools, candidate applications and techniques Many people may now be convinced of the benefit of a data-centric application development methodology for geospatial information, and they may be ready to try this approach for their next (or current) AM/FM/GIS project. The authors have found that the approach works well in environments with a small number of essential elements. Project Selection First and foremost, the data-centric approach works best when there is a clear and very short-term requirement to address by "getting the data out". Projects where a single, bounded business need can be satisfied by providing access to a variety of data sources, both spatial and non-spatial, without the need for extensive data conversion or complex processing. Data Modeling Ensure that the project team is skilled and experienced in developing solid data models and rapidly implementing those models. The data-centric approach depends on a well-structured database that can link together both spatial and non-spatial elements. Therefore, one key tasks is to bring together all targeted data resources into a single, integrated environment. Remember that a "publishing" environment is being built not a data maintenance environment, so data resources should be organized around web-based deployment and integration with maps not around data maintenance tasks. Tools Success in the rapid deployment of geospatial data is tied to the use of extremely lightweight and widely deployable mapping tools specificallyy tools that work well within Web browsers. Previous generations of GIS tools have required too much training and were to complex to allow for simple operation by casual users. Additionally, these GIS's were too costly to be widely deployed, did not work well over the Intranet. Web-based tools that can display maps and let the user interact and query objects on the map are key to the wide deployment of geospatial data. An obstacle to look out for is vendor-initiated "tool wars", wherein vendors argue that organizations must use one tool or another in order to maintain an integrated GIS capability within an organization. It is recommended that users utilize whatever tool will allow the organization to quickly deploy data to the entire enterprise. Experience has shown that project teams should be able to widely deploy geospatial data integrated with large relational databases in a few weeks to a month. If this goal is unattainable, the project is probably being unreasonably constrained by its soflware tools, by the skills of its development resources, or both. Identifying Candidate Applications As mentioned above, the authors have found that the best candidate applications for a data-centric geospatial application development are those that focus on a narrow and very short-term critical business need. Applications that are part of a grand, "strategic" vision are less likely to be sufficiently flexible to gain benefits from this approach. It should be noted that a substantial portion of the benefit of the data-centric methodology derives from the ability to spatially enable, link and deploy existing data to a wide audience. Therefore, the data-centric approach will be much more difficult to implement if there are only limited preexisting spatial resources to support the application. In this situation, purchasing basic spatial information (such as a relatively inexpensive basic street network, often available for only a few hundred dollars) and using that to spatially enable the rest of your data is a realistic option. From these constraints, some obvious candidate applications for the data-centric geospatial application development approach may come to mind, including:
From the authors' successes at using the data-centric approach to geospatial application development, a few suggestions are offered on how to be successfid with this approach.
In this section, three case studies are highlighted. In each of the following systems, the authors have used a data-centric applications development methodology to successfully deploy geospatial data to hundreds or thousands of users in just a few months. PacifiCorp's Operations Visualization System PacifiCorp is one of the largest electric utilities in the United States, providing electricity to more than 2.3 million customers in a seven-state service area. Like many utilities, PacifiCorp has installed a sophisticated Outage Management System to capture trouble calls received by customer service agents and an automated voice response system. The system allowed operations managers to work with trouble call data to improve response time and allocation of resources. In 1997, company management realized that the outage management system could not easily be deployed to the dozens of locations where the information was needed. The system required client software to be installed on fairly powerful computers, a high-performance network, and significant training. Company management also realized that the outage system data was the real resource. Using a data centric approach, Pacificorp was able to deploy a complete spatially enabled Operations Visualization System in four months. Pacificorp initiated the effort by purchasing a commercial y available street network and applying it to in-house AM/FM/GI S data. Service area imported from another call-management effort and customer data from the customer information system rounded out the base data sources. From the outage management system the trouble calls themselves were displayed on the maps using a very lightweight polling mechanism. The imported data was deployed using Web technology and Autodesk MapGuide to draw maps. Web browsers minimized the installation overhead at client desktops. The new system does not provide sophisticated outage management functionality, but lets users company-wide visualize customers, facilities, service areas, and trouble calls throughout the sevenstate service area. Users can navigate to calls spatially, by call time, by service area, or by customer. They can then report on calls, customers, or a wide variety of other data. Users also create and download spreadsheets containing trouble call information, services, or any other data. The Operations Visualization System has since been enhanced to further simplify its use. Using the data-centric approach, the system is now being enhanced to import real-time facility status data from a new network control system. And the system has been enhanced with a few simple reports that provide summarized and historical data on outages. The Contra Costa County Department of Public Works The Contra Costa County Department of Public Works has had an ongoing project of GIS data development. The Department has also built a sophisticated and powerfil database of land ownership information, expanding greatly upon the information available from the County Assessor, and linking that expanded database to spatial information from a variety of map sources. Like most county governments, the Public Works Department has a long-term strategy for providing GIS-based applications to information workers throughout the enterprise. But that strategy will take considerable time and resources to implement. Meanwhile, the rich store of information about land and public works facilities was locked inside the department's GIS systems. In the fall of 1998, Department management decided to use Web technology and a data-centric methodology to deploy this data to the enterprise. Specific applications to manipulate the data were not critical: they wanted instead to allow users to navigate through the spatial information, and spatially enable the tabular data sources. The Public Works Department set up two server machines dedicated to publishing GIS and land ownership information to the Intranet. These machines were populated with data from a variety of data sources, including assessor's records, a variety of internal Public Works data sources, commercial street networks, and sources from the public domain such as census data. To date, the data is expected to be fully available to users throughout County government by early 1999. Southern Company The Gabrielle Energy Information System is an Intranet-based application that provides a complete system level overview of various Power Delivery organization data within Southern Company's subsidiary Alabama Power Company. The primary objective of the Gabrielle EIS is its ability to capture department "rules of thumb," and integrate these known rules into a system that provides an overall view of company operations. The Gabrielle EIS coalesces both spatial and non-spatial information to increase operational effectiveness. Gabrielle EIS was developed as an enterprise solution, capturing data from multiple departments and sharing this data with various decision-making users. Gabrielle EIS users can research information starting with plant coal delivery schedules, down to the residential homeowner meter. The significance the data integration is in the ability to access this information so those users can determine the effects of a decision beyond the typical department boundaries. The benefits of the Gabrielle EIS is the ability to streamline and improve business processes, tighten operational coordination, reduce system administrative cost, and effectively leverage existing investments in legacy applications and databases. Data integration and fimding evolved departmentally, making it an easier sell to upper management. Departmental allocation of $25K to $50K for a data solution developed and implemented in a few short months could be easily justified. As compared to a system wide solution costing upwards to $300K to $500K, developed over a year plus time frame. The Gabrielle EIS was developed using common Web based tools and links, with no specific ties to any vendor development product. It was determined that current systems provide only a fractured jigsaw puzzle view whose individual pieces conflicted with each other, this contradictory data or data overlap is now leveraged by Gabrielle to improve data constancy and to accurately reflect system wide operations. Conclusion The mantra of this new approach to AM/FM/GIS applications development is "collect, organize, link, and publish". Complex analysis and other time-consuming and limited-value software development activities are specifically avoided. Instead, the concentration is on "getting the data" out and the first 60 to 80°/0 of the value to the enterprise. Since the data described in this paper is primarily geospatial data, Web-based map display tools and simple tabular reporting tools, linked to the objects on the map, are utilized to publish and visually interact with the data. | ||
| © GISdevelopment.net. All rights reserved. |