Non-utility customer information—What’s the value?
Dean A. Zastava UGC Consulting 6200 S. Syracuse Way, Suite 222 Englewood, Colo. 80111 (303) 773-6166, (303) 773-6618 (fax) Abstract Business Geographies applications, data providers, and consultants have created a high level of interest in non-utility customer information due to the increasingly competitive utility marketplace. A utility company today can purchase literally gigabytes of information about non-customers. These data range from digital images that show the location of all buildings to economic, cultural, and education profiles of residents within individual buildings. These data are certainly interesting; however, they are not available without cost, and they may not bring added value to the utility if the data are not analyzed properly, or if the data are purchased for an area that does not need to be analyzed. Frequently-asked questions address the potential value of non-customer information to a utility company, and what requirements need to be considered before loading this data into an AM/FM/GIS. Key factors inresolving theseissues willinclude thefollowing:
Several AM/FM/GIS application opportunities exist to further a utility’s goals of adopting a proactive approach to compiling and using non-customer information:
Resolving The Address Matching Problem A top priority for virtually any competitive-minded utility today is to know the location of its customers and potential customers in relation to the location of its facilities. This is not a trivial issue. Utility companies’ customer information systems have historically focused on getting bills delivered and knowing meter locations, and have not been concerned with addresses that were not part of a billing or a service location. Decades of doing business using paper-based maps and records, and relying on cost-plus rate making rather than market forces have left many utilities with a deficient understanding of their market areas and their customers. To utilize a customer’s or non customer’s location in an AM/FM/GIS, geographic coordinates need to be assigned to each customer and non-customer address. This is accomplished using “geocoding”, the process of matching a data file of address information against a geographically located dataset such as street centerlines (recommended minimum) or Zip+4 centroids (50,000-foot level) sothata latitudellonghude orothercoordinate valuemay be assigned toeachrecord.Thisthenallowstheaddressinformation tobegraphically displayed withinthe AM/FM/GIS. A recent pilot project for a gas utility client demonstrated that commercially available geocoded addresses from two different vendors was nearly as good as addresses that were digitized by a data conversion contractor from digital aerial photos. The commercial data was deemed adequate from an accuracy perspective, and the cost of all addresses from the commercial data vendor was less than having the data conversion contractor digitize the services. There is also a related problem in obtaining addresses from certain municipalities in the service area without performing a field survey. Progressive data conversion contractors are evaluating commercially available geocoded addresses as a more efficient and lower cost way of providing their clients with address information. A number of components are required to evaluate the usefulness of such a geocoding strategy. These include a data file of existing utility customer addresses, street address file linked to streets centerlines or Zip+4 centroids, a geographic layer showing utility infrastructure and an address matching and geocoding software program. Customer Address File The first piece needed in conducting a geocoding evaluation is a data file with current utility customer address information. These data are acquired by performing an extract of the customer information system to create an ASCII text file containing a number of fields, including:
Street Address File The street address file uses address ranges on street segments as a reference to interpolate the location of a given address. For example, if a street segment is known to have an address range from 500 to 598 on one side of the street, house number 550 would be located (interpolated) at approximately the midpoint. Street address file information is available from a number of commercial and government sources. Utilitv Infrastructure Locations To check the commercially-derived address locations for accuracy, a layer of utility infrastructure locations derived from digital orthophotography and field checks may be used to compare how far apart similarly addressed houses were located between sources. To derive the infrastructure locations, the GIS system may be used to create a view of street centerlines, address numbers, street names, and infrastructure locations. This view then may be exported into a DXF file and imported as a geographic map layer. Address Matching and Geocoding Software The ASCII extract of customer address from the utility customer information file is compared to the Zip+4 compliant commercial address data to identify and resolve all mismatches. Most utility companies have performed some level of address matching and cleanup process; however, these processes sometimes rely upon postal workers returning improperly addressed items. This is not a reliable process since the postman is more interested in delivering the mail then in performing an audit of address numbering and street spelling. If the postman recognizes the name of the addressee, the mail gets delivered. If your company has not done a Zip+4 audit of customer address, this is the first step that needs to be done, and you should consider contracting this work out to a company who is expert at it. The commercial data address data is built from telephone books and other sources and is audited in several ways including compliance to the postal Zip+4 database. In addition, the commercial address data is updated every six months to keep the resident’s name and profile data current. Geocoding (digitizing of address locations), like the Zip+4 audit is best done by company who is expert at it. Geocoding to be successful, requires tools, processes, and previous experience in gee-coding. Cost when contracted out is on the order of $40 or less per one thousand addresses depending upon the volume. If you are not yet convinced to contract for address matching and geocoding work, read on about some of the issues that must be dealt with. Locational Accuracy of Geocoded Data The locational accuracy of geocoded data is dependent upon two factors:
1. Non-matching records due to:
Premise identifications representing customer locations often proves to be both difficult and time-consuming during an AM/FM/GIS implementation. An alternative methodology is known as geocoding, or locating customers using a commercially available landbase and address file, which matches and plots addresses to road segments. If some 10SS in positional accuracy is tolerable in exchange for cost savings, ease of implementation, and additional attribute information, then this alternative should be given strong consideration. Another potential advantage to using a commercial landbase and geocoding service is that non-customer information can also be easily obtained and mapped into the Iandbase. Including non-customer data can greatly facilitate the use of a number of AM/FM/GIS software applications, especially Sales/Marketing and Emergency Response Outage Analysis. Adding Non-Customer Information A variety of issues pertaining to non-customer data should be explored to determine:
Residential Name Telephone number Home owner or renter Occupation code (of head(s) of household) Age of head(s) of household Marital status of head(s) of household Length of residence Income code of head(s) of household Dwelling type (single or multi-family) Business Business name Key contact names and titles Primary and secondary SIC codes Telephone Number of employees Year business started Annual sales Non-customer Aggregate Level Demographic Information In addition to site-specific information, commercial sources of non-customer data also supply consumer demographics information representative of typical characteristics for an area or neighborhood. While commercial suppliers attach this information to their records, the most common source of this type of information is through federal government sources. This includes data compiled by the U.S. Bureau of the Census. A census of the U.S. population is formally canvassed every 10 years, and population estimates are projected annually. This information is collected for every family and business in the United States; however, it is geographically aggregated at the national, region, division, state, county, minor civil division, places, census tract, and block group levels. All tabulated information is available at these geographic levels. Additionally, population counts are available at the block level, which is roughly equivalent to a city block. An additional component of viewing census demographics geographically is addressed by the Census Bureau’s TIGER (Topologically Integrated Geographic Encoding and Referencing) boundary files. Boundary coordinates of states, counties, places, census tracts, block groups, and blocks may be extracted from these digital files and processed for use within an AM/FM/GIS. This allows demographic information to be linked to specific geographic areas. Population characteristics may then be queried geographically to determine a population profile at a specific location, or thematic maps may be prepared. Again, this information is available for the cost of reproduction, but can be found as well on the Internet at no cost. Additional demographic information is available commercially through a variety of providers. These include data on consumer demand, service industry statistics, grocery store and retail statistics, and household statistics. Data are typically packaged by industry or geographic segments and is variably priced. This type of information paired with publicly available data and the customer information system can greatly enhance gee-marketing efforts at the micro level. This recommended approach should also be coordinated with a clean up of the addresses in the customer information system. Looking Into The Crystal Ball Demographic and Boundary File Resources are already available on the Internet. This information will continue to become more available, more accurate, and more affordable. I predict that in the not too-distant future you will be able to purchase digital orthophotos along with intelligent street center lines and up to date addressing from a one-stop “information vendor”. The individual data slices will come from different sources and will be packaged and marketed by the information vendor on the Internet. Listed below are some of the Internet locations where pieces of this data are available today:
| ||
|
|