|
|
| GeoCommunity Mailing List |
| |
| Mailing List Archives |
| Subject: | Re: [gislist] Address Parsing for Standardization and Geocoding |
| Date: |
03/24/2005 09:05:00 PM |
| From: |
Sonny Parafina |
|
|
Hey Bill,
You might want to look at Schuyler Erle's geocoder.us
http://geocoder.us/
Source is available and its written in perl. Its pretty nifty but it doesn't handle mis-spelling, you would probably need a soundex to handle that.
sonny
-----Original Message----- From: gislist-bounces@lists.thinkburst.com [mailto:gislist-bounces@lists.thinkburst.com]On Behalf Of Bill Thoen Sent: Thursday, March 24, 2005 9:23 PM To: gislist@lists.thinkburst.com Subject: [gislist] Address Parsing for Standardization and Geocoding
I'm looking for advice and algorithms for splitting US addresses into street number, prefix direction, street name, street type, suffix direction and unit. The problem I have is that the addresses I'm working with have all these logical fields combined into one physical field and the elements are not standardized. For example, the information in the street field may vary a lot. There may or may not be direction information or even street types. You can't be sure that the second word represents the prefix direction, and it's really hard to tell which word is the last one of the street name and whether the next word is the street type, suffix direction or unit. Also some street names are spelled differently, like "Woody Creek Rd" and "Woody Crk Rd."
Any suggestions on how to approach this problem? I'm currently working on this in an Access database, and I can handle SQL and VBA programming without too much difficulty. I'm just wondering how big a problem this is, and how to break it down into smaller problems.
I did find some general articles on street elements via Google, and I know where to get the USPS abbreviations for street types and directions, but I haven't found any technical details on how to parse and standardize addresses, so before I try to start from scratch, I thought I'd ask and see what ideas and pointers that others might have.
- Bill Thoen
_______________________________________________ gislist mailing list gislist@lists.geocomm.com http://lists.geocomm.com/mailman/listinfo/gislist
_________________________________ This list is brought to you by The GeoCommunity http://www.geocomm.com/
Get Access to the latest GIS & Geospatial Industry RFPs and bids http://www.geobids.com
_______________________________________________ gislist mailing list gislist@lists.geocomm.com http://lists.geocomm.com/mailman/listinfo/gislist
_________________________________ This list is brought to you by The GeoCommunity http://www.geocomm.com/
Get Access to the latest GIS & Geospatial Industry RFPs and bids http://www.geobids.com
|
|

Sponsored by:

For information regarding advertising rates Click Here!
|