Opened 12 years ago

Last modified 7 years ago

#1461 new defect

Tiger Geocoder doesn't anticipate irregular spacing in road name

Reported by: arencambre Owned by: robe
Priority: medium Milestone: PostGIS Fund Me
Component: pagc_address_parser Version: master
Keywords: Cc: aren@…, woodbri

Description

The tx_edges table uses I- 635 instead of the more consistent I-635.

This looks weird but works:

SELECT ST_AsEWKT(geomout) FROM geocode_intersection('N. Belt Line', 'I- 635', 'TX', 'Coppell') ORDER BY rating ASC LIMIT 1;

This looks correct but doesn't work:

SELECT ST_AsEWKT(geomout) FROM geocode_intersection('N. Belt Line', 'I-635', 'TX', 'Coppell') ORDER BY rating ASC LIMIT 1;

Seems like the geocoder needs to work around these kind of errors in the Tiger data.

Change History (10)

comment:1 by robe, 12 years ago

Milestone: PostGIS 2.0.0PostGIS 2.1.0

yah — may not be that trivial as spacing is used to designate separation of elements so putting logic like this in is libel to break something else without some extensive testing. I'll push to 2.1.0 but may get to it before then.

comment:2 by robe, 12 years ago

Milestone: PostGIS 2.1.0PostGIS 2.0.1

comment:3 by robe, 12 years ago

Milestone: PostGIS 2.0.1PostGIS 2.1.0

comment:4 by woodbri, 11 years ago

Cc: woodbri added

PAGC tools handle this correctly.

comment:5 by robe, 11 years ago

Component: tiger geocoderpagc_address_parser

comment:6 by woodbri, 11 years ago

I should correct my last comment, PAGC still parses based on token so names like "SUN VALLEY" will parse as two tokens and "SUNVALLEY" will parse as one token. In my geocoder, I handle this under the fuzzy search by joining all the name tokens, then picking the best match to the input via scoring the results.

comment:7 by woodbri, 11 years ago

Also, I have found at least one very pathological case where the name is "MAINSTREET" and if it is entered as "MAIN STREET" then it is impossible to match, because "STREET" is classified as SUFFIX_TYPE token and "MAIN" is too short to match the fuzy key of "MAINSTREET".

There are ways to find this, but they tend to make everything else slower and return too many unwanted results. — Good Times!

comment:8 by robe, 11 years ago

Milestone: PostGIS 2.1.0PostGIS 2.2.0

comment:9 by robe, 9 years ago

Milestone: PostGIS 2.2.0PostGIS Future

comment:10 by robe, 7 years ago

Milestone: PostGIS FuturePostGIS Fund Me

Milestone renamed

Note: See TracTickets for help on using tickets.