#5299 closed defect (fixed)

The address_standardizer_data_us missing lexers

Reported by: robe Owned by: robe
Priority: medium Milestone: PostGIS 3.3.3
Component: pagc_address_parser Version: 3.3.x
Keywords: address_standardizer_data_us Cc:

Description

The address_standardizer_data_us is missing some lexer entries that prevents it from parsing correctly addresses such as:

SELECT * FROM
standardize_address('us_lex',  'us_gaz', 'us_rules',
'1 Timepiece POINT',
'Boston, MA, 02220');

it should be treating POINT as a suffix, but instead it returns

 building | house_num | predir | qual | pretype |      name       | suftype | sufdir | ruralroute | extra |  city   |     state     | country | postcode | box | unit
----------+-----------+--------+------+---------+-----------------+---------+--------+------------+-------+---------+---------------+---------+----------+-----+------
          | 1         |        |      |         | TIMEPIECE POINT |         |        |            |       | BOSTON | MASSACHUSETTS |         | 02220    |     |
(1 row)

instead of:

building | house_num | predir | qual | pretype |   name    | suftype | sufdir | ruralroute | extra |  city  |     state     | country | postcode | box | unit
----------+-----------+--------+------+---------+-----------+---------+--------+------------+-------+--------+---------------+---------+----------+-----+------
          | 1         |        |      |         | TIMEPIECE | POINT   |        |            |       | BOSTON | MASSACHUSETTS |         | 02220    |     |
(1 row)

Similarly has issue with PIECE as a street name

SELECT * FROM 
standardize_address('us_lex',  'us_gaz', 'us_rules',
'50 Gold Piece Drive',
'Boston, MA, 02020') as std1;

resolves to nothing:

 building | house_num | predir | qual | pretype | name | suftype | sufdir | ruralroute | extra | city | state | country | postcode | box | unit
----------+-----------+--------+------+---------+------+---------+--------+------------+-------+------+-------+---------+----------+-----+------
          |           |        |      |         |      |         |        |            |       |      |       |         |          |     |
(1 row)

When it should give:

building | house_num | predir | qual | pretype |    name    | suftype | sufdir | ruralroute | extra |  city  |     state     | country | postcode | box | unit
----------+-----------+--------+------+---------+------------+---------+--------+------------+-------+--------+---------------+---------+----------+-----+------
          | 50        |        |      |         | GOLD PIECE | DRIVE   |        |            |       | BOSTON | MASSACHUSETTS |         | 02020    |     |
(1 row)

I'm still getting a list of these issues and will add them to this ticket.

Change History (5)

comment:1 by Regina Obe <lr@…>, 17 months ago

In 147700b/git:

  1. Change to VALUES so easier to skip over entries already present
  2. Add entries for PIECE and POINT

References #5299 for PostGIS 3.4

comment:2 by Regina Obe <lr@…>, 17 months ago

In 722d887/git:

address_standardizer_data_us corrections:

  1. Change to VALUES so easier to skip over entries already present
  2. Add entries for PIECE and POINT

References #5299 for PostGIS 3.3.3

comment:3 by Regina Obe <lr@…>, 17 months ago

In e6fad28/git:

Update address_standardizer_data_us tests. References #5299 for PostGIS 3.4

comment:4 by pramsey, 15 months ago

Close?

comment:5 by robe, 15 months ago

Resolution: fixed
Status: newclosed
Note: See TracTickets for help on using tickets.