Skip to content

Wrong street parsing #400

@Katoshy

Description

@Katoshy

The parsing logic is incorrectly segmenting multi-word street and location elements, especially when the address contains long names or a predictable location structure like "[City Name] [Location Type]" (e.g., "Village"). The initial segmentation error has a cascading failure, leading to the misclassification of the State as the City.

Original:

1050 Elk Grove Town Center, Elk Grove Village, Illinois, 60007

Actual Output

{
'AddressNumber': '1050', 
'StreetName': 'Elk Grove Town Center, Elk Grove', 
'StreetNamePostType': 'Village', 
'PlaceName': 'Illinois', 
'ZipCode': '60007'
}

Expected Output

{
'AddressNumber': '1050', 
'StreetName': 'Elk Grove Town Center', 
'PlaceName': 'Elk Grove Village', 
'StateName': 'Illinois', 
'ZipCode': '60007'
}

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions