A Multi-Semantic Feature Fusion Method for Complex Address Matching of Chinese Addresses

Accurate address matching is crucial for the analysis, integration, and intelligent management of urban geospatial data and is also a key step in achieving geocoding. However, due to the complexity, diversity, and irregularity of address expression, address matching becomes a challenging task. This...

Full description

Saved in:
Bibliographic Details
Main Authors: Pengpeng Li, Qing Zhu, Jiping Liu, Tao Liu, Ping Du, Shuangtong Liu, Yuting Zhang
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:ISPRS International Journal of Geo-Information
Subjects:
Online Access:https://www.mdpi.com/2220-9964/14/6/227
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurate address matching is crucial for the analysis, integration, and intelligent management of urban geospatial data and is also a key step in achieving geocoding. However, due to the complexity, diversity, and irregularity of address expression, address matching becomes a challenging task. This paper proposes a multi-semantic feature fusion method for complex address matching of Chinese addresses that formulates address matching as a classification task that directly predicts whether two addresses refer to the same location, without relying on predefined similarity thresholds. First, the address is resolved into address elements, and the Word2vec model is trained to generate word vector representations using these address elements. Then, multi-semantic features of the addresses are extracted using a Text Recurrent Convolutional Neural Network (Text-RCNN) and a Graph Attention Network (GAT). Finally, the Enhanced Sequential Inference Model (ESIM) is used to perform both local inference and inference composition on the multi-semantic features of the addresses to achieve accurate matching of addresses. Experiments were conducted using Points of Interest (POI) address data from Baidu Maps, Tencent Maps, and Amap within the Chengdu area. The results demonstrate that the proposed method outperforms existing address matching methods, with precision, recall, and F1 values all exceeding 95%. In addition, transfer experiments using datasets from five other cities including Beijing, Shanghai, Xi’an, Guangzhou, and Wuhan show that the model maintains strong generalization ability, achieving F1 values above 84% in cities such as Xi’an and Wuhan.
ISSN:2220-9964