Address matching algorithm based on chinese natural language understanding[J]. Journal of Remote Sensing, 2013,17(4):788-801. DOI: 10.11834/jrs.20132164.
Address matching algorithm that has broad application prospects is the core and key technology for location-based services. This paper analyzes the existing three major address matching algorithms which are the level based matching algorithm
the full-text search algorithm and the regular expression algorithm. An address matching algorithm based on Chinese natural language understanding is proposed in this paper. The complete process of this new algorithm includes five parts as pretreatment
address parsing
address elements standardization
reasoning about address matching and matching registration. This paper focuses on address parsing and reasoning matching the two most important parts. The paper establishes a complete Chinese address matching algorithm based on natural language understanding. In the principle of Chinese segmentation and semantic reasoning in natural language understanding
the new algorithm achieves the goal to combine natural language understanding with address matching by processing Chinese address of unstructured format. To check the new algorithm
an address matching experimental system was developed. The matching experiment using 1000 resident addresses of Puyang city
Henan province shows that the matching rate can be 95% or more and the accuracy rate is above 93% .
关键词
自然语言理解地址匹配地址要素地址解析隐马尔科夫模型
Keywords
natural language understandingaddress matchingaddress elementaddress parsingHidden Markov Model