基于字面相似度的地理信息分类体系自动转换方法
Approach to Automatic Conversion of Geographic Information Classification Schemes
- 2008年第3期 页码:433-441
纸质出版日期: 2008
DOI: 10.11834/jrs.20080359
扫 描 看 全 文
浏览全部资源
扫码关注微信
纸质出版日期: 2008 ,
扫 描 看 全 文
[1]张雪英,闾国年.基于字面相似度的地理信息分类体系自动转换方法[J].遥感学报,2008(03):433-441.
ZHANG Xue-ying LU Guo-nian. Approach to Automatic Conversion of Geographic Information Classification Schemes[J]. Journal of Remote Sensing, 2008,(3):433-441.
地理信息分类体系转换对于实现异构地理信息系统之间的语义信息共享与互操作具有至关重要的作用。人工转换方法效果较好
但是对时间、经费和领域专家的要求较高。提出了一种基于字面相似度的地理信息分类体系自动转换方法
包括类别语义相度计算方法、类别转换模型和分类体系转换算法。实验表明
该方法能够比较有效地构建不同地理信息分类体系的类别转换关系
实现它们之间的自动转换。
Since 1980’s
classification standard of Chinese special geographical schemes have been increasingly con- strutted
e.g.Classification and Codes for the National Land Information(GB/T 13923-1992
GB/T13923-2006)
Clas- sification and Codes for the Features 1:500 1:1000 1:2000 Topographic Maps(GB/T14804-1993)
Classification and Codes for the Features 1:5000 1:10000 1:25000 1:50000 1:100000 Topographic Maps(GB/T15660-1995).However
there is serious semantic heterogeneity between different geographical information classification schemes.Commonly-used intellectual approaches can normally achieve good performance
but problems always arise if there are insufficient time
money and domain experts.To tackle these problems
it is necessary to introduce effective and efficient automatic approa- ches used.Semantic conversion between classification schemes are oriental or directional
i.e.from classification scheme A(source scheme)to classification scheme B(target scheme)is not the same as semantic conversion from classification scheme B(source scheme)to classification scheme A(target scheme).Theoretically
if every entry in the source scheme can locate a closely relation entry in the target scheme
then the source scheme is fully compatible with the target scheme
and vice versa.With the investigation of string similarity measures for English and Chinese languages
we select a measure taking into account of the characteristics of Chinese language and class names of geographical information classification schemes.On the assumption that the semantic similarity between two entries from different schemes can be measured by the string similarity of their class names
a model consisting of four conversion patterns is defined.These patterns are given a priority according to their representation capability of the semantic content of the corresponding entries.And then an algorithm is developed to conduct automatic semantic conversion of geographical information classification schemes.In this algorithm
data preprocessing is a key step by means of some natural language processing techniques such as syntax processing
punctuation processing
entry splitting and so on in order that every entry can represent complete and distinct content and concept.The other steps aim to implement the conversion model and then to filter optimal semantic relations. The conclusion of semantic relevance between entries of classification schemes is subjective in any real world task.Never- theless
there is no doubt that intellectual approaches can achieve the best performance.Four measures are defined to esti- mate the performance of intellectual and automatic approaches:out of the total number of entries of the source scheme (1)recall ration is the number of unique source entries of the semantic relation created by one automatic or intellectual conversion approach ;(2)precision ration is the number of unique target entries of the semantic relations created by one automatic or intellectual conversion approach ;(3)accuracy ration is the proportion of correct semantic relations out of the total semantic relations created by the automatic approach;(4)matching ration expresses the proportion of overlaps of se- mantic relations generated by the automatic approach from the whole semantic relations created by the intellectual ap- proach.Finally
we carried out experiments on four classification standard Chinese geographical information schemes.The experimental results show that our proposed approach achieve satisfactory performance.
地理信息分类体系语义转换字面相似度转换模型
geographical information classification schemesemantic conversionstring similarityconversion model
相关文章
相关作者
相关机构