智能与分布计算实验室

A Semantic Matching of Information Segments for Tolerating Error Chinese Words

出版社:
  • 会议名称:Web Information Systems Engineering(WISE 2006)
  • 举办地点:Wuhan,China
  • 举办日期:October 23-26 ,2006
  • 页数:48-59
摘要内容:

There exist new words and error words in Chinese information of web pages. In this paper, we introduce our definition of semantic similarity between sememes and their theorems. On the base of proving the theorems, the influence of the parameter is analyzed. Moreover, this paper presents a novel definition of the word similarity based on the sememe similarity, which can be used to match the new Chinese words with the existing Chinese words and match the error Chinese words with correct Chinese words. And also, based on the novel word similarity, a matching method of information segments is presented to recognize the category of Chinese web information segments, in which new words and error words occur. In addition, the experiment of the matching methods is presented. Therefore, the novel matching method is an efficient method both in theory and from experimental results.

关键词: