• Overview of Chinese core journals
  • Chinese Science Citation Database(CSCD)
  • Chinese Scientific and Technological Paper and Citation Database (CSTPCD)
  • China National Knowledge Infrastructure(CNKI)
  • Chinese Science Abstracts Database(CSAD)
  • JST China
  • SCOPUS
GAN Rong. Resolution Algorithm of Cross Ambiguity in Chinese Word Segmentation[J]. Journal of Xihua University(Natural Science Edition), 2018, 37(6): 32-36. DOI: 10.3969/j.issn.1673-159X.2018.06.006
Citation: GAN Rong. Resolution Algorithm of Cross Ambiguity in Chinese Word Segmentation[J]. Journal of Xihua University(Natural Science Edition), 2018, 37(6): 32-36. DOI: 10.3969/j.issn.1673-159X.2018.06.006

Resolution Algorithm of Cross Ambiguity in Chinese Word Segmentation

  • Chinese word segmentation is the foundation of natural language processing, and cross ambiguity is one of the bottlenecks to improve the accuracy of Chinese word segmentation. This paper proposes a method combining maximum matching algorithm and passive aggressive(PA)algorithm to eliminate cross ambiguity. Firstly, segmentation model was trained based on PA. Secondly, we checked the position of cross ambiguity based on forward maximum matching algorithm and negative maximum matching algorithm. Thirdly, the position of cross ambiguity and the context were submitted to the segmentation model, and they were decoded. Lastly, the final result was obtained. The experiment results on Renmin Daily 2014 show that the precision, recall and F-score of cross ambiguity are 98.32%、98.14% and 98.23% respectively.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return