Rongliang Luo
Department of Computer Science and Engineering, Zhejiang University City College, Hangzhou, 310015, China
Hongxi Zhang
Department of Computer Science and Engineering, Zhejiang University City College, Hangzhou, 310015, China
Minghui Wu
Department of Computer Science and Engineering, Zhejiang University City College, Hangzhou, 310015, China
ABSTRACT
We propose a word segment algorithm based on word group. In this model, word group is used for Ambiguity analysis. At first step, statistical information is used for build information base. In the process of dealing with sentence, a small step is triggered for counting information of adjoining situation and word frequency and calculates parameters of this model according to size of window. When get different word sequences, we use Analysis Tree to find the prime sequence. Because of short in decision distance, we get a low time complexity. Algorithm analyzing and result of experiment show that segmentation algorithm based on word group has higher efficiency and accuracy.
PDF References Citation
How to cite this article
Rongliang Luo, Hongxi Zhang and Minghui Wu, 2013. Ambiguity Analysis Model of Word Segmentation Based on Word Group. Journal of Applied Sciences, 13: 3153-3160.
DOI: 10.3923/jas.2013.3153.3160
URL: https://scialert.net/abstract/?doi=jas.2013.3153.3160
DOI: 10.3923/jas.2013.3153.3160
URL: https://scialert.net/abstract/?doi=jas.2013.3153.3160
REFERENCES
- Wang, H.S., J. Zhu, S. Tang and X. Fan, 2011. A new unsupervised approach to word segmentation. Comput. Linguist., 37: 421-454.
CrossRefDirect Link - Li, Z., 2011. Parsing the internal structure of words: A new paradigm for chinese word segmentation. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, June 19-24, 2011, Portland, ON., USA., pp: 1405-1414.
Direct Link - Zhang, C.P., L.L. Zhao and C.M. Wu, 2010. Method of Chinese word segmentation based on character-word classification. J. Comput. Appl., 30: 2034-2037.
Direct Link