A Lao word segmentation method based on
bidirectional longshort term memory neural network model

Computer Engineering & Science

Previous Articles Next Articles

A Lao word segmentation method based on

bidirectional longshort term memory neural network model

HE Li，ZHOU Lanjiang，ZHOU Feng，GUO Jianyi

（Faculty of Information Engineering and Automation,Kunming University of Science and Technology，Kunming 650500，China）

Received:2018-07-18 Revised:2018-11-08 Online:2019-07-25 Published:2019-07-25

Abstract

Abstract:

It is necessary to divide the continuous Lao language into words, which are the smallest independent and meaningful unit of language. We propose a Lao word segmentation method based on bidirectional long-short term memory (BLSTM) neural network model. The model is trained from a Lao corpus that contains 913487 manually tagged words. In this model, the Lao word segmentation task can be transformed into a syllablebased sequential tagging task, in which a Lao syllable is labeled as four tags: begin-word (B), middleword (M), end-word (E) and singleword (S). Firstly, Lao sentences are divided into syllables and the syllables are trained into vectors. Secondly, as the input of the BLSTM neural network model, these vectors are used to predict the label of the syllable. Thirdly, the sequence inference algorithm is used to determine the label of the syllable. We carry out experiments on the manually labeled word-segmentation corpus. Experimental results show that the proposal has an accuracy of 87.48%, which is obviously better than that of
existing word segmentation methods.

Key words: neural network, syllable, bidirectional long-short term memory, Lao word segmentation

HE Li，ZHOU Lanjiang，ZHOU Feng，GUO Jianyi.

A Lao word segmentation method based on

bidirectional longshort term memory neural network model

[J]. Computer Engineering & Science.

[1]	ZHENG Weiwei, ZHENG Zhong, CHEN Wei, LU Hongyi. Comparison and analysis of TAGE-based and neural-based branch predictors [J]. Computer Engineering & Science, 2025, 47(8): 1364-1380.
[2]	LIU Jinzhu, ZHANG Dong, LI Guanyu. A link prediction model based on dense convolution and multi-feature perception [J]. Computer Engineering & Science, 2025, 47(8): 1483-1492.
[3]	CHEN Xu, CHEN Zixiong, JING Yongjun, WANG Shuyang, SONG Jifei. A slice-level vulnerability detection method based on hyperbolic graph convolutional neural network [J]. Computer Engineering & Science, 2025, 47(5): 851-863.
[4]	WANG Ying, YANG Qing , WANG Xiangyu , ZHANG Yong, . Research on EEG signal emotion analysis based on asymmetric spatial features [J]. Computer Engineering & Science, 2025, 47(5): 921-930.
[5]	LI Zhenqi, WANG Qiang, QI Xingyun, LAI Mingche, ZHAO Yankang, LU Yihang, LI Yuan. Design and FPGA implementation of lightweight convolutional neural network hardware acceleration [J]. Computer Engineering & Science, 2025, 47(4): 582-591.
[6]	WANG Yuheng, LIU Qiang, WU Xiaojie. RCGNN: Robustness certification for graph neural networks under graph injection attacks [J]. Computer Engineering & Science, 2025, 47(3): 434-447.
[7]	JING Yongjun, WANG Hao, SHAO Kun, WANG Xiaofeng. A network intrusion detection method based on graph heat kernel diffusion convolution [J]. Computer Engineering & Science, 2025, 47(3): 459-471.
[8]	LI Jiao, GAO Leiyi, ZHANG Ruixin, WU Yue, DENG Hongxia. A lightweight face super-resolution reconstruction method based on pulse attention mechanism [J]. Computer Engineering & Science, 2025, 47(3): 494-503.
[9]	CHEN Yuling, LI Xiang. Node classification with graph structure prompt in low-resource scenarios [J]. Computer Engineering & Science, 2025, 47(3): 534-547.
[10]	HUANG Ying, TANG Min, . Privacy-preserving gene testing based on deep neural network [J]. Computer Engineering & Science, 2025, 47(2): 265-275.
[11]	HOU Xuan, LIANG Zhizhen, ZHANG Lei, LIU Bailong, ZHANG Xuefei. Trajectory-user linking based on contextual global spatial graph [J]. Computer Engineering & Science, 2025, 47(2): 336-348.
[12]	ZHU Jiajun, BAO Meikai, ZHANG Kai, LIU Ye, LIU Qi. A commonsense question answering method based on multi-source knowledge infusion [J]. Computer Engineering & Science, 2025, 47(2): 349-360.
[13]	LI Ruihong, LI Xiaohong, YAO Jin, WANG Shanshan. A citation recommendation method based on dual-channel heterogeneous hypergraph neural networks [J]. Computer Engineering & Science, 2025, 47(2): 361-369.
[14]	WANG Peng, ZHANG Jia-cheng, FAN Yu-yang, . A neural network pruning and quantization algorithm for hardware deployment [J]. Computer Engineering & Science, 2024, 46(9): 1547-1553.
[15]	YUAN Jia-wei, ZHAO Jin. OMCI model similarity computation based on graph neural networks [J]. Computer Engineering & Science, 2024, 46(9): 1576-1586.

A Lao word segmentation method based on

bidirectional longshort term memory neural network model

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments