An unsupervised phoneme segmentation method for Lao language with multi-feature interaction fusion

Computer Engineering & Science ›› 2024, Vol. 46 ›› Issue (05): 937-944.

• Artificial Intelligence and Data Mining • Previous Articles Next Articles

An unsupervised phoneme segmentation method for Lao language with multi-feature interaction fusion

LI Xin-jie1,2,WANG Wen-jun1,2,DONG Ling1,2,LAI Hua1,2,YU Zheng-tao1,2,GAO Sheng-xiang1,2

(1.Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500;
2.Yunnan Key Laboratory of Artificial Intelligence,Kunming University of Science and Technology,Kunming 650500,China)

Received:2023-09-04 Revised:2023-10-20 Accepted:2024-05-25 Online:2024-05-25 Published:2024-05-30

Abstract

Abstract: Aiming at the inaccurate phoneme segmentation problem caused by the lack of consideration of Lao language tone changes and audio diversity in existing methods, this paper proposes an unsupervised phoneme segmentation method for Lao language with multi-feature interaction fusion. Firstly, self-supervised features, spectral features and pitch features are independently coded to avoid the insufficiency of a single feature. Secondly, multiple independent features are gradually fused based on the attention mechanism, so that the model can more comprehensively capture the information of Lao language tone changes and phoneme boundaries. Finally, a learnable framework is adopted to optimize the phoneme segmentation model. The experimental results show that the proposed method improves the R-value by 27.88% on the Lao phoneme segmentation task compared with the baseline methods.

Key words: unsupervised learning, feature fusion, Lao language, phoneme segmentation, speech representation

LI Xin-jie, WANG Wen-jun, DONG Ling, LAI Hua, YU Zheng-tao, GAO Sheng-xiang, . An unsupervised phoneme segmentation method for Lao language with multi-feature interaction fusion[J]. Computer Engineering & Science, 2024, 46(05): 937-944.

[1]	MA Jin-lin, YAN Qi, MA Zi-ping. A multi-layer mask recognition method for Tangut characters [J]. Computer Engineering & Science, 2024, 46(12): 2227-2238.
[2]	FU Yan, YANG Xu, YE Ou. A smoke recognition method based on CNN and Transformer feature fusion [J]. Computer Engineering & Science, 2024, 46(11): 2045-2052.
[3]	LIU Xiao-hua, XU Ru-zhi, YANG Cheng-yue. A Chinese named entity recognition model based on multi-feature fusion embedding#br# [J]. Computer Engineering & Science, 2024, 46(08): 1473-1481.
[4]	Xie-zhong, CHEN Xu, JING Yong-jun, WANG Shu-yang. Semi-supervised website topic classification based on hetero-geneous graph neural networkWANG [J]. Computer Engineering & Science, 2024, 46(04): 635-646.
[5]	YU Tian-ci, GAO Shang. A code summarization generation model fusing multi-structure data [J]. Computer Engineering & Science, 2024, 46(04): 667-675.
[6]	YANG Xiao-qiang, HUANG Jia-cheng. A multi-branch fine-grained recognition method based on dynamic localization and feature fusion [J]. Computer Engineering & Science, 2024, 46(02): 253-263.
[7]	JIANG Zhi-peng, WANG Zi-quan, ZHANG Yong-sheng, YU Ying, CHENG Bin-bin, ZHAO Long-hai, ZHANG Meng-wei. A vehicle object detection algorithm in UAV video stream based on improved Deformable DETR [J]. Computer Engineering & Science, 2024, 46(01): 91-101.
[8]	LI Zhuo-xuan, ZHOU Ya-tong. iSFF-DBNet:An improved text detection algorithm in e-commerce images [J]. Computer Engineering & Science, 2023, 45(11): 2008-2017.
[9]	DONG Zi-ping, CHEN Shi-guo, LIAO Guo-qing. A dense multi-face detection algorithm based on YOLOv5s [J]. Computer Engineering & Science, 2023, 45(10): 1838-1846.
[10]	ZENG Fan-feng, WANG Chun-zhen, LI Chen. An unsupervised video summarization algorithm based on deep and shallow feature fusion [J]. Computer Engineering & Science, 2023, 45(09): 1602-1610.
[11]	CUI Ke-bin, CUI Ye-wei. A circuit breaker moving contact tracking methods based on convolution and Transformer [J]. Computer Engineering & Science, 2023, 45(07): 1236-1244.
[12]	PU Zi-jun, ZHANG Shou-ming. A sound event localization and detection algorithm based on feature fusion and Transformer model [J]. Computer Engineering & Science, 2023, 45(06): 1097-1105.
[13]	DENG Shan-shan, HUANG Hui, MA Yan. A small object detection algorithm based on improved Faster R-CNN [J]. Computer Engineering & Science, 2023, 45(05): 869-877.
[14]	ZHANG Hai-yan, FU Ying-na, DING Gui-jiang, MENG Qing-yan. Towards Anchor-free object detection with diverse receptive fields attention feature refinement network [J]. Computer Engineering & Science, 2022, 44(11): 1995-2002.
[15]	WU Cong-zhong, DONG Hao, FANG Jing. An adaptive filtering remote sensing image segmentation network based on attention mechanism [J]. Computer Engineering & Science, 2022, 44(11): 2010-2018.

An unsupervised phoneme segmentation method for Lao language with multi-feature interaction fusion

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles 0

Metrics

Comments