An unbiased offensive text detection method based on BERT and sentiment analysis

doi:10.3969/j.issn.1007-130X.2026.05.014

Computer Engineering & Science ›› 2026, Vol. 48 ›› Issue (5): 906-913.doi: 10.3969/j.issn.1007-130X.2026.05.014

• Artificial Intelligence and Data Mining • Previous Articles Next Articles

An unbiased offensive text detection method based on BERT and sentiment analysis

YUAN Liang,GUO Weibin

(School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China)

Received:2024-07-23 Revised:2024-11-06 Online:2026-05-25 Published:2026-05-21

Abstract

Abstract: Offensive information on the internet poses severe harm to individuals and society. In offensive text detection methods, existing methods suffer from misjudging non-offensive texts containing profanity and bias against special groups. To address the former issue, this paper proposes a sentiment analysis-based offensive text detection (SAOD) model, which uses sentiment features to assist in predict- ing whether a text is offensive. To tackle the latter issue, a debiasing data augmentation method called special groups mask (SGM) is proposed. This method masks special groups during training, ensuring that special groups are not directly involved in model training, thereby reducing the model's bias towards these groups. Using BERT+LSTM as the base model, experiments were conducted on publicly avail- able datasets ToxiCN and COLD. The experimental results show that the former method improved the base model’s F1-score from 80.18% to 82.67%. Based on this, the latter method reduces the false positive rate (FPR) from 18.27% to 12.77%.

Key words: BERT model, offensive text detection, sentiment analysis, debiasing, data augmentation

YUAN Liang, GUO Weibin. An unbiased offensive text detection method based on BERT and sentiment analysis[J]. Computer Engineering & Science, 2026, 48(5): 906-913.

[1]	ZHANG Yang, HU Huijun, LIU Maofu. Aspect-based multimodal sentiment analysis based on panoramic semantics and multi-level feature fusion [J]. Computer Engineering & Science, 2026, 48(2): 341-352.
[2]	ZHANG Feng1, SHAO Yubin1, DU Qingzhi1, LONG Hua1, MA Dinan2. Multimodal aspect-based sentiment analysis based on dual channel graph convolutional network [J]. Computer Engineering & Science, 2025, 47(7): 1321-1330.
[3]	WANG Luyao, HU Huijun, LIU Maofu. Imagetext emotion classification based on visual feature enhancement and bidirectional interaction fusion [J]. Computer Engineering & Science, 2025, 47(11): 2056-2066.
[4]	ZHANG Yu-ying, ZHU Guang-li, TAN Guang-pu, . A financial implicit sentiment analysis model based on sentiment enhancement and semantic dependency [J]. Computer Engineering & Science, 2024, 46(6): 1112-1120.
[5]	ZENG Tao, WANG Jing-jing, ZHANG Han, LIU Yi-ding. A word-pair relationship modeling method for aspect-based sentiment information extraction in dialogue text [J]. Computer Engineering & Science, 2024, 46(12): 2239-2251.
[6]	SUN Jie, CHE Wen-gang, GAO Sheng-xiang. A low-rank cross-modal Transformer for multimodal sentiment analysis [J]. Computer Engineering & Science, 2024, 46(10): 1888-1900.
[7]	ZHAO Wen-hui, WU Xiao-ling, LING Jie, HOON Heo. Multi-domain sentiment analysis of Chinese text based on prompt tuning [J]. Computer Engineering & Science, 2024, 46(1): 179-190.
[8]	DONG Peng-shan, ZHANG Jing, JIN Ri-ze. Sentiment analysis of Chinese product reviews based on dual-channel gated composite network [J]. Computer Engineering & Science, 2023, 45(5): 911-919.
[9]	YANG Chun-xia, YAO Si-cheng, SONG Jin-jian, . A Chinese sentiment analysis model combining character and word information [J]. Computer Engineering & Science, 2023, 45(3): 512-519.
[10]	CHEN Jing-jing, HAN Hu, XU Xue-feng. A multi-aspect oriented dual channel knowledge-enhanced graph convolutional network model [J]. Computer Engineering & Science, 2023, 45(12): 2246-2255.
[11]	YANG Chun-xia, GUI Qiang, MA Wen-wen, XU Ben, . Aspect-level sentiment analysis of graph attention network fused with graph walk information [J]. Computer Engineering & Science, 2023, 45(10): 1858-1865.
[12]	YANG Chun-xia, YAO Si-cheng, SONG Jin-jian, . An aspect-level sentiment analysis model based on word co-occurrence [J]. Computer Engineering & Science, 2022, 44(11): 2071-2079.
[13]	YANG Chun-xia, SONG Jin-jian, YAO Si-cheng, . Aspect-level sentiment analysis based on deep BiLSTM and graph convolutional networks [J]. Computer Engineering & Science, 2022, 44(10): 1893-1900.
[14]	WANG Chun-dong, ZHANG Hui, MO Xiu-liang, YANG Wen-jun. Overview on sentiment analysis of microblog [J]. Computer Engineering & Science, 2022, 44(1): 165-175.
[15]	ZHANG Bao-hua, LI En-lin, ZHANG Hua-ping, SHANG Jian-yun. A sentiment unit representation method based on layer hierarchy [J]. Computer Engineering & Science, 2022, 44(1): 149-158.

An unbiased offensive text detection method based on BERT and sentiment analysis

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 15

Recommended Articles

Metrics

Comments