基于BERT和情感分析的无偏见攻击性文本检测方法

doi:10.3969/j.issn.1007-130X.2026.05.014

计算机工程与科学 ›› 2026, Vol. 48 ›› Issue (5): 906-913.doi: 10.3969/j.issn.1007-130X.2026.05.014

基于BERT和情感分析的无偏见攻击性文本检测方法

袁亮，郭卫斌

(华东理工大学信息科学与工程学院，上海 200237)

收稿日期:2024-07-23 修回日期:2024-11-06 出版日期:2026-05-25 发布日期:2026-05-21
基金资助:
国家自然科学基金(62076094)

An unbiased offensive text detection method based on BERT and sentiment analysis

YUAN Liang,GUO Weibin

(School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China)

Received:2024-07-23 Revised:2024-11-06 Online:2026-05-25 Published:2026-05-21

摘要/Abstract

摘要： 互联网中的攻击性信息会对个人和社会造成严重危害。在攻击性文本检测方法中,现有的方法存在对含有脏话的非攻击性文本的误判问题和对特殊群体存在偏见的问题。针对前者,提出了一种基于情感分析的攻击性文本检测模型SAOD,利用情感特征辅助预测文本是否具有攻击性；针对后者,提出了一种去偏见的数据增强方法SGM,在训练时将特殊群体进行掩盖,使特殊群体不经过模型训练,从而降低模型对特殊群体的偏见。以BERT+LSTM为基础模型,基于公开数据集ToxiCN和COLD,进行了相应的实验验证。实验结果表明,前者以F1为评价指标,将基础模型的F1分数由80.18%提升到了82.67%;后者实验建立在前者基础上,以误报率FPR为指标,将其由18.27%降低到12.77%。

关键词: BERT模型, 攻击性文本检测, 情感分析, 去偏见, 数据增强

Abstract: Offensive information on the internet poses severe harm to individuals and society. In offensive text detection methods, existing methods suffer from misjudging non-offensive texts containing profanity and bias against special groups. To address the former issue, this paper proposes a sentiment analysis-based offensive text detection (SAOD) model, which uses sentiment features to assist in predict- ing whether a text is offensive. To tackle the latter issue, a debiasing data augmentation method called special groups mask (SGM) is proposed. This method masks special groups during training, ensuring that special groups are not directly involved in model training, thereby reducing the model's bias towards these groups. Using BERT+LSTM as the base model, experiments were conducted on publicly avail- able datasets ToxiCN and COLD. The experimental results show that the former method improved the base model’s F1-score from 80.18% to 82.67%. Based on this, the latter method reduces the false positive rate (FPR) from 18.27% to 12.77%.

Key words: BERT model, offensive text detection, sentiment analysis, debiasing, data augmentation

袁亮, 郭卫斌. 基于BERT和情感分析的无偏见攻击性文本检测方法[J]. 计算机工程与科学, 2026, 48(5): 906-913.

YUAN Liang, GUO Weibin. An unbiased offensive text detection method based on BERT and sentiment analysis[J]. Computer Engineering & Science, 2026, 48(5): 906-913.

[1]	张洋, 胡慧君, 刘茂福. 基于全景语义和多层次特征融合的方面级多模态情感分析[J]. 计算机工程与科学, 2026, 48(2): 341-352.
[2]	张凤1, 邵玉斌1, 杜庆治1, 龙华1, 马迪南2. 基于双通道图卷积网络的多模态方面级情感分析[J]. 计算机工程与科学, 2025, 47(7): 1321-1330.
[3]	王露瑶, 胡慧君, 刘茂福. 基于视觉特征增强与双向交互融合的图文情绪分类[J]. 计算机工程与科学, 2025, 47(11): 2056-2066.
[4]	张玉莹, 朱广丽, 谈光璞, . 基于情感增强和语义依存的金融隐式情感分析模型[J]. 计算机工程与科学, 2024, 46(6): 1112-1120.
[5]	曾涛, 王晶晶, 张涵, 刘一丁. 一种针对对话文本属性级情感信息抽取的词对关系建模方法[J]. 计算机工程与科学, 2024, 46(12): 2239-2251.
[6]	孙杰, 车文刚, 高盛祥. 面向多模态情感分析的低秩跨模态Transformer[J]. 计算机工程与科学, 2024, 46(10): 1888-1900.
[7]	赵文辉, 吴晓鸰, 凌捷, HOON Heo. 基于prompt tuning的中文文本多领域情感分析研究[J]. 计算机工程与科学, 2024, 46(1): 179-190.
[8]	董芃杉, 张晶, 金日泽. 基于双通道门控复合网络的中文产品评论情感分析[J]. 计算机工程与科学, 2023, 45(5): 911-919.
[9]	杨春霞, 姚思诚, 宋金剑, . 一种融合字词信息的中文情感分析模型[J]. 计算机工程与科学, 2023, 45(3): 512-519.
[10]	陈景景, 韩虎, 徐学锋. 面向多方面的双通道知识增强图卷积网络模型[J]. 计算机工程与科学, 2023, 45(12): 2246-2255.
[11]	杨春霞, 桂强, 马文文, 徐奔, . 融合图游走信息的图注意力网络方面级情感分析[J]. 计算机工程与科学, 2023, 45(10): 1858-1865.
[12]	杨春霞, 姚思诚, 宋金剑, . 基于词共现的方面级情感分析模型[J]. 计算机工程与科学, 2022, 44(11): 2071-2079.
[13]	杨春霞, 宋金剑, 姚思诚, . 基于深度BiLSTM和图卷积网络的方面级情感分析[J]. 计算机工程与科学, 2022, 44(10): 1893-1900.
[14]	王春东, 张卉, 莫秀良, 杨文军. 微博情感分析综述[J]. 计算机工程与科学, 2022, 44(1): 165-175.
[15]	张宝华 , 李奀林 , 张华平 , 商建云. 基于层次体系的情感单元表示方法[J]. 计算机工程与科学, 2022, 44(1): 149-158.

基于BERT和情感分析的无偏见攻击性文本检测方法

An unbiased offensive text detection method based on BERT and sentiment analysis

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 15

编辑推荐

Metrics

本文评价