Biomedical named entity recognition based on BERT and BiLSTM-CRF

Computer Engineering & Science ›› 2021, Vol. 43 ›› Issue (10): 1873-1879.

Previous Articles Next Articles

Biomedical named entity recognition based on BERT and BiLSTM-CRF

XU Li,LI Jian-hua

(School of Information Science and Engineering,East China University of Science and Technology,Shanghai 200237,China)

Received:2020-04-12 Revised:2020-09-14 Accepted:2021-10-25 Online:2021-10-25 Published:2021-10-22

Abstract

Abstract: In biomedical field, the named entity recognition method based on static word vector achieves low precision. To solve this problem, a method of combining pre-training model BERT and BiLSTM-CRF for biomedical named entity recognition is proposed. Firstly, the BERT is used for semantic extraction and the generation of dynamic word vector. Part of speech and chunking features are added to improve the model precision. Secondly, the word vector is sent to the BiLSTM model for further training to obtain the context features. Finally, the CRF is used to decode sequence and output the result with maximum probability. The average F1 score of this model reaches 89.45% on BC4CHEMD, BC5CDR-chem and NCBI-disease datasets. Experimental results show that the proposed model can effectively improve the precision of the model in the biomedical named entity recognition task.

Key words: biomedicine, named entity recognition, pre-training language model, part of speech, chunk- ing

XU Li, LI Jian-hua. Biomedical named entity recognition based on BERT and BiLSTM-CRF[J]. Computer Engineering & Science, 2021, 43(10): 1873-1879.

[1]	DING Jian-ping, LI Wei-jun, LIU Xue-yang, CHEN Xu. A review of named entity recognition research [J]. Computer Engineering & Science, 2024, 46(07): 1296-1310.
[2]	TIAN Hong-peng, WU Jing-wei. RIB-NER:A span-based Chinese named entity recognition model [J]. Computer Engineering & Science, 2024, 46(07): 1311-1320.
[3]	CHEN Huan-huan, WANG Jian, Muhammad Naeem Ul Hassan, . Chinese-Urdu neural machine translation interacting POS sequence prediction in Urdu language [J]. Computer Engineering & Science, 2024, 46(03): 518-524.
[4]	YU Jin-ping, ZHU Wei-feng, LIAO Lie-fa. Entity recognition of support policy text based on RoBERTa-wwm-BiLSTM-CRF [J]. Computer Engineering & Science, 2023, 45(08): 1498-1507.
[5]	LI Hong-fei, LIU Pan-yu, WEI Yong. Military named entity recognition based on self-attention and Lattice-LSTM [J]. Computer Engineering & Science, 2021, 43(10): 1848-1855.
[6]	QIU Zeng-hui, HE Ming-jie, LIN Zheng-kui. A named entity recognition method for online shopping comments based on deep learning [J]. Computer Engineering & Science, 2020, 42(12): 2287-2294.
[7]	XIAO Xinfeng1,2,LI Shijun2,YU Wei2,LIU Jie2,LIU Beixiong1. English-Chinese translation based on an improved seq2seq model [J]. Computer Engineering & Science, 2019, 41(07): 1257-1265.
[8]	LI Jianlong，WANG Panqing，HAN Qiyu. Military named entity recognition based on bidirectional LSTM [J]. Computer Engineering & Science, 2019, 41(04): 711-718.
[9]	. [J]. J4, 2007, 29(11): 152-156.
[10]	. [J]. J4, 2006, 28(6): 135-139.

Biomedical named entity recognition based on BERT and BiLSTM-CRF

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 10

Recommended Articles

Metrics

Comments