Chinese Named Entity Recognition Based on BERT with Whole Word Masking - Details

Author：

Liu, Chao (Liu, Chao.) (Scholars：刘超) | Zhu, Cui (Zhu, Cui.) | Zhu, Wenjun (Zhu, Wenjun.)

Indexed by：

EI Scopus

Abstract：

Named　Entity　Recognition　(NER)　is　a　basic　task　of　natural　language　processing　and　an　indispensable　part　of　machine　translation,　knowledge　mapping　and　other　fields.　In　this　paper,　a　fusion　model　of　Chinese　named　entity　recognition　using　BERT,　Bidirectional　LSTM　(BiLSTM)　and　Conditional　Random　Field　(CRF)　is　proposed.　In　this　model,　Chinese　BERT　generates　word　vectors　as　a　word　embedding　model.　Word　vectors　through　BiLSTM　can　learn　the　word　label　distribution.　Finally,　the　model　uses　Conditional　Random　Fields　to　make　syntactic　restrictions　at　the　sentence　level　to　get　annotation　sequences.　In　addition,　we　can　use　Whole　Word　Masking　(wwm)　instead　of　the　original　random　mask　in　BERT＇s　pre-training,　which　can　effectively　solve　the　problem　that　the　word　in　Chinese　NER　is　partly　masked,　so　as　to　improve　the　performance　of　NER　model.　In　this　paper,　BERT-wwm　(BERT-wwm　is　the　BERT　that　uses　Whole-Word-Masking　in　pre　training　tasks),　BERT,　ELMo　and　Word2Vec　are　respectively　used　for　comparative　experiments　to　reflect　the　effect　of　bert-wwm　in　this　fusion　model.　The　results　show　that　using　Chinese　BERT-wwm　as　the　language　representation　model　of　NER　model　has　better　recognition　ability.　©　2020　ACM.

Keyword：

Image segmentation Natural language processing systems Long short-term memory Random processes Knowledge representation Computer aided language translation

Author Community：

[ 1 ] [Liu, Chao]College of Computer Science and Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Zhu, Cui]College of Computer Science and Technology, Beijing University of Technology, Beijing, China
[ 3 ] [Zhu, Wenjun]College of Computer Science and Technology, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

A combining approach for Chinese word segmentation
2007，SNPD 2007: 8th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing
Video summarization with visual and semantic features
2010，
Sea-Land Segmentation of Remote Sensing Images Based on SDW-UNet
2023，Computer Systems Science and Engineering
Mobile phone spam text classification based on prior knowledge
2018，4th IEEE International Conference on Computer and Communications, ICCC 2018

Source ：

Year： 2020

Page： 311-316

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 10

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

经济与管理学院

信息学部计算机学院

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to