收录:
摘要:
Many machine learning methods have been applied on Named Entity Recognition (NER). Such methods generally build on a large manually-annotated training set. However, the training set is usually limited as human labeling is costly and time consuming. Compare to the training set, the unlabeled corpus is usually much bigger and contains rich information about language. In this paper, a hybrid Deep Neural Network (DNN) is proposed to take advantage of the implicit information embedded in the un-labeled corpus. The experiments show that F1-score is improved from 85% to 90% (person name), from 75% to 81% (location name), and from 74% to 78% (organization name), compared with Conditional Random Fields (CRFs).
关键词:
通讯作者信息:
电子邮件地址:
来源 :
2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS)
ISSN: 2376-5933
年份: 2014
页码: 433-438
语种: 英文
归属院系: