• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

朱青 (朱青.) (Scholars:朱青) | 吕晓旭 (吕晓旭.)

Indexed by:

CQVIP CSCD

Abstract:

标题是描述一个HTML文档主题的重要信息,但常常不能被准确指明.本文通过对过去标题抽取方法优缺点的总结和进一步分析,提出了通过机器学习策略进行标题抽取的方法.我们将HTML格式及DOM树结构等信息引入了机器学习标题抽取过程中,并通过实验验证了我们提出方法的可行性.

Keyword:

标题 机器学习 信息抽取

Author Community:

  • [ 1 ] [朱青]北京工业大学
  • [ 2 ] [吕晓旭]北京工业大学

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

微计算机信息

ISSN: 1008-0570

Year: 2010

Issue: 9

Volume: 26

Page: 15-16,11

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count: 8

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:631/5411562
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.