基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Purpose: To design an efficient high-performance algorithm for semantic annotation of biodiversity documents in Chinese.Design/methodology/approach: Data set consists of 1,000 randomly selected documents from Flora of China. Comparative evaluation of the proposed approach with the Na ve Bayes algorithm have been developed before for the same purpose.Findings: Experimental results show that the heuristics based algorithm outperformed the Na ve Bayes algorithm. The use of leading words helped improving the annotation performance while prioritizing rule application based on their weights had no significant impact on algorithm performance.Research limitations: The ICTCLAS was used to identify word boundaries off-shelf without optimatization for biodiversity domain. This may have not made the best use of the tool.Practical implications & Originality/value: The performance of heuristics based approach,enhanced by leading words analysis, reached an F value of 0.9216, which is sufficiently accurate for practical use.
推荐文章
基于 Annotation 数据结构自动生成的研究与实现
数据库结构
自动生成
Annotation
java 反射
Using Sr isotopes to trace the geographic origins of Chinese mitten crabs
Chinese mitten crab
Lakes
Sr isotopes
Geographic origin
Effects of mineral-organic fertilizer on the biomass of green Chinese cabbage and potential carbon s
Potassic rock
Carbonate
Karst
Ion chromatograph
Carbon sequestration
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Heuristics based semantic annotation of biodiversity documents in Chinese
来源期刊 中国文献情报:英文版 学科 生物学
关键词 Heuritistics BASED method LEADING word analysis TA
年,卷(期) 2013,(2) 所属期刊栏目
研究方向 页码范围 33-46
页数 14页 分类号 Q16
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (55)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2005(1)
  • 参考文献(1)
  • 二级参考文献(0)
2006(1)
  • 参考文献(1)
  • 二级参考文献(0)
2013(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
Heuritistics
BASED
method
LEADING
word
analysis
TA
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
数据与情报科学学报:英文版
季刊
2096-157X
10-1394/G2
北京市中关村北四环西路33号
82-563
出版文献量(篇)
445
总下载数(次)
1
论文1v1指导