基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Part of Speech (POS) Tagging can be applied by several tools and several programming languages. This work focuses on the Natural Language Toolkit (NLTK) library in the Python environment and the gold standard corpora installable. The corpora and tagging methods are analyzed and com- pared by using the Python language. Different taggers are analyzed according to their tagging ac- curacies with data from three different corpora. In this study, we have analyzed Brown, Penn Treebank and NPS Chat corpuses. The taggers we have used for the analysis are;default tagger, regex tagger, n-gram taggers. We have applied all taggers to these three corpuses, resultantly we have shown that whereas Unigram tagger does the best tagging in all corpora, the combination of taggers does better if it is correctly ordered. Additionally, we have seen that NPS Chat Corpus gives different accuracy results than the other two corpuses.
推荐文章
Hydrogeochemical evaluation and statistical analysis of groundwater of Sylhet, north-eastern Banglad
Arsenic
Groundwater
Hydrogeochemistry
Multivariate statistics
Spatial distribution
Comprehensive geochemical/hydrochemical and geo-thermometry analysis of Unai geothermal field, Gujar
Geothermal energy
Hydrochemical
Geochemical
Geothermometery
Renewable energy
Part-Join:基于划分的字符串相似性连接
相似性连接
划分
频率
编辑距离
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Tagging Accuracy Analysis on Part-of-Speech Taggers
来源期刊 电脑和通信(英文) 学科 医学
关键词 POS Tagger BROWN CORPUS Penn TREEBANK CORPUS NPS CHAT CORPUS
年,卷(期) 2014,(4) 所属期刊栏目
研究方向 页码范围 157-162
页数 6页 分类号 R73
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2014(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
POS
Tagger
BROWN
CORPUS
Penn
TREEBANK
CORPUS
NPS
CHAT
CORPUS
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
电脑和通信(英文)
月刊
2327-5219
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
783
总下载数(次)
0
总被引数(次)
0
论文1v1指导