基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Stemming is used to produce stem or root of words. The process is vital to different research fields such as text mining, sentiment analysis, and text categorization, etc. Several techniques have been proposed to stemming Arabic text and among them, Khoja and light-10 stemmers are the most widely used. In this paper, we propose and evaluate two different stemming techniques to Arabic that are based on light stemming techniques. The new stemmers are compared to best reported light stemmer, which is light-10. Results and experiments, which were conducted using standard collections, reveal that The proposed stemmers yield 5.13% and 13.1% improvement in retrieval performance over light 10 with 0.369 average precision and 0.397, respectively and the improvement is statistically significant.
推荐文章
基于 EPICS 的 J-TEXT CODAC系统
CODAC系统
托卡马克
ITER
EPICS
J-TEXT托卡马克数据采集系统设计
J-TEXT
数据采集
MDSplus
In-situ nitrogen fate in the vadose zone of different soil types and its implications for groundwate
Vadose zone
Silty-loam
Silty-clay-loam
Nitrogen transformation
Groundwater vulnerability
Stable isotopes
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Developing Two Different Novel Techniques for Arabic Text Stemming
来源期刊 智能信息管理(英文) 学科 医学
关键词 ARABIC Language ARABIC Information RETRIEVAL LIGHT STEMMING LIGHT 10 EXTENDED Light-Stemmer Linguistic-Based Stemmer
年,卷(期) 2019,(1) 所属期刊栏目
研究方向 页码范围 1-23
页数 23页 分类号 R73
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2019(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
ARABIC
Language
ARABIC
Information
RETRIEVAL
LIGHT
STEMMING
LIGHT
10
EXTENDED
Light-Stemmer
Linguistic-Based
Stemmer
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
智能信息管理(英文)
半月刊
2160-5912
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
114
总下载数(次)
0
总被引数(次)
0
论文1v1指导