基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Purpose:This paper aims to analyze the effectiveness of two major types of features—metadata-based(behavioral)and content-based(textual)—in opinion spam detection.Design/methodology/approach:Based on spam-detection perspectives,our approach works in three settings:review-centric(spam detection),reviewer-centric(spammer detection)and product-centric(spam-targeted product detection).Besides this,to negate any kind of classifier-bias,we employ four classifiers to get a better and unbiased reflection of the obtained results.In addition,we have proposed a new set of features which are compared against some well-known related works.The experiments performed on two real-world datasets show the effectiveness of different features in opinion spam detection.Findings:Our findings indicate that behavioral features are more efficient as well as effective than the textual to detect opinion spam across all three settings.In addition,models trained on hybrid features produce results quite similar to those trained on behavioral features than on the textual,further establishing the superiority of behavioral features as dominating indicators of opinion spam.The features used in this work provide improvement over existing features utilized in other related works.Furthermore,the computation time analysis for feature extraction phase shows the better cost efficiency of behavioral features over the textual.Research limitations:The analyses conducted in this paper are solely limited to two wellknown datasets,viz.,Yelp Zip and Yelp NYC of Yelp.com.Practical implications:The results obtained in this paper can be used to improve the detection of opinion spam,wherein the researchers may work on improving and developing feature engineering and selection techniques focused more on metadata information.Originality/value:To the best of our knowledge,this study is the first of its kind which considers three perspectives(review,reviewer and product-centric)and four classifiers to analyze the effectiveness of opinion spam detection using two m
推荐文章
Metadata机制研究
Metadata
Meta演算
Web深度计算
MetaManager
Diffusion in garnet: a review
High temperature and high pressure
Diffusion
Garnet
Point defects
A review of geoanalytical databases
Database
Geochemistry
Geology
Geoanalysis
Information system
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Effective Opinion Spam Detection: A Study on Review Metadata Versus Content
来源期刊 数据与情报科学学报:英文版 学科 工学
关键词 Opinion spam Behavioral features Textual features Review spammers Spam-targeted products
年,卷(期) 2020,(2) 所属期刊栏目
研究方向 页码范围 76-110
页数 35页 分类号 TP181
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2020(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
Opinion
spam
Behavioral
features
Textual
features
Review
spammers
Spam-targeted
products
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
数据与情报科学学报:英文版
季刊
2096-157X
10-1394/G2
北京市中关村北四环西路33号
82-563
出版文献量(篇)
445
总下载数(次)
1
总被引数(次)
0
论文1v1指导