基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Purpose:Ever increasing penetration of the Internet in our lives has led to an enormous amount of multimedia content generation on the internet.Textual data contributes a major share towards data generated on the world wide web.Understanding people’s sentiment is an important aspect of natural language processing,but this opinion can be biased and incorrect,if people use sarcasm while commenting,posting status updates or reviewing any product or a movie.Thus,it is of utmost importance to detect sarcasm correctly and make a correct prediction about the people’s intentions.Design/methodology/approach:This study tries to evaluate various machine learning models along with standard and hybrid deep learning models across various standardized datasets.We have performed vectorization of text using word embedding techniques.This has been done to convert the textual data into vectors for analytical purposes.We have used three standardized datasets available in public domain and used three word embeddings i.e Word2Vec,GloVe and fastText to validate the hypothesis.Findings:The results were analyzed and conclusions are drawn.The key finding is:the hybrid models that include Bidirectional LongTerm Short Memory(Bi-LSTM)and Convolutional Neural Network(CNN)outperform others conventional machine learning as well as deep learning models across all the datasets considered in this study,making our hypothesis valid.Research limitations:Using the data from different sources and customizing the models according to each dataset,slightly decreases the usability of the technique.But,overall this methodology provides effective measures to identify the presence of sarcasm with a minimum average accuracy of 80%or above for one dataset and better than the current baseline results for the other datasets.Practical implications:The results provide solid insights for the system developers to integrate this model into real-time analysis of any review or comment posted in the public domain.This study has various other practical implications fo
推荐文章
Identification of bacterial fossils in marine source rocks in South China
South China
Excellent marine source rocks
Bacterial fossil
Sedimentary environment
基于语义的Data Cube数字水印技术
数字水印
语义
数据立方体
版权
Data Transfer Object模式探讨
Data Transfer Object 三层应用 DataSet
Identification of terrigenous and autochthonous organic carbon in sediment cores from cascade reserv
Cascade reservoirs
Terrigenous OC
Lignin
Autochthonous OC
Three-end-member mixing model
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Identification of Sarcasm in Textual Data:A Comparative Study
来源期刊 数据与情报科学学报:英文版 学科 工学
关键词 Machine learning Artificial NEURAL networks Word EMBEDDING TEXT VECTORIZATION ACCURACY
年,卷(期) sjyqbkxxbywb_2019,(4) 所属期刊栏目
研究方向 页码范围 56-83
页数 28页 分类号 TP3
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2019(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
Machine
learning
Artificial
NEURAL
networks
Word
EMBEDDING
TEXT
VECTORIZATION
ACCURACY
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
数据与情报科学学报:英文版
季刊
2096-157X
10-1394/G2
北京市中关村北四环西路33号
82-563
出版文献量(篇)
445
总下载数(次)
1
论文1v1指导