基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Purpose:Existing researches of predicting queries with news intents have tried to extract the classification features from extemal knowledge bases,this paper tries to present how to apply features extracted from query logs for automatic identification of news queries without using any external resources.Design/methodology/approach:First,we manually labeled 1,220 news queries from Sogou.com.Based on the analysis of these queries,we then identified three features of news queries in terms of query content,time of query occurrence and user click behavior.Afterwards,we used 12 effective features proposed in literature as baseline and conducted experiments based on the support vector machine (SVM) classifier.Finally,we compared the impacts of the features used in this paper on the identification of news queries.Findings:Compared with baseline features,the F-score has been improved from 0.6414 to 0.8368 after the use of three newly-identified features,among which the burst point (bst)was the most effective while predicting news queries.In addition,query expression (qes) was more useful than query terms,and among the click behavior-based features,news URL was the most effective one.Research limitations:Analyses based on features extracted from query logs might lead to produce limited results.Instead of short queries,the segmentation tool used in this study has been more widely applied for long texts.Practical implications:The research will be helpful for general-purpose search engines to address search intents for news events.Originality/value:Our approach provides a new and different perspective in recognizing queries with news intent without such large news corpora as blogs or Twitter.
推荐文章
钢弹簧浮置板中低频振动特性分析
振动与波
城市轨道交通
钢弹簧浮置板
振动特性
刚度
压延胶帘布自动测厚装置的设计及优化
胶鞋
布面胶鞋
压延
胶帘布
测厚装置
基于有限元分析的一种超超高效异步电机?
有限元分析
超超高效
损耗
异步电机
不稳定心绞痛TIMI危险分层与中医血瘀证相关性研究
不稳定心绞痛
心肌梗死溶栓危险评分
血瘀证
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Exploring features for automatic identification of news queries through query logs
来源期刊 中国文献情报(英文刊) 学科
关键词 Query intent News query News intent Query classification Automatic identification
年,卷(期) 2014,(4) 所属期刊栏目
研究方向 页码范围 31-45
页数 15页 分类号
字数 语种 英文
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (5)
共引文献  (8)
参考文献  (8)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
1960(1)
  • 参考文献(1)
  • 二级参考文献(0)
1977(2)
  • 参考文献(1)
  • 二级参考文献(1)
2001(1)
  • 参考文献(1)
  • 二级参考文献(0)
2008(4)
  • 参考文献(1)
  • 二级参考文献(3)
2010(2)
  • 参考文献(1)
  • 二级参考文献(1)
2011(1)
  • 参考文献(1)
  • 二级参考文献(0)
2012(2)
  • 参考文献(2)
  • 二级参考文献(0)
2014(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
Query intent
News query
News intent
Query classification
Automatic identification
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
中国文献情报(英文刊)
季刊
1674-3393
11-5670/G2
eng
出版文献量(篇)
199
总下载数(次)
0
总被引数(次)
213
论文1v1指导