基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
In this paper we investigate the effectiveness of ensemble-based learners for web robot session identification from web server logs. We also perform multi fold robot session labeling to improve the performance of learner. We conduct a comparative study for various ensemble methods (Bagging, Boosting, and Voting) with simple classifiers in perspective of classification. We also evaluate the effectiveness of these classifiers (both ensemble and simple) on five different data sets of varying session length. Presently the results of web server log analyzers are not very much reliable because the input log files are highly inflated by sessions of automated web traverse software’s, known as web robots. Presence of web robots access traffic entries in web server log repositories imposes a great challenge to extract any actionable and usable knowledge about browsing behavior of actual visitors. So web robots sessions need accurate and fast detection from web server log repositories to extract knowledge about genuine visitors and to produce correct results of log analyzers.
推荐文章
嵌入式Web Server中SQLite访问技术的研究
嵌入式Web Server
SQLite数据库
Common Gateway Interface(CGI)
应用下位机Web Server监控联合站运行
联合站
分布式子系统
CGI程序
Web Server
基于ARM7的嵌入式Web Server的实现
嵌入式Web Server
ARM7TDMI
μC/OS-Ⅱ
嵌入式网关接口(EGI)
网络安全
IIC总线在嵌入式WEB Server中的应用
ⅡC
S3C4510
嵌入式WEB Server
ZLG7290
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Agglomerative Approach for Identification and Elimination of Web Robots from Web Server Logs to Extract Knowledge about Actual Visitors
来源期刊 数据分析和信息处理(英文) 学科 医学
关键词 WEB Robots WEB Server Log REPOSITORIES Ensemble Learning Bagging Boosting and Voting Actionable KNOWLEDGE Usable KNOWLEDGE BROWSING Behavior GENUINE VISITORS
年,卷(期) 2015,(1) 所属期刊栏目
研究方向 页码范围 1-10
页数 10页 分类号 R73
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2015(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
WEB
Robots
WEB
Server
Log
REPOSITORIES
Ensemble
Learning
Bagging
Boosting
and
Voting
Actionable
KNOWLEDGE
Usable
KNOWLEDGE
BROWSING
Behavior
GENUINE
VISITORS
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
数据分析和信息处理(英文)
季刊
2327-7211
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
106
总下载数(次)
0
论文1v1指导