作者:
基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Built upon an iterative process of resampling without replacement and out-of-sample prediction, the delete-d cross validation statistic CV(d) provides a robust estimate of forecast error variance. To compute CV(d), a dataset consisting of n observations of predictor and response values is systematically and repeatedly partitioned (split) into subsets of size n – d (used for model training) and d (used for model testing). Two aspects of CV(d) are explored in this paper. First, estimates for the unknown expected value E[CV(d)] are simulated in an OLS linear regression setting. Results suggest general formulas for E[CV(d)] dependent on σ2 (“true” model error variance), n – d (training set size), and p (number of predictors in the model). The conjectured E[CV(d)] formulas are connected back to theory and generalized. The formulas break down at the two largest allowable d values (d = n – p – 1 and d = n – p, the 1 and 0 degrees of freedom cases), and numerical instabilities are observed at these points. An explanation for this distinct behavior remains an open question. For the second analysis, simulation is used to demonstrate how the previously established asymptotic conditions {d/n → 1 and n – d → ∞ as n → ∞} required for optimal linear model selection using CV(d) for model ranking are manifested in the smallest sample setting, using either independent or correlated candidate predictors.
推荐文章
LT码译码算法的研究
LT码
喷泉码
MPGE
译码算法
基于LT码数据分发协议性能分析
LT码
分发协议
无线传感网络
LT-B转基因烟草植株的建立
大肠杆菌热不稳定肠毒素B亚单位
转基因烟草
植物疫苗
根瘤农杆菌
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Small Sample Behaviors of the Delete-<i>d</i>Cross Validation Statistic
来源期刊 统计学期刊(英文) 学科 医学
关键词 EXPECTED Value FORECAST Error Variance Linear Regression Model Selection Simulation
年,卷(期) 2015,(5) 所属期刊栏目
研究方向 页码范围 382-392
页数 11页 分类号 R73
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2015(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
EXPECTED
Value
FORECAST
Error
Variance
Linear
Regression
Model
Selection
Simulation
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
统计学期刊(英文)
半月刊
2161-718X
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
584
总下载数(次)
0
总被引数(次)
0
论文1v1指导