基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
The concept of missing data is important to apply statistical methods on the dataset. Statisticians and researchers may end up to an inaccurate illation about the data if the missing data are not handled properly. Of late, Python and R provide diverse packages for handling missing data. In this study, an imputation algorithm, cumulative linear regression, is proposed. The proposed algorithm depends on the linear regression technique. It differs from the existing methods, in that it cumulates the imputed variables;those variables will be incorporated in the linear regression equation to filling in the missing values in the next incomplete variable. The author performed a comparative study of the proposed method and those packages. The performance was measured in terms of imputation time, root-mean-square error, mean absolute error, and coefficient of determination (R^2). On analysing on five datasets with different missing values generated from different mechanisms, it was observed that the performances vary depending on the size, missing percentage, and the missingness mechanism. The results showed that the performance of the proposed method is slightly better.
推荐文章
A re-assessment of nickel-doping method in iron isotope analysis on rock samples using multi-collect
Fe isotope
Ni-doping
Stable isotope
Precision and accuracy
Mass bias correction
Pseudo-high mass resolution
Groundwater quality assessment using multivariate analysis, geostatistical modeling, and water quali
Groundwater
Multivariate analysis
Geostatistical modeling
Geochemical modeling
Mineralization
Ordinary Kriging
Spatial analysis of carbon storage density of mid-subtropical forests using geostatistics: a case st
Carbon storage density
Geostatistics
Mid-subtropical forests
Spatial autocorrelation
Spatial heterogeneity
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Imputing missing values using cumulative linear regression
来源期刊 智能技术学报 学科 社会科学
关键词 Imputing MISSING VALUES CUMULATIVE LINEAR regression STATISTICAL METHODS
年,卷(期) 2019,(3) 所属期刊栏目
研究方向 页码范围 182-200
页数 19页 分类号 G
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2019(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
Imputing
MISSING
VALUES
CUMULATIVE
LINEAR
regression
STATISTICAL
METHODS
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
智能技术学报
季刊
2468-2322
重庆市巴南区红光大道69号
出版文献量(篇)
142
总下载数(次)
4
总被引数(次)
0
论文1v1指导