基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
Spark is a distributed data processing framework based on memory.Memory allocation is a focus question of Spark research.A good memory allocation scheme can effectively improve the efficiency of task execution and memory resource utilization of the Spark.Aiming at the memory allocation problem in the Spark2.x version,this paper optimizes the memory allocation strategy by analyzing the Spark memory model,the existing cache replacement algorithms and the memory allocation methods,which is on the basis of minimizing the storage area and allocating the execution area according to the demand.It mainly including two parts:cache replacement optimization and memory allocation optimization.Firstly,in the storage area,the cache replacement algorithm is optimized according to the characteristics of RDD Partition,which is combined with PCA dimension.In this section,the four features of RDD Partition are selected.When the RDD cache is replaced,only two most important features are selected by PCA dimension reduction method each time,thereby ensuring the generalization of the cache replacement strategy.Secondly,the memory allocation strategy of the execution area is optimized according to the memory requirement of Task and the memory space of storage area.In this paper,a series of experiments in Spark on Yarn mode are carried out to verify the effectiveness of the optimization algorithm and improve the cluster performance.
推荐文章
Mechanism of accelerated dissolution of mineral crystals by cavitation erosion
Cavitation erosion
Mineral dissolution
Plastic deformation
Stepwave
Gibbs free energy
An experimental study on dynamic coupling process of alkaline feldspar dissolution and secondary min
Alkaline feldspar
Dissolution rate
Precipitation
Mineral conversion
Secondary porosity
Effect of Zn deficiency and excessive bicarbonate on the allocation and exudation of organic acids i
Adaptation
Excessive bicarbonate
Organic acids
Organs
Root exudates
Zn deficiency
Spark数据倾斜问题研究
大数据
Spark
数据倾斜
数据处理
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 A Dynamic Memory Allocation Optimization Mechanism Based on Spark
来源期刊 计算机、材料和连续体(英文) 学科 工学
关键词 MEMORY calculation MEMORY ALLOCATION OPTIMIZATION CACHE REPLACEMENT OPTIMIZATION
年,卷(期) 2019,(8) 所属期刊栏目
研究方向 页码范围 739-757
页数 19页 分类号 TP3
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2019(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
MEMORY
calculation
MEMORY
ALLOCATION
OPTIMIZATION
CACHE
REPLACEMENT
OPTIMIZATION
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
计算机、材料和连续体(英文)
月刊
1546-2218
江苏省南京市浦口区东大路2号东大科技园A
出版文献量(篇)
346
总下载数(次)
4
总被引数(次)
0
论文1v1指导