An Improved Memory Cache Management Study Based on Spark

Chaoyi Pang; Lu Zhang; Ning Cao; Suzhen Wang; Yanpiao Zhang

文献导航

搜索文章

搜索思路

钛学术文献服务平台 \
学术期刊 \
工程科技I期刊 \
材料科学期刊 \
计算机、材料和连续体（英文）期刊 \
An Improved Memory Cache Management Study Based on Spark

An Improved Memory Cache Management Study Based on Spark

作者：

Chaoyi Pang Lu Zhang Ning Cao Suzhen Wang Yanpiao Zhang

基本信息来源于合作网站，原文需代理用户跳转至来源网站获取

Resilient

DISTRIBUTION

datasets

UPDATE

mechanism

WEIGHT

MODE

摘要：

Spark is a fast unified analysis engine for big data and machine learning,in which the memory is a crucial resource.Resilient Distribution Datasets(RDDs)are parallel data structures that allow users explicitly persist intermediate results in memory or on disk,and each one can be divided into several partitions.During task execution,Spark automatically monitors cache usage on each node.And when there is a RDD that needs to be stored in the cache where the space is insufficient,the system would drop out old data partitions in a least recently used(LRU)fashion to release more space.However,there is no mechanism specifically for caching RDD in Spark,and the dependency of RDDs and the need for future stages are not been taken into consideration with LRU.In this paper,we propose the optimization approach for RDDs cache and LRU based on the features of partitions,which includes three parts:the prediction mechanism for persistence,the weight model by using the entropy method,and the update mechanism of weight and memory based on RDDs partition feature.Finally,through the verification on the spark platform,the experiment results show that our strategy can effectively reduce the time in performing and improve the memory usage.

内容分析

关键词云

关键词热度

相关文献

推荐文献

根据相关规定，获取原文需跳转至原文服务方进行注册认证身份信息

完成下面三个步骤操作后即可获取文献，阅读后请点击下方页面【继续获取】按钮

钛学术文献服务平台

学术出版新技术应用与公共服务实验室出品

原文合作方

获取文献流程

1.访问原文合作方请等待几秒系统会自动跳转至登录页，首次访问请先注册账号，填写基本信息后，点击【注册】

2.注册后进行实名认证，实名认证成功后点击【返回】

3.检查邮箱地址是否正确，若错误或未填写请填写正确邮箱地址，点击【确认支付】完成获取，文献将在1小时内发送至您的邮箱

*若已注册过原文合作方账号的用户，可跳过上述操作，直接登录后获取原文即可

点击【获取原文】按钮，跳转至合作网站。

首次获取需要在合作网站进行注册。

注册并实名认证，认证后点击【返回】按钮。

确认邮箱信息，点击【确认支付】，订单将在一小时内发送至您的邮箱。

* 若已经注册过合作网站账号，请忽略第二、三步，直接登录即可。

期刊分类
期刊（年）
期刊（期）
期刊推荐

一般化学工业一般服务业冶金工业化学安全科学与灾害防治无机化工有机化工材料科学燃料化工环境科学与资源利用石油天然气工业矿业工程综合科技A类综合轻工业手工业金属学及金属工艺

计算机、材料和连续体（英文）2019 计算机、材料和连续体（英文）2018 计算机、材料和连续体（英文）2017 计算机、材料和连续体（英文）2016

按字母查找期刊：

A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
其他

联系合作广告推广: shenyukuan@paperpass.com

篇名	An Improved Memory Cache Management Study Based on Spark
来源期刊	计算机、材料和连续体(英文)	学科	工学
关键词	Resilient DISTRIBUTION datasets UPDATE mechanism WEIGHT MODE
年，卷（期）	2018,（9）	所属期刊栏目
研究方向		页码范围	415-431
页数	17页	分类号	TP3
字数		语种
DOI