Similarity Search Algorithm over Data Supply Chain Based on Key Points
基本信息来源于合作网站,原文需代理用户跳转至来源网站获取
摘要:
In this paper,we target a similarity search among data supply chains,which plays an essential role in optimizing the supply chain and extending its value.This problem is very challenging for application-oriented data supply chains because the high complexity of the data supply chain makes the computation of similarity extremely complex and inefficient.In this paper,we propose a feature space representation model based on key points,which can extract the key features from the subsequences of the original data supply chain and simplify it into a feature vector form.Then,we formulate the similarity computation of the subsequences based on the multiscale features.Further,we propose an improved hierarchical clustering algorithm for a similarity search over the data supply chains.The main idea is to separate the subsequences into disjoint groups such that each group meets one specific clustering criteria;thus,the cluster containing the query object is the similarity search result.The experimental results show that the proposed approach is both effective and efficient for data supply chain retrieval.