Identify crystal structures by a new paradigm based on graph theory for building materials big data
基本信息来源于合作网站,原文需代理用户跳转至来源网站获取
摘要:
Material identification technique is crucial to the development of structure chemistry and materials genome project.Current methods are promising candidates to identify structures effectively,but have limited ability to deal with all structures accurately and automatically in the big materials database because different material resources and various measurement errors lead to variation of bond length and bond angle.To address this issue,we propose a new paradigm based on graph theory (GT scheme) to improve the efficiency and accuracy of material identification,which focuses on processing the "topological relationship" rather than the value of bond length and bond angle among different structures.By using this method,automatic deduplication for big materials database is achieved for the first time,which identifies 626,772 unique structures from 865,458 original structures.Moreover,the graph theory scheme has been modified to solve some advanced problems such as identifying highly distorted structures,distinguishing structures with strong similarity and classifying complex crystal structures in materials big data.