nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo searchdiv qikanlogo popupnotification paper paperNew
2025, 09, v.53 2643-2650
机器学习优化Clausius-Mossotti介电常数计算方程
基金项目(Foundation): 国家自然科学基金联合基金(U24A2052); 国家资助博士后研究人员计划C档(GZC20232825); 上海市东方英才计划(QNKJ2024026)
邮箱(Email): mamingsheng@mail.sic.ac.cn;liuzf@mail.sic.ac.cn;
DOI: 10.14062/j.issn.0454-5648.20250039
摘要:

介电常数的机器学习预测模型准确性虽然较经典Clausius-Mossotti方程更高,但模型解释性仍需提升。本工作提出了领域知识和机器学习双向嵌入的方法,在C–M方程中引入机器学习修正项以提升方程准确性。以机器学习模型所识别出的介电常数关键参量作为修正项的自变量,结合符号回归和遗传算法生成修正项的数学表达式。基于Pareto前沿选出了平衡精准度和复杂度的结果,并采用偏导数揭示了“低值补偿、高值抑制”的极化纠偏机制。对于采用Shannon极化率数据的介电常数计算,修正后的C–M方程R2提升了42.30%,RMSE下降了14.72%。本工作初步探索了“双向嵌入”方法在兼顾模型准确性和解释性方面的潜力,并尝试为材料设计提供新的思考角度。

Abstract:

Introduction The Clausius-Mossotti(C–M) equation is one of the most fundamental theoretical models to calculate the dielectric constant of materials. As was proposed over a century ago, it has been widely applied in dielectric material research due to its relatively simple mathematical form and clear physical interpretation. The equation, derived based on the Lorentz effective field approximation and the assumption of constant polarization, establishes a relationship among the macroscopic dielectric constant and the microscopic polarizability and unit cell volume. However, with the growing understanding of dielectric materials, it becomes clear that dielectric constant is affected by a range of additional factors, such as bond length, packing density and symmetry. These factors are not captured by the conventional C–M equation, indicating the limitations of the equation. Recently, machine learning(ML) methods show a promise in improving the prediction accuracy of dielectric properties. The ML models can outperform classical equations in terms of accuracy, but they often lack interpretability, which hinders their widespread use in material design and optimization. To address these issues, this study proposed a novel bidirectional embedding approach with domain knowledge and machine learning. The ML-based correction term was integrated into the classical C–M equation, enhancing its predictive accuracy, while retaining physical interpretability. This work could explore a potential of combining physical formulas with ML methods to create more robust and explainable models for material design. Methods The data of single-phase microwave dielectric ceramics were collected from published literatures and the Materials Project database. The dataset included materials whose dielectric constants were measured by the Hakki-Coleman method in a frequencies range from 5 to 18 GHz. A machine learning model identified molecular dielectric polarizability per volume(ppv), average bond length(blm) and unit cell volume(va) as the most significant contributors to the dielectric constant in a previous study. To enhance the accuracy of the C–M equation, correction terms were introduced based on the three key features. Symbolic regression, combined with genetic algorithms, was used to derive mathematical expressions for the correction terms. The symbolic regression technique utilized evolutionary algorithms to generate and evolve potential expressions. The structures and parameters of expressions were optimized to minimize prediction errors. Hyperparameters were optimized by a grid search method to identify the best-performing models. The process was carried out iteratively. A total of 16,000 candidate mathematical expressions were generated and compared. Finally, the Pareto front analysis was used to select the optimal expression that balances accuracy and complexity. Results and Discussion The revised C–M equation results in significant improvements in its predictive performance. For the Shannon polarizability dataset, the R2 value of the revised C–M equation is increased by 42.29%, and the RMSE value is decreased by 14.72%. The correction term compensates for the inaccuracies of the original equation, particularly in cases where the molecular dielectric polarizability per volume of Shannon value(ppvs) is either overestimated or underestimated. For low-polarizability materials with ppvs values less than 0.2087, the Shannon database tends to underestimate the dielectric constant, and the correction term compensates for this error. Conversely, for high-polarizability materials, the dielectric constant is often overestimated, and the correction term reduces the overestimation. The analysis of the relationship between the correction term(Δppvs) and features reveals that the correction term is most sensitive to ppvs, followed by blm, with va having the least impact. This finding indicates that the dielectric polarizability plays a dominant role in determining the accuracy of the revised C–M equation. The influence of the features on the correction term is further explored through partial derivatives. The first-order partial derivatives show that the correction term's contribution from ppvs is much greater than that from blm or va. The second-order partial derivatives reveal a non-linear relationship between the correction term and ppvs and blm, while the relationship with va is linear. The results indicate that the compensatory effect of the correction term gradually diminishes, and the suppression effect becomes more pronounced as ppvs increases. This behavior is consistent with the correction mechanism of “elevating the underrated, reducing the overrated”. Conclusions This study introduced a bidirectional embedding approach that could integrate machine learning with domain knowledge to enhance both the accuracy and interpretability of the Clausius-Mossotti equation. The revised equation achieved a higher prediction accuracy with a 42.29% increase in R2 and a 14.72% decrease in RMSE via incorporating the correction term derived from symbolic regression. The revised equation retained the physical meaning of the original one. The research could highlight a potential of bidirectional embedding approach to create models that could balance both accuracy and interpretability, offering a promising perspective on optimizing classical equations and material design.

参考文献

[1]KIRKWOOD J G.On the theory of dielectric polarization[J].1936,4(9):592-601.

[2]QIN J C,LIU Z F,MA M S,et al.Machine learning approaches for permittivity prediction and rational design of microwave dielectric ceramics[J].J Materiomics,2021,7(6):1284-1293.

[3]SHANNON R D.Dielectric polarizabilities of ions in oxides and fluorides[J].1993,73(1):348-366.

[4]SONG X Q,ZOU Z Y,LU W Z,et al.Crystal structure,lattice energy and microwave dielectric properties of melilite-type Ba1-xSrxCu2Si2O7solid solutions[J].J Alloys Compd,2020,835:155340.

[5]KIM E S,CHUN B S,FREER R,et al.Effects of packing fraction and bond valence on microwave dielectric properties of A2+B6+O4 (A2+:Ca,Pb,Ba;B6+:Mo,W) ceramics[J].J Eur Ceram Soc,2010,30(7):1731-1736.

[6]WILSON G J,CHAN R K,DAVIDSON D W,et al.Dielectric properties of ices II,III,V,and VI[J].1965,43(7):2384-2391.

[7]QIN J C,LIU Z F,MA M S,et al.Structure and microwave dielectric properties of gillespite-type ACu Si4O10 (a=Ca,Sr,Ba) ceramics and quantitative prediction of the Q×f value via machine learning[J].ACSAppl Mater Interfaces,2021,13(15):17817-17826.

[8]WU S,SONG K X,LIU P,et al.Effect of Ti O2 doping on the structure and microwave dielectric properties of cordierite ceramics[J].J Am Ceram Soc,2015,98(6):1842-1847.

[9]唐莹,相怀成,李洁,等.Ca3-xMgxYb2Ge3O12(0≤x≤3)石榴石的A位Rattling效应与微波介电性能[J].硅酸盐学报,2023,51(4):872-881.TANG Ying,XIANG Huaicheng,LI Jie,et al.J Chin Ceram Soc,2023,51(4):872-881.

[10]杜康,尹长志,杨佳庆,等.Ge4+取代对Ba Sn Si3O9陶瓷烧结行为和微波介电性能的优化[J].硅酸盐学报,2023,51(4):866-871.DU Kang,YIN Changzhi,YANG Jiaqing,et al.J Chin Ceram Soc,2023,51(4):866-871.

[11]MURDOCK R J,KAUWE S K,WANG A Y,et al.Is domain knowledge necessary for machine learning materials properties?[J].Integr Mater Manuf Innov,2020,9(3):221-227.

[12]刘悦,邹欣欣,杨正伟,等.材料领域知识嵌入的机器学习[J].硅酸盐学报,2022,50(3):863-876.LIU Yue,ZOU Xinxin,YANG Zhengwei,et al.J Chin Ceram Soc,2022,50(3):863-876.

[13]秦锦成,刘志甫,马名生,等.数据驱动范式下微波介质陶瓷材料及器件研究进展[J].硅酸盐学报,2023,51(4):921-933.QIN Jincheng,LIU Zhifu,MA Mingsheng,et al.J Chin Ceram Soc,2023,51(4):921-933.

[14]TAKAHASHI A,KUMAGAI Y,MIYAMOTO J,et al.Machine learning models for predicting the dielectric constants of oxides based on high-throughput first-principles calculations[J].Phys Rev Mater,2020,4(10):103801.

[15]MORITA K,DAVIES D W,BUTLER K T,et al.Modeling the dielectric constants of crystals using machine learning[J].J Chem Phys,2020,153(2):024503.

[16]NODA Y,OTAKE M,NAKAYAMA M.Descriptors for dielectric constants of perovskite-type oxides by materials informatics with first-principles density functional theory[J].Sci Technol Adv Mater,2020,21(1):92-99.

[17]LIN X Y,LI C J,HAO H,et al.Accelerated search for ABO3-type the electronic contribution of polycrystalline dielectric constants by machine learning[J].Comput Mater Sci,2021,193:110404.

[18]YAO T Z,CHEN L P,HU N,et al.Discovering ABO3-type perovskite with different dielectric constants via intelligent optimization algorithm[J].AIP Adv,2024,14(7):075012.

[19]LIU X B,SU C,HUANG Q X,et al.Machine learning enhanced prediction of permittivity of spinel microwave dielectric ceramics compared to traditional C-M calculation[J].Modelling Simul Mater Sci Eng,2024,32(3):035002.

[20]RUDIN C.Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead[J].Nat Mach Intell,2019,1(5):206-215.

[21]孙中体,李珍珠,程观剑,等.机器学习在材料设计方面的研究进展[J].科学通报,2019,64(32):3270-3275.SUN Zhongti,LI Zhenzhu,CHENG Guanjian,et al.Chin Sci Bull,2019,64(32):3270-3275.

[22]YUAN R H,LIU Z,BALACHANDRAN P V,et al.Accelerated discovery of large electrostrains in Ba Ti O3-based piezoelectrics using active learning[J].Adv Mater,2018,30(7):1702884.

[23]JIANG L,FU H D,ZHANG H T,et al.Physical mechanism interpretation of polycrystalline metals’yield strength via a data-driven method:A novel Hall-Petch relationship[J].Acta Mater,2022,231:117868.

[24]BUTLER K T,DAVIES D W,CARTWRIGHT H,et al.Machine learning for molecular and materials science[J].Nature,2018,559(7715):547-555.

[25]LIU Y,GUO B R,ZOU X X,et al.Machine learning assisted materials design and discovery for rechargeable batteries[J].Energy Storage Mater,2020,31:434-450.

[26]GUO W J,WANG L Y,ZHU L A,et al.Exploration of high-temperature oxidation resistance laws in ultra-high temperature boride ceramics through data-driven approaches[J].Corros Sci,2024,230:111943.

[27]LEE J A,FIGUEIREDO R B,PARK H,et al.Unveiling yield strength of metallic materials using physics-enhanced machine learning under diverse experimental conditions[J].Acta Mater,2024,275:120046.

[28]QIN J C,LIU Z F,MA M S,et al.Optimizing and extending ion dielectric polarizability database for microwave frequencies using machine learning methods[J].NPJ Comput Mater,2023,9:132.

[29]PENN S J,ALFORD N M,TEMPLETON A,et al.Effect of porosity and grain size on the microwave dielectric properties of sintered alumina[J].J Am Ceram Soc,1997,80(7):1885-1888.

[30]JAIN A,ONG S P,HAUTIER G,et al.Commentary:The Materials Project:A materials genome approach to accelerating materials innovation[J].2013,1(1):011002.

[31]ONG S P,RICHARDS W D,JAIN A,et al.Python materials genomics(pymatgen):A robust,open-source Python library for materials analysis[J].Comput Mater Sci,2013,68:314-319.

[32]KOZA J R.Genetic programming as a means for programming computers by natural selection[J].Stat Comput,1994,4(2):87-112.

[33]ANGELIS D,SOFOS F,KARAKASIDIS T E.Artificial intelligence in physical sciences:Symbolic regression trends and perspectives[J].Arch Comput Methods Eng,2023(30):3845-3865.

基本信息:

DOI:10.14062/j.issn.0454-5648.20250039

中图分类号:TP181;TB30

引用信息:

[1]秦锦成,马名生,张发强等.机器学习优化Clausius-Mossotti介电常数计算方程[J].硅酸盐学报,2025,53(09):2643-2650.DOI:10.14062/j.issn.0454-5648.20250039.

基金信息:

国家自然科学基金联合基金(U24A2052); 国家资助博士后研究人员计划C档(GZC20232825); 上海市东方英才计划(QNKJ2024026)

检 索 高级检索

引用

GB/T 7714-2015 格式引文
MLA格式引文
APA格式引文