UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels

Dral, PO; Owens, A; Yurchenko, SN; Thiel, W; (2017) Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels. The Journal of Chemical Physics , 146 (24) , Article 244108. 10.1063/1.4989536. Green open access

[thumbnail of Yurchenko_1%252E4989536.pdf]
Preview
Text
Yurchenko_1%252E4989536.pdf - Published Version

Download (3MB) | Preview

Abstract

We present an efficient approach for generating highly accurate molecular potential energy surfaces (PESs) using self-correcting, kernel ridge regression (KRR) based machine learning (ML). We introduce structure-based sampling to automatically assign nuclear configurations from a pre-defined grid to the training and prediction sets, respectively. Accurate high-level ab initio energies are required only for the points in the training set, while the energies for the remaining points are provided by the ML model with negligible computational cost. The proposed sampling procedure is shown to be superior to random sampling and also eliminates the need for training several ML models. Self-correcting machine learning has been implemented such that each additional layer corrects errors from the previous layer. The performance of our approach is demonstrated in a case study on a published high-level ab initio PES of methyl chloride with 44 819 points. The ML model is trained on sets of different sizes and then used to predict the energies for tens of thousands of nuclear configurations within seconds. The resulting datasets are utilized in variational calculations of the vibrational energy levels of CH3Cl. By using both structure-based sampling and self-correction, the size of the training set can be kept small (e.g., 10% of the points) without any significant loss of accuracy. In ab initio rovibrational spectroscopy, it is thus possible to reduce the number of computationally costly electronic structure calculations through structure-based sampling and self-correcting KRR-based machine learning by up to 90%.

Type: Article
Title: Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels
Location: United States
Open access status: An open access version is available from UCL Discovery
DOI: 10.1063/1.4989536
Publisher version: http://dx.doi.org/10.1063/1.4989536
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Physics and Astronomy
URI: https://discovery-pp.ucl.ac.uk/id/eprint/1562670
Downloads since deposit
18,788Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item