Interatomic potential

From testwiki
Jump to navigation Jump to search

Template:Short description

Typical shape of an interatomic pair potential.

Interatomic potentials are mathematical functions to calculate the potential energy of a system of atoms with given positions in space.[1][2][3][4] Interatomic potentials are widely used as the physical basis of molecular mechanics and molecular dynamics simulations in computational chemistry, computational physics and computational materials science to explain and predict materials properties. Examples of quantitative properties and qualitative phenomena that are explored with interatomic potentials include lattice parameters, surface energies, interfacial energies, adsorption, cohesion, thermal expansion, and elastic and plastic material behavior, as well as chemical reactions.[5][6][7][8][9][10][11]

Functional form

Interatomic potentials can be written as a series expansion of functional terms that depend on the position of one, two, three, etc. atoms at a time. Then the total potential of the system V can be written as [3]

V=i=1NV1(ri)+i,j=1NV2(ri,rj)+i,j,k=1NV3(ri,rj,rk)+

Here V1 is the one-body term, V2 the two-body term, V3 the three body term, N the number of atoms in the system, ri the position of atom i, etc. i, j and k are indices that loop over atom positions.

Note that in case the pair potential is given per atom pair, in the two-body term the potential should be multiplied by 1/2 as otherwise each bond is counted twice, and similarly the three-body term by 1/6.[3] Alternatively, the summation of the pair term can be restricted to cases i<j and similarly for the three-body term i<j<k, if the potential form is such that it is symmetric with respect to exchange of the j and k indices (this may not be the case for potentials for multielemental systems).

The one-body term is only meaningful if the atoms are in an external field (e.g. an electric field). In the absence of external fields, the potential V should not depend on the absolute position of atoms, but only on the relative positions. This means that the functional form can be rewritten as a function of interatomic distances rij=|rirj| and angles between the bonds (vectors to neighbours) θijk. Then, in the absence of external forces, the general form becomes

VTOT=i,jNV2(rij)+i,j,kNV3(rij,rik,θijk)+

In the three-body term V3 the interatomic distance rjk is not needed since the three terms rij,rik,θijk are sufficient to give the relative positions of three atoms i,j,k in three-dimensional space. Any terms of order higher than 2 are also called many-body potentials. In some interatomic potentials the many-body interactions are embedded into the terms of a pair potential (see discussion on EAM-like and bond order potentials below).

In principle the sums in the expressions run over all N atoms. However, if the range of the interatomic potential is finite, i.e. the potentials V(r)0 above some cutoff distance rcut, the summing can be restricted to atoms within the cutoff distance of each other. By also using a cellular method for finding the neighbours,[1] the MD algorithm can be an O(N) algorithm. Potentials with an infinite range can be summed up efficiently by Ewald summation and its further developments.

Force calculation

The forces acting between atoms can be obtained by differentiation of the total energy with respect to atom positions. That is, to get the force on atom i one should take the three-dimensional derivative (gradient) of the potential Vtot with respect to the position of atom i:

Fi=riVTOT

For two-body potentials this gradient reduces, thanks to the symmetry with respect to ij in the potential form, to straightforward differentiation with respect to the interatomic distances rij. However, for many-body potentials (three-body, four-body, etc.) the differentiation becomes considerably more complex [12] [13] since the potential may not be any longer symmetric with respect to ij exchange. In other words, also the energy of atoms k that are not direct neighbours of i can depend on the position ri because of angular and other many-body terms, and hence contribute to the gradient rk.

Classes of interatomic potentials

Interatomic potentials come in many different varieties, with different physical motivations. Even for single well-known elements such as silicon, a wide variety of potentials quite different in functional form and motivation have been developed.[14] The true interatomic interactions are quantum mechanical in nature, and there is no known way in which the true interactions described by the Schrödinger equation or Dirac equation for all electrons and nuclei could be cast into an analytical functional form. Hence all analytical interatomic potentials are by necessity approximations.

Over time interatomic potentials have largely grown more complex and more accurate, although this is not strictly true.[15] This has included both increased descriptions of physics, as well as added parameters. Until recently, all interatomic potentials could be described as "parametric", having been developed and optimized with a fixed number of (physical) terms and parameters. New research focuses instead on non-parametric potentials which can be systematically improvable by using complex local atomic neighbor descriptors and separate mappings to predict system properties, such that the total number of terms and parameters are flexible.[16] These non-parametric models can be significantly more accurate, but since they are not tied to physical forms and parameters, there are many potential issues surrounding extrapolation and uncertainties.

Parametric potentials

The arguably simplest widely used interatomic interaction model is the Lennard-Jones potential [17][18][19]

VLJ(r)=4ε[(σr)12(σr)6]

where ε is the depth of the potential well and σ is the distance at which the potential crosses zero. The attractive term proportional to 1/r6 in the potential comes from the scaling of van der Waals forces, while the 1/r12 repulsive term is much more approximate (conveniently the square of the attractive term).[6] On its own, this potential is quantitatively accurate only for noble gases and has been extensively studied in the past decades,[20] but is also widely used for qualitative studies and in systems where dipole interactions are significant, particularly in chemistry force fields to describe intermolecular interactions - especially in fluids.[21]

Another simple and widely used pair potential is the Morse potential, which consists simply of a sum of two exponentials.

VM(r)=De(e2a(rre)2ea(rre))

Here De is the equilibrium bond energy and re the bond distance. The Morse potential has been applied to studies of molecular vibrations and solids, [22] and also inspired the functional form of more accurate potentials such as the bond-order potentials.

Ionic materials are often described by a sum of a short-range repulsive term, such as the Buckingham pair potential, and a long-range Coulomb potential giving the ionic interactions between the ions forming the material. The short-range term for ionic materials can also be of many-body character .[23]

Pair potentials have some inherent limitations, such as the inability to describe all 3 elastic constants of cubic metals or correctly describe both cohesive energy and vacancy formation energy.[7] Therefore, quantitative molecular dynamics simulations are carried out with various of many-body potentials.

Repulsive potentials

For very short interatomic separations, important in radiation material science, the interactions can be described quite accurately with screened Coulomb potentials which have the general form

V(rij)=14πε0Z1Z2e2rijφ(r/a)

Here, φ(r)1 when r0. Z1 and Z2 are the charges of the interacting nuclei, and a is the so-called screening parameter. A widely used popular screening function is the "Universal ZBL" one.[24] and more accurate ones can be obtained from all-electron quantum chemistry calculations [25] In binary collision approximation simulations this kind of potential can be used to describe the nuclear stopping power.

Many-body potentials

The Stillinger-Weber potential[26] is a potential that has a two-body and three-body terms of the standard form

VTOT=i,jNV2(rij)+i,j,kNV3(rij,rik,θijk)

where the three-body term describes how the potential energy changes with bond bending. It was originally developed for pure Si, but has been extended to many other elements and compounds [27] [28] and also formed the basis for other Si potentials.[29] [30]

Metals are very commonly described with what can be called "EAM-like" potentials, i.e. potentials that share the same functional form as the embedded atom model. In these potentials, the total potential energy is written

VTOT=iNFi(jρ(rij))+12i,jNV2(rij)

where Fi is a so-called embedding function (not to be confused with the force Fi) that is a function of the sum of the so-called electron density ρ(rij). V2 is a pair potential that usually is purely repulsive. In the original formulation [31][32] the electron density function ρ(rij) was obtained from true atomic electron densities, and the embedding function was motivated from density-functional theory as the energy needed to 'embed' an atom into the electron density. .[33] However, many other potentials used for metals share the same functional form but motivate the terms differently, e.g. based on tight-binding theory [34][35] [36] or other motivations [37] [38] .[39]

EAM-like potentials are usually implemented as numerical tables. A collection of tables is available at the interatomic potential repository at NIST [1]

Covalently bonded materials are often described by bond order potentials, sometimes also called Tersoff-like or Brenner-like potentials. [10] [40] [41]

These have in general a form that resembles a pair potential:

Vij(rij)=Vrepulsive(rij)+bijkVattractive(rij)

where the repulsive and attractive part are simple exponential functions similar to those in the Morse potential. However, the strength is modified by the environment of the atom i via the bijk term. If implemented without an explicit angular dependence, these potentials can be shown to be mathematically equivalent to some varieties of EAM-like potentials [42] [43] Thanks to this equivalence, the bond-order potential formalism has been implemented also for many metal-covalent mixed materials.[43][44] [45] [46]

EAM potentials have also been extended to describe covalent bonding by adding angular-dependent terms to the electron density function ρ, in what is called the modified embedded atom method (MEAM).[47][48][49]

Force fields

Template:Main A force field is the collection of parameters to describe the physical interactions between atoms or physical units (up to ~108) using a given energy expression. The term force field characterizes the collection of parameters for a given interatomic potential (energy function) and is often used within the computational chemistry community.[50] The force field parameters make the difference between good and poor models. Force fields are used for the simulation of metals, ceramics, molecules, chemistry, and biological systems, covering the entire periodic table and multiphase materials. Today's performance is among the best for solid-state materials,[51][52] molecular fluids,[21] and for biomacromolecules,[53] whereby biomacromolecules were the primary focus of force fields from the 1970s to the early 2000s. Force fields range from relatively simple and interpretable fixed-bond models (e.g. Interface force field,[50] CHARMM,[54] and COMPASS) to explicitly reactive models with many adjustable fit parameters (e.g. ReaxFF) and machine learning models.

Non-parametric potentials

It should first be noted that non-parametric potentials are often referred to as "machine learning" potentials. While the descriptor/mapping forms of non-parametric models are closely related to machine learning in general and their complex nature make machine learning fitting optimizations almost necessary, differentiation is important in that parametric models can also be optimized using machine learning.

Current research in interatomic potentials involves using systematically improvable, non-parametric mathematical forms and increasingly complex machine learning methods. The total energy is then writtenVTOT=iNE(𝐪i)where 𝐪i is a mathematical representation of the atomic environment surrounding the atom i, known as the descriptor.[55] E is a machine-learning model that provides a prediction for the energy of atom i based on the descriptor output. An accurate machine-learning potential requires both a robust descriptor and a suitable machine learning framework. The simplest descriptor is the set of interatomic distances from atom i to its neighbours, yielding a machine-learned pair potential. However, more complex many-body descriptors are needed to produce highly accurate potentials.[55] It is also possible to use a linear combination of multiple descriptors with associated machine-learning models.[56] Potentials have been constructed using a variety of machine-learning methods, descriptors, and mappings, including neural networks,[57] Gaussian process regression,[58][59] and linear regression.[60][16]

A non-parametric potential is most often trained to total energies, forces, and/or stresses obtained from quantum-level calculations, such as density functional theory, as with most modern potentials. However, the accuracy of a machine-learning potential can be converged to be comparable with the underlying quantum calculations, unlike analytical models. Hence, they are in general more accurate than traditional analytical potentials, but they are correspondingly less able to extrapolate. Further, owing to the complexity of the machine-learning model and the descriptors, they are computationally far more expensive than their analytical counterparts.

Non-parametric, machine learned potentials may also be combined with parametric, analytical potentials, for example to include known physics such as the screened Coulomb repulsion,[61] or to impose physical constraints on the predictions.[62]

Potential fitting

Since the interatomic potentials are approximations, they by necessity all involve parameters that need to be adjusted to some reference values. In simple potentials such as the Lennard-Jones and Morse ones, the parameters are interpretable and can be set to match e.g. the equilibrium bond length and bond strength of a dimer molecule or the surface energy of a solid .[63][64] Lennard-Jones potential can typically describe the lattice parameters, surface energies, and approximate mechanical properties.[65] Many-body potentials often contain tens or even hundreds of adjustable parameters with limited interpretability and no compatibility with common interatomic potentials for bonded molecules. Such parameter sets can be fit to a larger set of experimental data, or materials properties derived from less reliable data such as from density-functional theory.[66][67] For solids, a many-body potential can often describe the lattice constant of the equilibrium crystal structure, the cohesive energy, and linear elastic constants, as well as basic point defect properties of all the elements and stable compounds well, although deviations in surface energies often exceed 50%. [30][43][45][46][65][50] [68] [69] [70] Non-parametric potentials in turn contain hundreds or even thousands of independent parameters to fit. For any but the simplest model forms, sophisticated optimization and machine learning methods are necessary for useful potentials.

The aim of most potential functions and fitting is to make the potential transferable, i.e. that it can describe materials properties that are clearly different from those it was fitted to (for examples of potentials explicitly aiming for this, see e.g.[71][72][73][74][75]). Key aspects here are the correct representation of chemical bonding, validation of structures and energies, as well as interpretability of all parameters.[51] Full transferability and interpretability is reached with the Interface force field (IFF).[50] An example of partial transferability, a review of interatomic potentials of Si describes that Stillinger-Weber and Tersoff III potentials for Si can describe several (but not all) materials properties they were not fitted to.[14]

The NIST interatomic potential repository provides a collection of fitted interatomic potentials, either as fitted parameter values or numerical tables of the potential functions.[76] The OpenKIM [77] project also provides a repository of fitted potentials, along with collections of validation tests and a software framework for promoting reproducibility in molecular simulations using interatomic potentials.

Machine-learned interatomic potentials

Template:Main Since the 1990s, machine learning programs have been employed to construct interatomic potentials, mapping atomic structures to their potential energies. These are generally referred to as 'machine learning potentials' (MLPs)[78] or as 'machine-learned interatomic potentials' (MLIPs).[79] Such machine learning potentials help fill the gap between highly accurate but computationally intensive simulations like density functional theory and computationally lighter, but much less precise, empirical potentials. Early neural networks showed promise, but their inability to systematically account for interatomic energy interactions limited their applications to smaller, low-dimensional systems, keeping them largely within the confines of academia. However, with continuous advancements in artificial intelligence technology, machine learning methods have become significantly more accurate, positioning machine learning as a significant player in potential fitting.[80][81][82]

Modern neural networks have revolutionized the construction of highly accurate and computationally light potentials by integrating theoretical understanding of materials science into their architectures and preprocessing. Almost all are local, accounting for all interactions between an atom and its neighbor up to some cutoff radius. These neural networks usually intake atomic coordinates and output potential energies. Atomic coordinates are sometimes transformed with atom-centered symmetry functions or pair symmetry functions before being fed into neural networks. Encoding symmetry has been pivotal in enhancing machine learning potentials by drastically constraining the neural networks' search space.[80][83]

Conversely, message-passing neural networks (MPNNs), a form of graph neural networks, learn their own descriptors and symmetry encodings. They treat molecules as three-dimensional graphs and iteratively update each atom's feature vectors as information about neighboring atoms is processed through message functions and convolutions. These feature vectors are then used to directly predict the final potentials. In 2017, the first-ever MPNN model, a deep tensor neural network, was used to calculate the properties of small organic molecules. Advancements in this technology led to the development of Matlantis in 2022, which commercially applies machine learning potentials for new materials discovery.[84] Matlantis, which can simulate 72 elements, handle up to 20,000 atoms at a time, and execute calculations up to 20 million times faster than density functional theory with almost indistinguishable accuracy, showcases the power of machine learning potentials in the age of artificial intelligence.[80][85][86]

Another class of machine-learned interatomic potential is the Gaussian approximation potential (GAP),[87][88][89] which combines compact descriptors of local atomic environments[90] with Gaussian process regression[91] to machine learn the potential energy surface of a given system. To date, the GAP framework has been used to successfully develop a number of MLIPs for various systems, including for elemental systems such as Carbon[92][93] Silicon,[94] and Tungsten,[95] as well as for multicomponent systems such as Ge2Sb2Te5[96] and austenitic stainless steel, Fe7Cr2Ni.[97]

Reliability of interatomic potentials

Classical interatomic potentials often exceed the accuracy of simplified quantum mechanical methods such as density functional theory at a million times lower computational cost.[51] The use of interatomic potentials is recommended for the simulation of nanomaterials, biomacromolecules, and electrolytes from atoms up to millions of atoms at the 100 nm scale and beyond. As a limitation, electron densities and quantum processes at the local scale of hundreds of atoms are not included. When of interest, higher level quantum chemistry methods can be locally used.[98]

The robustness of a model at different conditions other than those used in the fitting process is often measured in terms of transferability of the potential.

See also

References

Template:Reflist

  1. 1.0 1.1 M. P. Allen and D. J. Tildesley. Computer Simulation of Liquids. Oxford University Press, Oxford, England, 1989.
  2. Daan Frenkel and Berend Smit. Understanding molecular simulation: from algorithms to applications. Academic Press, San Diego, second edition, 2002.
  3. 3.0 3.1 3.2 R. Lesar. Introduction to Computational Materials Science. Cambridge University Press, 2013.
  4. Template:Cite journal
  5. N. W. Ashcroft and N. D. Mermin. Solid State Physics.Saunders College, Philadelphia, 1976.
  6. 6.0 6.1 Charles Kittel. Introduction to Solid State Physics. John Wiley & Sons, New York, third edition, 1968.
  7. 7.0 7.1 Template:Cite journal
  8. Template:Cite journal
  9. Template:Cite journal
  10. 10.0 10.1 Template:Cite journal
  11. Template:Cite journal
  12. Template:Cite journal
  13. Template:Cite journal
  14. 14.0 14.1 Template:Cite journal
  15. Template:Cite journal
  16. 16.0 16.1 Template:Cite journal
  17. Template:Cite journal.
  18. Template:Cite journal
  19. Template:Cite journal
  20. Template:Cite journal
  21. 21.0 21.1 Template:Cite journal
  22. Template:Cite journal
  23. Template:Cite journal
  24. J. F. Ziegler, J. P. Biersack, and U. Littmark. The Stopping and Range of Ions in Matter. Pergamon, New York, 1985.
  25. Template:Cite journal
  26. Template:Cite journal
  27. Template:Cite journal
  28. Template:Cite journal
  29. Template:Cite journal
  30. 30.0 30.1 Template:Cite journal
  31. Template:Cite journal
  32. Template:Cite journal
  33. Template:Cite journal
  34. Template:Cite journal
  35. Template:Cite journal
  36. Template:Cite journal
  37. Template:Cite journal
  38. Template:Cite journal
  39. Template:Cite journal
  40. Template:Cite journal
  41. Template:Cite journal
  42. Template:Cite journal
  43. 43.0 43.1 43.2 Template:Cite journal
  44. Template:Cite journal
  45. 45.0 45.1 Template:Cite journal
  46. 46.0 46.1 Template:Cite journal
  47. Template:Cite journal
  48. Template:Cite journal
  49. Template:Cite journal
  50. 50.0 50.1 50.2 50.3 Template:Cite journal
  51. 51.0 51.1 51.2 Template:Cite journal
  52. Template:Cite journal
  53. Template:Cite journal
  54. Template:Cite journal
  55. 55.0 55.1 Template:Cite journal
  56. Template:Cite journal
  57. Template:Cite journal
  58. Template:Cite journal
  59. Template:Cite journal
  60. Template:Cite journal
  61. Template:Cite journal
  62. Template:Cite journal
  63. Template:Cite journal
  64. Template:Cite journal
  65. 65.0 65.1 Template:Cite journal
  66. Template:Cite journal
  67. Template:Cite journal
  68. Template:Cite journal
  69. Template:Cite journal
  70. Template:Cite journal
  71. Template:Cite journal
  72. Template:Cite journal
  73. Template:Cite journal
  74. Template:Cite journal
  75. Template:Cite journal
  76. Template:Cite web
  77. Template:Cite web
  78. Template:Cite journal
  79. Template:Cite journal
  80. 80.0 80.1 80.2 Template:Cite journal
  81. Template:Cite journal
  82. Template:Cite journal
  83. Template:Cite journal
  84. Template:Cite journal
  85. Template:Cite journal
  86. Template:Cite web
  87. Template:Cite journal
  88. Template:Cite journal
  89. Template:Cite journal
  90. Template:Cite journal
  91. Template:Cite book
  92. Template:Cite journal
  93. Template:Cite journal
  94. Template:Cite journal
  95. Template:Cite journal
  96. Template:Cite journal
  97. Template:Cite journal
  98. Template:Cite journal