Anda di halaman 1dari 27

PHYSICAL REVIEW B VOLUME 50, NUMBER 24 15 DECEMBER 1994-II

Projector augmented-+rave method


P. E. Blochl
IBM Research Division, Zurich Research Laboratory, CH-8808 Ruschlikon, Switzerland
(Received 13 June 1994; revised manuscript received 22 August 1994)
An approach for electronic structure calculations is described that generalizes both the pseu-
dopotential method and the linear augmented-plane-wave (LAPW) method in a natural way. The
method allows high-quality first-principles molecular-dynamics calculations to be performed using
the original fictitious Lagrangian approach of Car and Parrinello. Like the LAPW method it can be
used to treat first-row and transition-metal elements with affordable effort and provides access to the
full wave function. The augmentation procedure is generalized in that partial-wave expansions are
not determined by the value and the derivative of the envelope function at some muKn-tin radius,
but rather by the overlap with localized projector functions. The pseudopotential approach based
on generalized separable pseudopotentials can be regained by a simple approximation.

I. INTRODUCTION the transferability of the pseudopotential. Vanderbilt's


ultrasoft pseudopotentials9 ~o have improved this situa-
In the past few decades, electronic structure calcula- tion significantly by relaxing the norm-conservation con-
tions have made significant contributions to our under- dition that is usually imposed on the pseudopotential ap-
standing of solid-state properties. The majority of such proach. This xnethod also allows first-rom and transition-
calculations are based on the local-density approximation metal elexnents to be dealt with in an economical way.
(LDA) of the density-functional theory. ~ 2 The density- Car and Parrinello have combined the density-
functional theory maps the ground state of an interacting functional theory with molecular-dynamics techniques.
electron gas onto the ground state of noninteracting elec- Here both the electronic structure problexn and the dy-
trons, which experience an effective potential. namics of the atoms are solved simultaneously by a set of
Numerous methods have been developed to solve the Newton's equations. In this way not only has the struc-
resulting one-particle Schrodinger equation of the LDA. ture deterxnination become a straightforward technique,
The most widely used electronic structure methods can but the fully dynaxnic time evolution of the atomic struc-
be divided into two classes: (i) the linear methodss de- ture has also become accessible.
veloped by Andersen &om the augmented-plane-wave The Car-Parrinello method was first applied in the con-
(APW) method4's and the Korringa-Kohn-Rostocker text of the plane-wave pseudopotential method. There
methods' and (ii) the pseudopotential method based on is considerable interest in applying the same technique
norm-conserving O,b initio pseudopotentials invented by to all-electron (AE) methods, which allow one to deal
Hamann, Schluter, and Chiang. A third class, primar- efficiently with first-row and transition-xnetal elements
ily employed in chemistry, uses Gaussian basis sets to and which supply inforxnation about the wave func-
expand the full wave functions. tion close to the nucleus probed by several experimen-
The linear methods can be subdivided into a vari- tal techniques, but not provided by the pseudopoten-
ety of methods ranging &om the most accurate linear tial approach. These are, among many others, hyper-
augmented-plane-wave (LAPW) method to the linear fine parameters and electric field gradients. ' Sev-
muffin-tin orbital (LMTO) method, which, in a simpli- eral features of the Car-Parrinello method have been im-
fied version, even allows some electronic structure calcu- plexnented into existing AE methods such as the com-
lations to be performed with paper and pencil. The lin- bined minimization of electronic and nuclear degrees of
ear methods deal with the full wave functions and treat &eedom. To my knowledge, however, no energy-
all elements in the Periodic Table, i.e., 8-, p-, d-, and conserving molecular-dynaxnics simulation has been per-
f-electron systems, on the same footing. forxned to date that can compare in terms of quality with
The pseudopotential method, when used in combina- simulations using the pseudopotential approach.
tion with a plane-wave basis set, on the other hand, has This article describes an approach that combines the
the advantage of formal simplicity. When applied to ei- versatility of the LAPW method with the formal sim-
f
ther first-row elements or systems with d or electrons, plicity of the traditional plane-wave pseudopotential ap-
even pseudopotentials become very "hard, " so that in proach. The method extends the augmented-wave meth-
practice either very large or complicated basis sets in- ods, such as the LAPW method, and the pseudopoten-
stead of plane waves have to be used. Similarly, treating tial method in a natural way. As an AE method it pro-
semicore states as valence states, which is often necessary vides the full wave functions that are not directly acces-
for early transition-metal elements and allmli and alkaline sible with the pseudopotential approach, and the poten-
earth metals, results in hard pseudopotentials and sects tial is determined properly &oxn the full charge densi-

0163-1829/94/50(24)/17953(27)/$06. 00 50 17 953 Q~1994 The American Physical Society


17 954 P. E. BLOCHL

ties. It will be demonstrated that the accuracy of the cillations, which make a numerical treatment cumber-
method described here compares well with the most ac- some. Therefore, we transform the wave functions of this
curate existing electronic structure methods based on Hilbert space into a new, so-called pseudo (PS) Hilbert
the local-density approximation. The quality of first- space. Mapping the physical valence wave functions onto
principles molecular dynamics obtained with the present the fictitious PS wave functions thus de6ned shall be a
AE approach is in line with that of state-of-the-art Car- linear transformation and it shall transform the physi-
Parrinello calculations. Hence the first energy-conserving cally relevant AE wave functions onto computationally
molecular-dynamics calculations based on the full wave convenient PS wave functions. The PS wave functions
functions were made possible. Finally, it can be imple- will be identified with the envelope functions of the jin-
mented with relatively minor efFort into existing pseu- ear methods or the wave functions of the pseudopoten-
dopotential codes. tial approach. An AE wave function is a full one-electron
The method has many similarities with both the ex- Kohn-Sham wave function and is not to be confused with
isting linear methods and the pseudopotential approach. a many-electron wave function. All quantities related to
We can therefore expect that this method will close the the PS representation of the wave functions will hence-
gap between the two. The LAPW method is a special forth be indicated by a tilde.
case of the present method, and the pseudopotential for- This transformation changes the representation of the
malisrn is obtained by a well-defined approximation. wave functions in a way reminiscent of the change from a.
This article is organized as follows. Section II estab- Schrodinger to a Heisenberg picture. Knowing the trans-
lishes the principles of the method. Section III describes formation 7 from the PS wave function to the AE wave
which approximations are required in real calculations. functions, we can obtain physical quantities, represented
Section IV derives the expressions for the Hamilton op- as the expectation value (A) of some operator A, frorii
erator and forces. Section V describes the implemen- the PS wave functions ~4) either directly as (@~A~4) af-
tation in a first-principles molecular-dynamics scheme. ter transformation to the true AE wave functions ~@) =
Section VI describes the basic ingredients used in the 7 ~4) or as the expectation value (A) = (4~A~4) of a PS
method, such as partial waves and projector functions. operator A = 7 tA7 in the Hilbert space of the PS wave
Section VII contains a detailed analysis of the errors in- functions. Similarly we can evaluate the total energy
troduced in Section III. Section VIII is devoted to nu- directly as a functional of the PS wave functions. The
merical test calculations. Section IX shows the relation ground-state PS wave functions can be obtained from
between the new method and existing approaches.
~&71+)j, ~t~(@)
II. FORMALISM
Next, we choose a particular transformation. Since we
A. Projector augmented-wave functions
will exploit the characteristics of particular atom types,
we consider only transformations that differ &om identity
Wave functions of real materials have very different by a sum of local, atom-centered contributions 7R such
signatures in difFerent regions of space: in the bonding that
region the wave function is fairly smooth, whereas close
to the nucleus the wave function oscillates rapidly owing 7 =&+). 4.
to the large attractive potential of the nucleus. This is the R
source of the diKculty of electronic structure methods to
describe the bonding region to a high degree of accuracy Each local contribution 7~ acts only within some aug-
while accounting for the large variations in the atom cen- mentation region OR enclosing the atom. This implies
ter. The strategy of the augmented-wave methods has that AE and PS wave functions coincide outside the aug-
been to divide the wave function into parts, namely, a mentation regions. The equivalent of the augmentation
partial-wave expansion within an atom-centered sphere region in the linear methods is the mufBn-tin or atomic
and envelope functions outside the spheres. The enve- sphere. In the pseudopotential method the augmentation
lope function is expanded into plane waves or some other region corresponds to the so-called core region.
convenient basis set. Envelope function and partial-wave The local terms 7R are defined for each augmenta-
expansions are then matched with value and derivative tion region individually by specifying the target func-
at the sphere radius. tions ~P;) of the transformation 7 for set of initial func-
Even though the present method has been inspired by tions ~P;) that is orthogonal to the core states and other-
the existing augmented-wave methods, I approach the wise complete in the augmentation region, namely, by
problem in a somewhat difFerent way. The relation of ~P;) = (1 + 7~)~P, ) within O~. I call the initial states
my approach to the commonly used one described above ~P;) PS partial waves and the corresponding target func-
will be described in Sec. IXB. Concerning the following tions ~P;) AE partial waves. A natural choice for these
derivation it is emphasized that the present method is, in functions for the AE partial waves are solutions of the
a certain sense, the most general augmentation scheme. radial Schrodinger equation for the isolated atom, which
Let us consider the Hilbert space of all wave functions are orthogonalized to the core states if necessary. Hence
orthogonal to the core states. The physically relevant the index i refers to the atomic site R, the angular mo-
wave functions in this Hilbert space exhibit strong os- mentum quantum numbers I = (I., m), and an additional
50 PROJECTOR AUGMENTED-WAVE METHOD 17 955

index n to label diferent partial waves for the same site between the valence wave functions and fictitious PS
and angular mornentuxn. For each such AE partial wave wave functions has been established. Using this trans-
let us choose a PS partial wave denoted by 1$;). The PS formation, the AE wave function can be obtained from
partial waves must be identical to the corresponding AE the PS wave function by
partial waves outside the augmentation region and should
themselves form a complete set of functions within the
augmentation region. The rexnaining degree of &eedom
in the choice of the PS partial waves will be exploited
to map the physically relevant AE wave functions onto The three quantities that determine this transformation
computationally convenient PS wave functions. In our are (i) the AE partial waves 1$;) obtained by radially
case these are smooth functions. integrating the Schrodinger equation of the atomic en-
This forrnal definition must be turned into a closed ergy for a set of energies e~ and orthogonalization to the
expression for the transformation operator. We make use core states; (ii) one PS partial wave 1$;), which coincides
of the fact that, within the augmentation region, every with the corresponding AE partial wave outside some
PS wave function can be expanded into PS partial waves: augmentation region for each AE partial wave; and (iii)
one projector function Ip;) for each PS partial wave lo-
I@) = ) 1$;)c; within OR. calized within the augmentation region and which obeys
the relation (p;1$~) = b;z.
The partial waves are functions on a radial grid, mul-
Since 1$, ) = 7 1$;), the corresponding AE wave function tiplied with spherical harxnonics. In our case the PS
is of the form wave functions are expanded into plane waves, but other
choices are equally possible. The projectors are also cal-
14) = 714') = ) Ip;)c; within OR, (4) culated as a radial function times spherical harxnonics,
but are then transformed into the same representation
as the PS wave functions, which, in our case, is a plane-
with identical coefficients c, in both expansions. Hence wave representation. Since the projectors are tied to the
we can express the AE wave function as
atomic positions and since their shape is independent of
the potential, their Fourier components are expressed as
a product of a form factor and a structure factor.
The core states 14") are decomposed in a way simi-
where the expansion coefficients for the partial wave ex- lar to the valence wave functions. They are decomposed
pansions rexnain to be determined. into three contributions: a PS core wave function I@'),
Since we require the transformation 7 to be linear, which is identical to the true core state outside the aug-
the coefficients must be linear functionals of the PS wave mentation region and a smooth continuation inside; an
functions. Hence the coefficients are scalar products "AE core partial wave" IgP), which is identical to the AE
core state 14") and is expressed as a radial function times
(6) spherical harmonics; and finally a "PS core partial wave"
1$'), which is identical to the PS core state 14'), but rep-
of the PS wave function with some fixed functions (p;I, resented as a radial function times spherical harmonics.
which I will call projector functions. There is exactly one The core state is therefore expressed as
projector function for each PS partial wave.
The projector functions must fulfill the condition (IO)
P,. 1$,)(p;I = 1 within OR, so that the one-center ex- In contrast to the valence states, no projector functions
pansion P,. 1$;)(plilI) of a PS wave function is identical need be defined for the core states, and the "coefficients"
to the PS wave function 14') itself. This implies that of the one-center terms are always unity. Furthermore,
consistent with the &ozen-core approximation, the core
states are imported &om an isolated atom. In prac-
tice, a soft core scheme with core states that adjust
The projector functions are localized in the augmentation to the instantaneous potential is also conceivable (see
region, even though more extended projector functions Sec. VII D 2), but has not been implemented. In the fol-
could in principle also be chosen. The most general form lowing, the core states are implicitly included when sum-
for the projector functions is (p;I = g.
(((fi, lg~))),. (f&I, ming over energy states. Note that the corresponding
where the fz) form an arbitrary, linearly independent set
I
coefficients are not defined via the scalar product with a
of functions. The projector functions are localized if the PS wave function, even though, for the sake of simplicity,
functions f;) are localized. The reader interested at this
I
I will still use the symbol for all states.
point in a practical procedure to determine partial waves It should be noted that the frozen-core approximation
and projector functions might wish to jump to Sec. VI. allows certain nontrivial changes of the core wave func-
In summary, a linear transformation tion during the self-consistency or molecular-dynamics
simulation. The frozen-core approximation only restricts
the variational degree of freedom to a simple unitary
transformation among the core states (and occupied va-
17 956 P. E. BLOCHL 50

lence states). It does allow mixing among the core states The general form of an operator is strongly reminiscent
due to changing potential. Therefore, to test the accu- of generalized separable pseudopotentials. The PS op-
racy of the frozen-core approximation one should never erator contains three parts: The first part is an operator
compare the core states of the isolated atom on a one-to- that directly acts on the PS wave function and is evalu-
one basis with those obtained &om a relaxed-core calcu- ated either in real or reciprocal space. The remaining two
lation in a crystal or molecule. parts contain the projectors and the expectation value of
At this point I will not discuss the components of the the operator either between the AE or the PS partial
projector augmented-wave (PAW) method further. They waves, which can be easily obtained on radial grids using
are described in Sec. VI of this article. I will, however, spherical harmonics and Clebsch-Gordan coefBcients. If
continue to impose the condition that the AE and PS the partial waves are unbound, the individual terms A'
partial waves form complete sets of functions within the and A are not defined. However, since the PS and AE
augmentation regions. In practical calculations the num- partial waves are identical outside the augmentation re-
ber of partial waves and projectors needs to be truncated. gion, these tails cancel exactly for each pair of partial
The way to truncate the series and the errors involved are waves. In practice, this problem is solved by truncat-
also described in detail in later sections of the paper. ing the AE and PS partial waves somewhere outside the
Here and in the following I will make extensive use of augmentation region in a completely identical way.
Dirac's bra and ket notation. A wave function in real There is an additional freedom to add a term of the
space is written as (rl@) = 4(r); its complex conjugate form
function is (@lr& = 4'*(r). The Fourier components of
the wave function are (Gl@) = 4(G) with a similar def-
inition of its complex conjugate. A plane wave is of the
form (rlG) = exp(iGr). I have adopted the convention
for the Fourier transform that the forward transform of to the right-hand side of Eq. (11), where B is an ar-
a function f is (7'If) = g&(rlG)(GI f) and the backward bitrary operator that is localized within the augmen-
transform is of the form (Glf) = 1/V fv dr(GIr&&rI f&, tation regions. It is easily shown that the expectation
where V is the volume of the unit cell. value of this term is zero for any PS wave function,
since I@& = g, . IP, &(p;I@& within the augmentation re-
gion. This freedom can be exploited when the operator 2
B. Operators cannot easily be evaluated in a plane-wave expansion. An
Since in the PAW method the PS wave functions in- example is the Coulomb potential of the nucleus, which
stead of the AE functions play the role of the variational is problematic due to its singularity at the nuclear site.
parameters, we need to be able to obtain observable In this case we may construct a new potential that is
quantities as the expectation values of the PS wave func- identical to the true potential outside the augmentation
tions. As the representation of the wave functions has region and a smooth continuation inside. The difFerence
beenf„(@„IAIDO'„),
changed, we also need to transform our operators between the two potentials is localized within the aug-
into new, so-called PS operators. mentation region and therefore can act as the operator
Consider soine operator A: Its expectation value (A) = B in Eq. (13). By adding a term of the kind of Eq. (13)
P„ where n is the band index and f„ is the to the PS electrostatic potential obtained from Eq. (11),
occupation of the state, can be obtained alternatively as we can cancel the Coulomb singularity in the plane-wave
(A) = g„ f„&@„IAIDO'„). For quasilocal operators, such part and obtain an expression that is less sensitive to a
as the kinetic-energy operator —V'2/2 and the real-space truncation of number of plane waves.
projection operator Ir)(rl, which are needed to evaluate We now obtain the charge density following the de-
total energy and charge density, the PS operator has the scription given in Eq. (11). The charge density at a
form point r in space is the expectation value of the real-space
projection operator Ir&(rl. Hence the charge density is
given by

where
To arrive at Eq. (11) I expanded 7 using Eq. (8) and
deleted terms that cancel because P,IP, ) (p, = 1 within
I

the augmentation region OR and IP;& = IP;) outside the


augmentation region. Note that only on-site terms con-
tribute.
For truly nonlocal operators we need to add a term
b, A to the expression Eq. (11) given as

n(i j)
50 PROJECTOR AUGMENTED-WAVE METHOD 17 957

Note also that n contains the contribution of the core In addition, let us introduce the so-called compensa-
states P„(P'„lr)(rig'„) and that n~ as well as n contain tion charge density n. After adding an appropriate com-
the contribution of the PS core states P„(P'„lr)(rig'„) pensation charge density to the PS charge density and
its one-center expansion, the difference of the AE and
and g (@' lr)(rl@'), respectively.
In practice, we do not construct a PS c;ore state for the PS one-center contributions (n + n ) —(n + n)
each core state individually unless we ar~: interested in to the charge density has vanishing electrostatic multi-
the physics related directly to the core states. Instead, pole moments and hence no longer interacts with charges
we construct only a PS core density. outside the augmentation region: This energy has been
transferred to E. Here I made use of the fact that a
localized charge distribution produces a potential that,
C. Total energy outside the region of localization, depends only on the
electrostatic multipole moments, but not on the shape of
the charge distribution.
Similar to the expectation values, the expression for
the total energy functional
The identity E = E+ E —E for a complete set of
partial waves can be seen as follows. (Those not inter-
E= ) f„(C„I— v'IC„) -',
ested to follow through this detail may proceed to the
next paragraph. ) One divides space into augmentation

+— dr dr', + drne„, n
regions and an interstitial region. Now we use the identi-
ties n = n and n = n inside the augmentation regions
and the identities n = n and n = n in the interstitial
region. One can convince oneself easily that the decom-
position is true for the kinetic energy (see Sec. IIB), for
can also be divided as E = E+ E — into a smooth E, the exchange and correlation energy, and for the term
part E, which is evaluated on regular grids in Fourier or proportional to 8. The decomposition for the electro-
real space, and two one-center contributions E and E, static energy is more complex to show: Let us add a
which are evaluated on radial grids in an angular mo- charge density n~ + n+ —n~ —n to n + n in Eq. (19)
mentum representation. Let us denote the point charge and to n~ + n in Eq. (21). The effect of this addition
density of the nucleus by n+ and the energy per electron vanishes: First, the term quadratic in n + n —n —n
&om exchange and correlation as e„,. Here and in the cancels exactly because E and E are added with oppo-
following I use hartree atomic units (5 = e = m, = 1). site sign. Second, the terms linear in n + n —n —n
The three contributions to E are are proportional to n —n, which is zero within the aug-
mentation regions, and to the electrostatic potential of
E= ) f„(C„I— v'I4„) -', n + n —n —n, which is zero in the interstitial re-
gion, because the density itself is localized within the
„,(n+ rn)—(n+
r'
n) augmentation region and has zero electrostatic multipole
moments. Once this term has been added, the electro-
static contributions of the last two terms Eqs. (20) and
+ drnexc n ) (21) are identical and cancel, while the first term is the
true electrostatic interaction of the full charge density
E' = ):j) f
n(i
(@-lp')(&'I ——,
'&'I& )(p I@-) ~ + ~ = ~ + ~ —~ + ~ . This special form of the
total energy has been chosen in order to obtain a strict
separation into partial-wave and plane-wave expansions
+-1 dr dr ,
(n'+ n~)(n'+ n~) and to achieve rapid convergence for both expansions.
r —r' The compensation charge density n = P& nR with

+ dr A Exc A (2o) = )

1,
nR(r) gRL(r)QRL (22)
L
E' = )
n(i j)
—-', &'l&, )%1@„)
f„(@„I&;)(&;I is expressed as a sum of generalized Gaussians

+-2 (n~ + n) (n~ + n) = +&Ir —Bl YL, (r —B)e (I" Il"')

+
dr

dr% Exc
dr'
Ir —r'I
+ drA v

(21)
gR&(r)

with the normalization constant


its multipole moment j
|
g determined such that
dr r YL, (r)gr, (r) is unity. YL, is a
spherical harmonic function or its real counterpart. The
The potential 8 is an arbitrary potential localized in decay length r is sufFiciently small so that the compensa-
the augmentation regions. Its contribution to the total tion charge density is localized within the augmentation
energy vanishes exactly because n = n within the aug- regions. The value of r depends on the particular atom
mentation region. Since the potential 8 contributes only type; B stands for a particular nuclear site and L = (E, m)
if the partial wave expansion is not complete, it is used represents the angular momenta in the spherical harmon-
to minimize truncation errors. ics expansion. The multipole moments Q~L, are given by
17 958 P. E. BLOCHL

Finally, we need to evaluate the energy of exchange and


QRL
f «~l~ —&~l'I~4(~) + ~~(~) —~R(~)]Q(~ —&j .
correlation for a one-center expansion. We adopt a proce-
dure &om previous full-potential LMTO calculations
(24) and expand the corresponding energy density in the de-
Since the Gaussians are required to decay within the viation of the one-center charge density &om its spherical
augmentation regions, they often have high Fourier com- part nI1 &
ponents. This would require a large plane-wave cutoff
in the PS charge density. The problem is solved by
a well-known trick already used in the pseudopotential
.
1 =
d" "R& (~R) 1
d™ee, 1
e=o& (~z, e=o)

approach: We introduce a second, primed compensa-


tion charge density n, which has the same multipole mo-
ments as n, but uses generalized Gaussians ejIel (r) with
+-1 )
I.
, l +0
~&x~ (~ee, e=o)
AQ
(~R, e, )

a larger decay constant r,' than the unprimed compensa- +o((~ee, L, )') (29)
tion charge density. It may extend over several atomic
where p, (n) = d[ne„, (n)]/dn. The angular momentum
„,

„,1,
sites, but should not contribute higher Fourier compo-
nents than the PS charge density itself does. components of the one-center charge density are denoted
Now we rewrite the electrostatic energy in E: by n& L. In practice a Taylor expansion up to the second
1 (n+ n)(n+ n) order has show'n to be sufficiently accurate. 2 The one-
2 /r —r'f
center contribution of the PS charge density is treated
identically.
(6+n')(6+
—r'[
+'),
2 [r
III. FROM AN EXACT FORMALISM
+ dr6r 6r + UR~e. TO A PRACTICAL SCHEME
R, R'

The first term in the new expression (25) involves only Up to this point the PAW method is an exact irn-
smooth functions and can be evaluated in Fourier space plementation of the density-functional theory within the
as &ozen-core approximation. However, we have required
certain completeness conditions for the plane-wave basis
2 ~~- l~(&) + ~'(G) I'
Q'2 (26)
set for the PS wave functions and the AE and PS partial
waves. In order to arrive at a practical scheme, let me
now introduce two approximations.
The second part of Eq. (25) introduces a potential (i) Plane waves are included only up to a given plane-

j, n(r') —n'(r')
/r —r'/
which has high Fourier components just as the original
(27)
wave cutoff' Epw defined as the maximum of G2/2.
(ii) The number of AE partial waves, PS partial waves,
and projectors is finite. However, the truncation of AE
and PS partial waves and projector functions are done in
exactly the same way. That is, for each AE partial wave
compensation charge density does. However, they do not there is a corresponding PS partial wave and its projector
contribute to the total energy because they are multi- function.
plied with the high Fourier components of the PS charge Both approximations can be controlled in a straight-
density, which are exactly zero if a plane-wave cutoff is forward way, by increasing either the plane-wave cutoff
imposed. Hence this term can also be exactly evaluated and/or the number of partial waves. The convergence for
in Fourier space. The spacial extent of this potential in both is rapid if a suitable set of partial waves and pro-
real space is identical to that of the smooth compensation jectors has been selected. Typically good convergence
charge density nR. is obtained for plane-wave cutoffs of 30 Ry and one or
The last term in Eq. (25) is a short-ranged pair po- two partial waves per site and angular momentum, with
tential between the atoms a maximum angular momentum of typically E = I or
8 = 2. The partial-wave truncation will be discussed in
detail in Sec. VII.
The two approximations define a new total energy
(28) functional, and we have to establish that this new func-
tional is suKciently close to the correct functional for
which can be evaluated analytically. The range of the physically relevant states. Once this new functional
this pair potential is twice that of the smooth compen- is defined, no further approximations are allowed be-
sation charge densities n'. It depends explicitly on the cause they would destroy the energy conservation in a
charge distributions via the multipole moments Qu'il. . molecular-dynamics simulation. Energy conservation is
Note that the potential 6 and the pair potential U~ ~ the most important test of the quality of any molecular-
contain nonspherical terms and adjust to the actual dynamics simulation. Many previous electronic struc-
charge density. ture methods have concentrated on providing a satisfac-
50 PROJECTOR AUGMENTED-WAVE METHOD 17 959

tory description of the potential. For molecular-dynamics p = P„ I@„)f„(4' I, where f„denotes the occupations
simulations the primary quantity is the total energy func- and I%'„) the orthogonal eigenfunctions of the density
tional because sxnall inconsistencies between forces and operator. This can be explained as follows: Since the ex-
total energy can create substantial difficulties in a simula- pectation value of any one-particle operator A is the trace
tion. An accurate description of the potential follows, of (A) = Tr[pA] of the product between the one-particle op-
course, from an accurate description of the total energy erator and the density operator, the first derivative of the
functional. total energy with respect to the density operator can be
There are two further approximations that are not nec- written as
essary, but are employed to accelerate the calculations:
One can introduce a plane-wave cutofF in the representa- BT [ ,'&-'p-] BE BT [lr)(rip]
+ dr
tion of the PS charge density and an angular momentum I(9P BP Bn(r) Bp
truncation in the one-center PS and AE densities. With- 1 V' 2
out these cutoffs, the PS charge density has plane-wave = —— +v,
2
components corresponding to four times the plane-wave
cutoff for the wave function and the one-center expan-
where the potential is v(r) = Ir) & ~„l(rl, which is the
sions have angular momentum components of up to twice
the maxixnum angular xnomentum of the partial waves. well-known form of the Hamilton operator.
However, a n»mber of these terms contribute little to the As the variational parameters of the PAW method are
total energy, so that these approximations are convenient the PS wave functions, we construct a PS Hamilton op-
ways to save computation time. One can truncate the erator H defined as the derivative of the total energy,
angular momentum expansion safely at 8 = 2, and the given by Eqs. (19)-(21), with respect to the PS density
plane-wave cutofF for the density can be chosen in many operator p = P,
. I@„)f„(@„lwith wave functions that
cases to be only twice the value of the wave function. obey the orthogonality condition (4'„IOI4' ) = In h„.
this section we shall derive the explicit expressions for
the PS Hamilton operator that will be needed to set up
IV. FORCES, HAMILTON OPERATORS, AND the Kohn-Sham equations or the equations of motion for
OVERLAP MATRICES first-principles molecular dynamics.
Let us treat the potential energy as a functional of four
In order to find the ground state of the density func- arguments n, n, n,and the multipole moments QRL, .
tional or to propagate wave functions and atoms in a The multipole xnoments, which determine the compen-
sation charge densities, are themselves unique functions
molecular-dynaxnics simulation, one needs to calculate
the gradients of the total energy functional with respect —
of the one-center densities. This choice which is not an
to all the variational paraxneters, namely, the PS wave approximation —
will simplify the bookkeeping in the fol-
functions and the atomic positions. In the following lowing derivation. The derivative with respect to the PS
subsections I shall derive explicit expressions for forces, density operator is then obtained as
Hamilton operators, and overlap matrices.
BTr[pT] BE Bn
A. Overlap operator
The overlap matrix in the AE representation is given + dr
BE
+
. BE BQRL, i Bn
Bn~ BQRL, Bn~ J Bp
simply by the matrix elements of the unity operator. R, L
Consequently plane waves form an orthogonal basis set
+
BE . BE BQRL, i Bn
~ ~+ L
in the AE representation.
dr
The PS version of the unity
operator obtained via Eq. (11), however, is a nonlocal R,
operator of the form where

(3o)

Hence, in the PS representation, plane waves are no


longer orthogonal, that is, (GIOIG') g 8G. ~,if the PS
overlap operator 0 differs from unity. This is a direct
consequence of relaxing the condition of norm conser- is the PS version of the kinetic-energy operator T
vation: The PS overlap operator obviously reduces to —V /2. Note that the three densities n,
the unity operator if the norm-conservation condition
and n n,
are linear functions of the PS density operator and their
(P;IP~) = (P;IP~) is imposed. derivatives with respect to the PS density operator are
obtained easily &om Eqs. (15)—(17).
B. Hamilton operator The individual terms in Eq. (32) are evaluated as fol-
lows.
The Hamilton operator is the first derivative of the to- (i) The derivative with respect to the PS charge density
tal energy functional with respect to the density operator is obtained &om Eqs. (19) and (25) as
50 PROJECTOR AUGMENTED-WAVE METHOD 17 961

arbitrarily far from the electronic ground state. The rea- a unitary matrix can be written in the 6rst order in B
son for using this expression is not that this force theorem as 1 + B. Since the total energy is invariant with re-
is particularly insensitive to deviations from the Born- spect to a unitary transformation between the occupied
Oppenheimer surface, but rather that only the direct par- wave functions, the forces are invariant with respect to
tial derivative avoids the double counting of forces acting the antisymmetric part of A, given that A is block diag-
on the wave functions and those acting on the atoms. onal separating occupied and unoccupied states. Hence
To calculate the derivative with respect to the atomic we obtain
positions, one 6rst has to calculate the forces on the
nucleus and second to include the change of the AE
wave functions for fixed PS wave functions, but changing
&~l@-) = —— ) .I@-)I:(4'-1&~01@-)+ B-] .
atomic positions. This term appears because the aug-
mentation depends on the atomic position. The force Using Eqs. (46) and (47) and
on the nucleus is obtained as Ig = —gl~@l. It is
the product of the electric field at the nucleus and the
nuclear charge. Its value is derived from an infinitesimal
BE[4, R]
BR
I4-)
)
= -f„(4„IvRHI~„), (47)
displacement of the nuclear charge density n . The forces
resulting &om an infinitesimal change in the wave func- we can write the total force including the force on the
tions due to the atomic displacement can be written as nucleus and the Pulay force as
FR ——g1~4, l + g ~@l and are called Pulay forces. To
) f„(~„Iv~aI~„)
I

use the language of the force theorem, the Pulay forces rR = —


describe forces on the electrons that are dragged along

+) "+
with the nucleus due to the position-dependent basis set.
With the PAW method we must consider Pulay forces (e IV'nOle„)(e„lull )
&om the &ozen-core electrons, which shift rigidly with
the nucleus, and the contributions &om the augmenta-
tion.
)-f— B „(4„IHI4 ) . (48)
When calculating the Pulay forces, we must also con-
sider the change in the overlap between the wave func- The last term in Eq. (48) describes the effect of elec-
tions. An in6nitesixnal change of the atomic positions tronic excitations resulting &om a unitary transformation
must be accompanied by a change in the wave func- between occupied and unoccupied states and cannot be
tions that restores the orthogonality. The new occu- specified further. However, its contribution to the force
pied wave functions must span the same portion of the vanishes if the Hamilton matrix (O'„IHI@ ) commutes
Hilbert space that was occupied before displacement of with the occupations (for example, if it is diagonal). This
the atoms. Hence the new wave function can be expressed is the case if the wave functions are obtained by diagonal-
as a linear combination ization of a Hamiltonian. In a Lagrangian formalism this
(R+ dR)) = 1@„(R))+ ) I@ (R))A „dR (43) term must be chosen in a well-de6ned way, as described
in a later section.

of the PS wave functions with undistorted atomic po-


sitions. Note that these wave functions should not be 2. Forces in the PAW method
confused with the self-consistent wave functions for dis-
placed atomic positions. The new wave functions obey
The change of the AE wave functions is related to a
the orthogonality condition
displacement of the projector functions and the partial
(4„(R+dR)10(R+ dR)14 (R+ dR)) = h„(44) waves

to linear order in the displacement,


A„by
which determines &Rl~-) = — ). (I&4') —1&4'))%1~-)

A„+ At = —(4„lv'&014 ) . (45)


-) . (14,) —14;))(&p'I@-)

Here VR corresponds to a derivative of a nuclear coor-


dinate rather than to a derivative with respect to an .
--, ) l~-)(~-l&nol~-) (49)
electronic coordinate, which is denoted by V. To ar-
rive at this expression we used the orthonormality of the The first summation corresponds to a rigid displacexnent
wave function 1@„(R))before the displacement. We re- of the partial waves, the second to change of shape of the
tain the freedom of adding an arbitrary antisyxnmetric one-center expansions, and the third is the force due to
matrix to A, which is reminiscent of the invariance of the the change of the overlap. For sixnplicity, we derive the
total energy with respect to a unitary transformation of forces that result &om the three contributions indepen-
the occupied wave functions. This can be seen as fol- dently.
lows: A unitary matrix has the general form
B is an anti-Hermitean matrix (i.e. , B = Bt).
e, Hence—
where The force resulting from a rigid shift of the partial
waves is treated together with those functions that do not
17 962 P. E. BLOCHL

explicitly depend on the PS wave functions but are tied = (&'lr) (r I&')
directly to the nuclear positions, such as the rigid shift
of the nucleus nR, the potential 8R, and the Gaussians
gRI, and gRI used to expand the coxnpensation charge = (&*lr)(rl&~)
densities nR and nR .
Let us first analyze the contribution of the smooth part This force can be evaluated using the one-center expan-
E, given in Eq. (19), of the total energy sions v (r) and v (r) of the AE and the PS potential
/

FR = ~~+~R d~ + d~~ +'UR + +~R F~ ' = — ) .~~ov ((O'I


—-'7'+ "'1»)

drn V'V — V'R gUR R (5O) —(»I —-~'+ "'I»)) . (57)


R'
Finally, we consider the forces resulting Rom the
The first three terms are evaluated in G space. The
change of the orthogonality of the AE wave functions.
fourth is related to the derivative of the pair potential
The corresponding force is of the form
for fixed multipole moments Q~L, and is evaluated ana-
lytically. Note that the multipole moments do not change
under an in6nitesimal rigid shift of the partial waves be-
cause the center of the Gaussians is also shifted analo-
gously.
Second, let us consider the contribution of the rigid
shift from the one-center terms E and E . Their contri-
bution is exactly zero, because all contributing densities
and potentials are rigidly shifted, so that the change can
be reduced to a change of coordinates, which does not
a6ect the energy. Note that this term also contains the The total force is given by the sum of the three terms
force on the nucleus.
Next we consider the change of shape of the one-center FR = FR(i) +FR(2) +FR(3) (59)
densities. Here we can use quantities that have already
been calculated in the Hamiltonian. This force is propor-
tional to the gradient
V. FIRST-PRINCIPLES
v~e;, = — ) f„((e„lv&,)(A~le„) + (4'„lp MOLECULAR DYNAMICS

(51) A. Fictitious Lagrangian

of 0;~, which is de6ned as The 6rst-principles molecular dynamics is imple-


mented in a straightforward manner once the exact ex-
(52) pressions for the Hamiltonian and the forces have been
obtained. After inclusion of the real kinetic energy of the
nuclei and the 6ctitious kinetic energy of the PS wave
where 0;~ is a density matrix for the one-center expan- functions, we obtain a Lagrangian
sions in terms of partial waves.
The force due to the change of shape has the form

FR
(2) = ) .B0OE +Re'i
~
(53)

The energy derivative with respect to 0;~ is obtained


where the last term ensures orthogonality of the AE wave
sixnilarly to the corresponding term in the Hamiltonian
functions via the method of Lagrange parameters. The
[cf. Eq. (32)]
resulting Euler-Lagrange equations have the form
BE BE BE Bn'
BO,
dr
Bn' + - BQ~I,
BQ~L, Bni B8; m~14„) = —H14„)+) 014 )A „—
7

MRR = FR
where and are integrated with the Verlet algorithxn. The La-
50 PROJECTOR AUGMENTED-WAVE METHOD 17 963

grange parameters for a system at rest are related to the time step. It difFers from O(0) only if the atoms are mov-
Hamiltonian via A„= (e„lHle~)(f„+ )/2 .Note f ing. The trajectories determined with this procedure are
that for a system at rest the Hamiltonian commutes with exactly symmetric under time reversal, which is crucial
the occupations. to obtaining energy conservation and predicts the wave
functions accurately to the order 4,
consistent with the
overall accuracy of the Verlet algorithm. Furthermore,
B. Propagation of the wave functions the constraints cannot deteriorate if a finite time step is
used.
The equations of motion for the electrons are inte- In practice we first evaluate the forces of constraints
grated using the Verlet algorithm3~

e„(+)) = 2le„(o)) —le„(—)) —HI4 (o)) Ix ) = o(0)le„(0)) (67)

+) OI4 (0))A „—.


n
(63) Using Eq. (65) I rewrite the constraint equation Eq. (66)
Tn as
The following notation is used here for the time steps. A( ) + XtB+ BtX+ Xtt X = 1 (68)
The time step is denoted by A. The wave func-
tions have an integer argument denoting the num- with the definitions
ber of the particular time step relative to the ac-
tual configuration. Hence the series of coordinates is A."' = (e-Io(+)le-), (69)
~" Ie ( —2)) Ie ( —)) Ie (o)) Ie (+)) Ie (+2))
For convenience let us abbreviate the arguments for the
previous and the subsequent time step. Similar notation
B- = 4-Io(+)le-) (70)
is used for the atomic positions. c- = 4 lo(+)l~ (71)
As a rule of thumb, the equations of motion for the
wave functions are integrated properly if the time step and
4 is related to the fictitious mass of the wave functions Q2
m@ so that b, 2/m@ lies in the range 0.1— 0.15 a.u. ss For X „=A (72)
most systems the mass m@ can be chosen to be 1000 a.u. , my f„
resulting in a time step of about 10 a.u. or:0.25 fsec.
During the dynamical simulation, we have to ensure Equation (68) cannot be solved directly for X. There-
the orthogonality of the wave functions in an energy- fore we obtain X iteratively. Let us first analyze Eq. (68)
conserving manner consistent with the accuracy of the in orders of the time step A.
Verlet algorithm. 3~ The methods were originally invented We will see that the leading order is proportional to
for molecular-dynamics simulations of polymers that A = h„+
O(Az), because the forces of con-
obey bond-length constraints. Car and Parrinello straint contribute only in the second order and therefore
adopted this algorithm in their formulation of first- le(+)) = Ie) + O(b, ). As the leading order of X„ is
principles molecular dynamics. Later Laasonen et Ol. proportional to 6,
the term Xtt X vanishes in leading
order and only the zeroth order of B contributes. The
extended this method to the case of a position-dependent
overlap matrix, as used for Vanderbilt's ultrasoft pseu- zeroth order of B is equal to (e„(0)IO(0) Ie (0)) and
dopotentials. I have adopted their strategy and extended therefore Hermitian. Hence the lowest order of Eq. (68)
it to include the possibility of different occupations for in 6is
different states. This will be described in the following.
The wave functions are first propagated without con-
straints, which yields le)
)
l
„', , + „', ~f = —(A„' —b„), (73)

2 where i = 0 and B' = (B + Bt)/2 is the Hermitian


I4' ) = 2le-(o)) —Ie-( —)) —Hle-(o)) (64) part of B. Equation (73) determines X = X~ol + O(b, s)

„,
accurately in leading order.
The forces of constraint are added in a second step Analogously to the previous discussion we find that
also the higher orders of X' = . X~'l are obtained from
P,
A~'l given by
)
e„(+)) = le„) — o(o)le (0))w (65)
Eq. (73), with
(i) g(i —1) + X(i—1)tB + Q~(i —1) + X(i—1)tax(i —1)
and the Lagrange parameters A are determined itera- (74)
tively so that the constraints for the next time step
However, there are many solutions to Eq. (73). They dif-
(e-(+) Io(+) le-(+)) = h- (66) fer by a matrix hX = (B') ~D, where D is an arbitrary
antisymmetric matrix. Only a solution that fulfills
are exactly fulfilled within a given tolerance for the over-
lap matrix. 0(+) is the PS overlap operator for the next Xag fg —f*Xg' (75)
17 964 P. E. BLOCHL

will conserve the energy and is of interest. F. = RF&+ R) f„(4„~V'RH~4„)


To obtain X, we diagonalize B„=P& U„&b~U~~ and
obtain its eigenvalues 6; and the unitary matrix U
formed by its eigenvectors. From that we determine
+) [A„(4 iOi4„) + (4 iOii„)A„ I.

) . U„„Up, (A . *„— Now we use the first energy derivative of the constraint
(.+, ) . 8~I ) Ui
i'~ 2 f„ equation (4„~0~4 ) and the requirement that A be Her-
bp+ 6I f. +f mitian to obtain
(76) F. =R F~+) f„(4.„~vRHI4„)
The iterative procedure Eq. (76) for X = g,. X' has
a fixed point at the correct solution for Eqs. (75) and )
— (4„~V&0~4 )A „.
(68). In each iteration Eq. (75) is exactly fulfilled, which
ensures energy conservation in each step.
This iterative scheme for X is a Taylor expansion in
The requirement E=0 results in an expression for the
force
4 if either all occupations are identical or if B' is unity
times a constant factor. As these requirements often are
not ful6lled, each order in 6 requires an additional itera-
Ftt = ) .—f-(@-l&JtHI@-) + ) .(@-I&ttOI@-)A-
tion, which is similar to that described above, but with-
out the term quadratic in X~'~ and the non-Hermitian
(8I)
part of B in Eq. (74). Owing to the close relationship which in the stationary case is identical to Eq. (48) above.
between the two nested iteration schemes, in practice I The propagation of the ionic positions is straightfor-
perform only the outer iteration, which now also plays ward once the Lagrange multipliers are known. As seen
the role of the inner iteration. in Sec. VB, those will be calculated only after the new
Evaluation of the overlap matrix A~'~ does not require positions are determined because the PS overlap opera-
that the scalar products of the wave functions be reevalu- tor and hence the orthogonalization that yields the La-
ated. Instead, the matrix A~'j is calculated from Eq. (68) grange multipliers depend explicitly on the atomic po-
using the matrices X~'~ from the previous iteration steps. sitions. However, the Verlet algorithm has only limited
Note also that B' needs to be diagonalized only once for accuracy in the time step A. Hence it is sufhcient if we
every time step, i.e. , once for the entire iterative scheme can predict A up to the order 6. This implies that a
described in this subsection. Convergence is reached if linear extrapolation from the last two time steps
every element of the right-hand side in Eq. (73) is smaller
than a certain given tolerance. Finally we can predict the A„(0) = 2A„(—) —A„(—2) + 0(b, ') (82)
new PS wave functions according to Eq. (65). is sufficient.It is therefore not necessary to achieve self-
consistency of atomic positions and Lagrange multipliers
iteratively, which would be computationally extremely
C. Propagating the atoms demanding. This is in contrast to the approach used
by Laasonen et al. , who suggest that A and forces be
The equations of motion for the ions are integrated as determined self-consistently.

Q2
R, (+) = 2R, (o) —R, ( —) + FR, (77) 2. Renormalization of the atonw cmasses'
R,.
The fictitious dynamics of the electronic wave func-
Forces consistent with the Lagrenge multip/iers tions has two main eKects on the atomic trajectories.
First, energy is transferred constantly to the wave func-
As mentioned before, the force component due to the tions, which have a temperature that is very low com-
pared to that of the ionic subsystem. The rate of
changing overlap operator described in Eq. (58) must be
modified in the molecular-dynamics formalism. Instead heat transfer is roughly proportional to the magnitude
of the forces acting on the ions and to the band gap
of (4'„~H~4 )(f + f)/2, one has to use the Lagrange
between occupied and unoccupied states. This efI'ect
multipliers. This results f'rom the condition of energy
can be controlled for long simulations using two Nose
conservation. The change of the total energy is
thermostats, ' one to keep the ions at their physical
temperature and one to keep the wave functions close to
a = MRR+ R„+) f„m+((4„~4„)+ (C.~4.)) the Born-Oppenheimer surface.
The second e8'ect is that the ions propagate as quasi-
dE dE particles dressed by the wave functions, which increases
+(4-I - + - l~-) (78) the e8'ective mass. This e8'ect can be compensated by
d(i„i di4„) renormalizing the masses of the nuclei. The magni-
We insert the equations of motion and resolve the energy tude of the efFect can be estimated from an isolated
derivatives using Eqs. (42) and (47) to obtain atom that experiences external forces, described by a
50 PROJECTOR AUGMENTED-WAVE METHOD 17 965

potential V,„i(R) acting on the nuclei. Assuxning that


the wave functions of the atom reside exactly on the
Born-Oppenheimer surface, the wave functions do not
change, except that they undergo a rigid displacement
and ~4) i ——~V4) tR. Hence the Lagrangian can be
simplified to

) R l
2m+ ) fat, ~(Va@n~V~@~)st

+M&b;, R, —Eo[@j —V.„,.

Here Eo is the total energy of the isolated atom, which


is constant during the simulation. The constraints of
orthonormal wave functions are automatically fu16lled
because here 4 denotes rigidly displaced PS wave func-
I I

0 1
tions of the isolated atom. The efFective mass tensor
(ao) r (ao)
2m@ g„ f„(V;0'„~V~4'„) +. MRb;~ is diagonal because
the isolated atom is spherically symmetric. The efFec- FIG. 1. Partial waves and projectors for Mn. Left panel:
tive mass AE partial waves (solid lines) and PS partial waves (dashed
and dash-dotted lines). The "first" PS partial wave is a
dash-dotted line. Right panel: first (solid line) and second
(dashed line) projector functions. (a) and (d) show the re-
is therefore one-third of the trace of the mass tensor, sults for the 6rst and the second partial wave of the s angular
which has been modi6ed after applying Gauss's theorem. momentum channel, respectively, (b) and (e) for the p chan-
The expectation value f„(4~ —2V ~@)« is nothing
P„ nel, and (c) and (f) for the d channel. 3s and 3p functions are
other than the plane-wave part of the true electronic ki- treated as valence states. Functions are scaled individually.
netic energy. Hence the bare mass of the ions used in the
Lagrangian should be reduced by

bM = -m+ ) f, „(4„~—'V'~4'„)„.
—, (85)

This correction has been included in all our simulations outward for the self-consistent atomic AE potential v q
described here. The quality of this correction can be and a set of energies e;. In practice we use the scalar
estimated by comparing the kinetic energy related to the relativistic version of Koelling and Harmon.
PS wave functions of the system of interest to that of the The AE partial waves are chosen to describe the phys-
isolated atoms. ically relevant states, i.e. , those &om the valence band
region, reasonably well. The energy e of the first partial
VI. CONSTRUCTION OF PARTIAL WAVES wave per angular momentum and site is usually chosen as
AND PROJECTORS the energy of the 1owest bound valence state of the atom.
The energy of the second partial wave is chosen after in-
The basic ingredients of the PAW method are partial specting the scattering properties of the PAW method
waves and projectors. There is an in6nite number of ways for the isolated atom with only one partial wave. It is
to construct them. I will describe here in detail the par- placed at the energy where the scattering properties be-
ticular choice I made for this application. Even though gin to deteriorate. There is no need for the partial wave
the solution of the problem is quite satisfactory, there to be bound states because the exponentially increasing
may be better choices than the ones described here. In tail will be canceled exactly by the identical behavior of
particular the construction of PS partial waves is com- the PS partial waves. The number of partial waves is
pletely analogous to the construction of pseudopoten- then further increased until the scattering properties of
tials with the pseudopotential method. The expertise the valence band region are described satisfactorily.
acquired with the pseudopotential method is likely An equally justi6ed approach, more similar in spirit
to create choices that permit a further reduction of the to the linear methods, is to use increasingly higher en-
number of plane waves. The partial waves and projector ergy derivatives of the energy-dependent partial waves
functions obtained with the procedure described below obtained at one fixed energy. In principle, one can also
are shown in Fig. l. add partial waves &om atoms with various occupations.
If core states extend beyond the augmentation re-
A. All-electron partial maves gion, we subsequently orthogonalize the AE partial waves
within the augmentation region to core states centered
The AE partial waves are obtained by radially inte- on the same site. We 6nd that one AE partial wave per
grating the Schrodinger equation angular momentum and site is often sufBcient and that
P. E. BI.OCHL

two terms yield a satisfactory description even for diK- TABLE I. Parameters for the construction of PS partial
cult cases such as transition metals and systems in which waves. The cutoK parameter has been chosen as A = 6 for all
semicore states are treated as valence states. However, if atoms and angular momentum channels.
one is interested in states that lie far above the valence Symbol v{0)[II) ~. [H) &ks
band region, the number of AE partial waves can be in- H —3.43 -0.234 0.45
creased further until the desired accuracy is achieved. Li —1.43 —0.106 1.20
The entire construction is fairly insensitive to the Be —1.40 —0.207 1.20 —0.079 1.20
choice of the energies of the AE partial waves. Since 8 —2.20 -0.346 1.00 -0.137 1.00
the valence band region can be described fairly well with —2.47 --0.501 1.00 —0.199 1.00
two partial waves, as shown in the linear methods, any -2.58 -0.677 1.00 -0.266 1.00
two partial waves &om this region will span a, very similar 0 —3.19 —0.873 0.90 -0.338 0.90
portion of the Hilbert space. Even though the individual Og —2.60 --0.873 1.00 -0.338 1.00
partial waves and projectors will deafer, they represent an F —2.55 -1.090 1.00 —0.415 1.00
almost equivalent choice. Fe 1.88 —0.020 1.50 —0.058 1.50 -0.287 1.50
0.000 1.50 0.000 1.50
B. Pseudopartial waves Mn 0.0 -0.194 1.50 —0.054 1.50 —0.257 1.50
1.50 1.50 0.500 1.50
To construct PS partial waves, I proceed in loose anal-
—3.20 -3.138 1.50 -2.002 1.50 —0.257 1.50
ogy to the pseudopotential approach described in the -0.194 1.50 —0.054 1.50 0.500 1.50
work of Hamann et al. , ' o' which I have extended to
include several terms into the separable form. However,
in general we do not perform the norm-conservation step
suggested there. Is') '&'+ v-~ —e,')I&') .
= {--, {90)
I 6rst select a PS potential 6 t. This is done in two
If the result is zero,we set the projector function equal to
different ways, depending on the element.
the cutoff function k(r). These projector functions must
(i) For transition metals, a polynomial of fourth order
is matched diKerentially to the AE potential. Outside be modified such that they fulfill the condition (p;Ig~) =
the matching radius the two potentials coincide. The b, ~. This condition is now imposed iteratively beginning
remaining &ee parameter, the value of v ~ at the nuclear with the lowest partial wave. The following equations
site, is adjusted by hand. [(91)—(96)] should not be read as mathematical identities
(ii) For elements without d electrons in the valence but rather like a computer program: The left-hand side
shell, v~t is obtained from the AE potential as v~t(r) = is the product of the right-hand side, with the new value
v i (0)k(r) + [1 —k(r)]v t(r), using a cutoff function of the overwriting previous values for the same symbol. I have
form done this to avoid multiple symbols for the same quantity.
For a given partial wave denoted by subscript i, and
k(r) = exp[ —(r/rg, )"]. (87) assuming that (pi, lp~) = bi~ for k, j(
i, the projector
functions are erst orthogonalized to all lower PS partial
In order to obtain the PS partial wave, I de6ne for each waves by
AE partial wave a PS potential of the form —
).
2 jL

v(rt) = v, (r) + c;k(r) . (88) lp') = IP') — lI ~)(&~ I&') . (91)


j=l
The values of the cutoH' radius rI, and the exponent A are
chosen such that this potential is virtually identical to the Then the AE and PS partial waves are modi6ed in order
AE atomic potential outside the augmentation region. to ensure the orthogonality of the PS partial wave with
Often we choose A = 6 and rg as three-quarters of the the lower projector functions
covalent radius. The values used in our calculations are t —1
listed in Table I. The PS partial wave is obtained as a
solution of the nonrelativistic Schrodinger equation 14') = I@'& — ).
j=1
I+')(p I+'&

1 i —1
—— V + iv, (r) —e 1 IP;) = 0
( 2

for the energy of the corresponding AE partial wave and


{89)
I&*-& = l4') — ).
j=l
I@ &(A~ I+'& . (93)

the potential iv;(r). The &ee coefficient c; in Eq. (88) Finally the projector function and partial waves are
is then determined such that the PS partial wave coin- scaled so that Q, IP;) is unity
cides with the corresponding AE partial wave outside the
augmentation region.
IP') = lp')/(p'I&*) « (94)
C. Projector functions = I@-)/c
I@'&
Next we calculate preliminary projector functions ac-
cording to I+'& = l0. )/c. (96)
50 PROJECTOR AUGMENTED-WAVE METHOD 17 967

The free constant c is used to avoid very large projector in detail. In particular, this approach ensures that the
functions, while the partial waves are very small and vice states used to construct the PS partial waves are repro-
versa. It has no influence other than to prevent very sxnall duced correctly as atomic PS wave functions of the PAW
and very large numbers, which may create problems on method, irrespective of the quality of the partial-wave
the computer. Once the set of projectors and partial expansion.
waves with index i are modified to obey the orthogonality Let me summarize which quantities we import from the
condition, one proceeds to the next set of projectors and atomic calculation into the ab initio molecular dynamics
partial waves Ip;+&&, 14;+&&, 14;+&). simulation: (i) the AE and PS core densities, (ii) AE and
PS partial waves Ig;& and IP;& and PS projector functions
— — —
(s 'I (iii) the matrices (O'*I 2&'14~& (&'I 2't7'l&~& and
D. The potential 8 —
(P;IP~& (P;IP~& for the calculation of the one-center
con-
tributions of kinetic energy and overlap matrix (note that
The potential 8 is now obtained by subtracting the the Laplacian for the AE partial waves is replaced by its
potential of the self-consistent atomic PS density &om scalar relativistic counterpart), and (iv) the cutofF r, that
the PS potential used to define the PS waves: determines the range of the short-ranged compensation
densities.

VII. ANALYSIS OF THE PARTIAL-%PAVE


This step is the analog to the unscreening of a pseudopo- TRUNCATION ERROR AND EXTENSIONS
tential performed in the pseudopotential approach. OF THE PAW METHOD
Since the PS partial waves do not necessarily corre-
spond to the atomic bound states, which are needed in In the previous sections we have taken the point of
Eq. (97), the latter are obtained from the radial, separa- view that the PAW method is an exact formulation of
ble Schrodinger equation the Kohn-Sham equations, from which a practical scheme
is obtained by truncating two rapidly converging series
expansions. Here I will analyze the truncation errors of
the partial-wave expansion in detail and thus justify the
(98) choices I made for wave functions and total energy ex-
pressions. This section can be skipped by the practi-
with dH;~ and dO;z given as tioner. I recommend this section to those who are inter-
ested in the underlying principles and possible extensions
of the present implementation of the PAW method.

A. Truncation error in the wave function


A way to solve this equation is sketched in the Appendix.
Orthogonality to the corn states
To obtain the PS density we still need to define its
core contribution. The PS core density 6' is obtained by
matching a parabola differentially to the AE core density. The main effect of the truncation of the partial-wave
Outside the matching radius PS and AE core densities expansions for the wave function is to redefine the trans-
are identical. Using the wave functions and occupations formation &om the valence wave functions to the PS wave
of the atom one constructs a PS charge density n and functions. This in itself does not introduce errors, but it
from that 8 according to Eq. (97). affects the orthogonality to the core states. Whereas the
AE partial waves are constructed to be orthogonal to the
core states, a nonzero remainder of I%'& — IP;)(p;I%'&
P,
E. Outlook can create a nonzero overlap with the core states.
Therefore, I introduce a new definition of the trans-
The procedure described above is by no means the formation 7 that explicitly ensures that any PS wave
only way to create PS partial waves. There are a function is transformed onto an AE wave function that
number of different ways to construct first-principles is exactly orthogonal to the core states IP'):
pseudopotentials. These xnethods can easily be
adapted to relax the norm-conservation condition, to al-
low a larger augmentation region, and to include un-
bound states. Each of them can be used to construct (101)
PS partial waves corresponding to given AE partial waves
and, with the procedure outlined above, to construct pro-
jector functions. In the analysis of truncation errors it should be kept in
Once the PS partial waves are defined, it is recom- mind that Eq. (101) rather than Eq. (9) is the true def-
mended that the procedure described above be followed inition of the transforxnation between PS and AE wave
17 968 P. E. BLOCHL

functions. Of course, if the partial waves form a complete with the "AE projector functions" defined as
set, the two expressions are identical.
(103}
2. Additive augnaentation
Note the difference between the AE and the PS projector
When truncating the partial-wave expansions it is im-
functions.
portant that the partial-wave expansions of the AE and
To show that the transformation is unique, we test
the PS wave functions are truncated in a completely anal-
whether any nonzero function orthogonal to the core
ogous way. This principle is called additive augmentation
states is mapped onto a zero PS wave function. This
and has important advantages.
would be the case only if
First, the wave functions of the PAW method are dif-
ferentiable to an arbitrary order if the PS partial waves (104)
have been constructed to be differentiable to an in6nite
order. (In many implementations of the LAPW method
the wave function is even discontinuous. ) for any function 14) orthogonal to the core states. Hence
Second, higher partial waves not explicitly included in such a function I@) must be a superposition P,
(ig;)--
the partial-wave expansions are represented by the tails 1$;))c;. If we insert this ansatz into Eq. (104) and exploit
of the plane-wave part that extend into the augmentation (p;1$~) = S,z, it is clear that the coefBcients must fulfill
region.
Finally, the PAW basis set is complete whenever the (105)
plane waves form a complete basis set, irrespective of
the partial-wave truncation. This justi6es the use of
partial waves imported &om the isolated atom with- The matrix (p;1$~) is none other than the matrix a, ~ de-
out adjusting them to the actual potential, as done in fined above, as it fulfills the relation g&(p; i/i, ) (pi, i/~) =
the linear methods. The use of &ozen partial waves b;~. Equation (105) can only be fulfilled if either a;~ has
has substantial advantages in combination with the first- a zero right-hand eigenvalue or the functions on the left-
principles molecular-dynamics approach because it elim- hand side can add up to zero; these are the exceptions
inates a large number of parameters that otherwise have given above. This concludes the proof of the complete-
to be treated as dynamical variables or determined vari- ness of the PAW basis set.
ationally in each time step to a very high degree of accu-
racy.
The principle of additive augmentation itself is not B. Truncation error in the expectation values
new and has been exploited to some extent in the
I MTO method and in the APW method of Soler and While evaluating expectation values in Sec. VIIA,
Williams. There the angular momentum expansions it was reasonable to neglect cross terms between the
of the wave function and charge density were truncated three contributions of the wave function because 14')—
in the same way, resulting in a very rapid 8 convergence.
As a result of the projector augmentation, however, we
g, i/, ) (p; I@) = 0 in the augmentation regions, if the par-
tial waves form a complete set of functions in the aug-
can exploit this principle even on the level of individual mentation region. However, if the partial-wave expan-
partial waves. sions are truncated, this condition is no longer exactly
Here I will show that truncation of the partial-wave ful611ed.
expansions does not affect the completeness of the ba- In the following I will use the symbols
sis set: If a set of PS wave functions forms a complete
basis, the corresponding basis of projector augmented
P, ig, )(p;14') for the one-center expansion of the AE
wave function and 14'i) = P,
. Ig, )(p;14) for the one-
wave functions is complete in the Hilbert space orthog- center expansion of the PS wave function. The difference
onal to the core states. For this to be true two weak
conditions must be ful6lled: There is a matrix a,.~ such
AA„=- AA„+ AA„between the matrix elements
of an operator A calculated directly from Eq. (101) and
that P& a;g(pl, [g~) = b;~, which has no zero right-hand
eigenvalues, and the differences between AE and PS (@„IAIDO' ), with the PS operator from Eq. (11), is given
partial waves 1$, ) —1$, ) are not linearly dependent. by
To prove this statement, it must be shown that for
every AE wave function orthogonal to the core states
there exists one and only one well-de6ned PS wave func-
tion. For linear transformations such as the ones consid-
ered here, this implies first that we can define an inverse
transformation 7 from the AE wave function to the PS
wave function. This is indeed possible and the expression
is formally very sirailar to the forward transformation: (107)
(102) where P' = g,. 1$;)(P;. I
is the projection operator on the
core states.
50 PROJECTOR AUGMENTED-WAVE METHOD 17 969

The first term AA„ is proportional to the difference the shape of the potential: High-energy electrons pass
l@) —l4' ) between a PS wave function and its one-center through the atom too fast to be seriously affected by
expansion, whereas AA depends quadratically on it. the potential. Hence the difference between the AE and
Consequently both terms converge to zero as the partial- PS partial waves vanishes for high energies, even though
wave expansion is xnade complete. each partial wave itself is still sizable. Since they appear
The first term is a matrix element between the differ- in pairs of opposite sign in the truncation error, the con-
ence between the PS wave function and its one-center sistent truncation of both expansions is highly favorable.
expansion, which is largest at the surface of the aug- I shall return to this point when comparing the PAW
mentation region, and a function that is localized in the with the LAPW method.
center of the atom, namely, the difference between the
one-center expansion of the AE wave function and the
one for the PS wave functions. For quasilocal operators C. Truncation error in the total energy
such as the kinetic energy or the real-space projection op-
erator needed for the total energy, one profits &om the The error in the total energy can be divided into two
fact that the two functions are largest in opposite regions parts. The first is due to the difference between expres-
of space, resulting in small errors. sions (19)—(21) and the total energy calculated directly
The fact that the dominant truncation error AA using the expectation values for charge density, kinetic
is proportional to the difference between AE and PS energy, and overlap obtained via Eq. (11). The second
partial waves is the reason for truncating both partial- source of the error is due to the approximations described
wave expansions in exactly the same way. Partial waves in Sec. VIIB.
for higher energies become increasingly insensitive to The first error term is of the form
I

dd], E~'l —
) (fr[n(r) ni (r)] dr' l
& ) @ &( ) ( )
6(r) l

dr([n(r) + n'(r) —n'(r)]e„, (n + n —n') —n(r)e„, (n) —n'(r)e„, (n') —n'(r)e„, (n. ')) . (108)
AR

It is easily seen that both terms vanish as 6 —n, i.e. , if where the operator A is —zV + v —e„, where v is the
the partial-wave expansions are coxnplete. Furthermore exact potential and e„ the exact energy eigenvalues. The
the integrands go smoothly to zero at the boundary of error is the sum of the diagonal matrix elements with
the augmentation region. the eigenfunctions. Let us consider again only the lowest
To get a better idea of these terms, we can expand order in [4') —i@i):
them in orders of n —n and ]C') —i@i) and consider
only terms up to the first order: b, E~ ) = 2Re )
R, n, i
f„(4'„—4'„l
= y- dr[R(r) —vR(r)][vv(r) —vR(r)]
~Z
R f [(v—w;) —(e —.
,')]l4"))(p;le )
+O(n —n')'
+o(I4') —l4') )
= 2Re ) .f-(~- —~.'l(v~ —
Rn
vh) l~.')
using Eqs. (86) and (89), which define the partial waves,
(110)

+&(14) —l~'))' (109) and (106). (Note that we need to apply a linear trans-
forxnation to partial waves and projectors to undo the
Before returning to AE~ ~, I consider the errors that scrambling of partial waves described in Sec. VIC. )
propagate from the approxixnation of the expectation val- Coxnbining the two sources of error, AE~ &om ~

ues. The error can be obtained via Eqs. (106) and (107), Eq. (109) and b, E(2) from Eq. (110), we find
I

b, E( ) + b, E( l = 2Re )
R, n, i
f„(C'~ —4' l((1 —P')[(v —v~t) —(e —e;))lP;)

[(v —v )
—(m; —v )
—(e —e;)llg )k(p
Let us simplify Eq. (111) by replacing v with v, which is justified since the difference between them also vanishes
if the partial-wave expansions are complete and therefore does not contribute to the lowest order of the truncation
error in l@) —l4 ). Furthermore, we exploit the fact that (@ —@ l(v);)v t) lP, ) vanishes because (v); —v t) lP,. ) can be
expressed as a superposition of projector functions [Eqs. (89) and (90)] and (4 —@ lp;) = (4](1— ]pe)(4)~ l) lp;) = 0: P
P. E. BLOCHL

~Z~'l+ ~E~'l = 2R. ) f„(e„—e„'I


R, n
xi(1 —P ) [(v' —v-i) —(e- —e,') ll@*& —Nv' —v-i) —(e- —e,') jl&'&)(p*l~-& (112&

Equation (112) is an important result. It tells us that molecular or crystalline environment. This set of par-
the strongly varying potential of the core and the nu- tial waves will be divided into a subset of "lower" partial
cleus does not contribute to the error. This is a result of waves that will be used as partial waves in the way out-
an efficient cancellation of two errors, namely, AE~ and~
lined in the previous sections and a subset of "higher"
AE~ ~. In a hand-waving way, the term proportional to partial waves. The lower partial waves are adjusted to
(1 —P')(v —v i)lg;) —(v —v i)lg;) describes the charge the actual potential by mixing with the higher partial
density transferability error because it depends on the de- waves. This approach avoids the inclusion of additional
viation of the potential from the atomic one, whereas the projector functions and the corresponding increase of the
term proportional to (e„— e, )1$, —P, ) describes the error computational eH'ort by determining the coefficients of
in the scattering properties or the energy transferability. the higher partial waves self-consistently &om the po-
Note that the constant term in v and v almost cancels a tential within the sphere. The procedure can be termed
similar term in e —t, . It is also worthwhile to note that downfolded augmentation.
the right-hand side of the product vanishes diEerentiably Let us denote the higher partial waves by the sub-
at the boundary of the augmentation region, whereas the scripts h, 6' and the lower partial waves that adjust to the
term (41 —(@ is a small quantity, which is expected to
I
potential by the subscripts l, jt'. The rigid partial waves
be largest far kom the center of the atom. The resulting that have been constructed &om an atom and that will be
expression could actually be used to estimate the partial- used as a reference are distinguished &om the adjusting
wave truncation error in practice. partial waves by the superscript Q.
I make the following ansatz for the lower partial waves
and projectors
D. Extensions of the PAW method
I«& = I&(') + )h
144&«~
The PAW method lends itself to a number of extensions
which, though not yet implemented, may be interesting 14~&
= I«')+). I&h&a«
to keep in mind. These extensions concern the use of
partial waves that adjust to the actual potential and the
relaxation of the &ozen-core approximation. I will show
Ip~&
= ) lpga')b«+ ). I pa)c~i
in Sec. VIII that the PAW method is highly accurate even
without these features owing to the rigorous exploitation The coefficients will be determined such that (i) the or-
of the principle of additive augmentation. However, I thogonality relation between projector functions and par-
want to demonstrate that the PAW method is sufficiently tial waves is fulfilled and (ii) the scattering into the higher
Qexible to accommodate them, if desired. This will turn partial waves vanishes:
the linear transformation between AE and PS wave func- (p~ I&i& = ~~, ~

tions into a nonlinear one. As in the linear methods with —«)l&t& = o


adjusting partial waves, the nonlinear degrees of freedom (41~r (-2&'+ v'
can be relaxed during a self-consistent procedure. How-
ever, some caution is required if they are to be used in
combination with a Gctitious Lagrangian formalism be-
cause all nonlinearities must be treated consistently. In
contrast to the linear methods in their present implemen-
X (dHj' t" &idol', I" ) (pl"
tation, the partial waves will be adjusted here to both I

spherical and nonspherical parts of the on-center poten-


tials. Inserting the ansatz, we obtain expressions for the ma-
trices a, b, e

Optifaisation of partial viaves


to the actual potential
a~, i =— ) =~~ («)(4a I~n( 2&'+ v——«) 14i & .

Here I describe how the partial waves can be adjusted


to the actual potential within the frozen-core approxima-
tion. In Sec. VIID2, I will describe how to relax also
«, i = — ). (&a ion(
—2&'+ v' —«) I&i') (d~ —«dOli i'

the &ozen-core approximation.


We start out with a large set of partial waves, one that
is suKciently complete to describe the wave functions ac- —~l, l' &Lb+hi'
curately for all possible potentials that may occur in a
50 PROJECTOR AUGMENTED-WAVE METHOD 17 971

where =i ~i, (e) is defined by where ='. &(e) is defined by

) =-i, g~. (.)(&,-len( -,'V'+v'-g)ly„, ) =b„„,. (11g) =-; ()(&"l(1—~ )( —'v'+ ' — )
—,

x (1 —I' ) lyi"} = a;i (»&)


The matrices de i and dOi i are defined as in Eqs. (gg)
and (100) using only the lower, optimized partial waves P =P; IP;. }(P;. denotes the projection onto the relaxed-
I
and the actual one-center potentials instead of the atomic core states. The validity of the result can be verified by
potential. If the partial waves are xnade self-consistent inserting it into the de6ning equations. The result can be
in each time step, also the projections (peal@„) must be further sixnplified using the fact that the core states are
transformed according to the change in the projector solutions to some Schrodinger equations and the known
functions. When adjusting the partial waves and pro- orthogonality relations. It should, however, be noted
jectors, the energies e~ are chosen relative to a potential that (1 —P )IP; ) vanishes as the &ozen-core states and
reference, such as the average potential in the augmenta- relaxed-core states become similar. Hence these func-
tion sphere, in order to avoid a dependence on a arbitrary tions should be normalized before inserting them into
overall shift of the potential. the above equation in order to avoid a division by zero.
If sexnicore states are present, the same transformation
2. Beyond the frozen-corn approaimation should be performed between the PS partial waves and
Even though the present method has been imple- PS core states in order to guarantee that PS and AE
mented in the &ozen-core approximation, it is not lim- partial waves match at the boundary of the augmenta-
ited to it. The core states can of course always be re- tion region. This linear system of equations can be solved
laxed within an internal self-consistency loop for core and iterated until self-consistency is achieved among core
wave functions and one-center potentials. The only diffi- states, AE valence partial waves, and the potential.
culty is that the orthogonality between AE partial waves
and core states must be restored. One could simply imag-
ine using a Gram-Schmitt orthogonalization procedure. VIII. NUMERICAL TESTS
However, this would produce partial waves that are no
longer close to the solution of the Schrodinger equation, A. Scattering properties
resulting in large partial-wave truncation errors.
Let us therefore mix the partial waves with both the It can easily be shown that the scattering properties
&ozen- and the relaxed-core states to impose orthogonal- of the atom are reproduced correctly in the neighbor-
hood of the energies for which partial waves have been
ity and to minimize the total energy. We can make an
ansatz for the new AE partial waves included. The logarithmic derivative of the PS and AE
wave functions and their 6rst derivatives agree beyond
I&'} = I&')+). I& )a '+). l4;}b' (11g) the core region. The proof is analogous to that for local
potentials, which can be found in Skriver's book.
Thus the scattering properties of the atom can be im-
where Ig;} are the AE partial waves orthogonalized to the proved systematically for an arbitrarily large energy re-
relaxed-core states, lge} are the partial waves orthogonal- gion by increasing the number of partial waves. This
ized to the &ozen-core states, IP;} are the relaxed-core principle is illustrated in Fig. 2 for the example of man-
states, and lg;o} are the kozen-core states. The coef- ganese. The semicore states have been treated as va-
ficients u~, and b~; are determined such that the total lence states, which is not necessary in most applications.
energy is minixnized The logarithmic derivatives obtained with only one par-
(4'I — v'+ v' —el', } = o, tial wave per angular momentuxn — not to be confused
-', (120) —
with the setup Mn~ in Table I result in a poor descrip-
(4;I ——,'v'+ v' —.
l4, } = o. (121) tion of the valence region. If the number of partial waves
is doubled, the scattering properties are accurate up to
The relaxed-core states IP;} depend explicitly on the co- approxixnately 1.5 Ry above the occupied states as shown
efficients a;~ and 6;~. Using the fact that the core states in Fig. 2, which is more than sufficient for most calcula-
are solutions to some Schrodinger equations, we see that tions. This choice corresponds to the number of partial
the second equation reduces directly to the orthogonality waves used in the linear methods. Our experience is that
condition between relaxed core states and the new par- 6rst-row elements can be well described with only one
tial waves. The resulting partial waves for the relaxed partial wave per angular momentum and that two pro-
core are jectors per angular momentum are sufficient for the nar-
row d states and if semicore states are treated as valence
I&*) =(1-& ) I&,'}-).
j, k
l0 )=-;a(e;) states.

(&"l(1 —& )(—-'v'+ ' — ') B. Accuracy of the AE wave functions

(1 —& )I4,'} (122) In order to analyze the accuracy of the AE wave func-
tions obtained with the PAW method, let us calculate the
17 972 P. E. BLOCHL

20 wave function is smaller than 1%.


In order to see the deviations let us compare the wave
functions for an energy of +13.6 eV; see Figs. 3(d) —3(f).
This is about 20 eV above the valence states. %hereas
10 the wave functions of the p and d angular momenta are
still quite accurate, we see deviations of 15% in the s-type
wave function. This type of deviation, namely, an under-
estimation of the maxima of the wave function close to
the nucleus, is a typical signature of partial-wave trunca-
tion errors in the wave function. We can conclude that
the PAW method predicts the wave function with high
—10
—20 —10 0 10 20 accuracy over a wide energy range.

FIG. 2. Scattering properties of the Mn atom. Logarithmic C. Structural properties


derivative function Dr(e) = rB„gr(r, e)/P(r, e) with r = 3ao
for s, p, and d angular momenta versus energy. Triangles,
I have performed a number of test calculations on
circles, and squares indicate the exact result for 8, p, and d
simple molecules to establish the accuracy of the PAW
angular momenta, respectively. Solid lines are obtained with
method. I have chosen small molecules because they
the PAW method using the setup denoted as Mng in Table I.
exhibit the strongest nonspherical potentials and the
Dashed lines have been obtained with the same setup, but
without the second partial wave per E. smallest bond lengths and therefore should provide strin-
gent test systems for any electronic structure method.
The results are summarized in Table II together with
scattering states of an isolated manganese atom using the the results of other recent accurate all-electron LDA
PAW method and compare the resulting AE wave func-
tions with the direct integration of the radial Schrodinger
equation. Figures 3(a) —3(c) shows the wave functions for TABLE II. Comparison of binding energies, structural
an energy of — 8.16 eV, which lies in the range of the oc- properties, and vibrational properties for dimers obtained
cupied states. The deviation between the PS and the AE with the PAW method at a plane-wave cutoft' of 30 Ry with
those of other all-electron LDA calculations. Note that the
B~ and the 02 dimer of Ref. 54 are non-spin-polarized.
Molecule Quantify PAW Other LDA
H2 EJs (eV) 4.62 4.65, 4.6
d(a) 1.46 1.45, ' 1.45
(u (cm ') 4040 4160
Li2 En (eV) 1.04 1.00, 1.02
d (ao) 5.13 5.12,' 5.20
(u(cm ') 335 322b
Beg En (eV) 0.53
" (ao) 4. 51 4.52'
~ (cm ') 367
B2 En (ev) 3.78 3.5
d (ao) 3.03 3.05
~ (cm ') 1060 1030
Ng En (eV) 11.38 11.47, 11.3
d (ao) 2.09 2.07,' 2.07
(u (cm ') 2417 2380"
02 En (eV) 7.33 7.48, 6.2
0 d (ao) 2.32 2.29
r (a) (cm ') 1660 1620"
En (eV) 3 11 3.33,
2.62, ' 2.63
~

FIG. 3. Comparison of atomic wave functions of Mn using d (ao) 2.67


the PAW method with the exact result. Each graph shows the (u (cm ') 1148 1060
wave function obtained from the PAW method (solid line), the En (eV) 3.99 4.05, 2.89'
exact AE wave function (bullets), their difference magnified d («) 3.68 3.74, 3.70'
by a factor of 10 (dash-dotted line), and the PS wave func- ~ (cm ') 441 418, 412'
tion (dashed line) for a given energy and angular momentum.
(a) —(c) are obtained at an energy of —8.16 eV, which lies in Reference 54.
the valence band region; (d) —(f) sre obtained at sn energy of Reference 55.
+13.61 eV, which is far above the valence band region. (a) 'Reference 56.
and (d) are s-type wave functions, (b) and (e) are p-type wave Reference 58.
functions, and (c) and (f) sre d-type wave functions. 'Reference 57.
50 PROJECTOR AUGMENTED-WAVE METHOD 17 973

calculations. The results agree very well with the AE calculations


I used fcc supercell with a lattice constant of 30 a.u. of Chen et aL, who predict dM„F — 3.20ao and
The PS wave functions have been expanded up to a plane dM ~ —2.96ao. As a comparison, the pseudopoten-
wave cutofF of 30 Ry. The charge density has been ex- tial calculations of Chen et al. overestimate these bond
panded to 60 Ry. The augmentation charge density has distances by about 2.5% relative to the AE results.
been expanded up to 8 = 2. I used the Perdew-Zunger The calculations of MnFO3 demonstrate that the PAW
parameterization of Ceperley and Alder's Monte Carlo method faces no problems in dealing with high-oxidation
calculations of the &ee-electron gas. The parameters states. Furthermore they show that also calculations
used to de6ne the PS partial waves are listed in Table I. with unfrozen-semicore states are feasible with moderate
For the 02 dimer calculation we used the setup denoted computational efFort.
by 0 in Table I. The setup Op resulted in a &equency I conclude that the PAW method matches the accu-
about 5'%%uptoo low due to the overlap of the augmenta- racy of the best existing schemes within the LDA. Even
tion regions in the 02 molecule. This setup (Os) has been though we have examined simple molecules here, stud-
used in the calculations of MnFO3 described below. ies of larger systems have also yielded a similar accu-
The atoms have been calculated with integer occupa- racy of structural parameters for which data existed for
tions, which makes them nonspherical. An exception is F, comparison.
which prefers to spread the two at electrons over three
orbitals. In the case of Fe2 we use a numerical spherical,
spin-polarized calculation on radial grids in the experi- D. First-principles molecular dynamics
mentally observed valence configuration d&d&s . This is
not the ground-state con6guration of the LSDA, but it To illustrate the quality of dynamical simulations I
has been used to allow comparisons with previous calcu- show in Fig. 4 the various energy contributions to the
lations. dynamics. The potential energy and the fictitious ki-
Dioxygen and the boron dimer have been calculated netic energy undergo a regular oscillation with a period
in the triplet con6guration, and the iron dimer has been that corresponds to 441 cm,
which agrees well with
treated in the septet configuration. All other dimers have the previously calculated vibration kequency of 412— 418
been calculated in a non-spin-polarized fashion. Binding cm . The solid line represents the conserved energy
energies have been reduced by the zero-point vibration which should be constant in a high-quality molecular-
energies. dynamics simulation. Here the conserved energy devi-
Vibrational &equencies have been obtained by dynam- ates less than 0.8 meV &om the initial value. No drift has
ical simulation. Hereby I expanded the bond length been observed within 10 H. Hence the quality of energy
by approximately 5%%up and let the system evolve unper- conservation is as good as that obtained with traditional
turbed according to equations of motion. No thermo- Car-Parrinello simulations using pseudopotentials.
stating has been used for electrons or ions. I used a time The oscillations of the fictitious kinetic energy should
step of 10 a.u. for all molecules, except for the hydrogen not be misinterpreted as deviations &om the Born-
molecule, for which I used a time step of 1 a. u. The H2 Oppenheimer surface. The latter are irreversible and
vibrational &equency obtained with a time step of 10 a.u. their signature is a monotonous drift of the fictitious ki-
is about 10'%%up lower and that obtained with 5 a.u. is 1.2% netic energy to higher values, accompanied by a simul-
lower. For N2, the reduction of the time step to 5 a.u. taneous drift of the nuclear kinetic energy towards lower
lowers the frequency by less than 1%. values. The oscillations of the 6ctitious kinetic energy
The results agree very well with other AE calculations. seen in Fig. 4 represent the motion of the wave function
The bond length deviates typically by less than 1% and
vibrational f'requencies deviate typically by 4%. Binding
energies have deviations of 0.1— 0.2 eV, which is within the
accuracy of previous calculations. Note that the largest
discrepancies can be reduced further by increasing the —20
plane-wave cutofF.
In order to study the other worst case for the PAW, E -40
i.e., that in which the density deviates strongly &om the
atomic density, I studied MnFO3. In this compound, the
Uj
60
I' ll 'I, 'I,II, II,II II,
I
II
,
, II
.
, I
II ~,

" "
II ' II I, il
manganese occurs in a formal oxidation state of seven. I'
II
"
II
,I
I
I I
II
I
I I I
il
'i I
'I
I
I'
il , I

I I' 'l I' I I


The structure is that of a slightly distorted tetrahedron 80 1 „' ~t J ij 'g'
J

formed by the oxygen and fluorine atoms, with the man-


I I

0.0 0.1 0.2 0.3 0.& 0.5


ganese in its center. In this system I also compared a
calculation with &ozen-semicore 3s and 3p states with t(psec)
one that included these electrons explicitly as valence FIG. 4. Energy contributions during a first-principles
electrons. Treating the semicore states as valence states molecular-dynamics simulation of an iron dimer. The
results in bond distances of dM„F = 3.205ap (ap is the dash-dotted line is the LDA total energy, the dashed line the
Bohr radius) and dM„O = 2.973ap, which agrees very fictitious kinetic energy of the wave functions, and the solid
well with the results when keeping the semicore state line the conserved energy. All energies are plotted relative to
frozen, namely, dM„F —3.189ao and dM„~ —2.976ao. their initial values. See text for discussion.
17 974 P. E. BLOCHL

on the Born-Oppenheimer surface. Its inQuence on the


motion has been compensated by the renormalization of
the nuclear masses by 8% as described in Sec. V C 2.
In conclusion it has been demonstrated that the PAW
0.0 '-

method allows high-quality energy-conserving molecular


dynamics. Molecular-dynamics simulations have been
performed on larger systems and the results are published
elsewhere.

E. Plane-wave convergence 10 20 30 40 50
Epw(R&)
Figures 5—7 show the plane-wave convergence of total FIG. 6. Plane-wave convergence of the binding energy for
energy, binding energies, and bond distances obtained dimers. AE is the binding energy relative to the result ob-
with the PAW method for all first-row elements (except tained with Epw —50 Ry. The symbols are the same as in
carbon) and iron as an example of a transition metal. Fig. 5. See text for details.
Convergence to 0.1 eV is achieved at about 30— 40 Ry
even for difFicult cases, such as oxygen and fIuorine, and
cutofF that is substantially larger than twice the wave-
substantially earlier for other elements. Structural prop-
function plane-wave cutofF or where one has to resort to
erties such as the bond distances are accurate to 0.02ao
multigrid techniques such as described by Laasonen et
at a cutoff of 30 Ry, which is less than 1% of the bond
al. 4~
length. Binding energies converge faster than absolute
The plane-wave convergence of the PAW method is al-
total energies, and the error at 30 Ry is as small as 0.1 eV.
ready close to that of the LAPW method, which predicts
For oxygen, we can compare this plane-
mRy convergence of the total energy at a plane-wave cut-
wave convergence with that of Vanderbilt's ultrasoft off given by Epw = (5+/ ) /rMTao Ry, where /~a„ is
pseudopotential. The accuracy and the plane-wave con- the highest angular momentum of the wave functions and
vergence of our calculations are apparently comparable
rMT is the muffin-tin radius. For a system such as oxy-
to the "hardest" pseudopotential used. It is not unex-
gen with a muffin-tin radius of 1.1ao as is necessary for
pected that a similar plane-wave convergence is obtained molecular bonds, the plane-wave cutofF should be about
for the PAW method and Vanderbilt's ultrasoft pseu-
30 Ry. This is a better convergence than that produced
dopotentials if a similar choice of PS wave functions is
by the present implementation of the PAW method since
used in our calculation and in the construction of the
the former corresponds to mRy convergence.
ultrasoft pseudopotentials.
In all calculations shown here we expand the charge
density in plane waves up to a plane-wave cutoK that IX. COMPARISON WITH EXISTING METHODS
is only twice that used to expand the wave functions.
Hence many operations, such as Fourier transforms eval-
uation of the potential-energy part of E, are performed as One can observe that AE and the pseudopotential
if the plane-wave cutofF were only one-half of that actu- methods introduced in the past few years seem to con-
ally used. This is one advantage over Vanderbilt's ultra-
verge. Vanderbilt's ultrasoft pseudopotentials opened
soft pseudopotentials, which either require a plane-wave the way to the efficient study of first-row and transi-
tion metals using a plane-wave-based pseudopotential
approach. Conversely Goedecker and Maschke have

04
0.3
0.2
E5
D
0.1
I I
0.0
10 20 30 40 50
Epw(R&) —0.1
10 20 30 40 50
FIG. 5. Plane-wave convergence of the atomic total energy E,~(Ry)
for 6rst-row elements and iron. Ep~ is the plane-wave cutofF
for the wave function. AE is the total energy relative to the FIG. 7. Plane-wave convergence of dimer bond lengths.
result obtained with Ep~ = 50 Ry. The following symbols AE is the bond length relative to the result obtained with
are used: H (A), Li (*), Be (D ), 8 (~), N (a), 0 (0), F (CI), Ep~ = 50 Ry. The symbols are the same as in Fig. 5. See
aud Fe (Q). For detai1s see text. text for details.
50 PROJECTOR AUGMENTED-WAVE METHOD 17 975

shown that the LAPW xnethod can be simplified to yield


pseudopotentials and introduced techniques very similar
to the pseudopotential approach into the LAPW method.
I believe that the PAW method actually bridges the gap (i25)
between these two approaches, namely, the pseudopoten- and the constant is the so-called self-energy
tial approach and the augmented-wave methods. On the
one hand, it is an augmented-wave method that yields
the full wave functions. It can be regarded, in a sense,
E-~r = E'~ —E'a — ) (@ ~, I" ~l@ ~, ). (126)
as a generalization of the LAPW method. On the other
hand, xnost of the operations done on the plane-wave part The potentials v ~, v ~, and v are obtained from the
are in fact identical to those perforxned in the plane-wave- charge densities n and n of the atoms. Also compen-
based pseudopotential approach. I view this as a promis- sation charge densities in E are kept &ozen and are im-
ing developxnent, as it allows the virtues of two separate ported &om the isolated atoms. The first-order change of
techniques to be combined. In this section I will there- the total energy with respect to a change in the compen-
fore discuss the relationship between the PAW method sation densities has been absorbed by the nonlocal PS
and other approaches. potential. The approximations are well justified as long
as the PS charge density is sufficiently close to the AE
charge density. If the condition of norm conservation is
A. Pseudopotential method ixnposed, the overlap operator is identical to unity.
We thus obtain the form for the total-energy functional
1. No~-coneeming peeudopotentials that is well known &om the pseudopotential method,
which is none other than that actually implemented in
The pseudopotential approach, based on generalized the original Car-Parrinello codes using generalized sep-
separable potentials, ' can actually be derived &oxn arable pseudopotentials. Achieving a close similarity of
the PAW method by making one well-defined approxi- the computationally demanding plane-wave expressions
xnation. This way of approaching the pseudopotential to existing Car-Parrinello codes was one of my goals while
theory sheds light on the underlying principles of the deriving the PAW xnethod. This allows us to make direct
pseudopotential approach. Furthermore, the comparison use of the technology of existing methods, while incorpo-
will provide a one-to-one correspondence between quanti- rating the full-wave functions.
ties in the PAW method and those in the pseudopotential The major di8'erence in the computational eff'ort re-
method. In order to alert the reader to this fact, I have quired for the PAW method and the traditional pseu-
called the plane-wave part of the wave function the "PS dopotential approach is the explicit inclusion of the one-
wave function. " center terms in the PAW method. The related eff'ort,
Before we start comparing PAW and the pseudopoten- however, scales linearly with the number of atoms and
tial approach, let us recall what is termed the "pseudopo- is in practice negligibly small. Furthermore, the PAW
tential approach. " A more extensive description can be method provides new Qexibility to increase computa-
found in Refs. 8 and 50. tional efficiency and accuracy. We can use larger core
A valid first-principles pseudopotential obeys the fol- radii, and by relaxing the norm-conservation condition
lowing conditions. we can use smoother PS partial waves and reduce the
(i) For the atom, the PS wave functions agree with the basis set just as in Vanderbilt's ultrasoft pseudopoten-
AE wave functions beyond the core region. tials. The trade-ofF of this, however, is that the PS wave
(ii) Atomic eigenvalues and the first energy deriva- functions have to be orthogonalized in the presence of a
tive of the logarithmic derivative of the energy-dependent nontrivial overlap operator. Hence the decision in favor
partial waves agree with those of the AE calculation. of or against the norm-conservation condition depends
(iii) The norm of the atomic PS wave function is iden- on the system under study.
tical to that of the true wave function.
Owing to the construction described in Sec. VI, the
PAW xnethod fulfills the first two conditions automat-
2. Vanderbilt's altraeoft peeudopotentials
ically. Even though the third condition is usually not
fulfilled, it can be imposed. Vanderbilt recently introduced ultrasoft pseudopoten-
In order to reduce the PAW method to the traditional tials that relax the norm-conservation condition. ' This
pseudopotential approach, we must make a Taylor expan- approach has two basic ingredients: First, the wave
sion of the total energy in terms of the deviation of the functions are not norm conserving and second, general-
ized separable pseudopotentials — an extension of the
one-center densities n and n &oxn their isolated atom
values. This yields, to first order, Kleinman-Bylander potentials — are used.
These two ingredients of the ultrasoft pseudopotentials
.
E = E, &&+ E+ ) (4„1v„&14„). (i24) and the PAW method are similar. In addition, the projec-
tor augmentation uses concepts &om the generalized sep-
arable pseudopotentials and, like other augxnented wave
The term linear in the density operator is a nonlocal pseu- schemes, its PS wave function or envelope function has a
dopotential diferent norm than that of the AE wave function. The
17 976 P. E. BLOCHL

difference, however, is that the PAW method is an all- respect to the one-particle energy, resulting in energy-
electron method and Vanderbilt's approach a pseudopo- independent basis functions of a type previously sug-
tential method. gested by Marcus in the context of the APW method,
(i) The PAW method avoids "pseudization" steps This allowed the wave functions to be obtained by ma-
to which one resorts when using pseudopotential ap- trix diagonalization. It also led to the definition of basis
proaches, be they norm-conserving pseudopotential ap- functions such as linear augmented-plane waves, linear
proaches or not. The PAW method, on the other hand, augmented muKn-tin orbitals, and many more.
works directly with the full-wave functions and poten- All linear methods have in common that two partial
tials and includes the core states. This is a nontrivial waves of the spherical part of the potential within the
problem; the notion that the full-wave functions cannot muon-tin sphere are matched at the sphere radius to a
be treated in a reasonable way on a regular grid was the so-called envelope function, which corresponds to the PS
reason to introduce augmented-wave and pseudopoten- wave function of the PAW method. The PAW method
tial methods. The PAW method follows here the tradi- modi6es precisely that principle. Instead of determining
tion of the augmented-wave methods, where the full-wave the coefEcients of the partial-wave expansion &om the
function is decomposed into various parts, each of which value and derivative of the PS wave function at some
can be handled conveniently in its own representation. sphere radius, it uses the more general principle of pro-
(ii) The PAW method provides a prescription to go jector augmentation. We have seen in Sec. II that the
back and forth between the PS wave functions and the scalar product with some projector function is the most
physical AE wave functions. The analogy between the genera/ way to determine these coeKcients linearly from
PAW method and the pseudopotential approach has been the PS wave function. There are other differences t;o
exploited successfully by Van de Walle and myself to re- the linear methods made possible by this more general
construct an approximate full-wave function &om a pseu- approach which provide the important practical advan-
dopotential calculation and obtain quantities that are not tages. Those will be discussed later.
directly accessible by the pseudopotential approach. The Here we show that the LAP W method is, in some
PAW method is more rigorous than a mere reconstruc- sense, a special case of the PAW method, namely, that it
tion of the wave functions because the full wave function is possible to formulate the augmentation by matching as
takes part in the screening process. in the linear methods by projector functions. That the
(iii) In the ultrasoft pseudopotentials the overlap oper- matching of the LAPW method can be expressed by pro-
ator and the local charges have been introduced to restore jector functions has been observed earlier by Goedecker
the scattering properties of the pseudopotential when the and Maschke. 69
norm-conservation condition is relaxed in order to obtain In the LAPW method the wave functions are expressed
smoother PS wave functions. In the PAW method, the by partial waves ~P ) and their energy derivatives ~P„) at
non-norm-conserving PS wave functions enter naturally sorae energy e within atom-centered mufBn-tin spheres
as in all other augmented-wave methods. O~ and by plane waves in the interstitial region 0, :
(iv) From the point of view of computational effort,
the PAW method and the ultrasoft pseudopotentials are
equivalent with respect to plane-wave convergence if a
similar construction. of non-norm-conserving PS wave where 0g„ is a step function that is unity within the aug-
functions is used. However, the PAW method is more mentation sphere O~ and zero outside. The coeKcients
eScient because it treats the one-center expansions on a and b,
radial grids, which effectively eliminates the related com-
putational cost, rather than in a plane-wave representa-
tion. Furthermore, the plane-wave cutoff for the charge
density can be chosen substantially lower in the PAW
method because the augmentation density is not directly are determined such that the wave function is differen-
added to the density grid. This cuts the computational tiable at the sphere radius. For reasons of simplicity we
effort for the Fourier transforms substantially. can write these equations for only one angular momen-
tum component of the wave function.
The same result can also be reproduced using projec-
B. Linear augmented-plane-wave method tor functions: First we construct two AE partial waves
per angular momentum and site as the partial wave ~P(~))
The present method is in the tradition of exist- for a given energy e, yielding ~P„), and its energy deriva-
ing augmented-wave methods. Augmented-wave func- tive ~P), where the overdot stands for energy derivative.
tions were originally invented by Slater. There, the Then we construct PS partial waves analogously to the
Lippmann-Schwinger equation for a mufBn-tin type po- procedure described in Sec. VI. When constructing the
tential is solved by matching the energy-dependent par- projector functions, the augmented-wave methods devi-
tial solution of the Schrodinger equation inside and out- ate from the recipe given in Sec. VI and build them up
side the mufBn-tin sphere. As this matching procedure is as superpositions ~p;) = P. (V' 8~„)~gz)c~;. The coeffi-
computationally extremely demanding, Andersen intro- cients c,-~ are determined such that the orthogonality con-
duced the linear methods, where the partial solutions dition (p;~P~) = b, ~ is fulfilled. The projector functions
from within the muon-tin sphere are linearized with corresponding to the linear methods are localized on the
50 PROJECTOR AUGMENTED-%AVE METHOD

two-dimensional muffin-tin surface, which excludes real den" variables in the APW method, e.g. , the shape of the
space summation to obtain the scalar products with the partial waves, that adjust to the actual spherical poten-
PS wave function. However, the latter can be evaluated tial. As the PA% xnethod uses frozen partial waves, this
in G space for any other representation of the PS wave problexn does not arise.
function in terms of analytical functions. There are ex- Second, in the Soler-Williaxns APW the linear approxi-
tensions of the LAPW method that use more than two mation has been abandoned in favor of so-called infinites-
partial waves, but they will not be discussed here, as I imal paneling. This means that every state has its own
only wish to illustrate the principle. The computational P, P functions obtained from the spherical potential at the
effort for this part of the augmentation is similar in the energy of that state. As a result the scattering properties
LAPW and the PAW methods if such a projection-type are correct for all one-particle energies given a spherical
approach is used in the LAP% method. potential. The PAW xnethod reproduces the scattering
In contrast to the PAW method, the LAPW method properties correctly over the entire energy range only in
requires the one-center expansion of the PS wave func- the ideal case of an infinite number of partial waves. In
tion to be exactly identical to the PS wave function it- this case the PAW wave functions are exact solutions for
self, which is computationally demanding. In the PAW the full nonspherical potential. In practice the wave func-
method there is no need to make the one-center expan- tions of the PAW are accurate in an arbitrarily large, but
sion of the PS wave function more accurate than the one- finite energy region.
center of the AE wave function and both are therefore Like previous implexnentations of the LMTO method,
obtained in a completely analogous way. In fact, this pro- the Soler-Williams APW employs the principle of addi-
cedure results in an extremely rapid convergence of the tive augmentation. In the PAW method the principles of
partial-wave expansion, which is due to the error cancel- additive augmentation are applied even more rigorously.
lations discussed earlier. The one-center expansions of Not only the 8 convergence, but also the partial-wave
the PS wave function are obtained toithout extra egort convergence for each angular momentum channel is ac-
because their coefficients are identical to those obtained celerated using this idea.
previously with the AE wave function.
Another difference between the LAPW and the PA%
methods is the use of &ozen partial waves imported from 2. Singh's projectov -basis techniqtce
an isolated atom as opposed to partial waves that ad-
just to the actual potential. At first glance this appears Recently, Singh 4 introduced an implementation of the
to be a disadvantage of the PAW method. However, if LAPW method {and the mixed-basis pseudopotential)
we count the number of variational degrees of freedom, method that is substantially more efficient than previous
we 6nd that, given the same number of plane waves, the implementations. Instead of matching the partial waves
PAW method has a more Bexible basis set, even though directly to the plane-wave part of the wave function, a
the partial-wave expansions often have fewer terms than set of analytical functions is first 6tted to the PS wave
the LAPW method. Whereas the LAPW method has function on the real-space grid points in a localized re-
complete fIexibility to adjust to the spherical part of the gion around each atom. Once an analytical expression
potential inside the muffin-tin spheres, the PAW method for the plane-wave part is given, the matching of partial
lets the plane waves of the unaugmented part of the PS waves and the integrations over the xnuffin-tin sphere can
wave function extend into the spheres and thus allows the be evaluated quickly. The advantage of this approach
wave functions to adjust to both spherical and nonspher- is that computational effort for the augmentation scales
ical potentials. This is the result of what is often called quadratically [or N21n(N)) with the number of atoms
"additive augmentation" and has been discussed in the N, compared to an N3 scaling in a pure G space formu-
Sec. VII. In combination with a fictitious Lagrangian for- lation. Another solution of the same problem has been
malism, the use of 6xed partial waves has the important developed in the pseudopotential approach.
advantage that the total energy is a unique function of There has been some confusion about the terms "pro-
only the PS wave functions and the atomic positions. jector basis function" in Singh's paper and the "projector
functions" of the PA% method. The two refer to different
objects. Since the projector functions are used to repre-
The APW method of Soles and Williams sent the PS wave function, the projector basis functions
of Singh's approach relate more closely to the PS partial
The APW method of Soler and Williams differs waves of the PAW method rather than to its projector
&om other ixnplementations of the LAPW method in sev- functions. However, also here are difFerences: The PS
eral ways. I will compare the main difFerences between partial waves of the PAW method are per construction
their APW method and the PAW xnethod. Most of what adapted to the PS potential, and the coeKcients of the
has already been said about the difference between the partial wave expansion are obtained by a scalar prod-
PAW method and the LAPW method also applies here. uct of my projector functions with the PS wave func-
First, Soler's APW method employs an iterative min- tion, whereas in Singh's approach these coefficients are
imization of the total energy, which is similar in spirit obtained by a least-squares-type 6t to the plane-wave
to the Car-Parrinello method. To my knowledge no part at the real-space grid points. The difFerence between
molecular-dynaxnics features have been implemented. the two approaches is most apparent &om the number of
The reason has been attributed to the problexn of "hid- projector functions and PS waves, which in PAW vary
17 978 P. E. BLOCHL

typically between four and fourteen per site, whereas in for his continuous support and encouragement. I would
Singh's approach several hundred functions are used. also like to thank K. Schwarz, D. Vanderbilt, P. Margi,
Singh's approach can be applied equally to the J. Hutter, and M. Parrinello for useful comments after
pseudopotential formalism and to all augmented-wave reading the manuscript.
schemes including the PA%i method. %hen convergence
with the number of Singh's projector functions and plane-
wave cutoff is obtained, it reproduces exactly the elec- APPENDIX A: RADIAL SCHRODINGER
tronic structure method to which it has been applied. EQUATION WITH A SEPARABLE POTENTIAL

In order to test the scattering properties it is necessary


X. CONCLUSIONS to solve the radial Schrodinger equation using a separable
pseudopotential. An approach is sketched here brie6y:
An electronic structure method has been described
that works directly on the full valence and core wave ——V' +8 —E
functions and allows highly accurate first-principles 2
molecular-dynaxnics simulation to be performed. It has
been demonstrated that the accuracy of the PAW method +).Ip')(dH, , +.do;, )(P sl~) =o. (AI)
matches that of other state-of-the-art electronic structure
methods based on the I DA and that high-quality first- An ansatz for the solution is
principles molecular-dynamics simulations are possible
using this approach. To the best of my knowledge, this I@) = lu)+). I~*)c*
method was used in the first molecular-dynamics simula-
tion using an all-electron method. The method bridges
the gap between the existing augmented-wave methods with lu) and Iut;) defined as
and the pseudopotential methods and underscores the re-
lationship between these two approaches. Compared to
(--', V'+ e —.
)Iu) = O (A3)
the existing approaches, the present method is expected and
to be more efBcient, given a similar level of optimization.
The method can be incorporated into existing pseudopo-
(
—2'7'+ u —e)l~') = lp')
tential codes with relatively minor additional effort.
After inserting this ansatz into Eq. (Al) we obtain an
equation that determines the coefFicients c, as
ACKNOW'LEDGMENTS

I wouldlike to express my gratitude to O. K. Anderson,


who taught me the basics that allowed me to develop
c, =— )j, l
8,,+ ) Ie
(dH;t, —edO;t, )(pt, lut, )

PAW. Furthermore, I am indebted to S. T. Pantelides x(dH, i —edO, i)(poilu) . (A5)

P. Hohenberg and W. Kohn, Phys. Rev. 136, B664 (1964). J.M. Soler and A. R. Williams, Phys. Rev. B 42, 9728
W. Kohn and L.J. Sham, Phys. Rev. 140, B1133 (1965). (1990).
O.K. Andersen, Phys. Rev. B 12, 3060 (1975). J.M. Soler and A. R. Williams, Phys. Rev. B 4'F, 6784
J.C. Slater, Phys. Rev. 51, 846 (1937). (1993).
P.M. Marcus, Int. J. Quantum. Chem. 1S, 567 (1967). D. Singh, Phys. Rev. B 40, 5428 (1989).
J. Korringa, Physica 13, 392 (1947). T. Oguchi, in Interatomic Potential and Structural Stabit-
W. Kohn and N. Rostocker, Phys. Rev. 94, ill (1954). ity, edited by K. Terakura and H. Akai (Springer, Berlin,
D.R. Hamann, M. Schliiter, and C. Chiang, Phys. Rev. 1993), p. 33.
Lett. 4$, 1494 (1979). A set of functions that is complete in a region can be used
D. Vanderbilt, Phys. Rev. B 41, 7892 (1985) to expand any function vrithin that region.
K. Laasonen, R. Car, C. Lee, and D. Vanderbilt, Phys. ' P.E. Blochl, Phys. Rev. B 41, 5414 (1990).
Rev. B 43, 6796 (1991). R. Car (private communication).
R. Car and M. Parrinello, Phys. Rev. Lett. 55, 2471 (1985). S.F. Boys, Proc. R. Soc. London Ser. A 200, 542 (1950).
G.G. Van de Walle and P.E. Blochl, Phys. Rev. B 4V, 4244 S. Obara and A. Saika, J. Chem. Phys. 84, 3963 (1986).
(1993). V.R. Saunders, in Methods in Cmnputationa/ Molecular
P. Blaha, K. Schwarz, and P.H. Dederichs, Phys. Rev. B Physics, edited by G.H. F. Diercksen and S. Wilson (Reidel,
37, 2792 (1988). Dordrecht, 1983), pp. 1—36.
K. Schwarz and P. Blaha, Z. Naturforsch. A 47, 197 (1992). K.H. Weyrich, Phys. Rev. B O'7, 10269 (1988).
J.M. Soler and A. R. Williams, Phys. Rev. B 40, 1560 P.E. Blochl, Ph. D. thesis, University of Stuttgart, Ger-
(1989). many, 1989.
50 PROJECTOR AUGMENTED-WAVE METHOD 17 979

P. Ehrenfest, Z. Phys. 45, 455 (1927). Sciences Vol. 41 (Springer, Berlin, 1984).
H. Hellmann, Einfiihrung in die qnantenchemie (Deuticke, A.D. Becke, J. Chem. Phys. 97, 9173 (1992).
Leipzig, 1937). P.A. Serena, A. BaratofF, and J.M. Soler, Phys. Rev. B 48,
R.P. Feynman, Phys. Rev. 56, 340 (1939). 2046 (1993).
D. G. Pettifor, Commun. Phys. 1, 141 (1979). R.M. Dickson and A. Becke, J. Chem. Phys. 99, 3898
A. R. Mackintosh and O.K. Andersen, in Electrons at the (1993).
Fermi Surface, edited by M. Springford (Cambridge Uni- S. Dhar and N. R. Kestner, Phys. Rev. A 38, 1111 (1988).
versity Press, Cambridge, England, 1980), p. 187. J.L. Chen, C.S. Wang, K.A. Jackson, and M.R. Pederson,
V. Heine, Solid State Phys. 35, 1 (1980). Phys. Rev. B 44, 6558 (1991).
O. H. Nielsen and R.M. Martin, Phys. Rev. B 32, 1780 J.P. Perdew and A. Zunger, Phys. Rev. B 23, 5048 (1981).
(1985). D.M. Ceperley and B.J. Alder, Phys. Rev. Lett. 45, 566
P. Pulay, Mol. Phys. 17, 197 (1969). (1980).
J. Harris, R.O. Jones, and J. E. Miiller, J. Chem. Phys. H. Chen, M. Krasowski, and G. Fitzgerald, J. Chem. Phys.
75, 3904 (1981). 98, 8710 (1993).
L. Verlet, Phys. Rev. 159, 98 (1967). P. Margi, P.E. Blochl, and K. Schwarz, in Computations
G. Pastore, E. Smargiassi, and F. Buda, Phys. Rev. A 44, for the Nano Scale, edited by P.E. Blochl et al. (Kluwer,
6334 (1991). Dordrecht, 1993), p. 153.
J.-P. Ryckaert, G. Ciccotti, and H. J.C. Berendsen, J. Com- P. Margi, P.E. Blochl, and K. Schwarz, J. Chem. Phys.
put. Phys. 23, 327 (1977). 100, 8194 (1994).
R. Car and M. Parrinello, in Simple Molecular Systems at A. J. Fisher and P.E. Blochl, Phys. Rev. Lett. 70, 3263
Very High Density, edited by A. Polian et al. (Plenum, New (1993).
York, 1989), p. 455. A. J. Fisher and P.E. Blochl, in Computations for the Nano
K. Laasonen, A. Pasquarello, R. Car, C. Lee, and D. Van- Scale (Ref. 62), p. 185.
derbilt, Phys. Rev. B 47, 10142 (1993). R. Nesper, K. Vogel, and P.E. Blochl, Angew. Chem. Int.
S. Nose, J. Chem. Phys. 81, 511 (1984). Ed. Engl. 32, 701 (1993).
P.E. Blochl and M. Parrinello, Phys. Rev. B 45, 9413 P. Carloni, P.E. Blochl, and M. Parrinello (unpublished).
(1992). The convergence of the binding energy of Fez has been stud-
M. Parrinello (private communication). ied by comparing a plane-wave PAW calculation of Fe +.
G. P. Kerker, J. Phys. C 13, 1189 (1980). S. Goedecker and K. Maschke, Phys. Rev. B 42, 8858
A. M. Rappe, K.M. Rabe, E. Kaxiras, and J.D. Joannopou- (1990).
los, Phys. Rev. B 41, 1227 (1990). L. Kleinman and D.M. Bylander, Phys. Rev. Lett. 48, 1425
J.S. Lin, A. Qteish, M. C. Payne, and V. Heine, Phys. Rev. (1982).
B 47, 4174 (1993). Matrix elements of V 8O can be resolved by performing
N. Troullier and J.L. Martins, Phys. Rev. B 43, 1993 partial integration two times and by the application of
(1991). Gauss's theorem as (fr, ~(V' Hn)~gri) = bl, rir„(fr8„gl, +
D.D. Koelling and B.N. Harmon, J. Phys. C 10, 3107 gr, 8 fr. ), where rn is the radius of the muFin-tin sphere.
(1977). fr, and B„fr, are the value and the derivative of the radial
G.B. Bachelet, D.R. Hamann, and M. Schluter, Phys. Rev. part of fr, ) at the muKn-tin radius.
~

B 26, 4199 (1982). D.J. Singh, Phys. Rev. B 43, 6388 (1991).
D.R. Hamann, Phys. Rev. B 40, 2980 (1989). J.M. Soler (private communication).
A right-hand eigenvalue is a value A that has a nontrivial D. Singh, Phys. Rev. B 46, 13065 (1992).
solution c; to P. a;ic~. = c, A. R.D. King-Smith, M. C. Payne, and J.S. Lin, Phys. Rev. B
H. L. Skriver, in The LMTO Method, edited by M. Cardona, 44, 13063 (1991).
P. Fulde, and H. J. Queisser, Springer Series in Solid State

Anda mungkin juga menyukai