next up previous contents
Next: Corrected penalty functional method Up: Kohn's method Previous: Variational principle   Contents


Implementation problems

The conjugate gradients algorithm for minimising functions is described in appendix B. Throughout the lengthy derivation it is clear that the useful results obtained and the remarkably simple final result are due to the special properties of quadratic functions. Any function may be expanded in the form of a Taylor series about an analytic point, and around a minimum where the first order term from the gradient vanishes, a quadratic function is generally a good approximation. However, we note that the Kohn penalty functional has a branch point from the square-root function exactly at the ground-state minimum which we seek, and so the function cannot be Taylor-expanded there. Local information from the gradient cannot be used to infer the global shape of the function. This is illustrated in figure 6.3 for the case of a parabolic interpolation to find a line minimum based upon the gradient and a trial step, but the problem is even worse in the multi-dimensional space since the ``conjugate'' directions constructed from the gradients will not point in the direction of the ground-state minimum.

Figure 6.3: Failure of quadratic interpolation for Kohn's penalty functional.

\includegraphics [height=8cm]{oops.eps}

This problem is reflected in the very poor convergence when an attempt is made to minimise the functional using conjugate gradients: the steepest descents method actually performs better because it does not assume global quadratic behaviour. Also, the penalty functional does not vanish at the minimum sufficiently quickly as the parameter $\alpha $ is increased. However, the root-mean-square error in the occupation numbers $\overline{\delta f}$, given by

\begin{displaymath}
\overline{\delta f} = \frac{{\cal P}[\rho]}{\sqrt{N/2}}
\end{displaymath} (6.14)

in fact decays more rapidly, so that the total energy calculated at the minimum is quite accurate, although it is neither variational nor an upper bound. Also, although the total functional ${\tilde {\cal Q}}[\rho;\mu,\alpha]$ decreases monotonically, the total energy does not. Thus no advantage is gained by using the variational property, since it can only be applied to the total energy when ${\cal P}[\rho] = 0$. The variational property of the total functional is that it is minimal at the ground-state, but this minimum is defined in terms of the functional taking its minimum value there, not in terms of a vanishing gradient (the gradient being undefined at the ground-state). Because of the non-variational behaviour of the total energy with respect to the occupation numbers at the ground-state, it is impossible to construct a penalty functional which has a continuous first derivative at the ground-state and also results in a variational principle for the total energy.

Figure 6.4: Convergence properties of Kohn's penalty functional: behaviour of penalty functional and occupation numbers with $\alpha $.

=1mm
\begin{picture}(150,85)
\put(25,5){\includegraphics [width=10cm]{kohn.eps}}
\put...
...{$\alpha {\cal P}[\rho]$\ /eV}
\put(130,50){$\overline{\delta f}$}
\end{picture}

In figure 6.4 we present the results of tests on an 8-atom silicon cell to demonstrate the behaviour of the functional. As the penalty functional parameter $\alpha $ is increased, both the contribution of the penalty functional to the total functional $\alpha {\cal P}[\rho]$, and the root mean square error in the occupation numbers $\overline{\delta f}$ decrease, but not rapidly enough with $\alpha $ since the number of iterations required to reach convergence increases with $\alpha $ making the calculations too expensive for practical applications. For example, the number of iterations required to converge the total functional to $0.01$ eV per atom increases by a factor of more than ten when $\alpha $ is increased from 100 eV to 1000 eV. Even with the smaller value for $\alpha $, the rate of convergence is much slower than traditional methods, and this is due to the incompatibility of the functional with the conjugate gradients scheme.


next up previous contents
Next: Corrected penalty functional method Up: Kohn's method Previous: Variational principle   Contents
Peter D. Haynes
1999-09-21