This note gives a derivation of the Born-Oppenheimer Hamiltonian eigenvalue problems (9.13) for the wave functions of the nuclei.
First consider an exact eigenfunction
of the complete system,
including both the electrons and the nuclei fully. Can it be related
somehow to the simpler electron eigenfunctions
that ignored nuclear kinetic energy? Yes
it can. For any given set of nuclear coordinates, the electron
eigenfunctions are complete; they are the eigenfunctions of an
Hermitian electron Hamiltonian. And that means that you can for any
given set of nuclear coordinates write the exact wave function as
So, to be really precise, the wave function of
electrons and
nuclei can be written as:

Consider what this means physically. By construction, the square
electron eigenfunctions
give the probability of
finding the electrons assuming that they are in eigenstate
and that the nuclei are at the positions listed in the final
arguments of the electron eigenfunction. But then the probability
that the nuclei are actually at those positions, and that the
electrons are actually in eigenstate
, will have to be
. After all, the full wave function
must describe
the probability for the entire system to actually be in a
specific state. That means that
must be the nuclear wave
function
for when the electrons are in energy eigenstate
. So from now on, just call it
instead of
. The full wave function is then
| (D.29) |
In the unsteady case, the
, hence the
, will also
be functions of time. The
will remain time independent
as long as no explicitly time-dependent terms are added. The
derivation then goes exactly the same way as the time-independent
Schrödinger equation (Hamiltonian eigenvalue problem) derived below,
with
replacing
.
So far, no approximations have been made; the only thing that has been
done is to define the nuclear wave functions
. But the
objective is still to derive the claimed equation (9.13)
for them. To do so plug the expression
into the exact Hamiltonian
eigenvalue problem:
Note first that the eigenfunctions can be taken to be real since the
Hamiltonian is real. If the eigenfunctions were complex, then their
real and imaginary parts separately would be eigenfunctions, and both
of these are real. This argument applies to both the electron
eigenfunctions separately as well as to the full eigenfunction. The
trick is now to take an inner product of the equation above with a
chosen electron eigenfunction
. More precisely, multiply
the entire equation by
, and integrate/sum over the
electron coordinates and spins only, keeping the nuclear positions and
spins at fixed values.
What do you get? Consider the terms in reverse order, from right to
left. In the right hand side, the electron-coordinate inner product
is zero unless
, and
then it is one, since the electron wave functions are orthonormal for
given nuclear coordinates. So all we have left in the right-hand side
is
, Check,
is the correct right hand side in
the nuclear-wave-function Hamiltonian eigenvalue problem
(9.13).
Turning to the latter four terms in the left-hand side, remember that
by definition the electron eigenfunctions satisfy
That leaves only the nuclear kinetic term, and that one is a bit
tricky. Recalling the definition (9.4) of the kinetic
energy operator
in terms of the nuclear coordinate Laplacians,
you have
Remember that not just the nuclear wave functions, but also the
electron wave functions depend on the nuclear coordinates. So, if you
differentiate the product, you get
Now if you take the inner product with electron eigenfunction
, the first term gives you what you need, the expression
for the kinetic energy of the nuclei. But you do not want the other
two terms; these terms have the nuclear kinetic energy
differentiations at least in part on the electron wave function
instead of on the nuclear wave function.
Well, whether you like it or not, the exact equation is, collecting
all terms and rearranging,
The first thing to note is the final sum in (D.30). Unless
you can talk away this sum as negligible, (9.13) is not
valid. The “off-diagonal” coefficients, the
for
, are particularly bad news, because they produce
interactions between the different potential energy surfaces, shifting
energy from one value of
to another. These off-diagonal terms are
called “vibronic coupling terms.” (The word is a contraction of
“vibration” and “electronic,” if you are
wondering.)
Let’s have a closer look at (D.31) and
(D.32) to see how big the various terms really are. At
first appearance it might seem that both the nuclear kinetic energy
and the coefficients
can be ignored, since both are
inversely proportional to the nuclear masses, hence apparently
thousands of times smaller than the electronic kinetic energy included
in
. But do not go too quick here. First ballpark the
typical derivative,
when applied
to the nuclear wave function. You can estimate such a derivative as
, where
is the typical length over which there
are significant changes in a nuclear wave function
. Well,
there are significant changes in nuclear wave functions if you go from
the middle of a nucleus to its outside, and that is a very small
distance compared to the typical size of the electron blob
.
It means that the distance
is small. So the relative
importance of the nuclear kinetic energy increases by a factor
relative to the electron kinetic energy,
compensating quite a lot for the much higher nuclear mass. So keeping
the nuclear kinetic energy is definitely a good idea.
How about the coefficients
? Well, normally the
electron eigenfunctions only change appreciable when you vary the
nuclear positions over a length comparable to the electron blob scale
. Think back of the example of the hydrogen molecule. The
ground state separation between the nuclei was found as 0.87Å. But
you would not see a dramatic change in electron wave functions if you
made it a few percent more or less. To see a dramatic change, you
would have to make the nuclear distance 1.5Å, for example. So the
derivatives
applied to the
electron wave functions are normally not by far as large as those
applied to the nuclear wave functions, hence the
terms are
relatively small compared to the nuclear kinetic energy, and ignoring
them is usually justified. So the final conclusion is that equation
(9.13) is usually justified.
But there are exceptions. If different energy levels get close
together, the electron wave functions become very sensitive to small
effects, including small changes in the nuclear positions. When the
wave functions have become sensitive enough that they vary
significantly under nuclear position changes comparable in size to the
nuclear wave function blobs, you can no longer ignore the
terms and (9.13) becomes invalid.
You can be a bit more precise about that claim with a few tricks.
Consider the factors
For
, the following trick works:

As far as the final term in
is concerned, like the second
term, you would expect it to become important when the scale of
nontrivial changes in electron wave functions with nuclear positions
becomes comparable to the size of the nuclear wave functions. You can
be a little bit more precise by taking one more derivative of the
inner product expression derived above,
The diagonal part of matrix
, i.e. the
terms, is
somewhat interesting since it produces a change in effective energy
without involving interactions with the other potential energy
surfaces, i.e. without interaction with the
for
. The diagonal part is called the “Born-Oppenheimer diagonal correction.” Since as noted above,
the first term in the expression (D.32) for the
does not have a diagonal part, the diagonal correction is given by the
second term.
Note that in a transient case that starts out as a single nuclear wave
function
, the diagonal term
multiplies the
predominant nuclear wave function
, while the off-diagonal
terms only multiply the small other nuclear wave functions. So
despite not involving any derivative of the nuclear wave function, the
diagonal term will initially be the main correction to the
Born-Oppenheimer approximation. It will remain important at later
times.