This note deduces the canonical probability distribution. Since the derivations in typical textbooks seem crazily convoluted and the made assumptions not at all as self-evident as the authors suggest, a more mathematical approach will be followed here.
Consider a big system consisting of many smaller subsystems
with a given total energy
. Call the combined system
the collective. Following the same reasoning as in derivation
{D.59} for two systems, the thermodynamically stable
equilibrium state has shelf occupation numbers of the subsystems
satisfying

An individual system, take
as the example, no longer has an
individual energy that is for certain. Only the collective has that.
That means that when
is taken out of the collective, its shelf
occupation numbers will have to be described in terms of
probabilities. There will still be an expectation value for the
energy of the system, but system energy eigenfunctions
with somewhat different energy
can no longer be excluded
with certainty. However, still assume, following the fundamental
assumption of quantum statistics, {N.23}, that the
physical differences between the system energy eigenfunctions do not
make (enough of) a difference to affect which ones are likely or not.
So, the probability
of a system eigenfunction
will be assumed to depend only on its energy
:
For the isolated example system
, the question is now no longer
“What shelf numbers have the most eigenfunctions?” but
“What shelf numbers have the highest probability?” Note
that all system eigenfunctions
for a given set of
shelf numbers
have the same system energy
. Therefore, the
probability of a given set of shelf numbers
will be
the number of eigenfunctions with those shelf numbers times the
probability of each individual eigenfunction:
Mathematically, the function whose partial derivatives must be zero to
find the most probable shelf numbers is
Substituting
, taking
apart the logarithm, and differentiating, produces
It follows that
at least in
the vicinity of the most probable energy
. Hence in
the vicinity of that energy
Note that the canonical probability is self-consistent: if two systems
at the same temperature are combined, the probabilities of the
combined eigenfunctions multiply, as in