Physics:Bose–Einstein statistics
Statistical mechanics 

In quantum statistics, Bose–Einstein (B–E) statistics describes one of two possible ways in which a collection of noninteracting, indistinguishable particles may occupy a set of available discrete energy states at thermodynamic equilibrium. The aggregation of particles in the same state, which is a characteristic of particles obeying Bose–Einstein statistics, accounts for the cohesive streaming of laser light and the frictionless creeping of superfluid helium. The theory of this behaviour was developed (1924–25) by Satyendra Nath Bose, who recognized that a collection of identical and indistinguishable particles can be distributed in this way. The idea was later adopted and extended by Albert Einstein in collaboration with Bose.
The Bose–Einstein statistics applies only to the particles not limited to single occupancy of the same state – that is, particles that do not obey the Pauli exclusion principle restrictions. Such particles have integer values of spin and are named bosons.
Bose–Einstein distribution
At low temperatures, bosons behave differently from fermions (which obey the Fermi–Dirac statistics) in a way that an unlimited number of them can "condense" into the same energy state. This apparently unusual property also gives rise to the special state of matter – the Bose–Einstein condensate. Fermi–Dirac and Bose–Einstein statistics apply when quantum effects are important and the particles are "indistinguishable". Quantum effects appear if the concentration of particles satisfies
[math]\displaystyle{ \frac{N}{V} \ge n_q, }[/math]
where N is the number of particles, V is the volume, and n_{q} is the quantum concentration, for which the interparticle distance is equal to the thermal de Broglie wavelength, so that the wavefunctions of the particles are barely overlapping.
Fermi–Dirac statistics applies to fermions (particles that obey the Pauli exclusion principle), and Bose–Einstein statistics applies to bosons. As the quantum concentration depends on temperature, most systems at high temperatures obey the classical (Maxwell–Boltzmann) limit, unless they also have a very high density, as for a white dwarf. Both Fermi–Dirac and Bose–Einstein become Maxwell–Boltzmann statistics at high temperature or at low concentration.
B–E statistics was introduced for photons in 1924 by Bose and generalized to atoms by Einstein in 1924–25.
The expected number of particles in an energy state i for B–E statistics is:
[math]\displaystyle{ \bar{n}_i = \frac{g_i}{e^{(\varepsilon_i\mu) / k_\text{B} T}  1} }[/math]
with ε_{i} > μ and where n_{i} is the occupation number (the number of particles) in state i, [math]\displaystyle{ g_i }[/math] is the degeneracy of energy level i, ε_{i} is the energy of the ith state, μ is the chemical potential, k_{B} is the Boltzmann constant, and T is absolute temperature.
The variance of this distribution [math]\displaystyle{ V(n) }[/math] is calculated directly from the expression above for the average number.^{[1]}
[math]\displaystyle{ V(n) = kT\frac{\partial}{\partial \mu}\bar{n}_i = \langle n\rangle(1+\langle n\rangle) = \bar{n} + \bar{n}^2 }[/math]
For comparison, the average number of fermions with energy [math]\displaystyle{ \varepsilon_i }[/math] given by Fermi–Dirac particleenergy distribution has a similar form:
[math]\displaystyle{ \bar{n}_i(\varepsilon_i) = \frac{g_i}{e^{(\varepsilon_i  \mu)/k_\text{B}T} + 1}. }[/math]
As mentioned above, both the Bose–Einstein distribution and the Fermi–Dirac distribution approaches the Maxwell–Boltzmann distribution in the limit of high temperature and low particle density, without the need for any ad hoc assumptions:
 In the limit of low particle density, [math]\displaystyle{ \bar{n}_i = \frac{g_i}{e^{(\varepsilon_i\mu)/k_\text{B}T}\pm 1} \ll 1 }[/math], therefore [math]\displaystyle{ e^{(\varepsilon_i\mu)/k_\text{B}T} \pm 1 \gg 1 }[/math] or equivalently [math]\displaystyle{ e^{(\varepsilon_i\mu)/k_\text{B}T} \gg 1 }[/math]. In that case, [math]\displaystyle{ \bar{n}_i \approx \frac{g_i}{e^{(\varepsilon_i\mu)/k_\text{B}T}}=\frac{1}{Z}e^{\varepsilon_i/k_\text{B} T} }[/math], which is the result from Maxwell–Boltzmann statistics.
 In the limit of high temperature, the particles are distributed over a large range of energy values, therefore the occupancy on each state (especially the high energy ones with [math]\displaystyle{ \varepsilon_i  \mu \gg k_\text{B}T }[/math]) is again very small, [math]\displaystyle{ \bar{n}_i = \frac{g_i}{e^{(\varepsilon_i\mu)/k_\text{B}T} \pm 1} \ll 1 }[/math]. This again reduces to Maxwell–Boltzmann statistics.
In addition to reducing to the Maxwell–Boltzmann distribution in the limit of high [math]\displaystyle{ T }[/math] and low density, B–E statistics also reduces to Rayleigh–Jeans law distribution for low energy states with
[math]\displaystyle{ \varepsilon_i  \mu \ll k_\text{B}T }[/math], namely
[math]\displaystyle{ \begin{align} \bar{n}_i & = \frac{g_i}{e^{(\varepsilon_i\mu)/k_{\text{B}}T}1} \\ &\approx \frac{g_i}{(\varepsilon_i\mu)/k_\text{B}T} = \frac{g_i k_\text{B}T}{\varepsilon_i  \mu}. \end{align} }[/math]
History
Władysław Natanson in 1911 concluded that Planck's law requires indistinguishability of "units of energy", although he did not frame this in terms of Einstein's light quanta.^{[2]}^{[3]}
While presenting a lecture at the University of Dhaka (in what was then British India and is now Bangladesh) on the theory of radiation and the ultraviolet catastrophe, Satyendra Nath Bose intended to show his students that the contemporary theory was inadequate, because it predicted results not in accordance with experimental results. During this lecture, Bose committed an error in applying the theory, which unexpectedly gave a prediction that agreed with the experiment. The error was a simple mistake—similar to arguing that flipping two fair coins will produce two heads onethird of the time—that would appear obviously wrong to anyone with a basic understanding of statistics (remarkably, this error resembled the famous blunder by d'Alembert known from his Croix ou Pile article^{[4]}^{[5]}). However, the results it predicted agreed with experiment, and Bose realized it might not be a mistake after all. For the first time, he took the position that the Maxwell–Boltzmann distribution would not be true for all microscopic particles at all scales. Thus, he studied the probability of finding particles in various states in phase space, where each state is a little patch having phase volume of h^{3}, and the position and momentum of the particles are not kept particularly separate but are considered as one variable.
Bose adapted this lecture into a short article called "Planck's law and the hypothesis of light quanta"^{[6]}^{[7]} and submitted it to the Philosophical Magazine. However, the referee's report was negative, and the paper was rejected. Undaunted, he sent the manuscript to Albert Einstein requesting publication in the Zeitschrift für Physik. Einstein immediately agreed, personally translated the article from English into German (Bose had earlier translated Einstein's article on the general theory of relativity from German to English), and saw to it that it was published. Bose's theory achieved respect when Einstein sent his own paper in support of Bose's to Zeitschrift für Physik, asking that they be published together. The paper came out in 1924.^{[8]}
The reason Bose produced accurate results was that since photons are indistinguishable from each other, one cannot treat any two photons having equal quantum numbers (e.g., polarization and momentum vector) as being two distinct identifiable photons. By analogy, if in an alternate universe coins were to behave like photons and other bosons, the probability of producing two heads would indeed be onethird, and so is the probability of getting a head and a tail which equals onehalf for the conventional (classical, distinguishable) coins. Bose's "error" leads to what is now called Bose–Einstein statistics.
Bose and Einstein extended the idea to atoms and this led to the prediction of the existence of phenomena which became known as Bose–Einstein condensate, a dense collection of bosons (which are particles with integer spin, named after Bose), which was demonstrated to exist by experiment in 1995.
Derivation
Derivation from the microcanonical ensemble
In the microcanonical ensemble, one considers a system with fixed energy, volume, and number of particles. We take a system composed of [math]\displaystyle{ N = \sum_i n_i }[/math] identical bosons, [math]\displaystyle{ n_i }[/math] of which have energy [math]\displaystyle{ \varepsilon_i }[/math] and are distributed over [math]\displaystyle{ g_i }[/math] levels or states with the same energy [math]\displaystyle{ \varepsilon_i }[/math], i.e. [math]\displaystyle{ g_i }[/math] is the degeneracy associated with energy [math]\displaystyle{ \varepsilon_i }[/math] of total energy [math]\displaystyle{ E = \sum_i n_i \varepsilon_i }[/math]. Calculation of the number of arrangements of [math]\displaystyle{ n_i }[/math] particles distributed among [math]\displaystyle{ g_i }[/math] states is a problem of combinatorics. Since particles are indistinguishable in the quantum mechanical context here, the number of ways for arranging [math]\displaystyle{ n_i }[/math] particles in [math]\displaystyle{ g_i }[/math] boxes (for the [math]\displaystyle{ i }[/math]th energy level) would be (see image).
[math]\displaystyle{ w_{i,\text{BE}} = \frac{(n_i+g_i1)!}{n_i! (g_i1)!} = C^{n_i+g_i1}_{n_i}, }[/math]
where [math]\displaystyle{ C^m_k }[/math] is the kcombination of a set with m elements. The total number of arrangements in an ensemble of bosons is simply the product of the binomial coefficients [math]\displaystyle{ C^{n_i+g_i1}_{n_i} }[/math] above over all the energy levels, i.e. [math]\displaystyle{ W_\text{BE} =\prod_i w_{i,\text{BE}}=\prod_i\frac{(n_i +g_i 1)!}{(g_i1)! n_i!}, }[/math]
The maximum number of arrangements determining the corresponding occupation number [math]\displaystyle{ n_i }[/math] is obtained by maximizing the entropy, or equivalently, setting [math]\displaystyle{ \mathrm{d}(\ln W_\text{BE}) = 0 }[/math] and taking the subsidiary conditions [math]\displaystyle{ N=\sum n_i, E=\sum_i n_i\varepsilon_i }[/math] into account (as Lagrange multipliers).^{[9]} The result for [math]\displaystyle{ n_i\gg 1 }[/math], [math]\displaystyle{ g_i\gg 1 }[/math], [math]\displaystyle{ n_i/g_i=O(1) }[/math] is the Bose–Einstein distribution.
Derivation from the grand canonical ensemble
The Bose–Einstein distribution, which applies only to a quantum system of noninteracting bosons, is naturally derived from the grand canonical ensemble without any approximations.^{[10]} In this ensemble, the system is able to exchange energy and exchange particles with a reservoir (temperature T and chemical potential µ fixed by the reservoir).
Due to the noninteracting quality, each available singleparticle level (with energy level ϵ) forms a separate thermodynamic system in contact with the reservoir. That is, the number of particles within the overall system that occupy a given single particle state form a subensemble that is also grand canonical ensemble; hence, it may be analysed through the construction of a grand partition function.
Every singleparticle state is of a fixed energy, [math]\displaystyle{ \varepsilon }[/math]. As the subensemble associated with a singleparticle state varies by the number of particles only, it is clear that the total energy of the subensemble is also directly proportional to the number of particles in the singleparticle state; where [math]\displaystyle{ N }[/math] is the number of particles, the total energy of the subensemble will then be [math]\displaystyle{ N\varepsilon }[/math]. Beginning with the standard expression for a grand partition function and replacing [math]\displaystyle{ E }[/math] with [math]\displaystyle{ N \varepsilon }[/math], the grand partition function takes the form
[math]\displaystyle{ \mathcal Z = \sum_N \exp((N\mu  N\varepsilon)/k_\text{B} T) = \sum_N \exp(N(\mu  \varepsilon)/k_\text{B} T) }[/math]
This formula applies to fermionic systems as well as bosonic systems. Fermi–Dirac statistics arises when considering the effect of the Pauli exclusion principle: whilst the number of fermions occupying the same singleparticle state can only be either 1 or 0, the number of bosons occupying a single particle state may be any integer. Thus, the grand partition function for bosons can be considered a geometric series and may be evaluated as such:
[math]\displaystyle{ \begin{align}\mathcal Z & = \sum_{N=0}^\infty \exp(N(\mu  \varepsilon)/k_\text{B} T) = \sum_{N=0}^\infty [\exp((\mu  \varepsilon)/k_\text{B}T)]^N \\ & = \frac{1}{1  \exp((\mu  \varepsilon)/k_\text{B} T)}.\end{align} }[/math]
Note that the geometric series is convergent only if [math]\displaystyle{ e^{(\mu  \varepsilon)/k_\text{B}T}\lt 1 }[/math], including the case where [math]\displaystyle{ \epsilon = 0 }[/math]. This implies that the chemical potential for the Bose gas must be negative, i.e., [math]\displaystyle{ \mu\lt 0 }[/math], whereas the Fermi gas is allowed to take both positive and negative values for the chemical potential.^{[11]}
The average particle number for that singleparticle substate is given by
[math]\displaystyle{ \langle N\rangle = k_\text{B} T \frac{1}{\mathcal Z} \left(\frac{\partial \mathcal Z}{\partial \mu}\right)_{V,T} = \frac{1}{\exp((\varepsilon\mu)/k_\text{B} T)1} }[/math] This result applies for each singleparticle level and thus forms the Bose–Einstein distribution for the entire state of the system.^{[12]}^{[13]}
The variance in particle number, [math]\displaystyle{ \sigma_N^2 = \langle N^2 \rangle  \langle N \rangle^2 }[/math], is:
[math]\displaystyle{ \sigma_N^2 = k_\text{B} T \left(\frac{d\langle N\rangle}{d\mu}\right)_{V,T} = \frac{\exp((\varepsilon\mu)/k_\text{B} T)}{(\exp((\varepsilon\mu)/k_\text{B} T)1)^2} = \langle N\rangle(1 + \langle N\rangle). }[/math]
As a result, for highly occupied states the standard deviation of the particle number of an energy level is very large, slightly larger than the particle number itself: [math]\displaystyle{ \sigma_N \approx \langle N\rangle }[/math]. This large uncertainty is due to the fact that the probability distribution for the number of bosons in a given energy level is a geometric distribution; somewhat counterintuitively, the most probable value for N is always 0. (In contrast, classical particles have instead a Poisson distribution in particle number for a given state, with a much smaller uncertainty of [math]\displaystyle{ \sigma_{N,{\rm classical}} = \sqrt{\langle N\rangle} }[/math], and with the mostprobable N value being near [math]\displaystyle{ \langle N \rangle }[/math].)
Derivation in the canonical approach
It is also possible to derive approximate Bose–Einstein statistics in the canonical ensemble. These derivations are lengthy and only yield the above results in the asymptotic limit of a large number of particles. The reason is that the total number of bosons is fixed in the canonical ensemble. The Bose–Einstein distribution in this case can be derived as in most texts by maximization, but the mathematically best derivation is by the Darwin–Fowler method of mean values as emphasized by Dingle.^{[14]} See also MüllerKirsten.^{[9]} The fluctuations of the ground state in the condensed region are however markedly different in the canonical and grandcanonical ensembles.^{[15]}
Suppose we have a number of energy levels, labeled by index [math]\displaystyle{ i }[/math], each level having energy [math]\displaystyle{ \varepsilon_i }[/math] and containing a total of [math]\displaystyle{ n_i }[/math] particles. Suppose each level contains [math]\displaystyle{ g_i }[/math] distinct sublevels, all of which have the same energy, and which are distinguishable. For example, two particles may have different momenta, in which case they are distinguishable from each other, yet they can still have the same energy. The value of [math]\displaystyle{ g_i }[/math] associated with level [math]\displaystyle{ i }[/math] is called the "degeneracy" of that energy level. Any number of bosons can occupy the same sublevel.
Let [math]\displaystyle{ w(n,g) }[/math] be the number of ways of distributing [math]\displaystyle{ n }[/math] particles among the [math]\displaystyle{ g }[/math] sublevels of an energy level. There is only one way of distributing [math]\displaystyle{ n }[/math] particles with one sublevel, therefore [math]\displaystyle{ w(n,1)=1 }[/math]. It is easy to see that there are [math]\displaystyle{ (n+1) }[/math] ways of distributing [math]\displaystyle{ n }[/math] particles in two sublevels which we will write as:
[math]\displaystyle{ w(n,2)=\frac{(n+1)!}{n!1!}. }[/math]
With a little thought (see Notes below) it can be seen that the number of ways of distributing [math]\displaystyle{ n }[/math] particles in three sublevels is
[math]\displaystyle{ w(n,3) = w(n,2) + w(n1,2) + \cdots + w(1,2) + w(0,2) }[/math] so that
[math]\displaystyle{ w(n,3)=\sum_{k=0}^n w(nk,2) = \sum_{k=0}^n\frac{(nk+1)!}{(nk)!1!}=\frac{(n+2)!}{n!2!} }[/math]
where we have used the following theorem involving binomial coefficients: [math]\displaystyle{ \sum_{k=0}^n\frac{(k+a)!}{k!a!}=\frac{(n+a+1)!}{n!(a+1)!}. }[/math]
Continuing this process, we can see that [math]\displaystyle{ w(n,g) }[/math] is just a binomial coefficient (See Notes below)
[math]\displaystyle{ w(n,g)=\frac{(n+g1)!}{n!(g1)!}. }[/math]
For example, the population numbers for two particles in three sublevels are 200, 110, 101, 020, 011, or 002 for a total of six which equals 4!/(2!2!). The number of ways that a set of occupation numbers [math]\displaystyle{ n_i }[/math] can be realized is the product of the ways that each individual energy level can be populated:
[math]\displaystyle{ W = \prod_i w(n_i,g_i) = \prod_i \frac{(n_i + g_i1)!}{n_i!(g_i1)!} \approx \prod_i \frac{(n_i+g_i)!}{n_i!(g_i)!} }[/math]
where the approximation assumes that [math]\displaystyle{ n_i \gg 1 }[/math].
Following the same procedure used in deriving the Maxwell–Boltzmann statistics, we wish to find the set of [math]\displaystyle{ n_i }[/math] for which W is maximised, subject to the constraint that there be a fixed total number of particles, and a fixed total energy. The maxima of [math]\displaystyle{ W }[/math] and [math]\displaystyle{ \ln(W) }[/math] occur at the same value of [math]\displaystyle{ n_i }[/math] and, since it is easier to accomplish mathematically, we will maximise the latter function instead. We constrain our solution using Lagrange multipliers forming the function:
[math]\displaystyle{ f(n_i)=\ln(W)+\alpha(N\sum n_i)+ \beta(E\sum n_i \varepsilon_i) }[/math]
Using the [math]\displaystyle{ n_i \gg 1 }[/math] approximation and using Stirling's approximation for the factorials [math]\displaystyle{ \left(x!\approx x^x\,e^{x}\,\sqrt{2\pi x}\right) }[/math] gives
[math]\displaystyle{ f(n_i)=\sum_i (n_i + g_i) \ln(n_i + g_i)  n_i \ln(n_i) +\alpha\left(N\sum n_i\right)+\beta\left(E\sum n_i \varepsilon_i\right)+K. }[/math]
Where K is the sum of a number of terms which are not functions of the [math]\displaystyle{ n_i }[/math]. Taking the derivative with respect to [math]\displaystyle{ n_i }[/math], and setting the result to zero and solving for [math]\displaystyle{ n_i }[/math], yields the Bose–Einstein population numbers:
[math]\displaystyle{ n_i = \frac{g_i}{e^{\alpha+\beta \varepsilon_i}1}. }[/math]
By a process similar to that outlined in the Maxwell–Boltzmann statistics article, it can be seen that: [math]\displaystyle{ d\ln W = \alpha\,dN + \beta\,dE }[/math]
which, using Boltzmann's famous relationship [math]\displaystyle{ S=k_\text{B}\,\ln W }[/math] becomes a statement of the second law of thermodynamics at constant volume, and it follows that [math]\displaystyle{ \beta = \frac{1}{k_\text{B}T} }[/math] and [math]\displaystyle{ \alpha =  \frac{\mu}{k_\text{B}T} }[/math] where S is the entropy, [math]\displaystyle{ \mu }[/math] is the chemical potential, k_{B} is Boltzmann's constant and T is the temperature, so that finally:
[math]\displaystyle{ n_i = \frac{g_i}{e^{(\varepsilon_i\mu)/k_\text{B}T}1}. }[/math]
Note that the above formula is sometimes written:
[math]\displaystyle{ n_i = \frac{g_i}{e^{\varepsilon_i/k_\text{B}T}/z1}, }[/math]
where [math]\displaystyle{ z=\exp(\mu/k_\text{B}T) }[/math] is the absolute activity, as noted by McQuarrie.^{[16]}
Also note that when the particle numbers are not conserved, removing the conservation of particle numbers constraint is equivalent to setting [math]\displaystyle{ \alpha }[/math] and therefore the chemical potential [math]\displaystyle{ \mu }[/math] to zero. This will be the case for photons and massive particles in mutual equilibrium and the resulting distribution will be the Planck distribution.
A much simpler way to think of Bose–Einstein distribution function is to consider that n particles are denoted by identical balls and g shells are marked by g1 line partitions. It is clear that the permutations of these n balls and g − 1 partitions will give different ways of arranging bosons in different energy levels. Say, for 3 (= n) particles and 3 (= g) shells, therefore (g − 1) = 2, the arrangement might be ●●●, or ●●●, or ●●● , etc. Hence the number of distinct permutations of n + (g − 1) objects which have n identical items and (g − 1) identical items will be:
[math]\displaystyle{ \frac{(g1 + n)!}{(g1)! n!} }[/math]
See the image for a visual representation of one such distribution of n particles in g boxes that can be represented as g − 1 partitions.OR
The purpose of these notes is to clarify some aspects of the derivation of the Bose–Einstein (B–E) distribution for beginners. The enumeration of cases (or ways) in the B–E distribution can be recast as follows. Consider a game of dice throwing in which there are [math]\displaystyle{ n }[/math] dice, with each die taking values in the set[math]\displaystyle{ \{ 1, \dots, g \} }[/math], for [math]\displaystyle{ g \ge 1 }[/math]. The constraints of the game are that the value of a die [math]\displaystyle{ i }[/math], denoted by [math]\displaystyle{ m_i }[/math], has to be greater than or equal to the value of die [math]\displaystyle{ (i1) }[/math], denoted by [math]\displaystyle{ m_{i1} }[/math], in the previous throw, i.e., [math]\displaystyle{ m_i \ge m_{i1} }[/math]. Thus a valid sequence of die throws can be described by an ntuple [math]\displaystyle{ ( m_1 , m_2 , \dots , m_n) }[/math], such that [math]\displaystyle{ m_i \ge m_{i1} }[/math]. Let [math]\displaystyle{ S(n,g) }[/math] denote the set of these valid ntuples:
[math]\displaystyle{ S(n,g) = \left\{ ( m_1 , m_2 , \dots , m_n ) \, \Big \, m_i \ge m_{i1}, m_i \in \left\{ 1, \ldots, g \right\}, \forall i = 1, \dots , n \right\}. }[/math] 

( ) 
Then the quantity [math]\displaystyle{ w(n,g) }[/math] (defined above as the number of ways to distribute [math]\displaystyle{ n }[/math] particles among the [math]\displaystyle{ g }[/math] sublevels of an energy level) is the cardinality of [math]\displaystyle{ S(n,g) }[/math], i.e., the number of elements (or valid ntuples) in [math]\displaystyle{ S(n,g) }[/math]. Thus the problem of finding an expression for [math]\displaystyle{ w(n,g) }[/math] becomes the problem of counting the elements in [math]\displaystyle{ S(n,g) }[/math].
Example n = 4, g = 3: [math]\displaystyle{ S(4,3) = \left\{ \underbrace{(1111), (1112), (1113)}_{(a)}, \underbrace{(1122), (1123), (1133)}_{(b)}, \underbrace{(1222), (1223), (1233), (1333)}_{(c)}, \underbrace{(2222), (2223), (2233), (2333), (3333)}_{(d)} \right\} }[/math] [math]\displaystyle{ w(4,3) = 15 }[/math] (there are [math]\displaystyle{ 15 }[/math] elements in [math]\displaystyle{ S(4,3) }[/math])
Subset [math]\displaystyle{ (a) }[/math] is obtained by fixing all indices [math]\displaystyle{ m_i }[/math] to [math]\displaystyle{ 1 }[/math], except for the last index, [math]\displaystyle{ m_n }[/math], which is incremented from [math]\displaystyle{ 1 }[/math] to [math]\displaystyle{ g=3 }[/math]. Subset [math]\displaystyle{ (b) }[/math] is obtained by fixing [math]\displaystyle{ m_1 = m_2 = 1 }[/math], and incrementing [math]\displaystyle{ m_3 }[/math] from [math]\displaystyle{ 2 }[/math] to [math]\displaystyle{ g=3 }[/math]. Due to the constraint [math]\displaystyle{ m_i \ge m_{i1} }[/math] on the indices in [math]\displaystyle{ S(n,g) }[/math], the index [math]\displaystyle{ m_4 }[/math] must automatically take values in [math]\displaystyle{ \left\{ 2, 3 \right\} }[/math]. The construction of subsets [math]\displaystyle{ (c) }[/math] and [math]\displaystyle{ (d) }[/math] follows in the same manner.
Each element of [math]\displaystyle{ S(4,3) }[/math] can be thought of as a multiset of cardinality [math]\displaystyle{ n=4 }[/math]; the elements of such multiset are taken from the set [math]\displaystyle{ \left\{ 1, 2, 3 \right\} }[/math] of cardinality [math]\displaystyle{ g=3 }[/math], and the number of such multisets is the multiset coefficient [math]\displaystyle{ \left\langle \begin{matrix} 3 \\ 4 \end{matrix} \right\rangle = {3 + 4  1 \choose 31} = {3 + 4  1 \choose 4} = \frac {6!} {4! 2!} = 15 }[/math]
More generally, each element of [math]\displaystyle{ S(n,g) }[/math] is a multiset of cardinality [math]\displaystyle{ n }[/math] (number of dice) with elements taken from the set [math]\displaystyle{ \left\{ 1, \dots, g \right\} }[/math] of cardinality [math]\displaystyle{ g }[/math] (number of possible values of each die), and the number of such multisets, i.e., [math]\displaystyle{ w(n,g) }[/math]is the multiset coefficient
[math]\displaystyle{ w(n,g) = \left\langle \begin{matrix} g \\ n \end{matrix} \right\rangle = {g + n  1 \choose g1} = {g + n  1 \choose n} = \frac{(g + n  1)!}{n! (g1)!} }[/math] 

( ) 
which is exactly the same as the formula for [math]\displaystyle{ w(n,g) }[/math], as derived above with the aid of a theorem involving binomial coefficients, namely
[math]\displaystyle{ \sum_{k=0}^n\frac{(k+a)!}{k!a!}=\frac{(n+a+1)!}{n!(a+1)!}. }[/math] 

( ) 
To understand the decomposition
[math]\displaystyle{ w(n,g) = \sum_{k=0}^n w(nk, g1) = w(n, g1) + w(n1, g1) + \dots + w(1, g1) + w(0, g1) }[/math] 

( ) 
or for example, [math]\displaystyle{ n=4 }[/math] and [math]\displaystyle{ g=3 }[/math] [math]\displaystyle{ w(4,3) = w(4,2) + w(3,2) + w(2,2) + w(1,2) + w(0,2), }[/math]
let us rearrange the elements of [math]\displaystyle{ S(4,3) }[/math] as follows [math]\displaystyle{ S(4,3) = \left\{ \underbrace{ (1111), (1112), (1122), (1222), (2222) }_{(\alpha)}, \underbrace{ (111{\color{Red}{\underset{=}{3}}}), (112{\color{Red}{\underset{=}{3}}}), (122{\color{Red}{\underset{=}{3}}}), (222{\color{Red}{\underset{=}{3}}}) }_{(\beta)}, \underbrace{ (11{\color{Red}{\underset{==}{33}}}), (12{\color{Red}{\underset{==}{33}}}), (22{\color{Red}{\underset{==}{33}}}) }_{(\gamma)}, \underbrace{ (1{\color{Red}{\underset{===}{333}}}), (2{\color{Red}{\underset{===}{333}}}) }_{(\delta)} \underbrace{ ({\color{Red}{\underset{====}{3333}}}) }_{(\omega)} \right\}. }[/math]
Clearly, the subset [math]\displaystyle{ (\alpha) }[/math] of [math]\displaystyle{ S(4,3) }[/math] is the same as the set [math]\displaystyle{ S(4,2) = \left\{ (1111), (1112), (1122), (1222), (2222) \right\}. }[/math]
By deleting the index [math]\displaystyle{ m_4=3 }[/math] (shown in red with double underline) in the subset [math]\displaystyle{ (\beta) }[/math] of [math]\displaystyle{ S(4,3) }[/math], one obtains the set
[math]\displaystyle{ S(3,2) = \left\{ (111), (112), (122), (222) \right\}. }[/math]
In other words, there is a onetoone correspondence between the subset [math]\displaystyle{ (\beta) }[/math] of [math]\displaystyle{ S(4,3) }[/math] and the set [math]\displaystyle{ S(3,2) }[/math]. We write [math]\displaystyle{ (\beta) \longleftrightarrow S(3,2) . }[/math]
Similarly, it is easy to see that [math]\displaystyle{ (\gamma) \longleftrightarrow S(2,2) = \left\{ (11), (12), (22) \right\} }[/math] [math]\displaystyle{ (\delta) \longleftrightarrow S(1,2) = \left\{ (1), (2) \right\} }[/math] [math]\displaystyle{ (\omega) \longleftrightarrow S(0,2) = \{\} = \varnothing. }[/math]
Thus we can write [math]\displaystyle{ S(4,3) = \bigcup_{k=0}^{4} S(4k,2) }[/math]
or more generally,
[math]\displaystyle{ S(n,g) = \bigcup_{k=0}^{n} S(nk,g1); }[/math] 

( ) 
and since the sets
[math]\displaystyle{ S(i,g1), \text{ for } i = 0, \dots , n }[/math]
are nonintersecting, we thus have
[math]\displaystyle{ w(n,g) = \sum_{k=0}^{n} w(nk,g1), }[/math] 

( ) 
with the convention that
[math]\displaystyle{ w(0,g) = 1 \ , \forall g, \text{ and } w(n,0) = 1 \ , \forall n. }[/math] 

( ) 
Continuing the process, we arrive at the following formula [math]\displaystyle{ w(n,g) = \sum_{k_1=0}^n \sum_{k_2=0}^{nk_1} w(n  k_1  k_2, g2) = \sum_{k_1=0}^n \sum_{k_2=0}^{nk_1} \cdots \sum_{k_g=0}^{n\sum_{j=1}^{g1} k_j} w(n  \sum_{i=1}^{g} k_i, 0). }[/math] Using the convention (7)_{2} above, we obtain the formula
[math]\displaystyle{ w(n,g) = \sum_{k_1=0}^{n} \sum_{k_2=0}^{nk_1} \cdots \sum_{k_g=0}^{n\sum_{j=1}^{g1} k_j} 1, }[/math] 

( ) 
keeping in mind that for [math]\displaystyle{ q }[/math] and [math]\displaystyle{ p }[/math] being constants, we have
[math]\displaystyle{ \sum_{k=0}^{q} p = q p. }[/math] 

( ) 
It can then be verified that (8) and (2) give the same result for [math]\displaystyle{ w(4,3) }[/math], [math]\displaystyle{ w(3,3) }[/math], [math]\displaystyle{ w(3,2) }[/math], etc.
Interdisciplinary applications
Viewed as a pure probability distribution, the Bose–Einstein distribution has found application in other fields:
 In recent years, Bose–Einstein statistics has also been used as a method for term weighting in information retrieval. The method is one of a collection of DFR ("Divergence From Randomness") models,^{[17]} the basic notion being that Bose–Einstein statistics may be a useful indicator in cases where a particular term and a particular document have a significant relationship that would not have occurred purely by chance. Source code for implementing this model is available from the Terrier project at the University of Glasgow.
 The evolution of many complex systems, including the World Wide Web, business, and citation networks, is encoded in the dynamic web describing the interactions between the system's constituents. Despite their irreversible and nonequilibrium nature these networks follow Bose statistics and can undergo Bose–Einstein condensation. Addressing the dynamical properties of these nonequilibrium systems within the framework of equilibrium quantum gases predicts that the "firstmoveradvantage", "fitgetrich" (FGR) and "winnertakesall" phenomena observed in competitive systems are thermodynamically distinct phases of the underlying evolving networks.^{[18]}
See also
 Bose–Einstein correlations
 Bose–Einstein condensate
 Bose gas
 Einstein solid
 Higgs boson
 Parastatistics
 Planck's law of black body radiation
 Superconductivity
 Fermi–Dirac statistics
 Maxwell–Boltzmann statistics
Notes
 ↑ Pearsall, Thomas (2020). Quantum Photonics, 2nd edition. Graduate Texts in Physics. Springer. doi:10.1007/9783030473259. ISBN 9783030473242. https://www.springer.com/us/book/9783030473242.
 ↑ Jammer, Max (1966). The conceptual development of quantum mechanics. McGrawHill. p. 51. ISBN 0883186179.
 ↑ Passon, Oliver; GrebeEllis, Johannes (20170501). "Planck's radiation law, the light quantum, and the prehistory of indistinguishability in the teaching of quantum mechanics". European Journal of Physics 38 (3): 035404. doi:10.1088/13616404/aa6134. ISSN 01430807. Bibcode: 2017EJPh...38c5404P. https://iopscience.iop.org/article/10.1088/13616404/aa6134.
 ↑ d'Alembert, Jean (1754). "Croix ou pile" (in fr). L'Encyclopédie 4.
 ↑ d'Alembert, Jean (1754). "Croix ou pile". http://www.cs.xu.edu/math/Sources/Dalembert/croix_ou_pile.pdf.
 ↑ See p. 14, note 3, of the thesis: Michelangeli, Alessandro (October 2007). Bose–Einstein condensation: Analysis of problems and rigorous results (PDF) (Ph.D.). International School for Advanced Studies. Archived (PDF) from the original on 3 November 2018. Retrieved 14 February 2019.
 ↑ Bose (2 July 1924). "Planck's law and the hypothesis of light quanta" (PostScript). University of Oldenburg. http://www.condmat.unioldenburg.de/TeachingSP/bose.ps.
 ↑ Bose (1924), "Plancks Gesetz und Lichtquantenhypothese" (in de), Zeitschrift für Physik 26 (1): 178–181, doi:10.1007/BF01327326, Bibcode: 1924ZPhy...26..178B
 ↑ ^{9.0} ^{9.1} H. J. W. MüllerKirsten, Basics of Statistical Physics, 2nd ed., World Scientific (2013), ISBN:9789814449533.
 ↑ Srivastava, R. K.; Ashok, J. (2005). "Chapter 7". Statistical Mechanics. New Delhi: PHI Learning Pvt. Ltd.. ISBN 9788120327825.
 ↑ Landau, L. D., Lifšic, E. M., Lifshitz, E. M., & Pitaevskii, L. P. (1980). Statistical physics (Vol. 5). Pergamon Press.
 ↑ "Chapter 6". Statistical Mechanics. January 2005. ISBN 9788120327825.
 ↑ The BE distribution can be derived also from thermal field theory.
 ↑ R. B. Dingle, Asymptotic Expansions: Their Derivation and Interpretation, Academic Press (1973), pp. 267–271.
 ↑ Ziff R. M.; Kac, M.; Uhlenbeck, G. E. (1977). "The ideal Bose–Einstein gas, revisited". Physics Reports 32: 169–248.
 ↑ See McQuarrie in citations
 ↑ Amati, G.; C. J. Van Rijsbergen (2002). "Probabilistic models of information retrieval based on measuring the divergence from randomness " ACM TOIS 20(4):357–389.
 ↑ Bianconi, G.; Barabási, A.L. (2001). "Bose–Einstein Condensation in Complex Networks". Physical Review Letters 86: 5632–5635.
References
 Annett, James F. (2004). Superconductivity, Superfluids and Condensates. New York: Oxford University Press. ISBN 0198507550.
 Carter, Ashley H. (2001). Classical and Statistical Thermodynamics. Upper Saddle River, New Jersey: Prentice Hall. ISBN 0137792085.
 Griffiths, David J. (2005). Introduction to Quantum Mechanics (2nd ed.). Upper Saddle River, New Jersey: Pearson, Prentice Hall. ISBN 0131911759.
 McQuarrie, Donald A. (2000). Statistical Mechanics (1st ed.). Sausalito, California 94965: University Science Books. p. 55. ISBN 1891389157. https://archive.org/details/statisticalmecha00mcqu_0/page/55.
Original source: https://en.wikipedia.org/wiki/Bose–Einstein statistics.
Read more 