8b. Free probability
As a solution to the difficulties met in the previous section, let us turn to probability. This is surely not geometry, in a standard sense, but at a more advanced level, geometry that is. For instance if you have a quantum manifold [math]X[/math], and you want to talk about its Laplacian, or its Dirac operator, you will certainly need to know a bit about [math]L^2(X)[/math]. And isn't advanced measure theory the same as probability theory, hope we agree on this.
Let us start our discussion with something that we know since chapter 5:
Let [math]A[/math] be a [math]C^*[/math]-algebra, given with a trace [math]tr:A\to\mathbb C[/math].
- The elements [math]a\in A[/math] are called random variables.
- The moments of such a variable are the numbers [math]M_k(a)=tr(a^k)[/math].
- The law of such a variable is the functional [math]\mu:P\to tr(P(a))[/math].
Here the exponent [math]k=\circ\bullet\bullet\circ\ldots[/math] is as before a colored integer, with the powers [math]a^k[/math] being defined by multiplicativity and the usual formulae, namely:
As for the polynomial [math]P[/math], this is a noncommuting [math]*[/math]-polynomial in one variable:
Generally speaking, the above definition is something quite abstract, but there is no other way of doing things, at least at this level of generality. However, in the special case where our variable [math]a\in A[/math] is self-adjoint, or more generally normal, we have:
The law of a normal variable [math]a\in A[/math] can be identified with the corresponding spectral measure [math]\mu\in\mathcal P(\mathbb C)[/math], according to the following formula,
This is something that we again know well, either from chapter 5, or simply from chapter 3, coming from the spectral theorem for normal operators.
Let us discuss now independence, and its noncommutative versions. As a starting point, we have the following update of the classical notion of independence:
We call two subalgebras [math]B,C\subset A[/math] independent when the following condition is satisfied, for any [math]x\in B[/math] and [math]y\in C[/math]:
It is possible to develop some theory here, but this leads to the usual CLT. As a much more interesting notion now, we have Voiculescu's freeness [1]:
Given a pair [math](A,tr)[/math], we call two subalgebras [math]B,C\subset A[/math] free when the following condition is satisfied, for any [math]x_i\in B[/math] and [math]y_i\in C[/math]:
As a first observation, there is a certain lack of symmetry between Definition 8.12 and Definition 8.13, because the latter does not include an explicit formula for quantities of type [math]tr(x_1y_1x_2y_2\ldots)[/math]. But this can be done, the precise result being as follows:
If [math]B,C\subset A[/math] are free, the restriction of [math]tr[/math] to [math] \lt B,C \gt [/math] can be computed in terms of the restrictions of [math]tr[/math] to [math]B,C[/math]. To be more precise, we have
With [math]x'=x-tr(x)[/math], we can start our computation as follows:
Thus, we are led to a kind of recurrence, and this gives the result.
Let us discuss now some examples of independence and freeness. We first have the following result, from [1], which is something elementary:
Given two algebras [math](A,tr)[/math] and [math](B,tr)[/math], the following hold:
- [math]A,B[/math] are independent inside their tensor product [math]A\otimes B[/math], endowed with its canonical tensor product trace, given on basic tensors by [math]tr(a\otimes b)=tr(a)tr(b)[/math].
- [math]A,B[/math] are free inside their free product [math]A*B[/math], endowed with its canonical free product trace, given by the formulae in Proposition 8.14.
Both the assertions are indeed clear from definitions, with just some standard discussion needed for (2), in connection with the free product trace. See [1].
More concretely now, we have the following result, also from Voiculescu [1]:
We have the following results, valid for group algebras:
- [math]L(\Gamma),L(\Lambda)[/math] are independent inside [math]L(\Gamma\times\Lambda)[/math].
- [math]L(\Gamma),L(\Lambda)[/math] are free inside [math]L(\Gamma*\Lambda)[/math].
In order to prove these results, we can use the general results in Proposition 8.15, along with the following two isomorphisms, which are both standard:
Alternatively, we can check the independence and freeness formulae on group elements, which is something trivial, and then conclude by linearity. See [1].
We have already seen limiting theorems in classical probability, in chapter 6. In order to deal now with freeness, let us develop some tools. First, we have:
We have a well-defined operation [math]\boxplus[/math], given by
We need to check here that if [math]a,b[/math] are free, then the distribution [math]\mu_{a+b}[/math] depends only on the distributions [math]\mu_a,\mu_b[/math]. But for this purpose, we can use the formula in Proposition 8.14. Indeed, by plugging in arbitrary powers of [math]a,b[/math] as variables [math]x_i,y_j[/math], we obtain a family of formulae of the following type, with [math]Q[/math] being certain polyomials:
Thus the moments of [math]a+b[/math] depend only on the moments of [math]a,b[/math], and the same argument shows that the same holds for [math]*[/math]-moments, and this gives the result.
In order to advance now, we would need an analogue of the Fourier transform, or rather of the log of the Fourier transform. Quite remarkably, such a transform exists indeed, the precise result here, due to Voiculescu [1], being as follows:
Given a probability measure [math]\mu[/math], define its [math]R[/math]-transform as follows:
This is something quite tricky, the idea being as follows:
(1) In order to model the free convolution, the best is to use creation operators on free Fock spaces, corresponding to the semigroup von Neumann algebras [math]L(\mathbb N^{*k})[/math]. Indeed, we have some freeness here, a bit in the same way as in the free group algebras [math]L(F_k)[/math].
(2) The point now, motivating this choice, is that the variables of type [math]S^*+f(S)[/math], with [math]S\in L(\mathbb N)[/math] being the shift, and with [math]f\in\mathbb C[X][/math] being an arbitrary polynomial, are easily seen to model in moments all the possible distributions [math]\mu:\mathbb C[X]\to\mathbb C[/math].
(3) Now let [math]f,g\in\mathbb C[X][/math] and consider the variables [math]S^*+f(S)[/math] and [math]T^*+g(T)[/math], where [math]S,T\in L(\mathbb N*\mathbb N)[/math] are the shifts corresponding to the generators of [math]\mathbb N*\mathbb N[/math]. These variables are free, and by using a [math]45^\circ[/math] argument, their sum has the same law as [math]S^*+(f+g)(S)[/math].
(4) Thus the operation [math]\mu\to f[/math] linearizes the free convolution. We are therefore left with a computation inside [math]L(\mathbb N)[/math], which is elementary, and whose conclusion is that [math]R_\mu=f[/math] can be recaptured from [math]\mu[/math] via the Cauchy transform [math]G_\mu[/math], as in the statement.
With the above linearization technology in hand, we can now establish the following remarkable free analogue of the CLT, also due to Voiculescu [1]:
Given self-adjoint variables [math]x_1,x_2,x_3,\ldots,[/math] which are f.i.d., centered, with variance [math]t \gt 0[/math], we have, with [math]n\to\infty[/math], in moments,
We follow the same idea as in the proof of the CLT:
(1) At [math]t=1[/math], the [math]R[/math]-transform of the variable in the statement can be computed by using the linearization property from Theorem 8.18, and is given by:
(2) On the other hand, some standard computations show that the Cauchy transform of the Wigner law [math]\gamma_1[/math] satisfies the following equation:
Thus, by using Theorem 8.18, we have the following formula:
(3) We conclude that the laws in the statement have the same [math]R[/math]-transforms, and so they are equal. The passage to the general case, [math]t \gt 0[/math], is routine, by dilation.
In the complex case now, we have a similar result, also from [1], as follows:
Given random variables [math]x_1,x_2,x_3,\ldots[/math] which are f.i.d., centered, with variance [math]t \gt 0[/math], we have, with [math]n\to\infty[/math], in moments,
This follows indeed from the free CLT, established before, simply by taking real and imaginary parts of all the variables involved.
Now that we are done with the basic results in continuous case, let us discuss as well the discrete case. We can establish a free version of the PLT, as follows:
The following limit converges, for any [math]t \gt 0[/math],
Let [math]\mu[/math] be the measure in the statement, appearing under the convolution sign. The Cauchy transform of this measure is elementary to compute, given by:
By using Theorem 8.18, we want to compute the following [math]R[/math]-transform:
We know that the equation for this function [math]R[/math] is as follows:
With [math]n\to\infty[/math] we obtain from this the following formula:
But this being the [math]R[/math]-transform of [math]\pi_t[/math], via some calculus, we are done.
As a first application now of all this, following Voiculescu [2], we have:
Given a sequence of complex Gaussian matrices [math]Z_N\in M_N(L^\infty(X))[/math], having independent [math]G_t[/math] variables as entries, with [math]t \gt 0[/math], we have
We know from chapter 6 that the asymptotic moments are:
On the other hand, the free Fock space analysis done in the proof of Theorem 8.18 shows that we have, with the notations there, the following formulae:
By doing some combinatorics, this shows that an abstract noncommutative variable [math]a\in A[/math] is circular, following the law [math]\Gamma_t[/math], precisely when its moments are:
Thus, we are led to the conclusion in the statement. See [2].
Next in line, comes the main result of Voiculescu in [2], as follows:
Given a family of sequences of Wigner matrices,
We can assume that we are dealing with 2 sequences of matrices, [math]Z_N,Z_N'[/math]. In order to prove the asymptotic freeness, consider the following matrix:
This is then a complex Gaussian matrix, so by using Theorem 8.22, we have:
We are therefore in the situation where [math](Z_N+iZ_N')/\sqrt{N}[/math], which has asymptotically semicircular real and imaginary parts, converges to the distribution of a free combination of such variables. Thus [math]Z_N,Z_N'[/math] become asymptotically free, as desired.
Getting now to the complex case, we have a similar result here, as follows:
Given a family of sequences of complex Gaussian matrices,
This follows indeed from Theorem 8.23, which applies to the real and imaginary parts of our complex Gaussian matrices, and gives the result.
Finally, we have as well a similar result for the Wishart matrices, as follows:
Given a family of sequences of complex Wishart matrices,
Here the first assertion is the Marchenko-Pastur theorem, from chapter 6, and the second assertion follows from Theorem 8.23, or from Theorem 8.24.
Let us develop now some further limiting theorems, classical and free. We have the following definition, extending the Poisson limit theory developed before:
Associated to any compactly supported positive measure [math]\rho[/math] on [math]\mathbb C[/math] are the probability measures
In what follows we will be interested in the case where [math]\rho[/math] is discrete, as is for instance the case for [math]\rho=t\delta_1[/math] with [math]t \gt 0[/math], which produces the Poisson and free Poisson laws. The following result allows one to detect compound Poisson/free Poisson laws:
For [math]\rho=\sum_{i=1}^sc_i\delta_{z_i}[/math] with [math]c_i \gt 0[/math] and [math]z_i\in\mathbb C[/math], we have
Let [math]\mu_n[/math] be the measure appearing in Definition 8.26. We have:
In the free case we can use a similar method, and we obtain the above formula.
We have the following result, providing an alternative to Definition 8.26, which will be our formulation here of the Compond Poisson Limit Theorem, classical and free:
For [math]\rho=\sum_{i=1}^sc_i\delta_{z_i}[/math] with [math]c_i \gt 0[/math] and [math]z_i\in\mathbb C[/math], we have
This follows indeed from the fact that the the Fourier/[math]R[/math]-transform of the variable in the statement is given by the formulae in Proposition 8.27.
Following [3], [4], we will be interested here in the main examples of classical and free compound Poisson laws, which are constructed as follows:
The Bessel and free Bessel laws are the compound Poisson laws
- At [math]s=1[/math] we obtain the usual Poisson and free Poisson laws, [math]p_t,\pi_t[/math].
- At [math]s=2[/math] we obtain the “real” Bessel and free Bessel laws, denoted [math]b_t,\beta_t[/math].
- At [math]s=\infty[/math] we obtain the “complex” Bessel and free Bessel laws, denoted [math]B_t,\mathfrak B_t[/math].
There is a lot of theory regarding these laws, and we refer here to [3], [4], where these laws were introduced. We will be back to these laws, in a moment.
General references
Banica, Teo (2024). "Principles of operator algebras". arXiv:2208.03600 [math.OA].
References
- 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 D. Voiculescu, Addition of certain noncommuting random variables, J. Funct. Anal. 66 (1986), 323--346.
- 2.0 2.1 2.2 D. Voiculescu, Limit laws for random matrices and free products, Invent. Math. 104 (1991), 201--220.
- 3.0 3.1 T. Banica, S.T. Belinschi, M. Capitaine and B. Collins, Free Bessel laws, Canad. J. Math. 63 (2011), 3--37.
- 4.0 4.1 T. Banica, J. Bichon and B. Collins, The hyperoctahedral quantum group, J. Ramanujan Math. Soc. 22 (2007), 345--384.