Bistochastic form
10a. Basic theory
In this chapter and in the next one we discuss some further analytic aspects of the complex Hadamard matrices, which this time are brand new or almost, going back to the mid 10s and onwards, and are very exciting too. The general idea is that any Hadamard matrix, real or complex, can be put in bistochastic form over the complex numbers [math]\mathbb C[/math], and with this bistochastic form looking much better than the original form.
Thus, we have here a potentially far-reaching idea, consisting in reformulating everything that we know, including our favorite questions from the real case, the HC and CHC, in complex bistochastic form. But, and here comes the second point, putting an Hadamard matrix in bistochastic form is something non-trivial, in general done by a non-explicit result of Idel-Wolf [1], based on some non-trivial symplectic geometry results of Biran-Entov-Polterovich [2] and Cho [3], motivated by a deep conjecture of Arnold.
And isn't this exciting. We have been commenting in the last chapter on open questions in mathematics, our point being that the closer you get to classical mechanics, the better that is, for the fate of your open problem. And since in classical mechanics all roads lead to Arnold, we are probably on the right track here. Perhaps for the first time, since the beginning of this book. That is, plenty of reasons to be optimistic.
All this is however very new, and our presentation here will be quite modest. Lots of further work are needed, and it is a pity that nothing much is going on here, so far. Young reader, if I have an excellent question to recommend to you, this is the one, continuation of what will be said here. Get to know and love classical mechanics, which is the mother of everything, in mathematics and physics, than read some books of Arnold, starting with [4], which are a must-read anyway, no matter what mathematics or physics you want to do, and then start solving some Hadamard matrix questions, using this technology.
In order to get started now, we have already talked about bistochastic Hadamard matrices, in the real case, on several occasions, in chapters 1-4. Our first purpose will be that of carefully reviewing and extending that material, in the complex Hadamard matrix case. Let us start our discussion with the following definition:
A complex Hadamard matrix [math]H\in M_N(\mathbb C)[/math] is called bistochastic when the sums on all rows and all columns are equal,
The bistochastic Hadamard matrices are quite interesting objects, and include for instance all the circulant Hadamard matrices, that we discussed in chapter 9. Indeed, assuming that [math]H_{ij}=\xi_{j-i}[/math] is circulant, all rows and columns sum up to [math]\lambda=\sum_i\xi_i[/math]:
We will be back to this, in a moment. Let us begin, however, with some considerations regarding the real case. Our point here is that the real Hadamard matrices often “look better” in complex bistochastic form, and that there is some potentially interesting mathematics behind all this. As a first and trivial remark, the first Walsh matrix [math]W_2=F_2[/math] looks better in complex bistochastic form, modulo the standard equivalence relation:
To be more precise, the matrix on the right, while having the slight disadvantage of being complex instead of real, is something very nice, circulant and symmetric. Regarding the second Walsh matrix [math]W_4=W_2\otimes W_2[/math], this looks as well better in bistochastic form, because it becomes in this way equivalent to [math]K_4[/math], the most beautiful matrix ever:
As before, the matrix on the right looks better than the one on the left, because it is circulant and symmetric. And all this is quite interesting, philosophically speaking. Indeed, we have here a new idea, in connection with the various questions explained in chapters 1-4, namely that of studying the real Hadamard matrices [math]H\in M_N(\pm1)[/math] by putting them in complex bistochastic form, [math]H'\in M_N(\mathbb T)[/math], and then studying these latter matrices. Let us record here, as a partial conclusion, the following simple fact:
All Walsh matrices can be put in bistocastic form, as follows:
- The matrices [math]W_N[/math] with [math]N=4^n[/math] admit a real bistochastic form, namely:
[[math]] W_N\sim\begin{pmatrix} -1&1&1&1\\ 1&-1&1&1\\ 1&1&-1&1\\ 1&1&1&-1 \end{pmatrix}^{\otimes n} [[/math]]
- The matrices [math]W_N[/math] with [math]N=2\times4^n[/math] admit a complex bistochastic form, namely:
[[math]] W_N\sim\begin{pmatrix}i&1\\1&i\end{pmatrix}\otimes\begin{pmatrix} -1&1&1&1\\ 1&-1&1&1\\ 1&1&-1&1\\ 1&1&1&-1 \end{pmatrix}^{\otimes n} [[/math]]
This follows indeed from the above discussion.
Let us review now the material in chapter 9. According to the results there, and to the above-mentioned fact that circulant implies bistochastic, we have:
The class of bistochastic Hadamard matrices is stable under permuting rows and columns, and under taking tensor products. As examples, we have:
- The circulant and symmetric forms [math]F_N'[/math] of the Fourier matrices [math]F_N[/math].
- The bistochastic and symmetric forms [math]F_G'[/math] of the Fourier matrices [math]F_G[/math].
- The circulant and symmetric Backelin matrices, having size [math]MN[/math] with [math]M|N[/math].
In this statement the claim regarding permutations of rows and columns is clear. Assuming now that [math]H,K[/math] are bistochastic, with sums [math]\lambda,\mu[/math], we have:
We have as well the following computation:
Thus, the matrix [math]H\otimes K[/math] is bistochastic as well. As for the assertions (1,2,3), we already know all this, coming from our study from from chapter 9.
In the above list of examples, those in (2), which are not necessarily circulant, are the key ones. Indeed, while many interesting complex Hadamard matrices, such as the usual Fourier ones [math]F_N[/math], can be put in circulant form, this is something quite exceptional, which does not work any longer when looking for instance at the general Fourier matrices [math]F_G[/math]. To be more precise, consider a finite abelian group, written as follows:
We can then consider the following matrix, with [math]F_N'[/math] standing as before for the circulant and symmetric form of the Fourier matrix [math]F_N[/math], which is equivalent to [math]F_G[/math]:
Now since the tensor product of circulant matrices is bistochastic, but not necessarily circulant, we can only say that this matrix [math]F_G'[/math] is bistochastic, as stated in (2) above.
As a conclusion to all this, the bistochastic complex Hadamard matrices are interesting objects, covering all the generalized Fourier matrices, up to equivalence, and which are definitely worth some study. So, let us develop now some general theory, for such matrices. As a first result, regarding the unitary bistochastic matrices in general, we have:
The real and complex bistochastic groups, which are the sets
Let us pick a unitary matrix [math]F\in U_N[/math] satisfying the following condition, where [math]e_0,\ldots,e_{N-1}[/math] is the standard basis of [math]\mathbb C^N[/math], and where [math]\xi[/math] is the all-one vector:
Observe that such matrices [math]F\in U_N[/math] exist indeed, the basic example being the normalized Fourier matrix [math]F_N/\sqrt{N}[/math]. We have then, by using the above property of [math]F[/math]:
Thus we have isomorphisms as in the statement, given by:
But this gives both the assertions.
Now back to the Hadamard matrices, we first have the following elementary result:
For a complex Hadamard matrix [math]H\in M_N(\mathbb C)[/math], the following conditions are equivalent:
- [math]H[/math] is bistochastic, with sums [math]\lambda[/math].
- [math]H[/math] is row-stochastic, with sums [math]\lambda[/math], and [math]|\lambda|^2=N[/math].
Both the implications are elementary, as follows:
[math](1)\implies(2)[/math] If we denote by [math]H_1,\ldots,H_N\in\mathbb T^N[/math] the rows of [math]H[/math], we have indeed:
[math](2)\implies(1)[/math] Consider the all-one vector [math]\xi=(1)_i\in\mathbb C^N[/math]. The fact that [math]H[/math] is row-stochastic with sums [math]\lambda[/math] reads:
Also, the fact that [math]H[/math] is column-stochastic with sums [math]\lambda[/math] reads:
We must prove that the first condition implies the second one, provided that the row sum [math]\lambda[/math] satisfies [math]|\lambda|^2=N[/math]. But this follows from the following computation:
Thus, we have proved both the implications, and we are done.
Here is another basic result, that we will need as well in what follows:
For a complex Hadamard matrix [math]H\in M_N(\mathbb C)[/math], and a number [math]\lambda\in\mathbb C[/math] satisfying [math]|\lambda|^2=N[/math], the following are equivalent:
- We have [math]H\sim H'[/math], with [math]H'[/math] being bistochastic, with sums [math]\lambda[/math].
- [math]K_{ij}=a_ib_jH_{ij}[/math] is bistochastic with sums [math]\lambda[/math], for some [math]a,b\in\mathbb T^N[/math].
- The equation [math]Hb=\lambda\bar{a}[/math] has solutions [math]a,b\in\mathbb T^N[/math].
Once again, this is an elementary result, the proof being as follows:
[math](1)\iff(2)[/math] Since the permutations of the rows and columns preserve the bistochasticity condition, the equivalence [math]H\sim H'[/math] that we are looking for can be assumed to come only from multiplying the rows and columns by numbers in [math]\mathbb T[/math]. Thus, we are looking for scalars [math]a_i,b_j\in\mathbb T[/math] such that the following matrix is bistochastic with sums [math]\lambda[/math]:
Thus, we are led to the conclusion that (1) and (2) are equivalent, as claimed.
[math](2)\iff(3)[/math] The row sums of the matrix [math]K_{ij}=a_ib_jH_{ij}[/math] are given by:
Thus [math]K[/math] is row-stochastic with sums [math]\lambda[/math] precisely when [math]Hb=\lambda\bar{a}[/math], and by using the equivalence in Proposition 10.5, we obtain the result.
Finally, here is an extension of the excess inequality from chapter 2:
For a complex Hadamard matrix [math]H\in M_N(\mathbb C)[/math], the excess,
In terms of the all-one vector [math]\xi=(1)_i\in\mathbb C^N[/math], we have:
Now by using the Cauchy-Schwarz inequality, along with the fact that [math]U=H/\sqrt{N}[/math] is unitary, and hence of norm 1, we obtain, as claimed:
Regarding now the equality case, this requires the vectors [math]H\xi,\xi[/math] to be proportional, and so our matrix [math]H[/math] to be row-stochastic. Now, let us assume:
We have then [math]|\lambda|^2=N[/math], and by Proposition 10.5 we obtain the result.
The above result was just an introduction to what can be said about the excess, and we refer to Kharaghani-Seberry [5] for more on all this. In what concerns us, we will be back to the excess in chapter 11 below, with some probabilistic computations.
Let us go back now to the fundamental question of putting an arbitrary Hadamard matrix in bistochastic form. As already explained in the above, we are interested in solving this question in general, and in particular in the real case, with potential complex reformulations of the HC and CHC, and other real Hadamard questions, at stake. What we know so far on this subject can be summarized as follows:
An Hadamard matrix [math]H\in M_N(\mathbb C)[/math] can be put in bistochastic form when one of the following conditions is satisfied:
- The equations [math]|Ha|_i=\sqrt{N}[/math], with [math]i=1,\ldots,N[/math], have solutions [math]a\in\mathbb T^N[/math].
- The quantity [math]|E|[/math] attains its maximum [math]N\sqrt{N}[/math] over the equivalence class of [math]H[/math].
This follows indeed from Proposition 10.5 and Proposition 10.6, which altogether gives the equivalence between the two conditions in the statement.
Thus, we have two approaches to the problem, one algebraic, and one analytic.
10b. Idel-Wolf theorem
Let us first discuss the algebraic approach, coming from Proposition 10.8 (1). What we have there is a certain system of [math]N[/math] equations, having as unknowns [math]N[/math] real variables, namely the phases of [math]a_1,\ldots,a_N[/math]. This system is highly non-linear, but can be solved, however, via a certain non-explicit method, as explained by Idel-Wolf in [1]. In order to discuss this material, which is quite advanced, let us begin with some preliminaries. The complex projective space appears by definition as follows:
Inside this projective space, we have the Clifford torus, constructed as follows:
With these conventions, we have the following result, from [1]:
For a unitary matrix [math]U\in U_N[/math], the following are equivalent:
- There exist [math]L,R\in U_N[/math] diagonal such that the following matrix is bistochastic:
[[math]] U'=LUR [[/math]]
- The standard torus [math]\mathbb T^N\subset\mathbb C^N[/math] satisfies:
[[math]] \mathbb T^N\cap U\mathbb T^N\neq\emptyset [[/math]]
- The Clifford torus [math]\mathbb T^{N-1}\subset P^{N-1}_\mathbb C[/math] satisfies:
[[math]] \mathbb T^{N-1}\cap U\mathbb T^{N-1}\neq\emptyset [[/math]]
These equivalences are all elementary, as follows:
[math](1)\implies(2)[/math] Assuming that [math]U'=LUR[/math] is bistochastic, which in terms of the all-1 vector [math]\xi[/math] means [math]U'\xi=\xi[/math], if we set [math]f=R\xi\in\mathbb T^N[/math] we have:
Thus we have [math]Uf\in\mathbb T^N\cap U\mathbb T^N[/math], which gives the conclusion.
[math](2)\implies(1)[/math] Given [math]g\in\mathbb T^N\cap U\mathbb T^N[/math], we can define [math]R,L[/math] as follows:
With these values for [math]L,R[/math], we have then the following formulae:
Thus the matrix [math]U'=LUR[/math] is bistochastic, because:
[math](2)\implies(3)[/math] This is clear, because [math]\mathbb T^{N-1}\subset P^{N-1}_\mathbb C[/math] appears as the projective image of [math]\mathbb T^N\subset\mathbb C^N[/math], and so [math]\mathbb T^{N-1}\cap U\mathbb T^{N-1}[/math] appears as the projective image of [math]\mathbb T^N\cap U\mathbb T^N[/math].
[math](3)\implies(2)[/math] We have indeed the following equivalence:
But [math]U\in U_N[/math] implies [math]|\lambda|=1[/math], and this gives the result.
The point now is that the condition (3) above is something familiar in symplectic geometry, and known to hold for any [math]U\in U_N[/math]. Thus, following [1], we have:
Any unitary matrix [math]U\in U_N[/math] can be put in bistochastic form,
As already mentioned, the condition [math]\mathbb T^{N-1}\cap U\mathbb T^{N-1}\neq\emptyset[/math] in Proposition 10.9 (3) is something quite natural in symplectic geometry. To be more precise:
-- The Clifford torus [math]\mathbb T^{N-1}\subset P^{N-1}_\mathbb C[/math] is a Lagrangian submanifold.
-- The map [math]\mathbb T^{N-1}\to U\mathbb T^{N-1}[/math] is a Hamiltonian isotopy.
-- A non-trivial result of Biran-Entov-Polterovich [2] and Cho [3] states that [math]\mathbb T^{N-1}[/math] cannot be displaced from itself via a Hamiltonian isotopy.
Thus, the results in [2], [3] tells us that [math]\mathbb T^{N-1}\cap U\mathbb T^{N-1}\neq\emptyset[/math] holds indeed, for any [math]U\in U_N[/math]. We therefore obtain the result, via Proposition 10.9. See Idel-Wolf [1].
In relation now with our Hadamard matrix questions, we have:
Any complex Hadamard matrix can be put in bistochastic form, up to the standard equivalence relations for such matrices.
This follows indeed from Theorem 10.10, because if [math]H=\sqrt{N}U[/math] is Hadamard then so is [math]H'=\sqrt{N}U'[/math], and with the remark that, in what regards the equivalence relation, we just need the multiplication of the rows and columns by scalars in [math]\mathbb T[/math].
There are many further things that can be said here. As explained in [1], the various technical results from [2], [3] show that in the generic, “transverse” situation, there are at least [math]2^{N-1}[/math] ways of putting a unitary matrix [math]U\in U_N[/math] in bistochastic form, and this modulo the obvious transformation [math]U\to zU[/math], with [math]|z|=1[/math].
Thus, the question of explicitely putting the Hadamard matrices [math]H\in M_N(\mathbb C)[/math] in bistochastic form remains open, and open as well is the question of finding a simpler proof for the fact that this can be done indeed, without using [2], [3].
10c. Complex glow
Regarding the above questions, a possible approach comes from the excess result from Theorem 10.7. Indeed, we know from there that the excess [math]E(H)=\sum_{ij}H_{ij}[/math] satisfies the following inequality, with equality precisely when [math]H[/math] is bistochastic:
Thus, in order to put a complex Hadamard matrix [math]H\in M_N(\mathbb T)[/math] in bistochastic form, it is enough to show that the law of [math]|E|[/math] over the equivalence class of [math]H[/math] has [math]N\sqrt{N}[/math] as upper support bound. In order to comment on this, let us first formulate:
The glow of [math]H\in M_N(\mathbb C)[/math] is the measure [math]\mu\in\mathcal P(\mathbb C)[/math] given by:
Here [math]H[/math] can be any complex matrix, but the equivalence relation is the one for the complex Hadamard matrices. To be more precise, let us call two complex matrices [math]H,K\in M_N(\mathbb C)[/math] Hadamard equivalent if one can pass from one to the other by permuting rows and columns, or by multiplying the rows and columns by numbers in [math]\mathbb T[/math]. Now since permuting rows and columns does not change the quantity [math]E=\sum_{ij}H_{ij}[/math], we can restrict attention from the full equivalence group [math]G=(S_N\rtimes\mathbb T^N)\times(S_N\rtimes\mathbb T^N)[/math] to the smaller group [math]G'=\mathbb T^N\times\mathbb T^N[/math], and we obtain in this way the measure [math]\mu[/math] in Definition 10.12.
As in the real case, the terminology comes from a picture of the following type, with the stars [math]*[/math] representing the entries of our matrix, and with the switches being supposed now to be continuous, randomly changing the phases of the concerned entries:
In short, what we have here is a complex generalization of the Gale-Berlekamp game [6], [7], and this is where a main motivation for studying the glow comes from.
As a first remark, simplifying our study, exactly as in the real case, we are in fact interested in computing a real measure, due to the following simple fact:
With [math]E=\sum_{ij}H_{ij}[/math], the laws [math]\mu,\mu^+[/math] of the variables
By definition of the excess [math]E[/math], as being the total sum of the entries of the matrix, we have the following equality, valid for any [math]\lambda\in\mathbb T[/math]:
We conclude from this that [math]\mu=law(E)[/math] is invariant under the action of [math]\mathbb T[/math]. Thus [math]\mu[/math] must decompose as follows, with [math]\mu^+[/math] being a certain probability measure on [math][0,\infty)[/math]:
But, according to our definitions, this measure [math]\mu^+[/math] is precisely the measure in the statement, that of variable [math]|E|[/math], and this gives the result.
In particular, we can see from the above result that the glow is invariant under rotations. With this observation made, we can formulate the following result:
The glow of any Hadamard matrix [math]H\in M_N(\mathbb C)[/math], or more generally of any [math]H\in\sqrt{N}U_N[/math], satisfies the following conditions, where [math]\mathbb D[/math] is the unit disk,
We have two inclusions to be proved, the idea being as follows:
(1) The inclusion on the right comes indeed from Cauchy-Schwarz, as explained in the proof of Theorem 10.7, with the remark that the computation there only uses the fact that the rescaled matrix [math]U=H/\sqrt{N}[/math] is unitary.
(2) Regarding now the inclusion on the left, we know from Theorem 10.10 that [math]H[/math] can be put in bistochastic form. According to Proposition 10.8, this tells us that we have:
Now by using the rotational invariance of the glow, and hence of its support, coming from Proposition 10.13, we obtain from this:
Thus, we are led to the conclusions in the statement.
The challenging question now is that of proving the above result, which comes from heavy symplectic geometry, by using standard probabilistic techniques. Indeed, as explained in chapter 9, in the context of the questions investigated there, the support of a real measure can be recaptured from the moments, by computing a limit. Thus, knowing the moments of the glow well enough would solve the problem.
Regarding these moments, the general formula is as follows:
For [math]H\in M_N(\mathbb T)[/math] the even moments of [math]|E|[/math] are given by
We have indeed the following computation:
Now since the integrals at right equal respectively the Kronecker symbols [math]\delta_{[i],[k]}[/math] and [math]\delta_{[j],[l]}[/math], we are led to the formula in the statement.
With this formula in hand, the main result, regarding the fact that the complex Hadamard matrices can be put in bistochastic form, reformulates as follows:
For a complex Hadamard matrix [math]H\in M_N(\mathbb T)[/math] we have
This follows from the well-known fact that the maximum of a bounded function [math]\Theta:X\to[0,\infty)[/math] can be recaptured via following formula:
We can use this estimate for the following function, over [math]X=\mathbb T^N\times\mathbb T^N[/math]:
We conclude that the limit in the statement is the square of the upper bound of the glow. But, according to Theorem 10.14, this upper bound is known to be [math]\leq N^3[/math] by Cauchy-Schwarz, and the equality holds by the results in [1].
To conclude now, the challenging question is that of finding a direct proof for Theorem 10.16. All this would provide an alternative aproach to the results in [1], which would be of course still not explicit, but which would use at least some more familiar tools. We will discuss such questions in chapter 11 below, with the remark however that the problems at [math]N\in\mathbb N[/math] fixed being quite difficult, we will do a [math]N\to\infty[/math] study only.
10d. Fourier matrices
Getting away now from these difficult questions, we have nothing concrete so far, besides the list of examples from Theorem 10.3, coming from the circulant matrix considerations in chapter 9. So, our purpose will be that of extending that list. A first natural question is that of looking at the Butson matrix case. To start with, we have:
Assuming that the Butson class [math]H_N(l)[/math] contains a bistochastic matrix, the equations
This is a reformulation of the following equality, from Proposition 10.5, regarding the row sums of a bistochastic Hadamard matrix:
Indeed, if we set [math]w=e^{2\pi i/l}[/math], and we denote by [math]a_i\in\mathbb N[/math] the number of [math]w^i[/math] entries appearing in the first row of our matrix, then the row sum of the matrix is given by:
Thus, we obtain the system of equations in the statement.
The point now is that, in practice, we are led precisely to the Turyn obstructions from chapter 9. At small values of [math]l[/math], the obstructions are as follows:
Assuming that [math]H_N(l)[/math] contains a bistochastic matrix, the following equations must have solutions, over the integers:
- [math]l=2[/math]: [math]4n^2=N[/math].
- [math]l=3[/math]: [math]x^2+y^2+z^2=2N[/math], with [math]x+y+z=0[/math].
- [math]l=4[/math]: [math]a^2+b^2=N[/math].
This follows indeed from the results that we have:
(1) This is something well-known, which follows from Proposition 10.17.
(2) This is best viewed by using Proposition 10.17, and the following formula, that we already know, from chapter 5 above:
At the level of the concrete obstructions, we must have for instance [math]5\!\!\not|N[/math]. Indeed, this follows as in the proof of the de Launey obstruction for [math]H_N(3)[/math] with [math]5|N[/math].
(3) This follows again from Proposition 10.17, and from [math]|a+ib|^2=a^2+b^2[/math].
As a conclusion, nothing much interesting is going on in the Butson matrix case, with various arithmetic obstructions, that we partly already met, appearing here. In order to reach, however, to a number of positive results, beyond those in Theorem 10.3, we can investigate various special classes of matrices, such as the Di\c t\u a products. In order to formulate our results, we will use the following notion:
We say that a complex Hadamard matrix [math]H\in M_N(\mathbb C)[/math] is in “almost bistochastic form” when all the row sums belong to [math]\sqrt{N}\cdot\mathbb T[/math].
Observe that, assuming that this condition holds, the matrix [math]H[/math] can be put in bistochastic form, just by multiplying its rows by suitable numbers from [math]\mathbb T[/math]. We will be particularly interested here in the special situation where the affine deformations [math]H^q\in M_N(\mathbb C)[/math] of a given complex Hadamard matrix [math]H\in M_N(\mathbb C)[/math] can be put in almost bistochastic form, independently of the value of the parameter [math]q[/math]. For the simplest deformations, namely those of [math]F_2\otimes F_2[/math], this is indeed the case, as shown by the following result:
The deformations of [math]F_2\otimes F_2[/math], with parameter matrix [math]Q=(^p_r{\ }^q_s)[/math],
By multiplying the columns of the matrix in the statement with [math]1,1,-1,1[/math] respectively, we obtain the following matrix:
The row sums of this matrix are as follows:
Thus, by multiplying by suitable scalars, namely the complex conjugates of these numbers, we can put our matrix in bistochastic form, as desired.
We will see later that [math]F_2\otimes''_QF_2[/math] is equivalent to a certain matrix [math]F_2\otimes'F_2[/math], which is part of a series [math]F_N\otimes'F_N[/math]. Now back to the general case, we have:
A deformed tensor product [math]H\otimes_QK[/math] can be put in bistochastic form when there exist numbers [math]x^i_a\in\mathbb T[/math] such that with
According to our tensor product conventions, the deformed tensor product [math]L=H\otimes_QK[/math] is given by the following formula:
By multiplying the columns by scalars [math]R_{jb}\in\mathbb T[/math], this matrix becomes:
The row sums of this matrix are given by:
Consider now the following variables:
In terms of these variables, the rows sums are given by:
Thus [math]H\otimes_QK[/math] can be put in bistochastic form when we can find scalars [math]R_{jb}\in\mathbb T[/math] and [math]x^i_a\in\mathbb T[/math] such that, with [math]C^i_b=Q_{ib}(HR)_{ib}[/math], the following condition is satisfied:
But this condition is equivalent to the following condition:
Now by multiplying to the left by [math]K^*[/math], we are led to the following condition:
Now by recalling that [math]C^i_b=Q_{ib}(HR)_{ib}[/math], this condition is equivalent to:
Consider now the variables in the statement, namely:
In terms of these variables, the above condition reads:
But this condition is equivalent to:
Now by multiplying to the left by [math]H^*[/math], we are led to the following condition:
Thus, we have obtained the condition in the statement.
As an illustration for the above result, assume that [math]H,K[/math] can be put in bistochastic form, by using vectors [math]y\in\mathbb T^M,z\in\mathbb T^N[/math], and let us set:
Then with the choice [math]Q=1[/math] for our parameter matrix, we have:
We therefore obtain the following formula:
Thus the usual tensor product [math]H\otimes K[/math] can be put in bistochastic form as well, which is of course something that we already know, from the above. Now back to the general case, that of the arbitrary Di\c t\u a deformations in Theorem 10.21, the point is that in the particular case [math]H=F_M[/math] the equations simplify, and we have the following result:
A deformed tensor product [math]F_M\otimes_QK[/math] can be put in bistochastic form when there exist numbers [math]x^i_a\in\mathbb T[/math] such that with
With notations from Theorem 10.21, and with [math]w=e^{2\pi i/M}[/math], we have:
The absolute value of this number can be computed as follows:
If we denote by [math]v^b_l[/math] the sum on the right, we obtain:
Now if we denote by [math]\xi[/math] the all-one vector in [math]\mathbb C^M[/math], the condition [math]|(H^*G)_{ib}|=\sqrt{MN}[/math] for any [math]i,b[/math] found in Theorem 10.21 reformulates as follows:
By multiplying to the left by [math]F_M^*/M[/math], this condition is equivalent to:
Let us examine the first equation, [math]v^b_0=MN[/math]. By definition of [math]v^b_l[/math], we have:
Now recall from Theorem 10.21 that we have, for certain numbers [math]x^j_b\in\mathbb T[/math]:
Since we have [math]Q_{jb}\in\mathbb T[/math] and [math]K^*/\sqrt{N}\in U_N[/math], we obtain:
Thus the [math]M\times N[/math] matrix [math]|G_{jb}|^2[/math] is row-stochastic, with sums [math]N^2[/math], and our equations [math]v^b_0=MN[/math] for any [math]b[/math] state that this matrix must be column-stochastic, with sums [math]MN[/math]. Regarding now the other equations that we found, namely [math]v^b_l=0[/math] for [math]l\neq0[/math], by definition of [math]v^b_l[/math] and of the variables [math]G_{jb}[/math], these state that we must have:
Thus, we are led to the conditions in the statement.
As an illustration for this result, let us go back to the [math]Q=1[/math] situation, explained after Theorem 10.21. By using the formula [math]G_{ib}=y_i(K^*z)_b[/math] there, we have:
Thus, if [math]K[/math] can be put in bistochastic form, then so can be put [math]F_M\otimes K[/math]. As a second illustration now, let us go back to the matrices [math]F_2\otimes'_QF_2[/math] from the proof of Proposition 10.20. For these matrices, the vector of the row sums is as follows:
Thus, with the above notations, we have the following formula:
We therefore obtain the following formulae for the upper entries of [math]G[/math]:
As for the lower entries of [math]G[/math], these are as follows:
Thus, in this case the matrix [math]G[/math] is as follows, independently of [math]Q[/math]:
In particular, we see that the conditions in Proposition 10.22 are satisfied. Now back to the general case, as a main application of our results so far, we have:
The Di\c t\u a deformations of tensor squares of Fourier matrices,
We use Proposition 10.22, with [math]M=N[/math], and with [math]K=F_N[/math]. Let [math]w=e^{2\pi i/N}[/math], and consider the vectors [math]x^i\in\mathbb T^N[/math] given by:
Since [math]K^*K=N1_N[/math], and [math]x^i[/math] are the column vectors of [math]K[/math], shifted by 1, we have:
We conclude that we have the following formula:
Thus the matrix [math]G[/math] is given by the following formula:
With this formula in hand, the sums in Proposition 10.22 are given by:
In the case [math]l\neq0[/math] we clearly get [math]0[/math], because the products of Kronecker symbols are [math]0[/math]. In the case [math]l=0[/math] the denominators are [math]|Q_{jb}|^2=1[/math], and we obtain:
Thus, the conditions in Proposition 10.21 are satisfied, and we obtain the result.
In relation with the various questions raised above, regarding the Di\c t\u a deformations of the Fourier matrices, this is best result that we have, so far. Here is an equivalent formulation of the above result, which is quite useful, in practice:
The matrix [math]F_N\otimes'_QF_N[/math], with [math]Q\in M_N(\mathbb T)[/math], defined by
Our claim is that this is the matrix constructed in the proof of Theorem 10.23. Indeed, let us first go back to the proof of Theorem 10.21. In the case [math]M=N[/math] and [math]H=K=F_N[/math], the Di\c t\u a deformation [math]L=H\otimes_QK[/math] studied there is given by:
As explained in the proof of Theorem 10.23, if the conditions in the statement there are satisfied, then the matrix [math]L_{ia,jb}'=R_{jb}L_{ia,jb}[/math] is almost bistochastic, where:
In our case now, [math]M=N[/math] and [math]H=K=F_N[/math], we know from the proof of Proposition 10.22 that the choice of [math]G[/math] which makes work Theorem 10.23 is as follows:
With this formula in hand, we can compute the matrix [math]R[/math], as follows:
Thus, the modified version of [math]F_N\otimes_QF_N[/math] which is almost bistochastic is given by:
Thus we have obtained the formula in the statement, and we are done.
As an illustration, let us work out the case [math]N=2[/math]. Here the root of unity is [math]w=-1[/math]. Let us denote the deformation matrix as follows:
With the notations [math]u=p/r[/math], [math]v=s/q[/math], we obtain the following matrix:
In general, the question of putting the Di\c t\u a deformations of the tensor products in explicit bistochastic form remains open. Open as well is the question of putting the arbitrary affine deformations of the Fourier matrices in explicit bistochastic form.
We would like to end this chapter by discussing a related interesting question, which can serve as a very good motivation for all this, namely the question on whether the real Hadamard matrices, [math]H\in M_N(\pm1)[/math], can be put or not in bistochastic form, in an explicit way. This is certainly true for the Walsh matrices, as explained before, but for the other basic examples, such as the Paley or the Williamson matrices, no results seem to be known so far. Having such a theory would be potentially very interesting, with a complex reformulation of the HC and of the other real Hadamard questions at stake.
We already know that we are done with the case [math]N\leq8[/math]. The next problem regards the Paley matrix at [math]N=12[/math], which is the unique real Hadamard matrix there:
This matrix is as follows, with the [math]\pm[/math] signs standing for [math]\pm1[/math] entries:
This matrix cannot be put of course in real bistochastic form, its size being not of the form [math]N=4n^2[/math]. Nor can it be put in bistochastic form over [math]\{\pm1,\pm i\}[/math], because the Turyn obstruction for matrices over [math]\{\pm1,\pm i\}[/math] is [math]N=a^2+b^2[/math], and we have:
However, the question of putting [math]P_{12}[/math] in bistochastic form over the 3-roots of unity makes sense, because the Turyn obstruction here is:
And, we do have solutions to these equations at [math]N=12[/math], as follows:
Another question is whether [math]P_{12}[/math] can be put in bistochastic form over the 8-roots of unity. In order to comment on this, let us first work out the Turyn obstruction, for the bistochastic matrices having as entries the 8-roots of unity. The result is as follows:
The Turyn obstruction for the bistochastic matrices having as entries the [math]8[/math]-roots of unity is
The 8-roots of unity are as follows, with [math]w=e^{\pi i/4}[/math]:
Thus, we are led to an equation as follows, with [math]x,y,z,t\in\mathbb Z[/math]:
We have the following computation:
Thus, we are led to the conclusion in the statement.
In relation with the above, the point now is that the equations in Proposition 10.25 do have solutions at [math]N=12[/math], namely:
Summarizing, the Paley matrix [math]P_{12}[/math] cannot be put in bistochastic form over the 4-roots, but the question makes sense over the 3-roots, and over the 8-roots. However, the computations here are not exactly trivial, and the answer is not known.
There are many interesting questions here, and as already mentioned above, the interest in this subject comes from the fact that all this can potentially lead to a complex reformulation of the HC and of the other real Hadamard matrix questions.
General references
Banica, Teo (2024). "Invitation to Hadamard matrices". arXiv:1910.06911 [math.CO].
References
- 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 M. Idel and M.M. Wolf, Sinkhorn normal form for unitary matrices, Linear Algebra Appl. 471 (2015), 76--84.
- 2.0 2.1 2.2 2.3 2.4 P. Biran, M. Entov and L. Polterovich, Calabi quasimorphisms for the symplectic ball, Commun. Contemp. Math. 6 (2004), 793--802.
- 3.0 3.1 3.2 3.3 3.4 C.H. Cho, Holomorphic discs, spin structures, and Floer cohomology of the Clifford torus, Int. Math. Res. Not. 35 (2004), 1803--1843.
- V.I. Arnold, Mathematical methods of classical mechanics, Springer (1974).
- H. Kharaghani and J. Seberry, The excess of complex Hadamard matrices, Graphs Combin. 9 (1993), 47--56.
- P.C. Fishburn and N.J.A. Sloane, The solution to Berlekamp's switching game, Discrete Math. 74 (1989), 263--290.
- R. Roth and K. Viswanathan, On the hardness of decoding the Gale-Berlekamp code, IEEE Trans. Inform. Theory 54 (2008), 1050--1060.