Geometry, defect
7a. Affine deformations
We have seen so far that some theory for the complex Hadamard matrices [math]H\in M_N(\mathbb T)[/math] can be developed with some inspiration for the real case, [math]H\in M_N(\pm1)[/math], by looking at the Butson matrix case, [math]H\in M_N(\mathbb Z_l)[/math] with [math]l \lt \infty[/math]. However, all this root of unity business ultimately leads into questions of HC flavor, and to put it squarely, wrong way. In this chapter we take a radically different approach to the study of the complex Hadamard matrices. Let us recall that the complex Hadamard manifold appears as:
This intersection is far from being smooth, and given a point [math]H\in X_N[/math], the problem is that of understanding the structure of [math]X_N[/math] around [math]H[/math], which is often singular. And this is what we will do, real algebraic geometry, for studying [math]X_N[/math] and its singularities. For this purpose, let us begin with some notations. We denote by [math]X_p[/math] an unspecified neighborhood of a point in a manifold, [math]p\in X[/math]. Also, for [math]q\in\mathbb T_1[/math], meaning that [math]q\in\mathbb T[/math] is close to [math]1[/math], we define [math]q^r[/math] with [math]r\in\mathbb R[/math] by [math](e^{it})^r=e^{itr}[/math]. With these conventions, we have:
For [math]H\in X_N[/math] and [math]A\in M_N(\mathbb R)[/math], the following are equivalent:
- The following is an Hadamard matrix, for any [math]q\in\mathbb T_1[/math]:
[[math]] H_{ij}^q=H_{ij}q^{A_{ij}} [[/math]]
- The following equations hold, for any [math]i\neq j[/math] and any [math]q\in\mathbb T_1[/math]:
[[math]] \sum_kH_{ik}\bar{H}_{jk}q^{A_{ik}-A_{jk}}=0 [[/math]]
- The following equations hold, for any [math]i\neq j[/math] and any [math]\varphi:\mathbb R\to\mathbb C[/math]:
[[math]] \sum_kH_{ik}\bar{H}_{jk}\varphi(A_{ik}-A_{jk})=0 [[/math]]
- For any [math]i\neq j[/math] and any [math]r\in\mathbb R[/math], with [math]E_{ij}^r=\{k|A_{ik}-A_{jk}=r\}[/math], we have:
[[math]] \sum_{k\in E_{ij}^r}H_{ik}\bar{H}_{jk}=0 [[/math]]
If these conditions are satisfied, we call the matrix [math]H^q[/math] an affine deformation of [math]H[/math].
These equivalences are all elementary, and can be proved as follows:
[math](1)\iff(2)[/math] Indeed, the scalar products between the rows of [math]H^q[/math] are:
[math](2)\implies(4)[/math] This follows from the following formula, and from the fact that the power functions [math]\{q^r|r\in\mathbb R\}[/math] over the unit circle [math]\mathbb T[/math] are linearly independent:
[math](4)\implies(3)[/math] This follows from the following formula:
[math](3)\implies(2)[/math] This simply follows by taking [math]\varphi(r)=q^r[/math].
In order to understand the above deformations, which are “affine” in a certain sense, as suggested at the end of the statement, it is convenient to enlarge the attention to all types of deformations. We keep using the neighborhood notation [math]X_p[/math] introduced above, and we consider functions of type [math]f:X_p\to Y_q[/math], which by definition satisfy [math]f(p)=q[/math]. We have the following definition, further clarifying the terminology in Proposition 7.1:
Let [math]H\in M_N(\mathbb C)[/math] be a complex Hadamard matrix.
- A deformation of [math]H[/math] is a smooth function [math]f:\mathbb T_1\to (X_N)_H[/math].
- The deformation is called “affine” if [math]f_{ij}(q)=H_{ij}q^{A_{ij}}[/math], with [math]A\in M_N(\mathbb R)[/math].
- We call “trivial” the deformations of type [math]f_{ij}(q)=H_{ij}q^{a_i+b_j}[/math], with [math]a,b\in\mathbb R^N[/math].
Here the adjective “affine”, which is used in the same context as in Proposition 7.1, comes from the formula [math]f_{ij}(e^{it})=H_{ij}e^{iA_{ij}t}[/math], because the function [math]t\to A_{ij}t[/math] which produces the exponent is indeed affine. As for the adjective “trivial”, this comes from the fact that the affine deformations of type [math]f(q)=(H_{ij}q^{a_i+b_j})_{ij}[/math] are obtained from [math]H[/math] by multiplying the rows and columns by certain numbers in [math]\mathbb T[/math], so are automatically Hadamard.
The basic example of an affine deformation comes from the Di\c t\u a deformations [math]H\otimes_QK[/math], by taking all parameters [math]q_{ij}\in\mathbb T[/math] to be powers of [math]q\in\mathbb T[/math]. As an example, here are the exponent matrices coming from the left and right Di\c t\u a deformations of [math]F_2\otimes F_2[/math]:
There are of course many other examples, which are less trivial, as for instance the Haagerup matrix, that we met in chapters 5-6, which is as follows:
Observe that this is indeed an affine deformation of [math]H_6=H_6^1[/math], in the sense of Definition 7.2 (2), the corresponding matrix of exponents being as follows:
We will see that there are many other interesting examples of affine deformations, and that some general theory for such deformations can be developed. In order to investigate now the above types of deformations, we will use the corresponding tangent vectors. So, let us recall that the complex Hadamard matrix manifold [math]X_N[/math] is given by:
This observation leads to the following definition, where in the first part we denote by [math]T_pX[/math] the tangent space to a point in a smooth manifold, [math]p\in X[/math]:
Associated to a point [math]H\in X_N[/math] are the following objects:
- The enveloping tangent space: [math]\widetilde{T}_HX_N=T_HM_N(\mathbb T)\cap T_H\sqrt{N}U_N[/math].
- The tangent cone [math]T_HX_N[/math]: the set of tangent vectors to the deformations of [math]H[/math].
- The affine tangent cone [math]T_H^\circ X_N[/math]: same as above, using affine deformations only.
- The trivial tangent cone [math]T_H^\times X_N[/math]: as above, using trivial deformations only.
Observe that [math]\widetilde{T}_HX_N,T_H^\times X_N[/math] are real linear spaces, and that [math]T_HX_N,T_H^\circ X_N[/math] are two-sided cones, in the sense that they satisfy the following condition:
Observe also that we have inclusions of cones, as follows:
In more algebraic terms now, these various tangent cones are best described by the corresponding matrices, and we have here the following result:
The cones [math]T_H^\times X_N\subset T_H^\circ X_N\subset T_HX_N\subset\widetilde{T}_HX_N[/math] are as follows:
- [math]\widetilde{T}_HX_N[/math] can be identified with the linear space formed by the matrices [math]A\in M_N(\mathbb R)[/math] satisfying the following condition:
[[math]] \sum_kH_{ik}\bar{H}_{jk}(A_{ik}-A_{jk})=0 [[/math]].
- [math]T_HX_N[/math] consists of those matrices [math]A\in M_N(\mathbb R)[/math] appearing as [math]A_{ij}=g_{ij}'(0)[/math], where [math]g:M_N(\mathbb R)_0\to M_N(\mathbb R)_0[/math] satisfies:
[[math]] \sum_kH_{ik}\bar{H}_{jk}e^{i(g_{ik}(t)-g_{jk}(t))}=0 [[/math]]
- [math]T^\circ_HX_N[/math] is formed by the matrices [math]A\in M_N(\mathbb R)[/math] satisfying the following condition, for any [math]i\neq j[/math] and any [math]q\in\mathbb T[/math]:
[[math]] \sum_kH_{ik}\bar{H}_{jk}q^{A_{ik}-A_{jk}}=0 [[/math]]
- [math]T^\times_HX_N[/math] is formed by the matrices [math]A\in M_N(\mathbb R)[/math] which are of the form [math]A_{ij}=a_i+b_j[/math], for certain vectors [math]a,b\in\mathbb R^N[/math].
All these assertions can be deduced by using basic differential geometry:
(1) This result is well-known, the idea being as follows. First, [math]M_N(\mathbb T)[/math] is defined by the algebraic relations [math]|H_{ij}|^2=1[/math], and with [math]H_{ij}=X_{ij}+iY_{ij}[/math] we have:
Consider now an arbitrary vector [math]\xi\in T_HM_N(\mathbb C)[/math], written as follows:
This vector belongs then to the tangent space [math]T_HM_N(\mathbb T)[/math] if and only if we have:
We therefore obtain the following formula, for the tangent cone:
We also know that the rescaled unitary group [math]\sqrt{N}U_N[/math] is defined by the following algebraic relations, where [math]H_1,\ldots,H_N[/math] are the rows of [math]H[/math]:
The relations [math] \lt H_i,H_i \gt =N[/math] being automatic for the matrices [math]H\in M_N(\mathbb T)[/math], if for [math]i\neq j[/math] we let [math]L_{ij}= \lt H_i,H_j \gt [/math], then we have:
On the other hand, differentiating the formula of [math]L_{ij}[/math] gives:
Now if we pick [math]\xi\in T_HM_N(\mathbb T)[/math], written as above in terms of [math]A\in M_N(\mathbb R)[/math], we obtain:
Thus we have reached to the description of [math]\widetilde{T}_HX_N[/math] in the statement.
(2) We pick an arbitrary deformation, written as [math]f_{ij}(e^{it})=H_{ij}e^{ig_{ij}(t)}[/math]. Observe first that the Hadamard condition corresponds to the equations in the statement, namely:
Observe also that by differentiating this formula at [math]t=0[/math], we obtain:
Thus the matrix [math]A_{ij}=g_{ij}'(0)[/math] belongs indeed to [math]\widetilde{T}_HX_N[/math], so we obtain in this way a certain map, as follows:
In order to check that this map is indeed the correct one, we have to verify that, for any [math]i,j[/math], the tangent vector to our deformation is given by:
But this latter verification is just a one-variable problem. So, by dropping all [math]i,j[/math] indices, which is the same as assuming [math]N=1[/math], we have to check that for any point [math]H\in\mathbb T[/math], written [math]H=X+iY[/math], the tangent vector to the deformation [math]f(e^{it})=He^{ig(t)}[/math] is:
But this is clear, because the unit tangent vector at [math]H\in\mathbb T[/math] is [math]\eta=-i(Y\dot{X}-X\dot{Y})[/math], and its coefficient coming from the deformation is:
(3) Observe first that by taking the derivative at [math]q=1[/math] of the condition (2) in Proposition 7.1, of just by using the condition (3) there with the function [math]\varphi(r)=r[/math], we get:
Thus we have a map [math]T_H^\circ X_N\to\widetilde{T}_HX_N[/math], and the fact that is map is indeed the correct one comes for instance from the computation in (2), with [math]g_{ij}(t)=A_{ij}t[/math].
(4) Observe first that the Hadamard matrix condition is satisfied, because:
As for the fact that [math]T_H^\times X_N[/math] is indeed the space in the statement, this is clear.
Let [math]Z_N\subset X_N[/math] be the real algebraic manifold formed by all the dephased [math]N\times N[/math] complex Hadamard matrices. Observe that we have a quotient map [math]X_N\to Z_N[/math], obtained by dephasing. With this notation, we have the following refinement of (4) above:
We have a direct sum decomposition of cones
If we denote by [math]M_N^\circ(\mathbb R)[/math] the set of matrices having [math]0[/math] outside the first row and column, we have a direct sum decomposition, as follows:
Now by looking at the affine cones, and using Theorem 7.4, this gives the result.
Summarizing, we have so far a number of theoretical results about the tangent cones [math]T_HX_N[/math] that we are interested in, and their versions coming from the trivial and affine deformations, and from the intersection formula [math]X_N=M_N(\mathbb T)\cap\sqrt{N}U_N[/math] as well. In practice now, passed a few special cases where all these cones collapse to the trivial cone [math]T_N^\times X_N[/math], which by Proposition 7.5 means that the image of [math]H\in X_N[/math] must be isolated in the dephased manifold [math]X_N\to Z_N[/math], things are quite difficult to compute. However, as a concrete numerical invariant arising from all this, which can be effectively computed in many cases of interest, we have, following Tadej-\.Zyczkowski [1]:
The real dimension [math]d(H)[/math] of the enveloping tangent space
In view of Proposition 7.5, it is sometimes convenient to replace [math]d(H)[/math] by the following related quantity, also introduced in [1], and called dephased defect of [math]H[/math]:
In what follows we will rather use the quantity [math]d(H)[/math] defined before, which behaves better with respect to a number of operations, and simply call it “defect” of [math]H[/math]. We already know, from Theorem 7.4, what is the precise geometric meaning of the defect, and how to compute it. Let us record again these results, that we will use many times in what follows, in a slightly different form, closer to the spirit of [1]:
The defect [math]d(H)[/math] is the real dimension of the linear space
Here the first assertion is something that we already know, from Theorem 7.4 (1), and the second assertion follows either from Theorem 7.4 and its proof, or directly from the definition of the enveloping tangent space [math]\widetilde{T}_HX_N[/math], as used in Definition 7.6.
Still following [1], here are a few basic properties of the defect:
Let [math]H\in X_N[/math] be a complex Hadamard matrix.
- If [math]H\simeq\widetilde{H}[/math] then [math]d(H)=d(\widetilde{H})[/math].
- We have [math]2N-1\leq d(H)\leq N^2[/math].
- If [math]d(H)=2N-1[/math], the image of [math]H[/math] in the dephased manifold [math]X_N\to Z_N[/math] is isolated.
All these results are elementary, the proof being as follows:
(1) If we let [math]K_{ij}=a_ib_jH_{ij}[/math] with [math]|a_i|=|b_j|=1[/math] be a trivial deformation of our matrix [math]H[/math], the equations for the enveloping tangent space for [math]K[/math] are:
By simplifying we obtain the equations for [math]H[/math], so [math]d(H)[/math] is invariant under trivial deformations. Since [math]d(H)[/math] is invariant as well by permuting rows or columns, we are done.
(2) Consider the inclusions [math]T_H^\times X_N\subset T_HX_N\subset\widetilde{T}_HX_N[/math]. Since [math]\dim(T_H^\times X_N)=2N-1[/math], the inequality at left holds indeed. As for the inequality at right, this is clear.
(3) If [math]d(H)=2N-1[/math] then [math]T_HX_N=T_H^\times X_N[/math], so any deformation of [math]H[/math] is trivial. Thus the image of [math]H[/math] in the quotient manifold [math]X_N\to Z_N[/math] is indeed isolated, as stated.
7b. Defect computations
As an illustration for the above notions, let us discuss now the computation of the defect for the most basic examples of complex Hadamard matrices that we know, namely the real ones, and the Fourier ones. In order to deal with the real case, it is convenient to modify the general formula from Theorem 7.7, via a change of variables, as follows:
We have a linear space isomorphism as follows,
Given a matrix [math]A\in M_N(\mathbb C)[/math], if we set [math]R_{ij}=A_{ij}H_{ij}[/math] and [math]E=RH^*[/math], the correspondence [math]A\to R\to E[/math] is then bijective onto [math]M_N(\mathbb C)[/math], and we have:
In terms of these new variables, the equations in Theorem 7.7 become:
Thus, when taking into account these conditions, we are simply left with the conditions [math]A_{ij}\in\mathbb R[/math]. But these correspond to the conditions [math](EH)_{ij}\bar{H}_{ij}\in\mathbb R[/math], as claimed.
With the above result in hand, we can now compute the defect of the real Hadamard matrices. The result here, from Szöllősi [2], is as follows:
For any real Hadamard matrix [math]H\in M_N(\pm1)[/math] we have
We use Proposition 7.9. Since [math]H[/math] is now real the condition [math](EH)_{ij}\bar{H}_{ij}\in\mathbb R[/math] there simply tells us that [math]E[/math] must be real, and this gives the result.
As another computation now, let us discuss the case [math]N=4[/math]. Here we know from chapter 5 that the only complex Hadamard matrices are, up to equivalence, the Di\c t\u a deformations of [math]F_4[/math]. To be more precise, we have the following result:
The complex Hadamard matrices at [math]N=4[/math] are, up to equivalence, the following matrices, appearing as Di\c t\u a deformations of [math]F_4[/math]:
- At [math]q=1[/math] we have [math]F_{2,2}^q=F_2\otimes F_2[/math].
- At [math]q=-1[/math] we have [math]F_{2,2}^q\simeq F_2\otimes F_2[/math].
- At [math]q=\pm i[/math] we have [math]F_{2,2}^q\simeq F_4[/math].
The first assertion is something that we already know, from chapter 5. Regarding now the [math]q=1,i,-1,-i[/math] specializations, the situation here is as follows:
(1) This is clear from definitions.
(2) This follows from (1), by permuting the third and the fourth columns:
(3) This follows from the following computation:
Here we have interchanged the second column with the third one in the case [math]q=i[/math], and we have used a cyclic permutation of the last 3 columns in the case [math]q=-i[/math].
Let us compute now the defect of the above matrices. We will work out everything in detail, as an illustration for how the equations in Theorem 7.7 work. The result is:
The defect of the [math]4\times4[/math] complex Hadamard matrices is given by
Our starting point are the equations in Theorem 7.7, namely:
Since the [math]i \gt j[/math] equations are equivalent to the [math]i \lt j[/math] ones, and the [math]i=j[/math] equations are trivial, we just have to write down the equations corresponding to indices [math]i \lt j[/math]. And, with [math]ij=01,02,03,12,13,23[/math], these equations are:
Assume first [math]q\neq\pm 1[/math]. Then [math]q[/math] is not real, and appears in 4 of the above equations. But these 4 equations can be written in the following way:
Now since the unknowns are real, and [math]q[/math] is not, we conclude that the terms between braces in the left part must be all equal, and that the same must happen at right:
Thus, the equations involving [math]q[/math] tell us that [math]A[/math] must be of the following form:
Let us plug now these values in the remaining 2 equations. We obtain:
Thus we must have [math]a+g=c+e[/math] and [math]b+h=d+f[/math], which are independent conditions. We conclude that the dimension of the space of solutions is [math]10-2=8[/math], as claimed.
Assume now [math]q=\pm 1[/math]. For simplicity we set [math]q=1[/math], and we compute the dephased defect. The dephased equations, obtained by setting [math]A_{i0}=A_{0j}=0[/math] in our system, are:
The first three equations tell us that our matrix must be of the following form:
Now by plugging these values in the last three equations, these become:
Thus we must have [math]a=c[/math], [math]b=f[/math], [math]d=e[/math], and since these conditions are independent, the dephased defect is 3, and so the undephased defect is [math]3+7=10[/math], as claimed.
In general, the defect computation for the Di\c t\u a deformations, of even for the usual tensor products, is a difficult question. We will be back to this in chapter 8 below.
7c. Fourier matrices
Let us discuss now a fundamental question, namely the computation of the defect of the Fourier matrix [math]F_G[/math]. The main idea here goes back to a 1989 preprint of Karabegov [3], with some supplementary contributions from Nicoara [4], in 2006, and then the main formula, in the cyclic group case, was obtained by Tadej-\.Zyczkowski in [1], and the corresponding deformations of [math]F_G[/math] were studied by Nicoara-White in [5]. As a first result on this subject, we have, following Tadej-\.Zyczkowski [1]:
For a Fourier matrix [math]F=F_G[/math], the matrices [math]A\in\widetilde{T}_FX_N[/math] with [math]N=|G|[/math], are those of the form [math]A=PF^*[/math], with [math]P\in M_N(\mathbb C)[/math] satisfying
We use the system of equations in Theorem 7.7, namely:
By decomposing our finite abelian group as [math]G=\mathbb Z_{N_1}\times\ldots\times\mathbb Z_{N_r}[/math] we can assume:
Thus with [math]w_k=e^{2\pi i/k}[/math] we have the following formula:
With [math]N=N_1\ldots N_r[/math] and [math]w=e^{2\pi i/N}[/math], we obtain the following formula:
Thus the matrix of our system of equations is given by:
Now by plugging in a multi-indexed matrix [math]A[/math], our system becomes:
Now observe that in the above formula we have in fact two matrix multiplications, so our system can be simply written as:
Now recall that our indices have a “cyclic” meaning, so they belong in fact to the group [math]G[/math]. So, with [math]P=AF[/math], and by using multi-indices, our system is simply:
With [math]i=I+J,j=I[/math] we obtain the condition [math]P_{I+J,J}=P_{IJ}[/math] in the statement. In addition, [math]A=PF^*[/math] must be a real matrix. But, if we set [math]\tilde{P}_{ij}=\bar{P}_{i,-j}[/math], we have:
Thus we have [math]\overline{PF^*}=\tilde{P}F^*[/math], so the fact that the matrix [math]PF^*[/math] is real, which means by definition that we have [math]\overline{PF^*}=PF^*[/math], can be reformulated as [math]\tilde{P}F^*=PF^*[/math], and hence as [math]\tilde{P}=P[/math]. So, we obtain the conditions [math]P_{ij}=\bar{P}_{i,-j}[/math] in the statement.
We can now compute the defect, and we are led to the following formula:
The defect of a Fourier matrix [math]F_G[/math] is given by
According to the formula [math]A=PF^*[/math] from Theorem 7.13, the defect [math]d(F_G)[/math] is the dimension of the real vector space formed by the matrices [math]P\in M_N(\mathbb C)[/math] satisfying:
Here, and in what follows, the various indices [math]i,j,\ldots[/math] will be taken in [math]G[/math]. Now the point is that, in terms of the columns of our matrix [math]P[/math], the above conditions are:
(1) The entries of the [math]j[/math]-th column of [math]P[/math], say [math]C[/math], must satisfy [math]C_i=C_{i+j}[/math].
(2) The [math](-j)[/math]-th column of [math]P[/math] must be conjugate to the [math]j[/math]-th column of [math]P[/math].
Thus, in order to count the above matrices [math]P[/math], we can basically fill the columns one by one, by taking into account the above conditions. In order to do so, consider the subgroup [math]G_2=\{j\in G|2j=0\}[/math], and then write [math]G[/math] as a disjoint union, as follows:
With this notation, the algorithm is as follows. First, for any [math]j\in G_2[/math] we must fill the [math]j[/math]-th column of [math]P[/math] with real numbers, according to the periodicity rule:
Then, for any [math]j\in X[/math] we must fill the [math]j[/math]-th column of [math]P[/math] with complex numbers, according to the same periodicity rule [math]C_i=C_{i+j}[/math]. And finally, once this is done, for any [math]j\in X[/math] we just have to set the [math](-j)[/math]-th column of [math]P[/math] to be the conjugate of the [math]j[/math]-th column.
So, let us compute the number of choices for filling these columns. Our claim is that, when uniformly distributing the choices for the [math]j[/math]-th and [math](-j)[/math]-th columns, for [math]j\notin G_2[/math], there are exactly [math][G: \lt j \gt ][/math] choices for the [math]j[/math]-th column, for any [math]j[/math]. Indeed:
(1) For the [math]j[/math]-th column with [math]j\in G_2[/math] we must simply pick [math]N[/math] real numbers subject to the condition [math]C_i=C_{i+j}[/math] for any [math]i[/math], so we have indeed [math][G: \lt j \gt ][/math] such choices.
(2) For filling the [math]j[/math]-th and [math](-j)[/math]-th column, with [math]j\notin G_2[/math], we must pick [math]N[/math] complex numbers subject to the condition [math]C_i=C_{i+j}[/math] for any [math]i[/math]. Now since there are [math][G: \lt j \gt ][/math] choices for these numbers, so a total of [math]2[G: \lt j \gt ][/math] choices for their real and imaginary parts, on average over [math]j,-j[/math] we have [math][G: \lt j \gt ][/math] choices, and we are done again.
Summarizing, the dimension of the vector space formed by the matrices [math]P[/math], which is equal to the number of choices for the real and imaginary parts of the entries of [math]P[/math], is:
But this is exactly the number in the statement. Regarding now the second assertion, according to the definition of [math]F_G[/math], the number of [math]1[/math] entries of [math]F_G[/math] is given by:
Thus, the second assertion follows from the first one.
Let us finish now the work, and explicitely compute the defect of [math]F_G[/math]. It is convenient to consider the following quantity, which behaves better:
As a first example, consider a cyclic group [math]G=\mathbb Z_N[/math], with [math]N=p^a[/math] power of a prime. The count here is very simple, over sets of elements having a given order:
In order to extend this kind of count to the general abelian case, we use two ingredients. First is the following result, which splits the computation over isotypic components:
For any finite groups [math]G,H[/math] we have:
Indeed, we have the following estimate, coming from definitions:
Regarding the last assertion, in the case [math](|G|,|H|)=1[/math], the least common multiple appearing on the right becomes a product:
Thus, we have equality in this case, as desired.
We deduce from this that we have the following result:
For a finite abelian group [math]G[/math] we have
This is clear from Proposition 7.15, the order of [math]G_p[/math] being a power of [math]p[/math].
As an illustration for the above results, we can recover in this way the following key defect computation, from Tadej-\.Zyczkowski [1]:
The defect of a usual Fourier matrix [math]F_N[/math] is given by
The underlying group here is the cyclic group [math]G=\mathbb Z_N[/math], whose isotypic components are the following cyclic groups:
By applying now Proposition 7.16, and by using the computation for cyclic [math]p[/math]-groups performed before Proposition 7.15, we obtain:
But this is exactly the formula in the statement.
Now back to the general case, where we have an arbitrary Fourier matrix [math]F_G[/math], we will need, as a second ingredient for our computation, the following result:
For the [math]p[/math]-groups, the quantities
Indeed, for a product of [math]p[/math]-groups we have:
We recognize at right [math]c_k(G)c_k(H)[/math], and we are done.
Let us compute now [math]\delta[/math] in the general isotypic case. We have here:
For [math]G=\mathbb Z_{p^{a_1}}\times\ldots\times\mathbb Z_{p^{a_r}}[/math] with [math]a_1\leq a_2\leq\ldots\leq a_r[/math] we have
First, in terms of the numbers [math]c_k[/math], we have the following formula:
In the case of a cyclic group [math]G=\mathbb Z_{p^a}[/math] we have [math]c_k=p^{\min(k,a)}[/math]. Thus, in the general isotypic case [math]G=\mathbb Z_{p^{a_1}}\times\ldots\times\mathbb Z_{p^{a_r}}[/math] we have the following formula:
Now observe that the exponent on the right is a piecewise linear function of [math]k[/math]. More precisely, by assuming [math]a_1\leq a_2\leq\ldots\leq a_r[/math] as in the statement, the exponent is linear on each of the intervals [math][0,a_1],[a_1,a_2],\ldots,[a_{r-1},a_r][/math]. So, the quantity [math]\delta(G)[/math] to be computed will be 1 plus the sum of [math]2r[/math] geometric progressions, 2 for each interval.
In practice now, the numbers [math]c_k[/math] are as follows:
Now by separating the positive and negative terms in the above formula of [math]\delta(G)[/math], we have indeed [math]2r[/math] geometric progressions to be summed, as follows:
Now by performing all the sums, we obtain the following formula:
By looking now at the general term, we get the formula in the statement.
Let us go back now to the general defect formula in Theorem 7.14. By putting it together with the various results above, we obtain:
For a finite abelian group [math]G[/math], decomposed as [math]G=\times_pG_p[/math], we have
Indeed, we know from Theorem 7.14 that we have:
The result follows then from Proposition 7.16 and Proposition 7.19.
As a first illustration, we can recover in this way the formula in Theorem 7.17. Indeed, assuming that [math]N=p_1^{a_1}\ldots p_s^{a_s}[/math] is the decomposition of [math]N[/math] into prime factors, we have:
As a second illustration, for the group [math]G=\mathbb Z_{p^{a_1}}\times\mathbb Z_{p^{a_2}}[/math] with [math]a_1\leq a_2[/math] we obtain:
Finally, let us mention that for general non-abelian groups, there does not seem to be any reasonable algebraic formula for the quantity [math]\delta(G)[/math]. As an example, consider the dihedral group [math]D_N[/math], consisting of [math]N[/math] symmetries and [math]N[/math] rotations. We have:
Now remember the formula for [math]\mathbb Z_N[/math] established above, namely:
It is quite clear that the [math]N/2[/math] factor can not be incorporated in any nice way, and so, as indicated above, the quantity [math]\delta(G)[/math] remains something quite complicated.
7d. Explicit deformation
Let us discuss now, following the paper of Nicoara and White [5], the key fact that for the Fourier matrices the defect is “attained”, in the sense that the deformations at order 0 are true deformations, at order [math]\infty[/math]. This is something quite surprising, and non-trivial. Let us begin with some generalities. We first recall that we have:
The unitary matrices [math]U\in U_N[/math] around [math]1[/math] are of the form
This is something well-known. Indeed, assuming that a matrix [math]A[/math] is antihermitian, [math]A=-A^*[/math], the matrix [math]U=e^A[/math] follows to be unitary:
As for the converse, this follows either by using a dimension argument, which shows that the space of antihermitian matrices is the correct one, or by diagonalizing [math]U[/math].
Now back to the Hadamard matrices, we will need to rewrite a part of the basic theory of the defect, using deformations of type [math]t\to U_tH[/math]. First, we have:
Assume that [math]H\in M_N(\mathbb C)[/math] is Hadamard, let [math]A\in M_N(\mathbb C)[/math] be antihermitian, and consider the matrix [math]UH[/math], where [math]U=e^{tA}[/math], with [math]t\in\mathbb R[/math].
- [math]UH[/math] is Hadamard when, for any [math]p,q[/math]:
[[math]] |\sum_{rs}H_{rq}\bar{H}_{sq}(e^{tA})_{pr}(e^{-tA})_{sp}|=1 [[/math]]
- [math]UH[/math] is Hadamard at order [math]0[/math] when, for any [math]p,q[/math]:
[[math]] |(AH)_{pq}|=1 [[/math]]
We already know that [math]UH[/math] is unitary, so we must find the conditions which guarantee that we have [math]UH\in M_N(\mathbb T)[/math], in general, and then at order 0.
(1) We have the following computation, valid for any unitary [math]U[/math]:
Now with [math]U=e^{tA}[/math] as in the statement, we obtain:
Thus, we are led to the conclusion in the statement.
(2) The derivative of the function computed above, taken at [math]0[/math], is as follows:
Thus, we are led to the conclusion in the statement.
In the Fourier matrix case we can go beyond this, and we have:
Given a Fourier matrix [math]F_G\in M_G(\mathbb C)[/math], and an antihermitian matrix [math]A\in M_G(\mathbb C)[/math], the matrix [math]H=UF_G[/math], where [math]U=e^{tA}[/math] with [math]t\in\mathbb R[/math], is Hadamard when
According to the formula in the proof of Theorem 7.22 (1), we have:
By setting [math]n=r-s[/math], can write this formula in the following way:
Since this quantity must be 1 for any [math]q[/math], we must have:
On the other hand, we have the following computation:
Thus, we are led to the conclusion in the statement.
Following Nicoara-White [5], let us construct now the deformations of [math]F_G[/math]. The result here, which came a long time after the original defect paper of Tadej-\.Zyczkowski [1], and even more time after the early computations of Karabegov [3], appearing somewhat as a total surprise, puzzling all known experts at that time, is as follows:
Let [math]G[/math] be a finite abelian group, and for any [math]g,h\in G[/math], let us set:
The proof of this result, from [5], is quite long and technical, based on the Fourier computation from Proposition 7.23, the idea being as follows:
(1) First of all, an elementary algebraic study shows that when [math](g,h)\in G^2[/math] range in some suitable cosets, coming from the proof of Theorem 7.14, the various matrices [math]B=B^{gh}[/math] constructed above are distinct, the matrices [math]A=i(B+B^t)[/math] and [math]A'=B-B^t[/math] are linearly independent, and the number of such matrices equals the defect of [math]F_G[/math].
(2) It is also standard to check that each [math]B=(B_{pq})[/math] is a partial isometry, and that [math]B^k,B^{*k}[/math] are given by simple formulae. With this ingredients in hand, the Hadamard property follows from the Fourier computation from the proof of Proposition 7.23. Indeed, we can compute the exponentials there, and eventually use the binomial formula.
(3) Finally, the matrices in the statement can be shown to be non-equivalent, and this is something more technical, for which we refer to [5]. With this last ingredient in hand, a comparison with Theorem 7.14 shows that the defect of [math]F_G[/math] is indeed attained, in the sense that all order 0 deformations are actually true deformations. See [5].
Finally, let us mention that the paper of Nicoara-White [5] was written in terms of subfactor-theoretic commuting squares, which is a quite technical operator algebra notion, and with a larger class of commuting squares being actually under investigation.
We will discuss a bit the relation between Hadamard matrices and commuting squares in chapter 14 below, but in what regards the Nicoara-White theorem, which is the main known theorem regarding the geometry of the complex Hadamard matrices, this definitely remains something to be learned, from their paper [5] and their follow-up papers, which are quite technical, and that we would like however to warmly recommend here.
General references
Banica, Teo (2024). "Invitation to Hadamard matrices". arXiv:1910.06911 [math.CO].
References
- 1.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 W. Tadej and K. \.Zyczkowski, Defect of a unitary matrix, Linear Algebra Appl. 429 (2008), 447--481.
- F. Szöllősi, Parametrizing complex Hadamard matrices, European J. Combin. 29 (2008), 1219--1234.
- 3.0 3.1 A. Karabegov, The reconstruction of a unitary matrix from the moduli of its elements and symbols on a finite phase space (1989).
- R. Nicoara, A finiteness result for commuting squares of matrix algebras, J. Operator Theory 55 (2006), 295--310.
- 5.0 5.1 5.2 5.3 5.4 5.5 5.6 5.7 R. Nicoara and J. White, Analytic deformations of group commuting squares and complex Hadamard matrices, J. Funct. Anal. 272 (2017), 3486--3505.