Quick search Find article
Quick search
Find article
J. Phys. A: Math. Gen. 39 No 4 (27 January 2006) L77-L84
doi:10.1088/0305-4470/39/4/L01
PII: S0305-4470(06)08570-2

LETTER TO THE EDITOR

Diagonal approximation of the form factor of the unitary group

G Berkolaiko

Department of Mathematics, Texas A&M University, College Station, TX 77840-3368, USA

Email: Gregory.Berkolaiko@math.tamu.edu

Received 21 September 2005, in final form 15 November 2005
Published 11 January 2006

Abstract. The form factor of the unitary group U(N) endowed with the Haar measure characterizes the correlations within the spectrum of a typical unitary matrix. It can be decomposed into a sum over pairs of `periodic orbits', where by periodic orbit we understand any sequence of matrix indices. From here the diagonal approximation can be defined in the usual fashion as a sum only over pairs of identical orbits. We prove that as we take the dimension N to infinity, the diagonal approximation becomes `exact', that is converges to the full form factor in the interval τ  in [0, 1].

PACS numbers: 05.45.Mt, 02.10.De, 02.50.Ga, 05.10.Gg, 02.70.Hm

1. Introduction

One of the central questions in the quantum theory of classically chaotic systems is whether the fluctuations in the spectra follow the predictions of random matrix theory (RMT). The spectral form factor, which is the Fourier transform of the two-point correlation function, is often used to characterize these fluctuations.

The advantage of the form factor is that it has a convenient expansion in terms of pairs of periodic orbits of the classical system. This expansion is an application of trace formulae (see [1] for a review) and has been a starting point for many investigations. The progress of understanding the role of periodic orbits in the universal behaviour of the form factor of different systems is marked by such milestones as the diagonal approximation [2], the first off-diagonal contribution [3, 4] and, most recently, an outline of the complete expansion [5]. Quantum graphs were also used [6] as a mathematical toy model with a simple-to-understand periodic orbit structure and an exact trace formula.

In a recent paper, Degli Esposti and Knauf [7] proposed using the ultimate mathematical toy model for the random matrix conjecture - the random matrix theory itself. The authors argued that if one can perform a step-by-step expansion of the RMT form factor in terms of families of periodic orbits, it would greatly facilitate analogous expansions for other systems.

Degli Esposti and Knauf [7] studied the diagonal approximation to the form factor of one of the random matrix ensembles, the circular unitary ensemble (CUE). Their results, however, depend on a combinatorial conjecture, which is still unproven. The purpose of this letter is to present an alternative proof for their main result; our proof is unconditional and is significantly simpler. Our proof is based on the observation that the diagonal approximation can be expressed as a trace of certain doubly stochastic matrix. The asymptotic properties of the diagonal approximation can then be related to the spectral gap of the doubly stochastic matrix, which was previously studied by the author in [8].

The structure of the letter is as follows. In section 2, we introduce the circular unitary ensemble and the form factor. We follow by reviewing the main result of [7] and stating our main result in section 3. The relevant properties of doubly stochastic matrices are reviewed in section 4. Here, we also prove our main result which follows quite simply from a technical lemma on the asymptotics of the second largest eigenvalue. This lemma is proved in section 5 and we conclude by comparing our results with those of [7] in section 6.

2. The circular unitary ensemble and its form factor

The circular unitary ensemble is the unitary group U(N) equipped with the Haar measure:

Definition 1. CUE(N) is defined as the ensemble of all unitary N × N matrices endowed with the probability measure that is invariant under every automorphism

Equation (1)

where V and W are any two N × N unitary matrices.

The form factor is a function describing the statistical properties of the spectrum of U. It is defined as the Fourier transform of the two-point spectral correlation function, R2(x),

Equation (2)

where exp(2πiphij) are the eigenvalues of U and the averaging is performed over the CUE(N). The form factor of the unitary ensemble is thus given by

Equation (3)

The well-known formula for KN(t) (see, e.g., [9, 10]) is

Equation (4)

This formula can, in particular, be derived by expanding the trace in (3),

Equation (5)

where the convention it + 1  =  i1 is assumed here and throughout the letter. The above expression is analogous to the trace-formula expansions of the form factor in other systems, with i  =  (i1,  ..., it) and j  =  (j1,  ..., jt) playing the role of the periodic orbits. It can be shown (see equation (8)) that only the terms with j being a permutation of i contribute to the form factor, i.e.

Equation (6)

where St is the group of all t-permutations (the symmetric group). This step is analogous to only considering pairs of orbits with a small action difference. The approximation to (6) that restricts the double sum even further, to pairs of periodic orbits that are equal (up to a cyclic shift), is called the diagonal approximation

Equation (7)

where the factor t comes from the number of possible shifts of an orbit. It is natural to expect the diagonal approximation to give the dominant contribution to KN since it selects only those pairs of orbits i and j that produce a positive product in (6) for all choices of U. In this letter, we compare the form factor KN(t) with its diagonal approximation.

The present letter was inspired by a study by Degli Esposti and Knauf [7]. They proved that, assuming validity of a certain combinatorial conjecture, the diagonal approximation K1,N converges to t/N uniformly in t/N as N → ∞. The convergence is proved in a closed interval which is bounded away from zero. The combinatorial conjecture requires a more thorough introduction and will be delayed to a later section.

Here, we propose an alternative approach which exploits the fact that the sum in K1,N is the trace of a doubly stochastic matrix. Our proof is unconditional, i.e. it does not rely on the conjecture, and the interval of convergence is extended.

3. Averages over the unitary group and the main result

The averages of products of the elements of matrices U have been studied by various authors [11-14]. Their main result is

Equation (8)

where St is the symmetric group of permutations of the set {1,  ..., t}, δnk is the Kronecker delta and the coefficient VN - 1π) depends only on the lengths of cycles in the cycle expansion of σ - 1π.

To introduce the results of [7], we need the following definitions.

Definition 2. Let σ  in St and i  in {1, 2,  ..., t}. Then a cycle of i with respect to σ is the (finite) set \{\sigma^k(i) \colon k\in{\bb N}\} . The cycle length of i is the number of elements in the cycle of i. The rank |σ| of a permutation σ is the number of disjoint cycles induced by σ. By |σ ∨ ρ| we denote the number of disjoint sets of the form \{\rho^n\sigma^k(i) \colon k,n\in{\bb N}\} . By τ we will denote the circular permutation (1, 2,  ..., t). If σ  in St, σ +  will denote τ - 1στ.

Conjecture 1 (Degli Esposti and Knauf [7]). There exists a constant C1 such that for all t\leq N \in {\bb N}

Equation (9)

Theorem 1 (Degli Esposti and Knauf [7]). Assuming conjecture 1 is true, the form factor KN is approximated by the diagonal contribution in the following sense:

For all epsilon > 0

Equation (10)

uniformly in τ  =  t/N  in [epsilon, (1  -  epsilon)e/C1].

Our main result is a version of the above theorem that drops the dependence on the conjecture and extends the interval of convergence.

Theorem 2. For all epsilon > 0

Equation (11)

uniformly in τ  =  t/N  in [epsilon, epsilon - 1]. Thus, uniformly in τ  =  t/N  in [epsilon, 1],

Equation (12)

4. Doubly stochastic matrices and the proof of theorem 2

Definition 3. An entrywise non-negative N × N matrix M is called doubly stochastic if

Equation (13)

The set of all such N × N matrices we denote by DS(N). DS(N) is a monoid with respect to the matrix multiplication, in particular if A and B are doubly stochastic then so is AB.

It follows directly from the definition that the vector e  =  (1/N,  ..., 1/N) is both a left and a right eigenvector of any matrix from DS(N). The corresponding eigenvalue 1 is the largest (by modulus) one. The following observation shows the importance of the asymptotics of the second largest eigenvalue.

If U is unitary then the matrix M defined by

Equation (14)

is doubly stochastic. In fact, M is a so-called unistochastic matrix [15].

Definition 4. An N × N matrix M is called unistochastic if there exists a unitary matrix U satisfying (14). The set of unistochastic matrices is a proper subset of DS(N) when N > 2.

If M is given by (14) then

Equation (15)

and, therefore, the diagonal approximation K1,N is equal to

Equation (16)

where the averaging is performed with respect to the measure induced on unistochastic matrices by the correspondence (14). The averages with respect to this measure were discussed in [8, 16]. In [8], it was speculated that the second largest eigenvalue of such random matrices M decays, on average, as N - 1/2. If this is indeed so, we immediately get

Equation (17)

for t > 2, which would prove theorem 2. On the other hand, [16] conjectured that the difference langleTr MtrangleN  -  1 behaves like N1 - t, which is also consistent with theorem 2. While the conjectures of [8, 16] are still unproven, we can use the technique of [8] to achieve the same result without directly calculating the eigenvalues of M.

Since the eigenvalues of M lie in the unit circle, it is intuitively clear that Tr Mt  =  ∑λti should be a decreasing function in t, at least `on average'. It is therefore tempting to show that langleTr MtrangleN → 1 for some fixed t as N → ∞ and then conclude that this property still holds if we send t to infinity as well. Indeed, this would work if the eigenvalues of M were positive real numbers, which is generically not the case for matrices M. To circumvent this difficulty, we form a matrix

Equation (18)

which is symmetric, doubly stochastic and positive semidefinite (A ≥ 0). Now we can estimate the eigenvalues of M through the eigenvalues of A.

Proposition 1 ([8]). Second largest eigenvalues of M and A satisfy

Equation (19)

Proof. The square roots of the eigenvalues of A defined by (18) are usually called the singular values of the matrix M. One of their fundamental properties is Weyl's majorization inequality [17], which implies

Equation (20)

Since both M and A are stochastic matrices, λ1(M)  =  λ1(A)  =  1. Inequality (19) follows immediately.        square

Now the idea is to find the power k such that langleλ(A)krangleN decay sufficiently fast. It turns out that k  =  3 is enough.

Lemma 1. The Haar probability measure on the unitary group induces a measure on symmetric doubly stochastic matrices A via (14) and (18). With respect to this measure langleλ2(A)3rangleN  =  O(N - 2).

The proof is rather technical and will be given in a later section. This lemma enables us to give a simple proof of our main result.

Proof of theorem 2. We show that langleTr Mtrangle → 1. If τ ≥ epsilon and N is sufficiently large then t  =  τN ≥ 6, leading to

\fl |\langle\Tr{\bf M}^t\rangle - 1| =
\left|\left\langle \sum_{k=2}^N \lambda_k({\bf M})^t\right\rangle\right| \leq \sum_{k=2}^N
|\langle\lambda_k({\bf M})^t\rangle|
\leq \sum_{k=2}^N \langle |\lambda_k({\bf M})|^t \rangle\\
\fl \hphantom{|\langle\Tr{\bf M}^t\rangle - 1|}\leq (N-1) \langle |\lambda_2({\bf M})|^t \rangle
\leq (N-1) \langle |\lambda_2({\bf M})|^6 \rangle\\
\fl \hphantom{|\langle\Tr{\bf M}^t\rangle - 1|}\leq (N-1) \langle |\lambda_2({\bf A})|^3 \rangle =
O(N^{-1}).

Now we use equation (16) to obtain the sought after conclusion.        square

5. Proof of lemma 1

We start with the trace of A3,

Equation (21)

since all eigenvalues of A are non-negative real numbers. Thus, we can estimate 0 ≤ λ2(A)3 ≤ Tr A3  -  1. Writing out the trace we get

\Tr{\bf A}^3 = \Tr({\bf M}^T{\bf M}{\bf M}^T{\bf M}{\bf M}^T{\bf M}) \\
\hphantom{\Tr{\bf A}^3}= \sum_{i,j,k,l,m,n=1}^N ({\bf M}^T)_{i,j}({\bf M})_{j,k}({\bf M}^T)_{k,l}({\bf M})_{l,m}({\bf M}^T)_{m,n}({\bf M})_{n,i} \\
\hphantom{\Tr{\bf A}^3}= \sum_{i,j,k,l,m,n=1}^N
|U_{j,i}U_{j,k}U_{l,k}U_{l,m}U_{n,m}U_{n,i}|^2.

Now we average over the unitary group,

Equation (22)

At this point, we can apply equation (8) with αk  =  ak and βk  =  bk with 1 ≤ k ≤ 6. The indices are a  =  (j, j, l, l, n, n) and b  =  (i, k, k, m, m, i). We denote

Equation (23)

To cut down the number of the coefficients W that need to be calculated, we list some of the properties arising from the symmetries of (8).

(i)  

Application of a permutation to both a and b leaves W invariant. Due to the special form of a and b, the only eligible permutations are shifts:

Equation (24)

(ii)  

We can switch around a and b. After that we should apply a right shift to bring them back to the special form: a '  =  (i, i, k, k, m, m) and b '  =  (n, j, j, l, l, n). Thus,

Equation (25)

Note that the lower indices on the right-hand side are shifted from their original order.

(iii)  

The coefficients W do not depend on the individual values of j, l and n, but only on whether they are pairwise distinct. That is, if

Equation (26)

then

Equation (27)

The same applies to i, k and m.

Using these properties we can list all possible values of W. If {j, l, n} are pairwise distinct, Wj,l,ni,k,m  =  W1,2,3i,k,m and there are N(N  -  1)(N  -  2) ways to select such j, l, n. If l  =  n but j is different, Wj,l,ni,k,m  =  W1,2,2i,k,m and there are N(N  -  1) way to choose j and n; the cases j  =  ln and j  =  nl are analogous. Finally, when j  =  l  =  n, Wj,l,ni,k,m  =  W1,1,1i,k,m and there are N ways to choose the value of j. The dependence on indices {i, k, m} can be treated in the same way. Table 1 lists all contributing terms together with the number of times they occur in expansion (22). The values of W were computed using a program which utilizes formula (8) and the data for VN from [11].

Table 1. Factors contributing to langleTr A3rangle. We use notation D  =  (N  +  5)(N  +  4)(N  +  3)(N  +  2)(N  +  1)N. The remaining terms can be obtained using Wj,l,ni,k,m  =  Wi,k,mn,j,l. Whenever the first column lists three terms, the last column gives the number of occurrences of each of those terms.
Contributing terms Notation Value Number of occurrences
W1,2,31,2,3 w33 \frac{N^5+6N^4+5N^3-20N^2-4N+32}{D(N-2)(N^2-1)(N-1)N} N2(N  -  1)2(N  -  2)2
W1,2,21,2,3, W2,1,21,2,3, W2,2,11,2,3 w23 \frac{2(N^2+5N+8)}{D(N-1)N} N2(N  -  1)2(N  -  2)
W1,2,21,2,2, W2,1,22,1,2, W2,2,12,2,1 w22a \frac{6(N^2+3N+6)}{D(N-1)N} N2(N  -  1)2
W1,2,22,1,2, W2,1,22,2,1, W2,2,11,2,2 w22b \frac{6(N^2+3N+6)}{D(N-1)N} N2(N  -  1)2
W1,2,22,2,1, W2,1,21,2,2, W2,2,12,1,2 w22c \frac{8(N+2)(N+1)}{D(N-1)N} N2(N  -  1)2
W1,1,11,2,3 w13 \frac{8}{D} N2(N  -  1)(N  -  2)
W1,1,11,2,2, W1,1,12,1,2, W1,1,12,2,1 w12 \frac{48}{D} N2(N  -  1)
W1,1,11,1,1 w11 \frac{720}{D} N2

Thus, we get

\fl\langle\Tr{\bf A}^3\rangle_N =
N^2(N-1)^2(N-2)^2 w_{33} + 2\times3N^2(N-1)^2(N-2) w_{23}\\ +\,
3N^2(N-1)^2 w_{22a} + 3N^2(N-1)^2 w_{22b} + 3 N^2(N-1)^2 w_{22c}\\
+\, 2\times N^2(N-1)(N-2) w_{13} + 2\times 3N^2(N-1) w_{12} + N^2
w_{11},

where the factors of 2 arise due to Wj,l,ni,k,m  =  Wi,k,mn,j,l. The result is

Equation (28)

Equation (29)

which, together with inequality λ2(A)3 ≤ Tr A3  -  1, finishes the proof.

6. Discussion

We presented an alternative and complete proof of the main result of [7]. However, the main importance of the results of [7] lies in the method of their derivation and in establishing a foothold in understanding the combinatorics of the form factor expansion. Below we formulate two conjectures which can be studied using the methods of [7].

The first conjecture is rephrasing an observation made in [8] regarding the asymptotics of the eigenvalues of unistochastic matrices M.

Conjecture 2. For any j ≥ 2 and k langlej(M)|krangle  =  O(N - k/2), where the averaging is performed with respect to the measure induced by (14).

The second conjecture is a strengthening of our result which would follow if conjecture 2 were proved. It concerns the speed of convergence of the diagonal approximation and was first mentioned in [16].

Conjecture 3. For any τ  in (0, ∞)

Equation (30)

faster than any power of N.

Regarding possible extensions of our results, we expect our method to be directly applicable to other random matrix ensembles, such as circular orthogonal ensemble (COE). This is an ensemble of random unitary symmetric matrices S, with the measure induced by the Haar measure on U(N) via the factorization

Equation (31)

We expect the matrix M (now symmetric) obtained from such S to also satisfy (30). The only significant difference from CUE is that the diagonal approximation in COE no longer recovers the full form factor (up to t  =  N) but just a leading (for small t/N) contribution to it. The proof can either proceed via factorization (31) or, more simply, by using the averages with respect to COE computed in [14].

Acknowledgments

The author is grateful to B Winn and a referee for many suggested improvements to the letter.

References
[1] 
Cohen D, Primack H and Smilansky U 1998 Ann. Phys. 264 108-70 (Preprint chao-dyn/9708017) 
CrossRefPreprint
[2] 
Berry M V 1985 Proc. R. Soc. A 400 229-51 
CrossRef
[3] 
Sieber M 2002 J. Phys. A: Math. Gen. 35 L613-9 
IOPscience
[4] 
Sieber M and Richter K 2001 Phys. Scr. T 90 128 
IOPscience
[5] 
Muller S, Heusler S, Braun P, Haake F and Altland A 2005 Phys. Rev. E 72 046207 (Preprint nlin.CD/050352) 
CrossRefPreprint
[6] 
Berkolaiko G, Schanz H and Whitney R S 2003 J. Phys. A: Math. Gen. 36 8373-92 (Preprint nlin.CD/0205014) 
IOPsciencePreprint
[7] 
Degli Esposti M and Knauf A 2004 J. Math. Phys. 45 4956-79 (Preprint cond-mat/0311074) 
CrossRefPreprint
[8] 
Berkolaiko G 2001 J. Phys. A: Math. Gen. 34 L319-26 (Preprint nlin.CD/0104009) 
IOPsciencePreprint
[9] 
Mehta M L 2004 Random Matrices 3rd edn (Amsterdam: Elsevier/Academic)  
[10] 
Haake F 2001 Quantum Signatures of Chaos 2nd edn (Berlin: Springer)  
[11] 
Samuel S 1980 J. Math. Phys. 21 2695-703 
CrossRef
[12] 
Ullah N 1964 Nucl. Phys. 58 65-71 
CrossRef
[13] 
Mello P A 1990 J. Phys. A: Math. Gen. 23 4061-80 
IOPscience
[14] 
Brouwer P W and Beenakker C W J 1996 J. Math. Phys. 37 4904-34 (Preprint cond-mat/9604059) 
CrossRefPreprint
[15] 
Marshall A and Olkin I 1979 Inequalities: Theory of Majorization and Its Applications (San Diego: Academic)  
[16] 
Życzkowski K, Kuś M, Słomczyński W and Sommers H J 2003 J. Phys. A: Math. Gen. 36 3425-50 (Preprint nlin.CD/0112036) 
IOPsciencePreprint
[17] 
Weyl H 1949 Proc. Natl Acad. Sci. USA 35 408-11 
CrossRefPubMed




Please login to access our web services, or create an account if you don't yet have one.

You must have cookies enabled in your web browser to be able to login.

Username
Password

Forgotten your password? Get a new one here.