Pairs of commuting integer matrices
Abstract.
We prove upper and lower bounds on the number of pairs of commuting matrices with integer entries in , as . Our work uses Fourier analysis and leads to an analysis of exponential sums involving matrices over finite fields. These are bounded by combining a stratification result of Fouvry and Katz with a new result about the flatness of the commutator Lie bracket.
2010 Mathematics Subject Classification:
11D45 (11T23, 15B33, 15B36)Contents
1. Introduction
Let denote the scheme of matrices, for an integer . Let denote the closed subscheme . The aim of this note is to count integral points of height , as , on the commuting variety
viewed as an affine variety in . Thus we are interested in the behaviour of the counting function
as , where denotes the maximum modulus of any of the entries in a matrix . It was shown by Motzkin and Taussky in the paragraph after [12, Thm. 6] that is irreducible and it follows from work of Basili [1, Thm. 1.2] that . Furthermore, cannot be linear, since it contains and , for any , but does not always contain the point . In fact, a recursive formula for the degree of has been provided by Knutson and Zinn-Justin [10].
Since is defined by a system of homogeneous quadratic equations with coefficients in , we can view as an affine cone over a non-linear irreducible variety in of dimension , which is definable over . Appealing to the resolution of the dimension growth conjecture by Salberger [14], it therefore follows that
(1.1) |
for any , where the implied constant depends at most on and . Moreover, the can be removed if the variety has degree by [3], which is likely to be the case for by the work in [10]. On the other hand, we have the lower bound
(1.2) |
coming from the special subvariety of scalar matrices . The following is our main result.
Theorem 1.1.
Let . Then
Note that Theorem 1.1 improves on (1.1) for . In fact our exponent is always within of that occurring in the lower bound (1.2), which we expect to be the truth. Our approach uses matrix identities and ideas from harmonic analysis, which will lead us to analyse the fibres of the commutator map
(1.3) |
where we recall that denotes the closed subscheme . In this way we have been led to prove the following result, the analogue of which was proved by Larsen and Lu [11] in the multiplicative setting.
Theorem 1.2.
The map
is flat over the open set .
Theorem 1.2 is equivalent to a pointwise bound on the dimensions of the fibres of the commutator map over . For comparison, earlier work has examined the dimensions of the fibres on average over certain small families of matrices in , such as diagonal matrices (Knutson [9]) or rank-one matrices (Neubauer [13]). Moreover, Young [19] proved a finer conjecture of Knutson [9] concerning the irreducible components of the diagonal commutator scheme. Our new methods may be capable of re-proving, and generalising to other families, some of these earlier results.
Acknowledgements
The authors are very grateful to Alina Ostafe, Matthew Satriano and Igor Shparlinski for drawing their attention to this problem and for useful comments, and to Michael Larsen and Peter Sarnak for their helpful correspondence. While working on this paper the first author was supported by a FWF grant (DOI 10.55776/P36278), the second author by a Sloan Research Fellowship, and the third author by the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Grant Agreement No. 101034413.
2. Matrix exponential sums
We begin by recording the following consequence of Theorem 1.2.
Theorem 2.1.
For all primes , and for all points , we have
Proof assuming Theorem 1.2.
In § 4, we will actually directly prove Theorem 2.1 by induction on for , by symmetry and point counting, before deriving Theorem 1.2 as a consequence. However, a direct proof of Theorem 1.2 by induction on is also possible using algebraic geometry, Grassmannians, symmetry, and dimension counting.
For the purposes of harmonic analysis to come, we observe that over any field , the pairing on is perfect. The usual dot product on is given by , but it will be more convenient below to work with , which can be viewed as the usual dot product after an invertible linear change of variables .
Given , our work will lead us to analyse the complete exponential sum
(2.1) |
for a sufficiently large prime . It follows from the point count of Feit–Fine [4] that
(2.2) |
We seek to find conditions on under which we can show that cancellation occurs in .
We begin by recording the observation that
(2.3) |
Then, by the cyclic property of the trace, we can average over to conclude that
In particular, it is clear that
(2.4) |
for any . We are now ready to prove the following result.
Lemma 2.2.
If or , then
3. Fourier analysis and proof of the main result
In this section we prove Theorem 1.1 via harmonic analysis, by combining work of Fouvry and Katz [5] with our discussion in the previous section on exponential sums over finite fields. We shall assume throughout this section that .
Let be an auxiliary prime in the interval whose size will be optimised later. Then
Consider the function
for any . Identifying with , this has Fourier transform
(3.1) |
where . Moreover it is clear that if . It now follows that
An application of Poisson summation reveals that
on making an obvious change of variables. We have a similar identity for the sum over , all of which leads to the expression
in the notation of (2.1).
Appealing to (3.1), we finally deduce that
If then only contribute to the sum. Thus it follows from (2.2) that
(3.2) |
where
We now appeal to a stratification result by Fouvry and Katz [5, Thm. 1.1] to estimate . This produces subschemes , with for , such that
whenever the reduction modulo of corresponds to vector in , under the isomorphism . It is convenient to put and . Combining this with Lemma 2.2, we therefore deduce that
where denotes the set of with for which the reduction of is non-zero and belongs to the set . Since , it now follows from [2, Lemma 4] that
But then we may conclude that
Since we see that the first term is maximised at the largest value of , while the second term is maximised when is least. This leads to the expression
since .
Returning to (3.2), we have now established that
This is optimised at , which leads to the final bound for recorded in Theorem 1.1.
Remark 3.1.
The idea of choosing an auxiliary modulus and using harmonic analysis has been applied to other varieties in the past, such as by Fujiwara [6, 7], and by Shparlinski and Skorobogatov [15, 16]. Moreover, using finer geometric information, Heath-Brown [8] has given stronger results in many cases by working with a composite modulus . Once combined with Fouvry–Katz stratification [5], the simplest version of Fujiwara’s method gives the following general result.
Let be an affine variety of dimension , cut out by polynomials with integer coefficients. Assume that for all primes and for all non-zero vectors , we have
(3.3) |
for some such that . Then for , we have
(3.4) |
This would improve on the dimension growth bound of Salberger [14] and Vermeulen [18] (when it applies), provided only that and in (3.3). (Note that (3.4) recovers the bound in Theorem 1.1 when , and .)
4. Flatness of the commutator map
In this section we shall prove Theorem 1.2, which we first reduce to Theorem 2.1. For this, we broadly follow [11]*§ 5.
Proof of Theorem 1.2 assuming Theorem 2.1.
Let be the open subscheme of with zero section removed, over . Let be the commuting scheme , over . Let be the commutator map in (1.3), restricted to . Let be the flat locus of over . Then is open by [17, Tag 0399]. Thus is closed. By miracle flatness, Theorem 2.1 implies that for all , the map is flat, whence . By the Lang–Weil estimate, the locally closed variety must be empty. We conclude that is flat, as desired. ∎
The rest of this section is devoted to the proof of Theorem 2.1. Let be any prime. (Eventually, we will assume .) Fix a non-zero . Note that
where
We shall bound via point counting on . We take , with sufficiently large so that and the variety has an irreducible component that is invariant under . It will suffice to estimate the quantity
since then
(4.1) |
by the Lang–Weil estimate for the variety .
Let on . We proceed by noting that
where in the last step we average over using the cyclic property of the trace function. We now exploit the linearity of the equations and in the coordinates of . Since the hyperplane has codimension either or in the linear space , we find that
where . Then, since depends only on the conjugacy class of , we find on averaging over conjugates of that
(4.2) |
where
(4.3) |
The orbit of under the conjugation action of has cardinality
by the orbit-stabiliser theorem and the Lang–Weil estimate, since and . Therefore, if denotes a complete set of representatives for the conjugation action, then breaking (4.2) into orbits gives
(4.4) |
Lemma 4.4 will non-trivially bound . The proof inducts on and uses an auxiliary averaging result, Lemma 4.3. First, we define some useful projection maps. Given integers and a matrix , define block matrices so that
where has rows and columns.
Lemma 4.3 concerns some averages over . First we study averages over a certain subgroup. Let
Then is a vector space such that for all . In particular,
is an abelian subgroup of , where for all .
Lemma 4.1.
Let , not necessarily with trace zero. Suppose . Let . Then the following hold:
-
(1)
We have
-
(2)
We have
-
(3)
We have
Proof.
First we record a general calculation: for we have
(4.5) |
(1): Both sides of (1) are invariant under conjugation of by any element of . Moreover, if is never zero, then (1) is trivial. Thus, after conjugating if necessary, we may assume that . Then, writing , we get
(4.6) | ||||
(4.7) |
by (LABEL:NMN). In particular, by (4.6),
Given a matrix , the condition holds if and only if each row of lies in the left kernel of the matrix . If , then is orthogonal to a non-zero column of . Hence, one of the coordinates of is a linear function of the others. Thus, one of the columns of is uniquely determined by others. Since each column of has entries, it follows that
On the other hand, if , then
(2): As in the proof of (1), we may assume that . Then
(4.8) |
for , by (LABEL:NMN). The map
(4.9) |
is linear in the entries of . By (4.6) and (4.8), we have
If , then the rows of lie in the left kernel of , and the columns of lie in the right kernel of . If and , then it follows that there exist a column and row of such that the entries of are uniquely determined by the entries in . Since , it follows that
(4.10) |
if . On the other hand, if , then
and for all , by (4.6), (4.7), and (4.8). Hence (2) follows.
Henceforth, assume , so that the following simple combinatorial lemma holds.
Lemma 4.2.
Let . Suppose is non-zero. Then for some -element subset .
Proof.
If , say, then either or must be non-zero. By symmetry, it remains to consider the case . Then , since . Thus , because . ∎
Lemma 4.3.
Let , not necessarily with trace zero. Suppose . Let . Then the following hold:
-
(1)
.
-
(2)
.
-
(3)
.
-
(4)
.
-
(5)
.
-
(6)
.
-
(7)
.
Proof.
Although (2) is stronger than (1), and (5) is stronger than both (3) and (4), it turns out that proving (1) first will help in proving (2), and likewise for (3) and (4).
(1): Let be the usual coordinate basis vectors. The condition holds if and only if for all . Thus
Note that is equally likely to be any list of linearly independent vectors. Since the right kernel of has dimension , and the number of lists of linearly independent vectors in is , it follows that
(4): Immediate from (3) with in place of .
(6): This follows from (2) after replacing with , and conjugating by a suitable permutation matrix.
(7): Assume . By the case of (2), we may conjugate to assume that its top left entry is non-zero. Then by Lemma 4.2, there exist diagonal entries of whose total is non-zero. By a further conjugation, we may assume . Therefore, the subvariety of is either empty or of codimension one, since is irreducible. The Lang–Weil bound now implies (7). ∎
We are finally ready to bound the quantity from (4.3).
Lemma 4.4.
Let , not necessarily with trace zero. Let be the radical of the characteristic polynomial of . Then
Proof.
In this proof, we relinquish the previous definition of from the introduction, in order to allow for some convenient notation. We use strong induction on . Since for all , we may assume , or else . Moreover, if or , then the trivial bound suffices. Therefore, we may assume from now on that , , and .
By the Lang–Weil estimate applied to the constructible set
we may assume that such that splits completely in . By conjugation, we may assume that is in rational canonical form, which coincides with Jordan normal form because splits completely. Since , the matrix has at least distinct eigenvalues. After permuting Jordan blocks if necessary, we may assume that
where and , with , such that and share no eigenvalues. Then, in particular,
(4.13) |
Let and . Let
(4.14) |
Since , and by the conjugation-invariance of , we have
(4.15) |
where the last step uses (4.13), (4.14), and the block matrix identity
for . By the inductive hypothesis applied to the two factors of in (4.15), we get
where
We now wish to bound the probabilities . Since
the probabilities for simplify as follows:
By parts (7), (6), (2), and (5) of Lemma 4.3, respectively, we have , , , and . Thus
since . ∎
Plugging Lemma 4.4 into (4.4), we get
But is in bijection with the set of rational canonical forms on , where we recall that the rational canonical form of consists of a partition for each monic irreducible polynomial , such that the action of on is isomorphic to multiplication by on the vector space , where . Thus, by the prime number theorem in and the fact that has only finitely many partitions, we deduce that
for . Summing over , we get
References
- [1] R. Basili, On the irreducibility of varieties of commuting matrices. Pure and Applied Algebra 149 (2000), 107–120.
- [2] T. D. Browning and D. R. Heath-Brown, Rational points on quartic hypersurfaces. J. reine angew. Math. 629 (2009), 37–88.
- [3] W. Castryck, R. Cluckers, P. Dittmann and K. H. Nguyen, The dimension growth conjecture, polynomial in the degree and without logarithmic factors. Algebra & Number Theory 14 (2020), 2261–2294.
- [4] W. Feit and N. J. Fine, Pairs of commuting matrices over a finite field. Duke Math. J. 27 (1960), 91–94.
- [5] E. Fouvry and N. Katz, A general stratification theorem for exponential sums, and applications. J. reine angew. Math. 540 (2001), 115–166.
- [6] M. Fujiwara, Upper bounds for the number of lattice points on hypersurfaces. Number theory and combinatorics (Japan, 1984), 89–96, World Scientific, 1985.
- [7] M. Fujiwara, Distribution of rational points on varieties over finite fields. Mathematika 35 (1988), 155–171.
- [8] D. R. Heath-Brown, The density of rational points on non-singular hypersurfaces. Proc. Indian Acad. Sci. (Math. Sci.) 104 (1994), 13–29.
- [9] A. Knutson, Some schemes related to the commuting variety. J. Algebraic Geom. 14 (2005), 283–294.
- [10] A. Knutson and P. Zinn-Justin, A scheme related to the Brauer loop model. Advances in Math. 214 (2007), 40–77.
- [11] M. Larsen and Z. Lu, Flatness of the commutator map over . IMRN (2021), 5605–5622.
- [12] T. S. Motzkin and O. Taussky, Pairs of matrices with property , II. Trans. Amer. Math. Soc. 80 (1955), 387–401.
- [13] M. G. Neubauer, The variety of pairs of matrices with . Proc. Amer. Math. Soc. 105 (1989), 787–792.
- [14] P. Salberger, Counting rational points on projective varieties. Proc. Lond. Math. Soc. 126 (2023), 1092–1133.
- [15] I. E. Shparlinski and A. N. Skorobogatov, Exponential sums and rational points on complete intersections. Mathematika 37 (1990), 201–208.
- [16] A. N. Skorobogatov, Exponential sums, the geometry of hyperplane sections, and some Diophantine problems. Israel J. Math. 80 (1992), 359–379.
- [17] T. Stacks project authors, The Stacks project. https://stacks.math.columbia.edu, 2024.
- [18] F. Vermeulen, Dimension growth for affine varieties. IMRN (2024), 11464–11483.
- [19] H.-W. V. Young, On matrix pairs with diagonal commutators. J. Algebra 570 (2021), 437–451.