This post is the new research thread for the Polymath7 project to solve the hot spots conjecture for acute-angled triangles, superseding the previous thread; this project had experienced a period of low activity for many months, but has recently picked up again, due both to renewed discussion of the numerical approach to the problem, and also some theoretical advances due to Miyamoto and Siudeja.
On the numerical side, we have decided to focus first on the problem of obtaining validated upper and lower bounds for the second Neumann eigenvalue of a triangle . Good upper bounds are relatively easy to obtain, simply by computing the Rayleigh quotient of numerically obtained approximate eigenfunctions, but lower bounds are trickier. This paper of Liu and Oshii has some promising approaches.
After we get good bounds on the eigenvalue, the next step is to get good control on the eigenfunction; some approaches are summarised in this note of Lior Silberman, mainly based on gluing together exact solutions to the eigenfunction equation in various sectors or disks. Some recent papers of Kwasnicki-Kulczycki, Melenk-Babuska, and Driscoll employ similar methods and may be worth studying further. However, in view of the theoretical advances, the precise control on the eigenfunction that we need may be different from what we had previously been contemplating.
These two papers of Miyamoto introduced a promising new method to theoretically control the behaviour of the second Neumann eigenfunction , by taking linear combinations of that eigenfunction with other, more explicit, solutions to the eigenfunction equation , restricting that combination to nodal domains, and then computing the Dirichlet energy on each domain. Among other things, these methods can be used to exclude critical points occurring anywhere in the interior or on the edges of the triangle except for those points that are close to one of the vertices; and in this recent preprint of Siudeja, two further partial results on the hot spots conjecture are obtained by a variant of the method:
- The hot spots conjecture is established unconditionally for any acute-angled triangle which has one angle less than or equal to (actually a slightly larger region than this is obtained). In particular, the case of very narrow triangles have been resolved (the dark green region in the area below).
- The hot spots conjecture is also established for any acute-angled triangle with the property that the second eigenfunction has no critical points on two of the three edges (excluding vertices).
So if we can develop more techniques to rule out critical points occuring on edges (i.e. to keep eigenfunctions monotone on the edges on which they change sign), we may be able to establish the hot spots conjecture for a further range of triangles. In particular, some hybrid of the Miyamoto method and the numerical techniques we are beginning to discuss may be a promising approach to fully resolve the conjecture. (For instance, the Miyamoto method relies on upper bounds on , and these can be obtained numerically.)
The arguments of Miyamoto also allow one to rule out critical points occuring for most of the interior points of a given triangle; it is only the points that are very close to one of the three vertices which we cannot yet rule out by Miyamoto’s methods. (But perhaps they can be ruled out by the numerical methods we are also developing, thus giving a hybrid solution to the conjecture.)
Below the fold I’ll describe some of the theoretical tools used in the above arguments.
Let be an acute-angled triangle that is not equilateral, and let be the second Neumann eigenvalue; as discussed in previous posts, we know that this eigenvalue is simple. The method of Miyamoto allows one to control the structure of the second eigenfunction through an analysis of the quadratic form
for (we restrict attention here to real-valued functions). From the spectral theorem, we know that this quadratic form is non-negative when has mean zero, with equality if and only if is a multiple of . This leads to the following consequence:
Lemma 1 Let have disjoint supports. Then is non-negative for all but at most one of the . If and none of the vanish identically, we may upgrade “non-negative” in the previous assertion to “strictly positive”.
\bein{proof} Suppose for contradiction that and are negative for some distinct . If we take to be a linear non-trivial combination of which has mean zero, then we see from the disjoint supports of that is also negative, contradicting the non-negativity of on mean-zero functions.
Now suppose that and are merely non-positive instead of non-negative. Then the above argument shows that there is a non-trivial linear combination of that is a non-zero multiple of . On the other hand, if and none of the vanishing identically, then this linear combination of will be zero on a set of positive measure, which is impossible for a non-zero multiple of the eigenfunction (which is real analytic).
We have a further non-negativity property of :
Lemma 2 Let vanish on two of the three sides of . Then , with equality occuring if only if solves the eigenfunction equation and obeys Neumann conditions on the remaining side of .
Proof: Write . If vanishes on and with , we reflect across and obtain a function on the kite formed by reflecting across , with
and so the first Dirichlet eigenvalue of is less than . But by a result of Friedlander, the first Dirichlet eigenvalue of the convex planar domain is at least as large as the third Neumann eigenvalue of that domain. Hence, the symmetric reflection of across cannot be the second or third Neumann eigenfunction for , and so these functions must both be anti-symmetric instead of symmetric across . But at least one of these anti-symmetric eigenfunctions must change sign on (as they are orthogonal to each other), and will then have at least four nodal domains, contradicting the Courant nodal line theorem. The second claim of the lemma follows by similar arguments and is omitted.
This lemma turns out to be particularly useful when applied to the nodal components of a solution to the eigenfunction equation :
Corollary 3 Let be a solution to the eigenfunction equation , not necessarily obeying the Neumann boundary condition. Let be the nodal domains of (i.e. the connected components of in ). Then for all but at most one , where is the derivative in the outward normal direction. If , then we can make the inequality (1) strict. Finally, (1) holds (with strict inequality) whenever is contained in one of the three sides of
Proof: We apply the previous lemmas with , and observe from integration by parts that
As worked out in previous polymath7 threads, applying this corollary to the Neumann eigenfunction yields that the nodal curve is simple and connects two distinct sides of the triangle . However, the new advances of Miyamoto and Siudeja have come from applying this corollary to other solutions to the eigenfunction equation. For instance:
Corollary 4 Let be a non-trivial solution to the eigenfunction equation , not necessarily obeying the Neumann boundary condition. Then the nodal curve does not contain any loops.
This leads to a variant of the maximum principle:
Corollary 5 Let be a solution to the eigenfunction equation , not necessarily obeying the Neumann boundary condition. If on , then on .
One particularly nice solution to use in the above corollary is a directional derivative of , yielding the following result of Siudeja:
Corollary 6 Suppose that has no critical points on the interior of two of the three sides of the triangle . Then has no critical points in the interior of either. In particular, the hot spots conjecture is true for this triangle.
Proof: Apply the previous corollary to the derivative of in the direction normal to the third side, to conclude that that derivative does not change sign in the interior of the triangle. But this is incompatible with a critical point in the interior (as can be seen for instance by a Bessel expansion around that point).
Another fruitful solution to use is some linear combination of and another solution designed to create a degenerate critical point . This gives the following criterion of Miyamoto for excluding critical points at certain locations:
Corollary 7 Let be an interior point of , and let be a solution to the eigenfunction equation with , , and for all (excluding vertices), but is not identically zero on . Then does not have a critical point at at .
Proof: Suppose for contradiction that . We first eliminate a degenerate case when . In this case the nodal curve of crosses itself at , which by Corollary 4 creates at least four nodal domains, contradicting the Courant nodal line theorem. Thus we may assume without loss of generality that . If we subtract a suitable multiple of from we then get another solution to the eigenfunction equation with , and on . Again, from Corollary 4 has at least four nodal domains, including at least two in which is negative. But this contradicts Corollary 3.
Miyamoto uses this corollary with being a radial solution to the eigenfunction equation centered at to establish the hot spots conjecture for sufficiently round domains, but perhaps one can adapt the method to other solutions to also cover many cases of critical points inside various triangles.
Here is a computation building upon Siudeja’s arguments (Corollary 6 in the above post) that may possibly be helpful in strengthening that result.
Consider an acute triangle ABC, and let be the inward normal from AB towards C. Then the directional derivative vanishes at AB and at C, as well as at any other critical point of , and solves the eigenfunction equation. Let be a nodal domain of , and consider the quantity
(*)
This is non-negative for all but at most one of the nodal domains (Lemma 1). On F, this quantity is equal to
(Corollary 3). The set is the union of various intervals in AB, BC, AC, whose endpoints are boundary critical points of . The portion on AB vanishes because vanishes here. Now consider an interval PQ on AC (with P closer to A than Q) of this set:
.
We may write , where is the tangential derivative in the direction from A to C, and is the angle . Using the Neumann condition (which implies ), the above integral becomes
.
On the other hand, from the eigenfunction equation we have , so this integrates to
As are critical points of , vanishes there, and so
We thus have a relatively simple formula for the quantity (*):
where are the intervals of AC on the boundary of F, and are the intervals of BC on the boundary of F.
If we consider a nodal domain on which is positive, then we have and . Unfortunately this doesn’t quite give a sign for (*) because can be both positive and negative. But perhaps we can control how the nodal curve of interacts with the nodal curve of using this sort of analysis.
Note also that if u changes sign on, say, AC, then it must have an even number of critical points on the interior of AC, since it is a local maximum on A and a local minimum on C or vice versa. So if the hypotheses of Corollary 6 break down, then there are quite a few critical points on the boundary and so there should be quite a few nodal domains for , which one can hopefully use to one’s advantage.
Comment by Terence Tao — August 9, 2013 @ 8:36 pm
Another thought that just occurred to me is that degree theory should be able to relate the number (or at least the parity) of the critical points in the interior with the critical points on the boundary, basically by trying to see how the gradient winds around the origin. If the hot spots conjecture is true, then if one traverses a loop in the triangle ABC that is very close but not quite touching the boundary (so as to avoid all the critical points at vertices and edges), the gradient of the eigenfunction should have a total winding number of zero, because there are no critical points in the interior. This should be equivalent to some assertion about the number of critical points on the boundary at points where the eigenfunction is positive or negative; I’ll try to work this out now (note that we can eliminate the degenerate critical points where the second derivative vanishes or where the solution vanishes).
Comment by Terence Tao — August 9, 2013 @ 11:00 pm
OK, here is what degree theory tells us: assuming that critical points are always non-degenerate in the sense that the Hessian has non-zero determinant, the number of local extrema plus the number of saddle points in the triangle ABC must equal zero, where a critical point on an edge only counts for half, and a critical point at a vertex of angle only counts for . In particular, for a second eigenfunction that is non-zero at the three vertices, all three vertices contribute a net of half of a local extremum, which must therefore be balanced by a net of half of a saddle point. Numerically, this comes from half of a saddle point on the edge connecting the two vertices where the eigenfunction does not change sign, and no other critical points. A local extremum in the interior of the triangle must then be balanced either by a saddle point in the interior, or two half-saddle points on the edges in addition to the half-saddle point that already must necessarily occur on the edge where the eigenfunction does not change sign.
Unfortunately, it’s a bit tricky to figure out which critical points are local extrema and which are saddles; the former occurs when is positive and the latter when it is negative, and the eigenfunction equation isn’t of much use in simplifying this constraint. (But on an edge, any positive local minimum or negative local maximum is necessarily a half-saddle point.)
Comment by Terence Tao — August 9, 2013 @ 11:48 pm
Not much progress to report, unfortunately, but two small additional thoughts:
(a) now that narrow triangles have been dealt with, the other troublesome region of the configuration space is the nearly equilateral triangles (the region near the point H in the above diagram), mainly due to the fact that the second and third Neumann eigenvalue are very close to each other here (so that our previous numerical strategies would break down). Abstractly we know that there is an open region around H where the hot spots conjecture holds, but it may now be worthwhile to try to see how explicitly large of a region we can make here. One nice thing here is that for the perfectly equilateral triangle, all the spectral statistics (e.g. the Neumann eigenvalues, the Dirichlet eigenvalues, the mixed Neumann-Dirichlet eigenvalues when some sides are Neumann and others are Dirichlet) are all explicitly computable through reflection arguments and Fourier analysis, and through Rayleigh quotient arguments one can then get approximate control on these same statistics for nearly equilateral triangles.
(b) Miyamoto’s method proceeds by starting with the second eigenfunction with a critical point at , and subtracting off of it some multiple of an explicit solution to the eigenfunction equation with a critical point at with a definite sign on the normal derivative, to obtain a new solution with a degenerate critical point and a definite sign on the normal derivative, which can then be used to lead to a contradiction. But it might be possible to get additional information on by finding other ways to line up and to create a degenerate critical point. For instance, instead of getting the critical points of and to line up, one could ask instead for the nodal lines of and to be tangent to each other, as this would also mean that some linear combination of have a degenerate critical point (cf. the Lagrange multiplier method). Similarly if has a critical point at some point other than the nodal line of . By playing around with various choices for (e.g. radial examples, whose nodal line is a circle, or cosine examples, whose nodal line is a line) this may provide some geometric constraints on the nodal line which could potentially be useful. (For instance we had a conjecture that the nodal line was convex, which might potentially be attackable by this sort of method.) In conjunction with point (b), we might try all this first for the equilateral triangle, where the nodal lines are explicitly computable and the geometry is all explicit and symmetric.
Comment by Terence Tao — August 13, 2013 @ 8:41 pm
Sorry, I’ve been bogged down by assorted other things and hope to return to posting some code+results within a few weeks.
Your comment (a) reminds me: computing the spectral gap in the neighbourhood of the equilateral triangles, I recall observing that there was a striking difference (in terms of symmetries) between the variation of the second (ie, first non-zero) eigenvalue with triangle angles, and the variation of the third eigenvalue angles. Perhaps this is a well-known fact in spectral analysis? Here are (coarse grid) figures of what I mean:
Figure of the second Neumann eigenvalue as a function of triangle angles:
Figure of the third Neumann eigenvalue as a function of triangle angles (sorry, figure title has a typo.)
I wondered if this observation could somehow be used in trying to establish (a)? I used this informally to help me decide how to refine grids in parameter space.
Comment by nilimanigam — September 9, 2013 @ 7:55 pm