The polymath blog

September 10, 2012

Polymath7 research threads 4: the Hot Spots Conjecture

Filed under: hot spots,research — Terence Tao @ 7:28 pm

It’s time for another rollover of the  Polymath7 “Hot Spots” conjecture, as the previous research thread has again become full.

Activity has now focused on a numerical strategy to solve the hot spots conjecture for all acute angle triangles ABC.  In broad terms, the strategy (also outlined in this document) is as follows.   (I’ll focus here on the problem of estimating the eigenfunction; one also needs to simultaneously obtain control on the eigenvalue, but this seems to be to be a somewhat more tractable problem.)

  1. First, observe that as the conjecture is scale invariant, the only relevant parameters for the triangle ABC are the angles \alpha,\beta,\gamma, which of course lie between 0 and \pi/2 and add up to \pi.  We can also order \alpha \leq \beta \leq \gamma, giving a parameter space which is a triangle between the values (\alpha,\beta,\gamma) = (0,\pi/2,\pi/2), (\pi/4,\pi/4,\pi/2), (\pi/3,\pi/3,\pi/3).
  2. The triangles that are too close to the degenerate isosceles triangle {}(0,\pi/2,\pi/2) or the equilateral triangle {}(\pi/3,\pi/3,\pi/3) need to be handled by analytic arguments.  (Preliminary versions of these arguments can be found here and Section 6 of these notes  respectively, but the constants need to be made explicit (and as strong as possible)).
  3. For the remaining parameter space, we will use a sufficiently fine discrete mesh of angles (\alpha,\beta,\gamma); the optimal spacing of this mesh is yet to be determined.
  4. For each triplet of angles in this mesh,  we partition the triangle ABC (possibly after rescaling it to a reference triangle \hat \Omega, such as the unit right-angled triangle) into smaller subtriangles, and approximate the second eigenfunction w (or the rescaled triangle \hat w) by the eigenfunction w_h (or \hat w_h) for a finite element restriction of the eigenvalue problem, in which the function is continuous and piecewise polynomial of low degree (probably linear or quadratic) in each subtriangle; see Section 2.2 of these notes.    With respect to a suitable basis, w_h can be represented by a finite vector u_h.
  5. Using numerical linear algebra methods (such as Lanczos iteration) with interval arithmetic, obtain an approximation \tilde u to u_h, with rigorous bounds on the error between the two.  This gives an approximation to w_h or \hat w_h with rigorous error bounds (initially of L^2 type, but presumably upgradable).
  6. After (somehow) obtaining a rigorous error bound between w and w_h (or \hat w and \hat w_h), conclude that w stays far from its extremum when one is sufficiently far away from the vertices A,B,C of the triangle.
  7. Using L^\infty stability theory of eigenfunctions (see Section 5 of these notes), conclude that w stays far from its extremum even when (\alpha,\beta,\gamma) is not at a mesh point.  Thus, the hot spots conjecture is not violated away from the vertices.  (This argument should also handle the vertex that is neither the maximum nor minimum value for the eigenfunction, leaving only the neighbourhoods of the two extremising vertices to deal with.)
  8. Finally, use an analytic argument (perhaps based on these calculations) to show that the hot spots conjecture is also not violated near an extremising vertex.

This all looks like it should work in principle, but it is a substantial amount of effort; there is probably still some scope to try to simplify the scheme before we really push for implementing it.

About these ads

11 Comments »

  1. It seems that one of the weaker links in the above process, at least at our current level of understanding, is Step 6. If we had L^2 bounds on the residual \Delta w_h - \lambda w_h, then one could get H^2 (and thence L^infty) bounds on the error w-w_h, but with piecewise polynomial approximations that are only continuous at the edges, \Delta w_h is going to have some singular components on the edges of the triangle and so L^2 bounds for the residual are not available.

    On the other hand, if we get really good bounds for Step 6, then maybe these bounds would also work for perturbations of the triangle and we could skip Step 7.

    Comment by Terence Tao — September 10, 2012 @ 7:36 pm | Reply

    • There are two pieces to Step 6.

      part a. Denote w_h as the discrete approximant of w from the finite-dimensional subspace V_h \subset H^1(PQR), where w_h is computed by solving the discrete eigenvalue problem DVP2 exactly.

      part b. Denote v_h as the approximation of w_h, computed using an iterative process for computing eigenfunctions of discrete systems.

      What we need is a rigorous L^2 estimate on the residual \|\Delta v_h -\lambda v_h\|, since what we have in hand is v_h.
      I think one can use inverse and duality estimates to compute the finite element residual \| \Delta w_h -\lambda w_h\|, since w_h is approximating a C^2 function, and can try to post some notes/references.

      If we denote the quantity v_h-w_h = z_h, then we want \Delta z_h - \lambda z_h to be well-controlled. We only have in hand \lambda_h and v_h, so the error estimate we want will have to boot-strap from these.
      Apart from using interval arithmetic, I’m not sure how to proceed on this. Insight into part b. is key.

      Comment by nilimanigam — September 12, 2012 @ 5:11 pm | Reply

      • One possibility is to not work with residuals, but instead with the Rayleigh quotient of w_h (which is finite, because it only requires square integrability of the first derivative of w_h, not square integrability of the second derivative). If this Rayleigh quotient is known to be close to the true second eigenvalue \lambda_2, then w_h is necessarily close in H^1 norm to a true second eigenfunction. On the plus side, this lets us skip Steps 6 and 7 (since the Rayleigh quotient is quite stable wrt perturbation of the triangle) and would be relatively simple to implement.

        On the minus side, it only gives H^1 control on the true eigenfunction, which isn’t quite enough by itself to give L^infty control. But perhaps we can combine it with some elliptic regularity or something. For instance, in any disk in the triangle ABC, one can write the value of the eigenfunction at the centre of the the disk in terms of an integral on the disk involving a Bessel function. So if one has H^1 control on the eigenfunction in that disk, one gets pointwise control at the centre of the disk. If one is near an edge of the triangle, but not near a corner, one can do something similar after first applying a reflection. Things get worse near the corners, but we were planning to use a different argument for that case anyway (Step 8).

        Another minus is that I suspect the dependence of the error on the mesh size is poor (something like the square root of the mesh size, perhaps). And it requires quite precise control on the second eigenvalue. But it may still be the lesser of two evils.

        Comment by Terence Tao — September 14, 2012 @ 5:18 am | Reply

        • Let’s see. For a symmetric positive-definite A, the Lanzcos iteration for the smallest eigenvalue of Ax = \mu x proceeds until a certain error criteria is met. This error criteria is based on the discrete Rayleigh quotient v^TAv ($v$ is assumed to be normalized).

          In our setting, the quantity w_h^T A w_h is the quantity (\nabla I_h w_h, \nabla I_h w_h) + (I_h w_h, I_h w_h), where I am using the notation I_hw_h to be the element of H^1 constructed by using the coefficients w_h in some basis.

          In other words, the algorithm already minimizes the discrete Rayleigh quotient. The nature of the matrix A tells us that the true discrete eigenvalue converges to the true one (for the continuous one) quadratically in the mesh size. The asymptotic constant depends on the true eigenvalue and the spectral gap. Eigenvectors converge quadratically with tesselation size in the H1 norm, eigenvalues converge quartically.

          Backward error analysis tells us that the computed w_h is close to the true w_h for a given matrix size. The asymptotic constant again depends on the spectral gap. http://web.eecs.utk.edu/~dongarra/etemplates/node151.html
          So I think we already have H^1 control, and I should attempt to explicitly string together the constants.

          I suspect you may be right: since the asymptotic constants above for any FIXED triangle PQR depend on the spectral gap, and since this spectral gap varies as we change the angles of PQR, we may have poor control in terms of the angle parameters. But staring at the spectral gap plot
          http://people.math.sfu.ca/~nigam/polymath-figures/SpectralGap-07.jpg
          I am optimistic that away from the equilateral triangle one could actually make the dependence of asymptotic constants precise.

          What’s harder to get a hand on is the residual of the computed (ie, not the finite element, but actually computed) eigenvector in terms of the direct application of the Laplacian.

          Comment by nilimanigam — September 14, 2012 @ 6:45 pm | Reply

        • Here are some brief computations illustrating how an L^2 bound on the error in an approximate eigenfunction can be turned into L^infty bounds, at least if one is away from corners, thus avoiding the need to have to understand the residual.

          More precisely, suppose that we have a true eigenfunction -\Delta u = \lambda u and some discretised approximation \tilde u to this eigenfunction, and that we have somehow obtained a bound \| u - \tilde u \|_{L^2(ABC)} \leq \varepsilon on the L^2 error (e.g. we should be able to get this if the Rayleigh quotient of \tilde u is close to the minimum value of \lambda).

          Now suppose that the triangle ABC contains the disk B(0,R) for some radius R. A bit of messing around with polar coordinates shows that the function

          \overline{u}(r) := \frac{1}{2\pi} \int_0^{2\pi} u(r,\theta)\ d\theta

          is smooth on [0,R], equal to u(0) at the origin, and obeys the Bessel differential equation \partial_{rr} \overline{u}(r) + \frac{1}{r} \partial_r \overline{u}(r) + \lambda \overline{u}(r) = 0

          which can be solved using Bessel functions as

          \overline{u}(r) = J_0(\sqrt{\lambda} r) u(0)

          thus

          \int_0^{2\pi} u(r,\theta)\ d\theta = 2\pi J_0(\sqrt{\lambda} r) u(0)

          for all 0 \leq r \leq R. Integrating this against a test function f(r)\ r dr we get

          \int_0^{2\pi} \int_0^{R} u(r,\theta) f(r)\ r dr d\theta = 2\pi u(0) \int_0^R J_0(\sqrt{\lambda} r) f(r)\ r dr.

          Splitting u into \tilde u and u-\tilde u and using Cauchy-Schwarz to bound the latter term, we obtain

          |\int_0^{2\pi} \int_0^{R} \tilde u(r,\theta) f(r)\ r dr d\theta - 2\pi u(0) \int_0^R J_0(\sqrt{\lambda} r) f(r)\ d dr| \leq \varepsilon (2\pi \int_0^R |f(r)|^2\ r dr)^{1/2}.

          This gives a computable upper and lower bound for u(0). From the Cauchy-Schwarz inequality, we see that the optimal value of f here is f(r) = J_0(\sqrt{\lambda} r), leading to the bound

          |u(0) - A^{-1} \int_0^{2\pi} \int_0^{R} \tilde u(r,\theta) J_0(\sqrt{\lambda} r)\ r dr d\theta| \leq A^{-1/2} \varepsilon (1)

          where

          A := 2\pi \int_0^R |J_0(\sqrt{\lambda} r)|^2\ r dr. (2)

          Of course, this estimate works best when R is large, and we can only fit a big ball B(0,R) inside the triangle ABC if the origin 0 is deep in the interior of ABC. But if instead 0 is near an edge of ABC, but far from the vertices, one can still get a big ball inside the domain after reflecting the triangle across the edge 0 is near (but this multiplies the L^2 norm of the residual by a factor of sqrt(2)). In principle, this gives L^infty control on u everywhere except near the corners. (Of course, one still has to numerically integrate the expressions in (1) and (2), but this looks doable, albeit messy.)

          Comment by Terence Tao — September 20, 2012 @ 10:42 pm | Reply

          • This is really useful!

            Here’s how I was trying to proceed:

            Suppose u is the exact eigenfunction, and R_hu its exact projection onto the finite dimensional subspace V_h. I was trying to compute the precise asymptotic constant c in the sup-norm estimate \|u-R_h u\|_\infty \leq c h^{\mu} (this is the kind of FEM estimate one gets, eg. page 3 of http://people.math.sfu.ca/~nigam/polymath-figures/Validation+Numerics.pdf), but I think your argument above may help with this.

            One then has to estimate \|R_h u- \tilde{u}\|_\infty , where both now live in V_h. I think this can be controlled using the inverse and backward error estimates for eigenvalue problems – the trick is getting the asymptotic constants decently. One then puts these estimates together

            I’m also (still!) trying to debug the code to locate the extrema of the piecewise quadratics on the smaller triangles. This would circumvent the interpolation-to-linears step. The calculations are so very messy (not hard), so I’m going very slowly.

            Comment by Nilima Nigam — September 21, 2012 @ 6:13 pm | Reply

  2. So unfortunately, nothing we discussed panned out. But the following were some ideas considered:

    1) Perhaps it is the case that the nodal line is the image of a line or circle (hyberbolic geodesic?) under the Schwarz–Christoffel mapping. Or if not the image then close to it. Bartlomiej made some computer simulations and in many cases the fit was very close but for the equilateral triangle the fit was a fair bit off.

    2) Terry’s re-proof of a coupling-argument using the vector maximum principle begs the question: Is it possible to reprove the Hot-Spots conjecture for obtuse triangles (the paper of Banuelos and Burdzy) using such a non-probabilistic argument? If so, perhaps the proof could be extended to the non-obtuse case. Of course we can see by computer simulation that the gradient field of certain acute triangles is much nastier… but I think it is still the case that the gradient vectors all lie in a half space? In fact I think the hot-spots conjecture would hold if we could show that the gradient vectors all lie in a (non-convex) cone of angle less than 360 degrees. In summary: Maybe something can be said about the gradient field.

    3) There suggestion of Mihai Pascu involving the nodal line dividing the triangle into star-like domains with respect to hyperbolic geodesics. But I didn’t quite follow/remember… maybe Barlomiej can better explain.

    4) There was the suggestion of Baneulos that we might examine the function \phi_2(x)/\phi_1(x) where \phi_i(x) is the i-th Dirichlet eigenfunction of the triangle. Apparently this ratio has properties similar to the second Neumann eigenfunction.

    That’s all I can remember… hopefully I didn’t miss any.

    Edit: I remembered another interesting comment:

    5) Chris Burdzy pointed out that any argument used has to be one that wouldn’t work on a manifold. He gave the following counter-example: Consider the surface of a long baguette. The hot spots would then be at the
    tips of the baguette”. This is a manifold without boundary but then you can cut a small hole in the baguette to make the surface into a manifold with boundary. If the hole is small enough, the hot spots wouldn’t move much and would therefore not be on the boundary.

    Comment by Chris Evans — September 16, 2012 @ 10:56 pm | Reply

  3. The observation (1) is interesting! let’s ignore the equilateral triangle case for the time being: is the nodal line always a line/circle under the Schwarz-Christoffel map? Is there some simple way to see this?

    Comment by Nilima Nigam — September 17, 2012 @ 10:02 pm | Reply

    • Presumably one can do explicit calculations for the 45-45-90 triangle. If the vertices are (0,0), (1,0), (1,1) then the second eigenfunction is \cos(\pi x) + \cos(\pi y) and the nodal line is the bisecting altitude connecting (1/2,1/2) with (1,0). I guess this transforms to a horocycle or something under Schwarz-Christoffel?

      The other explicit example is the 30-60-90 triangle, but the nodal line there is a bit more complicated. If we use the triangle with vertices (0,0,0), (1,-1,0), (1,-1/2,-1/2), the second eigenfunction is \cos(\pi(x-y)/3) + \cos(\pi(y-z)/ 3) + \cos(\pi(z-x)/3), and I can’t work out the nodal line for that in my head…

      Comment by Terence Tao — September 17, 2012 @ 10:44 pm | Reply

    • Bartlomiej’s numerical simulations suggested conjecture (1) was false for equilateral triangles but true (or perhaps just coincidentally close) for the 30-60-90 triangle. As the 30-60-90 triangle is half the equilateral triangle perhaps there is a nice analytic way to compare the images of lines/circles under the SC-map for the two regions.

      More specifically, does it hold that the image-of-lines/circles for the equilateral triangle are the reflections of those for the 30-60-90 triangle?

      As a separate question: Suppose we could show that the nodal line lies in some specific “epsilon band”. That begs the question of how to prove a claim such as the following:

      Claim: Consider an “almost-arc” with two straight Neumann-sides meeting at an acute angle at the corner A and one wiggly Dirichlet-side such that the distance from any point of the Dirichlet-side to the corner A lies in the interval (R-\epsilon,R + \epsilon). Then for suitably large R and small \epsilon, the maximum of the first eigenfunction should occur at the corner A.
      (Instead of taking an “epsilon band” around a curved arc we might also take one around a straight line and consider an “almost-triangle”)

      Our proofs using coupled-BM/vector-valued maximum principle won’t work if the Dirichlet-side is wiggly and yet intuitively the claim still ought to hold. Of course if \epsilon\to 0 it should hold by continuity, but it ought to hold for \epsilon not too small…

      Comment by Chris Evans — September 18, 2012 @ 4:39 pm | Reply

  4. I’ve been playing around a bit trying to estimate the X in Terry’s notes, in particular the \frac{d^2}{dt^2} A(t) term, where A(t) denotes the area of the triangle. I scanned in some notes at

    http://www.math.missouri.edu/~evanslc/Polymath/Oct26th.pdf

    Looking at the graph of the function in Wolfram Alpha it seems that f(x)=\Gamma(\frac{x}{\pi})^2\sin(x) and its first and second derivatives will blow up when x is close to 0… but that is old news I suppose and we can presumably handle the case where one of the angles is close to 0 by other means.

    Despite its nice structure I didn’t see an easy way to read off the eigenvalues/vectors of the Hessian of F… but such information would yield a bound. In fact, using that our perturbation is restricted to a plane might give us a better bound.

    Comment by Chris Evans — October 26, 2012 @ 8:59 pm | Reply


RSS feed for comments on this post. TrackBack URI

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

Theme: Customized Rubric. Blog at WordPress.com.

Follow

Get every new post delivered to your Inbox.

Join 135 other followers

%d bloggers like this: