It’s probably time to refresh the previous thread for the “finding primes” project, and to summarise the current state of affairs.
The current goal is to find a deterministic way to locate a prime in an interval in time that breaks the “square root barrier” of
(or more precisely,
). Currently, we have two ways to reach that barrier:
- Assuming the Riemann hypothesis, the largest prime gap in
is of size
. So one can simply test consecutive numbers for primality until one gets a hit (using, say, the AKS algorithm, any number of size z can be tested for primality in time
.
- The second method is due to Odlyzko, and does not require the Riemann hypothesis. There is a contour integration formula that allows one to write the prime counting function
up to error
in terms of an integral involving the Riemann zeta function over an interval of length
, for any
. The latter integral can be computed to the required accuracy in time about
. With this and a binary search it is not difficult to locate an interval of width
that is guaranteed to contain a prime in time
. Optimising by choosing
and using a sieve (or by testing the elements for primality one by one), one can then locate that prime in time
.
Currently we have one promising approach to break the square root barrier, based on the polynomial method, but while individual components of this approach fall underneath the square root barrier, we have not yet been able to get the whole thing below (or even matching) the square root. I will sketch the approach (as far as I understand it) below; right now we are needing some shortcuts (e.g. FFT, fast matrix multiplication, that sort of thing) that can cut the run time further.
– The polynomial method –
The polynomial method begins with the following observation: in order to quickly find a prime in , it suffices to be able to quickly solve the prime decision problem: given a subinterval
of
, decide whether such an interval contains a prime or not. If one can solve this problem in, say,
time, then one can find a prime in this time also by binary search.
Actually, using Odlyzko’s method we can already narrow down to an interval of length with a lot of primes in it in
time, so we only need to break the square root barrier for the decision problem for intervals
of length
or less.
The decision problem is equivalent to determining whether the prime polynomial
(1)
is non-trivial or not, where ranges over primes in the interval
.
Now, the prime polynomial, as it stands, has a high complexity; the only obvious way to compute it is to enumerate all the primes from to
, which could take
time in the worst case. But we can improve matters by working modulo 2; note that as the coefficients of
are either 1 or 0, it suffices to decide whether
is non-trivial modulo 2.
The reason we do this is the observation that if is a natural number, then the number of solutions to the diophantine equation
with
is odd when n is prime, and usually even when n is composite. (There are some rare exceptions to this latter fact, when n contains square factors, but it seems likely that one can deal with these latter cases by Möbius inversion, exploiting the convergence of the sum
.) So, the prime polynomial f modulo 2 is morally equal to the variant polynomial
(2)
So a toy problem would be to decide whether (2) was non-zero modulo 2 or not in time or better.
The reason that (2) is more appealing than (1) is that the primes have disappeared from the problem. Instead, one is computing a sum over a fairly simple region bounded by two hyperbolae and two lines.
The point is now this: if vanishes modulo 2, then it also vanishes modulo
for any low-degree polynomial g (degree
or better), and more generally $\tilde f(x^n)$ vanishes modulo
. Conversely (if one is lucky), if
vanishes modulo
for sufficiently many
, then it should be that
vanishes. So this leads to the following strategy:
- Goal 1: Find a collection of
such that if
vanishes modulo
for all the pairs
, then
vanishes modulo 2.
- Goal 2: Find a way to decide whether
vanishes modulo
for all the required
in time
or better.
One way to achieve Goal 1 is to forget about , and choose the
so that the least common multiple of all the
(modulo 2) cannot divide
. Since
is basically a polynomial of degree
shifted by a monomial, one obvious way to proceed would be to pick more than
polynomials g. But then it looks unlikely that one can beat the square root barrier in Goal 2. Similarly if one varies n as well as g.
On the other hand, we have a partial result in Goal 2: for any fixed n and g, we can compute in time below the square root barrier, e.g. in time
. For instance, setting n=1 and
, we can compute
in this time. Unfortunately a single n,g is not nearly enough to solve Goal 2 yet, so we either need a further advance on Goal 1, or some very efficient way to test non-vanishing of
modulo (2,g) for many pairs (n,g) at a time (e.g. by an FFT type approach).
The partial result is based on the fact that has an arithmetic circuit complexity below the square root level, i.e. it can be expressed in terms of
(say) arithmetic operations. As such, for any low-degree g (say degree
),
can be computed in
time (using fast multiplication for mod g arithmetic if necessary).
Let’s sketch how the circuit complexity result works. Recall that (2) is a sum over the geometric region . Using the geometric series formula, one can convert this sum over
to a sum over what is basically the boundary of
. This boundary has
points, so this shows that
has an arithmetic circuit complexity of
already. But one can do better by using the Farey sequence to represent the discrete hyperbolae that bound
by line segments. The sum over each line segment is basically a quadratic sum of the form
for various coefficients
. It seems that one can factorise this sum as a matrix product and use ideas from the Strassen fast multiplication algorithm to give this a slightly better circuit complexity than the crude bound of
; see these notes; there may also be other approaches to computing this quickly (e.g. FFT).
Where we’re still stuck right now is scaling up this single case of Goal 2 to the more general case we need. Alternatively, we need to strengthen Goal 1 by cutting down substantially the number of pairs (n,g) we need to test…
[...] http://polymathprojects.org/2009/10/27/research-thread-v-determinstic-way-to-find-primes/ Possibly related posts: (automatically generated)You’ll Always Find Your Way Back Home New ClipSOME BEST WAYS 2 FIND NEW MUSICRavelry Finds ~ New Ways to Warm UpNo Way Home: The Rules [...]
Pingback by New thread for deterministic way to find primes « Euclidean Ramsey Theory — October 28, 2009 @ 5:08 pm |
Regarding writing the prime generating function
as a sum
, where the
and
are sparse polynomials (say,
terms each): I had a conversation with a friend of mine this past weekend who is an expert on real algebraic geometry, and who was in town for the FOCS conference here at Georgia Tech. He said that in the
context (the
are polynomials with real coefficients) this problem is a well-studied in terms of trying to generalize Descarte’s Rule of Sign. If a single polynomial has only
terms, Descarte’s rule implies that it can have at most
non-zero real roots, and apparently it is a studied open problem to extend this to a sum of products of two sparse polynomials like we have (but in
). Perhaps there are some techniques that algebraic geometers have developed that would be useful in bounding the number of roots of our analogous polynomials in
, or at least good conjectures. I didn’t get the chance to ask him what is known on this problem, but I will soon, and report back if I discover anything on it…
Incidentally, *he* initiated the discussion of this problem — not me — as it pertained to one of the FOCS talks on arithmetic circuit complexity.
Comment by Ernie Croot — October 28, 2009 @ 6:40 pm |
Oops… I meant to say “bounding the number of roots… among
in
”.
Comment by Ernie Croot — October 28, 2009 @ 7:01 pm |
It seems that the result on Descartes Rule uses special properties of the reals, and probably won’t extend to the complex (or finite field) setting. Here is the problem I asked my friend:
—-
Problem. Suppose that
are all polynomials
having
terms each. Let
be an irreducible polynomial,
is at least
, say. Let
in
such that the order of
Show that
can’t all be
in
.
—-
And here was his response:
One remark about the problem. The fact that there exists any bound at all in terms of m and k in the real case has to do with the fact that we are bounding the number of real roots. Of course, no such bounds exist for complex roots — this makes me suspect that standard tools of algebraic geometry are probably not very useful in this context.
An exponential bound (in m as well as k) in the real case follows from Khovansky’s theory Pfaffian functions (you might want to look at the book “Fewnomials” by Khovansky).
But I do not think it can have applications in the finite field case where an exponential bound is probably obvious.
Comment by Ernie Croot — October 29, 2009 @ 10:19 pm |
I am not sure of this would help but would an approach on de randomizing a probabilistic algorithm for finding primes help? I attended midwest theory day and it seemed it may apply.
http://pages.cs.wisc.edu/~jkinne/research/KvMS_full_manuscript.pdf
Comment by Joshua Herman — December 17, 2009 @ 4:01 am |