It's been known for a long time that the distribution of prime numbers "up to x is asymptotic to x/log(x), where log(x) is the natural logarithm of x"*. In plain English, the probability of numbers being prime decreases by a predictable amount on a gradual slope the larger those numbers become. If the truth be told, this fact is not very interesting - nor very surprising, for that matter.
What is interesting and seems worthy of investigation is the observation that the number of primes occurring for a given quadratic interval up to any observable x is actually increasing absolutely. This doesn't contradict the asymptotic rule above. It simply makes things much more interesting.
It raises an obvious question: do the number of prime numbers increase in absolute terms with the increase of the quadratic interval to infinity? - or at some point does the thinning of prime numbers become greater than the increase in the quadratic interval, so that the absolute number of primes per interval begins to diminish? (If a reader knows the answer to this question let me know!)
The situation looks like this for numbers under 100 million. In the following graph, the vertical axis is logarithmic. Perfect squares grow the fastest, then the quadratic interval between the perfect squares. Then almost parallel with the interval count, the prime count is shown. It is increasing overall in absolute numbers as measured from one quadratic interval to the next. The trend is not linear, but viewed on the log scale, very close to it. However, as the fourth line shows, the ratio of prime numbers to the quadratic interval is slowly falling off, but at an ever more gradual rate.

Download the chart data.
For now, let's assume that the absolute increase in prime numbers per quadratic interval does increase indefinitely even though the rate of increase is continually converging - that is, leveling off. No doubt the question as to whether it ever levels off completely and whether it begins a gradual descent could be solved very quickly by applying sufficient computing power for sufficient time.
Let's now look at the question of how primes and prime factors distribute themselves within each quadratic interval, or rotation. We could call this the "demographics" of natural numbers - demographics being related to "the dynamic balance of a population especially with regard to density and capacity for expansion or decline" (Merriam-Webster).
Is there a consistent recognizable pattern to this distribution that can be grasped by graphing numbers in relation to the quadratic interval, rather than some other arbitrary division of the number line? By segmenting the number line using the perfect square as a regular marker, we are in effect designating the quadratic interval to be a constant. Regardless of whether the interval between perfect squares is 7 or 707 or 7007, we will divide this interval into equal amounts.
The amounts we divide the interval into don't matter so long as they're equal amounts. For convenience, we can use 360º, or we can use divisions based on powers of 2, for example, such as 1/64th or 1/128th. We can visualize the data by looking at the density of the individual data points or by connecting them together and smoothing the line to gain an impression of change within the interval.
The following graph shows the summation of all prime numbers in the first 1000 quadratic intervals, or rotations. It shows the sum of all prime numbers less than 1,000,000 (the perfect square of 1,000) for each 1/360th of the interval. A moving-average (10-unit) trendline is also shown.

Consequences of a constant interval
The quadratic interval on the 999th rotation is 1999 (998001-1000000). Since the X-axis is divided into 360º, for the 1000th rotation each degree represents about 5.5 integers. On the 180th rotation (32400-32761=361), each degree represented 1 integer. On the 50th rotation (2500-2601=101), each integer was represented by 3.56º.
The waveform shown in the graph can also be observed by counting the frequency of primes or by looking at the prime factor distribution of all the composites - for example, the arithmetic mean of the highest prime factor of each composite. Since these statistical methods are cumulative, if the distribution was somehow random it would flatten out over time. This is not the case. In fact, the waveform is quite consistent over time.
The noncumulative distributions are equally interesting. For example, the prime number scatter reveals numerous proximate prime polynomials that appear like elongated constellations in a star chart.

More to come...
|