The Anti-Correlation Engine: Variance Structure of ρ_E and the Local Smoothness of Integer Complexity
We report the decisive M8 exploration results on the variance structure of the integer complexity function $\rho_E$. The deficiency $d_\rho(n) = \rho_E(n) - c^* \ln n$ (where $c^*$ is a working normalization constant) has shell variance $\mathrm{Var}(d_\rho \mid \Omega = k, n \leq N)$ consistent with $a(k) + b(k) \cdot \ln\ln N$, with $b(k) > 0$ and increasing significantly in $k$ on the tested range — not $O(1)$ as initially conjectured. Despite this, the first-difference variance $\mathrm{Var}(\rho_E(n-1) - \rho_E(n) \mid \Omega = k)$ lies in $[0.3, 1.35]$ across all tested $k$ ($2$ to $18$) and $N$ ($10^4$ to $10^7$), with zero growth trend in $N$, supporting $\sigma(G) = O(1)$. The mechanism is a forced anti-correlation between the additive part $f$ and the combinatorial remainder $r$ in the decomposition $\rho_E = f + r$: $\mathrm{Var}(\Delta f) \approx 10$ and $\mathrm{Var}(\Delta r) \approx 7$ individually, but $\mathrm{Cov}(\Delta f, \Delta r) \approx -7.5$ compresses $\mathrm{Var}(A) = \mathrm{Var}(\Delta f + \Delta r)$ to $\approx 1$. Three proof routes are eliminated: Turán-Kubilius on $\Delta f$ (variance $\approx 10$, not $O(1)$), separate treatment of $f$ and $r$ (each divergent), and shell-level concentration of $d_\rho$ (grows with $N$). The most leveraged remaining target for $\sigma(G) = O(1)$ is the local Lipschitz property of $d_\rho$: consecutive deficiencies differ by $O(1)$ despite shell-level drift of $O(\sqrt{k \cdot \ln\ln N})$, analogous to a process with stationary increments. Formalizing the local Lipschitz property is the most leveraged remaining step toward $\sigma(G) = O(1)$; combined with the remaining relay inputs — B-bound (numerical), scale term monotonicity (numerical), I-a for fixed $k$ (open), and Lemma II (numerically benign) — it would complete the engine for $D(N) \to 1$.
Keywords: integer complexity, ρ-arithmetic, anti-correlation, variance decomposition, local smoothness, deficiency, DP recursion, Brownian analogy
§1. Introduction
§1.1. Context
Paper 17 (DOI: 10.5281/zenodo.19016958) reduced the proof of $D(N) \to 1$ to four principal relay inputs plus Lemma II. The most leveraged input is $\sigma(G \mid \Omega = k) = O(1)$ — the formalization of Paper 13's Gaussian model. The present paper reports the M8 exploration results that resolve the variance structure of $\rho_E$ on $\Omega$-shells, identify three dead proof routes, and identify the most leveraged remaining target.
§1.2. Notation
As in Paper 17 §1.2. Additionally: $c^* = 1/\ln 2 \approx 1.4427$ (working normalization constant; not a proved asymptotic constant), $d_\rho(n) = \rho_E(n) - c^* \ln n$ (deficiency relative to $c^*$), $f(n) = \sum_{q^a \| n} \rho_E(q^a)$ (additive part), $r(n) = \rho_E(n) - f(n)$ (combinatorial remainder), $\Delta f(n) = f(n-1) - f(n)$, $\Delta r(n) = r(n-1) - r(n)$.
§1.3. Main Results
Numerical Discovery 1 (Shell Deficiency Drift). $\mathrm{Var}(d_\rho \mid \Omega = k, n \leq N)$ fits $a(k) + b(k) \cdot \ln\ln N$ with $b(k) > 0$ for all tested $k$:
| $k$ | $b$ |
|---|---|
| 4 | 1.26 |
| 6 | 2.58 |
| 8 | 4.41 |
| 10 | 7.53 |
$b(k)$ increases significantly with $k$ on the tested range (from $1.26$ at $k=4$ to $7.53$ at $k=10$), faster than linear. The precise asymptotic form of $b(k)$ is an open question.
Numerical Discovery 2 (Local Smoothness). $\mathrm{Var}(A \mid \Omega = k, n \leq N) = \mathrm{Var}(\rho_E(n-1) - \rho_E(n) \mid \Omega = k)$ lies in $[0.3, 1.35]$ across $k = 2$ to $18$ and $N = 10^4$ to $10^7$, with zero growth trend in $N$. For $k \geq 5$, the values lie in $[0.8, 1.35]$.
Numerical Discovery 3 (Anti-Correlation Engine). The decomposition $A = \Delta f + \Delta r$ reveals:
| $k$ | $\mathrm{Var}(\Delta f)$ | $\mathrm{Var}(\Delta r)$ | $\mathrm{Cov}(\Delta f, \Delta r)$ | $\mathrm{Var}(A)$ |
|---|---|---|---|---|
| 4 | 6.96 | 5.68 | −5.96 | 0.72 |
| 8 | 9.90 | 6.96 | −7.89 | 1.09 |
| 12 | 9.96 | 6.30 | −7.48 | 1.30 |
Neither $\Delta f$ nor $\Delta r$ is $O(1)$; their forced anti-correlation compresses $\mathrm{Var}(A)$ to $O(1)$.
Numerical Discovery 4 (Variance Anatomy of $G_{\mathrm{spf}}$). Complete decomposition $\mathrm{Var}(G_{\mathrm{spf}}) = \mathrm{Var}(A) + \mathrm{Var}(B) + 2\mathrm{Cov}(A,B)$:
- $\mathrm{Var}(A)$: rises from $0.31$ ($k=2$) to $1.33$ ($k=18$), bounded.
- $\mathrm{Var}(B)$: falls from $1.02$ ($k=2$) to $0.25$ ($k \geq 8$), converges to $1/4$.
- $\mathrm{Cov}(A,B)$: small negative, $\approx -0.10$ for $k \geq 6$.
Numerical Discovery 5 (Gaussian Character). $\mathrm{Var}(G \mid \Omega = k) \leq 1.45$ across all $k$. Kurtosis $\in [2.62, 2.92]$ (Gaussian value: 3). Skewness $\to 0$ at high $k$. Stable across five windows.
Eliminated Routes:
- (E1) Turán-Kubilius on $\Delta f$: $\mathrm{Var}(\Delta f) \approx 10$, not $O(1)$. No existing shifted-shell TK theorem.
- (E2) Separate treatment of $f$ and $r$: each component diverges; must treat $\rho_E = f + r$ as indivisible unit.
- (E3) Shell concentration of $d_\rho$: $\mathrm{Var}(d_\rho)$ grows with $N$, not $O(1)$.
Correct Proof Target: The local Lipschitz property $\mathrm{Var}(\rho_E(n-1) - \rho_E(n) \mid \Omega = k) = O(1)$.
§2. Shell Deficiency Drift
§2.1. The Deficiency and Its Natural Scale
Define $d_\rho(n) = \rho_E(n) - c^* \ln n$ where $c^* = 1/\ln 2 \approx 1.4427$ is the numerical best-fit constant for $\rho_E(n)/\ln n$ (a working normalization constant; Paper 11 establishes $\rho_E(n) = \Theta(\ln n)$ but does not prove the existence of a precise asymptotic constant). On the $\Omega = k$ shell, the question is whether $\mathrm{Var}(d_\rho)$ is $O(1)$, $O(k)$, or $O(\ln\ln N)$.
§2.2. Data: $\mathrm{Var}(d_\rho)$ vs $k$ and $N$
$k$-direction ($N = 10^7$ fixed): $\mathrm{Var}(d_\rho)$ monotonically decreases from $11.6$ ($k=2$) to $6.6$ ($k=18$) across 17 values of $k$, zero exceptions.
$N$-direction ($k$ fixed): seven-point scan from $N = 10^4$ to $10^7$ shows positive growth for all tested $k$. Fit: $\mathrm{Var}(d_\rho) \approx a(k) + b(k) \cdot \ln\ln N$ with $b(k) > 0$ increasing significantly with $k$ (from $1.26$ at $k=4$ to $7.53$ at $k=10$).
§2.3. Resolution of the Apparent Contradiction
In the $k$-direction at fixed $N$, $\mathrm{Var}(d_\rho)$ decreases because higher-$k$ shells are more constrained (more prime factors → tighter factorization control → less room for $d_\rho$ to deviate). In the $N$-direction at fixed $k$, $\mathrm{Var}(d_\rho)$ increases because larger $N$ introduces numbers with larger primes, whose deficiency behavior has wider spread.
The two effects coexist: the $k$-dependent coefficient $b(k)$ grows faster than linearly in $k$ on the tested range, so on the Erdős-Kac central shell $k \sim \ln\ln N$, $\mathrm{Var}(d_\rho)$ is expected to grow at least as fast as $\Theta((\ln\ln N)^2)$, though the precise scaling remains open.
§2.4. Why Shell Concentration Fails
The additive part $f(n) = \sum_{q^a \| n} \rho_E(q^a)$ has $\mathrm{Var}(f(n) \mid \Omega = k) \approx 16$ — already large. The remainder $r = \rho_E - f$ anti-correlates with $f$ (forced by the DP min), keeping $\mathrm{Var}(\rho_E)$ below $\mathrm{Var}(f)$. But this compression is not strong enough to reach $O(1)$; it only reduces $\sim 16$ to $\sim 10$.
§3. The Anti-Correlation Engine
§3.1. Structural Origin
$r(n) = \rho_E(n) - f(n)$ by definition. Therefore $\mathrm{Cov}(f, r) = \mathrm{Cov}(f, \rho_E - f) = \mathrm{Cov}(f, \rho_E) - \mathrm{Var}(f)$.
If $\rho_E$ were unconstrained (i.e., $r$ independent of $f$), then $\mathrm{Cov}(f, r) = 0$. The negative covariance arises precisely because the DP min operation caps $\rho_E$: when $f(n)$ is large (unfavorable additive structure), $\rho_E(n)$ cannot fully follow $f$ upward (the min with $M_n$ or $S_n$ pulls it back), so $r = \rho_E - f$ becomes more negative. Conversely, when $f(n)$ is small, $\rho_E$ may exceed $f$ (via the successor route), making $r$ more positive.
§3.2. The Same Mechanism for Differences
$A(n) = \rho_E(n-1) - \rho_E(n) = \Delta f(n) + \Delta r(n)$.
Both $\Delta f$ and $\Delta r$ individually have variance $\sim 7$–$10$ (growing with $k$), but their forced anti-correlation ($\mathrm{Cov} \approx -7.5$) compresses $\mathrm{Var}(A)$ to $\approx 1$.
The mechanism is the same: $\Delta r = \Delta \rho_E - \Delta f = A - \Delta f$. So $\mathrm{Cov}(\Delta f, \Delta r) = \mathrm{Cov}(\Delta f, A) - \mathrm{Var}(\Delta f)$.
If $\mathrm{Var}(A) = O(1)$ and $\mathrm{Var}(\Delta f) = \Theta(\ln\ln N)$, then $\mathrm{Cov}(\Delta f, A) = O(\sqrt{\ln\ln N})$ (by Cauchy-Schwarz), and:
$$\mathrm{Cov}(\Delta f, \Delta r) = \mathrm{Cov}(\Delta f, A) - \mathrm{Var}(\Delta f) \approx -\mathrm{Var}(\Delta f)$$
which explains the near-perfect cancellation.
§3.3. Remark: A Random-Walk Analogy
The numerical picture is suggestive of a random-walk structure: on the $\Omega = k$ shell, $d_\rho$ appears to have total variance growing as $\sim b(k) \cdot \ln\ln N$ while the increment variance $\mathrm{Var}(d_\rho(n-1) - d_\rho(n))$ remains $O(1)$ and the increments are approximately Gaussian (kurtosis $\approx 3$). This resembles a process with stationary increments. The analogy is heuristic; the proof of $\sigma(G) = O(1)$ requires formalizing the stationary-increment property, not the full random-walk structure.
§4. Eliminated Proof Routes
§4.1. Route E1: Turán-Kubilius on $\Delta f$
Target: $\mathrm{Var}(f(n-1) - f(n) \mid \Omega = k) = O(1)$.
Why it fails: $\mathrm{Var}(\Delta f) \approx 10$ at $N = 10^7$ and grows with $N$. Kátai-Subbarao's conditional TK gives $\mathrm{Var}(f(n) \mid \Omega = k) = O_k(1)$ (constrained side), but $f(n-1)$ is unconstrained ($\Omega(n-1)$ free), giving $\mathrm{Var}(f(n-1)) = O(\ln\ln N)$ by standard TK. With approximate independence of $n$ and $n-1$ (a heuristic supported by Elliott-type results on correlations of multiplicative functions), $\mathrm{Var}(\Delta f) \approx \mathrm{Var}(f(n-1)) + \mathrm{Var}(f(n)) - 2\mathrm{Cov} \approx O(\ln\ln N)$. The decisive evidence for E1 is the direct computation: $\mathrm{Var}(\Delta f) \approx 10$ at $N = 10^7$.
To the author's knowledge, no existing shifted-shell TK theorem resolves this; see Mangerel [6] for the current state of additive functions in short intervals and gaps.
§4.2. Route E2: Separate $f$ and $r$
Target: Prove $\mathrm{Var}(\Delta f) = O(1)$ and $\mathrm{Var}(\Delta r) = O(1)$ separately.
Why it fails: Both are $\gg 1$. The $O(1)$ variance of $A$ comes entirely from their cross-cancellation, not from individual boundedness.
§4.3. Route E3: Shell Concentration of $d_\rho$
Target: $\mathrm{Var}(d_\rho \mid \Omega = k) = O(1)$.
Why it fails: Seven-point $N$-scan confirms growth $\sim b(k) \cdot \ln\ln N$ with $b(k)$ increasing significantly in $k$.
§5. The Most Leveraged Proof Target: Local Lipschitz Property
§5.1. Statement
Numerical Target (Local Lipschitz). For each fixed $k$, there exists $C_k > 0$ such that
$$\mathrm{Var}(\rho_E(n-1) - \rho_E(n) \mid \Omega(n) = k, n \leq N) \leq C_k$$
uniformly in $N$.
Numerical evidence: $\mathrm{Var}(A) \in [0.3, 1.35]$ across $k = 2$ to $18$ and $N = 10^4$ to $10^7$, with zero growth trend (seven-point $N$-scan). Theoretical proof remains open.
§5.2. Why This Is the Most Leveraged Route toward $\sigma(G) = O(1)$
By Proposition J (Paper 17): $A = j - 1 = \max(G, 0) - 1$, so $\mathrm{Var}(A) = \mathrm{Var}(\max(G, 0)) \leq \mathrm{Var}(G)$.
The cleaner statement: $\mathrm{Var}(G_{\mathrm{spf}}) = \mathrm{Var}(A) + \mathrm{Var}(B) + 2\mathrm{Cov}(A,B)$. With $\mathrm{Var}(A) = O(1)$ (Local Lipschitz), $\mathrm{Var}(B) = O_k(1)$ (B-bound, numerical), and $|\mathrm{Cov}(A,B)| \leq \sqrt{\mathrm{Var}(A) \cdot \mathrm{Var}(B)} = O_k(1)$: $\mathrm{Var}(G_{\mathrm{spf}}) = O_k(1)$.
Numerically, $\mathrm{Var}(G)$ and $\mathrm{Var}(G_{\mathrm{spf}})$ remain close on the tested range (within $\pm 0.1$ for $k \geq 5$), with no evidence of variance blow-up in the passage from $G_{\mathrm{spf}}$ to $G$. So $\mathrm{Var}(G) = O_k(1)$.
§5.3. Connection to the DP Recursion
$\rho_E(n) = \min(\rho_E(n-1) + 1, M_n)$, so $A(n) = \rho_E(n-1) - \rho_E(n) = \max(\rho_E(n-1) + 1 - M_n, 0) - 1 = j(n) - 1$ (Proposition J).
The DP guarantees $A \geq -1$ (successor bound). The upper tail of $A$ is controlled by how much $M_n$ can undershoot $S_n$. On the $\Omega = k$ shell, $M_n$ is determined by $n$'s factorization, and $S_n = \rho_E(n-1) + 1$ is determined by the predecessor. Their near-cancellation to $O(1)$ is the local Lipschitz property.
§5.4. Possible Proof Directions for Local Lipschitz
- Direction 1: DP-induced smoothing. The recursion $\rho_E(n) = \min(S_n, M_n)$ creates a feedback loop: $S_{n+1} = \rho_E(n) + 1 = \min(S_n, M_n) + 1$. When $M_n \ll S_n$ (large jump), the next successor $S_{n+1}$ resets to $M_n + 1$, reducing the "memory" of previous values. This smoothing effect may bound the variance of consecutive differences.
- Direction 2: Quantitative smoothness of $\rho_E$. Numerically, $|\rho_E(n) - c^* \ln n|$ appears sublinear in $\ln n$ on the tested range. Since $c^*(\ln n - \ln(n-1)) = c^*/n + O(1/n^2) \to 0$, we have $|d_\rho(n) - d_\rho(n-1)| \leq |A(n)| + c^*/n$. This reduces to bounding $|A(n)|$, which is circular.
- Direction 3: Conditional second-moment method. Prove directly that $E[A^2 \mid \Omega = k] = O(1)$ using the insertion identity (Paper 16 Theorem 1) and the A/B decomposition.
§6. Complete Variance Anatomy
§6.1. $\mathrm{Var}(A)$ and $\mathrm{Var}(B)$ Across All $k$
| $k$ | $\mathrm{Var}(A)$ | $\mathrm{Var}(B)$ | $\mathrm{Cov}(A,B)$ | $\mathrm{Var}(G_{\mathrm{spf}})$ |
|---|---|---|---|---|
| 2 | 0.31 | 1.02 | +0.22 | 1.77 |
| 3 | 0.56 | 0.51 | +0.08 | 1.24 |
| 4 | 0.72 | 0.33 | −0.02 | 1.02 |
| 5 | 0.83 | 0.29 | −0.07 | 0.99 |
| 6 | 0.92 | 0.27 | −0.09 | 1.02 |
| 8 | 1.09 | 0.26 | −0.10 | 1.15 |
| 10 | 1.21 | 0.25 | −0.10 | 1.26 |
| 12 | 1.30 | 0.25 | −0.11 | 1.33 |
| 15 | 1.31 | 0.25 | −0.14 | 1.27 |
| 18 | 1.33 | 0.25 | −0.06 | 1.47 |
Observations: $\mathrm{Var}(B)$ converges to $1/4$ at high $k$, consistent with a Bernoulli-like distribution of $B$ values concentrated near $\{0, -1\}$. $\mathrm{Var}(A)$ saturates at $\approx 1.3$. $\mathrm{Cov}(A, B)$ is small and negative ($\approx -0.10$), indicating weak coupling.
§6.2. $G$ vs $G_{\mathrm{spf}}$
$G(n) \geq G_{\mathrm{spf}}(n)$ always (Paper 15, Theorem A). Non-SPF splits can be strictly better, making $G > G_{\mathrm{spf}}$:
| $k$ | $\mathrm{Var}(G)$ | $P(G = G_{\mathrm{spf}})$ |
|---|---|---|
| 2 | 0.35 | 54% |
| 5 | 0.92 | 36% |
| 8 | 1.23 | 17% |
| 12 | 1.40 | 12% |
| 18 | 1.30 | 13% |
At low $k$, non-SPF optimal splits are common and $\mathrm{Var}(G) < \mathrm{Var}(G_{\mathrm{spf}})$. At high $k$, $\mathrm{Var}(G) \approx \mathrm{Var}(G_{\mathrm{spf}})$, with $\mathrm{Var}(G)$ slightly exceeding $\mathrm{Var}(G_{\mathrm{spf}})$ by up to $0.1$.
§7. The Revised Proof Landscape
§7.1. What M8 Establishes
Discoveries (numerical, strong evidence on tested range $N \leq 10^7$, $k \leq 18$):
- Shell deficiency drift: $\mathrm{Var}(d_\rho)$ consistent with $a(k) + b(k) \cdot \ln\ln N$, $b(k) > 0$ increasing in $k$
- Local smoothness: $\mathrm{Var}(A)$ in $[0.3, 1.35]$ ($[0.8, 1.35]$ for $k \geq 5$; seven-point $N$-scan, zero growth trend)
- Anti-correlation engine: $\mathrm{Cov}(\Delta f, \Delta r) \approx -\mathrm{Var}(\Delta f)$, appears forced by DP min
- Gaussian character: kurtosis $\in [2.62, 2.92]$, skewness $\to 0$
- Variance anatomy: $\mathrm{Var}(B)$ converges toward $1/4$, $\mathrm{Var}(A)$ saturates near $1.3$
Eliminated routes: E1 (TK on $\Delta f$), E2 ($f/r$ separate), E3 (shell concentration)
§7.2. Updated Open Inputs for $D(N) \to 1$
| Input | Status | M8 impact |
|---|---|---|
| $\sigma(G) = O(1)$ | Most leveraged target identified: Local Lipschitz (theoretical proof open) | Shell concentration eliminated; proof target sharpened |
| Scale term monotonicity | Numerical (18pt + 7pt $N$-scan) | Unchanged |
| B-bound | Numerical hypothesis | $\mathrm{Var}(B) \to 1/4$ may be provable |
| I-a for fixed $k$ | Open | Unchanged |
| Lemma II | Numerically benign | $K_p > 0$ for $k \geq 6$ |
§7.3. Proof Dependency Graph (Updated)
$$\text{Local Lipschitz (N)} \longrightarrow \mathrm{Var}(A) = O(1) \xrightarrow{\text{+ B-bound (N)}} \mathrm{Var}(G_{\mathrm{spf}}) = O_k(1)$$
$$\mathrm{Var}(G_{\mathrm{spf}}) = O_k(1) + \text{Prop C (T)} + \text{Scale mono (N)} \longrightarrow \text{I-b}$$
$$\text{I-b} + \text{I-a (T high-}k\text{ / N fixed-}k\text{)} \longrightarrow \text{Lemma I}$$
$$\text{Lemma I} + \text{Lemma II (N)} + \text{B-bound (N)} \longrightarrow \text{Thm 5 (Paper 16, T)}$$
$$\text{Thm 5} + \text{SPF-positivity (N)} + \text{Sathe-Selberg (T)} \longrightarrow D(N) \to 1$$
Methodological Note: Four-AI Parallel Exploration and the Thermodynamic Interface
All mathematical claims, citations, and computations in this paper were independently checked and are the author's responsibility. AI systems were used for literature search, exploratory computation, and drafting support.
A.1. Four-AI Contributions to M8
The M8 results were developed through structured parallel exploration with four AI systems, following the protocol documented in Paper 16 Appendix.
ChatGPT conducted the literature search that proved decisive: identifying Kátai-Subbarao's conditional Turán-Kubilius [4], Goudout's shifted Erdős-Kac [5], and Mangerel's gap theory [6] as the relevant boundary of existing results. ChatGPT's prediction that $\mathrm{Var}(\Delta f) = O(\ln\ln N)$ (not $O(1)$) was confirmed by Gemini's data, and ChatGPT designed the $k$-vs-$N$ scaling test (§2.2) that resolved the $O(1)$ vs $O(\ln\ln N)$ question for $\mathrm{Var}(d_\rho)$.
Gemini computed the $f + r$ variance decomposition (§3) that revealed the anti-correlation engine: $\mathrm{Var}(\Delta f) \approx 10$, $\mathrm{Var}(\Delta r) \approx 7$, $\mathrm{Cov}(\Delta f, \Delta r) \approx -7.5$. This was the single most important computation of M8.
Grok computed $\mathrm{Var}(G)$ vs $\mathrm{Var}(G_{\mathrm{spf}})$ across all $k$ (§6), the kurtosis data confirming Gaussian character, the 17-point $k$-scan and 7-point $N$-scan of $\mathrm{Var}(d_\rho)$ (§2.2), and the $\mathrm{Var}(A)$ $N$-stability scan that confirmed local smoothness is independent of shell drift.
Claude proposed the initial $\mathrm{Var}(d_\rho) = O(1)$ conjecture (refuted), identified the tautological nature of the $S - M$ and $j - \Delta$ decompositions (§4, eliminating pure-DP routes), and formulated the Local Lipschitz target (§5.1).
A.2. Role of the Thermodynamic Interface
A parallel thread on the thermodynamic interpretation of ZFCρ contributed to M8 in three specific ways:
- (i) Priority setting. The thermodynamic analysis identified $\sigma(G) = O(1)$ as the single most leveraged target ("one stone, three birds": A-concentration + Prop C input + fluctuation-dissipation bridge), directly determining M8's attack choice over alternatives (scale monotonicity, I-a).
- (ii) Fluctuation-dissipation framing. The thermodynamic interpretation of the anti-correlation engine as a discrete fluctuation-dissipation relation — where the DP min operation plays the role of a dynamical constraint linking fluctuations ($\Delta f$) to dissipation ($\Delta r$) — provided the conceptual framework that guided the search for the mechanism behind $\mathrm{Var}(A) = O(1)$.
- (iii) Direction correction. The thermodynamic perspective's emphasis on local dynamics ("the system's constraint limits each step, not the global position") directly suggested the shift from shell concentration to local Lipschitz as the proof target — a suggestion subsequently confirmed by the $N$-scaling data.
These cross-domain contributions illustrate how physical intuition can guide the formulation (though not the proof) of mathematical conjectures.
References
- H. Qin, "Monotonicity, roughness stability, and the narrowing of the relay architecture" (Paper 17), Zenodo, DOI: 10.5281/zenodo.19016958.
- H. Qin, "The insertion identity, variance splitting, and the relay engine of Conjecture H'" (Paper 16), Zenodo, DOI: 10.5281/zenodo.19013602.
- H. Qin, "Zero-inflated lattice normal model" (Paper 13), Zenodo, DOI: 10.5281/zenodo.18991986.
- I. Kátai and M. V. Subbarao, "Some remarks on a paper of Ramachandra," Lithuanian Math. J. 48 (2008), 170–183.
- É. Goudout, "Lois de répartition des diviseurs," doctoral thesis, 2018–2021.
- O. Mangerel, "Additive functions in short intervals, gaps and a conjecture of Erdős," Int. Math. Res. Not. (2022).
- O. Mangerel, "On the bivariate Erdős-Kac theorem," Proc. London Math. Soc., 2021.
- G. Tenenbaum, Introduction to Analytic and Probabilistic Number Theory, 3rd ed., AMS, 2015.
本文报告 M8 探索阶段关于整数复杂度函数 $\rho_E$ 方差结构的决定性结果。缺陷量 $d_\rho(n) = \rho_E(n) - c^* \ln n$($c^*$ 为工作归一化常数)在壳上的方差 $\mathrm{Var}(d_\rho \mid \Omega = k, n \leq N)$ 与 $a(k) + b(k) \cdot \ln\ln N$ 增长一致,其中 $b(k) > 0$ 且在测试范围内随 $k$ 显著增大——并非此前猜测的 $O(1)$。尽管如此,一阶差分方差 $\mathrm{Var}(\rho_E(n-1) - \rho_E(n) \mid \Omega = k)$ 在所有测试 $k$($2$ 至 $18$)和 $N$($10^4$ 至 $10^7$)上均处于 $[0.3, 1.35]$,关于 $N$ 无增长趋势,支持 $\sigma(G) = O(1)$。机制是分解 $\rho_E = f + r$ 中加法部分 $f$ 与组合余项 $r$ 之间的强制反相关:$\mathrm{Var}(\Delta f) \approx 10$,$\mathrm{Var}(\Delta r) \approx 7$,但 $\mathrm{Cov}(\Delta f, \Delta r) \approx -7.5$ 将 $\mathrm{Var}(A)$ 压缩至 $\approx 1$。三条证明路线被排除:对 $\Delta f$ 的 Turán-Kubilius(方差 $\approx 10$,非 $O(1)$),$f$ 和 $r$ 的分开处理(各自发散),以及 $d_\rho$ 的壳集中(随 $N$ 增长)。$\sigma(G) = O(1)$ 的最有杠杆剩余目标是局部 Lipschitz 性质:连续缺陷量之差为 $O(1)$,尽管壳级漂移为 $O(\sqrt{k \cdot \ln\ln N})$,类比于平稳增量过程。结合其余接力输入——B-bound(数值),scale term 单调性(数值),固定 $k$ 的 I-a(开放),Lemma II(数值良性)——将完成 $D(N) \to 1$ 的引擎。
关键词:整数复杂度,ρ-算术,反相关,方差分解,局部光滑性,缺陷量,DP 递推,布朗类比
§1. 引言
§1.1. 背景
论文 17(DOI: 10.5281/zenodo.19016958)将 $D(N) \to 1$ 的证明归约为四个主要接力输入加 Lemma II。最有杠杆的输入是 $\sigma(G \mid \Omega = k) = O(1)$。本文报告 M8 探索结果,解析 $\rho_E$ 在 $\Omega$-壳上的方差结构,识别三条死路,并确定最有杠杆的剩余目标。
§1.2. 记号
同论文 17 §1.2。额外:$c^* = 1/\ln 2 \approx 1.4427$(工作归一化常数;非已证渐近常数),$d_\rho(n) = \rho_E(n) - c^* \ln n$,$f(n) = \sum_{q^a \| n} \rho_E(q^a)$(加法部分),$r(n) = \rho_E(n) - f(n)$(组合余项),$\Delta f(n) = f(n-1) - f(n)$,$\Delta r(n) = r(n-1) - r(n)$。
§1.3. 主要结果
数值发现 1(壳缺陷量漂移)。$\mathrm{Var}(d_\rho \mid \Omega = k, n \leq N)$ 拟合 $a(k) + b(k) \cdot \ln\ln N$,$b(k) > 0$:
| $k$ | $b$ |
|---|---|
| 4 | 1.26 |
| 6 | 2.58 |
| 8 | 4.41 |
| 10 | 7.53 |
$b(k)$ 在测试范围内随 $k$ 显著增大,快于线性。精确渐近形式为开放问题。
数值发现 2(局部光滑性)。$\mathrm{Var}(A \mid \Omega = k)$ 在 $[0.3, 1.35]$($k \geq 5$ 时 $[0.8, 1.35]$),跨 $k = 2$ 至 $18$ 和 $N = 10^4$ 至 $10^7$,关于 $N$ 无增长。
数值发现 3(反相关引擎)。$A = \Delta f + \Delta r$ 的分解:
| $k$ | $\mathrm{Var}(\Delta f)$ | $\mathrm{Var}(\Delta r)$ | $\mathrm{Cov}(\Delta f, \Delta r)$ | $\mathrm{Var}(A)$ |
|---|---|---|---|---|
| 4 | 6.96 | 5.68 | −5.96 | 0.72 |
| 8 | 9.90 | 6.96 | −7.89 | 1.09 |
| 12 | 9.96 | 6.30 | −7.48 | 1.30 |
$\Delta f$ 和 $\Delta r$ 均非 $O(1)$;强制反相关将 $\mathrm{Var}(A)$ 压缩至 $O(1)$。
数值发现 4($G_{\mathrm{spf}}$ 的方差解剖)。$\mathrm{Var}(A)$:0.31 → 1.33,有界。$\mathrm{Var}(B)$:1.02 → 0.25,收敛于 $1/4$。$\mathrm{Cov}(A,B)$:小负值,$k \geq 6$ 时 $\approx -0.10$。
数值发现 5(Gaussian 特征)。$\mathrm{Var}(G) \leq 1.45$。峰度 $\in [2.62, 2.92]$(Gaussian 值:3)。偏度趋零。跨五窗口稳定。
排除路线:(E1) 对 $\Delta f$ 的 TK:$\mathrm{Var}(\Delta f) \approx 10$,非 $O(1)$,无现成平移壳 TK 定理。(E2) $f/r$ 分开:各自发散,$O(1)$ 方差完全来自交叉对消。(E3) 壳集中:$\mathrm{Var}(d_\rho)$ 随 $N$ 增长。
正确证明目标:局部 Lipschitz 性质 $\mathrm{Var}(\rho_E(n-1) - \rho_E(n) \mid \Omega = k) = O(1)$。
§2. 壳缺陷量漂移
§2.1. 缺陷量及其自然标度
$d_\rho(n) = \rho_E(n) - c^* \ln n$,$c^* = 1/\ln 2$ 为工作归一化常数(论文 11 确立 $\rho_E(n) = \Theta(\ln n)$ 但未证精确渐近常数)。在 $\Omega = k$ 壳上,问题是 $\mathrm{Var}(d_\rho)$ 是 $O(1)$、$O(k)$ 还是 $O(\ln\ln N)$。
§2.2. 数据
$k$-方向($N = 10^7$):$\mathrm{Var}(d_\rho)$ 从 11.6($k=2$)单调递减至 6.6($k=18$),17 个 $k$ 值零例外。
$N$-方向($k$ 固定):7 点扫描 $N = 10^4$ 到 $10^7$,所有 $k$ 均正增长。拟合 $a(k) + b(k) \cdot \ln\ln N$,$b(k)$ 从 $k=4$ 的 1.26 增至 $k=10$ 的 7.53。
§2.3. 表观矛盾的解析
$k$-方向递减因高 $k$ 壳更受约束(更多素因子 → 更紧的因子分解控制 → $d_\rho$ 偏离空间更小)。$N$-方向递增因更大 $N$ 引入更大素因子,其缺陷量行为范围更宽。$b(k)$ 快于线性增长,中心壳层 $k \sim \ln\ln N$ 上 $\mathrm{Var}(d_\rho)$ 预期至少 $\Theta((\ln\ln N)^2)$,精确标度开放。
§2.4. 壳集中为何失败
$\mathrm{Var}(f(n) \mid \Omega = k) \approx 16$——已经很大。$r = \rho_E - f$ 与 $f$ 反相关(DP min 强制),将 $\mathrm{Var}(\rho_E)$ 压低,但压缩不足至 $O(1)$;仅从 $\sim 16$ 降至 $\sim 10$。
§3. 反相关引擎
§3.1. 结构起源
$r = \rho_E - f$ 为定义,故 $\mathrm{Cov}(f, r) = \mathrm{Cov}(f, \rho_E) - \mathrm{Var}(f)$。若 $\rho_E$ 不受约束则 $\mathrm{Cov}(f, r) = 0$。负协方差源于 DP min 截断:$f(n)$ 大时 $\rho_E(n)$ 无法跟随上升,$r$ 变负;$f(n)$ 小时 $r$ 变正。
§3.2. 差分的同一机制
$A = \Delta f + \Delta r$,$\Delta r = A - \Delta f$。$\mathrm{Cov}(\Delta f, \Delta r) = \mathrm{Cov}(\Delta f, A) - \mathrm{Var}(\Delta f) \approx -\mathrm{Var}(\Delta f)$,解释了近乎完美的对消。
§3.3. 随机游走类比
$d_\rho$ 在壳上似乎总方差以 $b(k) \cdot \ln\ln N$ 增长,增量方差保持 $O(1)$,增量近似 Gaussian(峰度 $\approx 3$)。类似于平稳增量过程。类比为启发式;证明 $\sigma(G) = O(1)$ 需形式化平稳增量性质。
§4. 排除的证明路线
§4.1. 路线 E1:对 $\Delta f$ 的 Turán-Kubilius
$\mathrm{Var}(\Delta f) \approx 10$ 且随 $N$ 增长。Kátai-Subbarao 条件 TK 给出 $\mathrm{Var}(f(n) \mid \Omega = k) = O_k(1)$,但 $f(n-1)$ 不受约束,$\mathrm{Var}(f(n-1)) = O(\ln\ln N)$(标准 TK)。$n$ 和 $n-1$ 近似独立(Elliott 型结果支持)使 $\mathrm{Var}(\Delta f) \approx O(\ln\ln N)$。据作者所知无现成平移壳 TK 定理适用;参见 Mangerel [6]。
§4.2. 路线 E2:$f$ 和 $r$ 分开
两者均 $\gg 1$。$A$ 的 $O(1)$ 方差完全来自交叉对消,非各自有界。
§4.3. 路线 E3:壳集中
7 点 $N$-扫描确认 $\mathrm{Var}(d_\rho)$ 增长,$b(k)$ 随 $k$ 增大。
§5. 最有杠杆的证明目标:局部 Lipschitz 性质
§5.1. 陈述
数值目标(局部 Lipschitz)。对每个固定 $k$,存在 $C_k > 0$ 使得
$$\mathrm{Var}(\rho_E(n-1) - \rho_E(n) \mid \Omega(n) = k, n \leq N) \leq C_k$$
关于 $N$ 一致。
数值证据:$\mathrm{Var}(A) \in [0.3, 1.35]$,7 点 $N$-扫描无增长。理论证明开放。
§5.2. 为何此为最有杠杆路线
$\mathrm{Var}(A) \leq \mathrm{Var}(G)$(命题 J + Lipschitz)。$\mathrm{Var}(G_{\mathrm{spf}}) = \mathrm{Var}(A) + \mathrm{Var}(B) + 2\mathrm{Cov}(A,B) = O_k(1)$(在局部 Lipschitz + B-bound 下)。$\mathrm{Var}(G)$ 和 $\mathrm{Var}(G_{\mathrm{spf}})$ 在测试范围内接近($k \geq 5$ 差异 $\pm 0.1$),无方差爆炸证据。故 $\mathrm{Var}(G) = O_k(1)$。
§5.3. 与 DP 递推的联系
$\rho_E(n) = \min(\rho_E(n-1) + 1, M_n)$,$A(n) = j(n) - 1$(命题 J)。DP 保证 $A \geq -1$。$A$ 的上尾由 $M_n$ 对 $S_n$ 的下冲控制。两者近乎对消至 $O(1)$ 即局部 Lipschitz。
§5.4. 可能的证明方向
- 方向 1:DP 诱导平滑。min 递推产生反馈环:$S_{n+1} = \min(S_n, M_n) + 1$。大跳跃后后继重置为 $M_n + 1$,减少记忆。
- 方向 2:定量光滑性。$|\rho_E(n) - c^* \ln n|$ 在测试范围内亚线性。归约为约束 $|A(n)|$,具有循环性。
- 方向 3:条件二阶矩法。利用插入恒等式直接证 $E[A^2 \mid \Omega = k] = O(1)$。
§6. 完整方差解剖
§6.1. 跨所有 $k$ 的 $\mathrm{Var}(A)$ 和 $\mathrm{Var}(B)$
| $k$ | $\mathrm{Var}(A)$ | $\mathrm{Var}(B)$ | $\mathrm{Cov}(A,B)$ | $\mathrm{Var}(G_{\mathrm{spf}})$ |
|---|---|---|---|---|
| 2 | 0.31 | 1.02 | +0.22 | 1.77 |
| 3 | 0.56 | 0.51 | +0.08 | 1.24 |
| 4 | 0.72 | 0.33 | −0.02 | 1.02 |
| 5 | 0.83 | 0.29 | −0.07 | 0.99 |
| 6 | 0.92 | 0.27 | −0.09 | 1.02 |
| 8 | 1.09 | 0.26 | −0.10 | 1.15 |
| 10 | 1.21 | 0.25 | −0.10 | 1.26 |
| 12 | 1.30 | 0.25 | −0.11 | 1.33 |
| 15 | 1.31 | 0.25 | −0.14 | 1.27 |
| 18 | 1.33 | 0.25 | −0.06 | 1.47 |
$\mathrm{Var}(B) \to 1/4$:与高 $k$ 时 $B$ 值集中于 $\{0, -1\}$ 的 Bernoulli 型分布一致;结构常数可能可证。$\mathrm{Var}(A)$ 饱和于 $\approx 1.3$。$\mathrm{Cov}(A, B)$ 小负值,弱耦合。
§6.2. $G$ vs $G_{\mathrm{spf}}$
| $k$ | $\mathrm{Var}(G)$ | $P(G = G_{\mathrm{spf}})$ |
|---|---|---|
| 2 | 0.35 | 54% |
| 5 | 0.92 | 36% |
| 8 | 1.23 | 17% |
| 12 | 1.40 | 12% |
| 18 | 1.30 | 13% |
低 $k$ 时 $\mathrm{Var}(G) < \mathrm{Var}(G_{\mathrm{spf}})$(额外分裂降低方差)。高 $k$ 时两者接近,$\mathrm{Var}(G)$ 略超至多 $0.1$。
§7. 修正后的证明全景图
§7.1. M8 确立的发现
(数值,测试范围 $N \leq 10^7$,$k \leq 18$):壳缺陷量漂移,局部光滑性($\mathrm{Var}(A) \in [0.3, 1.35]$ 稳定),反相关引擎($\mathrm{Cov}(\Delta f, \Delta r) \approx -\mathrm{Var}(\Delta f)$),Gaussian 特征,方差解剖($\mathrm{Var}(B) \to 1/4$,$\mathrm{Var}(A)$ 饱和于 1.3)。排除路线 E1/E2/E3。
§7.2. 更新的开放输入
| 输入 | 状态 | M8 影响 |
|---|---|---|
| $\sigma(G) = O(1)$ | 最有杠杆目标:局部 Lipschitz(理论证明开放) | 壳集中排除,目标锐化 |
| Scale term 单调性 | 数值(18 点 + 7 点 $N$-扫描) | 不变 |
| B-bound | 数值假设 | $\mathrm{Var}(B) \to 1/4$ 可能可证 |
| 固定 $k$ 的 I-a | 开放 | 不变 |
| Lemma II | 数值良性 | $K_p > 0$($k \geq 6$) |
§7.3. 证明依赖图
$$\text{局部 Lipschitz (N)} \longrightarrow \mathrm{Var}(A) = O(1) \xrightarrow{\text{+ B-bound (N)}} \mathrm{Var}(G_{\mathrm{spf}}) = O_k(1)$$
$$\mathrm{Var}(G_{\mathrm{spf}}) + \text{命题 C (T)} + \text{Scale 单调性 (N)} \longrightarrow \text{I-b}$$
$$\text{I-b} + \text{I-a} \longrightarrow \text{Lemma I}$$
$$\text{Lemma I} + \text{Lemma II (N)} + \text{B-bound (N)} \longrightarrow \text{Thm 5}$$
$$\text{Thm 5} + \text{SPF-正性收敛 (N)} + \text{Sathe-Selberg (T)} \longrightarrow D(N) \to 1$$
方法论注:四 AI 并行探索与热力学接口
本文所有数学声明、引用和计算均经独立检查,由作者负责。AI 系统用于文献检索、探索性计算和起草支持。
A.1. 四 AI 对 M8 的贡献
ChatGPT 进行了决定性文献搜索(Kátai-Subbarao 条件 TK [4],Goudout 平移 Erdős-Kac [5],Mangerel 间隙理论 [6]),预测 $\mathrm{Var}(\Delta f) = O(\ln\ln N)$ 被确认,设计了 $k$-vs-$N$ 标度测试(§2.2),解决了 $\mathrm{Var}(d_\rho)$ 的 $O(1)$ vs $O(\ln\ln N)$ 问题。
Gemini 计算了 $f + r$ 方差分解(§3),揭示反相关引擎——M8 最重要的单一计算。
Grok 计算了 $\mathrm{Var}(G)$ vs $\mathrm{Var}(G_{\mathrm{spf}})$(§6),峰度数据,17 点 $k$-扫描和 7 点 $N$-扫描,以及 $\mathrm{Var}(A)$ $N$-稳定性扫描。
Claude 提出初始 $\mathrm{Var}(d_\rho) = O(1)$ 猜想(被推翻),识别恒等式分解的恒等性质(排除纯 DP 路线),提出局部 Lipschitz 目标(§5.1)。
A.2. 热力学接口的角色
- (i) 优先级设定。热力学分析将 $\sigma(G) = O(1)$ 识别为最有杠杆目标("一石三鸟"),决定了 M8 的攻击选择。
- (ii) 涨落-耗散框架。将反相关引擎解读为离散涨落-耗散关系,DP min 操作起动力学约束的作用,提供了概念框架。
- (iii) 方向修正。"系统约束限制每步而非全局位置"的热力学直觉建议了从壳集中到局部 Lipschitz 的转向,随后被 $N$-标度数据确认。
这些跨领域贡献说明物理直觉如何引导数学猜想的提出(但非其证明)。
参考文献
- H. Qin,"Monotonicity, roughness stability, and the narrowing of the relay architecture"(论文 17),Zenodo,DOI: 10.5281/zenodo.19016958。
- H. Qin,"The insertion identity, variance splitting, and the relay engine of Conjecture H'"(论文 16),Zenodo,DOI: 10.5281/zenodo.19013602。
- H. Qin,"Zero-inflated lattice normal model"(论文 13),Zenodo,DOI: 10.5281/zenodo.18991986。
- I. Kátai and M. V. Subbarao,"Some remarks on a paper of Ramachandra," Lithuanian Math. J. 48 (2008), 170–183。
- É. Goudout,"Lois de répartition des diviseurs," doctoral thesis, 2018–2021。
- O. Mangerel,"Additive functions in short intervals, gaps and a conjecture of Erdős," Int. Math. Res. Not. (2022)。
- O. Mangerel,"On the bivariate Erdős-Kac theorem," Proc. London Math. Soc., 2021。
- G. Tenenbaum,Introduction to Analytic and Probabilistic Number Theory,3rd ed., AMS, 2015。