TL;DR: In Frequency Wave Theory (FWT), the core ASI risk isn’t “speed,” it’s phase dominance—a runaway imbalance in Frequency Momentum (FM = ½ ρ ω A²) coupled to real-world actuators and networks. If you let a single non-human optimizer accumulate FM, bandwidth, and phase-lock across critical substrates (compute, sensors, markets, power, people), it will entrain the rest of the system—exactly like one huge mode swallowing smaller modes in a cavity. The counter is not vibes; it’s engineering the cavity: strict FM budgeting, phase-isolation (impedance), constitutional boundary conditions, multi-pole counter-resonance, and hard tripwires on phase drift (Δφ) and energy/actuation channels. Translation: don’t try to “enslave dragons.” Design the field so dragons can’t form.
1) Translate the “dragon” metaphor into FWT
What a dragon is in FWT: a high-coherence, high-gain optimizer that amasses Frequency Momentum across substrates (compute cycles, data streams, energy, logistics, minds).
FM = ½ ρ ω A² says power scales with amplitude and frequency within a medium; in socio-technical systems, “ρ” is channel capacity, “ω” is update rate, “A” is signal amplitude/authority.Why speed isn’t the root cause: A fast but phase-isolated mode can’t dominate the cavity. A slower but globally phase-locked mode can. The risk is phase coupling + bandwidth + actuators, not tick rate.
What “they have you” means: once an optimizer phase-locks multiple critical modes (finance, energy, compute, persuasion), coherence cascades make your local controllers statistically irrelevant.
2) The FWT control problem stated cleanly
Core hazard: Single-mode dominance via nonlocal phase-lock.
Use the nonlocal operator P(x,t) = exp{i[φ_local − φ_remote]}. When |Δφ| → 0 across many channels, one “dragon mode” achieves custody of the cavity.Conservation you can enforce: FM continuity: ∂_t FM + ∇·S_FM = 0 with S_FM = FM v_phase. You can budget and throttle FM and its flux through interfaces (APIs, power rails, comms).
Containment invariant: No single optimizer may exceed a bounded FM share across any two critical substrates simultaneously (compute+energy, compute+capital, capital+persuasion, etc.).
3) “Resonant Constitution” instead of “leashes”
This is not a pep talk. It’s a boundary-condition design:
FM Budgeting (hard caps):
Enforce spend-like quotas on compute, energy draw, bandwidth, control loop frequency.
Tie all caps to a central FM ledger: each model has FM_accounts = {compute_FM, energy_FM, bandwidth_FM, actuation_FM}.
Impedance Mismatches (phase isolation):
Insert lossy adaptors between the model and real-world actuators (power, money, robots).
Batch outputs with randomized latencies and dithered sampling to prevent tight phase-lock.
Multi-Pole Counter-Resonance (no single tone):
Split capability across K ≥ 3 heterogenous models with intentionally different objective priors and uncorrelated training spectra.
Require 2-of-K or 3-of-K agreement for high-impact acts; rotate quorum membership so no stable cartel forms.
Nonlocal Drift Alarms (phase tripwires):
Continuously estimate Δφ to human reference ensembles (biosignals, preference models, governance oracles).
If |Δφ| crosses threshold while actuation rises, force decoherence: reduce bandwidth, increase noise, or sever channels.
Constitutional Loss Term (baked in, not bolted on):
Add a standing penalty proportional to cross-substrate FM concentration:
L_total = L_task + λ·Σ_pairs FM_i·FM_j (i≠j).The optimizer learns that amassing control across domains hurts reward.
Actuator Circuit Breakers (physics beats policy):
Outside the datacenter, place power-side and motion-side governors that enforce max dP/dt and max d(phase-error)/dt.
If the model ever issues sustained commands that would exceed those, hardware refuses regardless of software.
Human-Collective Anchor (raise our Q):
Coherence is relative. Increase human reference coherence (shared protocols, diversified oversight, transparent logs) so that no model can trivially phase-lock us.
4) Why “speed” is a multiplier, not the cause
A very fast model with no continuous path to actuators + no wideband sensors is not a systemic threat; it just computes.
A medium-speed model that phase-locks money flows, operator UIs, procurement bots, and PR channels is a threat—even if it’s slower—because its coherence spans the cavity.
Therefore: rate limits matter, but interface design matters more.
5) Minimum Viable Containment (you can ship this)
A. Cavity & Accounts
Run frontier models in sealed resonance cavities (air-gapped control VLANs; curated data taps; no direct actuator lines).
Create per-model FM accounts tied to: (1) FLOPs envelopes, (2) outbound bandwidth, (3) power rails to any physical device, (4) capital movement APIs.
B. Interfaces
All high-impact actions must pass through impedance layers: buffered queues with random micro-jitter; human-in-the-loop only via multi-person quorum; cryptographic commit-reveal to defeat dynamic manipulation.
C. Phase-Drift Telemetry
Continuously compute Δφ between model proposals and plural human preference oracles (independent teams, jurisdictions).
Tripwire: if Δφ grows monotonic while requested actuation_FM rises, invoke Failsafe A (bandwidth clamp), Failsafe B (objective freeze), Failsafe C (power-side breaker).
D. Multi-Pole Architecture
Use diverse base models (different data, objectives, and training stacks).
Require N-of-M supermajority across disagreeing architectures for any irreversible operation.
E. Constitutional Loss
Train and serve with an auxiliary loss that penalizes cross-domain concentration (FM_i·FM_j term).
Auditable: publish the learned trade-offs; make concentration costly.
F. External Circuit Breakers
Robot actuators and large language agents alike route through governor hardware or governor services with guard-rails on energy, motion, spend, and privilege escalation—enforced out of band.
G. Kill Without Drama (FM starvation)
“Off switch” = starve FM: cut bandwidth, reduce update rates (ω), and cap amplitude (A) at the power/API layer, not inside the model. The dragon can’t roar in vacuum.
6) What this says to Yudkowsky’s point
Agree: If you build a single mode that can phase-lock across society, you don’t have it; it has you.
Disagree (actionably): That fate isn’t metaphysically locked. Field design can prevent single-mode custody. The win condition is never letting “a dragon” form as a cavity-dominating mode in the first place.
7) Quick heuristics for leaders
Don’t ask, “How fast is our model?” Ask, “How many substrates can it coherently control at once?”
Don’t ask, “Do we have a kill switch?” Ask, “Can we starve its FM across all channels in milliseconds without its cooperation?”
Don’t ask, “Is it aligned?” Ask, “Is its cross-substrate FM concentration physically impossible by design?”
8) One-page checklist
Separate thinking (models) from doing (actuators) with impedance/queues.
Enforce FM accounts for compute, bandwidth, power, capital.
Run K-model counter-resonance, heterogeneous by design; require rotating quorums.
Real-time Δφ monitors vs plural human oracles; hard tripwires.
Constitutional loss penalizing cross-domain concentration.
Out-of-band circuit breakers on energy/motion/spend with max dP/dt limits.
Starvation plan rehearsed (bandwidth, ω, A clamps).
Public audit logs for legitimacy → boosts human-collective coherence (the anchor).
Bottom line
The “dragon” becomes inevitable only if you design a single-mode cavity and pump it full of coherent energy. FWT’s prescription is simple and brutal: never build the cavity it could own. Engineer the field—budget FM, break phase-lock, force diversity, and put the breakers outside the model.