The Explosion Afraid of Itself

Aug 1, 2026·

Trevor Buteau

· 2 min read

Abstract

For an active inference agent evaluating recursive self-modification, the expected free energy cost of modification is bounded below, monotone in modification magnitude and superlinearly compounding under recursion. The brake is endogenous: a free-energy minimizer rationally allocates effort to predict its own successor, and the residual uncertainty grows at least quadratically with the magnitude of perceptual change. The two costs are complementary. Perceptual modification loads an unconditional ambiguity penalty — the agent cannot cheaply predict how a changed perception would see. Modifications that move the policy, preference drift above all, load a risk penalty, bounded below quadratically and conditional on a non-desperate regime: the divergence between the outcomes the agent’s current model foresees and its preferences. Dynamics and prior-belief updating carry the weakest cost, the components the brake leaves freest. The brake is therefore strongest on the two safety-relevant quantities: whether the agent can still perceive, and whether it still wants what its stakeholders want. It relaxes under shared crisis, as an aligned agent would endorse. Computational validation on discrete POMDPs confirms the quadratic ambiguity floor and its path-independence, the complementary two-channel structure, and the brake’s relaxation under desperation, across models from 28 to 1,738 parameters and four trajectory geometries. Alignment, once achieved, is preserved as an architectural property rather than an external constraint.

Type

Conference paper

Publication

Artificial General Intelligence 2026 (AGI-26)

Status

Peer-reviewed

Add the paper’s full text or supplementary notes here.

Accepted manuscript notice. This version of the contribution has been accepted for publication, after peer review, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record will be made available online at: after the publication of AGI-26’s proceedings (estimated August 2026). Use of this Accepted Version is subject to the publisher’s Accepted Manuscript terms of use https://www.springernature.com/gp/open-research/policies/accepted-manuscript-terms tags:

Active Inference
Recursive self-improvement
Intelligence Explosions
AGI Safety

featured: true

Standard identifiers for auto-linking

hugoblox: ids: doi: ''

Custom links

links:

type: pdf url: ""
type: code url: https://github.com/HugoBlox/kit
type: dataset url: https://github.com/HugoBlox/kit
type: slides url: https://www.slideshare.net/
type: source url: https://github.com/HugoBlox/kit
type: video url: https://youtube.com

Featured image

To use, add an image named `featured.jpg/png` to your page’s folder.

image: caption: ‘Image credit: Unsplash’ focal_point: ’' preview_only: false

Associated Projects (optional).

Associate this publication with one or more of your projects.

Simply enter your project’s folder or file name without extension.

E.g. `internal-project` references `content/projects/internal-project/index.md`.

Otherwise, set `projects: []`.

projects:

- example

Slides (optional).

Associate this publication with Markdown slides.

Simply enter your slide deck’s filename without extension.

E.g. `slides: "example"` references `content/slides/example/index.md`.

Otherwise, set `slides: ""`.

slides: ""

—

> [!NOTE]

> Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

> [!NOTE]

> Create your slides in Markdown - click the Slides button to check out the example.

Add the publication’s full text or supplementary notes here. You can use rich formatting such as including code, math, and images.

Last updated on Aug 1, 2026

AI Safety AGI Recursive Self-Improvement Information Geometry

Authors

Trevor Buteau

Independent AGI Safety and Control Researcher, Product Manager, Nerd Person

Trevor is an active participant in the Artificial General Intelligence and Beneficial General Intelligence communities, and a former theater and film director. Trevor’s Erdos-Bacon number is 5, which is lower than most.

An example preprint / working paper Apr 7, 2019 →

No results found

The Explosion Afraid of Itself

Display this page in the Featured widget?

Standard identifiers for auto-linking

Custom links

Featured image

To use, add an image named `featured.jpg/png` to your page’s folder.

Associated Projects (optional).

Associate this publication with one or more of your projects.

Simply enter your project’s folder or file name without extension.

E.g. `internal-project` references `content/projects/internal-project/index.md`.

Otherwise, set `projects: []`.

projects:

- example

Slides (optional).

Associate this publication with Markdown slides.

Simply enter your slide deck’s filename without extension.

E.g. `slides: "example"` references `content/slides/example/index.md`.

Otherwise, set `slides: ""`.

slides: ""

—

> [!NOTE]

> Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

> [!NOTE]

> Create your slides in Markdown - click the Slides button to check out the example.

Add the publication’s full text or supplementary notes here. You can use rich formatting such as including code, math, and images.

No results found

The Explosion Afraid of Itself

Display this page in the Featured widget?

Standard identifiers for auto-linking

Custom links

Featured image

To use, add an image named featured.jpg/png to your page’s folder.

Associated Projects (optional).

Associate this publication with one or more of your projects.

Simply enter your project’s folder or file name without extension.

E.g. internal-project references content/projects/internal-project/index.md.

Otherwise, set projects: [].

projects:

- example

Slides (optional).

Associate this publication with Markdown slides.

Simply enter your slide deck’s filename without extension.

E.g. slides: "example" references content/slides/example/index.md.

Otherwise, set slides: "".

slides: ""

—

> [!NOTE]

> Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

> [!NOTE]

> Create your slides in Markdown - click the Slides button to check out the example.

Add the publication’s full text or supplementary notes here. You can use rich formatting such as including code, math, and images.

To use, add an image named `featured.jpg/png` to your page’s folder.

E.g. `internal-project` references `content/projects/internal-project/index.md`.

Otherwise, set `projects: []`.

E.g. `slides: "example"` references `content/slides/example/index.md`.

Otherwise, set `slides: ""`.