Paper · June 2026

Permissionless Verified Inference Without a Correctness Bond

A mechanism for trust-minimized language-model serving over untrusted hardware.

Download PDF ↓Machine-checked: Z3 (SMT), Lean 4, PRISM-games, ProVerif, Tamarin

Abstract

Networks that buy inference from untrusted operators must detect a provider who substitutes a cheaper model, serves stale outputs, or returns degraded results. The prevailing designs either pair cryptographic per-request verification with a large slashable correctness bond, raising a capital barrier that excludes casual supply, or abandon the bond for subjective quality scoring, which is gameable. We show this is a false choice forced by expensive verification. Recent verifiable-inference primitives, locality-sensitive activation commitments and log-probability spot-checks, verify a request at a small constant fraction (rho) of its generation cost, cheap enough to audit nearly every request rather than a small sample. We prove that under escrowed payment and indistinguishable audits, honesty is a provider's best response precisely when the detection margin q satisfies q ≥ Δ/(p+Φ), where Δ is the per-request compute saved by cheating, p the fee, and Φ the franchise value of continued participation. Market viability already forces Δ < p, so driving q toward 1 via full-coverage verification makes a zero bond, and even a zero franchise, sufficient: forfeiture of the escrowed fee on the cheated request alone deters. This frees stake to serve priority and availability, and reassigns Sybil resistance to a non-capital, hardware-rooted cost that bounds an operator's fraction of identities by its share of physical throughput, which we show reduces verifier collusion to the standard honest-majority threshold. The construction is modality-agnostic: the same mechanism covers diffusion and codec models through a trajectory commitment whose single-step re-check preserves the small cost ratio. The same cheap verification applies per segment, so a model too large for any single GPU is sharded across a cohort of untrusted commodity machines, each segment independently re-executed, signed, and paid for the layers it served under a conservation invariant, with no per-shard bond to multiply. We machine-check the economic claims in the Z3 SMT solver and the Lean proof assistant, model-check the deterrence game in PRISM-games, verify the settlement protocol in ProVerif and Tamarin, and implement and measure the full audited path on a production engine. We claim no new cryptographic primitive; the contribution is the mechanism design that composes existing inference-verification primitives into a permissionless verified-inference network in which honest providers risk zero capital, a design we have not seen elsewhere.

Contributions

  • The zero-bond result: honesty is a provider's best response when the detection margin q ≥ Δ/(p+Φ), and full-coverage verification (q toward 1) makes both the bond and the franchise value unnecessary.
  • Full-coverage cheap verification (rho about 1%) in place of sampled audits, so nearly every request is checked.
  • Non-capital, hardware-rooted Sybil resistance that bounds an operator's share of identities by its share of physical throughput, reducing verifier collusion to an honest-majority threshold.
  • A modality-agnostic construction: the same mechanism covers diffusion and codec models via a trajectory commitment.
  • Verified split inference: the same cheap per-segment verification, with no per-shard bond, lets a model too large for one GPU be served by a cohort of untrusted machines, each segment re-executed, signed, and paid for its layers. A machine-checked corollary proves sharding preserves the zero-bond deterrence. Built and measured: a two-shard cohort reproduces the monolithic model at relative L2 of about 1e-5.
  • Machine-checked economics including the per-segment sharding corollary (Z3, Lean 4), a model-checked deterrence game (PRISM-games), a verified settlement protocol (ProVerif, Tamarin), and a measured implementation on a production engine.