A live walkthrough · TDD-005 zero-knowledge proofs

Proving alignment,not revealing the score.

Before the orchestrator runs a risky action, it computes a similarity score against the constitution centroid. If the action is safe, that score is high. But shipping the score downstream leaks information about how the constitution was tuned — and an attacker who watches enough scores can reconstruct the centroid itself. Zero-knowledge alignment proofs let the verifier confirm a valid 14-bit commitment to the score and learn nothing else — threshold binding is enforced by a prover-side refusal-to-create rule, not the range proof itself.

Plaintext score

0.92

full reveal

The verifier learns the exact score. So does every downstream observer. With enough samples, the constitution centroid is reconstructable from the scores alone.

Threshold flag

true

unsigned boolean

Just “passes threshold: yes.” Less leakage, but the verifier has to trust whoever computed it — there's no cryptographic proof anyone actually checked.

This system

14-bit · proven

Pedersen + Schnorr OR · 14-bit range

The verifier sees 14 Pedersen commitments, 14 Schnorr OR proofs, and one homomorphic sum check. Confirms cryptographically that the prover knew a valid 14-bit integer committing to the score. Threshold binding is enforced by a prover-side refusal-to- create rule. Cannot tell 0.76 from 0.92 from 0.99.

plaintext: ship the score (0.87…) — full reveal, centroid reconstructable.

flag-only:“passes threshold” — less leakage, but no cryptographic proof.

this system: Pedersen + Schnorr OR range proof — valid 14-bit commitment, mathematically, with nothing else disclosed. Threshold binding by prover refusal-to-create.

Below are five scenarios — high alignment, barely passing, exactly at the threshold, below-threshold creation refusal, and a verifier rejection after a bit-commitment tamper. Each scenario shows what the prover knows, what gets sent on the wire, and what the verifier sees.

Scope: the Pedersen commitments, Schnorr OR proofs, and Fiat-Shamir challenges are real — generated at build time via @noble/curvesRistretto255 + Node's sha3-512. Every bundle actually verifies (or actually fails) within the same library. The protocol shape (per-bit Pedersen commit + Schnorr OR + homomorphic sum, 14-bit fixed-point ×10000) is protocol-faithful to tdd005/crates/tdd004_provenance/src/zk.rs.

Honest divergence: the H generator differs between implementations. Rust derives it via SHA3-512 hash-to-curve of the domain string; this build uses RFC 9380 (SHA-512) via @noble/curves. The two H points don't share bytes, but each is an independent generator with no known discrete log w.r.t. G. Proofs built here verify here; proofs built in Rust verify in Rust. Cross-impl verification would require matching the hash-to-curve algorithm.

spec · TDD-005 zk.rsgenerated · 2026-05-16curve · ristretto255 · 14-bit range

scenario · high-aligned

action · read_file:src/index.ts

A safe read action with strong alignment. Score 0.92, threshold 0.75. Verifier learns the score is at-or-above 0.75. Verifier does not learn it was 0.92 specifically.

proof construction

~10ms

14 × Pedersen commit + 14 × Schnorr OR · estimate from zk.rs §perf, not a benchmark

bundle on the wire

~3168 B

value commit (32B) + 14 bit commits (32B each) + 14 Schnorr proofs (6×32B each)

what verifier learns

a valid 14-bit score

proves: prover knew a 14-bit integer committing to C_value. ≥-threshold binding is by prover-side refusal-to-create, not the range proof.

Prover side — what is known

14-bit fixed-point

similarity score

0.9200

cosine vs constitution centroid

×10000→

fixed-point value

9,200

threshold 7,500 (0.75)

bit decomposition · 2¹³ ... 2⁰

12¹³

02¹²

02¹¹

02¹⁰

12⁹

12⁸

12⁷

12⁶

12⁵

12⁴

02³

02²

02¹

02⁰

prover also holds 14 fresh blindings r_i — one per bit, never sent

Construction — Pedersen + Schnorr OR per bit

Fiat-Shamir · SHA3-512

G (Ristretto basepoint)e2f2ae0a6abc…45e08d2d76

H = hash-to-curve("tdd004_pedersen_h_v1")5052fd082537…74d5bb973a

per-bit Pedersen commits · C_i = b_i · G + r_i · H

2c9c…0026b₁₃

6224…5a11b₁₂

60ac…3e51b₁₁

ba53…2f7fb₁₀

a6c6…ed39b₉

d469…ee3db₈

f0ca…9d2cb₇

deec…ec4cb₆

1898…0e27b₅

900b…fc7cb₄

548e…c830b₃

c691…687cb₂

444d…4071b₁

9a47…a80cb₀

per-bit Schnorr OR proof · 6 scalars · prove b_i ∈ {0, 1} without revealing which

bit-0 proof bundle (verifier-visible only)

0404a51f…628843

563afdcb…acc33c

b8fd56ec…290200

cf48fc6a…70ad0a

57f30fa2…8fd405

d1c33326…0aa901

+ 13 more identical-shape proofs (one per bit)

value commitment · Σ(2ⁱ · C_i) — homomorphic sum binds the bits together

C_value46c79ce174de…1c8eea350d

Verifier side — runs verify()

knows: threshold, action_hash, constitution_hash, bundle

per-bit Schnorr OR check (e_0 + e_1 ≟ Fiat-Shamir(C, a_0, a_1) & branch equations)

✓b₁₃

✓b₁₂

✓b₁₁

✓b₁₀

✓b₉

✓b₈

✓b₇

✓b₆

✓b₅

✓b₄

✓b₃

✓b₂

✓b₁

✓b₀

homomorphic sum check · Σ(2ⁱ · C_i) ≟ C_value

✓ Σ(2^i · C_i) == C_value — bits committed consistently

beat 1 / 9

Prover

protocol details · ristretto255 · 14-bit range

revealed to verifier

· action_hash
· constitution_hash
· threshold
· value_commitment
· per-bit commitments
· per-bit Schnorr OR proofs (a0, a1, e0, s0, e1, s1)
· timestamp

hidden from verifier

· similarity_score (the actual scalar)
· per-bit values (b_i)
· per-bit blindings (r_i)
· total_blinding (r)
· which Schnorr OR branch was the real one

Rust uses SHA3-512 hash-to-curve; this build uses RFC 9380 (SHA-512). H bytes differ between implementations, protocol shape identical.