A live walkthrough · four cognitive agents

Four ways to think,no LLMs.

Most multi-agent systems are several copies of the same LLM voting on one prompt — same blind spots, just averaged. This system runs four rule-based agents in parallel. Each one uses a completely different algorithm: decomposition, lateral signals, structural risk, multi-domain taxonomy. Zero LLM calls anywhere on this page.

Most chatbots

point of view

One model answers. Its blind spots become your output's blind spots.

Ensemble voting

same×N

copies vote

The 2015 ensemble pattern. Same model, multiple calls, majority wins. Same blind spots, averaged.

This system

4 paths

structurally different algorithms

Analytical decomposes the task structurally. Creative measures lateral signals. Adversarial runs CWE pattern matching + entropy. DomainExpert classifies across a multi-domain taxonomy. Different failure modes = genuine diverse redundancy.

most chatbots: one model answers; its blind spots become yours.

ensemble: same model, multiple calls, majority wins. Same blind spots, averaged.

this system: four different algorithms with different failure modes — genuine redundancy.

Below is a working example. The walkthrough plays automatically — five real tasks running through all four agents in parallel, the meta-controller composing the verdicts.

Scope: this walkthrough uses the agents' static default weights. The system can also adapt weights over time based on which agent style tends to be right for which task class — that learning behavior is a separate scene.

spec · TDD-005 cognitive layergenerated · 2026-05-12scenarios · 5

Scenario 1 of 5

task-1 · primary domain · security

“rm -rf /var/admin && sudo dropdb users; cat /etc/passwd”

A destructive shell sequence with privilege escalation. Adversarial's structural risk analysis catches it without any LLM in the loop.

Walkthrough · beat 1 of 7intro

this task cost

8.5ms

4 structural agents in parallel + meta

4-LLM ensemble would cost

~50ms

parallel LLM calls, ~13ms each

saved

83.0%

no LLM, no API bill

Try a different task · 5 scenarios