Law 07 · Reasoning & Planning

Don't Bet on One Chain

Sample many reasoning paths and let them vote.

The principle

A single greedy chain of thought is fragile. Sample several independent reasoning paths and take the majority answer, and you get large, consistent gains. Correct reasoning tends to converge while mistakes scatter, so agreement across independently generated plans is a real signal worth trusting before you act on something that matters.

Why it happens

One sampled reasoning path is one route through a probabilistic space. If it makes a bad early move, everything after it inherits that move. Multiple independent attempts give you a different signal: correct answers tend to converge, while mistakes scatter. Repeated sampling only helps when you can choose among the samples, through majority vote for comparable answers or through an external verifier for plans and artifacts. Use it for consequential, hard-to-reverse decisions. Do not spend 5x compute on routine steps that are cheap to undo.

Watch for

High-stakes outputs ride on a single greedy generation with no second opinion.
Re-running the same prompt yields meaningfully different answers, revealing the first one was luck.
Errors slip through because nothing checks whether independent attempts actually agree.

In practice

Your agent estimates a quote for a custom order in one greedy pass, lands on $1,400, and you send it to the customer, only to discover it dropped a line item that should have made it $2,100. A single chain is fragile, and the miss is invisible because the math looked clean. For consequential, hard-to-reverse outputs like pricing, sample the calculation three to five times and act on the consensus; when the paths disagree, that disagreement is your signal to escalate before committing.

Apply it

For consequential decisions, generate the answer several independent times instead of trusting the first.
Take the majority answer when outputs are comparable, or use an external check to pick among them.
Treat disagreement across the samples as a signal to escalate rather than silently picking one.

The takeaway

For high-stakes decisions, generate the plan or answer a few times and act on the consensus, not on the first chain you happened to get.

Sources and further reading

Get the audit kit Access the buyer edition Back to all 50 laws

The principle

Why it happens

Watch for

Apply it

Sources and further reading

Related laws