Law 07 · Reasoning & Planning
Don't Bet on One Chain
Sample many reasoning paths and let them vote.

The principle
A single greedy chain of thought is fragile. Sample several independent reasoning paths and take the majority answer, and you get large, consistent gains. Correct reasoning tends to converge while mistakes scatter, so agreement across independently generated plans is a real signal worth trusting before you act on something that matters.
Why it happens
One sampled reasoning path is one route through a probabilistic space. If it makes a bad early move, everything after it inherits that move. Multiple independent attempts give you a different signal: correct answers tend to converge, while mistakes scatter. Repeated sampling only helps when you can choose among the samples, through majority vote for comparable answers or through an external verifier for plans and artifacts. Use it for consequential, hard-to-reverse decisions. Do not spend 5x compute on routine steps that are cheap to undo.
Watch for
- High-stakes outputs ride on a single greedy generation with no second opinion.
- Re-running the same prompt yields meaningfully different answers, revealing the first one was luck.
- Errors slip through because nothing checks whether independent attempts actually agree.
In practice
Your agent estimates a quote for a custom order in one greedy pass, lands on $1,400, and you send it to the customer, only to discover it dropped a line item that should have made it $2,100. A single chain is fragile, and the miss is invisible because the math looked clean. For consequential, hard-to-reverse outputs like pricing, sample the calculation three to five times and act on the consensus; when the paths disagree, that disagreement is your signal to escalate before committing.
Apply it
- For consequential decisions, generate the answer several independent times instead of trusting the first.
- Take the majority answer when outputs are comparable, or use an external check to pick among them.
- Treat disagreement across the samples as a signal to escalate rather than silently picking one.
The takeaway
For high-stakes decisions, generate the plan or answer a few times and act on the consensus, not on the first chain you happened to get.