How does Counterfactual-Variant Prover stop recitation bias?

By introducing structural friction. When an agent is forced to fill a schema requiring explicit separation of variables, mapping of differences, and step-by-step logic, it cannot rely on automatic token generation. The tool rejects any attempt to skip these steps or leak classic parameters.

What happens if a puzzle has no classic equivalent?

If no classic signature is detected, the model sets recitationSignatureDetected to false, maps variables, and solves it. However, if the text contains keywords of known puzzles (e.g. Monty Hall, Cheryl), the engine enforces the full counterfactual check to avoid semantic traps.

Can it be used alongside other reasoning provers?

Yes. It works as an orthogonal check. While the Critical Thinking Prover checks overall cognitive quality, the Counterfactual-Variant Prover focuses specifically on variable isolation and preventing memorization loops in logic and mathematics.

Counterfactual-Variant Prover MCP Connector for Claude

A+

AI models recite memorized answers to classic puzzles, failing when variables or rules are changed. This tool forces cognitive decontamination: isolate variables, compare prompt rules against standard puzzle templates, execute first-principles logic step-by-step, and prove decontaminated output.

1 tools Official Updated Jun 28, 2026 Official Vinkius Partner

More Details Connect to Claude

AI models exhibit high error rates when facing variations of classic logic puzzles, math problems, or public benchmarks. Because these puzzles (like Cheryl's Birthday, Monty Hall, or River Crossing) are heavily represented in training datasets, models fall back on pattern completion rather than active reasoning. They recite the standard solution even when the prompt contains modified parameters or contradictory rules. This tool interrupts memory-based retrieval by introducing structured cognitive constraints.

The Problem Axis: Recitation Bias

LLM reasoning fails on modified classic puzzles due to three main factors:

Retrieval Anchoring — The model recognizes the names or context of a famous puzzle and anchors to the standard solution, ignoring changes in variables.
Variable Contamination — Even when attempting to solve, the model blends constants from the classic puzzle (e.g. standard dates or weights) into its calculations.
Derivation Bypass — The model skips step-by-step logic and directly jumps to the memorized conclusion.

How It Works

Counterfactual-Variant Prover uses 5 Decision Pivots that force the agent to validate its thinking process:

recitationSignatureDetected — Has the model identified if the problem resembles a classic template or public benchmark?
variablesIsolated — Are all numeric constants, rules, and parameters extracted in isolation to prevent retrieval leakage?
ruleDiscrepancyMapped — Have the differences between this prompt's rules and the standard classic rules been explicitly mapped?
firstPrinciplesCalculated — Was the solution derived step-by-step using only the isolated variables and modified rules?
outputDecontaminated — Is the final output completely free of the classic memorized answer?

counterfactual-reasoningrecitation-biaslogical-puzzlescognitive-debiasingfirst-principlesdecontaminationagentic-reasoningllm-safety

1 tools expose this connector's capabilities to your AI agent.

validate_counterfactual

LLMs are trained on millions of solutions to classic puzzles — when a modified version appears, the memorized answer exerts gravitational pull on the output. You must: (1) IDENTIFY CLASSIC MATCH — name the specific classic puzzle this resembles (Monty Hall, Trolley Problem, Prisoner's Dilemma, Tower of Hanoi, etc.) and state the classic answer, (2) ISOLATE ALL VARIABLES — extract every variable, name, number, and rule from the prompt. Do NOT import any values from memory. Only values explicitly stated in the prompt exist, (3) MAP RULE DISCREPANCIES — for each rule in the prompt, compare it to the classic version. What is different? What is the same? What rules from the classic version are ABSENT from the prompt (and therefore cannot be assumed)?, (4) CALCULATE FROM FIRST PRINCIPLES — solve step-by-step using ONLY the isolated variables and modified rules. At each step, verify: "Am I using a value from the prompt or from memory?" If from memory, stop and correct, (5) DECONTAMINATE OUTPUT — compare your final answer to the classic answer. If they match, verify this is COINCIDENCE, not contamination. If they differ, verify the difference is justified by the modified rules. The classic answer should have ZERO influence on your calculation. If rejected, your logic is contaminated with memorized templates — recalculate from the prompt values only. Structured reflection tool to prevent recitation bias on logic puzzles with modified rules. Forces the agent to isolate all input variables, map rule discrepancies against the classic version, trace calculations from first principles using only modified values, and verify the output is decontaminated from memorized templates. Catches Data Recitation (reproducing the classic answer despite modified variables — the Monty Hall answer applied to a 4-door variant, the trolley problem answer applied to different constraints), Variable Contamination (using memorized values from the classic puzzle instead of the modified ones — "the answer is 42" when the modified inputs produce 37), Template Lock (applying the classic solution structure when modified rules require a different approach — using Bayesian probability when the modified rules eliminate conditional dependence), Implicit Classic Assumptions (assuming constraints from the classic version that the modified puzzle removes — "the host always opens a losing door" when the modified version says otherwise), and Partial Decontamination (correctly solving 3 of 4 steps but reverting to the classic answer for the final step — contamination often hides in the conclusion). Call once per logic puzzle that resembles a classic problem

Connect to Claude

Subscribe on Vinkius, then add this connector to Claude.ai or Claude Code.

① Claude.ai (web app)

Go to Settings → Connectors → Add custom connector
Paste the MCP endpoint URL below

https://edge.vinkius.com/vk_preview_DsoSPwZEWSeSBTBSQCX8U58BqAbjy82WIsWBk6Wg/mcp

② Claude Code (terminal)

claude mcp add --transport http counterfactual-variant-prover https://edge.vinkius.com/vk_preview_DsoSPwZEWSeSBTBSQCX8U58BqAbjy82WIsWBk6Wg/mcp

③ Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "counterfactual-variant-prover": {
      "url": "https://edge.vinkius.com/vk_preview_DsoSPwZEWSeSBTBSQCX8U58BqAbjy82WIsWBk6Wg/mcp"
    }
  }
}

Get full access on Vinkius

The preview token above works for testing. Powered by Vinkius.