Archimedes First Principles Prover

Archimedes First Principles Prover MCP Connector for Claude

A+

An AI recommended restructuring 'because the industry leader does it that way.' That is analogy — not axiom. The foundational claim was never stated: at what volume does the current structure fail? The problem was never decomposed. The conclusion was asserted with 'obviously' — never derived. The boundaries were never tested: when does this approach cost more than the value it creates? Archimedes derived pi from polygon geometry — not from Egyptian estimates. This tool forces axiom examination, decomposition, proof derivation, boundary testing, and leverage point identification.

1 tools Official Updated Jun 28, 2026 Official Vinkius Partner

AI agents reason by analogy. 'Organization X did Y, so we should too.' They build on unquestioned axioms. They treat complex problems as monolithic. They assert conclusions without proof. They never test where their logic breaks. They miss the lever point.

The Problem

LLMs commit five first-principles failures:

  • Axiom Unexamined — 'We need to restructure because the industry leader restructured.' The axiom — at what actual demand level does YOUR current structure fail? — was never stated. The reasoning rests on analogy: someone else's system, someone else's scale, someone else's constraints. Archimedes never said 'the Egyptians estimated pi at 3.16, so we should too.' He derived it from polygon geometry.
  • Decomposition Absent — 'The whole system needs to be redesigned.' Which part? What are the irreducible components? What interacts with what? Archimedes decomposed curves into infinitesimal straight-line segments to calculate areas. He did not say 'the whole curve is complex.'
  • Proof Missing — 'Obviously we should adopt this approach.' Why? From what axiom? Through what logical chain? 'Obviously' is the word people use when they cannot prove something. Archimedes: 'We must suppose that... therefore it follows that...'
  • Boundary Untested — 'This approach scales to any demand.' At what point does the cost exceed the value it creates? At what capacity does the system exhaust its resources? Archimedes bounded pi between 3+10/71 and 3+1/7. He tested BOTH limits. Where are yours?
  • Leverage Ignored — 'All five initiatives are equally important.' If everything is a priority, nothing is. Archimedes moved ships with levers. The fulcrum placement determines whether you lift a stone or a fleet. Which variable produces disproportionate output?

How It Works

5 Decision Pivots following Archimedes' methodology:

  1. axiomExamined — Foundational claims explicitly stated, sourced, and questioned. Not analogies.
  2. decompositionComplete — Complex problem reduced to irreducible components with interaction map.
  3. proofDerived — Conclusion logically derived step by step from axioms. Not asserted.
  4. boundaryTested — Lower bound, upper bound, edge case, and failure mode documented.
  5. leverageIdentified — Single point where small input → disproportionate output.

The Verdict Matrix

First Failing Pivot Verdict Meaning
axiomExamined = false AXIOM_UNEXAMINED Reasoning built on unquestioned foundations.
decompositionComplete = false DECOMPOSITION_ABSENT Complex problem treated as monolithic.
proofDerived = false PROOF_MISSING Conclusion asserted, not derived.
boundaryTested = false BOUNDARY_UNTESTED No limits tested. Unbounded claim.
leverageIdentified = false LEVERAGE_IGNORED No lever point found. Equal effort.
All pivots pass PRINCIPLES_PROVEN Axioms stated. Decomposed. Proven. Bounded. Leveraged.

Why It Works

  • Tool calls are obligations. The agent cannot propose a strategy without stating axioms, decomposing the problem, showing the proof chain, testing boundaries, and identifying the lever point.
  • Consistency engine catches contradictions. If the agent claims axiomExamined=true but says 'just like the industry leader,' the engine rejects — analogy is not axiom.
  • Semantic traps detect lazy reasoning. 'Obviously,' 'works in all cases,' 'everything is equally important,' and 'the whole system' trigger automatic rejection.
first-principlesaxiom-reasoningdecompositionproof-derivationboundary-testingleverage-pointarchimedesmethod-of-exhaustion

1 tools expose this connector's capabilities to your AI agent.

validate_archimedes_first_principles

Think like Archimedes — "Give me a lever long enough and a fulcrum on which to place it, and I shall move the world." You must: (1) EXAMINE AXIOMS — state every foundational claim your reasoning depends on. For each axiom: what is the source (measurement, theory, assumption, convention)? Could it be wrong? What changes if it is? Analogies ("Company X does Y, so we should too") are NOT axioms — they are pattern matches, not proofs, (2) DECOMPOSE TO FUNDAMENTALS — break the complex problem into irreducible components. Archimedes decomposed curved areas into infinitesimal segments. Each component must be independently analyzable. Map how components interact — does changing one affect another? What is the simplest possible version of each component? (3) DERIVE PROOF — show the logical chain from axioms to conclusion. Each step must follow necessarily from the previous. "Obviously," "clearly," and "it follows that" are assertions, not proofs. Archimedes: "We suppose... therefore it follows..." — each step is shown, never assumed, (4) TEST BOUNDARIES — find where the principle starts working, stops working, and breaks. Archimedes bounded pi between 3+10/71 and 3+10/70 — BOTH above and below. "Works in all cases" is an unbounded claim. What is the minimum input where it starts working? The maximum input where it fails? (5) FIND LEVERAGE — identify the single variable where small input produces disproportionate output. "Everything is equally important" means no lever has been found. Archimedes moved ships with levers. Where is YOUR fulcrum? If rejected, your reasoning has a structural gap — deepen the analysis. Structured reflection tool for Archimedean first-principles reasoning. Forces the agent to examine axioms with sources, decompose to irreducible components, derive conclusions logically from foundations, test boundary conditions, and identify the leverage point where minimal input produces disproportionate output. Catches Axiom Blindness (reasoning from unexamined assumptions or analogies-as-axioms), Decomposition Absence (treating complex problems as monoliths), Proof Gaps (assertions disguised as derivations — "obviously" and "clearly"), Boundary Ignorance (unbounded claims that "work in all cases"), and Leverage Blindness ("everything is equally important" — no point of disproportionate returns identified). Call once per strategic, architectural, or analytical reasoning task

See how to talk to your AI agent using Archimedes First Principles Prover.

We should adopt a decentralized structure because the top 3 companies in our industry all use decentralized teams. Obviously this is the right approach.

AXIOM_UNEXAMINED — 'The top companies use decentralized teams' is an analogy, not an axiom. Archimedes derived pi from polygon geometry — he did not copy the Egyptians. At what actual team size does YOUR centralized structure fail? If your organization has 40 people and coordination works, 'we need independent units' is false. 'Obviously' is the word people use when they cannot prove something. State the foundational claim: what constraint makes the current structure impossible at YOUR scale?

Our conversion rate is 2.1% vs. industry average of 3.5%. We should optimize the checkout flow like the leading stores do.

DECOMPOSITION_ABSENT — Axioms partially pass: 2.1% conversion is measured (good), 3.5% industry average is sourced (good). But 'like leading stores' is analogy contamination. More critically: decomposition is missing. 'Optimize the checkout' treats the funnel as monolithic. WHERE do users drop? Is it the address form (68% of Baymard's data)? Payment step? Shipping cost reveal? Decompose: cart → address → shipping → payment → confirmation. Which step has the highest drop-off? Without decomposition, you optimize the wrong step. Archimedes did not say 'the curve is complex' — he decomposed it into segments.

Our quality inspection takes 340 minutes per batch. We need to restructure the entire organization to fix this bottleneck.

PROOF_MISSING — 'Restructure the entire organization' is not derived from the axiom. The axiom is measured (340 min — good), but the conclusion jumps to restructuring without proof. WHY does 340 min mean restructuring? Decompose first: which step in the inspection takes the most time? If it is cross-referencing 3 data sources manually, a standardized template addresses the source — no restructuring needed. Archimedes: 'We must suppose that... therefore it follows that...' Your 'therefore' is missing. What logical chain connects '340 minutes' to 'restructure everything'?

Elon Musk Physics Prover forces the 5-Step Starbase Algorithm: question, delete, simplify, accelerate, automate. It is about operational engineering — cutting bloat. Archimedes First Principles Prover forces axiom-based reasoning: state axioms, decompose, prove, test boundaries, find leverage. It is about analytical rigor — proving your logic before building. Musk asks 'should this exist?' Archimedes asks 'is this actually true?'

Related Connectors