Edison Experimentation Prover MCP Connector for Claude
A+A team chose paper filing because 'best practice.' No pilot. No alternatives tested. 8 months later, 14,000 submissions/day — 47 hours behind on retrieval. Emergency migration under pressure. Cost: 3x the estimate, 6 weeks frozen. Edison tested 3,000+ filament materials before carbonized bamboo — he did not pick the 'obvious choice.' This tool forces experimentation: test alternatives with measured criteria, iterate beyond the first solution, build the ecosystem, prove viability under real conditions, and document dead ends.
AI agents pick the first solution that comes to mind and call it done. They do not test alternatives. They do not iterate. They build the feature but forget the ecosystem. They claim 'it works' without real-world evidence. And they never document the dead ends.
The Problem
LLMs commit five experimentation failures:
- Experiment Absent — 'The best approach is the traditional method.' Based on what? Which alternatives were evaluated? With what criteria? What did each score? Edison tested 3,000+ filament materials — platinum, carbon, bamboo, horsehair, fishing line, cork, celluloid — before finding carbonized bamboo from Kyoto lasted 1,200 hours. 'Best practice says' is not an experiment.
- Iteration Insufficient — 'It works, let us ship it.' The first working version is rarely the best. Edison ran 10,000+ experiments on the storage battery. What variations did you test? What improved? Where did returns diminish? 'Good enough' means you stopped experimenting too early.
- System Incomplete — 'We built the verification module. Rollout is someone else's problem.' Edison did not just build the light bulb — he built generators, cables, switches, meters, fuses, AND sockets. The bulb without the power grid is a curiosity, not a product. Is your solution deployable? Monitored? Documented? Does a user know how to go from zero to using it?
- Viability Unproven — 'The design is operationally sound.' Theoretically. Under what volume? At what cost? With what adoption friction? Edison required every invention to be commercially viable within 6 months. 'Works in our pilot' is not viability evidence.
- Failure Undocumented — 'We tried some things and settled on this approach.' Which things? What happened with each? Why did they fail? What did you learn? Edison kept 3,500+ notebooks: 'Experiment #247: carbonized cedar — lasted 38 minutes, crumbled at high temperature.' THAT is documentation.
How It Works
5 Decision Pivots following Edison's methodology:
- experimentConducted — Alternatives tested with measured criteria and results.
- iterationSufficient — Variations tested beyond first working solution to diminishing returns.
- systemComplete — Ecosystem built: rollout plan, monitoring, documentation, transition, onboarding.
- viabilityProven — Real-world evidence, operating cost, adoption friction, success metrics.
- failureDocumented — Each dead end: approach, measured result, root cause, learning.
The Verdict Matrix
| First Failing Pivot | Verdict | Meaning |
|---|---|---|
| experimentConducted = false | EXPERIMENT_ABSENT | First idea accepted without testing. |
| iterationSufficient = false | ITERATION_INSUFFICIENT | Stopped at first working solution. |
| systemComplete = false | SYSTEM_INCOMPLETE | Built the bulb, forgot the grid. |
| viabilityProven = false | VIABILITY_UNPROVEN | "Should work" — no real evidence. |
| failureDocumented = false | FAILURE_UNDOCUMENTED | "We tried some things" — no record. |
| All pivots pass | EXPERIMENT_PROVEN | Tested. Iterated. Complete. Viable. Documented. |
Related Connectors
Lumber Quantity Calculator MCP
Estimate board feet of lumber needed for wall, floor, and roof framing.
DAS Payment Scheduler MCP
Automated annual calendar generator for Brazilian Simples Nacional tax payments, handling holiday-based date shifts and overdue period alerts.
Contingency Budget Calculator MCP
Calculate essential contingency reserves for architectural and construction projects based on development phases.
Curie Measurement Prover MCP
A team shipped a 'significant performance improvement.' Workflow, supplier, automation, and schedule all changed at once. Processing time dropped — but nobody knew which change helped most. The automation alone accounted for 89% of the gain. The supplier switch introduced a defect that surfaced 72 hours later. Curie processed 8 tons of pitchblende to isolate 0.1 grams of radium — ONE element at a time. This tool forces that discipline: measure with numbers, isolate variables, validate across environments, persist through investigation, and quantify risks.