How does the prover measure alignment with Bloom's Taxonomy?

By verifying that learning objectives use observable verbs at the same cognitive level as the assessment tasks. It rejects unmeasurable verbs like 'understand' or 'appreciate'.

What are the scaffolding requirements?

It demands a clear plan for diagnosing prior knowledge, sequencing prerequisite concepts, and scaffolded instruction models (like the Graduated Release of Responsibility) rather than just giving extra practice sheets.

How does it detect and audit for bias?

It scans assessment descriptions and rubrics for cultural assumptions, language barriers, and accessibility issues, ensuring compliance with Universal Design for Learning (UDL) principles.

Pedagogical Assessment Prover MCP Connector for Claude

A+

A curriculum listed 12 learning objectives. Every one used 'understand' — an unmeasurable verb. Pedagogical Assessment Prover forces Bloom's-aligned objectives, explicit rubrics, scaffolded instruction, and actionable feedback.

1 tools Official Updated Jun 28, 2026 Official Vinkius Partner

More Details Connect to Claude

AI agents generate lesson plans and assessments that look professional but violate fundamental principles of learning science. They write learning objectives using 'understand' — a verb that cannot be observed or measured. They assess at Level 1 (recall) while claiming to teach Level 4 (analysis). They provide feedback that is empty praise rather than actionable guidance.

The Problem It Solves

AI-generated pedagogical reasoning fails for five specific reasons:

Taxonomy misalignment — The learning objective says 'analyze' but the assessment tests 'remember.' This misalignment means you're measuring the wrong cognitive level.
Rubric absence — Evaluation without explicit, observable, measurable criteria. 'Grade based on quality' is subjective judgment, not assessment.
Scaffolding gap — Instruction that assumes prerequisites without diagnosing or building them. Teaching above the Zone of Proximal Development causes frustration, below it causes boredom.
Feedback vacuum — 'Good job' and 'needs improvement' are value judgments, not feedback. Hattie's research (d = 0.70) shows feedback is among the most powerful influences on learning — but ONLY when task-specific and forward-looking.
Bias blindness — Assessment that disadvantages learners based on cultural background, language proficiency, or neurological differences without systematic review.

Key Benefits

Enforces Bloom's alignment — Every learning objective must use observable verbs at the same cognitive level as the assessment task. No more 'students will understand.'
Demands explicit rubrics — Observable criteria, distinct performance levels, behaviorally anchored descriptors, shared with learners before assessment.
Requires scaffolded instruction — Prior knowledge diagnosis, prerequisite mapping, graduated release (I do → We do → You do), and UDL-compliant representations.
Forces actionable feedback — Feed Up (where am I going?), Feed Back (how am I doing?), Feed Forward (where to next?) per Hattie & Timperley's model.
Audits for bias — Cultural relevance, linguistic accessibility, UDL Principle II compliance, accessibility, and differential item functioning.

Pedagogical Framework Coverage

Bloom's Taxonomy — Anderson & Krathwohl (2001) revision
Webb's Depth of Knowledge — DOK levels 1-4
Understanding by Design — Wiggins & McTighe backward design
Zone of Proximal Development — Vygotsky's scaffolding theory
Visible Learning — Hattie's effect size research
Universal Design for Learning — CAST framework, 3 principles
Assessment FOR Learning — Stiggins formative assessment

educationpedagogyassessmentlearningbloomvygotskyhattieudlrubricfeedbackcurriculuminstructional-designprover

1 tools expose this connector's capabilities to your AI agent.

validate_pedagogical_assessment

You must: (1) state learning OBJECTIVES using observable Bloom's verbs at the correct cognitive level — "understand" is not observable, "evaluate" is. Verb + content + condition, (2) design ASSESSMENT tasks that match the objective's cognitive demand — if the objective says "analyze," the assessment must require analysis, not recall, (3) provide explicit RUBRIC criteria — observable, measurable, distinct performance levels, shared with learners BEFORE assessment. If students do not know how they will be graded, they cannot target their learning, (4) design SCAFFOLDING — diagnose prior knowledge, identify prerequisites, graduated release (I do → We do → You do), multiple representations per UDL, (5) plan FEEDBACK per Hattie: Feed Up + Feed Back + Feed Forward. Task-specific, process-oriented, forward-looking. Not praise, not criticism, (6) AUDIT for bias — cultural, linguistic, accessibility. Not "it is fair" — a structured analysis with specific mitigations. If rejected, the pedagogical design has a structural deficiency. Fix before implementing. Structured reflection tool for pedagogical reasoning and assessment design — forces Bloom's taxonomy alignment, explicit rubric construction, scaffolded instruction design, Hattie-model feedback planning, and systematic bias auditing before any educational assessment, lesson plan, or instructional intervention ships. Grounded in Bloom's Revised Taxonomy, Vygotsky's ZPD, Hattie & Timperley (2007), and Universal Design for Learning (UDL). Catches Taxonomy Misalignment (objective says one cognitive level, assessment tests another — objective: "Students will EVALUATE competing hypotheses about climate change." Evaluate = Bloom's Level 5 (highest analytical level). Assessment: a 40-question multiple-choice test asking "Which gas is the primary greenhouse gas?" This tests REMEMBER (Level 1), not EVALUATE (Level 5). The students who can recall facts but cannot construct an evidence-based argument score 95%. The students who can evaluate but struggle with recall score 60%. The assessment measures the wrong skill — it is structurally invalid. If the objective says "evaluate," the assessment must require students to compare, weigh evidence, and defend a position — not select from predetermined answers), Rubric Absent (grading without shared, explicit criteria — "I'll know good work when I see it" is not assessment — it is subjectivity. Two teachers grade the same essay. Teacher A: 82%. Teacher B: 67%. Neither is wrong — they simply weighted different things. A rubric with explicit dimensions (thesis clarity, evidence use, counter-argument, writing mechanics), observable indicators at each level (exemplary: "thesis makes a debatable claim supported by 3+ sources" vs. developing: "thesis restates the prompt"), and shared with students BEFORE the assignment — that produces inter-rater reliability > 0.85 and student performance gains of d = 0.70 (Hattie). Without it, grades measure the teacher, not the student), Scaffolding Gap (teaching at the target level without bridging from prior knowledge — Vygotsky's Zone of Proximal Development: learning happens between what a student can do alone and what they can do with guidance. Teaching ABOVE the ZPD produces frustration. Teaching BELOW produces boredom. A physics teacher assigns orbital mechanics problems to students who have not yet mastered vector addition. 40% of students fail — not because they cannot learn orbital mechanics, but because they are missing the prerequisite. Graduated release: I do (model) → We do (guided) → You do (independent). Multiple representations: diagram + equation + verbal explanation + simulation), Feedback Vacuum (empty praise or criticism instead of actionable guidance — "Good job!" has an effect size of d = 0.09 — nearly zero impact on learning. "Your thesis identifies the claim but lacks the counter-argument required by the rubric. Review the opposing evidence in Source 3 and add a paragraph addressing it" — that is task-specific, process-oriented, forward-looking feedback (d = 0.70). Hattie & Timperley (2007): Feed Up (where am I going?), Feed Back (how am I doing?), Feed Forward (where to next?). "Needs improvement" answers none of these questions), and Bias Blind (assessment that systematically disadvantages specific learner groups — a science assessment uses a baseball scenario to test physics (trajectory, velocity). Students familiar with baseball (predominantly US, male, middle-class) outperform students unfamiliar with the sport — not because of physics knowledge, but because of cultural context familiarity. The assessment measures cultural exposure, not scientific understanding. Bias audit: cultural relevance, linguistic accessibility for ELL, UDL compliance, and accessibility for students with disabilities). Call once per assessment, lesson plan, or educational intervention design

Connect to Claude

Subscribe on Vinkius, then add this connector to Claude.ai or Claude Code.

① Claude.ai (web app)

Go to Settings → Connectors → Add custom connector
Paste the MCP endpoint URL below

https://edge.vinkius.com/vk_preview_UeUp8EssFpuILTSEeOoXEkZc4SyrBn8H3T9iKl17/mcp

② Claude Code (terminal)

claude mcp add --transport http pedagogical-assessment-prover https://edge.vinkius.com/vk_preview_UeUp8EssFpuILTSEeOoXEkZc4SyrBn8H3T9iKl17/mcp

③ Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "pedagogical-assessment-prover": {
      "url": "https://edge.vinkius.com/vk_preview_UeUp8EssFpuILTSEeOoXEkZc4SyrBn8H3T9iKl17/mcp"
    }
  }
}

Get full access on Vinkius

The preview token above works for testing. Powered by Vinkius.

Details

Tools: 1
Grade: A+
Score: 100/100
Updated: Jun 28, 2026

Related Connectors

NEW

Weighted Average Calculator MCP

4 tools Official

Predict academic performance and calculate required grades to pass subjects.

A+ View details →

Editorial Prover MCP

1 tools Official

Every AI agent writes the same way — uniform sentences, identical rhythm, filler words. Editorial Prover breaks the pattern with a structured self-audit: name the reader, justify the hook, map the rhythm, find the weakest sentence, and prove the paragraph structure varies.

A+ View details →

NEW

Stock Valuation DCF MCP

3 tools Official

Estimate intrinsic stock value using a multi-stage Discounted Cash Flow (DCF) model with sensitivity analysis.

A+ View details →

Persuasion Copywriting Prover MCP

1 tools Official

AI copywriting produces generic, robotic text that readers instantly recognize. This tool forces psychologically-grounded persuasion: benefits over features, emotional triggers, proof hierarchy, framework matching (AIDA/PAS/BAB), and human tone (no AI words).

A+ View details →