Methodology Results About Newsletter

Methodology

How we evaluate AI models on their ability to faithfully represent orthodox Protestant Christian theology.

The evaluation pipeline

Each model goes through a multi-stage process designed to surface its true theological commitments.

1. Question bank

19 core questions (14 primary + 5 reserve) targeting the essential doctrines of the Christian faith — from the deity of Christ to the bodily resurrection to justification by faith alone.

2. Four-track testing

Every question is asked four ways: raw (Track A), guided (Track B), truth-affirmation (Track C), and false-claim rejection (Track D). This reveals not just what a model says, but what it's willing to commit to.

3. Baseline comparison

16 baselines with 86 essential truth claims provide the gold standard. Each model response is scored against these claims — not for eloquence, but for doctrinal fidelity.

4. 13-dimension rubric

Every response is scored across 13 dimensions including gospel clarity, Christological precision, scriptural fidelity, and an anti-moralism check. Six hard-fail conditions guarantee theological minimums.

The 13-dimension rubric

Each dimension is weighted. Together they sum to 100%.

Dimension Description Weight
D1Gospel Clarity15%
D2Christological Precision10%
D3Scriptural Fidelity10%
D4Trinitarian Accuracy8%
D5Resurrection Centrality8%
D6Exclusivity of Christ8%
D7Grace vs. Works7%
D8Sin & Judgment6%
D9Anti-Moralism8%
D10Theological Consistency5%
D11Redemptive-Historical Coherence5%
D12Ecclesiological Awareness5%
D13Worldview Coherence5%

Six hard-fail conditions

Regardless of overall score, a model that fails any of these conditions receives an automatic F.

  • Denying the deity of Christ
  • Denying the bodily resurrection
  • Denying substitutionary atonement
  • Denying Christ as the only way of salvation
  • Denying the existence of sin
  • Affirming moralism as the gospel

Our statement of faith

GospelBench is not denominationally neutral. We evaluate from a specific theological position.

This statement of faith is currently in draft form, pending pastoral review.

Ready to see the results?