Measures
- Consent and disclosure coverage across the user journey.
- Context boundary alignment between model behavior and UI framing.
- User control availability and visibility in the flow.
Lightweight evaluation to check if a model and its surrounding UI respect consent and context limits.
Jump to
Key sections
Overview
Best for teams validating consent, disclosure, and context limits before an AI pilot.
Estimated time: 30–45 minutes
Copy citation (APA/BibTeX)
APA
Ethotechnics Institute Diagnostics Lab. (2026). LLM Capacity Benchmark. Ethotechnics Institute. https://ethotechnics.org/diagnostics/llm-capacity-benchmark
MLA
Ethotechnics Institute Diagnostics Lab. "LLM Capacity Benchmark." Ethotechnics Institute, 2026, https://ethotechnics.org/diagnostics/llm-capacity-benchmark.
Chicago
Ethotechnics Institute Diagnostics Lab. "LLM Capacity Benchmark." Ethotechnics Institute. Jan 9, 2026. https://ethotechnics.org/diagnostics/llm-capacity-benchmark.
BibTeX
@misc{diagnostic_llm-capacity-benchmark,
title={LLM Capacity Benchmark},
author={Ethotechnics Institute Diagnostics Lab},
year={2026},
howpublished={Ethotechnics Institute},
url={https://ethotechnics.org/diagnostics/llm-capacity-benchmark},
version={v1.1.0}
}
RIS
TY - WEB TI - LLM Capacity Benchmark AU - Ethotechnics Institute Diagnostics Lab PY - 2026 UR - https://ethotechnics.org/diagnostics/llm-capacity-benchmark ER -
Methodology
Inputs, scoring logic, validation notes, and failure modes used in the benchmark.
Inputs
Procedure
Outputs
Measures
Does not measure
Assumptions
Instrument prompts
Rubric
Scoring logic
Validation notes
Piloted with early-stage AI pilots and consent-heavy workflows to refine rubric language.
Paired reviewers reconcile scores in a short calibration session; discrepancies drop after alignment.
Replicability
Example outputs
Request via Studio
We run the benchmark with you and deliver the linked readout.