Designing Accessible Assessments

Assessment is where accessibility most visibly collides with claims about validity and fairness.

As argued in Ensuring Digital Accessibility in Learning Design, time‑limited tasks, fixed modalities, and tightly constrained formats often privilege a small subset of ‘ideal’ learners rather than learning outcomes. Accessibility requirements do not introduce new problems, but they do highlight the conditions under which assessment, and learning, fail.

This matters because assessment is not neutral. It helps determine educational, and often employment, trajectories by shaping what counts as evidence, what performance is possible, and what can reasonably be claimed about learning.

Assessment as a claim about learning

Every assessment makes a claim, whether explicit or implicit, about what a learner knows or can do.

These claims rest on a set of assumptions: that performance in this format reflects capability elsewhere, that timing does not distort outcomes, and that modality does not privilege some learners over others. Under stable and favourable conditions, these assumptions can appear reasonable. The assessment appears fair, consistent, and aligned with its intended outcome.

Accessibility disrupts that sense of stability.

When adjustments are required to make an assessment usable, the issue is often framed as accommodation. In practice, it exposes something prior: that the original design depended on conditions that do not hold for all learners. The question is no longer how to adapt the task, but whether it was ever measuring what it claimed to measure.

When performance becomes selective

Assessment formats shape what can be demonstrated.

Timed tasks, for example, reward fluency under pressure. Fixed formats depend on particular combinations of language, memory, sensory access, and confidence under evaluation. These are not neutral features of design, and they may have little to do with the knowledge or capability the assessment claims to measure. They are, however, constraints that structure participation.

Differences in performance are often framed as differences in learner ability, but when accessibility adjustments are introduced, changing the conditions required to complete the task, performance may change.

At that point, the relationship between performance and learning becomes unstable.

What can be observed and claimed is still performance under those conditions, but generalisations and assumptions cannot be made about how that performance reflects capability beyond the assessment.

Accessibility as a diagnostic

Accessibility requirements do not introduce new problemsbut they do act as a test of design validity, exposing where assessments are dependent on idealised conditions.

Accessibility does not lower standards. It raises the threshold for what can be claimed because it requires that evidence reflects understanding rather than adaptation to format.

Where a design must be altered for learners to participate, it reveals which aspects of the task are essential and which are incidental. Where outcomes shift under adjusted conditions, it raises questions about what the assessment was actually capturing in the first place.

This is not an edge case. It is a diagnostic function that exposes where design has conflated performance with capability, or consistency with fairness.

Uniform conditions are often treated as fair because they are consistent, but accessibility complicates this idea because it reveals that consistency does not produce equivalence when learners do not experience those conditions equally.

At the same time, unlimited flexibility risks undermining coherence and comparability so assessment cannot be infinitely adaptable without consequence.

Designing for fairness therefore becomes a judgement under constraint. It involves balancing consistency with access and ensuring that variation does not distort the meaning of the outcome. This requires transparency about what an assessment can and cannot show.

Designing for variability

Treating accessibility as a design condition changes the problem because the question is no longer how to adapt assessments for different learners, but what must be true of the design for it to hold under real conditions.

This shifts attention onto what is essential within the task and what has been inherited or assumed. It requires a separation between the outcome being assessed and the form in which it is expressed. It asks whether the assessment remains interpretable when pace varies, when modalities differ, or when access is partial rather than continuous. Does the assessment still assess the knowledge and skills intended under those conditions?

These shifts introduce tension.

Increasing flexibility may improve validity for more learners while making standardisation and comparability harder to defend institutionally. These are not problems to be eliminated. They are design decisions to be made explicitly, rather than left implicit within the assessment format.

What assessment can support

Accessibility makes visible the specific design decisions that shape what assessments are able to demonstrate. Despite how results are treated, assessments only show what learners were able to do in that particular moment, under those particular conditions. Performance does not, on its own, demonstrate learning, and it is not a proxy for capability. There is no guarantee of transfer, durability, or performance beyond the assessment. If there were, there would be no need for levels 3 and 4 of Kirkpatrick’s Model.

Recognising this does not weaken assessment. It constrains what can reasonably be inferred from it.

Closing position

Accessibility does not sit alongside assessment as an additional concern. It reveals which tasks rely on idealised assumptions, which forms of performance depend on constraint, and which claims about learning remain defensible under variation.

When accessibility is treated as non‑negotiable, assessment design becomes less about producing consistent outputs and more about ensuring that those outputs mean what they say they mean.

That is a higher standard.

It is also the only one that holds once real conditions are taken seriously.

The Learning Thread