What Is Accessibility Metadata in EPUB, and Why Do Most Files Fail Without It?
Accessibility metadata in EPUB is structured information, embedded using schema.org vocabulary and reflected in ONIX product records, that describes how accessible a digital publication actually is, covering things like navigation, alternative text, and reading order. Most EPUB files that pass WCAG 2.1 AA conformance checks still fail accessibility audits because the metadata describing that conformance is missing, incomplete, or never makes it into the file’s package.opf or the retailer’s ONIX feed.
Key Takeaways
- A file can be technically WCAG-conformant and still fail an accessibility audit if it lacks the schema.org accessibility metadata that declares this conformance.
- ONIX accessibility metadata (codelists 195, 196, and 197) is what retailers, libraries, and aggregators use to surface accessible titles to readers who need them.
- ithout proper metadata, accessible EPUBs are effectively undiscoverable to users searching for accessible content, screen reader users, and procurement systems checking compliance.
- This is a high-impact, low-competition area: most accessibility content online focuses on the EPUB file itself, not the metadata layer that makes accessibility discoverable and auditable.
- BISG (Book Industry Study Group) maintains the primary guidance for implementing this metadata correctly across both EPUB and ONIX.
Why Does an Accessible EPUB Still Fail Compliance Audits?
An EPUB can pass every technical accessibility check, alt text present, reading order correct, navigation landmarks in place, and still fail a compliance audit for one reason: the file doesn’t say it’s accessible in a machine-readable way.
Compliance audits, particularly under the European Accessibility Act (EAA) and ADA Title II, increasingly check for two things, not one:
- Does the EPUB meet WCAG 2.1 AA requirements at the content and code level?
- Does the EPUB declare this conformance through standardized accessibility metadata that downstream systems, retailers, and libraries can read?
A file can satisfy the first condition entirely and fail the second. Auditors and procurement teams often rely on metadata fields to verify compliance at scale, rather than manually inspecting every file. If the metadata is absent, the file is frequently flagged as non-compliant by default, regardless of its actual internal accessibility.
What Is Schema.org Accessibility Metadata, and Where Does It Go in an EPUB?
Schema.org accessibility metadata is a set of standardized properties, originally developed through a collaboration between the DAISY Consortium, the EPUB accessibility community, and schema.org, that describe the accessibility features of a digital publication in a way search engines, retailers, and assistive technologies can read.
The core properties include:
Schema.org Property
What It Describes
Example Value
accessibilityFeature
Specific accessibility features present
alternativeText, readingOrder, tableOfContents, MathML
accessibilityHazard
Potential hazards for users with certain conditions
noFlashingHazard, noMotionSimulationHazard
accessibilitySummary
Human-readable summary of accessibility for the title
Free text description
accessibilityAPI
Accessibility APIs supported
ARIA
accessibilityControl
How the content can be controlled
fullKeyboardControl, fullMouseControl
conformsTo
Formal conformance standard
EPUB Accessibility 1.1, WCAG 2.1 Level AA
In the EPUB file itself, this metadata belongs in the package.opf file, within the <metadata> element, using the schema: namespace. This is distinct from, but related to, the EPUB Accessibility 1.1 specification’s own conformance and certification metadata requirements.
The most commonly missing property in real-world EPUB files is accessibilitySummary. Even files with correctly tagged accessibilityFeature values frequently omit the human-readable summary, which is often the field retailers and library systems prioritize for display to end users.
How Does ONIX Accessibility Metadata Differ From EPUB Metadata?
ONIX (ONline Information eXchange) is the metadata standard publishers use to communicate product information to retailers, distributors, and libraries, separately from the EPUB file itself. ONIX accessibility metadata carries the same underlying information as the schema.org properties inside the EPUB, but in a format retailers’ systems are built to ingest at the catalog level.
Key differences:
Aspect
EPUB (Schema.org) Metadata
ONIX Metadata
Location
Embedded inside the EPUB file (package.opf)
Sent separately as part of the product record/feed
Primary audience
Reading systems, assistive technology, the file itself
Retailers, distributors, library systems, aggregators
Used for
Determining how the content behaves for the reader
Determining how the title is displayed and filtered in a catalog
Relevant codelists
N/A (uses schema.org vocabulary)
ONIX Codelist 196 (Accessibility feature), Codelist 195 (Accessibility hazard), Codelist 197 (Accessibility API)
Common failure point
Missing accessibilitySummary
Accessibility fields present in EPUB but never mapped to the ONIX feed sent to retailers
This is the gap that causes the most real-world compliance failures. A publisher may correctly tag the EPUB file but never update their ONIX feed to reflect it, meaning the title appears as “accessibility unknown” or “no information provided” on retailer platforms, libraries, and aggregator sites, even though the underlying file is fully accessible.
Why Does Accessibility Metadata Affect EPUB Discoverability?
EPUB discoverability for accessible titles depends almost entirely on metadata, not on the file’s actual internal accessibility, because discovery happens at the catalog level, before a reader ever opens the file.
Three discoverability impacts of missing or incomplete accessibility metadata:
- Library and institutional procurement systems filter by metadata. Many library systems, particularly those serving K-12 and academic institutions under ADA Title II obligations, allow filtering or flagging titles by accessibility conformance. A title without this metadata is often excluded from “accessible” results entirely, regardless of its actual conformance.
- Retailer accessibility labels depend on ONIX fields. Major retailers display accessibility information (such as “screen reader friendly” or accessibility feature badges) based on ONIX accessibility codelists. Without this data in the feed, no label appears, and the title is effectively invisible to readers specifically searching for accessible content.
- Search engines and AI-driven discovery tools increasingly surface accessibility information. As schema.org metadata is also used by search engines for structured data, properly tagged accessibility metadata can influence how a title appears in search results and AI-generated summaries, an emerging factor as generative search becomes more prevalent.
What Does a Complete Accessibility Metadata Checklist Look Like?
A complete accessibility metadata implementation, covering both the EPUB file and the ONIX feed, should include the following:
Inside the EPUB (package.opf):
- [ ] accessibilityFeature values listed for all applicable features (alt text, structural navigation, MathML, etc.)
- [ ] accessibilityHazard values declared, including “none” hazards explicitly (e.g., noFlashingHazard)
- [ ] accessibilitySummary written in clear, human-readable language
- [ ] accessibilityAPI declared (typically ARIA)
- [ ] accessibilityControl values present
- [ ] conformsTo referencing the correct standard (EPUB Accessibility 1.1, WCAG 2.1 AA)
In the ONIX feed:
- [ ] Codelist 196 (Accessibility feature) populated to match EPUB metadata
- [ ] Codelist 195 (Accessibility hazard) populated
- [ ] Codelist 197 (Accessibility API) populated
- [ ] Accessibility summary text included in the relevant ONIX text field
- [ ] Conformance certification details included where applicable (e.g., EPUB Accessibility 1.1 conformance statement)
Process checks:
- [ ] Metadata updated whenever the EPUB file itself is revised
- [ ] EPUB metadata and ONIX feed kept in sync as part of a single QC step, not handled separately
- [ ] Metadata validated against current BISG guidance, since codelists and recommended fields are periodically updated
How Can Publishers Fix This Without Re-Auditing Every Title?
Publishers with large backlists don’t need to re-run a full accessibility audit on every title to fix metadata gaps. A more efficient approach:
- Audit metadata first, separately from file-level accessibility. Many backlist titles may already meet WCAG requirements internally but simply lack declared metadata. A metadata-only audit is faster and often resolves the majority of “non-compliant” flags.
- Build a single metadata template per accessibility conformance level. Since most titles within a given production workflow share the same accessibility features (same navigation structure, same alt text practices), a standardized metadata block can often be applied across many titles with minimal per-title customization.
- Synchronize EPUB and ONIX updates in one pass. Treat the EPUB package.opf update and the ONIX feed update as a single task per title, rather than two separate projects, to avoid the most common failure point: EPUB updated, ONIX not.
- Validate against current BISG and EPUB Accessibility 1.1 guidance, as both the schema.org accessibility vocabulary and ONIX codelists have been revised in recent years, and older implementations may use deprecated field names or values.
Frequently Asked Questions
Is accessibility metadata legally required under the European Accessibility Act? The EAA requires that digital publications meet WCAG 2.1 AA conformance, and accessibility metadata is the primary mechanism by which this conformance is communicated and verified at scale. While the legal requirement centers on the content’s actual accessibility, metadata is increasingly treated as essential evidence of compliance during audits and procurement reviews.
Can a publisher add accessibility metadata without modifying the EPUB’s content or layout? Yes. Accessibility metadata is added to the package.opf file’s metadata section and does not require changes to the EPUB’s reading content, layout, or styling. It is a metadata-only update, which is one reason it is often overlooked, since it has no visible effect within the reading application itself.
What is the difference between EPUB Accessibility 1.0 and 1.1, and does it affect metadata? EPUB Accessibility 1.1 refined and clarified several metadata requirements compared to 1.0, including how conformance and certification are declared. Files tagged against the 1.0 specification may use older property structures that should be reviewed against current 1.1 guidance to ensure ongoing compatibility with audit tools.
How do retailers and libraries actually use ONIX accessibility codelists? Retailers and library systems ingest ONIX product feeds and use the accessibility codelist values, such as Codelist 196 for accessibility features, to populate filters, badges, and search facets. A title with no values in these fields typically appears with no accessibility information at all, rather than being assumed accessible or inaccessible.
How Wordium Can Help
Wordium’s accessibility services cover the full metadata layer, not just file-level remediation. This includes auditing existing EPUB files for schema.org accessibility metadata completeness, reviewing and updating ONIX accessibility feeds to match, and building standardized metadata templates for backlist titles to avoid re-auditing every file individually.
If your accessible titles aren’t appearing as accessible to retailers, libraries, or procurement systems, the issue is often metadata, not the file itself. Contact Wordium’s accessibility team for a metadata audit.
Related Wordium Services: Accessibility (WCAG/EPUB) | eBook Creation (EPUB3) | Digital Conversion (XML, HTML, PDF)
Sources
- Book Industry Study Group (BISG) — Accessibility Metadata guidance and resources
- EPUB Accessibility 1.1 specification (W3C)
- Schema.org Accessibility Properties documentation
- EDItEUR — ONIX for Books accessibility codelists
Author: Wordium Editorial Team Word count: ~1,800
Comments
Post a Comment