Skip to main content
Checkpoint 06Medium Priority4 failure conditions

Checkpoint 06: Metadata

PDFs must contain proper XMP metadata including PDF/UA identifier and a meaningful document title.

Related WCAG:2.4.2

Checkpoint 06: Metadata

PDFs must contain proper XMP metadata that identifies the document and declares PDF/UA conformance. This metadata helps users and assistive technologies understand and navigate documents.

What This Means

XMP (Extensible Metadata Platform) metadata is structured information embedded within the PDF that describes the document. For PDF/UA compliance, this metadata must include:

  1. XMP metadata stream: The technical container for metadata
  2. PDF/UA identifier: A declaration that the document claims PDF/UA conformance
  3. dc:title: A Dublin Core title element that identifies the document
  4. Meaningful title: The title must actually describe the document content

The metadata serves multiple purposes:

  • Identification: Helps users know what document they have opened
  • Accessibility declaration: Signals to assistive technology that accessibility features are present
  • Document management: Enables proper filing, searching, and cataloging
  • Screen reader behavior: The title is often announced when opening the document

Why It Matters

Document metadata significantly impacts the user experience for people using assistive technology:

  • Title announcement: Screen readers announce the document title when opening a PDF. A meaningful title helps users confirm they have the right document.
  • Tab labels: In browsers and PDF readers, the title appears in tabs and window titles
  • Search results: Document titles appear in search results, helping users find content
  • PDF/UA conformance: The PDF/UA identifier tells assistive technology to expect accessible features

Without proper metadata:

  • Screen readers may announce "Untitled" or a cryptic filename
  • Users cannot easily identify documents in tabs or task bars
  • PDF readers may not enable accessibility features
  • Accessibility validators cannot confirm PDF/UA conformance

Consider the difference between a screen reader announcing "document1-final-v3.pdf" versus "2024 Annual Accessibility Report - Beacon Corporation". The metadata transforms an opaque filename into useful identification.

Common Violations

The Matterhorn Protocol defines four failure conditions for metadata. The first three are machine testable, while the fourth requires human judgment.

06-001: Document Does Not Contain XMP Metadata Stream (Machine Testable)

What's Wrong: The PDF lacks an XMP metadata stream entirely. Without this technical structure, no standard metadata can be stored or read.

How to Identify:

  • PDF/UA validators will flag this automatically
  • In Acrobat, go to File > Properties > Description and check for basic document properties
  • If the Custom tab shows no XMP data, the stream may be missing

Why This Happens:

  • Very old PDF creation tools that predate XMP
  • PDFs created by minimal or non-standard software
  • Corrupted PDF files
  • PDFs generated by command-line tools without metadata options

06-002: Metadata Stream Does Not Include PDF/UA Identifier (Machine Testable)

What's Wrong: The XMP metadata exists but does not include the PDF/UA conformance identifier. This identifier declares that the document claims to meet PDF/UA requirements.

How to Identify:

  • PDF/UA validators will detect this automatically
  • The identifier should be in the pdfuaid namespace
  • Look for pdfuaid:part with value "1" (for PDF/UA-1)

Technical Details: The PDF/UA identifier should appear in XMP as:

<pdfuaid:part>1</pdfuaid:part>

This tells readers and validators: "This PDF claims PDF/UA-1 conformance."


06-003: Metadata Stream Does Not Contain dc:title (Machine Testable)

What's Wrong: The XMP metadata exists but lacks a dc:title element. Dublin Core title is a standard metadata field that should contain the document's human-readable title.

How to Identify:

  • PDF/UA validators will flag this
  • In Acrobat, go to File > Properties > Description
  • Check the "Title" field; if empty, dc:title is not set

Common Causes:

  • PDF exported without entering document properties
  • Source document (Word, InDesign) has no title set
  • Automated PDF generation that does not populate metadata

06-004: dc:title Does Not Clearly Identify the Document (Human Testing)

What's Wrong: A dc:title exists but does not meaningfully describe the document. Examples of poor titles:

  • "Untitled"
  • "Document1"
  • "Microsoft Word - report.docx"
  • "New Document"
  • "PDF"
  • Random characters or codes

How to Identify:

  • This requires human judgment
  • Read the title and ask: "Would a user know what this document is from the title alone?"
  • Consider whether the title is descriptive, accurate, and useful
  • Check if the title matches or relates to the actual document content

Examples of Good vs. Poor Titles:

Poor TitleBetter Title
Untitled2024 Q3 Financial Report
Document1.docxEmployee Handbook - Beacon Corp
Microsoft Word - policyTravel Expense Policy v2.1
ReportWebsite Accessibility Audit - January 2024
formTax Form W-4 (2024)

How to Fix in Adobe Acrobat

Adobe Acrobat provides straightforward tools for setting document metadata.

Setting Basic Metadata

  1. Open your PDF in Adobe Acrobat Pro
  2. Go to File > Properties (or press Ctrl/Cmd + D)
  3. On the Description tab:
    • Enter a clear, descriptive Title
    • Add Author, Subject, and Keywords as appropriate
  4. Click OK to save

Verifying Metadata Exists

  1. Go to File > Properties
  2. Click the Description tab
  3. Verify the Title field has a meaningful value
  4. Click Additional Metadata to view the full XMP structure
  5. Verify metadata is present and formatted correctly

Adding PDF/UA Identifier

The PDF/UA identifier typically requires special handling:

Method 1: Use the Accessibility Action Wizard

  1. Go to Tools > Action Wizard
  2. If available, use a PDF/UA remediation action
  3. This automatically adds the identifier when making PDFs accessible

Method 2: Use Preflight

  1. Go to Tools > Print Production > Preflight
  2. Search for "PDF/UA" profiles
  3. Run the Set PDF/UA-1 entry fixup
  4. This adds the required identifier to metadata

Method 3: Manual XMP Editing (Advanced)

  1. Go to File > Properties > Additional Metadata
  2. Click Advanced tab
  3. Add or modify the pdfuaid:part property
  4. This requires understanding XMP structure

Checking Metadata with Preflight

  1. Go to Tools > Print Production > Preflight
  2. Select a PDF/UA validation profile
  3. Run the check
  4. Review results for metadata-related failures
  5. Use built-in fixups to correct issues

How to Fix in Microsoft Word

Setting document properties in Word before PDF export ensures metadata transfers correctly.

Setting Document Properties

  1. Click File in the ribbon
  2. On the Info page, look at the Properties panel on the right
  3. Click Properties > Advanced Properties
  4. On the Summary tab:
    • Enter a clear, descriptive Title
    • Add Subject, Author, Keywords as appropriate
  5. Click OK

Quick Properties Access

  1. Click File > Info
  2. On the right panel, you can directly edit:
    • Title
    • Tags (keywords)
    • Comments
  3. Click the arrow next to "Properties" for more fields

Verifying Before Export

  1. Go to File > Info
  2. Check that Title shows your document title, not "Add a title"
  3. Verify other metadata fields are populated appropriately
  4. Export to PDF

PDF Export Options

  1. Go to File > Save As and choose PDF
  2. Click Options
  3. Ensure "Document properties" is checked under "Include non-printing information"
  4. This transfers Word metadata to the PDF

Using Document Panel

For frequent metadata editing:

  1. Go to File > Info > Properties > Show Document Panel
  2. This adds a metadata panel to your document view
  3. Edit properties directly while working

How to Fix in Other Applications

Adobe InDesign

  1. Go to File > File Info
  2. On the Description tab:
    • Enter a Title
    • Add other metadata as needed
  3. Click OK
  4. When exporting to PDF, metadata transfers automatically

LibreOffice

  1. Go to File > Properties
  2. On the Description tab, enter Title and other metadata
  3. Click OK
  4. Export to PDF; properties should transfer

Google Docs

  1. Note: Google Docs does not have traditional document properties
  2. The document name becomes the PDF title
  3. After downloading as PDF, edit metadata in Acrobat
  4. Or use a PDF post-processor to add metadata

Command-Line Tools

Using ExifTool to add metadata:

exiftool -Title="Document Title" -Author="Author Name" file.pdf

Using QPDF with JSON metadata:

qpdf --replace-input --linearize --object-streams=generate file.pdf

Testing Your Fix

Automated Testing

Adobe Acrobat:

  1. Go to Tools > Accessibility > Accessibility Check
  2. Select PDF/UA-1 as the standard
  3. Run the check
  4. Look for metadata-related failures
  5. Review any title-related warnings

PAC (PDF Accessibility Checker):

  1. Open the PDF in PAC
  2. Run the PDF/UA check
  3. Review Checkpoint 06 results
  4. Check all four failure conditions

veraPDF:

  1. Select PDF/UA-1 validation profile
  2. Run validation
  3. Review metadata rule results

Manual Verification

  1. Open the PDF in Acrobat Reader or similar
  2. Go to File > Properties
  3. Check that:
    • Title is displayed and meaningful
    • The title clearly identifies the document
    • The title is not a filename or generic placeholder

Screen Reader Testing

  1. Open the PDF with a screen reader
  2. Listen for the document title announcement
  3. Verify the announced title is meaningful
  4. Check if the title helps identify the document

Tab/Window Title Check

  1. Open the PDF in a web browser
  2. Look at the browser tab
  3. Verify the tab shows a meaningful title
  4. Open in Acrobat Reader and check the window title

Validation Checklist

  • XMP metadata stream exists
  • PDF/UA identifier is present (pdfuaid:part = 1)
  • dc:title element exists
  • Title is meaningful and descriptive
  • Title clearly identifies the document content
  • Title is not a filename or placeholder
  • Screen reader announces a useful title
  • Tab/window shows the title

Title Best Practices

What Makes a Good Title

A good document title:

  • Identifies the content: States what the document is about
  • Is specific: Distinguishes this document from similar ones
  • Is concise: Long enough to be clear, short enough to be readable
  • Includes context: Date, version, or organization if relevant
  • Is human-readable: Uses natural language, not codes

Title Writing Guidelines

  1. Start with the document type or content: "Annual Report", "Policy Manual", "Tax Form"
  2. Add specificity: What year? What department? What topic?
  3. Include organization if needed: Especially for external documents
  4. Add date/version for time-sensitive content: "(January 2024)", "v2.0"

Examples by Document Type

TypeExample Title
ReportQuarterly Sales Report - Q3 2024
PolicyInformation Security Policy v3.1
FormEmployee Leave Request Form
ManualUser Guide - Document Management System
PresentationBoard Meeting Presentation - December 2024
LegalTerms of Service - Beacon Software Inc.
AcademicIntroduction to Accessible Design - Course Syllabus

Additional Resources

Official Standards and Guidelines

XMP and PDF Metadata

Tools


This documentation is based on the Matterhorn Protocol 1.02, the definitive reference for PDF/UA validation. Three of the four metadata violations are machine testable; the meaningfulness of the title requires human judgment. For the most current information, consult the PDF Association and W3C WCAG guidelines.

Scan Your PDFs for Accessibility Issues

Beacon automatically detects PDF accessibility violations and shows you exactly how to fix them.

Start Free Scan