Network Working Group E. Taft
Request for Comments: 3778 J. Pravetz
Category: Informational S. Zilles
L. Masinter
Adobe Systems
May 2004
The application/pdf Media Type
Status of this Memo
This memo provides information for the Internet community. It does
not specify an Internet standard of any kind. Distribution of this
memo is unlimited.
Copyright Notice
Copyright (C) The Internet Society (2004). All Rights Reserved.
Abstract
PDF, the 'Portable Document Format', is a general document
representation language that has been in use for document exchange on
the Internet since 1993. This document provides an overview of the
PDF format, explains the mechanisms for digital signatures and
encryption within PDF files, and updates the media type registration
of 'application/pdf'.
Table of Contents
1. Introduction. . . . . . . . . . . . . . . . . . . . . . . . . 2
2. History . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
3. Fragment Identifiers. . . . . . . . . . . . . . . . . . . . . 3
4. Encryption. . . . . . . . . . . . . . . . . . . . . . . . . . 4
5. Digital Signatures. . . . . . . . . . . . . . . . . . . . . . 5
6. PDF implementations . . . . . . . . . . . . . . . . . . . . . 8
7. Security Considerations . . . . . . . . . . . . . . . . . . . 8
8. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 9
9. References. . . . . . . . . . . . . . . . . . . . . . . . . .
9.1. Normative References. . . . . . . . . . . . . . . . . . 10
9.2. Informative References. . . . . . . . . . . . . . . . . 10
10. Authors' Addresses. . . . . . . . . . . . . . . . . . . . . . 12
11. Full Copyright Statement. . . . . . . . . . . . . . . . . . . 14
Taft, et al. Informational [Page 1]
RFC 3778 The application/pdf Media Type May 2004
1. Introduction
This document is intended to provide updated information on the
registration of the MIME Media Type "application/pdf", with
particular focus on the features that help mitigate security
concerns. This document refers to features documented in the PDF
References versions 1 [1], 1.3 [2], 1.4 [3] and 1.5 [4], as updated
by errata [5].
PDF is used widely in the Internet Community. Since PDF was
introduced in 1993, it has grown to be a widely-used format for
capturing and exchanging formatted documents electronically, across
the Web, via e-mail, and, for that matter, virtually every other
document exchange mechanism.
PDF represents formatted documents. These documents may be
structured or simple. They may contain text, images, graphics, and
other multimedia content, such as video and audio. There is support
for annotations, metadata, hypertext links, and bookmarks.
PDF supports encryption and digital signatures in the document. The
encryption capability is also combined with access control
information in a way that is intended to manage the uses that a
recipient can make of a document.
PDF usage is specified in other international standards. ISO 15930-
1:2001 PDF/X [16] has been adopted as the exchange standard for
electronic documents within the Prepress community. PDF/X is a
profile of PDF that references the PDF Reference, Third edition [2],
as the source specification.
Another profile of PDF, known as PDF/A [17], is being developed for
use as an international standard as an electronic document file
format for long-term preservation. Following the work on PDF/X, the
activity is joint work between NPES (The Association for Suppliers of
Printing, Publishing and Converting Technologies) and AIIM
International (the Association for Information and Image Management,
International). AIIM is the secretariat for ISO/TC 171 SC2, Document
Imaging Applications.
PDF usage is widespread enough for 'application/pdf' to be used in
other IETF specifications. RFC 2346 [15] describes how to better
structure PDF files for international exchange of documents where
different paper sizes are used; HTTP byte range retrieval is
illustrated using application/pdf (RFC 2616 [14], Section 19.2); RFC
3297 [13] illustrates how PDF can be sent to a recipient that
identifies his ability to accept the PDF using content negotiation.
Taft, et al. Informational [Page 2]
RFC 3778 The application/pdf Media Type May 2004
2. History
PDF was originally envisioned as a way to communicate and view
printed information electronically across a wide variety of machine
configurations, operating systems, and communication networks in a
reliable manner.
PDF relies on the same imaging model as the PostScript page
description language to render complex text, images, and graphics in
a device and resolution-independent manner, bringing this feature to
the screen as well as the printer. To improve performance for
interactive viewing, PDF defines a more structured format than that
used by most PostScript language programs. PDF also includes
objects, such as hypertext links and annotations, that are not part
of the page itself, but are useful for building collections of
related documents and for reviewing and commenting on documents.
The application/pdf media type was first registered in 1993 by Paul
Lindner for use by the gopher protocol; the registration was
subsequently updated in 1994 by Steve Zilles.
3. Fragment Identifiers
The handling of fragment identifiers [6] is currently defined in
Adobe Technical Note 5428 [7]. This section summarizes that
material.
A fragment identifier consists of one or more PDF-open parameters in
a single URL, separated by the ampersand (&) or pound (#) character.
Each parameter implies an action to be performed and the value to be
used for that action. Actions are processed and executed from left
to right as they appear in the character string that makes up the
fragment identifier.
The PDF-open parameters allow the specification of a particular page
or named destination to open. Named destinations are similar to the
"anchors" used in HTML or the IDs used in XML. Once the target is
specified, the view of the page in which it occurs can be specified,
either by specifying the position of a viewing rectangle and its
scale or size coordinates or by specifying a view relative to the
viewing window in which the chosen page is to be presented.
Taft, et al. Informational [Page 3]
RFC 3778 The application/pdf Media Type May 2004
The list of PDF-open parameters and the action they imply is:
nameddest=
Open to a specified named destination (which includes a view).
page=
Open the specified (physical) page.
zoom=,,
Set the and scrolling factors. , and are
measured from the top left corner of the page, independent of the
size of the page. The pair and are optional but both
must appear if present.
view=,
Set the view to show some specified portion of the page or its
bounding box; keywords are defined by Table 8.2 of the PDF
Reference, version 1.5. The value is required for some
of the keywords and not allowed for others.
viewrect=,,,
As with the zoom parameter, set the scale and scrolling factors,
but using an explicit width and height instead of a scale
percentage.
highlight=,