Inside PDF
Lecture @21C3
The Portable Document Format
A Short Introduction
Maik Musall <maik@musall.de> CCC Erlangen
Inside PDF Lecture @21C3 The Portable Document Format A Short - - PowerPoint PPT Presentation
Inside PDF Lecture @21C3 The Portable Document Format A Short Introduction Maik Musall <maik@musall.de> CCC Erlangen Overview History of PDF and it's relation to PostScript Licenses and legal issues File format syntax and
Lecture @21C3
A Short Introduction
Maik Musall <maik@musall.de> CCC Erlangen
Inside PDF Lecture @21C3 <maik@musall.de> Page 2
PDF 1.0 (1993) to 1.4
PDF 1.5, a contribution to storage and bandwidth
PDF 1.6, the 3D bloat
PDF/X (ISO standards)
PDF/A (upcoming ISO)
Inside PDF Lecture @21C3 <maik@musall.de> Page 3
turing-complete). PDF is not.
to all contained objects.
JavaScript, open actions etc.)
free use, but prohibit cloning the format.
Inside PDF Lecture @21C3 <maik@musall.de> Page 4
at the end.
may be appended at the end, leading to several body, xref and trailer sections.
all ASCII, if needed.
Generation is possible.
Inside PDF Lecture @21C3 <maik@musall.de> Page 5
Header Body XRef Table Trailer
%PDF-1.4 (some chars >0x80) Stream of objects like 24 0 obj <</Pages 22 0 R /Type /Catalog>> endobj xref 0 2 0000000234 00000 n trailer <<...>> startxref 1214 %%EOF
true false
0 1 5.4 -.002
(Hello World) <4D617465>
/Type /Pages
[ obj obj obj ]
<</Key1 val1 /Key2 val2>>
<<...>> stream...endstream
null
665 0 R
Inside PDF Lecture @21C3 <maik@musall.de> Page 6
selectively since PDF 1.5)
no longer in use)
access restriction settings.
Inside PDF Lecture @21C3 <maik@musall.de> Page 7
bookmarks), Logical structure, ...
child nodes (e.g. physical page dimensions).
* Header, XRef table, trailer * /Catalog, /Pages, one /Page, page content stream
Inside PDF Lecture @21C3 <maik@musall.de> Page 8
implementations can be reused (so does ghostscript)
images, and vector graphics
between user space and devices with different resolutions.
tems are de- scribed using transformation matrices
Inside PDF Lecture @21C3 <maik@musall.de> Page 9
changes in CTM, color, overprint, clipping, line patterns, transparency (PDF 1.4) etc.
a few others, grouped in Device-, CIE- and special color space groups
the document.
Inside PDF Lecture @21C3 <maik@musall.de> Page 10
scaling, leading, font name, font size, rendering mode, rise and knockout.
program), Type3 (arbitrary graphics operators), TrueType, CIDFonts.
complete font program
different styles, Symbol, ZapfDingbats) are considered built-in and required by every PDF processor to provide
Inside PDF Lecture @21C3 <maik@musall.de> Page 11
color space, compression.
ASCII85, LZW, Flate, RLE, CCITT, JBIG2 (PDF 1.4), DCT (Jpeg), JPX (Jpeg2000, PDF 1.5), Crypt (PDF 1.5).
a low-resolution image for fast screen viewing and a very-high-resolution image for printing.
Inside PDF Lecture @21C3 <maik@musall.de> Page 12
PDF 1.4).
and 3D objects (PDF 1.6).
Inside PDF Lecture @21C3 <maik@musall.de> Page 13
format
Platform) technology.
closer to realization.“ (Eric Miller, W3C Semantic Web Activity Lead)
Inside PDF Lecture @21C3 <maik@musall.de> Page 14
everything, but quite expensive
variants available. Good for creation, processing limited to copying whole pages.
flaws with PDF 1.5/1.4 hybrid updates a few months ago, but quickly developing, gcj compatible
implementation transforming XML to PDF
domain (creation, conversion, split/concat etc.)
Inside PDF Lecture @21C3 <maik@musall.de> Page 15
http://partners.adobe.com/asn/techresources.jsp
http://www.pdfx.info
http://www.planetpdf.com
http://www.pdf-tools.com
http://www.pdfzone.com
news://comp.text.pdf
Inside PDF Lecture @21C3 <maik@musall.de> Page 16
Inside PDF Lecture @21C3 <maik@musall.de> Page 17