PDFspy

PDFspy is the ultimate “get info” utility for your PDF documents. It can extract a comprehensive list of attributes from a PDF file into an XML-based format.

New features and enhancements including:

Support for PDF 1.7/ISO 32000 (Acrobat 9, X, DC)
Element now shows CMYK separations that are actually used by text and vector elements
New element that shows the number of shading objects in PDF file
Restored output being written to stdout if -o option not used, recommend using -quiet option when writing to stdout
Fixed calculation of page labels
Improved text extraction algorithm
Calculates color simulation values for ICCBased, Separation and DeviceN colorspaces
Improved Unicode, ISO Latin and AdobePDF character set support

Some examples of the many types of information PDFspy can extract:

Page information (count, size, boxes)
Fonts usage (name, type, embedding & subset status, use of Unicode)
Colorspaces used (alternates, separation names, index bases)
Images (size, resolution, compression, colorspace)
Use of transparency, smooth shadings and patterns
Presence (or absence) of hidden text and optional content/layers
Hyperlinks (size, location and destination)
Annotations (size, location, type, contents, colors)
PDF/X compliance (including output intent details)
Metadata (info dictionary & XMP)
Security and Encryption settings

Example uses:

Asset management system: extract page count, metadata, font & image information
Document management: determine text or image only documents, extract comments
Preflight: extract information about colorspaces, compression & font types
Developers: easily examine the structure of complex PDF documents

ORDER PDF Spy