Difference between revisions of "PDF"

From ForensicsWiki
Jump to: navigation, search
m
m
Line 11: Line 11:
 
== Embedded Objects==
 
== Embedded Objects==
  
You can use [[pdfimages]], part of the [http://www.foolabs.com/xpdf xpdf package], to extract all of the images out of a PDF file and put each in its own file.
+
You can use [[pdfimages]], part of the [[xpdf]], to extract all of the images out of a PDF file and put each in its own file.
  
  

Revision as of 22:42, 5 October 2008

The Portable Document Format (PDF) is a proprietary document format from Adobe Inc. It is widely available on the web.

Format

Each file begins with the string %PDF. Each block ends with the letters EOF, but there can be multiple EOF's in a single file (this often confuses programs like foremost that search for footers).

Metadata

The metadata (or parts of it) can be extracted with pdfinfo, a utility which is part of the xpdf package.

Embedded Objects

You can use pdfimages, part of the xpdf, to extract all of the images out of a PDF file and put each in its own file.


External Links

See Also

Tools:Document Metadata Extraction