Difference between revisions of "Word Document (DOCX)"

From ForensicsWiki
Jump to: navigation, search
Line 5: Line 5:
 
= Container Format =
 
= Container Format =
  
DOCX consists of a [[ZIP]] file containing [[XML]] and binaries. Content can be analysed without modification by unzipping the file (e.g. in WinZIP) and analysing the contents of the archive.
+
DOCX consists of a [[ZIP archive]] file containing [[XML]] and binaries. Content can be analysed without modification by unzipping the file (e.g. in WinZIP) and analysing the contents of the archive.
  
 
= Relationship to OOXML =
 
= Relationship to OOXML =

Revision as of 11:20, 31 January 2009

DOCX is the file format for Microsoft Office 2007 and later.

DOCX should not be confused with DOC, the format used by earlier versions of Microsoft Office.

Container Format

DOCX consists of a ZIP archive file containing XML and binaries. Content can be analysed without modification by unzipping the file (e.g. in WinZIP) and analysing the contents of the archive.

Relationship to OOXML

For most purposes OOXML may be considered a subset of DOCX (DOCX contains additional features, like OLE serialization).

Documentation on OOXML may provide a guide to analysing a DOCX file.

External Links