Difference between revisions of "Word Document (DOCX)"

From Forensics Wiki
Jump to: navigation, search
m
Line 1: Line 1:
 
DOCX is the file format for Microsoft Office 2007 and later.
 
DOCX is the file format for Microsoft Office 2007 and later.
  
DOCX should not be confused with [[DOC]], the format used by earlier versions of Microsoft Office
+
DOCX should not be confused with [[DOC]], the format used by earlier versions of Microsoft Office.
  
 
= Container Format =
 
= Container Format =
  
DOCX consists of a [[ZIP]] file containing XML and binaries. Content can be analysed without modification by unzipping the file (Eg, in WinZIP) and analysing the contents of the archive.
+
DOCX consists of a [[ZIP]] file containing [[XML]] and binaries. Content can be analysed without modification by unzipping the file (e.g. in WinZIP) and analysing the contents of the archive.
  
 
= Relationship to OOXML =
 
= Relationship to OOXML =

Revision as of 15:00, 7 November 2008

DOCX is the file format for Microsoft Office 2007 and later.

DOCX should not be confused with DOC, the format used by earlier versions of Microsoft Office.

Container Format

DOCX consists of a ZIP file containing XML and binaries. Content can be analysed without modification by unzipping the file (e.g. in WinZIP) and analysing the contents of the archive.

Relationship to OOXML

For most purposes OOXML may be considered a subset of DOCX (DOCX contains additional features, like OLE serialization).

Documentation on OOXML may provide a guide to analysing a DOCX file.