Difference between pages "Word Document (DOCX)" and "Google Chrome"

From ForensicsWiki
(Difference between pages)
Jump to: navigation, search
 
(External Links)
 
Line 1: Line 1:
DOCX is the file format for Microsoft Office 2007 and later.  
+
Google Chrome is a [[Web Browser|web browser]] developed by Google Inc.
  
DOCX should not be confused with [[DOC]], the format used by earlier versions of Microsoft Office.
+
== Configuration ==
 +
The Google Chrome configuration can be found in the '''Preferences''' file.
  
= Container Format =
+
On Linux
 +
<pre>
 +
/home/$USER/.config/google-chrome/Default/Preferences
 +
</pre>
  
DOCX is written in an XML format, which consists of a [[ZIP archive]] file containing [[XML]] and binaries. Content can be analysed without modification by unzipping the file (e.g. in WinZIP) and analysing the contents of the archive.
+
On MacOS-X
 +
<pre>
 +
/Users/$USER/Library/Application Support/Google/Chrome/Default/Preferences
 +
</pre>
  
The file _rels/.rels contains information about the structure of the document.  It contains paths to the metadata information as well as the main XML document that contains the content of the document itself.
+
On Windows XP
 +
<pre>
 +
C:\Documents and Settings\%USERNAME%\Local Settings\Application Data\Google\Chrome\User Data\Default\Preferences
 +
</pre>
  
Metadata information are usually stored in the folder docProps.  Two or more XML files are stored inside that folder, app.xml that stores metadata information extracted from the Word application itself and core.xml that stores metadata from the document itself, such as the author name, last time it was printed, etc.
+
On Windows Vista and later
 +
<pre>
 +
C:\Users\%USERNAME%\AppData\Local\Google\Chrome\User Data\Default\Preferences
 +
</pre>
  
Another folder contains the actual content of the document, in a Word document, or an .docx document the folder's name is word.  A XML file called document.xml is the main document, containing most of the content of the document itself.
+
Or for '''Chromium'''
  
= Relationship to OOXML =
+
On Linux
 +
<pre>
 +
/home/$USER/.config/chromium/Default/Preferences
 +
</pre>
  
Office Open XML is an open XML standard developed by Microsoft for word processing documents, spreadsheets, presentations and charts. The OOXML standard was submitted to the ISO for approval.  After initially being rejected over technical concerns, the ISO approved a modified version as ISO/IEC 29500:2008. Microsoft intended to use the OOXML standard for its Office suite. However, Office does not support the standard that the ISO approved, it only supports the standard that was originally rejected by the ISO[http://arstechnica.com/microsoft/news/2010/04/iso-ooxml-convener-microsofts-format-heading-for-failure.ars]. As of Office 2010, Microsoft has still not brought its software into compliance with the standard.
+
On MacOS-X
 +
<pre>
 +
/Users/$USER/Library/Application Support/Chromium/Default/Preferences
 +
</pre>
  
For most purposes OOXML may be considered a subset of DOCX (DOCX contains additional features, like OLE serialization).
+
On Windows XP
 +
<pre>
 +
C:\Documents and Settings\%USERNAME%\Local Settings\Application Data\Chromium\User Data\Default\Preferences
 +
</pre>
 +
 
 +
On Windows Vista and later
 +
<pre>
 +
C:\Users\%USERNAME%\AppData\Local\Chromium\User Data\Default\Preferences
 +
</pre>
 +
 
 +
=== Plugins ===
 +
 
 +
Information about plugins can be found under the "plugins section" of the Preferences file.
  
Documentation on OOXML may provide a guide to analysing a DOCX file.
+
=== DNS Prefetching ===
  
= Metadata =
+
DNS is prefetched for related sites, e.g. links on the page.
 +
This behavior is controlled by the setting "Predict network actions to improve page load performance", which is enabled by default.
  
== Content types ==
+
If enabled the Preferences file contains:
 
<pre>
 
<pre>
[Content_Types].xml
+
  "dns_prefetching": {
 +
      "enabled": true,
 
</pre>
 
</pre>
  
 +
If disabled the Preferences file contains:
 
<pre>
 
<pre>
&lt;?xml version="1.0" encoding="UTF-8" standalone="yes"?&gt;
+
  "dns_prefetching": {
&lt;Types xmlns="http://schemas.openxmlformats.org/package/2006/content-types"&gt;
+
      "enabled": false,
&lt;Default Extension="emf" ContentType="image/x-emf"/&gt;
+
&lt;Default Extension="rels" ContentType="application/vnd.openxmlformats-package.relationships+xml"/&gt;
+
&lt;Default Extension="xml" ContentType="application/xml"/&gt;
+
&lt;Override PartName="/word/document.xml" ContentType="application/vnd.openxmlformats-officedocument.wordprocessingml.document.main+xml"/&gt;
+
&lt;Override PartName="/word/styles.xml" ContentType="application/vnd.openxmlformats-officedocument.wordprocessingml.styles+xml"/&gt;
+
&lt;Override PartName="/word/stylesWithEffects.xml" ContentType="application/vnd.ms-word.stylesWithEffects+xml"/&gt;
+
&lt;Override PartName="/word/settings.xml" ContentType="application/vnd.openxmlformats-officedocument.wordprocessingml.settings+xml"/&gt;
+
&lt;Override PartName="/word/webSettings.xml" ContentType="application/vnd.openxmlformats-officedocument.wordprocessingml.webSettings+xml"/&gt;
+
&lt;Override PartName="/word/fontTable.xml" ContentType="application/vnd.openxmlformats-officedocument.wordprocessingml.fontTable+xml"/&gt;
+
&lt;Override PartName="/word/theme/theme1.xml" ContentType="application/vnd.openxmlformats-officedocument.theme+xml"/&gt;
+
&lt;Override PartName="/docProps/core.xml" ContentType="application/vnd.openxmlformats-package.core-properties+xml"/&gt;
+
&lt;Override PartName="/docProps/app.xml" ContentType="application/vnd.openxmlformats-officedocument.extended-properties+xml"/&gt;
+
&lt;/Types&gt;
+
 
</pre>
 
</pre>
  
== Document properties - core ==
+
== Start-up DNS queries ==
 +
 
 +
When Chrome starts it queries for several non-existing hostnames that consists of a 10 random characters, E.g.
 
<pre>
 
<pre>
docProps/core.xml
+
ttrgoiknff.mydomain.com
 +
bxjhgftsyu.mydomain.com
 +
yokjbjiagd.mydomain.com
 
</pre>
 
</pre>
  
 +
This is used to determine if your ISP is hijacking NXDOMAIN results [http://www.google.com/support/forum/p/Chrome/thread?tid=3511015c72a7b314&hl=en].
 +
 +
== Disk Cache ==
 +
The Google Chrome disk cache can be found in:
 +
 +
On Linux
 
<pre>
 
<pre>
&lt;?xml version="1.0" encoding="UTF-8" standalone="yes"?&gt;
+
/home/$USER/.config/google-chrome/Default/Application Cache/Cache/
&lt;cp:coreProperties
+
    xmlns:cp="http://schemas.openxmlformats.org/package/2006/metadata/core-properties"
+
    xmlns:dc="http://purl.org/dc/elements/1.1/"
+
    xmlns:dcterms="http://purl.org/dc/terms/"
+
    xmlns:dcmitype="http://purl.org/dc/dcmitype/"
+
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"&gt;
+
&lt;dc:creator&gt;User 1&lt;/dc:creator&gt;
+
&lt;cp:lastModifiedBy&gt;User 2&lt;/cp:lastModifiedBy&gt;
+
&lt;cp:revision&gt;3&lt;/cp:revision&gt;
+
&lt;dcterms:created xsi:type="dcterms:W3CDTF"&gt;2012-11-07T23:29:00Z&lt;/dcterms:created&gt;
+
&lt;dcterms:modified xsi:type="dcterms:W3CDTF"&gt;2013-08-25T22:18:00Z&lt;/dcterms:modified&gt;
+
&lt;/cp:coreProperties&gt;
+
 
</pre>
 
</pre>
  
== Document properties - extended: application ==
+
On MacOS-X
 
<pre>
 
<pre>
docProps/app.xml
+
/Users/$USER/Caches/Google/Chrome/Default/Cache/
 
</pre>
 
</pre>
  
 +
On Windows XP
 
<pre>
 
<pre>
&lt;?xml version="1.0" encoding="UTF-8" standalone="yes"?&gt;
+
C:\Documents and Settings\%USERNAME%\Local Settings\Application Data\Google\Chrome\User Data\Cache\
&lt;Properties
+
    xmlns="http://schemas.openxmlformats.org/officeDocument/2006/extended-properties"
+
    xmlns:vt="http://schemas.openxmlformats.org/officeDocument/2006/docPropsVTypes"&gt;
+
&lt;Template&gt;Normal.dotm&lt;/Template&gt;
+
    &lt;TotalTime&gt;1385&lt;/TotalTime&gt;
+
    &lt;Pages&gt;1&lt;/Pages&gt;
+
    &lt;Words&gt;2&lt;/Words&gt;
+
    &lt;Characters&gt;13&lt;/Characters&gt;
+
    &lt;Application&gt;Microsoft Office Word&lt;/Application&gt;
+
    &lt;DocSecurity&gt;0&lt;/DocSecurity&gt;
+
    &lt;Lines&gt;1&lt;/Lines&gt;
+
    &lt;Paragraphs&gt;1&lt;/Paragraphs&gt;
+
    &lt;ScaleCrop&gt;false&lt;/ScaleCrop&gt;
+
    &lt;HeadingPairs&gt;
+
        &lt;vt:vector size="2" baseType="variant"&gt;
+
            &lt;vt:variant&gt;
+
                &lt;vt:lpstr&gt;Title&lt;/vt:lpstr&gt;
+
            &lt;/vt:variant&gt;
+
            &lt;vt:variant&gt;
+
                &lt;vt:i4&gt;1&lt;/vt:i4&gt;
+
            &lt;/vt:variant&gt;
+
        &lt;/vt:vector&gt;
+
    &lt;/HeadingPairs&gt;
+
    &lt;TitlesOfParts&gt;
+
        &lt;vt:vector size="1" baseType="lpstr"&gt;
+
            &lt;vt:lpstr&gt;&lt;/vt:lpstr&gt;
+
        &lt;/vt:vector&gt;
+
    &lt;/TitlesOfParts&gt;
+
    &lt;Company&gt;&lt;/Company&gt;
+
    &lt;LinksUpToDate&gt;false&lt;/LinksUpToDate&gt;
+
    &lt;CharactersWithSpaces&gt;14&lt;/CharactersWithSpaces&gt;
+
    &lt;SharedDoc&gt;false&lt;/SharedDoc&gt;
+
    &lt;HyperlinksChanged&gt;false&lt;/HyperlinksChanged&gt;
+
    &lt;AppVersion&gt;14.0000&lt;/AppVersion&gt;
+
&lt;/Properties&gt;
+
 
</pre>
 
</pre>
  
= External Links =
+
On Windows Vista and later
 +
<pre>
 +
C:\Users\%USERNAME%\AppData\Local\Google\Chrome\User Data\Default\Cache\
 +
</pre>
 +
 
 +
The Chrome Cache contains different files with the following file names:
 +
* index
 +
* data_#; where # contains a decimal digit.
 +
* f_######; where # contains a hexadecimal digit.
 +
 
 +
For more info see Chrome developers site [http://www.chromium.org/developers/design-documents/network-stack/disk-cache].
 +
 
 +
== History ==
 +
Chrome stores the history of visited sites in a file named '''History'''. This file uses the [[SQLite database format]].
 +
 
 +
The '''History''' file can be found in same location as the '''Preferences''' file.
 +
 
 +
There is also '''Archived History''' that predates information in the '''History''' file.
 +
Note that the '''Archived History''' only contains visits.
 +
 
 +
=== Timestamps ===
 +
The '''History''' file uses the different timestamps.
 +
 
 +
==== visits.visit_time ====
 +
 
 +
The '''visits.visit_time''' is in (the number of) microseconds since January 1, 1601 UTC
 +
 
 +
Some Python code to do the conversion into human readable format:
 +
<pre>
 +
date_string = datetime.datetime( 1601, 1, 1 )
 +
            + datetime.timedelta( microseconds=timestamp )
 +
</pre>
 +
 
 +
Note that this timestamp is not the same as a Windows filetime which is (the number of) 100 nanoseconds since January 1, 1601 UTC
 +
 
 +
==== downloads.start_time ====
 +
 
 +
The '''downloads.start_time''' is in (the number of) seconds since January 1, 1970 UTC
 +
 
 +
Some Python code to do the conversion into human readable format:
 +
<pre>
 +
date_string = datetime.datetime( 1970, 1, 1 )
 +
            + datetime.timedelta( seconds=timestamp )
 +
</pre>
 +
 
 +
=== Example queries ===
 +
Some example queries:
 +
 
 +
To get an overview of the visited sites:
 +
<pre>
 +
SELECT datetime(((visits.visit_time/1000000)-11644473600), "unixepoch"), urls.url, urls.title FROM urls, visits WHERE urls.id = visits.url;
 +
</pre>
 +
 
 +
Note that the visit_time conversion looses precision.
 +
 
 +
To get an overview of the downloaded files:
 +
<pre>
 +
SELECT datetime(downloads.start_time, "unixepoch"), downloads.url, downloads.full_path, downloads.received_bytes, downloads.total_bytes FROM downloads;
 +
</pre>
 +
 
 +
How the information of the downloaded files is stored in the database can vary per version of Chrome as of version 26:
 +
<pre>
 +
SELECT datetime(((downloads.start_time/1000000)-11644473600), "unixepoch"), downloads.target_path, downloads_url_chains.url, downloads.received_bytes, downloads.total_bytes \
 +
FROM downloads, downloads_url_chains WHERE downloads.id = downloads_url_chains.id;
 +
</pre>
 +
 
 +
== See Also ==
 +
 
 +
* [[SQLite database format]]
 +
 
 +
== External Links ==
 +
* [http://en.wikipedia.org/wiki/Google_Chrome Wikipedia article on Google Chrome]
 +
* [http://www.chromium.org/user-experience/user-data-directory The Chromium Projects - User Data Directory]
 +
* [http://www.chromium.org/developers/design-documents/network-stack/disk-cache Chrome Disk Cache]
 +
* [http://www.google.com/support/forum/p/Chrome/thread?tid=3511015c72a7b314&hl=en Chrome support forum article random 10 character hostnames on startup]
 +
* [http://www.useragentstring.com/pages/Chrome/ Chrome User Agent strings]
 +
* [http://computer-forensics.sans.org/blog/2010/01/21/google-chrome-forensics/ Google Chrome Forensics] by [[Kristinn Guðjónsson]], January 21, 2010
 +
* [http://linuxsleuthing.blogspot.ch/2013/02/cashing-in-on-google-chrome-cache.html?m=1 Cashing in on the Google Chrome Cache], [[John Lehr]], February 24, 2013
 +
* [http://www.obsidianforensics.com/blog/history-index-files-removed-from-chrome/ History Index files removed from Chrome v30], by Ryan Benson, October 2, 2013
 +
* [https://hindsight-internet-history.googlecode.com/files/Evolution_of_Chrome_Databases.png Evolution of Chrome Databases], by Ryan Benson, October 2, 2013
  
* [http://msdn.microsoft.com/en-us/library/aa338205.aspx Introducing the Office (2007) Open XML File Formats], by [[Microsoft]], May 2006
+
== Tools ==
* [http://dublincore.org/documents/2012/06/14/dcmi-terms/?v=elements# DCMI Metadata Terms]
+
=== Open Source ===
* [http://www.simson.net/clips/academic/2009.IEEE.DOCX.pdf The new XML Office Document Files: Implications For Forensics], [[Simson L. Garfinkel]] and James Migletz
+
* [https://code.google.com/p/hindsight-internet-history/ hindsight-internet-history]
* [http://blog.kiddaland.net/2009/06/office-2007-metadata/ Perl script that displays metadata information that is extracted from an OpenXML document], by [[Kristinn Gudjonsson]], June 2009
+
* [http://blog.kiddaland.net/2009/07/antiword-for-office-2007/ Perl script that displays the content of a Docx document, similar to Antiword], by [[Kristinn Gudjonsson]], July 2009
+
* [http://computer-forensics.sans.org/blog/2009/07/10/office-2007-metadata/ Office 2007 Metadata], by [[Kristinn Gudjonsson]], July 10, 2009
+
  
[[Category:File Formats]]
+
[[Category:Applications]]
 +
[[Category:Web Browsers]]

Revision as of 06:06, 4 December 2013

Google Chrome is a web browser developed by Google Inc.

Configuration

The Google Chrome configuration can be found in the Preferences file.

On Linux

/home/$USER/.config/google-chrome/Default/Preferences

On MacOS-X

/Users/$USER/Library/Application Support/Google/Chrome/Default/Preferences

On Windows XP

C:\Documents and Settings\%USERNAME%\Local Settings\Application Data\Google\Chrome\User Data\Default\Preferences

On Windows Vista and later

C:\Users\%USERNAME%\AppData\Local\Google\Chrome\User Data\Default\Preferences

Or for Chromium

On Linux

/home/$USER/.config/chromium/Default/Preferences

On MacOS-X

/Users/$USER/Library/Application Support/Chromium/Default/Preferences

On Windows XP

C:\Documents and Settings\%USERNAME%\Local Settings\Application Data\Chromium\User Data\Default\Preferences

On Windows Vista and later

C:\Users\%USERNAME%\AppData\Local\Chromium\User Data\Default\Preferences

Plugins

Information about plugins can be found under the "plugins section" of the Preferences file.

DNS Prefetching

DNS is prefetched for related sites, e.g. links on the page. This behavior is controlled by the setting "Predict network actions to improve page load performance", which is enabled by default.

If enabled the Preferences file contains:

   "dns_prefetching": {
      "enabled": true,

If disabled the Preferences file contains:

   "dns_prefetching": {
      "enabled": false,

Start-up DNS queries

When Chrome starts it queries for several non-existing hostnames that consists of a 10 random characters, E.g.

ttrgoiknff.mydomain.com
bxjhgftsyu.mydomain.com
yokjbjiagd.mydomain.com

This is used to determine if your ISP is hijacking NXDOMAIN results [1].

Disk Cache

The Google Chrome disk cache can be found in:

On Linux

/home/$USER/.config/google-chrome/Default/Application Cache/Cache/

On MacOS-X

/Users/$USER/Caches/Google/Chrome/Default/Cache/

On Windows XP

C:\Documents and Settings\%USERNAME%\Local Settings\Application Data\Google\Chrome\User Data\Cache\

On Windows Vista and later

C:\Users\%USERNAME%\AppData\Local\Google\Chrome\User Data\Default\Cache\

The Chrome Cache contains different files with the following file names:

  • index
  • data_#; where # contains a decimal digit.
  • f_######; where # contains a hexadecimal digit.

For more info see Chrome developers site [2].

History

Chrome stores the history of visited sites in a file named History. This file uses the SQLite database format.

The History file can be found in same location as the Preferences file.

There is also Archived History that predates information in the History file. Note that the Archived History only contains visits.

Timestamps

The History file uses the different timestamps.

visits.visit_time

The visits.visit_time is in (the number of) microseconds since January 1, 1601 UTC

Some Python code to do the conversion into human readable format:

date_string = datetime.datetime( 1601, 1, 1 )
            + datetime.timedelta( microseconds=timestamp )

Note that this timestamp is not the same as a Windows filetime which is (the number of) 100 nanoseconds since January 1, 1601 UTC

downloads.start_time

The downloads.start_time is in (the number of) seconds since January 1, 1970 UTC

Some Python code to do the conversion into human readable format:

date_string = datetime.datetime( 1970, 1, 1 )
            + datetime.timedelta( seconds=timestamp )

Example queries

Some example queries:

To get an overview of the visited sites:

SELECT datetime(((visits.visit_time/1000000)-11644473600), "unixepoch"), urls.url, urls.title FROM urls, visits WHERE urls.id = visits.url;

Note that the visit_time conversion looses precision.

To get an overview of the downloaded files:

SELECT datetime(downloads.start_time, "unixepoch"), downloads.url, downloads.full_path, downloads.received_bytes, downloads.total_bytes FROM downloads;

How the information of the downloaded files is stored in the database can vary per version of Chrome as of version 26:

SELECT datetime(((downloads.start_time/1000000)-11644473600), "unixepoch"), downloads.target_path, downloads_url_chains.url, downloads.received_bytes, downloads.total_bytes \
FROM downloads, downloads_url_chains WHERE downloads.id = downloads_url_chains.id;

See Also

External Links

Tools

Open Source