Difference between pages "Data Mining" and "OLE Compound File"

From ForensicsWiki
(Difference between pages)
Jump to: navigation, search
 
(Contents)
 
Line 1: Line 1:
Right now this is just a list of resources that will be useful for people doing forensic data mining and machine learning.
+
The '''OLE Compound File (OLECF)''' is used in other file formats as its underlying container file.
 +
It allows data to be stored in multiple streams.  
  
==Open Source Software==
+
The OLECF is also known as:
* [http://www.cs.waikato.ac.nz/ml/weka/ Weka] data mining toolkit - java, has programmatic and GUI interface.  
+
* Compound Binary File (current name used by [[Microsoft]])
* Ping He has created an [http://code.google.com/p/fc45 Open Source C4.5 implementation in C]
+
* Compound Document File (name used by [[OpenOffice]])
* [http://mloss.org Machine Learning Open Source Software] - a page hosting many open source machine learning tools and libraries.  
+
* OLE2 file
* [http://lucene.apache.org/mahout/ Apache Mahout]: goal is to "build scalable, Apache licensed machine learning libraries" (java). also includes a focus on using [http://hadoop.apache.org/core/ hadoop].
+
 
 +
== MIME types ==
 +
 
 +
Because the OLECF by itself is just a container it does not use a mime type.
 +
A mime type assigned to an OLECF refers to its contents.
 +
 
 +
== File signature ==
 +
 
 +
The OLECF has the following file signature:
 +
hexadecimal: d0 cf 11 e0 a1 b1 1a e1
 +
 
 +
The OLECF has no distinct footer.
 +
 
 +
== Contents ==
 +
 
 +
The OLECF uses a FAT like file system to define blocks that are assigned to the stream using multiple allocation tables.
 +
It uses a directory structure to define the name of the streams.
 +
 
 +
The OLECF is used to store:
 +
* [[Microsoft Office]] 97-2003 documents:
 +
** [[Word Document (DOC)]]
 +
** [[Excel Spreadsheet (XLS)]]
 +
** [[Powerpoint Presentation (PPT)]]
 +
* [[Thumbs.db]]
 +
* StickyNotes.snt
 +
 
 +
== See also==
 +
 
 +
[[Media:Compdocfileformat.pdf|Microsoft Compound Document File Format]] (This is actually the OpenOffice specification)
 +
 
 +
[http://download.microsoft.com/download/0/B/E/0BE8BDD7-E5E8-422A-ABFD-4342ED7AD886/WindowsCompoundBinaryFileFormatSpecification.pdf Compound Binary File Specification by Microsoft]
 +
 
 +
Be warned this file contains at least one error: the directory entry name length is a size in bytes not in characters.
 +
 
 +
[[Category:File Formats]]

Revision as of 06:31, 19 November 2010

The OLE Compound File (OLECF) is used in other file formats as its underlying container file. It allows data to be stored in multiple streams.

The OLECF is also known as:

  • Compound Binary File (current name used by Microsoft)
  • Compound Document File (name used by OpenOffice)
  • OLE2 file

MIME types

Because the OLECF by itself is just a container it does not use a mime type. A mime type assigned to an OLECF refers to its contents.

File signature

The OLECF has the following file signature: hexadecimal: d0 cf 11 e0 a1 b1 1a e1

The OLECF has no distinct footer.

Contents

The OLECF uses a FAT like file system to define blocks that are assigned to the stream using multiple allocation tables. It uses a directory structure to define the name of the streams.

The OLECF is used to store:

See also

Microsoft Compound Document File Format (This is actually the OpenOffice specification)

Compound Binary File Specification by Microsoft

Be warned this file contains at least one error: the directory entry name length is a size in bytes not in characters.