Difference between pages "NetworkMiner" and "Document Metadata Extraction"

From ForensicsWiki
(Difference between pages)
Jump to: navigation, search
m
 
(Images)
 
Line 1: Line 1:
{{Infobox_Software |
+
Here are tools that will extract metadata from document files.
  name = NetworkMiner |
+
  maintainer = Erik Hjelmvik |
+
  os = {{Windows}} |
+
  genre = Network forensics |
+
  license = {{GPL}} |
+
  website = [http://networkminer.wiki.sourceforge.net/NetworkMiner networkminer.wiki.sourceforge.net/NetworkMiner] |
+
}}
+
  
[http://networkminer.wiki.sourceforge.net/NetworkMiner NetworkMiner] is a Network Forensic Analysis Tool (NFAT) for Windows. NetworkMiner can be used as a passive network [[sniffer]]/packet capturing tool in order to detect operating systems, sessions, hostnames, open ports etc. without putting any traffic on the network. NetworkMiner can also parse PCAP files for off-line analysis and to regenerate/reassemble transmitted files and certificates from PCAP files.
+
=Office Files=
  
The purpose of NetworkMiner is to collect data (such as forensic evidence) about hosts on the network rather than to collect data regarding the traffic on the network. The main view is host centric (information grouped per host) rather than packet centric (information showed as a list of packets/frames).
+
; [[antiword]]
 +
: http://www.winfield.demon.nl/
  
NetworkMiner performs [[OS fingerprinting]] based on TCP SYN and SYN+ACK packet by using [[OS fingerprinting]] databases from p0f (by Michal Zalewski) and Ettercap (by Alberto Ornaghi and Marco Valleri). NetworkMiner can also perform [[OS fingerprinting]] based on DHCP packets (which usually are broadcast packets) by making use of the Satori (by Eric Kollmann) [[OS fingerprinting]] database from FingerBank. NetworkMiner also uses the MAC-vendor list from Nmap (by Fyodor).
+
; [[catdoc]]
 +
: http://www.45.free.net/~vitus/software/catdoc/
  
NetworkMiner can extract files and certificates transferred over the network by parsing a PCAP file or by sniffing traffic directly from the network. This is a neat function that can be used to extract and [http://networkminer.wiki.sourceforge.net/save+media+files save media files] (such as audio or video files) which are streamed across a network. Supported protocols for file extraction are FTP, HTTP and SMB.
+
; [[laola]]
 +
: http://user.cs.tu-berlin.de/~schwartz/pmh/index.html
  
User credentials (usernames and passwords) for supported protocols are extracted by NetworkMiner and displayed under the "Credentials" tab. Please be considerate when displaying the contents of this tab to the public.
+
; [[word2x]]
 +
: http://word2x.sourceforge.net/
  
Another very useful feature is that the user can [http://networkminer.wiki.sourceforge.net/Keyword+Search search sniffed or stored data for keywords]. NetworkMiner allows the user to insert arbitrary string or byte-patterns that shall be searched for with the keyword search functionality.
+
; [[wvWare]]
 +
: http://wvware.sourceforge.net/
 +
: Extracts metadata from various [[Microsoft]] Word files ([[doc]]). Can also convert doc files to other formats such as HTML or plain text.
  
Version 0.84 (and newer) of NetworkMiner support [http://networkminer.wiki.sourceforge.net/WiFi+Sniffing sniffing and parsing of WLAN (IEEE 802.11) traffic]. NetworkMiner does however currently only support WiFi sniffing with AirPcap adapters.
+
; [[Outside In]]
 +
: http://www.oracle.com/technology/products/content-management/oit/oit_all.html
 +
: Originally developed by Stellant, supports hundreds of file types.
  
A feature which is planned to be included in future versions of NetworkMiner is to use statistical methods to do protocol identification (protocol fingerprinting) of a TCP session or UDP data. This means that instead of looking at the port number to guess which protocol is used on top of the TCP/UDP packet NetworkMiner will identify the correct protocol based on the TCP/UDP packet content. This way NetworkMiner will be able to identify protocols even if the service is run on a non-standard port. Richard Bejtlich calls this type of functionality [http://taosecurity.blogspot.com/2006/09/port-independent-protocol.html "Port Independent Protocol Identification" (PIPI)].
+
; [[FI Tools]]
 +
: http://forensicinnovations.com/
 +
: More than 100 file types.
  
[[Category:Network Forensics]]
+
=PDF Files=
 +
 
 +
; [[xpdf]]
 +
: http://www.foolabs.com/xpdf/
 +
: [[pdfinfo]] (part of the [[xpdf]] package) displays some metadata of [[PDF]] files.
 +
 
 +
 
 +
(See [[PDF]])
 +
 
 +
=Images=
 +
 
 +
; [[jhead]]
 +
: http://www.sentex.net/~mwandel/jhead/
 +
: Displays or modifies [[Exif]] data in [[JPEG]] files.
 +
 
 +
; [[vinetto]]
 +
: http://vinetto.sourceforge.net/
 +
: Examines [[Thumbs.db]] files.
 +
 
 +
;[[libexif]]
 +
: http://sourceforge.net/projects/libexif EXIF tag Parsing Library
 +
 
 +
; [[Adroit Photo Forensics]]
 +
: http://digital-assembly.com/products/adroit-photo-forensics/
 +
: Displays meta data and uses date and camera meta-data for grouping, timelines etc.
 +
 
 +
; Exif Viewer
 +
: http://araskin.webs.com/exif/exif.html
 +
: Add-on for Firefox and Thunderbird that displays various [[JPEG]]/JPG metadata in local and remote images.
 +
 
 +
; exiftags
 +
: http://johnst.org/sw/exiftags/
 +
: open source utility to parse and edit [[exif]] data in [[JPEG]] images. Found in many Debian based distributions.
 +
 
 +
; exifprobe
 +
: http://www.virtual-cafe.com/~dhh/tools.d/exifprobe.d/exifprobe.html
 +
: Open source utility that reads [[exif]] data in [[JPEG]] and some "RAW" image formats. Found in many Debian based distributions.
 +
 
 +
=General=
 +
These general-purpose programs frequently work when the special-purpose programs fail, but they generally provide less detailed information.
 +
 
 +
; [[Metadata Extraction Tool]]
 +
: "Developed by the National Library of New Zealand to programmatically extract preservation metadata from a range of file formats like PDF documents, image files, sound files Microsoft office documents, and many others."
 +
: http://meta-extractor.sourceforge.net/
 +
 
 +
; [[Metadata Assistant]]
 +
: http://www.payneconsulting.com/products/metadataent/
 +
 
 +
; [[hachoir|hachoir-metadata]]
 +
: Extraction tool, part of '''[[Hachoir]]''' project
 +
 
 +
; [[file]]
 +
: The UNIX '''file''' program can extract some metadata
 +
 
 +
; [[GNU libextractor]]
 +
: http://gnunet.org/libextractor/ The libextractor library is a plugable system for extracting metadata
 +
 
 +
; [[Directory Lister Pro]]
 +
: Directory Lister Pro is a Windows tool which creates listings of files from selected directories on hard disks, CD-ROMs, DVD-ROMs, floppies, USB storages and network shares. Listing can be in HTML, text or CSV format (for easy import to Excel). Listing can contain standard file information like file name, extension, type, owner and date created, but especially for forensic analysis file meta data can be extracted from various formats: 1) executable file information (EXE, DLL, OCX) like file version, description, company, product name. 2) multimedia properties (MP3, AVI, WAV, JPG, GIF, BMP, MKV, MKA, MPEG) like track, title, artist, album, genre, video format, bits per pixel, frames per second, audio format, bits per channel. 3) Microsoft Office files (DOC, DOCX, XLS, XLSX, PPT, PPTX) like document title, author, keywords, word count. For each file and folder it is also possible to obtain its CRC32, MD5, SHA-1 and Whirlpool hash sum. Extensive number of options allows to completely customize the visual look of the output. Filter on file name, date, size or attributes can be applied so it is possible to limit the files listed.
 +
: http://www.krksoft.com
 +
 
 +
[[Category:Tools]]

Revision as of 10:23, 9 April 2010

Here are tools that will extract metadata from document files.

Office Files

antiword
http://www.winfield.demon.nl/
catdoc
http://www.45.free.net/~vitus/software/catdoc/
laola
http://user.cs.tu-berlin.de/~schwartz/pmh/index.html
word2x
http://word2x.sourceforge.net/
wvWare
http://wvware.sourceforge.net/
Extracts metadata from various Microsoft Word files (doc). Can also convert doc files to other formats such as HTML or plain text.
Outside In
http://www.oracle.com/technology/products/content-management/oit/oit_all.html
Originally developed by Stellant, supports hundreds of file types.
FI Tools
http://forensicinnovations.com/
More than 100 file types.

PDF Files

xpdf
http://www.foolabs.com/xpdf/
pdfinfo (part of the xpdf package) displays some metadata of PDF files.


(See PDF)

Images

jhead
http://www.sentex.net/~mwandel/jhead/
Displays or modifies Exif data in JPEG files.
vinetto
http://vinetto.sourceforge.net/
Examines Thumbs.db files.
libexif
http://sourceforge.net/projects/libexif EXIF tag Parsing Library
Adroit Photo Forensics
http://digital-assembly.com/products/adroit-photo-forensics/
Displays meta data and uses date and camera meta-data for grouping, timelines etc.
Exif Viewer
http://araskin.webs.com/exif/exif.html
Add-on for Firefox and Thunderbird that displays various JPEG/JPG metadata in local and remote images.
exiftags
http://johnst.org/sw/exiftags/
open source utility to parse and edit exif data in JPEG images. Found in many Debian based distributions.
exifprobe
http://www.virtual-cafe.com/~dhh/tools.d/exifprobe.d/exifprobe.html
Open source utility that reads exif data in JPEG and some "RAW" image formats. Found in many Debian based distributions.

General

These general-purpose programs frequently work when the special-purpose programs fail, but they generally provide less detailed information.

Metadata Extraction Tool
"Developed by the National Library of New Zealand to programmatically extract preservation metadata from a range of file formats like PDF documents, image files, sound files Microsoft office documents, and many others."
http://meta-extractor.sourceforge.net/
Metadata Assistant
http://www.payneconsulting.com/products/metadataent/
hachoir-metadata
Extraction tool, part of Hachoir project
file
The UNIX file program can extract some metadata
GNU libextractor
http://gnunet.org/libextractor/ The libextractor library is a plugable system for extracting metadata
Directory Lister Pro
Directory Lister Pro is a Windows tool which creates listings of files from selected directories on hard disks, CD-ROMs, DVD-ROMs, floppies, USB storages and network shares. Listing can be in HTML, text or CSV format (for easy import to Excel). Listing can contain standard file information like file name, extension, type, owner and date created, but especially for forensic analysis file meta data can be extracted from various formats: 1) executable file information (EXE, DLL, OCX) like file version, description, company, product name. 2) multimedia properties (MP3, AVI, WAV, JPG, GIF, BMP, MKV, MKA, MPEG) like track, title, artist, album, genre, video format, bits per pixel, frames per second, audio format, bits per channel. 3) Microsoft Office files (DOC, DOCX, XLS, XLSX, PPT, PPTX) like document title, author, keywords, word count. For each file and folder it is also possible to obtain its CRC32, MD5, SHA-1 and Whirlpool hash sum. Extensive number of options allows to completely customize the visual look of the output. Filter on file name, date, size or attributes can be applied so it is possible to limit the files listed.
http://www.krksoft.com