Difference between revisions of "National Software Reference Library"

From ForensicsWiki
Jump to: navigation, search
m (Typo.)
(Added new header, better links and text)
Line 1: Line 1:
The '''National Software Reference Library''' (NSRL) provides sets of known good [[hash]]es.  
+
The '''National Software Reference Library''' (NSRL) consists of sets of known 'traceable' hashes produced by the [[National Institute of Standards and Technology]] (NIST). Useful for [[Data Reduction|data reduction]], it can be used to eliminate or highlight files from examination. The NSRL creates what they call a "Reference Data Set" or RDS. The RDS does not indicate if a file is known good or bad, only that it is known.
 +
 
 +
Although quite large, the NSRL is distributed online can be downloaded from the [http://www.nsrl.nist.gov/Downloads.htm NSRL website]. The most recent release was version 2.15 in December 2006.
  
 
== NSRL File Format ==
 
== NSRL File Format ==
  
=== Version 1.5 ===
+
Each RDS consists of several files, but the hashes are stored in <tt>NSRLFile.txt</tt>. These files have a header followed by many hash records. The header denotes the columns in each file. (See the External Links for the complete specification). RDS files can be used directly with programs like [[md5deep]], [[Forensic Toolkit|FTK]], and [[EnCase]].
 +
 
 +
The file format has changed slightly over time. The latest version was dated 7 Feb 2007:
  
 
=== Version 2.0 ===
 
=== Version 2.0 ===
  
Starting in version 2.0, the NSRL added the hashes to the start of the line and dropped the [[MD4]] value. Lines have the following format:
+
Starting in version 2.0, the NSRL moved the hashes to the start of each line and dropped the [[MD4]] hash. The file header:
 +
 
 +
<pre>"SHA-1","MD5","CRC32","FileName","FileSize","ProductCode","OpSystemCode","SpecialCode"</pre>
 +
 
 +
=== Version 1.5 ===
 +
 
 +
Information on the older header version is kept here so that programs can read older files. The file header:
 +
 
 +
<pre>"SHA-1","FileName","FileSize","ProductCode","OpSystemCode","MD4","MD5","CRC32","SpecialCode"</pre>
  
"SHA1","MD5","CRC32","filename","filesize","productcode","operating-system-code","specialcode"
+
<tt>OpSystemCode</tt> refers to the operating system code. The <tt>SpecialCode</tt> is a single character that can be used to mark records. A normal file has a blank value here. An <tt>M</tt> in this field denotes a malicious file. 
  
 
== External Links ==
 
== External Links ==
  
 
* [http://www.nsrl.nist.gov/ NSRL website]
 
* [http://www.nsrl.nist.gov/ NSRL website]
 +
* [http://www.nsrl.nist.gov/documents/Data-Formats-of-the-NSRL-Reference-Data-Set-14.pdf NSRL RDS Data File Format] - Describes the format of the hash files

Revision as of 23:47, 26 February 2007

The National Software Reference Library (NSRL) consists of sets of known 'traceable' hashes produced by the National Institute of Standards and Technology (NIST). Useful for data reduction, it can be used to eliminate or highlight files from examination. The NSRL creates what they call a "Reference Data Set" or RDS. The RDS does not indicate if a file is known good or bad, only that it is known.

Although quite large, the NSRL is distributed online can be downloaded from the NSRL website. The most recent release was version 2.15 in December 2006.

NSRL File Format

Each RDS consists of several files, but the hashes are stored in NSRLFile.txt. These files have a header followed by many hash records. The header denotes the columns in each file. (See the External Links for the complete specification). RDS files can be used directly with programs like md5deep, FTK, and EnCase.

The file format has changed slightly over time. The latest version was dated 7 Feb 2007:

Version 2.0

Starting in version 2.0, the NSRL moved the hashes to the start of each line and dropped the MD4 hash. The file header:

"SHA-1","MD5","CRC32","FileName","FileSize","ProductCode","OpSystemCode","SpecialCode"

Version 1.5

Information on the older header version is kept here so that programs can read older files. The file header:

"SHA-1","FileName","FileSize","ProductCode","OpSystemCode","MD4","MD5","CRC32","SpecialCode"

OpSystemCode refers to the operating system code. The SpecialCode is a single character that can be used to mark records. A normal file has a blank value here. An M in this field denotes a malicious file.

External Links