DragonFly On-Line Manual Pages

Search: Section:  


silk(7)                         SiLK Tool Suite                        silk(7)

NAME

SiLK - the System for Internet-Level Knowledge

DESCRIPTION

SiLK is a collection of traffic analysis tools developed by the CERT Network Situational Awareness Team (CERT NetSA) to facilitate security analysis of large networks. The SiLK tool suite supports the efficient collection, storage, and analysis of network flow data, enabling network security analysts to rapidly query large historical traffic data sets. SiLK is ideally suited for analyzing traffic on the backbone or border of a large, distributed enterprise or mid-sized ISP. A SiLK installation consists of two categories of applications: the analysis suite and the packing system. Analysis suite The SiLK analysis suite is a collection of command-line tools for processing SiLK Flow records created by the SiLK packing system. These tools read binary files containing SiLK Flow records and partition, sort, and count these records. The most important analysis tool is rrwwffiilltteerr(1), an application for querying the central data repository for SiLK Flow records that satisfy a set of filtering options. The tools are intended to be combined in various ways to perform an analysis task. A typical analysis uses UNIX pipes and intermediate data files to share data between invocations of tools. The tools, configuration files, and plug-in modules that make up the analysis tools are listed below, roughly grouped by functionality. Filtering, sorting, and display rrwwffiilltteerr(1) partitions SiLK Flow records into one or more 'pass' and/or 'fail' output streams. rwfilter is the primary tool for pulling flows from the data store. ssiillkk..ccoonnff(5) is the configuration file naming the Classes, Types, and Sensors available at your installation. rrwwssoorrtt(1) sorts SiLK Flow records using a user-specified key comprised of record attributes, and writes the records to the named output path or to the standard output. Users can define new key fields using plug- ins written in C or PySiLK. rrwwccuutt(1) prints the attributes of SiLK Flow records in a delimited, columnar, human-readable format. Users can define new printable attributes using plug-ins written in C or PySiLK. SiLK Python Extension ppyyssiillkk(3). PySiLK, the SiLK Python extension, allows one to read, manipulate, and write SiLK Flow records, IPsets, and Bags from within Python. PySiLK may be used in a stand-alone Python program or to write plug-ins for several SiLK applications. This document describes the objects, methods, and functions that PySiLK provides. The next entry describes using PySiLK from within a plug-in. ssiillkkppyytthhoonn(3). The SiLK Python plug-in provides a way to use PySiLK to define new partitioning rules for rrwwffiilltteerr(1), new key fields for rrwwccuutt(1), rrwwggrroouupp(1), and rrwwssoorrtt(1), and new key or value fields for rrwwssttaattss(1) and rrwwuunniiqq(1). Counting and statistics rrwwuunniiqq(1) bins (groups) SiLK Flow records by a user-specified key comprised of record attributes and prints the total byte, packet, and/or flow counts for each bin. rwuniq can also print distinct source IP and destination IP counts. Users can define new key fields and value fields using plug-ins written in C or PySiLK. rrwwccoouunntt(1) summarizes SiLK Flow records across time, producing textual output with counts of bytes, packets, and flow records for each time bin. rrwwssttaattss(1) summarizes SiLK Flow records by a user-specified key comprised of record attributes, computes values from the flow records that match each key, sorts the results by the value to generate a Top-N or Bottom-N list, and prints the results. Users can define new key fields and value fields using plug-ins written in C or PySiLK. rrwwttoottaall(1) summarizes SiLK Flow records by a specified key and prints the sum of the byte, packet, and flow counts for flows matching the key. rrwwaaddddrrccoouunntt(1) summarizes SiLK flow records by the source or destination IP and prints the byte, packet, and flow counts for each IP. IPset, Bag, and Prefix Map manipulation rrwwsseett(1) reads SiLK Flow records and generates binary IPset file(s) containing the source IP addresses or destination IP addresses seen on the flow records. rrwwsseettbbuuiilldd(1) reads (textual) IP addresses in dotted-quad or CIDR notation from an input file or from the standard input and writes a binary IPset file. rrwwsseettccaatt(1) prints the contents of a binary IPset file as text. Additional information about the IPset file can be printed. rrwwsseettttooooll(1) performs union, intersection, difference, and sampling functions on the input IPset files, generating a new IPset file. rrwwsseettmmeemmbbeerr(1) determines whether the IP address specified on the command line is contained in an IPset. rrwwbbaagg(1) reads SiLK Flow records and builds binary Bag(s) containing key-count pairs. An example is a Bag containing the sum of the byte counts for each source port seen on the flow records. rrwwbbaaggbbuuiilldd(1) creates a binary Bag file from a binary IPset file or from a textual input file. rrwwbbaaggccaatt(1) prints binary Bag files as text. rrwwbbaaggttooooll(1) performs operations (e.g., addition, subtraction) on binary Bag files and produces a new Bag file. rrwwppmmaappbbuuiilldd(1) reads textual input and creates a binary prefix map file for use with the Address Type (aaddddrrttyyppee(3)) and Prefix Map (ppmmaappffiilltteerr(3)) utilities. rrwwppmmaappccaatt(1) prints information about a prefix map file as text. By default, prints each IP range in the prefix map and its label. rrwwppmmaappllooookkuupp(1) finds information about specific IP address(es) or protocol/port pair(s) in a binary prefix map file and prints the result as text. rrwwiippaaiimmppoorrtt(1) imports a SiLK IPset, Bag, or Prefix Map file into the IP Address Association (IPA <http://tools.netsa.cert.org/ipa/>) library. rrwwiippaaeexxppoorrtt(1) exports a set of IP addresses from the IP Address Association (IPA) library to a SiLK IPset, Bag, or Prefix Map. IP and port labeling files aaddddrrttyyppee(3). The Address Type file provides a way to map an IPv4 address to an integer denoting the IP as internal, external, or non- routable. ccccffiilltteerr(3). The Country Code file provides a mapping from an IPv4 address to two-letter, lowercase abbreviation of the country what that IP address is located. The abbreviations used by the Country Code utility are those used by the Root-Zone Whois Index (see for example <http://www.iana.org/cctld/cctld-whois.htm>). ppmmaappffiilltteerr(3). Prefix map files provide a way to map field values to string labels based on a user-defined map file. The map file is created by rrwwppmmaappbbuuiilldd(1). Run time plug-ins fflloowwrraattee(3). The flowrate plug-in, which must be loaded explicitly, adds switches and fields to compute packets/second, bytes/second, bytes/packet, payload-bytes, and payload-bytes/second. iinntt--eexxtt--ffiieellddss(3). The internal/external plug-in makes available fields containing internal and external IPs and ports (int-ip, ext-ip, int-port, and ext-port). iippaaffiilltteerr(3). The IPA (IP Association) plug-in works with rwfilter to partition flows based on data in an IPA data store. rwfilter will automatically load this plug-in if it is available. The plug-in requires that SiLK be compiled with IPA support (<http://tools.netsa.cert.org/ipa/>). ssiillkk--pplluuggiinn(3) describes how to create and compile a new SiLK plug-in using C. Record grouping and masking rrwwggrroouupp(1) groups SiLK flow records by a user-specified key comprised of record attributes, labels the records with a group ID that is stored in the next-hop IP field, and writes the resulting flows to the specified output path or to the standard output. rwgroup requires that its input is sorted. rrwwmmaattcchh(1) matches (mates) records as queries and responses and marks mated records with an ID that is stored in the next-hop IP field. rwmatch requires that its input is sorted. rrwwnneettmmaasskk(1) reads SiLK Flow records, zeroes the least significant bits of the source-, destination-, and/or next-hop-IP address(es), and writes the resulting records to the named output path or to the standard output. Packet and external flow-format processing rrwwpp22yyaaff22ssiillkk(1) converts a packet capture (ppccaapp(3)) file---such as a file produced by ttccppdduummpp(1)---to a single file of SiLK Flow records. rwp2yaf2silk assumes that the yyaaff(1) (<http://tools.netsa.cert.org/yaf/>) and rrwwiippffiixx22ssiillkk(1) commands are available on your system as it is a simple Perl wrapper around those commands. rrwwiippffiixx22ssiillkk(1) converts a stream of IPFIX (Internet Protocol Flow Information eXport) records to the SiLK Flow record format. rrwwssiillkk22iippffiixx(1) converts a stream of SiLK Flow records to an IPFIX (Internet Protocol Flow Information eXport) format. rrwwppccuutt(1) reads a packet capture file and print its contents in a textual form similar to that produced by rwcut. rrwwppddeedduuppee(1) detects and eliminates duplicate records from multiple packet capture input files. See also rrwwddeedduuppee(1). rrwwppmmaattcchh(1) filters a packet capture file by writing only packets whose five-tuple and timestamp match corresponding records in a SiLK Flow file. rrwwppttooffllooww(1) reads a packet capture file and generates a SiLK Flow record for every packet. rrwwppdduu22ssiillkk(1) creates a stream of SiLK Flow records from a file containing NetFlow v5 PDU records. Scan detection rrwwssccaann(1) attempts to detect scanning activity from SiLK Flow records. rwscan can produce files that can be loaded into a database and queried with rwscanquery. rrwwssccaannqquueerryy(1) queries the scan database which has been populated from database load files generated by rwscan. Flow utilities rrwwccaatt(1) reads SiLK Flow records from the files named on the command line, or from the standard input when no files are provided, and writes the SiLK records to the specified output file or to the standard output if it is not connected to a terminal. rrwwaappppeenndd(1) appends the SiLK Flow records contained in the second through final file name arguments to the records contained in the first file name argument. rrwwccoommbbiinnee(1) reads SiLK Flow records from files named on the command line or from the standard input. For records where the attributes field contains the flow timed-out flag, rwcombine attempts to find the record with the corresponding continuation flag set and combine those records into a single flow. rwcombine writes the results to the named output file or to the standard output. rrwwccoommppaarree(1) determines whether two SiLK Flow files contain the same flow records. rrwwddeedduuppee(1) reads SiLK Flow records from files named on the command line or from the standard input and writes the records to the named output path or to the standard output, removing any duplicate flow records. Note that rwdedupe will reorder the records as part of its processing. rrwwrraannddoommiizzeeiipp(1) generates a new SiLK Flow file by substituting a pseudo-random IP address for the source and destination IP addresses in given input file. rrwwrreeccggeenneerraattoorr(1) generates SiLK Flow records using a pseudo-random number generator; these records can be used to test SiLK applications. rrwwsspplliitt(1) reads SiLK Flow records and generates a set of sub-files from the input. The sub-files can be limited by flow-, byte-, or packet-counts, or by unique IP count. In addition, the sub-file may contain all the flows or only a sample of them. rrwwsswwaappbbyytteess(1) generates a new SiLK Flow file by changing the byte order of the records in a given input SiLK Flow file. Utilities rrwwffiilleeiinnffoo(1) prints information (type, version, etc.) about a SiLK Flow, IPset, Bag, or Prefix Map file. rrwwssiitteeiinnffoo(1) prints information about the sensors, classes, and types specified in the ssiillkk..ccoonnff(5) file. rrwwttuucc(1) generates SiLK flow records from textual input; the input should be in a form similar to what rrwwccuutt(1) generates. rrwwffgglloobb(1) prints to the standard output the list of files that rwfilter would normally process for a given set of file selection switches. nnuumm22ddoott(1) reads delimited text from the standard input, converts integer values in the specified column(s) (default first column) to dotted-decimal IP address, and prints the result to the standard output. rrwwggeeooiipp22ccccmmaapp(1) reads the MaxMind GeoIP database and creates the country code mapping file that can be used by SiLK (see ccccffiilltteerr(3)). rrwwiiddssqquueerryy(1) invokes rwfilter to find flow records matching Snort signatures. rrwwrreessoollvvee(1) reads delimited text from the standard input, attempts to resolve the IP addresses in the specified column(s) to host names, and prints the result to the standard output. ssiillkk__ccoonnffiigg(1) prints information about how SiLK was compiled; this information can be used to compile and link other files and programs against the SiLK header files and libraries. Deprecated tools mmaappssiidd(1) maps between sensor names and sensor IDs using the values specified in the ssiillkk..ccoonnff(5) file. mapsid is deprecated as of SiLK 3.0.0, and it will be removed in the SiLK 4.0 release. This functionality is available in rrwwssiitteeiinnffoo(1). rrwwgguueessss(8) reads a file containing NetFlow v5 PDU records and prints the SNMP interfaces that are used most often and the number of records seen for each interface. rwguess is deprecated as of SiLK 3.8.3, and it will be removed in the SiLK 4.0 release. Similar functionality is available using a combination of rrwwppdduu22ssiillkk(1), rrwwssttaattss(1), and rrwwuunniiqq(1). rrwwiipp22cccc(1) maps a (textual) list of IP addresses to their country code. rwip2cc is deprecated as of SiLK 3.0.0, and it will be removed in the SiLK 4.0 release. This functionality is available in rrwwppmmaappllooookkuupp(1). Packing system The SiLK Packing System is comprised of daemon applications that collect flow records (IPFIX flows from yyaaff(1) or NetFlow v5 or v9 PDUs from a router), convert the records to the SiLK flow format, categorize the flows as incoming or outgoing, and write the records to their final destination in binary flat files for use by the analysis suite. Files are organized in a time-based directory hierarchy with files covering each hour at the leaves. The tools, configuration files, and plug-ins that comprise the SiLK Packing System are: fflloowwccaapp(8) listens to flow generators (devices which produce network flow data) and stores the data in temporary files prior to transferring the files to a remote machine for processing by rwflowpack. rrwwfflloowwppaacckk(8) reads flow data either directly from a flow generator or from files generated by flowcap, converts the data to the SiLK flow record format, categorizes the flow records according to rules loaded from a packing-logic plug-in, and writes the records either to hourly flat-files organized in a time-based directory structure or to files for transfer to a remote machine for processing by rwflowappend. rrwwfflloowwaappppeenndd(8) watches a directory for files containing small numbers of SiLK flow records and appends those records to hourly files organized in a time-based directory tree. rrwwsseennddeerr(8) watches an incoming directory for files, moves the files into a processing directory, and transfers the files to one or more rwreceiver processes. Either rwsender or rwreceiver may act as the server (i.e., listen for incoming network connections) with the other acting as the client. rrwwrreecceeiivveerr(8) accepts files transferred from one or more rwsender processes and stores them in a destination directory. Either rwsender or rwreceiver may act as the server with the other acting as the client. rrwwppoolllleexxeecc(8) monitors a directory for incoming files and runs a user- specified command on each file. rrwwppaacckkcchheecckkeerr(8) reads SiLK Flow records and checks for unusual patterns that may indicate data file corruption. sseennssoorr..ccoonnff(5) is a configuration file for sensors and probes used by rwflowpack and flowcap. ppaacckkllooggiicc--ttwwoowwaayy(3) is one of the plug-ins available that describe a set of rules (the packing-logic) that rwflowpack may use when categorizing flow records as incoming or output. ppaacckkllooggiicc--ggeenneerriicc(3) is one of the plug-ins available that describe a set of rules (the packing-logic) that rwflowpack may use when categorizing flow records as incoming or output.

ENVIRONMENT

The following environment variables affect the tools in the SiLK tool suite. SILK_CONFIG_FILE This environment variable contains the location of the site configuration file, ssiillkk..ccoonnff(5). This variable has precedence over all methods of finding the site file except for the --site-config-file switch on an application. For additional locations where site configuration file may reside, see the "FILES" section. SILK_DATA_ROOTDIR This variable gives the root of directory tree where the data store of SiLK Flow files is maintained, overriding the location that is compiled into the tools (/data). The rrwwffiilltteerr(1) and rrwwffgglloobb(1) tools use this value when selecting which flow files to process unless the user passes the --data-rootdir switch to the application. In addition, the SiLK tools will search for the site configuration file, silk.conf, in this directory. SILK_PATH This environment variable gives the root of the directory tree where the tools are installed. As part of its search for configuration files and plug-ins, a SiLK application may use this variable. See the "FILES" section for details. SILK_IPV6_POLICY For tools that implement the --ipv6-policy switch, this environment variable is used as the value for that switch when it is not provided. SILK_IP_FORMAT For tools that implement the --ip-format switch, this environment variable is used as the value for that switch when it is not provided. Since SiLK 3.11.0. SILK_TIMESTAMP_FORMAT For tools that implement the --timestamp-format switch, this environment variable is used as the value for that switch when it is not provided. Since SiLK 3.11.0. SILK_IPSET_RECORD_VERSION For the IPset family of tools, this environment variable is used as the default value for the --record-version switch when the switch is not provided on the command line. SILK_CLOBBER The SiLK tools normally refuse to overwrite existing files. Setting SILK_CLOBBER to a non-empty value removes this restriction. SILK_PAGER When set to a non-empty string, most of the applications that produce textual output (e.g., rrwwccuutt(1)) will automatically invoke this program to display their output a screen at a time. If set to an empty string, no paging of the output is performed. The --pager switch on the application overrides this value. PAGER The applications that support paging their output will use the value in this environment variable when SILK_PAGER is not set. SILK_TMPDIR This variable is used by tools that write temporary files (e.g., rrwwssoorrtt(1)) as the directory in which to store those files. SILK_TMPDIR overrides the value of TMPDIR. TMPDIR When set and SILK_TMPDIR is not set, temporary files are created in this directory. PYTHONPATH The Python modules and library files required to use PySiLK from rrwwffiilltteerr(1), rrwwccuutt(1), rrwwssoorrtt(1), and rrwwuunniiqq(1) as well as from Python itself are installed under SiLK's installation tree by default. It may be necessary to set or modify the PYTHONPATH environment variable so Python can find these files. For information on using the PySiLK module, see ssiillkkppyytthhoonn(3) as well as the SiLK in Python handbook. PYTHONVERBOSE If the SiLK Python extension or plug-in fails to load, setting this environment variable to a non-empty string may help you debug the issue. SILK_PYTHON_TRACEBACK When set, Python plug-ins will output traceback information on Python errors to the standard error. SILK_RWFILTER_THREADS The number of threads rwfilter uses while reading input files or files selected from the data store. SILK_COUNTRY_CODES This environment variable allows the user to specify the country code mapping file that the SiLK tools use. The value may be a complete path or a file relative to SILK_PATH. See the "FILES" section for standard locations of this file. SILK_ADDRESS_TYPES This environment variable allows the user to specify the address types mapping file that the SiLK tools use. The value may be a complete path or a file relative to SILK_PATH. See the "FILES" section for standard locations of this file. RWRECEIVER_TLS_PASSWORD, RWSENDER_TLS_PASSWORD Used by rrwwrreecceeiivveerr(8) and rrwwsseennddeerr(8), this variable specifies the password to use to decrypt the PKCS#12 file specified in the --tls-pkcs12 switch. TZ When a SiLK installation is built to use the local timezone (to determine if this is the case, check the "Timezone support" value in the output from the --version switch on most SiLK applications), the value of the TZ environment variable determines the timezone in which timestamps are displayed and parsed. If the TZ environment variable is not set, the default timezone is used. Setting TZ to 0 or to the empty string causes timestamps to be displayed in and parsed as UTC. The value of the TZ environment variable is ignored when the SiLK installation uses utc. For system information on the TZ variable, see ttzzsseett(3) or eennvviirroonn(7). SILK_ICMP_SPORT_HANDLER Modifies how "buggy" ICMP SiLK flow records are handled. ICMP type and code are normally encoded in the destination port field. Prior to SiLK 3.4.0, a bug existed when processing IPFIX bi-flow ICMP records where the type and code of the second records were stored in the source port. SiLK 3.4.0 attempts to work-around this bad encoding by modifying the buggy ICMP SiLK Flow records as they are initially read. However, the change in SiLK 3.4.0 removes a previous work-around designed to fix issues with SiLK Flow records collected prior to SiLK 0.8.0 that originated as NetFlow v5 PDUs from some types of Cisco routers. The ICMP records from these Cisco routers encoded the type and code in the source port, but the bytes were swapped from the normal encoding. When the SILK_ICMP_SPORT_HANDLER environment variable is set to "none", all work-arounds for buggy ICMP records are disabled and the source and destination ports remain unchanged. SILK_LOGSTATS_RWFILTER When set to a non-empty value, rrwwffiilltteerr(1) will treat the value as the path to a program to execute with information about this rwfilter invocation. Its purpose is to provide the SiLK administrator with information on how the SiLK tool set is being used. SILK_LOGSTATS This environment variable is currently an alias for SILK_LOGSTATS_RWFILTER. The ability to log invocations may be extended to other SiLK tools in future releases. SILK_LOGSTATS_DEBUG If the environment variable is set to a non-empty value, rwfilter will print messages to the standard error about the SILK_LOGSTATS value being used and either the reason why the value cannot be used or the arguments to the external program being executed. SILK_PLUGIN_DEBUG When set to a non-empty value, an application that supports plug- ins will print status messages to the standard error as it tries to open each of its plug-ins. SILK_TEMPFILE_DEBUG When set to 1, the library that manages temporary files for rwdedupe, rwsort, rwstats, and rwuniq prints debugging messages to the standard error as it creates, re-opens, and removes temporary files. SILK_UNIQUE_DEBUG When set to 1, the binning engine used by rwstats and rwuniq prints debugging messages to the standard error.

FILES

${SILK_ADDRESS_TYPES} ${SILK_PATH}/share/silk/address_types.pmap ${SILK_PATH}/share/address_types.pmap /usr/local/share/silk/address_types.pmap /usr/local/share/address_types.pmap Possible locations for the address types mapping file. ${SILK_CONFIG_FILE} ROOT_DIRECTORY/silk.conf ${SILK_PATH}/share/silk/silk.conf ${SILK_PATH}/share/silk.conf /usr/local/share/silk/silk.conf /usr/local/share/silk.conf Possible locations for the SiLK site configuration file which are checked when the --site-config-file switch is not provided, where ROOT_DIRECTORY/ is the directory rwfilter is using as the root of the data repository. ${SILK_COUNTRY_CODES} ${SILK_PATH}/share/silk/country_codes.pmap ${SILK_PATH}/share/country_codes.pmap /usr/local/share/silk/country_codes.pmap /usr/local/share/country_codes.pmap Possible locations for the country code mapping file. ${SILK_DATA_ROOTDIR}/ /data/ Locations for the root directory of the data repository. Some applications provide a command line switch to set this value. ${SILK_PATH}/lib64/silk/ ${SILK_PATH}/lib64/ ${SILK_PATH}/lib/silk/ ${SILK_PATH}/lib/ /usr/local/lib64/silk/ /usr/local/lib64/ /usr/local/lib/silk/ /usr/local/lib/ Directories that a SiLK application checks when attempting to load a plug-in. ${SILK_TMPDIR}/ ${TMPDIR}/ /tmp/ Directory in which to create temporary files.

SEE ALSO

Analysts' Handbook: Using SiLK for Network Traffic Analysis, The SiLK Reference Guide, SiLK Installation Handbook, <http://tools.netsa.cert.org/silk/> SiLK 3.11.0.1 2016-02-19 silk(7)

Search: Section: