World Wide Web (WWW) Traffic Analysis Tools
Computer Science Department
Virginia Polytechnic and State University
Blacksburg, VA 24061-0106
For the purposes of conducting analyses of WWW traffic at Virginia Tech, we have developed
several tools which can:
- Convert CERN-common Log files to a form suitable for the
CHITRA
analysis package,
- Collect WWW traffic directly from an ethernet backbone by filtering and processing
output from tcpdump, producing a CERN-common Log file..or an extended log file.
- Anonymize CERN-common Log files by mapping client IP and user-name fields to
unique integers.
Tools currently available:
- Additions for V1.1 (New as of 27-Jun-96)
- Map integers are now broken by the number of "parts" to an IP or URL.
- You can now map Client IP, Server IP and the relative URL.
- Map integers are consistent between Client IP and Server IP "parts".
- You can save the integer mapping in order to maintain map consistency between log files.
- You can define Client and Server IP substrings that you consider "LOCAL".
- Additions for V1.2 (New as of 15-Oct-97)
- Selecting a "-level" of "URL" now also maps any query string to a unique integer.
Here is the help for CLFmunge.pl:
USAGE: CLFmunge.pl [-h -use <dir> -local "list" -level "list"]
CLFmunge.pl maps sensitive client info from a Common Log Format server or
proxy log to a unique unrevertable integer.
-h prints this message
-use <directory> gets and saves all map information for commonality
between multiple log files.
-level "comma-sep list" DEFAULT 'CIP' which fields to map
Choices: 'CIP' Client IP address
'SIP' Server IP address
'URL' Relative path and filename in URL
NOTE: UID is ALWAYS mapped if it exists.
-local "comma-sep list" domain names for clients and servers in this
will be mapped to the key phrase 'local'.
NOTE: DO NOT start substitution strings with "."
Formal argument is a Common Log Format file.
The output file is .mng
Go to the home page for the
Chitra project
SUCCEED
The development of these tools was supported, in part, by a
National Science Foundation grant CCR-9211342 and the National Science
Foundation SUCCEED Coalition Cooperative Agreement No. EID-9109853).SUCCEED
is a coalition of eight schools and colleges working to enhance engineering
education for the twenty-first century.
Authors of WWW tools in Chitra:
Please send inquiries and commments to
succeed-people@vtopus.cs.vt.edu.
Number of accesses:
(since Feb 8, 1996).
Last Modified: 27-June-1996