"Linux Gazette...making Linux just a little more fun!"
 
  
 Gathering Usage Stats
 
   
Intro
Here in the Linux Laboratory at Northern Michigan University, we have quite
a few users and quite a few computers for them to use. It is important
for laboratoies like us to quantify usage. This
data can be used to justify expansion of a computer laboratory, describe
who is actually using the machines, which machines are being used,
or just satisfy simple curiosity.
Being the curious type, I sat down to write a program that would gather
usage information. The information I wanted includes:
- 
How much time each user spends online.
- 
How much time each computer spends being used.
- 
How often the computer is up.
- 
User total usage time divided by weeks (to see long term trends).
- 
User total usage time divided by day for the last couple of days (to see
current trends).
Methodology
My first thought was to just stick my head in at odd times and count users.
But for such a strategy to work, I would have to count users at various
times in the day, including times I might not otherwise be inclined to
visit the lab (like early mornings). Further, I would miss users
using the lab remotely, over the internet.
My second thought was to use the "w" command. This command reads
a log file (normally /var/log/wtmp) and produces a line of output for every
logon event in the past, describing who was logged on and for how long.
My hope was that a summary of this information would provide the usage
statistics I was looking for. Unfortunately, this command does not
produce foolproof output. If the machine crashes while someone is
logged on, then "w" will sometimes produce the wrong total time online.
Even worse, if a person is logged on but idle, this idle time still counts
as usage as computed by "w".
Counting idle time was unacceptable to me. We have several
users with computers in their offices, and they are essentially logged
on 24 hours per day 7 days per week. Their usage is nowhere near
this level (yes, even college professors go to sleep!)
Luckily , there was an alternative to "w". The easiest way to
find out who is currently logged onto a computer is to use finger, a program
designed for just this purpose. The command "finger @hostname"
will describe who is logged on to "hostname", and how long since they actually
typed a command (i.e. finger knows their idle time).
Finger produces a header line, and the one line for every person logged
on. Eliminating the users with a high idle time will provide a list
of users who are using the computer at any given moment. A log file
of such lists, gathered at regular intervals, will describe usage over
the time the log file was gathered.
There is an important statistical assumption here. We assume that
a set of entries will accurately describe usage over the whole time period,
not just the precise moments when those entries occur. For
this assumption to be valid the entries should be gathered at regular intervals.
Defining Usage
The other complicated issue is to define usage. Often a single computer
will have several users logged on simultaneously, and often a single user
will be logged on to multiple computers at once (as I am now). It
becomes important to carefully define usage in these cases. I adopted
the following definitions.
- 
A computer is in use if and only if there is at least one user using that
computer.
- 
A user is logged on if and only if the user is logged onto at least one
computer.
- 
A computer is up if and only if it responds to the finger command at all,
and is otherwise down. Note that a computer that is currently running
Windows will NOT respond, and will therefore be counted as down (which
makes sense to me!).
Given these definition, it becomes important not double count users where
they are logged in more than once, and to not double count computers when
they have more than one user. Correct programming eliminates these
double countings (see the source code below).
The Log file
The log file contains a series of records, each one of which is a description
of the results of running finger on the set of hosts. The size of
each entry is minimized, since many entries will be gathered yet the log
file should remain modest in size. The top of each entry contains the date
and time the entry was gathered, which is important for gathering time
and date based statistics. The log file entry below shows that it
is 11 45 in the evening on 10/11/97, and that I am the only one logged
in besides root. Root and I are using the computers ogaa and ogimaa.
Also shown is that the computer nigig is down, since it is not listed at
all.
| Date 97 10 11 23 45 Host ogimaa 1
 Host bine 0
 Host gaag 0
 Host makwa 0
 Host mooz 0
 Host zagime 0
 Host ogaa 1
 Host euclid 0
 Host euler 0
 Host fermat 0
 User randy
 User root
 Total 2 users
 | 
The Program
The program is named fingersummarize, since its job is to summarize a set
of results from the finger command. It is written in Perl, since
Perl offers wonderful support for associative arrays (where the usage stats
are stored) and working with strings (from the log file and the output
of finger).
There are two basic tasks of fingersummarize. These functions could
easily be done with two separate programs, but I find it easier to have
one program with options rather than two executables.
- 
It should gather finger results, and store them in a log file. (fingersummarize
-probe)
- 
It should read the log file and produce the usage statistics. (fingersummarize
-print)
Fingersummarize can be installed easily. Just follow the instructions
below.
- 
Copy the executable to someplace on your system, such as /usr/local/bin.
      cp /tmp/fingersummarize /usr/local/bin;
chmod 755 /usr/local/bin/fingersummarize
- 
Edit the top of the executable so that fingersummarize will probe your
machines instead of mine. This should be very easy to do.
             
vi /usr/local/bin/fingersummarize
- 
Make a blank log file and put that log file somewhere. Often /var/log/fingersummarize
is a reasonable place.
      echo > /var/log/fingersummarize;
chmod 600 /var/log/fingersummarize
- 
Install a line in cron so that fingersummarize will run in probe mode at
regular intervals. Below is the line I use, which runs fingersummarize
every fifteen minutes for every hour.
              
0,15,30,45 * * * * /usr/local/bin/fingersummarize -probe >> /var/log/fingersummarizelog
That's it. Now, whenever you want to see a current summary of the
usage data, just run
            
fingersummarize -print < /var/log/fingersummarizelog
Example Output
Here is some sample output. A current example for my lab can he had
at http://euclid.nmu.edu/fingerprobe.txt
. The executable itself can be had at http://euclid.nmu.edu/~randy/Papers/fingerprobe
. Note that the total number of hours computers were in use (12.8
hours/week) exceeds the total number of hours that people were using computers
(10.8hours/week). This just means there were times that some person
was using more than one computer at a time. Also, note that the useage
spikes at 10am, since a particular class sometimes meets in the lab at
10am.
| Stats by user User    Total   Usage   Hours
 Name    Observ. Percent /Day
 abasosh 47      4      
0.42
 agdgdfg 54      4.6    
0.49
 arnelso 7       0.6    
0.06
 bparton 2       0.1    
0.01
 bob     28      2.4    
0.25
 brandk  101     8.7    
0.92
 btsumda 37      3.2    
0.33
 chgijs  1       0      
0
 clntudp 1       0      
0
 daepke  2       0.1    
0.01
 dan     93      8      
0.84
 dfliter 17      1.4    
0.15
 gclas   43      3.7    
0.39
 goofy   15      1.3    
0.13
 gypsy   2       0.1    
0.01
 jadsjhf 2       0.1    
0.01
 jbsdjh  2       0.1    
0.01
 jdefgg  2       0.1    
0.01
 jeffpat 6       0.5    
0.05
 jpaulin 7       0.6    
0.06
 jstyle  4       0.3    
0.03
 jstamo  17      1.4    
0.15
 jwilpin 37      3.2    
0.33
 jwilpou 79      6.8    
0.72
 kangol  39      3.3    
0.35
 matt    58      5      
0.52
 mhgihjj 8       0.6    
0.07
 randy   187     16.2   
1.7
 rbush   2       0.1    
0.01
 root    22      1.9    
0.2
 rpijj   2       0.1    
0.01
 sbeyne  17      1.4    
0.15
 sdajani 1       0      
0
 sdalma  28      2.4    
0.25
 ship    1       0      
0
 skinny  48      4.1    
0.43
 stacey  2       0.1    
0.01
 tbutler 35      3      
0.31
 tmarsha 5       0.4    
0.04
 tpauls  34      2.9    
0.31
 vladami 30      2.6    
0.27
 xetroni 26      2.2    
0.23
 ---------------------------------
 Overall 1151           
10.24
 
 Stats by Host
Host    Total   Percent Percent Hours
 Name    Observ. Up      
Busy   /Day
 bine    131     100%   
4.9%    1.194
 euclid  152     100%   
5.7%    1.386
 euler   7       89.3%  
0.2%    0.068
 fermat  52      100%   
2.1%    0.506
 gaag    202     36.5%  
7.6%    1.842
 maang   118     100%   
4.4%    1.076
 makwa   77      100%   
2.9%    0.702
 mooz    92      100%   
3.4%    0.839
 nigig   81      100%   
3%      0.738
 ogaa    48      100%   
1.8%    0.437
 ogimaa  374     100%   
14.2%   3.411
 waabooz 28      100%   
1%      0.255
 zagime  38      100%   
1.4%    0.346
 ------------------------
 Overall 2551     94.2%  4.1%   
12.807
 | Stats by the Week Week           
User
 Starting        Hours
 97 10 04        74.5705816481128
 97 09 28        55.9130434782609
 97 09 21        64.7
 97 09 14        113.023956442831
 
 Last Two Weeks
Day      User
 Hours
 97 10 11 7.05882352941176
 97 10 10 16.75
 97 10 09 4.25
 97 10 08 1.5
 97 10 07 5.25
 97 10 06 8.25
 97 10 05 13.8947368421053
 97 10 04 17.6170212765957
 97 10 03 9.91304347826087
 97 10 02 0.75
 97 10 01 1
 97 09 31 12
 97 09 30 9.75
 97 09 29 12.75
 
 Stats by the Hour
Hour    Avg Users
 00      0.151
 01      0.163
 02      0.151
 03      0.053
 04      0.036
 06      0.027
 07      0.055
 08      0.175
 09      0.75
 10      1.398
 11      1.171
 12      0.972
 13      0.814
 14      0.775
 15      0.778
 16      0.607
 17      0.526
 18      0.459
 19      0.455
 20      0.232
 21      0.321
 22      0.339
 23      0.196
 
 | 
 
  
Copyright © 1998, Randy Appleton
 
Published in Issue 25 of Linux Gazette, February 1998
 
  
![[ TABLE OF CONTENTS ]](../gx/indexnew.gif) 
![[ FRONT PAGE ]](../gx/homenew.gif) 
 
