Electronic Review of Computer Books

[ ERCB Home | New | Feature | Brief | DDJ | Letters | Links ]

[an error occurred while processing this directive]

Vital Statistics

Title Web Site Stats: Tracking Hits and Analyzing Traffic
Author Rick Stout
Publisher Osborne McGraw-Hill
Berkeley, CA
http://www.osborne.com/
Copyright 1997
ISBN 0-07-882236-X
Pages 298 (plus CD-ROM)
Price $29.95


Web Site Stats

As the number of commercial and advertising-supported web sites continues to grow, the importance of techniques to gain insight into who is visiting those sites increases as well. Rick Stout's Web Site Stats is an excellent introductory text for webmasters who want to make sense of the data site visitors leave behind.

Stout begins with the basics, showing the reader what kind of logs Web server software packages maintain, how the logs relate to each other, and what format inconsistencies to look out for when reconciling data from the various files. While he admits that much of the information in this section is elementary, the author correctly points out that reviewing the basic files (and common "consolidated" log formats which bring together the most relevant information from the raw logs) is an essential first step to making sense of the data at hand.

With the basics out of the way, Stout walks the user through a sample analysis, pointing out some of the tricks and traps inherent in analyzing incomplete records. Two potential problem areas are tracks left by users of multi-user machines and those of visitors who connect to an Internet Service Provider which assigns dynamic IP addresses to its SLIP (Serial Link Interface Protocol) and PPP (Point-to- Point Protocol) customers. The former case confounds analysis as multiple users could visit the same pages repetitively but appear as a single user coming back many times, while the latter circumstance makes a single user's visits appear as if several different individuals were viewing the site. There's no way around these difficulties, but if the logs are small enough sorting by various elements of the log file (like the IP address of the machine making the connection or the referring page) can allow webmasters to judge the likelihood of whether a set of visits represents interest from one or more users.

The latter half of the book compares the merits of log file analysis software packages, third-party auditing companies, and ad-tracking services. I can't comment on the accuracy of Stout's assessments of the packages and analysis services directly as I didn't load all of the sample software included on the book's CD, but crosstables in Appendix A listing the features of the packages he reviewed (which included several freeware and shareware programs) seem quite complete. Software and services are upgraded constantly, but Stout has established a site at http://www.websitestats.com to supplement the information in the book and keep readers current on developments in the area.

Web Site Stats is a good book, especially for relatively inexperienced webmasters or individuals who would like an overview of the means available for examining their log files, but Stout does give short shrift to users' privacy concerns regarding "cookies," files some web servers write directly onto users' hard drives. The case Stout cites is one where a Java or JavaScript programmer wrote a program to determine a user's e-mail address, embedded the information in a cookie, and was thereby able to track the user every time he or she returned to the site (not to mention potentially adding the address to lists used for unsolicited commercial e-mail). While Stout is absolutely correct, from a technical standpoint, that the problem is with Java and not cookies, in my opinion he does not sensitize his readers to the growing concern over online privacy.

Despite this drawback, Web Site Stats provides a solid foundation for Webmasters who want to get a clearer picture of who is visiting their sites, whether out of curiosity or to justify advertising dollars.

-- Curtis D. Frye (cfrye@teleport.com)


Quick Rating

Readability Star Star Star Star
Originality Star Star Star
Organization Star Star Star
Accuracy Star Star Star
Consistency Star Star Star Star
Depth Star Star Star Star Star
Timeliness Star Star Star Star HalfStar
Editing Star Star Star Star
Design Star Star Star
Overall Value Star Star Star Star

Explanation of ERCB rating scale: No stars = unacceptable, 1 Star = marginal, 2 Stars = average, 3 Stars = above average, 4 Stars = exceptional, 5 Stars = stellar.


Copyright © 1997 Electronic Review of Computer Books
Created 10/16/97 / Last modified 10/16/97 / webmaster@ercb.com