Page 1 of 10
February 1998
Software Review:
Stat::Fit
Distribution fitting software makes simulation more
attractive, viable in many applications
By James C. Benneyan
Stat::Fit is a probability distribution fitting software package designed to
help users more easily test the fit of hypothesized statistical models to
empirical data and, ultimately, to identify the best candidate distribution
for a given scenario. Developed by Geer Mountain Software, the primary
intended users are simulation analysts who, by the nature of their work,
frequently need to determine appropriate distributions for any number of
random events or activities. Several nice features also make Stat::Fit
helpful for other possible uses, including basic data analysis and as a
teaching tool.
While a few similar products exist [1-4] a number of things make
Stat::Fit particularly appealing, not the least of which are its flexibility,
user-friendly GUI interface, and several secondary capabilities which fall
in the bells-and-whistles category. The obvious value of such a tool, used
in conjunction with commercial or special-purpose simulation programs,
is to largely free the analyst from the burden of testing and verifying
appropriate model inputs that otherwise can require sufficient time and
statistical background. In the past a few goodness-of-fit programs
therefore have been bundled with simulation software so as to automate
these "front-end activities as much as possible and allow a user to focus
more on other issues.
Overview
System requirements to run Stat::Fit are minimal by today's personal
computer standards, requiring "at least a 386 IBM compatible PC
running MS Windows 3.1 or higher, 4 MB of RAM and 6 MB of
available hard drive space. At the present time, Stat::Fit is not available
for the Macintosh. Software installation and self-familiarization are a
breeze. I intentionally ignored the user manual and stampeded on to
loading and executing Stat::Fit (my usual modus operandi, although here
conveniently claiming a valid excuse).
Stat::Fit has all the standard capabilities one would expect, including the
ability to import a data file, calculate several summary statistics, plot
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 2 of 10
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 3 of 10
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 4 of 10
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 5 of 10
the mouse, as shown in Figure 3, a user can instantly see the change in
shape, viewed either in PDF or CDF form dependent upon preference.
(While a similar feature can be found elsewhere [2], I have found the
Distribution Viewer to be a bit more user-friendly and visually
appealing.) Such a tool alone is well-worth the investment for anyone
who in the past has iteratively plotted PDFs by hand, in spreadsheets, or
in other software (or even generated large data sets and then constructed
histograms of these).
Educators in particular may also find this feature useful, especially those
who traditionally cover this material via hardcopy, transparencies or
cumbersome software. I have found this to be a much more dynamic way
to illustrate in classroom settings the various shapes that different PDFs
can assume, with very positive feedback from students as well. A
random number generator menu also can be used to explore any of the
listed PDFs or to export simulated data to a file for use elsewhere.
Stat::Fit also has an interesting "repopulate feature for continuous
random variables that have been rounded to integers as an artifact of data
collection, with the idea being to increase estimator accuracy by
replacing the truncated decimal portions of each datum with randomly
generated values based on the hypothesized pdf. A data file also can be
edited, transformed and algebraically manipulated in several manners,
with most open windows being immediately recalculated (again a very
nice teaching tool).
All of the above features offer a good deal of user customization,
including the abilities to set many analysis and format defaults, to
designate a default directory for saving and retrieving files, and to insert
bookmarks and free-form text annotations in the help facility once
information of interest is located. Other conveniences include the ability
to filter a data set in several ways, general overall flexibility in options
and graph formats, and a button for calculating a table of maximum
likelihood or method-of-moments estimates for selected distributions
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 6 of 10
(without running the fit routines), perhaps necessary for some other
need. Analysis results (including graphs) can be stored in a "project file,
saving the user from starting completely over each time a data file is
analyzed (although some formatting can be lost).
My Personal Wish List
In the course of evaluating this software it was only natural to develop a
few ideas as to what would make it even better. While I was pleasantly
surprised to find my list of grievances small, here are my top four
wishes:
1. Distribution Viewer:
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 7 of 10
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 8 of 10
Vendor Comments
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 9 of 10
References
1. Swain, J.J., Venkatraman S., Wilson, J.R. (1988), "Distribution
Selection and Validation, Journal of Statistical Computation and
Simulation, Vol. 29, pp. 271-297.
2. Sugiyama, S.O., Chow, J.W. (1997), "@Risk, RiskView, and
BestFit, OR/MS Today, April 1997, pp. 64-66.
3. Vincent, S.G., Law, A.M. (1992), "UniFit II: Total Support for
Simulation Input Modeling, Proceedings of the 1992 Winter
Simulation Conference, pp. 371-376.
4. Gottfried, B.S. (1993), "Use of Computer Graphics in Fitting
Statistical Distribution Functions to Data Representing Random
Events, Simulation, April 1993, pp. 281-286.
5. Cochran, W.G. (1952), "The X2 Test of Goodness of Fit, Annals of
Mathematical Statistics, Vol. 28, pp. 315-345.
6. Kolmogorov, A. (1941), "Confidence Limits for an Unknown
Distribution Function, Annals of Mathematical Statistics, Vol. 12, pp.
461-465.
7. Anderson, T.W., Darling, D.A. (1954), "A Test of Goodness of Fit,
Journal of the American Statistical Association, Vol. 49, pp. 765-769.
8. Cheng, R.C.H. (1994), "Selecting Input Models, Proceedings of
the 1994 Winter Simulation Conference, pp. 184-191.
9. D'Agnostino, R.B., Stephens, M.A., eds (1986), "Goodness-of-Fit
Techniques, New York: Marcel Dekker.
10. Dumonceau, R., Antle, C.E. (1973), "Discrimination Between the
Lognormal and Weibull Distributions, Technometrics, Vol. 15, pp.
923-926.
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009
Page 10 of 10
11. Hinz, P., Gurland, P. (1970), "A Test of Fit for the Negative
Binomial and Other Contagious Distributions,Journal of the American
Statistical Association, Vol. 65, pp. 887-903.
12. Neyman, J. (1937), "Smooth Test for Goodness of Fit,
Skandinavisk Aktuarietidskrift, Vol. 20, pp. 150-199.
James C. Benneyan is a professor of industrial engineering and
operations research at Northeastern University in Boston where he
teaches and researches in the areas of industrial statistics,
probability theory, statistical quality control and computer simulation.
For more information, put the number 5 in the appropriate
space on the
Reader Service Form
z Table of Contents
z OR/MS Today Home Page
OR/MS Today copyright 1998 by the Institute for Operations Research and the
Management Sciences. All rights reserved.
Lionheart Publishing, Inc.
506 Roswell Street, Suite 220, Marietta, GA 30060, USA
Phone: 770-431-0867 | Fax: 770-432-6969
E-mail: lpi@lionhrtpub.com
URL: http://www.lionhrtpub.com
Web Site Copyright 1998 by Lionheart Publishing, Inc. All rights reserved.
http://lionhrtpub.com/orms/orms-2-98/swr.html
11/17/2009