Dept. of Earth Sciences, University College London, Gower Street, London WC1E 6BT, United Kingdom
2
Emeritus Scientist, Geological Survey of Canada, 601 Booth St., Ottawa, Ontario, K1A 0E8, Canada
*Corresponding author: (e-mail: r.howarth@ucl.ac.uk)
ABSTRACT: The Imperial College of Science and Technology, a constituent college
of the University of London in the 1960s, had the good fortune to be one of the first
colleges in the United Kingdom to have access to digital computing facilities. This
review traces the history of the application of computing in the Geochemical
Prospecting Research Centre and its successor, the Applied Geochemistry Research
Group, as computing moved from being a frontier research area to becoming a
commonplace tool. The three principal areas in which it was involved comprised: the
quality control, and thereby assurance, of analytical data; the production of
pioneering atlases of regional geochemical variation in Northern Ireland (1973) and
England and Wales (1978); and the application of methods introduced by workers
in pattern-recognition and statistics to the interpretation of land-based and marine
regional geochemical data.
KEYWORDS: computers, computing, applied geochemistry, history of geochemistry, history of
statistics, history of cartography, regional mapping, spatial filters, geochemical atlas, SC4020,
LGP2703, multi-element maps, data transformation, factor analysis, cluster analysis, discriminant
analysis, ridge regression, Kleiner-Hartigan trees, robust statistics, quality assurance
The Geochemical Prospecting Research Centre (GPRC) was
established in 1954, under the direction of Professor John
Stuart Webb (19202007), in the Mining Geology section of
the Royal School of Mines (RSM), Imperial College of Science
and Technology (ICST), London. Initial studies were concerned with mineral prospecting using soil and drainage sampling in Northern Rhodesia (Zambia), Uganda, Sierra Leone,
Bechuanaland (Botswana), Tanganyika (Tanzania), British
North Borneo (Sabah, East Malaysia), Burma (Myanmar) and
the Federation of Malaya (West Malaysia), and extended in the
1960s to Southern Rhodesia (Zimbabwe), the Philippine
Republic, Borneo (now divided between Malaysia and
Indonesia), Fiji, East Africa, Australia, and the United
Kingdom. By 1960, its studies had broadened into regional
geochemistry, based on the analysis of stream sediments. In
1963, Webb initiated the first of a series of investigations
concerning the relationship between regional geochemistry and
agricultural problems in livestock in Eire (Webb 1964; Webb &
Atkinson 1965). The application of geochemistry to marine
mineral exploration began in 1964 (Tooms 1967). Consequently, by 1963, the Centres name was changed to the
Applied Geochemistry Research Group (AGRG) to reflect the
increasing breadth of its applications.
The work of the GPRC and AGRG was underpinned by
developments in two complementary spheres: methods and
instrumentation for chemical analysis (discussed in the paper by
Michael Thompson (2010) and computing (Fig. 1). The latter
facilitated: (i) statistical quality-assurance in the analytical labGeochemistry: Exploration, Environment, Analysis, Vol. 10 2010, pp. 289315
DOI 10.1144/1467-7873/09-238
290
Fig. 2. Portion of a point-symbol map of the concentration of cold-extractable copper (ppm) in the <80-mesh stream sediment fraction over the Namwala concession area, Zambia. Concentration
isolines (red) were interpolated by eye and hand-drawn. (from Webb et al. 1964a, Map I; original is 30 91 cm in size).
292
293
Fig. 3. Comparison of (left) cubic trend-surface and (right) rolling mean (moving-average) maps for nickel concentrations (ppm) in bedrock,
soil, and the <80-mesh stream sediment fraction over eastern Sierra Leone. Percentage values in each map are a measure of goodness of fit.
Nichol & Webb (1967, fig. 4; redrawn from Garrett 1966).
294
295
Fig. 5. (a) IBM7090 computation time (min) and (b) Calcomp pen
plotter time (min) to plot maps of values at 100 to 1000 uniformly
randomly distributed locations, in 1969. Redrawn from Khaleelee
(1969, figs 17, 18).
296
297
298
299
vectors, the cos coefficient (Imbrie 1963; for which significance levels were given by Howarth 1977). Imbrie (1963) called
this approach Q-mode analysis, in contrast to calculation of
the more familiar correlation coefficient matrix between the p
variables (which he called R-mode analysis). In a conventional
R-mode factor analysis, the first step is to compute the k
uncorrelated principal components, then to retain the first p of
these and rotate them (e.g. using the varimax criterion) to
obtain the final solution. In the Q-mode approach, each of the
k factors corresponds to an actual sample of extreme composition. These are end-members, and the entire data set is thus
represented in terms of the relative contributions of these
end-members, the factor scores increasing from zero to unity
as a samples composition approaches more exactly that of one
of the end-member reference vectors. This approach was first
applied at Northwestern to compositional analysis of variations
in carbonate sediments (Imbrie & Purdy 1962) and to lithostratigraphy (Krumbein & Imbrie 1963). The availability of a
computer program for the IBM 7094/1401 computer system
(Manson & Imbrie 1964) enabled Q-mode factor analysis to be
taken up by Garrett (1967), who was able to successfully apply
it to interpretation of the geochemistry of his regional stream
sediment data from Sierra Leone (Garrett 1966, 142148;
Nichol & Webb 1967; Nichol et al. 1969) (Fig. 16). However,
because of memory-size limitations, Q-mode analysis was at
this time subject to the serious restriction of handling a
maximum of 100 samples.
The relatively small number of samples capable of being
analysed by Q-mode factor analysis program proved a powerful
limitation where the large data sets involved in regional geochemistry were concerned. On completion of his thesis, Garrett
visited Northwestern University on a post-doctoral research
300
301
Fig. 10. Chromalin trial prints (Jan. 1976) of the LGP2703 laser-plotted percentile-based maps for zinc concentrations (ppm) in the <80-mesh
stream sediment fraction over England and Wales: (a) unsmoothed; (b) moving-average smoothed and gap-filled (originals are 27 40 cm).
302
303
304
305
sx 1d , 0
ln sxd, = 0
x . 0,
306
Fig. 16. Q-mode factor analysis maps for the geochemistry of the <80-mesh stream sediment fraction, Nimini Hills, eastern Sierra Leone: (upper
left) vector 1, (upper right) vector 2, (lower left) vector 3, end-members shown by solid dot in each case; (lower right) communality. Garrett
(1966, fig. 52; original is 18 23 cm).
307
Data displays
The arrival of the interactive statistical package MINITAB
(Ryner et al. 1976) on the Colleges distributed terminal system
enabled routine data analysis to be used by both staff and
students in AGRG and, because it embodied much of the
recent thinking on graphics-based Exploratory Data Analysis
(Tukey 1977), box-plots, quantile-quantile plots and other
graphical displays were soon taken up in AGRG work (Earle
1982; Howarth 1984; Turner 1986). Earle (1982, 168183)
developed a program (GIRAF) for the interactive dissection of
probability plots into constituent sub-populations. Turner
(1986, 166179) showed the utility of multivariate probability
plots, based on the cube-root of the Mahalanobis distance
(Healy 1968; Campbell 1979) for detection of multivariate
outliers.
Use of two new multivariate graphics to portray multielement sample compositions for the purpose of comparison
were extensively investigated by Turner (1986), using the
MorayBuchan data set: (i) Chernoff faces (Chernoff 1973),
which assigns features of the human face (e.g. position/style of
eyes, eyebrows, nose, mouth) to different variables to make
308
Fig. 18. Empirical discriminant classification map of the geochemistry of Pb, Ga, V, Mo, Cu, Zn, Ti, Ni, Co, Mn, Cr and Fe2O3 in the <80-mesh
stream sediment fraction over Denbighshire, UK. Training areas are for five lithologies are boxed in; samples assigned to unknown group are
shown solid, of these samples 62% were related to known mineralized areas. (Castillo-Muoz & Howarth 1976, fig. 6).
309
310
Thompson (1976), and Thompson (1978, 1981, 1983), developed an alternative approach, based on duplicate analysis of
randomised splits of routine field samples in which it was
311
312
by the wider geochemical and chemical community (e.g. Analytical Methods Committee 2002) and their approach continues
to be extended in scope (e.g. Stanley 2006; Stanley & Lawie
2008).
In other applications, simulation and regression techniques
have been applied to evaluation of matrix correction and
interference effects (Howarth 1973d; Thompson et al. 1979)
and to the comparison of analytical accuracy between analytical
methods (Thompson 1982). More recently, robust ANOVA
has been used to determine the magnitude of analytical variance
in relation to other sources of variance in geochemical data
(Ramsey et al. 1992).
When John Webb initiated the pioneering series of multielement multi-purpose geochemical atlases in the mid-1960s,
there was inevitable trade-off between the analytical method
used, expected analytical precision, and rapidity of turn-round;
this was not what traditional geochemists were used to, and the
matter proved controversial. AGRG staff had to justify this
new approach (Howarth & Lowenstein 1971, 1972; Webb &
Thompson 1977; Webb et al. 1978). Even today, despite
considerable advances in analytical methods, such a fitness-forpurpose approach to analysis requires explanation (Thompson
& Fearn 1996; Fearn et al. 2002).
Looking back now, it is probably impossible for younger
geochemists to realise just how difficult it was, not only to
implement many of the statistical techniques, where we were
breaking new ground in applied geochemistry, but to convince
potential users of the utility of the results. In a broader
perspective, Garrett et al. (2008) reviewed the development of
international geochemical mapping to date; it is pleasing to
think that AGRG pioneered many of the methods that subsequently became adopted.
The development and implementation of the computer-based
methods over the years described here was enabled by many bodies.
We principally have to thank the Department of Scientific and
Industrial Research and its successor, the Natural Environment
Research Council in Britain for their support to AGRG over many
years; other contributions have come from the Anglo American
Corporation (South Africa) Ltd.; the Institute of Geological
Sciences/British Geological Survey; Roan Selection Trust Technical
Services; Sierra Leone Geological Survey; Ministerio de Economia,
Industria y Comercio de Costa Rica; Wolfson Foundation; and the
U.S. Department of Energy, National Aeronautics and Space
Administration, and Rome Air Development Center (New York).
We are grateful to them all for their assistance, whether through
research contracts, support for studentships, or other help. The
authors are most grateful to the Editor, Gwendy Hall, and the
Association of Applied Geochemists for their assistance with the
funding of the colour illustrations in this paper.
REFERENCES
A, L.H. 1954. The log-normal distribution of the elements (A fundamental law of geochemistry and its subsidiary). Geochemica et Cosmochimica
Acta, 5, 4973; 6, 121131.
A, J. 1982. The statistical analysis of compositional data. Journal of the
Royal Statistical Society, B44, 139177.
A, J. 1986. The Statistical Analysis of Compositional Data. Chapman and
Hall. London and New York.
A, H. 1978a. A method of bivariate interpolation and smooth surface
fitting for irregularly distributed data points. ACM Transactions on Mathematical
Software, 4, 148159.
A, H. 1978b. Algorithm 526. Bivariate interpolation and smooth surface
fitting for irregularly distributed data points. ACM Transactions on Mathematical
Software, 4, 160164.
A, H. 1979. Remark on Algorithm 526. ACM Transactions on Mathematical
Software, 5, 242243.
A, P. & K, W.C. 1962. Secondary trend components in the Top
Ashdown Pebble Bed: A case history. Journal of Geology, 70, 507538.
ANALYTICAL METHODS COMMITTEE 2002. A simple fitness-forpurpose control chart based on duplicate results obtained from routine test
materials. Analytical Methods Committee Technical Brief no. 9, at the website:
www.rsc.org/Membership/Networking/InterestGroups/Analytical/
AMC/TechnicalBriefs.asp
A, D.F., B, P.J., H, F.R., H, P.J., R, W.H. &
T, J.W. 1972. Robust estimates of location. Survey and advances. Princeton
University Press, Princeton, NJ.
A, J.D., R, B.G. & T, I. 2008. National scale
estimation of potentially harmful elements background concentrations in
topsoil using parent material classified soil:stream sediment relationships.
Applied Geochemistry, 23, 25962611.
A-B, A. 1971. Provincial and regional geochemical studies in Zambia.
Unpublished PhD thesis, University of London, UK.
A-B, A. & N, I. 1970. Regional geochemical reconnaissance and the location of metallogenic provinces. Economic Geology, 65,
312330.
B, G.H. & H, D.J. 1965. ISODATA, a novel method of data analysis
and pattern classification. Stanford Research Institute, Menlo Park, CA.
Research Report, AD-699616, April 1965.
B, P. & D, B.R. 1955. Numerical weather map analysis. Tellus,
7, 1660.
B, A. 1943. On a measure of divergence between two
statistical populations defined by their probability distributions. Bulletin of
the Calcutta Mathematical Society, 35, 99109.
B, G.E.P. & C, D.R. 1964. An analysis of transformations. Journal of the
Royal Statistical Society, B26, 211252.
B G S. 1991. Regional geochemistry of the East Grampians
area. British Geological Survey, Keyworth, Nottingham.
B, A., M-F, G. & P-G, V. (eds) 2006.
Compositional Data Analysis in the Geosciences. From Theory to Practice. Geological Society, London, Special Publication, 264.
C, N.A. 1979. Canonical variate analysis: Some practical aspects. Unpublished PhD thesis, University of London, UK.
C, N.A. 1980. Robust procedures in multivariate analysis. I. Robust
covariance estimation. Applied Statistics, 29, 231237.
C, N.A. 1982. Statistical treatment of geochemical data. In: S,
R.E. (ed.) Geochemical Exploration in Deeply Weathered Terrain. CSIRO
Institute of Energy and Earth Resources, Floreat Park, WA, 141144.
C-M , R. 1973. Application of discriminant and cluster analysis to regional
geochemical surveys. Unpublished PhD thesis, University of London, UK.
C-M , R. & H, R.J. 1976. Application of the empirical
discriminant function to regional geochemical data from the United
Kingdom. Bulletin of the Geological Society of America, 87, 15671581.
C, P.E. 1998. A history of modern computing. MIT Press, Cambridge, MS.
C, F. 1971. Ratio Correlation. A Manual for Students of Petrology and
Geochemistry. The University of Chicago Press, Chicago and London.
C, H. 1973. The use of faces to represent points in K-dimensional
space graphically. Journal of the American Statistical Association. 68, 361368.
C, R. & K, P.G. 1976. Digital image processing on a microfilm
plotter. Unpublished report, University of London Computer Centre,
London.
C, R.N. 1986. A statistical appraisal of regional geochemical data from the
south-west Pacific for mineral exploration. Unpublished PhD thesis, University of
London, UK.
C, R.N. & C, D.S. 1987. A statistical evaluation of geochemical
data in regard to bedrock and placer mineral exploration in the S.W.
Pacific. Marine Mining, 6, 205221.
C, C.A.U. 1954. Statistical estimation of the accuracy of assaying.
Transactions of the Institution of Mining & Metallurgy, London, 63, 551563.
C, D.A. 1974. Application of multivariate methods to regional geochemistry: the
evaluation of a new technique. Unpublished MSc thesis, University of London,
UK.
D, M. 1977. Geostatistical ore reserve estimation. Elsevier, Amsterdam.
D, P.R. 1983. Geochemical applications of ridge regression for tinmineralised
granitoids. Unpublished MSc thesis, University of London, UK.
D, J.C. & S, R.J. 1973. Statistics and data analysis in geology. John
Wiley & Sons, New York.
D, W.J. & M, F.J. 1957. Introduction to Statistical Analysis. 2nd edition.
McGraw-Hill Book Co, New York.
D, J.R.V. 1975. Variability in some stream sediment geochemical data from
Australia. Unpublished PhD thesis, University of London, UK.
E, S.A.M. 1982. Geological interpretation of the geochemistry of stream sediments,
waters and soils in the Bristol district, with particular reference to the Mendip Hills,
Somerset. Unpublished PhD thesis, University of London, UK.
313
314
315