Abstract---- The present study focuses on treating the missing Recently Subramani and Ponnuswamy (1989) have
values in the split- plot design. Three methods have been used to discussed the non-iterative least squares estimation of
treat the missing values: Coons, Haseman and Gaylor, and missing values in experimental designs and presented
Rubin method. To make preference among these methods some randomized block designs and latin square designs[7]. Bhatra
statistical measurements have been used, which are absolute
error (AE), mean squares error (MSE) and Akaike information
(2013) studied the estimation of m missing observations by
criterion (AIC). From the practical work it is concluded that: specifying the positions and by not positions of the missing
In the case of one missing value was obtained the same values are presented in case of a randomize block design [1].
estimates for missing value. As in cases of two and three missing Three methods have been used to treat the missing values:
values show that the best method for estimating missing values Coons, Haseman and Gaylor, and Rubin method. To make
is Coons method. preference among these methods some statistical
measurements have been used, which are absolute error (AE)
Keywords-- Split- Plot Design, Estimating Missing Values, mean square error (MSE) and Akaike information
Mean Squares Error, Akaike Information Criterion. criterion (AIC).
.
II. SPLIT- PLOT DESIGN
I. INTRODUCTION
www.ijcit.com 337
International Journal of Computer and Information Technology (ISSN: 2279 – 0764)
Volume 05 – Issue 03, May 2016
𝑌.𝑗2.. 𝑌…2
𝑆𝑆𝐵 = − (6)
𝑎𝑟 𝑎𝑏𝑟 A. Coons Method
Coons (1957) was used analysis of covariance model to
Sum square of interaction effect AB: analyze the experiments with missing values .the technique
employs the computational procedures of a covariance
𝑌𝑖𝑗2 . 𝑌…2
𝑆𝑆𝐴𝐵 = − − 𝑆𝑆𝐴 − 𝑆𝑆𝐵 (7) analysis using a dummying X covariance as follows:
𝑟 𝑎𝑏𝑟 In the case of one missing value:
To estimate of missing value by covariance analysis
Sum square of error B:
conducting the following steps [9], [5]:
1) Consider the original data as the dependent
𝑆𝑆𝐸(𝑏) = 𝑆𝑆𝑇 − 𝑆𝑆𝑏𝑙𝑜𝑐𝑘 − 𝑆𝑆𝐴 − 𝑆𝑆𝐵 − 𝑆𝑆𝐴𝐵 − 𝑆𝑆𝐸(𝑎) (8)
variable y of the covariance analysis and inset the
Each has an associated degree of freedom. Mean squares are value of zero in the cell which has the missing
defined as sums of squares divided by degrees of freedom, observation.
the analysis of variance as shown in table(1). 2) Define a variable x where:
𝑋=0 𝑖𝑓 𝑌 ≠ 0
𝑋 = −𝑛 𝑖𝑓 𝑌 = 0
Where: n is the total number of observation in the
experiment including the missing value.
3) Carry out the analysis of covariance.
4) Compute the estimate of the regression coefficient:
𝐸𝑋𝑌
𝛽𝐸 = (9)
𝐸𝑋𝑋
And multiply by n to estimate the missing value:
𝑋 = 𝑛𝛽𝐸 (10)
www.ijcit.com 338
International Journal of Computer and Information Technology (ISSN: 2279 – 0764)
Volume 05 – Issue 03, May 2016
X = −PR−1 (14)
𝐸𝑋1 𝑋1 𝛽1𝐸 + 𝐸𝑋1 𝑋2 𝛽2𝐸 + ⋯ + 𝐸𝑋1 𝑋𝑚 𝛽𝑚𝐸 = 𝐸𝑋1 𝑌
.
. (11) Where:
. 𝑃, 𝑋 = Vector (1 × 𝑚).
𝐸𝑋𝑚 𝑋1 𝛽1𝐸 + 𝐸𝑋𝑚 𝑋2 𝛽2𝐸 + ⋯ + 𝐸𝑋𝑚 𝑋𝑚 𝛽𝑚𝐸 = 𝐸𝑋𝑚 𝑌 𝑅 = 𝑁𝑜𝑛 − 𝑠𝑖𝑛𝑔𝑢𝑙𝑎𝑟 𝑚𝑎𝑡𝑟𝑖𝑥 𝑚 × 𝑚 .
𝑌𝑖𝑗 . 𝑌.𝑗𝑘 𝑌.𝑗 .
We estimate the missing values by the following formula: 𝑒𝑖𝑗𝑘 = 𝑌𝑖𝑗𝑘 − − + (15)
𝑏 𝑟 𝑏𝑟
1
B. Haseman and Gaylar Method 𝑟𝑘𝑘 = (17)
𝑏𝑟
www.ijcit.com 339
International Journal of Computer and Information Technology (ISSN: 2279 – 0764)
Volume 05 – Issue 03, May 2016
And Akaike information criterion (AIC) is a measure of the FIGURE 1: HISTOGRAM FOR DATA
relative quality of statistical methods for a given set of data,
is calculated as follows: Histogram of y
Normal
Mean 38.74
4
AIC n ln 2 2(k 1) (19)
StDev
N
15.31
24
Where: 3
2
Frequency
: Mean square of error.
2
𝑘 : Number of variables in the model.
𝑛 : Total number of observations.
1
A. Data Description The above chart in figure (1), explained that the data
Data on height (cm) of eucalyptus plants from a field trial
experiment distributed normal distribution, and to test the
under split- plot design with two treatments, three blocks
homogeneity the hypothesis is given by:
given in (Jayaraman). Let A denoted the main- plot factor
(pit size) and B, the sub plot factor (fertilizer treatments), 𝐻0 : 𝑎𝑙𝑙 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒𝑠 𝑎𝑟𝑒 𝑒𝑞𝑢𝑎𝑙
then the resulting data is as follows [8]: (20)
𝐻1 : 𝑎𝑡 𝑙𝑒𝑎𝑠𝑡 𝑜𝑛𝑒 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒 𝑛𝑜𝑡 𝑒𝑞𝑢𝑎𝑙𝑒
Missing values in the experiment are not missing originally,
but I assumed it was missing. The value of Bartlett's test equal to (2.57) with p- value
(0.92), for test the p- value is greater than the value of level
TABLE 2: DATA EXPERIMENT of significant ( 0.05 ), this means cannot reject the null
hypothesis and there is no problem of homogeneity of
Blocks Total
A B I II III
variance.
Carry out the analysis of variance which is given in table (3):
𝑏1 25.38 61.35 37.00 123.73
𝒂𝟏 𝑏2 46.56 66.73 28.00 141.29
𝑏3 66.22 35.70 35.7 137.62 TABLE 3: ANOVA FOR DATA EXPERERIMENT.
𝑏4 30.68 58.96 21.58 111.22
(F.Tab.)
Total 168.84 222.74 122.28 513.86
(F.Cal 𝜶=
(S.O.V.) (d.f.) (S.S.) (M.S.)
) 𝟎. 𝟎𝟓
𝑏1 19.26 55.8 57.6 132.66
𝒂𝟐 𝑏2 19.96 33.96 31.7 85.62 Blocks 2 1938.5 969.25
𝑏3 22.22 58.4 51.96 132.6 A 1 228.35 228.35
𝑏4 16.82 45.6 26.55 88.97 2 1161.34 580.67 0.39 4.75
Error(a)
78.26 193.76 167.83 439.85
Total 3 487.82 162.61 1.01 3.49
𝑩
247.1 416.5 290.11 953.71 3 388.21 129.40 0.81 3.49
𝐴𝐵
12 1928.15 160.68
𝐸𝑟𝑟𝑜𝑟(𝑏)
Before analyzing the data, should be verified from the Total 23 6132.37
distribution of the data. To test the normal property was used
the histogram, as shown in figure (1) as follows.
Three methods have been used to treat the missing
values: Coons, Haseman and Gaylor, and Rubin method. To
make preference among these methods some statistical
measurements have been used, which are absolute error
(AE), mean squares error (MSE) and Akaike information
criterion (AIC).
www.ijcit.com 340
International Journal of Computer and Information Technology (ISSN: 2279 – 0764)
Volume 05 – Issue 03, May 2016
B. Estimating Missing Value TABLE 5: THE ANALYSIS AFTER ESTIMATING MISSING VALUE
www.ijcit.com 341
International Journal of Computer and Information Technology (ISSN: 2279 – 0764)
Volume 05 – Issue 03, May 2016
www.ijcit.com 342
International Journal of Computer and Information Technology (ISSN: 2279 – 0764)
Volume 05 – Issue 03, May 2016
42.84 18.51
Missing values are estimated by equation (12): H&G 33.19 24.41 191.46 136.112
28.93 6.71
𝑋1 53.19
𝑋2 = 45.15 42.87 18.48
Rubin 33.23 24.37 191.36 136.099
𝑋3 41.60 28.97 6.75
𝑋1 −1
6 1 1 319.14
𝑋2 = 1 6 1 270.90 The results of the study of estimating missing values are
𝑋3 1 1 6 249.61 summarized and tabulated in tables (5, 7, and 9)) which
contain the MSE, AE and AIC, we have observed that:
𝑋1 42.84
𝑋2 = 33.19 1. In the case of one missing value was obtained the same
𝑋3 28.93 estimates for missing value.
3. Rubin Method 2. The results of application in cases of two and three missing
By equation (16) and (17), we get values show that the best method for estimating missing
values is Coons method, because it is minimum mean squares
156.01 74.56 447.13 error, minimum absolute mean square error and minimum
𝑝1 = 0 − − + = −26.595
4 3 12
110.23 75.06 360.03 Akiakes information criterion.
𝑝2 = 0 − − + = −22.575 3. Increase the number of missing values leads to increased
4 3 12
difference between estimated values given by different
56.04 110.38 360.03
𝑝3 = 0 − − + = −20.8 methods.
4 3 12
1 1 1 REFERENCES
𝒓𝒌𝒌 = 1 − − + = 0.5
4 3 12
[1] B. N.Ch. Charyulu and T. Dharamyadav, Estimation of Missing
1 Observation in Randomized Block Design, International Journal
𝒓𝒌𝒌 = = 0.083
12 of Technology and Engineering Science, Vol. 1, No. 6,
2013,pp618-621.
𝑝 = (−26.595 −22.575 −20.8) [2] D. B. Rubin, A non- Iterative Algorithm for Least Square
Estimation of Missing Values in any Analysis of Variance Design,
Journal of Applied Statistic, 21, 1972, pp 136-141.
0.5 0.083 0.083 [3] F. Yates, Analysis of Replicated Experiment When the Field
𝑅= 0.5 0.083 Results are incomplete, Journal of Experimental Agriculture, 1, 2,
0.5 1933, Pp. 129-142.
Missing values are estimated by equation (14): [4] G. E. Boyhan, Agricultural Statistical Data Analysis Using Stata,
Taylor and Francis group, Boca Raton, London, New York, 2013.
𝑋1 42.87 [5] I. Coons, The Analysis of Covariance as a Missing Plot
𝑋2 = 33.23 Technique, Journal of Biometrics, Vol.13, No. 3, 1957, pp 387-
𝑋3 28.97 405.
www.ijcit.com 343
International Journal of Computer and Information Technology (ISSN: 2279 – 0764)
Volume 05 – Issue 03, May 2016
www.ijcit.com 344