Exercise 2 - Assignment
Part 2A
Data and Analysis:
Summary Statistics for the given data is tabulated below.
Roll No 1811142
n 142
k 7840
Confidence Coefficient for
91%
µ
Confidence Coefficient for
97.50%
π
Confidence Coefficient for
89%
ø
Parameter Chosen Annual Family Income
Proportion Definition % of people whose family income greater than 2
Population Statistics
µ= 2.81
σ= 4.67
π= 100.00%
σ2= 21.82
p 0.535
No of success 3238
Confidence coefficients
Coeff.
Estimator Probability
Value Z or χ value
X-bar 91 0.955 Z= 1.695398
p 97.5 0.9875 Z= 2.241403
χ2α/2 = 169.9218
S 89 0.945 χ21-α/2
= 116.149
X-bar SE(X-bar) p SE(p) S.D
2.835373 0.237939 0.542254 0.041809 4.620566
3.035754 0.254755 0.56338 0.041621 5.088713 Table for estimators X-bar, S, and pi
2.404803 0.201807 0.492958 0.041955 3.611172
2.555254 0.214432 0.507042 0.041955 3.640362
Roll No 1811142 (even) so we
estimate the expected value of the
2.447563 0.205395 0.43662 0.041621 4.936556
standard deviation (first ten and Last
2.442303 0.204953 0.56338 0.041621 3.065447 ten values are taken)
2.240176 0.187991 0.514085 0.041942 3.136674
2.77107 0.232543 0.535211 0.041855 5.170197
3.577972 0.300257 0.633803 0.040429 6.035988
3.342606 0.280505 0.556338 0.041692 5.597776
3.046613 0.255666 0.549296 0.041755 5.501008
3.212824 0.269614 0.598592 0.041135 5.385541
3.227261 0.270826 0.570423 0.041541 5.971404
2.493993 0.209291 0.528169 0.041892 3.682939
2.460021 0.20644 0.471831 0.041892 5.011198
2.403887 0.20173 0.443662 0.041692 3.924131
3.061669 0.25693 0.549296 0.041755 6.082693
2.840915 0.238404 0.542254 0.041809 4.171057
3.148352 0.264204 0.577465 0.041452 5.212921
3.293148 0.276355 0.598592 0.041135 5.236656
Mean
no of success 5410
Probability 0.690051
Proportion
no of success 7620
Probability 0.971939
Variance
no of success 2197
Probability 0.28023
S.D
no of success 4156
Probability 0.530102
Frequency Frequency
50
0
100
150
200
250
300
50
0
100
150
200
300
350
250
STEP-4(A)
1.714615173 1.64306338
2.151554288 1.820579786
2.588493403 1.998096191
3.025432518 2.175612596
3.462371632 2.353129001
3.899310747 2.530645407
4.336249862 2.708161812
4.773188977 2.885678217
5.210128092 3.063194622
Bin
Bin
5.647067206 3.240711028
6.084006321 3.418227433
6.520945436 3.595743838
6.957884551 3.773260243
Following are the distributions plotted for each parameter:
7.394823665 3.950776649
7.83176278 4.128293054
8.268701895 4.305809459
8.70564101 4.483325864
Distribution of Sample Mean
9.142580125 4.66084227
Frequency
Frequency Frequency
50
0
0
100
150
200
250
300
350
400
100
200
300
400
500
600
2.939905193 0.387323944
7.798387969 0.404529449
12.65687075 0.421734955
17.51535352 0.438940461
22.3738363 0.456145967
27.23231907 0.473351472
32.09080185 0.490556978
36.94928463 0.507762484
41.8077674 0.52496799
Bin
Bin
46.66625018 0.542173496
51.52473296 0.559379001
56.38321573 0.576584507
Frequency
Frequency
From the histograms, we can very well infer that the histograms of the variables follow the Normal
Sample
Summary Statistics Population Value Error %Error
4(B) Value
Mean 2.805511163 2.79691911 0.01 0%
Standard Deviation 4.671305059 4.514063811 0.16 3%
Variance 21.82109095 21.62798163 0.19 1%
Proportion 0.535472135 0.536427494 0.00 0%
From the above table, the difference percentage is not too much, therefore we infer that sample
mean, variance, S.D and proportion are the unbiased estimator of the population parameters i.e.
population mean, variance, S.D and proportion.
STEP-5(B)
We infer the following things:
At 91% of C.I, 69% of the samples contain the population mean within the confidence interval
limits
At 97.5% of C.I, 97.19% of the samples contain the population mean within the confidence
interval limits
At 89% of C.I, 28.02% of the samples contain the population mean within the confidence interval
limits
From the above, we can conclude that the sanctity of confidence co-efficient is maintained.
Part 2B- 1
For working this part, we are taking one row as the sample data set.
HYPOTHESIS OF MEAN
Ho μ=2
Ha μ≠2
Two tailed test
x bar 2.835373239
μ 2.805511163
s 4.620566469
n 142
Standard Deviation 4.620566469
Standard error 0.387749325
α 10%
α/2 5%
Test Statistics 0.077013872
p - value 93.86%
Z< -1.644853627
Z> 1.644853627
x bar< 2.16772028
x bar> 3.443302045
Power Test
True Mean P(type II error) - beta Power = ( 1- beta)
2.85 0.102232517 0.897767483
3.14 0.223124600 0.7768754
Since p-value is greater than alpha we fail to reject the Null Hypothesis at 5% significance level
HYPOTHESIS OF PROPORTION
Ho π >= 0.40
Ha π < 0.40
Left tail test
π 0.542253521
π0 0.4
1-p0 0.6
n 142
α 10%
Test Statistics 3.460202985
p - value 99.97%
Z value -1.281551566
Since p-value is greater than alpha we fail to reject the Null Hypothesis at 5% significance level
HYPOTHESIS OF VARIANCE
Ho σ02 <= 28
Ha σ02 > 28
Right tail test
n-1 141
2
S 21.34963449
σ0 2
28
α 10%
Chi Squared Distribution
Test Statistics 107.5106594
p - value 1.63%
Since p-value is greater than alpha we fail to reject the Null Hypothesis at 5% significance level
Part 2B- 2
Two samples are taken from the population based on the family income.
Sample 1 is taken for those where the family income is greater than 2.
Sample 2 is taken for those where the family income is not greater than 2.
Since p-value is greater than alpha we fail to reject the Null Hypothesis at 5% significance level.
Part 2B- 3
Ambulance Fire
Total time 10484.39 9131.95
Count 936.00 936.00
Avearge time 11.20127137 9.756356838
SD 3.577255601 2.78536801
p 0.172008547 0.241452991
In order to check whether the fire truck arrives at the scene on average more than 1 min faster than
the ambulance
The null hypothesis is rejected if fire truck does arrive more than 1 min later than the ambulance
Ho µ1-µ2 <= 1
Ha µ1-µ2 > 1
S1 (Ambulance) 2.07
S2 (Fire) 1.32
Mean1 - Mean 2 1.44
Reject the null
Pooled std dev 1.74
hypothesis
Sigma p 0.08
Taking confidence interval of 95%
z-value 1.96
z X sigma p 0.16
LL 1.29
UL 1.6
The confidence interval falls between 1.29 and 1.6 which is greater than 1
To check the frequency of arrival within 8 minutes is greater for the fire department
Ho π1 - π2 <=0
Ha π1 - π2 >0
Data
p1 (within 8 mins) 0.19
p2 0.27
Diff in p 0.08 Reject the null hypothesis
Std error 0.02
Taking confidence interval of 95%
z-value 1.96
z * Std error 0.04
LL 0.04
UL 0.12
Since both conditions are satisfied, Training of fire personnel should be done.