I.
1. Descriptive Statistics
Descriptive statistics is the term given to the analysis of data that helps describe, show or summarize data
in a meaningful way such that, for example, patterns might emerge from the data. Descriptive statistics do
not, however, allow us to make conclusions beyond the data we have analyzed or reach conclusions
regarding any hypotheses we might have made. They are simply a way to describe our data.
2. Inferential Statistics
Inferential statistics are techniques that allow us to use these samples to make generalizations about the
populations from which the samples were drawn. It is, therefore, important that the sample accurately
represents the population. The process of achieving this is called sampling. Inferential statistics arise out
of the fact that sampling naturally incurs sampling error and thus a sample is not expected to perfectly
represent the population.
II.
1. Mode
Is a statistical term that refers to the most frequently occurring number found in a set of numbers. The
mode is found by collecting and organizing data in order to count the frequency of each result. The result
with the highest number of occurrences is the mode of the set.
Example: in {4, 3, 9, 4, 4, 5, 9, 4, 7} the Mode is 4 (it occurs most often).
2. Median
Is a simple measure of central tendency. To find the median, we arrange the observations in order from
smallest to largest value. If there is an odd number of observations, the median is the middle value. If
there is an even number of observations, the median is the average of the two middle values.
Example: find the Median of {14, 23, 11, 16, 15, 4, 27}.
Put them in order: {4, 11, 14, 15, 16, 23, 27}
The middle number is 15, so the median is 15.
3. Percentile
Is a statistical term describing a division of observations into four defined intervals based upon the values
of the data and how they compare to the entire set of observations.
Example: 8, 8, 3, 4, 5, 6, 6, 7, 3, 1
Put them in order: 1, 3, 3, 4, 5, 6, 6, 7, 8, 8
Q1 = 3
Q2 = 5.5
Q3 = 7
5. Range
6. Interquartile Range
7. Standard Deviation
Is a statistic that measures the dispersion of a dataset relative to its mean and is calculated as the square
root of the variance.
Example: Calculate the sample standard deviation of the length of the crystals.
1. Calculate the mean of the data. Add up all the numbers and divide by the total number of data
points.
(9+2+5+4+12+7+8+11+9+3+7+4+12+5+4+10+9+6+9+4) / 20 = 140/20 = 7
2. Subtract the mean from each data point (or the other way around, if you prefer... you will be
squaring this number, so it does not matter if it is positive or negative).
(9 - 7)2 = (2)2 = 4
(2 - 7)2 = (-5)2 = 25
(5 - 7)2 = (-2)2 = 4
(4 - 7)2 = (-3)2 = 9
(12 - 7)2 = (5)2 = 25
(7 - 7)2 = (0)2 = 0
(8 - 7)2 = (1)2 = 1
(11 - 7)2 = (4)22 = 16
(9 - 7)2 = (2)2 = 4
(3 - 7)2 = (-4)22 = 16
(7 - 7)2 = (0)2 = 0
(4 - 7)2 = (-3)2 = 9
(12 - 7)2 = (5)2 = 25
(5 - 7)2 = (-2)2 = 4
(4 - 7)2 = (-3)2 = 9
(10 - 7)2 = (3)2 = 9
(9 - 7)2 = (2)2 = 4
(6 - 7)2 = (-1)2 = 1
(9 - 7)2 = (2)2 = 4
(4 - 7)2 = (-3)22 = 9
4. The population standard deviation is the square root of the variance. Use a calculator to obtain this
number.
(9.368)1/2 = 3.061
III. Sources
https://statistics.laerd.com/statistical-guides/descriptive-inferential-statistics.php
https://www.investopedia.com/terms/m/mode.asp
https://stattrek.com/statistics/dictionary.aspx?definition=median
https://www.mathsisfun.com/
https://www.mathsisfun.com/definitions/range-statistics-.html
https://stattrek.com/statistics/dictionary.aspx?definition=interquartile%20range
https://www.thoughtco.com/sample-standard-deviation-problem-609528