Graphical display of the frequency table can also be achieved through a frequency polygon. To create
a frequency polygon the intervals are labeled on the X-axis and the Y axis represents the height of a
point in the middle of the interval. The points are then joined are connected to the X-axis and thus a
polygon is formed. So, frequency polygon is a graph that is obtained by connecting the middle points
of the intervals. We can create a frequency polygon from a histogram also. If the middle top points of
the bars of the histogram are joined, a frequency polygon is formed.
Frequency polygon and histogram fulfills the same purpose. However, the former one is useful in
comparison of different datasets. In addition to that frequency polygon can be used to display
cumulative frequency distributions.
The following table is the frequency table of the marks obtained by 50 students in the pre-test
examination.
Table 1. Frequency Distribution of the marks obtained by 50 students in the pre-test examination.
Cumulative
Class
frequency
Frequency
Boundaries
(Less than
type)
30.5-40.5
40.5-50.5
14
20
50.5-60.5
20
40
60.5-70.5
47
70.5-80.5
Total
50
50
The labels of the X-axis are the midpoints of the class intervals. So the first label on the X-axis will be
35.5, next 45.5, followed by 55.5, 65.5 and lastly 75.5. The corresponding frequencies are then
considered to create the frequency polygon. The shape of the distribution can be determined from the
created frequency polygon. The frequency polygon is shown in the following figure.
Fig 1: Frequency polygon of the distribution of the marks obtained by 50 students in the pre-test
examination.
From the above figure we can observe that the curve is asymmetric and is right skewed.
Fig2: Cumulative Frequency polygon of the marks obtained by 50 students in the pre-test
examination.
The following is an example of dice where the distribution of observed frequencies and the distribution
of expected frequencies are compared for different scores of two dice. The frequency curves of the two
distributions are used for comparison.
The observed curve overlaps expected curve. The expected curve is smooth while the observed curve
is not smooth.
Also cumulative frequency polygon can also be plotted in the same graph. The following figure shows
such plot. The marks of two papers are compared throughcumulative frequency polygon.
Statistical help and online statistics help provided by us will thus help you to learn the proper use and
various aspects of statistics.
The term population will vary widely with its application. Examples could be any of
the following proper subsets: animals; primates; human beings; homo sapiens; U.S.
citizens; who are attending Andrews University, as graduate students, in the School of
Education, as Masters students, female, last name starting with S, who web registered.
Sample: A sample is a portion of a population selected for further analysis.
How samples are obtained or types of sampling will be studied in lesson 7. Most any
of the examples above for population could serve as a sample for the next higher level
data set.
Parameter: A parameter is a characteristic of the whole population.
Statistic: A statistic is a characteristic of a sample, presumably measurable.
This course will avoid complex models utilizing complicated mathematics. You will
need to be familiar with, however, the fundamental arithmetic operations, elementary
algebra, and some basic symbolism.
An interesting subset of the natural numbers generated by addition are called
Triangular Numbers. These are so called because these are the total number of dots, if
we arrange the dots in a triangle with one additional dot in each layer.
In general we write:
where
mathematicians use the capital Greek letter (sigma) to represent summation. Your
teacher has a particular fondness for this symbol since the first computer he had much
access to had that nickname.
There are three important rules for using the summation operator:
1. Since multiplication distributes over addition, the sum of a constant times a set
of numbers is the same as the constant times the sum of the set of numbers.
Example: Cx1 + Cx2 + ... + Cxn = C(x1 + x2 + ...+ xn)
2. The sum of a series of constants is the same as N times the constant,
where N represents how many constants there are.
Example: 4 + 4 + 4 + 4 + 4 = 5 4 = 20.
3. Since addition is commutative, the total sum of two or more scores for several
individuals can be achieved either by summing the scores separately and then
combining them or by summing an individual's scores and then combining
them.
Example: Joe got scores of 500 and 550 for his verbal and quantitative SAT
scores whereas Jim got scores of 520 and 510, respectively. 500 + 550 + 520 +
510 = 1050 + 1030 = 500 + 520 + 550 + 510 = 1020 + 1060 = 2090.
In addition to the operations of addition, subtraction, multiplication, and division,
several other arithmetic operators often appear. Exponentiation and absolute
value are two such. Also, various symbols of inclusion (parentheses, brackets, braces,
vincula) are used.
Exponentiation is a general term which includes squaring (12 2=144), cubing (63=216),
and square roots (16= (16)=4. When the square root symbol (surd and symbol of
inclusion, in recent history a vinculum, but historically parentheses) is used, we
general (although not quite always) mean only the positive square root.
The absolute value operator indicates the distance (always non-negative) a number is
from the origin (zero). The symbol used is a vertical line on either side of the operand.
Thus, if x>0, then |x|=x, if x<0, then |x|=-x, and ifx=0, |x|=0. (x2)=|x|.
There is a proscribed order for arithmetic operations to be performed.
Example: If we write 4 5 + 3 it is conventional to multiply the 4 and 5 together
before adding the 3 and thus obtain 23. Some calculators are algebraic and handle this
appropriately, others do not.
Parentheses and other symbols of inclusion are used to modify the normal order of
operations. We say these symbols of inclusion have the highest priority or precidence.
Exponentiation is done next. There is confusion when exponents are stacked which
we will not deal with here except to say computer scientists tend to do it from left to
right while mathematicians know that is wrong.
Multiplication and Division are done next, in order, from left to right.
Addition and Subtraction are done next, in order, from left to right.
A mnemonic such as Please Eat Miss Daisy's Apple Sauce can be useful for
remembering the proper order of operation.
Unauthorized personnel are giving wrong information that the public will take
as truth. A possibility is a company sponsoring a statistics research to prove that
their company is better.
Visual aids may be correct, but emphasize different aspects. Specific examples
include graphs which don't start at zero thus exaggerating small differences and
charts which misuse area to represent proportions. Often a chart will use a
symbol which is both twice as long and twice as high to represent something
twice as much. The area, in this case however, is four times as much!
Precise statisitics or parameters may incorrectly convey a sense of high
accuracy.
Misleading or unclear percentages are often used.
Statistics are often abused. Many examples could be added, (even books have been
written) but it will be more instructive and fun to find them on your own.
Types of Data
A dictionary defines data as facts or figures from which conclusions may be drawn.
Thus, technically, it is a collective, or plural noun. Some recent dictionaries
acknowledge popular usage of the word data with a singular verb. However we intend
to adhere to the traditional "English" teacher mentality in our grammar usagesorry
if "data are" just doesn't sound quite right! (My mother and step-mother were both
English teachers, so clearly no offense is intended above.) Datum is the singular form
of the noun data. Data can be classified as either numeric or nonnumeric. Specific
terms are used as follows:
1.
Qualitative data are nonnumeric.
2. {Poor, Fair, Good, Better, Best}, colors (ignoring any physical causes), and
types of material {straw, sticks, bricks} are examples of qualitative data.
3. Qualitative data are often termed catagorical data. Some books use the
terms individual and variable to reference the objects and characteristics
described by a set of data. They also stress the importance of exact definitions
of these variables, including what units they are recorded in. The reason the
data were collected is also important.
4.
Quantitative data are numeric.
Discrete data are numeric data that have a finite number of possible values.
Continuous data have infinite possibilities: 1.4, 1.41, 1.414, 1.4142, 1.141421...
3.
Ordinal: Ordinal data have order, but the interval between measurements is not
meaningful.
5.
Interval: Interval data have meaningful intervals between measurements, but there is no
true starting point (zero).
7.
Ratio: Ratio data have the highest level of measurement. Ratios between measurements as
well as intervals are meaningful because there is a starting point (zero).
Nominal comes from the Latin root nomen meaning name. Nomenclature,
nominative, and nominee are related words. Gender is nominal. (Gender is something
you are born with, whereas sex is something you should get a license for.)
Example 1: Colors
To most people, the colors: black, brown, red, orange, yellow, green, blue, violet, gray,
and white are just names of colors.
To an electronics student familiar with color-coded resistors, this data is in ascending
order and thus represents at least ordinal data.
To a physicist, the colors: red, orange, yellow, green, blue, and violet correspond to
specific wavelengths of light and would be an example of ratio data.
Example 2: Temperatures
What level of measurement a temperature is depends on which temperature scale is
used.