RESEARCH INSTRUMENT
JEFFERSON S. VALDEZ
MAEd Educ.Mgt.
VALIDITY
the degree to which a test or measuring
instrument measures what it intends to measure.
it has to do with soundness, (what the test or
questionnaire measure its effectiveness)
degree to which a test measures what it really
purports to measure
TYPES OF VALIDITY
Content validity
Concurrent Validity
Criterion related Validity
Construct Validity
Predictive Validity
1. CONTENT VALIDITY
the degree to which an instrument measures an
intended content area.
the degree to which the test represents the
essence, topics, and the test is designed to
measure.
reported in terms of non-numerical data unlike
the other types of validity.
Development of a Table of
Specifications (TOS)
A detailed T0S includes areas or concepts,
objectives, number of items and percentage or
proportion of items in each area.
It is advisable to make a 50 to 100% allowance in
the construction of items.
Item Writing
At this stage, the creative talent of the writer
should decide what type of items are supposed to
construct: the type of instrument, scoring
techniques.etc.
The quality of the test items, therefore, depends
on the considerable extent upon the researchers
ability to produce ideas and translate into items
and satisfy the TOS.
2.CONCURRENT VALIDITY
The degree to which the test agrees or correlates with a
criterion set up as an acceptable measure. The criterion
is always available at the time of testing.
Correspondence of one measure of a phenomenon with
another of the same construct (administered at the same
time)
Two tools are used to measure the same concept and
then a correlation analysis is performed. The tool which
is already demonstrated to be valid is the gold standard
with which the other measure correlate.
Example:
A researcher wishes to validate a Biology
achievement test he has constructed. He
administers this test to a group of Biology
students. The result of this test is correlated an
acceptable Biology test which has been
previously proven as valid. If the correlation is
high, the Biology test he has constructed is
valid.
Example:
If the criterion set for professionalism is in
nursing is belonging to nursing organizations
and reading nursing journals, then couldnt we
count memberships and subscriptions to come
up with a professionalism score?
4. PREDICTIVE VALIDITY
It is determined by showing how well
predictions made from the test are confirmed by
evidence gather at some subsequent time.
The criterion measure against this type of
validity is important because the outcome of the
subjects is predicted.
The ability of one measure to predict another
future measure of the same concept.
Example:
The researcher wants to estimate how well a
student maybe able to do in graduate school
courses on the basis of how well he has done on
test he took in undergraduate courses.
The criterion measure against which the test scores are
validated and obtained are available after a long period
of interval.
Example:
If IQ predicts SAT and SAT predicts QPA, then
shouldnt IQ predict QPA (we could skip SATs
for admission decisions.
If scores on parenthood readiness scale indicate
levels of integrity, trust, intimacy and identity
couldnt this test be used to predict successful
achievement of the developmental tasks of
adulthood?
5. CONSTRUCT VALIDITY
Sometimes called as concept validity
It is the extent to which a test measures the
theoretical construct or trait. This involves such
as those of understanding, appreciation and
interpretation of data.
Example:
A researcher wishes to establish the validity
of an IQ (Intelligence Quotient) using Weschler
Adult Intelligence Scale (WAIS). He
hypothesizes that students with high IQ also
have high achievement and those with low IQ,
low achievement. He therefore administers both
WAIS and achievement tests to two groups of
students with high and low IQ respectively. If
the results show the same with the hypothesis,
the test is valid.
RELIABILITY
test is dependable, self consistent and stable
It is concerned with the consistency of the
responses from moment to moment.
The instrument yields the same results over
repeated measures and subjects
Test-retest method
Parallel-forms method
Split-half method
Internal consistency method
Formula:
rs
1 6ED2
------------------
N3 N
Where: rs = Spearman Rho
ED2 = sum of the squared differences between ranks
N = total number of cases
Interpretation of Correlation
Coefficient Value
Correlation value (r)
0.00
- 0.20
Interpretation
Negligible
0.21 - 0.40
Low / slight
0.41 - 0.70
Marked /moderate
0.71 - 0.90
High
0.91 - 0.99
Very High
1.00
Perfect
Example:
The item, Convert 7,000 grams to kilograms in
Form A is parallel to Convert 7 kilograms to
grams in Form B. Moreover, these forms should
have approximately the same average and
variability of scores.
Form A : I am able to tell my partner how I feel.
Form B: My partner tries to understand my
feelings.
Assessment of Depression
Version A : During the past 4 weeks, I have felt
downhearted:
1- everyday, 2 some days, 3- never
Version B: During the past 4 weeks, I have felt
downhearted:
1- never, 2 some days, 3- everyday
Assessment of loneliness
Version A: How often the past months have you felt alone in the
world?
1- everyday
2- some days
3- occasionally
4- never
Version B: During the past 4 weeks, how often have you felt a
sense of lineliness? 1- all of the time
2- sometimes
3- from time to time
4- never
Equivalent or Non-equivalent
rewording?
Version 1: When your boss blames you for something you
did not do, how often do you stick up for yourself?
1 always
2- sometimes
3- never
Version 2: When presented with difficult professional
situations where a superior censures you an act for
which you are not responsible, how frequently do you
respond in assertive way?
1 always
2- sometimes
3- never
Formula:
2 (rht)
rwt
= -----------1 + rht
Where :
Interpretation of Correlation
Coefficient Value
Correlation value (r)
0.00
- 0.20
Interpretation
Negligible
0.21 - 0.40
Low / slight
0.41 - 0.70
Marked /moderate
0.71 - 0.90
High
0.91 - 0.99
Very High
1.00
Perfect
Formula:
N
SD2 - Epiqi
rxx = ----------------------N -1
SD2
Where:
SD2
= number of items
= the variance on the scores on test defined as E (X Xm);
----------------
N -1
p iq i
= product of the proportion passing or failing in item i
pi= proportion of individuals passing
qi = proportion of individuals failing (q i = 1- pi)
Interpretation of Correlation
Coefficient Value
Correlation value (r)
0.00
- 0.20
Interpretation
Negligible
0.21 - 0.40
Low / slight
0.41 - 0.70
Marked /moderate
0.71 - 0.90
High
0.91 - 0.99
Very High
1.00
Perfect