PENYELIDIKAN PENDIDIKAN
ALAT UKUR PENYELIDIKAN
Minggu 11
OBJEKTIF
Memahami maksud alat pengukuran dalam
penyelidikan.
Menggunakan
alat
pengukuran
dalam
penyelidikan pendidikan.
1.
Skala/Inventori
2.
Soal Selidik
3.
Lain-lain
4.
Ujian
Satu
set
rangsangan
(soalan
atau
kenyataan) untuk mendapatkan respons
subjek berasaskan pemarkahan.
Bagi mengukur penguasaan pelajar dalam
sesuatu mata pelajaran, bidang, kemahiran,
kecerdasan atau bakat.
Tahap penguasaan pelajar dikuantifikasikan
dengan memberi markah kepada jawapan
yang diberi.
4
Ujian
Jenis-jenis Ujian
Ujian Pencapaian.
Ujian Kecerdasan (mental, emosi &
rohani).
Ujian Prestasi.
Ujian Kecekapan.
Ujian Bakat.
Ujian Personaliti, Sikap dan Minat.
Ujian Pencapaian
Ujian Kecerdasan
Ujian Prestasi
Ujian Kecekapan
11
Ujian Bakat
Digunakan untuk membuat telahan
tentang kebolehan pelajar belajar pada
masa hadapan
Bakat merujuk kepada apa yang
pelajar boleh belajar
Contoh: Ujian Bakat Bermain Bola
Sepak, & Ujian Bakat Seni
12
Ujian Personaliti
14
15
Soal Selidik
16
Soal Selidik
3.
4.
5.
6.
7.
Temu Bual
20
Temubual
Kekuatan:
+ kerjasama, interaksi secara langsung dan aktif.
+ Mudah suai, mengikut situasi subjek.
+ Hubungan lebih erat, jawapan lebih tepat.
Kelemahan:
-
21
Pemerhatian
22
Pemerhatian
Jenis Pemerhatian
Pemerhatian turut serta
1.
24
2.
Pembinaan Instrumen
26
1.
2.
3.
4.
5.
6.
7.
8.
9.
27
Soal Selidik
Digunakan bagi mendapatkan maklumat.
Ciri-ciri soal selidik yang baik:
2.
3.
4.
5.
Sub-Indeks (Pemikiran
kreatif)
Keaslian
2.
Kelancaran
3.
Penghuraian
Jumlah item
20
Bil.
Item
7
30
KESAHAN (VALIDITY)
31
Based on Internal
Structure
Kesahan Gagasan
Based on
Relations to Other
Variables
Based on content
Kesahan Kriteria
Criterion-referenced (scores are a
predictor of an outcome or criterion
they are expected to predict)
Concurrent
Evidence
Construct
(determination of the
significance, meaning,
purpose, and use of the
scores)
Predictive
Evidence
Kesahan Kandungan
Content (representative of
all possible questions that
could be asked)
Content validation is usually carried
out by experts
Evidence Based on
KesahanContent
Kandungan
(Content Validity)
Content
Content validity
validity
is concerned with a tests ability to include or
represent all of the content of a particular construct.
The question 1 + 1 = ___ may be a valid basic
addition question. Would it represent all of the
content that makes up the study of mathematics? It
may be included on a scale of intelligence, but does it
represent all of intelligence? The answer to these
questions is obviously no. To develop a valid test of
intelligence, not only must there be questions on
math, but also questions on verbal reasoning,
analytical ability, and every other aspect of the
construct we call intelligence. There is no easy way to
determine content validity aside from expert opinion.
Content
Content validity
validity
1.
2.
3.
Content
Content validity
validity
Sebelum sesuatu instrumen itu dikatakan
mempunyai kesahan kandungan, lima syarat ini
perlu dipenuhi:
1.Bidang
Internal Structure
Concurrent
Concurrent Validity
Validity
Concurrent Validity refers to a measurement devices ability to vary
directly with a measure of the same construct or indirectly with a
measure of an opposite construct . It allows you to show that your
test is valid by comparing it with an already valid test. Administering
the focal test and criterion test at approximately the same point in time
(i.e., concurrently) and then correlating the two set of scores. If the two
sets of scores highly correlated, you have concurrent evidence.
e.g.
A new test of adult intelligence, for example, would have concurrent
validity if it had a high positive correlation with the Wechsler Adult
Intelligence Scale since the Wechsler is an accepted measure of the
construct we call intelligence. An obvious concern relates to the validity
of the test against which you are comparing your test. Some
assumptions must be made because there are many who argue the
Wechsler scales, for example, are not good measures of intelligence.
Predictive
Predictive Validity
Validity
Obtain predictive evidence of validity by measuring your
participants at one point in time on your test and then, at a future
time, measuring them on the criterion measure.
Take more time and effort than concurrent evidence, but it can
provide superior evidence that your test does what you want it to
do.
In order for a test to be a valid screening device for some
future behavior, it must have predictive validity. The SAT is
used by college screening committees as one way to predict
college grades. The GMAT is used to predict success in
business school. And the LSAT is used as a means to predict
law school performance. The main concern with these, and
many other predictive measures is predictive validity because
without it, they would be worthless
Kebolehpercayaan
Test-retest
Reliability
Internal
Consistency
Reliability
Equivalent
Forms
Reliability
Test-Retest
Reliability
Merujuk kepada ketekalan atau stabiliti markah
ujian jika dilakukan pada masa yang berbeza.
Contoh:
Ujian diberikan kepada 100 individu untuk satu masa dan diulangi
pada masa berlainan. Dua set markah ini dikorelasikan. Sekiranya
individu memperoleh markah tertinggi dalam ujian 1 juga
memperolehi markah tertinggi dalam ujian 2, begitu juga individu
yang mendapat markah terendah dalam ujian 1 juga mendapat
markah terendah dalam ujian, maka dikatakan mempunyai korelasi
yang tinggi. Oleh itu soalan ujian tersebut mempunyai
kebolehpercayaan yang tinggi.
Equivalent Forms
Reliability
Internal Consistency
Reliability
Split-half reliability
Coefficient alpha
Lee Cronbach 1951) developed coefficient alpha.. Alpha
Cronbach
Coefficient alpha tells you the degree to which the items
are interrelated.
Rule of thumb:
At a minimum, greater than or equal to .07 for research
purposes and somewhat greater than that value (e.g. .
09) for clinical testing purposes.
Sekiranya pekali
kebolehpercayaan
rendah???
Meningkatkan
Kebolehpercayaan
Garis panduan
Untuk kajian ujian rintis
baharu,
lakukan dua
kali ujian rintis.
Train
researchers to
collect
observational
data
Develop
standard written
procedures for
administering an
instrument
Obtain
permission to
collect and
use public
documents
Procedures for
Administering the Data
Collection
Respect individuals and sites
during data gathering (ethics)
lp
a
du
i
v
i
Ind
ip
c
i
rt
n ts
Institutional or
organizational
(e.g., school
district)
Si
te
-s
se pec
co if
sc n ic
ho da (e .
ol ry g.,
)
Parents of
participants who are
not considered adults
Kesahan
Kesahan Gagasan
Kesahan Kandungan
Kesahan Kriteria
61
Kesahan Kandungan
Kesahan Gagasan
Kesahan Kriteria
KEBOLEH PERCAYAAN
(RELIABILITY)
65
Kebolehpercayaan
Cara menentukan
kebolehpercayaan alat ukur
Indeks Kebolehpercayaan
Merupakan
pekali
yang
menunjukkan
ketekalan sesuatu alat ukur.
Ditentukan
dengan menggunakan pekali
korelasi antara dua set respons (markah) yang
diperolehi hasil daripada gerak balas subjek
terhadap alat ukur.
Kaedah menguji kebolehpercayaan:
1. Kaedah Bahagi Dua (Split-half)
2. Kaedah Uji dan Uji Kembali (Test-retest)
3. Kaedah Bentuk Setara (Equavalent Form)
67
69
Pekali
alpha
berasaskan
kepada sisihan piawai alat ukur
dan sisihan piawai butir yang
berlainan
n S S
n 1
S
2
70
71
3.
Kaedah
Bentuk
(Equavalent Forms)
Setara
Meningkatkan
Kebolehpercayaan
Ujian Rintis
74
75
2.
3.
Pengumpulan Data
77
SEKIAN
TERIMA KASIH
78