Jelajahi eBook
Kategori
Jelajahi Buku audio
Kategori
Jelajahi Majalah
Kategori
Jelajahi Dokumen
Kategori
A. Reliability
Realibility refers to the consistency or stability of test score. Or the consistency of
scores obtained by the same persons when retested with identical test of with an
equivalent form of test (Anastasi:27). Which means, a test cannot measure anything
unless it measures consistently.
For example : (1)
(2)
(3)
(4)
Reliability is estimated from a single administration of one form of the test. But in
this case we are concerned with inter-item consistency as determined by the
proportion of persons who pass and the proportion who do not pass each item.
Beside knowing methods used in finding out realibility, it is necessary to know the
factors affecting the reliability of a test. The factors (Heston:155) are as follow:
a) The extent of the sample of material selected for testing. The larger the sample,
the greater the probability that the test as a whole is reliable as the test allow for a
wide field to be covered.
b) The administration of the test. Is the same test administered to different groups
under different conditions or at different times?
c) Test instructions: are the various tasks expected from the testee made clear to all
candidates in the rubrics?
d) Personal factors such as motivation and illness.
e) Scoring the test: this is influenced by the type of the test objective or subjective.
students are expected to be able to choose the right word based on the context, then the
construction of the should be choose the right word based on the context. And the test
should be, for instance, as follow:
The students just gone
a. Has
b. have
c. had
Face Validity
We conclude this brief survey of some common varieties of validation with what is
most probably the most frequently employed type of all, face validity. Here we mean
simply the way the test looks- to the examinees , test administrators, educators, and the
like.
Obviously, this is not validity in the technical sense, ad face validity can never be
permitted to take the place of empirical validation or of the kind of authoritative analysis
of content referred to above.
2. Empirical Validity
Empirical validity is an indication of a perfect correlation between the two measures.
There are two general kinds of empirical validity, concurrent validity and predictive
validity.
Concurrent validity
A kind of validity related to the experience. Thus, to determine if the summative test
made is valid or not, the criterion of the daily test score or the previous summative test
can be used. This is can be carried out by comparing a test with another test (that is
meant by experience). A test is considered valid if it is in line with the set criterion.
Predictive validity
Predictive validity is a kind of validity which is expected to be able to make
predictions about future performance.
C. Practicality
A third characteristic of a good test is its practicality or usability. A test is said to be
valid if it is practical and usable. Thus in the preparation of a new test or the adoptation
of an existing one, we must keep in mind these following number of very practical
considerations:
1. Economy
Economy including both time and money. Testing can be expensive. We must take
into account the cost per copy, and whether or not the test book are reusable. Again,
several administrator and/or scorers will be needed, for more personnel who must be
involved, the more costly the process become. In writing or selecting a test, we
should certainly pay some attention to how long the administering and scoring of it
will take.
2. Ease of administration and scoring
Other consederation of test usability involve the ease with which the test can be
administrated. The test administrator can perform his tasks quickly and efficiently if
full, clear directions provided. Scoring procedure also have a significant effect on
the practicality of a given instrument. Because we need to know the number of
examinees involved, whether the test must be scored subjectively or is objective in
nature, the answer sheet used, and whether we scoring use machine or hand scoring
the tests.
3. Ease of interpretation
If a standard test is being adopted, it is important that we examine and take into
account the data which the publisher provides and whether there is an up-to-date test
manual that gives clear information about test reliability and validity and about
norms for appropriate references groups. Hovewer, we need to have some general
gudance as to the meaning of test scores to begin with.
In short, all the things above influences the quality of a good test.
D. Objektifitas
Sebagaimana telah kita ketahui bersama bahwa obyektif berarti tidak mengandung
unsur-unsur pribadi. Dalam hubungan ini, suatu tes dapat dikatakan obyektif dan
memiliki obyektivitas apabila tes tersebut disusun dan dilaksanakan sesuai dengan apa
yang ada. Isi atau materi tes diambil berdasarkan materi atau bahan pelajaran yang telah
diberikan sebelumnya dan sesuai dengan tujuan yang telah ditentukan (Anas Sudijono,
2005: 96). Dengan kata lain, sebuah tes dikatakan memiliki obyektivitas apabila dalam
pelaksanaan tes tersebut tidak ada factor subjektif yang mempengaruhi, terutama dalam
system penilaian. Apabila dikaitkan dengan reliabilitas, maka objektifitas lebih
menekankan ketetapan pada sistem scoring, sedangkan reliabilitas lebih menekankan
ketetapan dalam hasil tes.
Faktor yang mempengaruhi objektifitas adalah sebagai berikut:
a.Bentuk Tes
Tes yang berbentuk uraian (essay), akan memberikan banyak kemungkinan kepada si
penilai untuk memberikan banyak penilaian (skoring) menurut caranya sendiri. Halini
menunjukkan bahwa dengan menggunakan tes bentuk uraian akanmemungkinkan
masuknya unsur subjektivitas dari si penilai dalam melakukan skoring.
b.Penilai
Dengan menggunakan tes bentuk uraian, faktor subjektivitas dari seorang penilaiakan
dapat masuk secara lebih leluasa dan mempengaruhi pemberian skor. Faktor-faktor yang
dapat mempengaruhi dalam subjektivitas penilaian tersebut antara lain:kesan penilai
terhadap peserta tes (hallo-effect), tulisan, bahasa, waktu pelaksanaan penilaian, dan
sebagainya.
Language Testing
Creteria of a Good Test: Concept and Types
BY: