Lecture 8
Joy Robbins
TODAY’S SESSION
2
ASSESSMENT:
INTRODUCTORY DISCUSSION
What’s the point of assessing writing?
How have your teachers at school and university
assessed your writing in your 1st and 2nd languages?
Do you think there was any point in assessing you?
Why (not)?
In what ways have the scores and grades you have
received on your writing (in L1 and L2) helped you
improve your writing?
If you are an experienced language teacher, what do
you feel are your greatest challenges in evaluating
student writing? If you aren’t an experienced
teacher, what makes you nervous about assessing
student writing? Why?
3
5
RELIABILITY
‘reliability refers to the consistency with which a
sample of student writing is assigned the same rank or
score after multiple ratings by trained evaluators’
(Ferris & Hedgcock 1998: 230)
For example:
if we’re marking an essay out of 20, the test will be far
more reliable if 2 markers both award an essay the
same grade (or more or less the same grade), say 16 or
17. However, if 1 marker awards 10 and the other
awards 15, the test isn’t reliable.
7
FACE VALIDITY
Face validity refers to how acceptable and
credible a test is to its users (Alderson et al 1995)
9
2 APPROACHES TO SCORING WRITING
There are 2 main ways of scoring writing tests,
the holistic approach and the analytic
approach
10
HOLISTIC SCORING
Holistic scoring means that the assessor assesses
the text generally, rather than focusing on 2 or 3
specific aspects
The idea is that the assessor quickly reads
through a text, gets a global impression, and
awards a grade accordingly
The holistic approach is supposed to respond to
the writing positively, rather than negatively
focusing on the things the writer has failed to do
12
HOLISTIC SCORING: ADVANTAGES
Quick and easy, because there are few categories
for the teacher to choose from
13
HOLISTIC SCORING: DISADVANTAGES
Holistic scoring can’t provide the writing teacher
with diagnostic information about students’
writing, because it doesn’t focus on tangible aspects
of writing (e.g. organization, grammar, etc.)
16
ANALYTIC WRITING ASSESSMENT: AN
EXAMPLE
17
ANALYTIC SCORING: ADVANTAGES
Analytic schemes provide learners with much more
meaningful feedback than holistic schemes. Teachers
can hand students’ essays back with the criteria (e.g.
marks out of 10 for organization, spelling, etc.) circled
which the writing was awarded
Analytic schemes can be designed to reflect the
priorities of the writing course. So, for instance, if you
have stressed the value of good organization on your
course, you can weight the analytic criteria so that
organization is worth 60% of the marks
Because assessors are assessing specific criteria, it’s
easier to train them than assessors who are using
holistic schemes (Cohen 1994; McNamara 1996;
Omaggio Hadley 1993; Weir 1990)
Analytic assessment is more dependable than holistic 18
assessment (Jonsson & Svingby, 2007: 135)
ANALYTIC SCORING: DISADVANTAGES
23
MULTIPLE TRAIT SCORING:
ADVANTAGES
24
MULTIPLE TRAIT SCORING:
DISADVANTAGES
Can be extremely time consuming to design
specific assessment criteria for each type of
writing (Perkins 1983)
26
APPLICATION AND DISCUSSION (CONTD.)
In a pairs or groups, compare your score
and comments with those of your
colleagues.
On what points did you agree or disagree?
Why?
If you disagreed, try to arrive at a consensus
evaluation of the essay.
After identifying the sources of your
agreement and disagreement, formulate a
list of future suggestions for using holistic
scoring rubrics. (Ferris & Hedgcock 1998:
261)
27
REFERENCES
Alderson JC et al (1995) Language Test Construction and Evaluation.
Cambridge: Cambridge University Press.
Brindley G (2001) Assessment. In R. Carter & D. Nunan (eds.), The
Cambridge Guide to Teaching English to Speakers of Other Languages.
Cambridge: Cambridge University Press, pp.137-143.
Cohen A (1994) Assessing Language Ability in the Classroom (2nd ed.). Boston:
Heinle & Heinle.
Ferris D & Hedgcock JS (1998) Teaching ESL Composition: Purpose,
Process, and Practice. Mahwah: Lawrence Erlbaum.
Jonsson, A., & Svingby, G. (2007). The use of scoring rubrics: Reliability,
validity and educational consequences. Educational Research Review, 2(2),
130-144.
Lumley T & McNamara T (1995) Rater characteristics and rater bias:
implications for training. Language Testing 12: 54-71.
McNamara T (1996) Measuring Second Language Performance. London:
Longman.
Omaggio Hadley A (1994) Teaching Languages in Context (2nd ed.). Boston:
Heinle & Heinle.
Perkins K (1983) On the use of composition scoring techniques, objective
measures, and objective tests to evaluate ESL writing ability. TESOL
Quarterly 17: 651-671.
28
Weir CJ (1990) Communicative Language Testing. New York: Prentice Hall.
THIS WEEK’S READING
Chapters 5 and 6 of:
Ferris D & Hedgcock JS (2005) Teaching
ESL Composition: Purpose, Process, and
Practice. Mahwah: Lawrence Erlbaum.
Min H-T (2005) Training students to
become successful peer reviewers. System
33: 293-308.
29
HOMEWORK TASK
Use the analytic scoring scale to grade the pre-
sessional piece of writing you graded
holistically earlier today…