Learning
5
LEARNING OUTCOMES
By the end of this topic, you should be able to:
1.
Define learning;
2.
3.
4.
5.
X INTRODUCTION
From the day we are born till the day we die, we never stop learning. First, we
learn how to crawl, and then walk. Hopefully by the time we die, we should
have learnt how to live and love.
In this topic, we will explore several kinds of learning. One type is learning to
associate one event with another. Learning involves the formation of concepts,
theories, ideas and other mental abstractions.
5.1
WHAT IS LEARNING?
As children and even as adults, we learn many things by just observing and
imitating people. We also learn through association. Learning an association
between a stimulus (e.g. expensive toys) and a response (e.g. happiness) is called
conditioning.
There are many ways in which we have been conditioned without us even being
aware of it. Who conditions us? Our parents, teachers, the media and society. We
have been conditioned by the media to consume just like how dogs are trained to
TOPIC 5 LEARNING W
91
sit and shake hands. We have been conditioned to associate material things
with success and happiness. Some of us realise that we have been conditioned
and work on de-conditioning ourselves as we learn what works best for us.
How do you define learning? Learning takes place whenever experience or
practice results in a relatively permanent change in behaviour or behavioural
potential (Gerrig and Zimbardo, 2008).
Learning might consist of changes in immediate behaviour. For example, you
may change your brush stroke techniques immediately after your art teacher
suggests that you try something new. Learning can also consist of changes that
can potentially be initiated but do not occur because the opportunity never
arises. For example, you may learn how to hold a golf club the proper way by
watching a golfer on television but the opportunity to do so does not arise.
Although learning is relatively permanent, learnt behaviour is not necessarily
permanent, and can possibly be unlearnt (Huffman, Vernoy and Vernoy, 1997).
Learning is a result of practice or experience we can learn by actively repeating
behaviour or simply from passively experiencing it (observing behaviour).
ACTIVITY 5.1
1.
2.
3.
5.1.1
Classical Conditioning
92
TOPIC 5 LEARNING
had learnt that the bell was a signal that they were going to be fed. If you have
any pets at home, you would have noticed that as well.
There are four basic elements in classical conditioning, as shown in Figure 5.1:
(a) The unconditioned stimulus (UCS);
(b) The unconditioned response (UCR);
(c) The conditioned stimulus (CS); and
(d) The conditioned response (CR).
The unconditioned stimulus (UCS) is an event that automatically triggers a reflex
response, which is the unconditioned response (UCR). In Pavlovs experiment,
food in the mouth was the US, and salivation was the UR (Weiten, 2001).
The conditioned stimulus (CS) is an event that is repeatedly paired with the US.
In Pavlovs experiment, the ringing of the bell was the CS. The learnt reaction
salivating at the sound of the bell is the conditioned response (CR).
TOPIC 5 LEARNING W
93
Let us look at the basic principles of classical conditioning as shown in Table 5.1.
Table 5.1: Basic Principles of Classical Conditioning
Basic Principles of Classical
Conditioning
Description
Stimulus generalisation
Stimulus discrimination
Extinction
Spontaneous recovery
Higher-order conditioning
(b)
Spacing of Pairings
If the CS and UCS pairings follow each other too closely or too far apart,
learning the association is slower. If the spacing is moderate, learning
occurs faster.
94
TOPIC 5 LEARNING
SELF-CHECK 5.1
1.
What are the two steps that should be taken in order to make it
easier to establish a classically conditioned response?
2.
3.
5.1.2
Operant Conditioning
While Pavlov was busy with his dogs, another psychologist by the name of
Edward Lee Thorndike was studying how cats learn. Thorndike confined a
hungry cat in a simple wooden cage he called the puzzle box, and placed some
food right outside the box where the cat could still see and smell it (Refer to
Figure 5.3). To get the food, the cat had to figure out how to open the latch on the
box door, a process Thorndike timed. Initially, it took the cat some time before it
could figure out how to open the latch. However, in each subsequent trial, it took
less and less time. Eventually, the cat could escape from the box in almost no
time at all. Thorndike was a pioneer of this type of learning, which involves
responding in a certain way because of the consequences it brings. This form of
learning is known as operant conditioning. Thorndikes law of effect summarises
the influences of consequences. Law of effect refers to Thorndikes theory that
behaviour which is consistently rewarded will be recorded as learnt behaviour,
and behaviours which have unpleasant outcomes will be removed.
TOPIC 5 LEARNING W
95
There are three basic principles of operant conditioning, as shown in Figure 5.4.
Reinforcement
Reinforcement refers to delivering a stimulus immediately or shortly after a
response, which results in increasing the chances of that response occurring
again in the future. Reinforcement is divided into four categories:
(i)
Primary Reinforcers
Primary reinforcers are also sometimes known as unconditioned
reinforcers because they are naturally occurring reinforcers that have
developed in response to our basic needs. Primary reinforcers do not
96
TOPIC 5 LEARNING
Secondary Reinforcers
On the other hand, secondary reinforcers must be paired with a
stimulus that functions as a reinforcer. For example, if money is
paired with the sound of water running, running water then becomes
the secondary reinforcer.
Punishment
Punishment refers to any event which decreases the probability of
behaviour occurring again. Punishment is different from negative
reinforcement. Negative reinforcement strengthens behaviour by removing
something unpleasant from the environment. In contrast, punishment adds
something unpleasant to the environment to weaken behaviour. For
example, if you were to receive a heavy fine for littering, the behaviour of
littering would be weakened the next time. However, as we can observe
from real life, punishment does not always work. Why? In order for
punishment to be effective, it has to be swift meaning punishment should
not be delayed and it should be put into action immediately after the
unwanted behaviour has taken place.
Punishment should also be consistent. If you punish your child for hitting
his sister, but not for hitting the dog, then he is most likely to do it again. In
some cases, even the mere threat of punishment is enough to remove
unwanted behaviour. This is called avoidance training. How effective is
punishment? Punishment does not encourage people to unlearn behaviours
or teach them how to behave better. Rather, if the threat of punishment is
removed, the negative behaviour is likely to occur again. Furthermore,
punishment also induces negative feelings which can hinder the learning of
new, more desirable behaviours. For example, if a child is punished for
TOPIC 5 LEARNING W
97
giving the wrong answer, the child may become very frustrated and unsure
of himself. Besides that, punishment makes people angry, and angry people
tend to be more aggressive.
(c)
Schedules of Reinforcement
Schedules of reinforcement refer to the rule in operant conditioning for
determining when and how often reinforcers will be delivered. There are
two types of schedules of reinforcement (Refer to Figure 5.4).
(i)
Continuous Reinforcement
Continuous reinforcement results in behaviour that will persist for a
shorter time compared to behaviour learnt by partial reinforcement.
When people only receive reinforcements occasionally, they learn not
to expect reinforcement after every response; instead they respond
continuously hoping that eventually they will receive the desired
outcome (Morris & Maisto, 2001). Vending machines demonstrate the
effect of continuous reinforcement efficiently. Each time you put
money into a vending machine, you expect to get something in return
for your money. When you do not get anything in return for your
money, you do not put your money into the vending machine again.
However, if you put your money into a casino slot machine, and
receive nothing in return, you are still most likely to keep putting
money into the machine hoping that you will receive something the
next time. The casino slot machine demonstrates the concept of partial
reinforcement.
(ii)
Partial Reinforcement
Partial reinforcements are either fixed or variable. Table 5.2 explains
fixed ratio and variable ratio schedules; and fixed interval and
variable interval schedules by providing examples of these schedules
of reinforcement in everyday life.
98
TOPIC 5 LEARNING
Example
Emitted behaviour
There is one major difference between classical conditioning and operant
conditioning. In classical conditioning, a response is automatically triggered
by some stimulus. For example, food in the mouth automatically triggers
salivation. In contrast, operant conditioning involves voluntary behaviour
TOPIC 5 LEARNING W
99
(Morris & Maisto, 2001). For example, Thorndikes cat voluntarily attempted
to open the box door. This is called operant behaviour because it involves
operating on the environment.
Consequence
Consequences which increase the chances that a behaviour will be repeated
are called reinforcers. For example, Thorndikes cat gained freedom and food
for escaping from the puzzle box. In contrast, consequences which decrease
the chances that a behaviour will be repeated are called punishers.
Thorndike found that the influences of consequences can essentially follow
two paths. First, when a certain behaviour is rewarded (reinforced), the
behaviour is most likely to be performed again. Whereas, when a certain
behaviour brings about a negative effect such as punishment, the behaviour
is most likely to be suppressed. This is often referred to as the principle of
reinforcement by modern psychologists. It is not always easy to tell whether a
particular consequence will be reinforcing or punishing. We must wait to see
if the consequence increases or decreases the behaviour that comes before.
For example, you might assume that chocolate is always reinforcing for
children, but some children may not be motivated by this reward, so for them
it is not reinforcing. Hence, we must be careful when identifying
consequences as reinforcers or punishers.
Table 5.3 demonstrates the differences and similarities between classical
conditioning (CC) and operant conditioning (OC).
Table 5.3: Comparison between Classical Conditioning and Operant Conditioning
Similarities
Differences
100 X
TOPIC 5 LEARNING
ACTIVITY 5.2
1.
2.
5.2
COGNITIVE LEARNING
not solve this problem by using the trial and error strategy like Thorndikes cat,
but rather seemed to think about the problem for a period of weeks. Then, in a
flash of insight, the chimpanzee picked up the bamboo sticks and connected
them by inserting one end into the hollow end of another. With this extension,
the chimpanzee was able to reach the banana.
Edward Tolman went on to say that in order for learning to occur, the response
does not necessarily have to be reinforced, contrary to what most learning
theorists/behaviourists believed at that time. He found that a significant amount
of learning consists of latent learning, as he called it. Latent learning occurs in the
absence of any reward and the knowledge learnt remains hidden until some day
in the future when it can be finally retrieved and used.
In short, cognitive learning is the basis for many modern educational approaches
such as constructivism and problem-based learning.
5.3
OBSERVATIONAL LEARNING
(b)
(c)
102 X
TOPIC 5 LEARNING
This is the difference between learning and performance. We can learn without
any change in external behaviour that shows that we have learnt. However,
whether or not we perform depends on our motivation.
The kind of consequences associated with the observed behaviour (the rewards
or punishments) is an important motivation for acting. These consequences
(rewards or punishments) do not necessarily have to happen to the observer.
They may simply occur to the other people the observer is watching. This is
called vicarious reinforcement or vicarious punishment because the
consequences are not directly experienced by the observer; rather they are
experienced through other people. For example, if you see your father drinking
alcohol and enjoying himself, you are experiencing vicarious reinforcement of
drinking and you are much more likely to imitate it.
One of the pioneers of social learning theory is Albert Bandura, who refers to his
perspective as social cognitive theory. Bandura (1986) demonstrated in his
experiment that people can learn behaviour without being reinforced directly for
it, and that learning a behaviour and performing it are not the same thing. Three
groups of preschool children watched a film in which an adult starts hitting a
doll after it refuses to obey him. The film ended differently for the children in
each of the three groups. Those in the model-rewarded condition saw the adult
who had hit the doll being rewarded with lots of goodies and praise by a second
adult (vicarious reinforcement). Those in the model-punished condition saw the
second adult scolding and beating the adult who had hit the doll (vicarious
punishment). Finally, those in the no-consequences condition saw nothing
happen to the model as a result of his aggressive behaviour. Immediately after
watching the film, the children were escorted into another room where they
found the same doll. Each child played alone for 10 minutes while being
observed behind a one-way mirror. Much of the behaviour the children
displayed was what had been reinforced on the adult in the film.
This study teaches us an important lesson regarding how we should not provide
a model of aggression for children to follow unintentionally. Imagine you want
your child to stop hitting his sibling. You might resort to hitting the child as
punishment in order to discourage this kind of behaviour. However, hitting the
child also shows that hitting is an effective method of getting your way.
Therefore, hitting not only provides a model of aggression; it also provides
vicarious reinforcement. Both you and your child will be happier if the
punishment for hitting was not a similar form of aggression, and if the child is
also rewarded for showing kindness to others.
SELF-CHECK 5.2
1.
2.
Classical conditioning
Cognitive learning
Conditioned response
Conditioned stimulus
Latent learning
Neutral stimulus
Observational learning
Punishment
Reinforcement
Unconditioned stimulus
Unconditioned response
Vicarious conditioning
104 X
TOPIC 5 LEARNING
Davis, S. F., & Palladino, J. J. (2000). Psychology (3rd ed.). Upper Saddle River,
New Jersey: Prentice Hall.
Gazzaniga, M. S., & Heatherton, T. F. (2003). Psychological science: The mind,
brain and behaviour. London: W.W Norton & Company.
Gerrig, R. J., & Zimbardo, P. G. (2008). Psychology and life (18th ed.). Boston:
Pearson Education.
Huffman, K., Vernoy, M., & Vernoy, J. (1997). Psychology in action (4th ed.). John
Wiley & Sons Inc.
Morris, C. G., & Maisto, A. A. (2001). Understanding psychology (5th ed.).
Prentice Hall: Upper Saddle River, New Jersey.
Chapter 5: Learning
Weiten, W. (2001). Psychology themes & variations (5th ed.). Wadsworth/
Thomson Learning.