Readings
Instructor notes
Learning activities
Web resources
Driscoll, Chapter 2
Optional:
Gredler, Chapter 5
Optional reading:
Munson & Crosbie, 1998. Effects of response cost on computerized program
instruction. Psychological Record, Spring 98, Vol. 48, Issue
2.
This article is
to be skimmed, not read for detail. (See the last paragraph of the Instructor
Notes section for an explanation.) To access this article online, go
to the IUB Libraries
Web site and connect to the Academic Search FULLTEXT Elite database.
(There is also a link to the Libraries site from the P540 Web Resources
page.)
Back
to Top
Important ideas leading
up to Skinner
Skinnerian behaviorism
had important antecedents, including:
- Associationism,
and Ebbinghaus's experiments with memorizing lists of nonsense syllables.
Why did Ebbinghaus conduct his learning experiments with nonsense
syllables? And, in retrospect, why was that perhaps not such a great
idea?
- Thorndike's interest
in the association between the environment and action (or behavior),
rather than just the association between ideas.
- Pavlov's "discovery"
of classical conditioning, where a previously "neutral"
event (e.g., ringing a bell), when paired with a biological stimulus
(e.g., food) can come to elicit the same physiological response (e.g.,
salivation).
Some concepts associated
with the classical conditioning paradigm:
- Higher-order
conditioning: when a conditioned stimulus is paired with another
previously neutral stimulus, it too can acquire the ability to elicit
a response. So, if Pavlov's dogs were conditioned to salivate when
he rang a bell, then he began flashing a bright light at the same
time, eventually the light alone might come to elicit the same response.
- Extinction:
when the conditioned stimulus is presented over a sufficient period
of time without the unconditioned stimulus, the conditioned stimulus
eventually loses its ability to elicit the response. (i.e., ring that
bell enough times without presenting food, and the dog will stop salivating.)
- Counter-conditioning:
Changing the response to a conditioned stimulus by pairing it with
a different stimulus. In the case of Baby Albert, you might pair a
pleasant stimulus with the white rat, just in case extinction alone
doesn't work. Or, in the case of Pavlov's dogs, you might pair an
appetite-suppressing stimulus with the conditioned stimulus.
Back
to Top
Skinner's operant conditioning
Watson (1913) introduced
the notion of behaviorism: that psychology should be concerned only
with the objective data of behavior, since we can't really know what's
going on in the mind. B. F. Skinner followed Watson's lead in emphasizing
behavior, rather than thoughts, feelings, attitudes, etc., as the "proper"
subject matter of psychology. The interesting difference in Skinner's
view is that he was less concerned about the relationship between environmental
stimuli and responses; rather, he was mainly interested in behavior
(or a response) and its consequences. Attempts by psychologists to link
all behaviors to stimuli from the environment had become very cumbersome,
and required postulating a lot of intervening mental processes. Just
imagine starting only with stimuli that elicit physiological responses
(heat, loud noises, hunger, etc.), and trying to account for complex
human behavior like building a cathedral or writing a poem!
Skinner said that
the environment provides us with "cues" that serve as antecedents for
behavior; i.e., they set the conditions for it to occur. But it's the
results or consequences of our behavior which make that behavior more
or less likely to occur in the future, and so it was these results that
were more interesting to him.
Thus, Skinner distinguished
two classes of behavior: respondent and operant.
Respondent behavior is the kind we perform automatically in the presence
of some stimulus from the environment. Operant behavior is "emitted"
without necessarily following any particular stimulus. For example,
babies grasp at things and babble, just because that's what babies do.
To Skinner, it would be the consequences of their grasping or babbling
behavior that are important.
According to Skinner,
the basic "unit" of behavior analysis is a discriminative stimulus,
followed by an operant response, followed by a contingent
stimulus. This is commonly diagrammed something like this:
S (discrim) -->
R --> S (conting)
Remember that the
discriminative stimulus sets the stage for the response, but does not
automatically elicit it in the way that an object moving toward your
eye can elicit a blink. For example, robins peck for worms when they're
on the ground, but not when they're flying. So, the ground is a discriminative
stimulus for pecking behavior. The pecking behavior (the R in our diagram)
may be followed by getting a worm. If so, the worm is a new stimulus,
which is contingent on the pecking behavior, and which may serve to
reinforce that behavior.
Humans, too, are
very sensitive to discriminative stimuli. We know that very different
sets of behaviors are appropriate for different settings. For example,
the behaviors that are appropriate at a football game are quite different
from those that would be appropriate at an opera. While Skinner had
some interest in the role of discriminative stimuli in setting the stage
for behavior, he was more interested in the consequences of behavior;
that is, in the response-contingent stimulus that follows a response.
Back
to Top
Consequences of a response
Skinner noted that
the consequences of a response (the contingent stimulus) can be either
satisfying or unsatisfying to the person or organism making the response.
When the consequence is satisfying, it is said to reinforce the
behavior; that is, to make it more likely to occur in the future. When
the response is not satisfying, or aversive, it is said to punish
the behavior; that is, to make it less likely to occur in the future.
It's important to remember that we can't always guess what consequence
will be satisfying to someone. The final test of whether a consequence
is reinforcing to someone is whether or not it increases the behavior
of interest. Suppose, for example, that we try to get a child to go
to bed on time by reading to her for half an hour each night. No matter
how much we believe that the child enjoys being read to, if this does
not increase her going-to-bed-on-time behavior, then, in Skinnerian
terms, we must conclude that this was not a sufficient reinforcer for
her. Or (and this may be the case), that our reading reinforces several
delay tactics and makes them more likely to occur again.
Operant conditioning
would be a fairly simple concept if it were concerned only with positive
and negative consequences of a behavior or response. But it's a little
more complicated than that. Skinner reasoned that we could think not
only about situations in which a satisfying or aversive outcome was
presented following a response, but also about situations in
which it was removed following the response.
Suppose, for example,
that every time your dog sniffs around on top of the kitchen stove,
he finds a chicken leg or something equally tasty to eat. It is likely
that his stove-sniffing behavior would increase, because it has been
reinforced by finding food there. Now suppose that you get tired of
this and you stop leaving food on top of the stove where the dog can
reach it. This is a situation where a satisfying outcome is removed,
rather than presented, following the response. And the likely consequence
(you hope) is that the stove-sniffing behavior would decrease, through
the process of extinction.
Here's another example.
Suppose every time a student asks a question in class, the teacher responds
in a curt or abrupt way. This may function as a punishing consequence
which decreases the question-asking behavior. Then, a new teacher who
is more open to questions takes over the class. The punishing or aversive
consequence has been removed, so the student may increase the question-asking
behavior.
If we "cross" the
two possible functions of a contingent stimulus (satisfying or aversive)
with the two possible conditions of presenting or removing the stimulus
after the response, then we get the table with four cells that you see
on page 39 of the text.
To analyze behavior
a la Skinner, it is important to understand these four possible situations.
In particular, note that both positive reinforcement and negative reinforcement
(the upper left and lower right cells in Driscoll's table) serve to
increase or strengthen behavior. Positive reinforcement does
so by presenting a satisfying outcome after the response; negative reinforcement
does so by removing an aversive outcome after the response. Also, both
punishment, and response cost, timeout, and extinction (the upper right
and lower left cells) serve to decrease or weaken behavior. Punishment
does so by presenting an aversive consequence following the behavior.
The others do so by removing a satisfying consequence following the
behavior. Note: presenting an aversive outcome is sometimes referred
to as presentation punishment, while removing a satisfying outcome is
referred to as removal punishment.
Let's see how
well you understand the four different consequences which can be applied
after a behavior. Test yourself below by trying to decide which example
is presentation punishment, removal punishment, positive reinforcement,
and negative reinforcement.
Your dog, upon
hearing classical music, begins howling and your quiet moment is broken.
Now, you no longer listen to classical music. [click
for answer]
You are at work
and point out to your boss an egregious spelling error in a memo he/she
writes. Your performance review later that day is particularly harsh.
You no longer point out your boss' spelling errors. [click
for answer]
You wake up every
morning and place your feet on the cold tile floor. As you find this
unpleasant, one morning you put on your slippers and your feet are not
cold. Your behavior of putting on slippers increases. [click
for answer]
You decide you
would like some quiet time before going to bed each night. You turn
on a classical CD and dim the lights. You wake up the next morning feeling
refreshed so you do it again the next night. [click
for answer]
Back to Top
Learning new behaviors
and maintaining established ones
So far, we've considered
only behaviors that the person (or animal or organism) already knew
how to perform. The only issue was whether the frequency of the behavior
would be increased or decreased. But how does behaviorism account for
learning to perform new behaviors? In your text, four processes are
suggested:
- Shaping
involves the reinforcement of successively closer approximations
of some target behavior. For example, if you want to teach your
dog to fetch the newspaper, you might begin by rewarding him when
he approaches the door, then only when he goes out onto the sidewalk,
then only when he approaches the newspaper, then only when he picks
it up, then, finally, only when he performs the target behavior
of bringing the paper to you.
- Chaining
teaches complex behavior by reinforcing the performance of simpler
behaviors which are then strung together in the proper sequence
to make up the more complex behavior. For example, if I want to
teach someone to assemble an electronic component from a kit, I
might first teach the individual steps of assembly (with appropriate
reinforcement after the completion of each task), then reinforce
the learner only when she can assemble a particular module, then
finally only when she can assemble the entire component.
- Discrimination
learning is the process of learning to discriminate between
settings in which a particular behavior will or will not be reinforced.
For example, if I'm learning to work simple arithmetic problems,
I must know to add in the presence of a plus sign (a discriminative
stimulus), but not to add in the presence of a minus sign.
- Fading
is the gradual withdrawal of prompts or cues (discriminative stimuli)
that guide the performance of a complex behavior. For example, when
my friend was first learning to play chords on a guitar, he needed
lots of prompts, such as diagrams of the chord, a picture of someone
with their fingers in the proper position, and a teacher who actually
placed his fingers in the proper position. Later, he needed fewer
and fewer prompts to be able to form the chord correctly.
To this point,
we have assumed that a behavior is reinforced every time it is performed.
However, behaviorists found that this need not be the case. Once a
behavior has been established, reinforcement "schedules" other than
one reinforcement for every performance turn out to be more effective
at maintaining high response rates. Be sure you are familiar with
fixed/variable ratio and interval schedules. Check your understanding
below:
Suppose a teacher
takes her class to the library to work on an assignment. She gets
called away to monitor another teacher's class. As she leaves she
tells the students:
I'll be back every
5 minutes to check on you. You'd better be good.
[click
for answer]
You never know
when I'll be back to check on you; it could be 5 minutes, it would be
10 minutes. You'd better be good. [click
for answer]
When you have
completed the first 5 questions, send someone to get me and I'll let
you know what to do next. You'd better be good. [click
for answer]
When
I come back I'm going to check to see how many questions you've completed.
I expect you to get as many done as you can. You'd better be good. [click
for answer]
Which schedule
would lead to the fastest learning? Which would be most resistant
to extinction? Overall, which would you recommend to this teacher?
Back
to Top
Behaviorism and education
One of the most
lasting influences of behaviorism in education is in the use of principles
of behavior management. Many teachers have used ideas such as "time
out" and token economies to reward on-task classroom behavior and
discourage disruptive behaviors.
Behaviorism also
had a pervasive influence on instruction and instructional design,
although that influence has faded considerably over the past two decades.
Contemporary computer-assisted instruction is a direct descendant
of teaching machines and programmed instruction, both of which were
attempts to create a technology of teaching in accordance with behaviorist
learning principles. Instruction created according to behaviorist
principles (especially programmed instruction, early computer-based
instruction, and some contemporary computer-based tutorials and drill-and-practice)
typically emphasized these characteristics:
- Emphasis
on specific, observable learning outcomes. The target behaviors
needed to be specified so that the instructor or designer could
provide appropriate reinforcement when they were achieved.
- Individualized
learning. Each learner worked independently, at his/her own
pace. This permitted "reinforcement" to occur at the moment the
learner completed a particular objective.
- Frequent
reinforcement. The material to be learned was broken into very
small "chunks", usually somewhere between a paragraph
and a page of text. This was so that the learner could be reinforced
frequently while working through the material.
- "Knowledge
of correct results" as a reinforcer. The usual pattern of this
kind of instruction was to present a small amount of reading or
a problem, then to pose a question. In behaviorist terms, the reading
or problem was a discriminative stimulus; when the student answered
the question, they were emitting a response; and reinforcement that
followed the response was finding out whether they had answered
the question properly. Typically, these should be administered frequently
and should cover little new content so that the odds of producing
an incorrect answer to the discriminative stimulus (the Q) would
be decreased (that would reinforce the wrong answer).
- Fading of
prompts. In more sophisticated behaviorist instruction, the
learner would be provided with less and less guidance or prompting
as he/she proceeded through the material.
One of the reasons
that this kind of instruction has largely disappeared is that it emphasized
lower-order skills such as memorization. It is very difficult to teach
higher thinking skills such as synthesis and evaluation in this instructional
format.
However, to paraphrase
Mark Twain, reports of the demise of behaviorism as the basis for
instructional design are greatly exaggerated. The optional reading
listed for this unit is an example of a very contemporary research
study that is squarely in the behaviorist tradition.
Back
to Top
2.1 Discussion
on Behaviorism (in
Oncourse)
Facilitator (name
of student here) to post by Tuesday, June 4
Initial posts by Sunday, June 9, responses by Thursday, June 13
Wrapping (name of student here) to be completed by Friday, June 14
The following
is taken from Ormerod (2000) but should make a great opener for discussion.
Reinforcement
is often an effective way to improve students' classroom behavior.
But it has been criticized on many counts. Your colleagues at work
have expressed the following concerns about reinforcement. Some of
the concerns about reinforcement are probably "bogus" while
others are more legitimate. How would you respond to your colleagues
when they express these concerns? In other words, which concerns do
you think are "bogus" and which might be substantiated?
Why?
1. Reinforcement
develops dependence on concrete rewards for appropriate behavior.
2. Extrinsic reinforcement may encourage students to accomplish a
task in a minimally acceptable way rather than in a maximally beneficial
manner.
3. Changing a problem behavior doesn't change its underlying cause.
4. Reinforcement is bribery.
5. Extrinsic reinforcement of a behavior may undermine any intrinsically
reinforcing value that the activity has for students.
6. Reinforcing one student for good behavior after bad behavior teaches
other students to be bad to get reinforcement of their own when they
are good.
Our whole class
discussion task is to create a unified response to our colleagues
to address their concerns. The wrapper(s) will be responsible for
creating drafts, getting feedback and posting the class synthesis
by Friday, June 14.
2.2 Thought
activity: A plan for changing behavior
To be completed
by Sunday, June 16
With your group or on your own, consider the following scenario:
You have a five year-old daughter who suddenly refuses to go to bed
on time. She comes up with the classic excuses, needs a story, a song,
a kiss, a glass of water, to use the bathroom…then when all appeals
are exhausted, moves into the toughest thing for a parent to defend
against: monsters!
You suspect that
she is not really frightened and that this is just another delay tactic.
You even give her a water pistol to shoot the monsters, but this just
results in damp posters on her walls. You need to find a way to get
your child to go to bed on time.
Your task for
this thought activity is to draw on behaviorist principles learned
in Chapter 2 to create a plan for modifying your child's behavior.
Some things
to consider:
1. Observable
and measurable objectives are a key principle of behaviorism. What
evidence would you accept that the change had been made?
2. What
type of consequences would work in this situation - presentation
punishment, removal punishment, positive reinforcement, negative
reinforcement?
3. What schedules
of reinforcement would you use? Why?
4.
Could you modify any of the discriminating stimuli? How?
5. Don't forget - she is your daughter, so think about the realistic
implications of your ideas.
How
this thought activity will be assessed:
1.
Please limit your plan to 3-4 pages
2. Support
your plan with behaviorism principles from the readings. Make sure
you stick to only behavioral principles; don't let an unobservable
phenomena (e.g., thoughts, feelings) into your description of behaviorist
principles you wish to apply.
3. Make
sure its clear in your writing WHY something will work according
to behaviorism.
4. Make
reference to principles and pages in your readings to document your
plan.
5. Both processes and outcomes will be considered in the assessment.
2.3 Reflections
To be completed by Sunday, June 17
The purpose of
the reflection is for you to think about what you have learned through
this experience.
If you completed
this activity individually, please submit the answer to these
questions to your instructor along with your unit product:
1. Why did you choose to work individually on this activity?
2. How did your understanding of the learning theory change through
this activity?
If you completed
this activity as a group, please answer the following questions
individually and submit to the instructor. Please be honest. No group
experience is without challenges and frustrations. Reflecting on the
challenges of the group experience is just as important as celebrating
the positive achievements. Being honest will help us as instructors
give better guidance to teams collaborating online in the future.
1. Evaluate the contribution of EACH of your project team members,
including yourself, on a scale from 1 to 5. Refer to the descriptions
below as you make your ratings.
0 = team member
made no visible contributions to the project OR made significant
and sustained negative contributions to the project
1 = team member made minimal contributions to the overall project
2 = team member made uneven contributions to the project - some
positive, some negative
3 = team member made reasonable contributions to the project
4 = team member made significant and sustained positive contributions
to the project
5 = team member made significant and sustained positive contributions
to the project AND supported every member of the group by actively
bringing out the best in others.
2. Briefly describe
your group's approach to completing this thought activity.
3. Briefly describe
your individual contribution and each team member's contributions
to the activity.
4. How did your understanding of the learning theory change through
this activity?
Back
to Top
The Web Resources page has links to several
sites about behaviorism, including information about John Watson and
B.F. Skinner, the fathers of behaviorism.
Back
to Top
Comments: joalexan@indiana.edu