Evaluative Summary of article on
Randomized Block Design
Submitted by Tsai-Feng Wang
1. Background Information
Author: Gupta, S.
Title: Impulsivity/sociability and reinforcement in verbal operant conditioning:
a replication.
Source: Personality and Individual Differences.
Year: 1990, 11(6), 585-589.
2. Abstract
The purpose of this study was to examine the verbal operant conditioning (dependent
variable) in relation to impulsivity/sociability (three levels: high, moderate
and low) and reinforcement conditions (two levels: good (rewarding reinforcer)
and poor (punishing reinforcer)). The author stated that this is a 3 x 2 randomized
block design (however, I think this a 3 x 2 fixed-effect generalized randomized
blocking design because the author tested interaction effects in the ANOVA
tables). Previous similar study (Gupta and Nagpal, 1978) was conducted by
using different reinforcers (one verbal and one nonverbal reinforcers for
rewarding condition, and one nonverbal reinforcer for punishing condition).
In this study, the author used only verbal reinforcers for both rewarding
and punishing conditions). Participants were 240 female students from undergraduate
and postgraduate schools. 120 subjects joined in the one experiment with impulsivity
variable, and 120 subjects joined another one with sociability variable. Dependent
variable (verbal operant conditioning) was measured by one instrument: stimulus
material using Taffel's sentence construction technique. Two separate ANOVA
tables were completed on each of the two research questions (one is related
to the impulsivity, and another is sociability). The results of ANOVA with
impulsivity variable indicated that the effect of impulsivity (p = .02), reinforcement
(p = .05), and interaction between impulsivity and reinforcement (p < .01)
were found significant. The results of ANOVA with sociability variable indicated
that the effect of sociability (p < .01) and interaction between sociability
and reinforcement (p < .01) were found significant. However, the effect
of reinforcement (p = ?) was not found to be significant.
3. Null hypothesis, (or p) - level and sample size per group.
In this study, two null hypotheses for this 3 x 2 randomized block design
(RB-2) in two separate experiments are as follows:
(A). For the study with the impulsivity variable:
(1) Test for the effect of reinforcement treatment:
Ho1: j = 0 for all j. There is no effect caused by the two types of reinforcement,
which means that there is no verbal operant conditioning scores difference
for subjects either with rewarding condition or punishing one.
H11: j 0 for some j. There is verbal operant conditioning scores difference
caused by the different types of reinforcement.
(2) Test for the blocking variable (impulsivity):
Ho1 : z = 0 for all z. There is no effect caused by the three levels of impulsivity,
which means that there is no verbal operant conditioning scores difference
for subjects with high, moderate, or low level of impulsivity.
H11 : z 0 for some z. There is verbal operant conditioning scores difference
caused by the different levels of impulsivity.
(B). For the study with the sociability variable:
(1). Test for the effect of reinforcement treatment:
Ho1: j = 0 for all j. There is no effect caused by the two types of reinforcement,
which means that there is no verbal operant conditioning scores difference
for subjects either with rewarding condition or punishing one.
H11: j 0 for some j. There is verbal operant conditioning scores difference
caused by the different types of reinforcement.
(2) Test for the blocking variable (sociability):
Ho1 : z = 0 for all z. There is no effect caused by the three levels of sociability,
which means that there is no verbal operant conditioning scores difference
for subjects with high, moderate, or low level of sociability.
H11 : z 0 for some z. There is verbal operant conditioning scores difference
caused by the different levels of sociability.
The level was not stated throughout the article. The p level was reported
in the ANOVA table. In the experiment with impulsivity variable, a significant
F-ratio was report in the reinforcement effect at p = .05 level, in the impulsivity
effect at p = .02 levels, and in the interaction effect between reinforcement
and impulsivity at p < .01. In the experiment with sociability variable,
a significant F-ratio was report in the reinforcement effect at p < .01
levels, in the sociability effect at p < .01 levels.
The total number of participants was 240, which was divided into two parts
of experiments with 120 subjects in each one. Six groups were formed in this
3 x 2 randomized block design (RB-2). Therefore, there were 20 subjects in
each group. The author used equal sample size (n=20 subjects) in each group
and in each experiment.
4. Independent and dependent variables.
There were two fixed independent variables, the blocking and treatment variables,
and one dependent variable in the 3 x 2 randomized block design (RB-2).
The blocking variable:
In experiment 1: impulsivity, with three levels of scores: high, moderate,
and low. In experiment 2: sociability, with three levels of scores: high,
moderate, and low.
The treatment variable: Reinforcement, with two levels: good (rewarding condition)
and poor (punishing condition).
The dependent variable: Verbal operant conditioning scores measured by the
stimulus material using Taffel's sentence construction technique.
5. Instrument, briefly comment on its reliability and validity.
Three instruments were used in this study. For the purpose of sample selection,
Hindi version of the Eysenck Personality Inventory (EPI) was used. In order
to measure the scores of the dependent variable (verbal operant conditioning),
the author used the stimulus material using Taffel's sentence construction
technique. Also, there is a questionnaire used for testing the awareness of
the connection between subjects' behavior and the reinforcement shortly after
the treatment experiment. None of these three instruments were given the evidence
of validity and reliability. We don't know what these instruments are, and
how readers can be convinced about their consistency and accuracy related
with the author's intention on different purposes in this study.
6. Experimental Procedure
240 subjects were selected from the preliminary testing (Hindi version of
the Eysenck Personality Inventory (EPI) in order to block subjects according
to three levels of impulsivity and sociability) among 2000 female volunteers.
Two experiments were conducted with 120 subjects in each experiment. Every
subject was tested individually by the stimulus material using Taffel's sentence
construction technique, which presenting a card at a time and the total card
number is 100. In the stage of the fist 20 cards, the experimenter simply
wrote down what the subject's choice in pronounce in the nature condition.
In the stage of the next 60 cards, it is a treatment condition that the experimenter
spoke "good" or "poor" where "I' or "WE"
was reinforced. In the stage of last 20 cards, the reinforcement was removed.
As a result, frequency in using different pronouns from each card in these
three stages will be recorded and calculated toward the final conditioning
scores. Shortly after the experiment, a questionnaire used for testing the
awareness of the connection between subjects' behavior and the reinforcement
was conducted for each subject. If any subject were aware of the connection,
another new subject would replace her. There were 15 replacements happened
in this study.
7. Statistical analysis and conclusion.
Two ANOVA analyses were conducted separately on two experimental interests,
one is related with impulsivity variable and another one is related with sociability
variable. In the experiment with impulsivity variable, the effect of impulsivity
and reinforcement were revealed to have significant effects, where F(2, 114)
= 4.58, p = .02 and F(1, 114) = 4.38, p = .05, respectively. Also, the interaction
effect between impulsivity and reinforcement was found to be significant (F(2,
114) = 56.39, p < .01). It was found that subjects in the high level of
impulsivity had larger conditioning scores with "good"(rewarding
reinforcer) than with "poor"(punishing reinforcer) whereas subjects
in the low level of impulsivity had larger conditioning scores with "poor"
than with "good". In the experiment with sociability, the effect
of sociability was revealed to have a significant effect (F(2,114) = 5.57,
p < .01). Also, the interaction effect between sociability and reinforcement
was found to be significant (F(2, 114) = 19.69, p <.01). However, the effect
of reinforcement was not found to be significant. It was found that subjects
in the high level of sociability had larger conditioning scores with "good"(rewarding
reinforcer) than with "poor"(punishing reinforcer) whereas subjects
in the low level of sociability had larger conditioning scores with "poor"
than with "good".
8. If you were the researcher, how would you improve the study?
1. Although the author stated that this is a 3 x 2 randomized block design;
however, I think this is a 3 x 2 fixed-effect generalized randomized blocking
design because the author tested interaction effects (which were also the
author's intention) in the ANOVA tables. Therefore, this is a GRB-2 rather
RB-2 design. The null hypothesis about interaction effect in the GRB-2 design
should be added after previous GB-2 design in Part 3 as follows:
(A). For the study with the impulsivity variable:
(3) Test for the interaction effect between impulsivity and reinforcement:
Ho1: ( ) jz = 0 for all j and z. There is no interaction effect between the
two types of reinforcement and three levels of impulsivity. That is, a special
combination of the treatment and blocking factors, e.g. rewarding condition
and low level of impulsivity, does not affect the consistency of verbal operant
conditioning scores when compared to other combinations of the treatment and
blocking factors.
H11: ( ) jz 0 for some j and z. Interaction effect exists between the two
types of reinforcement and three levels of impulsivity in the verbal operant
conditioning scores difference.
(B). For the study with the sociability variable:
(3) Test for the interaction effect between sociability and reinforcement:
Ho1: ( ) jz = 0 for all j and z. There is no interaction effect between the
two types of reinforcement and three levels of sociability. That is, a special
combination of the treatment and blocking factors, e.g. rewarding condition
and high level of sociability, does not affect the consistency of verbal operant
conditioning scores when compared to other combinations of the treatment and
blocking factors.
H11: ( ) jz 0 for some j and z. Interaction effect exists between the two
types of reinforcement and three levels of sociability in the verbal operant
conditioning scores difference.
2. The way the author conducted the interaction comparison is quiet doubtful.
I don't really understand how he found his conclusions about the interaction
effects between reinforcement and impulsivity/sociability. He just stated
that he used Newman- Keuls procedure to test the interaction variance. How
come? This method is supposed to test the differences among the levels of
main effects. However, there were just two levels of treatment (reinforcement),
and blocking variable (impulsivity/sociability) is not our interests (although
it has three levels), we don't need to conduct a post-hoc procedure in this
study. When interaction effect is significant, and we are interested in the
interaction effect, contrast-contrast interaction (in this case, three interaction
contrasts in each experiment) comparison procedure should be used and presented.
It is even clearer for readers if the author could show interaction graphs
in the study.
3. Usually when we use RB or GRB design, we treat the blocking variables are
nuisance variables. Blocking is used to remove the effects of nuisance variables.
Also, in GRB design, we are not really interested in the interaction effect
between blocking variables and treatment variables. In this study, the author
did care about the significant results of the blocking variable and the interaction
effect of the blocking variable and treatment variable and treated these results
as important findings. From the viewpoint of GRB design issue, we should ignore
the significant results of the blocking variable and the interaction effect
of the blocking variable and treatment variable, because they are not our
interests. However, if we are interested in the interaction effect, we should
use another factorial design for your research intention and purpose.
According to author's purpose (author is both interested in impulsivity/sociability
and reinforcement) for this study, and the author also treated impulsivity
and sociability as main effects (see Results part in the article, p587), it
is really make me feel confused why he described that he used RB design in
his study. The variables in this study were not defined well. If I were the
researcher, I would clarify the independent variables as a treatment variable
or blocking one, and how these variables are related to the dependent variable
in order to make sure that I use right experimental design and statistical
analysis.
4. We can find that all sample are from female. The author didn't explain
why he only chose female as samples (because of convenient sample or his intention?).
It is problematic issue, when we generalize these results to the public without
mentioning they are suitable for female population. Therefore, I think the
title of this study should be either changed as Impulsivity/sociability and
reinforcement in verbal operant conditioning for female, or discuss and explain
it in the conclusion part of the paper when talked about the generalization
of the research results.
5. Statistical assumption should be stated in the paper in order to make sure
that statistical results are unbiased. For example, for the ANOVA method in
this study, there are three important assumptions: (1). Normality- the samples
for each of the groups are assumed to come from normal populations. (2). Homogeneity
of Variance- this is the assumption that the populations from which the data
came all have the same variance, regardless of whether or not their means
are equal. (3). Independence- this is the assumption that the errors associated
with each observation are independent. Because this study is a fixed model
(model 1), we don't need to use MSinteraction as the error term (homogeneity
of population within-cell variation is required when we use MSwithin as a
error term). Therefore, the sphericity assumption is not required in this
study (see Kirk in p.306).
6. ANOVA table (in Table 4) didn't provide what is the p-value in the effect
of reinforcement. We cannot understand how author judged that it was non-significant
in the reinforcement variable. Was the p-value too high or just slightly past
.05? I would present the p-value no matter what the value is in order to inform
readers to judge with the real raw data.
7. The evidence of instrument validity and reliability should be provided.
The author used three different instruments for different purposes. They were
Hindi version of the Eysenck Personality Inventory (EPI), the stimulus material
using Taffel's sentence construction technique and the questionnaire used
for testing the awareness of the connection between subjects' behavior and
the reinforcement. The construct validity of evidence of EPI should be provided
in order to make sure EPI was exactly used to measure the construct of impulsivity/sociability.
Also, the test-retest reliability coefficient of EPI should be proved its
consistency. I will also provide the evidence of the predictive-criterion
validity for the stimulus material and the questionnaire in order to prove
the accuracy of these two instruments for the purposes of this study. The
evidence of the test-retest and split-half reliability should be stated for
the stimulus material and questionnaire, respectively, in order to inform
readers their consistency when using these two instruments.
8. Although the ANOVA table was given, and the effects of impulsivity, reinforcement,
and interaction were tested to be significant, the information is not sufficient
to the readers. In order to understand the differences were large enough to
be useful in the real world, effect size (and 2) should provide to give the
evidence practical significance. After I did the calculations (an example
from the experiment with impulsivity variable):
2 =(2-1)(4.38-1) / (2-1)(4.38-1)+ (20*2*30) = .027 (large association)
2 = (3-1)(4.58-1) / (3-1)(4.58-1) + (20*2*30) = .056 (small association)
( )2 = (2-1)(3-1)(56.39-1) / (2-1)(3-1)(56.39-1) + (20*2*30) = .480 (large
association)
f = squr(.027/1-.027) = .167 (small effect size)
f = squr( .056-/1-.056) = .251 (small effect size)
f ( ) = aqur(.480/1-.480) = 1.332 (large effect size)
I found that only interaction effect had a large effect size in relation to
the verbal operant conditioning scores. Although effects of reinforcement
and impulsivity reached significance, its small effect size can't be counted
on any usefulness in relation to the verbal operant conditioning scores.
9. I found that the conclusion part in this study was quite weak. No limitation,
recommendation or further explanation described in this study. Although the
author stated the different finding comparing the previous study, he didn't
make any suggestion or application for this new finding. What is the possible
explanation for this different result? The caffeine example in the discussion
part was not adequate and totally was not related with the purpose of the
study. If I were the researcher, I would like to provide a clear discussion
in the final section about why the interaction between impulsivity and reinforcement
would potentially affect one's verbal operant conditioning scores? How can
this finding be used in a practical way? Also, I would present what the limitations
in this study were and give some suggestions for further studies.