Evaluative Summary of Articles on
Split-Plot Factorial (SPF- pq ) Design
Submitted by: Xiaojing Yang

 

1. Background Information
Author: Batson, Daniel C., Shannon Early and Giovanni Salvarani
Title: Perspective Taking: Imagining How Another Feels Versus Imagining How You Would Feel
Source: Personality and Social Psychology Bulletin (PSPB), Vol. 23, No.7, pp 741-758.
Year: 1997

2. Abstract
Literature review showed that the current empathy literature took it for granted that the psychological process and empathic emotion triggered by adopting two different perspectives (imagine-other vs. imagine-self) are the same. The authors wanted to demonstrate in their article (the first research purpose) that it was not the case and hypothesized that adopting an imagine-other perspective can trigger empathy to the person in need whereas adopting an imagine-self perspective can trigger both empathy to the person in need and distress. The second research purpose is the demonstration of and solution to a current methodological problem in empathy literature. The current procedure of measuring distress does not distinguish between self-oriented distress and other-oriented distress (a feeling of distress for others). The authors argued that the latter form of distress should belong to empathy rather than the direct personal distress and such confusion is a threat to content validity of distress measures. The authors conducted two separate two-way mixed-factor ANOVAs (3*2) to address the above two questions. 60 undergraduates participated the experiment although 4 were excluded from further statistical analysis because of their suspicion of authenticity of experimental manipulation. They were assigned to three treatment conditions and were asked to listen to a bogus pilot radio interview with a young woman in serious need and then were required to indicate their emotional response. While listening to the radio, they were asked to take different perspectives dependent on the treatment (imagine-other, imagine-self and objective). The results showed support to both hypotheses. Specifically, imagining how the person in need feels in the need situation produced empathy whereas imagining how a subject himself or herself would feel if he or she were in the need situation produced both empathy and personal stress. What is more, the existent measurement of distress did confuse two types of distress - direct personal distress versus distress for others.

3. Null Hypotheses, alpha (or p) level, and sample size per group
First all of all, I want to claim that I do not think the authors used a correct design (to be discussed later in this question) to answer either of their research questions. Second, the authors did not provide specific information regarding the dependent variable in either of the research questions (to be discussed in question 4). Virtually the two sets of statistical analysis and designs face similar problems. The research hypotheses listed as follows were based on the assumption that the authors were correct and that the authors used empathy index as dependent variable in the first question and direct distress index as dependent variable in the second research question.
The first set of null hypotheses (related to the first research question: adopting an imagine-other perspective can trigger empathy to the person in need whereas adopting an imagine-self perspective can trigger both empathy to the person in need and distress):
H01: The specific perspective the subjects take (imagine-self perspective, imagine-other perspective or objective perspective) when they listen to the taped radio interview with a person in need will not influence the degree to which the subjects experience empathic emotion (their scores on empathy index).
H01: j = 0 for all j (in this case j= 1, 2, 3)

H02: The type of vicarious emotion evoked (empathy or distress) will not influence subject's scores on empathy index (the degree to which the subjects experience empathic emotion).
H02: k= 0 for all k (in this case k= 1, 2)

H03: There will be no influence on the degree to which the subjects experience empathic emotion associated with the joint effects of the specific perspective the subjects adopt while listening to the taped radio interview and the type of vicarious emotion evoked. In other words, the difference between subjects' experienced empathic emotions due to the different perspective they adopt should not be dependent on the specific types of vicarious emotions evoked.
H03: ( ) jk = 0 for all j and k (in this case j= 1, 2, 3 and k= 1, 2)
Actually I believe there are problems associated with the second and third null hypothesis. The authors treated perspective taking as a between-subject variable and the type of emotion evoked as a within-subject variable. It is known from prior discussion that the first research purpose is to study the relationship between perspective taking and the types and degrees of vicarious emotion evoked. Therefore, the type of vicarious emotion evoked should be the dependent variable. However, the authors treated it as an independent variable. Besides, the second hypothesis is of no theoretical interest at all. Without carrying out statistical analysis, we will definitely expect a significant main effect of the type of vicarious emotions evoked on the degree to which the subjects experience empathic emotions. Another way to explain this main effect is that when the emotions evoked after listening to the tape is empathy, the subjects will score high on empathy index, which is of common sense and not at all worth studying. Since it is not correct to specify type of vicarious emotions evoked as independent variable, it makes no sense to interpret the interaction effect between the two independent variables.
The second set of null hypotheses (related to the second research question to further decompose distress: the type of distress evoked when subjects adopt the imagine-other perspective should be other-oriented and belong to empathic distress, whereas the type of distress evoked when subjects adopt the imagine-self perspective should be a mix of other-oriented distress and direct distress):
H04: The specific perspective the subjects take (imagine-self perspective, imagine-other perspective or objective perspective) when they listen to the taped radio interview with a person in need will not influence the degree to which the subjects experience direct personal distress (their scores on direct distress index).
H04: j = 0 for all j (in this case j= 1, 2, 3)

H05: The type of distress evoked (direct or for Katie-the person in need) will not influence subject's scores on the direct distress index (the degree to which the subjects experience direct personal distress).
H05: k= 0 for all k (in this case k= 1, 2)

H06: There will be no influence on the degree to which the subjects experience direct personal distress associated with the joint effects of the specific perspective the subjects adopt while listening to the taped radio interview and the type of distress evoked. In other words, the difference between subjects' experienced direct personal distress due to different perspectives they adopt should not be dependent on the specific types of distress evoked.
H06: ( ) jk = 0 for all j and k (in this case j= 1, 2, 3 and k= 1, 2)
This set of hypothesis faces the same problem as the first set of hypothesis. Namely, I think the fifth and sixth hypotheses are problematic. The authors treated the type of distress evoked as within-subject variable. It can be drawn that the second research purpose is to study the relationship between perspective taking and the types and degrees of distress evoked. Therefore, the type of distress (direct vs. for Katie) evoked should be the dependent variable. However, the authors treated it as independent variable. Besides, the fifth hypothesis is of no theoretical interest at all. It is basically saying that when the distress evoked after listening to the tape is self-oriented (person direct stress) the subjects will score high on direct distress index, which is of common sense and not at all worth studying. Since it is not correct to specify type of distress evoked as independent variable, it makes no sense to interpret the interaction effect between the two independent variables.
Pre-set alpha/ p-value: The alpha was not stated in either the research design or in the analysis section of the article. However, the authors did report p-level associated with each effect. The p values of significant effects range from 0.0001 to 0.05.
Sample size per group: The participants of this experiment are 60. The authors stated that in each cell (3 levels of perspective-taking), there are 10 males and 10 females. The authors mentioned that 4 data (all male) were excluded from further analysis because their doubt about the authenticity of the taped interview. (2 from the imagine-other condition, 1 from each of the other conditions) Therefore, the sample size is 56 (N=56).
It is also worth mentioning that the authors originally used a randomized block procedure to assign the subjects. Although it is the only place in the article mentioning this assignment procedure, it can be inferred that the authors might originally want to utilize a randomized block design and control the blocking variable of sex (actually it makes sense to expect an sex effect on empathy emotion theoretically). However, since later statistical analysis failed to show support of this hypothesis (pp754), they had to give up the randomized block design. (All the above analyses were not stated in the article besides the mentioning of "randomized block procedure" to assign subjects and the insignificant sex effect). The authors should make clear this process since it may cause confusion to the readers when they try to figure out the underlying theoretical reasons to utilize this "randomized block procedure" to assign subjects.
The final sample size per cell is shown as follows (perspective-taking as between-subject variable, types of vicarious emotions evoked as within-subject variable):
Imagine-other Imagine-self Objective
Empathy 18 19 19
Distress

However, in later analysis, the authors did not state how they solved the problem of unequality of cell sample size.

4. Independent variable and dependent variables
Although I think the authors are erroneous in terms of their basic design form, for the sake of analysis, let us first assume the authors were right. In the first mixed- factor ANOVA, the independent variables are perspective taking and types of vicarious emotions evoked. The dependent variable is the empathy index.
Perspective-taking was treated as a fixed and between-subject independent variable. The first level is objective perspective. In this level, subjects were required to view the person in need from an objective and detached perspective and were discouraged from imagining from the target person's perspective. The second level is imagine-other perspective. In this level, the subjects were required to imagine how the target person feels in the need situation. The third level is imagine-self perspective. In this level, the subjects were required to imagine how they themselves would feel if they were in the target person's situation.
According to the authors, the second independent variable is the type of vicarious emotion evoked, which was treated by the authors as a fixed and within-subject independent variable. The first level of this variable is empathy, which reflects an other-oriented emotional response congruent with the perceived plight of the person in need. The second level of this variable is distress, which is defined as a self-oriented aversive emotional response and taps more direct feelings of discomfort evoked by witnessing the plight of the other. Just as I have discussed before, I think that the authors were erroneous to treat the variable as an independent variable.
As I have mentioned before, the authors did not specify clearly which was considered as dependent variable. I think there are three possibilities of dependent variable. The first possibility is overall emotional reaction index (my label, the authors did not state it), which is the average of subject's scores on the total 26-item emotional reactions scale. The second possibility is empathy index, which is the average of the 6 items concerning empathy among the overall 26 items. The third possibility is the average of the 8 items concerning distress among the overall 26 items. The authors did not tell the readers which the above three alternatives is the one they used as dependent variable.
The second so-called "mixed"-factor ANOVA faces the same problems. Consistent with the analysis of first ANOVA, let us first assume the authors were right.
The first independent variable, perspective-taking, is the same as the first ANOVA. The second independent variable, type of distress, is a fixed and within- subject variable. It has two levels. The first level of this independent variable is direct distress, which refers to the kind of distress that is directly experienced by people after they hear the plight of the person in need. The second level is empathic distress, which refers to other-oriented emotional response congruent with the perceived welfare of another.
The dependent variable again faces the same problem as the first ANOVA. I think there are two possibilities of dependent variable. The subjects were asked to indicate the degree they personally directly experienced each of these types of response for four of the distress adjectives rated previously (distressed, upset, troubled and grieved) and the degree to which they feel each of the same four distress adjectives for the person in need. The scores were averaged to form a direct stress index and empathic stress index. It is not clear that which of the two indexes the authors used as dependent variables.
Again, I think the authors were erroneous to treat types of distress evoked as an independent variable. According to author's research hypotheses, the type of distress should be reflections of the manipulations of perspective taking.

5. Instrument, comment on its reliability and validity.
The first instrument/questionnaire the authors used was to measure the emotional reaction to Katie's need. After listening to the tape, participants completed the reaction questionnaire. Part 1 of the questionnaire listed 26 adjectives describing different emotional states. For each item, the subjects indicated on a 7-point likert scale to indicate how much they had experienced that emotion while listening to the tape. The list included the six items concerning empathy. It also included eight items concerning distress. However, the authors did not mention in their article the remaining 12 items. Part 2 of the reaction questionnaire was designed to assess the nature of distress reported in Part 1. Participants were asked to indicate on a 9 point likert scale the degree to which they experienced each of these types of response for four of the distress items rated in Part 1: distress, upset, troubled and grieved. They also were required to indicate the degree to which they feel the same types of distress for Katie (the person in need).
In part 1, the six items related to empathy were averaged to form an empathy index and the eight items related to distress were averaged to form a distress index. In part 2 of the research questionnaire, the four items related to distress for self (direct distress) were averaged to form a direct distress index and the four items related to distress for Katie were averaged to form a distress-for-Katie index.
I would take issue with the authors here because I think items of part 2 are subject to demand characteristics. It would be very easy for the subjects to guess the purpose of the researchers. For example, when the subjects have just read in the instructions that they are required to think from the perspective of Katie (how she will feel in such situation) and later the subjects were required to indicate how much distress for the self vs. for Katie, it is very easy for the subjects to identify the research hypotheses of the researchers and therefore they tend to give a high score on the distress-for-Katie items. The direction of the effect of demand characteristics on the dependent variable is the same as the research hypothesis, which causes potential confounding effects to the conclusion of the researchers.
The authors provided reliability for the four indexes. The Cronbach's alphas for the four indexes-empathy, distress, direct distress and distress- for -Katie are: 0.85, 0.93, 0.97 and 0.95, respectively, which are all beyond traditional rules-of-thumb 0.7.
However, the authors did not provide any evidence regarding validity of the four indexes. Information such as content validity, criterion-related validity, construct validity (convergent and discriminant) validity were not provided.
The authors also used what they called "ancillary measures" in order to investigate the effect of manipulations of perspective taking. They also required the subjects to indicate how interesting and worthwhile the radio program was in order to test the believability of the radio interviews. Since there is only one item associated with each question, no information regarding reliability and validity were provided, which are acceptable.

6. Experimental Procedure
Sixty psychology undergraduates participated the experiment. They were recruited and were told that they were involved in part of a pilot test of new programming ideas for the local university public radio station. According to the authors, they used a "randomized block procedure" to assign 10 women and 10 men to each of three perspective-taking conditions. As we know, the randomized block procedure is utilized only when the researchers suspect that there might be a nuisance variable (blocking variable) that they want to control for because of the potential confounding effect of this nuisance variable on dependent variable. However, the authors did not mention the potential effect of gender on the dependent variable (empathy vs. personal distress) at all, let alone providing any theoretical support to this potential effect.
4 additional data (2 from the imagine-other condition; 1 from each of the other conditions) were excluded because the subjects doubt about the credibility of the manipulation.
In the instructions contained in a folder that the subjects received upon arrival (the experimenter had no idea what specific contents were included), the participants were told to take a particular perspective, dependent on which condition they were assigned to - imagine other, imagine self and objective. In the imagine-other condition, the subjects were told that they should imagine how the person being interviewed feels about what has happened and how it has affected his or her life. In the imagine-self condition, the subjects were told to imagine how they themselves would feel if they were experiencing what has happened to the person being interviewed and how this experience would affect their life. In the objective condition, the subjects were told to be as objective as possible about what has happened to the person interviewed and how it has affected his or her life.
After the instructions, participants were left alone and listened to the radio interview with Katie Bank, whose parents were both killed recently in a car accident and who were trying her best to finish college and take care of her surviving brother and sister.
After listening the tape, the subjects were asked to complete a questionnaire, which were consistent of two parts. The 26 items contained in the first part of questionnaire were designed to measure the emotional reactions of the subjects. The 8 items contained in the second part of the questionnaire were designed to assess the nature of any distress reported in part 1. The subjects also completed some ancillary measure designed to assess the effect of manipulation and authenticity of the radio interview. In the end, the experimenter returned and conducted a full debriefing.

7 Statistical analysis and conclusion
In order to test their first hypothesis that adopting an imagine-other perspective will lead to empathy and adopting an imagine-self perspective will lead to both distress and empathy, the authors conducted a 2 (type of vicarious emotion: empathy, distress) * 3 (perspective-taking condition: objective, imagine other, imagine self) mixed-factor ANOVA with repeated measures on the first factor.
Just I have discussed above, there are two major drawbacks with this data analysis procedure. The first problem is that the authors should not treat the types of vicarious emotion evoked as independent variable. From a theoretical point of view, the types of vicarious emotion evoked should be a result (dependent variable) of the manipulation of another independent variable-perspective taking. From an operationalization (manipulation) point of view, the manipulation of one factor should not tap into the construct domain of the other factor. As we know, the types of vicarious emotion evoked are reflections of the manipulation of perspective taking. Therefore, we can expect that the types of vicarious emotions should be highly correlated to perspective taking. In other words, the manipulation of perspective taking will not only tap into the construct the manipulation ought to influence, it also taps into types of vicarious emotions evoked, the other factor. The contamination of manipulation may call the validity of researchers' conclusion into question. The second problem is that the authors did not provide any information regarding which is the dependent variable-empathy index, distress index or overall emotion index.
For the sake of analysis, let us assume that the authors use empathy index as dependent variable . According to the authors, there was a main effect for types of vicarious emotions on the dependent variable, the empathy index, F (1,57) = 22.21, p <.005. This main effect is so obvious that we do not need to carry out any statistical analysis to test it and it is of no theoretical interests at all. The main effect for perspective taking on empathy index was significant, F(2, 57) = 8.10, p<.001. This main effect is of theoretical interest and it tells us that the perspective the subjects take determines the degree to which they feel empathy. However, in order to test which perspective can evoke more empathy, post-hoc analysis should be conducted. The interaction effect of types of vicarious emotion and perspective taking is significant, F (2,57)= 3.24, p<. 05 However, just I have discussed in question 3, since the types of vicarious emotion should not be treated as an independent variable, it makes no sense to interpret this interaction effect.
The authors also conducted a few analyses to prove their research hypothesis. The authors conducted a planed comparison and the results indicated participants in the imagine conditions reported feeling more empathy than did participants in the objective condition, t (57) = 2.76, p <0.01. In order to test the hypothesis that imagine-self perspective would evoke greater distress than the objective and the imagine-other perspectives, the authors conducted planned contrasts and the results indicated that participants in the imagine-self condition reported feeling more distress than did participants in the objective and imagine-other conditions, t (57) = 4.22, p<.0005, with no reliable difference between the objective and imagine-other conditions, t<1.47. What is more, in the imagine-other condition, empathy was significantly higher (M=4.96) than distress (M= 3.99), t (19) = 3.97, p<0.001. In the imagine-self condition, both empathy (M=5.36) and distress (M=2.50) were high, with no reliable difference, t (19) <0.64. Actually I think these procedures are enough to prove their hypotheses. In other words, the authors do not need to go to the mixed-factor ANOVA analysis, which turns out to be erroneous any way. With this set of analysis, we are able to know that empathy was associated with taking an imagine-other perspective, whereas adopting an imagine-self will lead to both distress and empathy.
Although the authors did report p-level associated with the significant results, they failed to report the p-levels associated with the insignificant results.

In order to test their second hypothesis that the nature of distress experienced when adopting an imagine-other perspective will be other-oriented and the nature of distress experienced when adopting an imagine-self perspective will be a mix of self-oriented and other-oriented distress, the authors conducted a 2 (type of distress: direct vs. Katie) * 3 (perspective-taking condition: objective, imagine other, imagine self) mixed-factor ANOVA with repeated measures on the first factor.
The problems with the utilization of mixed-factor analysis here are almost the same as those in the first analysis so they are omitted here to avoid redundancy. For the sake of analysis, let us assume that the authors use direct distress index as dependent variable . According to the authors, there was a main effect for types of distress on the dependent variable, the direct distress index, F (1,57) = 98.73, p <.0001. This main effect is again so obvious that we do not need to carry out any statistical analysis to test it and it is of no theoretical interests at all. I would say that it would be extremely bizarre if the main effect of types of distress (direct distress vs. distress-for-Katie) on direct distress score did not turn out to be significant. The main effect for perspective taking on direct distress index was significant, F (2, 57) = 9.98, p<.0005. However, just as I have discussed above, the results are potentially confounded with demand characteristics. The interaction effect of types of distress and perspective taking is significant, F (2,57) = 3.76, p< .03. Since it is erroneous for the authors to treat type of distress evoked as independent variable, to interpret this interaction effect is not appropriate and makes no sense.
The authors also conducted a few analyses to prove their research hypothesis. The authors conducted a planed comparison and the results indicated participants in the imagine conditions reported feeling more distress for Katie than did participants in the objective condition, t (57) = 3.16, p <0.01. In order to test the hypothesis that imagine-self perspective would evoke greater direct distress than the objective and the imagine-other perspectives, the authors conducted planned contrasts and the results indicated that participants in the imagine-self condition reported feeling more direct distress than did participants in the objective and imagine-other conditions, t (57) = 4.50, p<0.005, with no reliable difference between the objective and imagine-other conditions, t<1.0. What is more, participants in both the imagine-other and the imagine-self conditions reported more distress for Katie (Ms = 6.96 and 7.44, respectively) than direct distress (Ms= 3.45 and 5.69, respectively), ts (19) = 7.55 and 3.00, respectively, ps >.0005 and .01, respectively and this difference was significantly greater in the former condition than the latter, F (1,38) = 5.76, p < 0.03. It seems that the set of analysis here is enough to prove the research hypotheses. In other words, the authors do not need to go to the mixed-factor ANOVA analysis, which turns out to be erroneous any way. With this set of analysis, we are able to know that empathy was associated with taking an imagine-other perspective, whereas adopting an imagine-self will lead to both distress and empathy.
Although the authors did report p-level associated with the significant results, they failed to report the p-levels associated with the insignificant results.

8. If you were the researcher, how would you improve the study? Be specific.
I would consider the following modifications:
(1) In terms of the basic form of design, I would certainly give up mixed factor ANOVA design. As discussed before, this design form is erroneous in that the authors treated the dependent variable as a within-subject variable. I think one-way ANOVA and independent t-tests will be sufficient to solve both of the research questions, as what the authors did also in their analysis.
(2) I would definitely show which variable was considered as dependent variable instead of letting the reader guess which is the most appropriate one of all the available alternatives (empathy index, distress index and overall emotion reactions related to the first research question; direct distress index and empathic distress index related to the second research question.)
(3) In terms of sampling procedures, I would specify why I first used a "randomized block procedure". Just I have analyzed above, I would tell the readers that I would expect gender as possible nuisance variable and wish to control it. Although later results did not support this assumption, which may lead me to give up "randomized block design", I should provide such analysis in front of the readers rather than make them confused.
(4) Since 4 data were missing, which resulted in unequal sample size, I would address the issue how I dealt with the problem of unequal size.
(5) I would state the alpha level in the introduction of research design, which will make the associated p-value interpretable.
(6) I would check the basic assumptions of ANOVA. Specifically, I would provide information regarding (a) normality of dependent variable (whether the distributions of emotion indexes are normal); (b) homogeneity of errors (to avoid heteroskedasticity); (c) independence of errors (the errors associated with each observation should not be correlated with each other) Actually, if the authors were correct in using mixed-factor design, they should check the sphericity assumption. However, since a mixed factor design is not appropriate here, it makes no sense to check this assumption.
(7) I would provide information regarding the effect size of the results. Although we can judge from F-test that the hypothesized relationships are significant, we have no idea how strong the hypothesized influences are. It is possible that although the factor can produce a significant influence on the dependent variable, the influence is nominal.