Publications



This web page is outdated and you should see an up-to-date list of publications here: http://nd.edu/~kkelley/site/Publications.html

Contact me via
email if you have would like a reprint, preprint, or have questions/comments
about any of my work (including what is currently under review or in the works) .


Peer Reviewed Methodological Publications

Kelley, K. & Maxwell, S.E. (in press). Delineating the average rate of change.
        Journal of Educational and Behavioral Statistics.
        Preprint; Abstract; BibTeX Citation


Yuan, K-H., Kouros, C. D., & Kelley, K. (in press). Diagnosis for covariance structure
          models by analyzing the path. Structural Equation Modeling.

Kelley, K., Lai, K., & Wu, P-J. (2008). Using R for data analysis: A best practice for
          research. In J. Osbourne (Ed.), Best Practices in Quantitative Methods (pp. 535--572).
          Newbury Park, CA: Sage.


Kelley, K. & Maxwell, S. E. (2008). Power and accuracy for omnibus and targeted
          effects: Issues of sample size planning with applications to multiple regression. In P.
          Alasuuta, J. Brannen, & L. Bickman (Eds.), Handbook of Social Research Methods
         
(pp. 166--192)Newbury Park, CA: Sage.

Maxwell, S. E., Kelley, K., & Rausch, J. R. (2008). Sample size planning for statistical
        power and accuracy in parameter estimation. Annual Review of Psychology, 59, 537-563.
        Full text

Kelley, K. (2007). Sample size planning for the coefficient of variation: Accuracy in parameter
          estimation via narrow confidence intervals.
 Behavior Research Methods, 39, 755-766.
   
      Full text; Abstract; BibTeX Citation

Kelley, K. (2007). Methods for the Behavioral, Educational, and Social Science: An R Package
         Behavior Research Methods, 39, 979-984.
         Full text; Abstract; BibTeX Citation


Kelley, K., Light, R. L, & Agarwal, R. (2007). Trended cosinor change model for analyzing
         hemodynamic rhythm patterns in hemodialysis patients. Hypertension, 50(1), 143-150.
         Full text; Abstract; BibTeX Citation

Kelley, K. (2007). Confidence intervals for standardized effect sizes: Theory, application, and implementation.
         Journal of Statistical Software, 20(8), 1-24.
         Full text; Abstract; BibTeX Citation

Kelley, K., & Rausch, J. R. (2006). Sample size planning for the standardized mean difference: Accuracy
          in parameter estimation via narrow confidence intervals. Psychological Methods, 11(4), 363--385.
          Full text; Abstract; BibTeX Citation; Figure 3 as it should be (The distinction between the power curves was lost during production).

Kelley, K. (2005). The effects of nonnormal distributions on confidence intervals around the standardized mean difference:
          Bootstrap and parametric confidence intervals. Educational and Psychological Measurement65(1), 51-69.
          Full text; Abstract; BibTeX Citation
.

Kelley, K. (2004). Assessing the assumption of symmetric proximity measures in the context of multidimensional
          scaling. Journal of Applied Measurement, 5(4), 419-429.
          
Full text; Abstract; BibTeX Citation.

Kelley, K. & Maxwell, S.E. (2003). Sample size for multiple regression: Obtaining regression coefficients that are accurate,
          not simply significant. Psychological Methods8(3), 305-321.
          
Full text; Abstract; BibTeX Citation.

Kelley, K., Maxwell, S.E., & Rausch, J.R. (2003). Obtaining power or obtaining precision: Delineating methods of
          sample-size planning. Evaluation and the Health Professions26(3), 258--287.
           
Full text; Abstract; BibTeX Citation.

Rausch, J. R., Maxwell, S.E., & Kelley, K. (2003). 
Analytic methods for questions pertaining to a randomized pretest,
          posttest, follow-up design. Journal of Clinical Child and Adolescent Psychology32(3), 467--486.
         
Full text; Abstract; BibTeX Citation.


Selected Peer Reviewed Substantive Publications (Published or In Press)

Kelley, K., Aricak, O. T., Light, R. L, & Agarwal, R. (2007). Proteinuria is a determinant of quality of life
          in diabetic nephropathy: Modeling lagged effects with path analysis. American Journal of Nephrology, 27(5), 488-494.
          Full text; Abstract; BibTeX Citation

Anderson, J. A. Wright, E. R, Kooreman, H. E. & Kelley, K. (in press). Patterns of
          clinical change among young people served in a system of care: The impact of age, ethnicity,
          referral source, and sex. Journal of Emotional and Behavioral Disorders.

Stright, A. D., Gallagher, K. C., & Kelley, K. (in press). Infant temperament moderates
         relations between maternal parenting in early childhood and children's adjustment in first
         grade, Child Development.


Software: Methods for the Behavioral, Educational, and Social Sciences, the 
MBESS R Package

Kelley, K. (2006; 2007). MBESS [computer software and manual].
          Retrievable from http://cran.r-project.org/src/contrib/Descriptions/MBESS.html.

         
          MBESS R Package Help File
         
          See the Journal of Statistical Software (2007) and Behavior Research Methods (in press) articles
          where MBESS was peer reviewed.
Although MBESS was originally an acronym for Methods for the Behavioral,
          Educational, and Social Sciences, it quickly became more general and used beyond the behavioral, educational,
          and social sciences. Therefore, MBESS is now an simply













Abstracts for Published Methodological Works


Proteinuria is a determinant of quality of life in diabetic nephropathy: Modeling lagged effects with path analysis
(Kelley, Aricak, Light, & Agarwal, 2007)


Background: Diabetic nephropathy with overt proteinuria often progresses relentlessly to end-stage renal disease (ESRD). Material and Methods: To answer the question whether it is impaired glomerular filtration rate (GFR) or its precursor proteinuria which is more related with multiple domains of health related quality of life (HRQOL), we measured GFR and proteinuria in 44 patients with type 2 diabetes and overt nephropathy and repeated the measurements after 4 months. 38 patients with ESRD due to diabetic nephropathy served as a control group. We used path analysis to examine the association of baseline proteinuria and GFR with baseline and subsequent HRQOL scales. Results: Compared to patients with ESRD, patients with non-dialysis CKD had Kidney Disease Burden (KDB) that was, on a sale from 0 to 100, 19.8 better (95% CI 6.9 – 32.8) (p=0.003). Mental component score (MCS) did not differ and physical component score (PCS) was worse in non-dialysis CKD patients by 8.5 (p<0.001). Proteinuria at baseline was a predictor of PCS, MCS and KDB score at 4 months, suggesting a lagged effect on proteinuria on HRQOL after controlling for the autoregressive effects. GFR did not have a significant impact on HRQOL. One log unit increase in proteinuria was associated with 3.8 (p=0.011) fall in PCS, 3.3 (p=0.043) fall in MCS and 10.6 (p=0.006) fall in KDB. Conclusion: In patients with advanced diabetic nephropathy, we found that proteinuria has a lagged and profound effect on multiple domains of HRQOL.


Trended cosinor change model for analyzing hemodynamic rhythm patterns in hemodialysis patients (Kelley, Light, & Agarwal, 2007).

To describe circadian blood pressure (BP) patterns and linear interdialytic changes, a model was developed to describe
simultaneously both the straight line change and oscillatory variation in BP and heart rate over an interdialytic interval in 
hemodialysis patients. Using this trended cosinor model, we simultaneously compared the impact of mean level of BP, 
linear changes over the interdialytic interval, and oscillatory changes in BP and its relationship with antihypertensive drug 
use. Neither a straight-line change model nor the cosinor model adequately described the BP variability in 12 750 BP 
measurements from 136 chronic stable hemodialysis patients. A combination of the 2 models that allowed for the oscillatory 
rhythmic pattern in BP variation to have an upward trend in the interdialytic period most accurately described the data. Time 
elapsed since the end of dialysis demonstrated a better model fit compared with the less meaningful clock time. More 
antihypertensive medication use was associated with increasing mean systolic, diastolic, and pulse pressure. Although the 
rate of change was blunted with increasing antihypertensive drug use, the impact on oscillatory change was U-shaped for 
systolic BP, direct for diastolic BP, and inverse for pulse pressure. A trended cosinor model better describes the change in 
BP in the interdialytic interval in hemodialysis patients, especially when time elapsed is measured from the end of dialysis. 
Antihypertensive drugs, though associated with higher average BP, are associated with blunted rate of change in BP over time.



Confidence intervals for standardized effect sizes: Theory, application, and implementation (Kelley, 2007).

The behavioral, educational, and social sciences are undergoing a paradigmatic shift in methodology, from disciplines 
that focus on the dichotomous outcome of null hypothesis significance tests to disciplines that report and interpret 
effect sizes and their corresponding confidence intervals. Due to the arbitrariness of many measurement instruments 
used in the behavioral, educational, and social sciences, some of the most widely reported effect sizes are
standardized. Although forming confidence intervals for standardized effect sizes can be very beneficial, such confidence
interval procedures are generally difficult to implement because they depend on noncentral  t, F, and chi-square distributions. 
At present, no main-stream statistical package provides exact confidence intervals for standardized effects without the use 
of specialized programming scripts. Methods for the Behavioral, Educational, and Social Sciences (MBESS) is an R package 
that has routines for calculating confidence intervals for noncentral t, F, and chi-square distributions, which are then used in
the calculation of exact confidence intervals for standardized effect sizes by using the confidence interval transformation and
inversion principles. The present article discusses the way in which confidence intervals are formed for standardized effect
sizes and illustrates how such confidence intervals can be easily formed using MBESS in R.


Sample size planning for the standardized mean difference: Accuracy in
parameter estimation via narrow confidence intervals (Kelley & Rausch, 2006).

Methods for planning sample size (SS) for the standardized mean difference so that a narrow confidence interval (CI)
can be obtained via the accuracy in parameter estimation (AIPE) approach are developed. One method plans SS so 
that the expected width of the CI is sufficiently narrow. A modification adjusts the SS so that the obtained CI is no wider
than desired with some specified degree of certainty (e.g., 99% certain the 95% CI will be no wider than
omega). The
rationale of the AIPE approach to SS planning is given, as is a discussion of the analytic approach to CI formation for
the population standardized mean difference. Tables with values of necessary SS are provided. The freely available
Methods for the Behavioral, Educational, and Social Sciences (K. Kelley, 2006a) R (R Development Core Team, 2006)
software package easily implements the methods discussed.

The Effects of Nonnormal Distributions on Confidence Intervals Around the Standardized 
Mean Difference: Bootstrap and Parametric Confidence Intervals  (Kelley, 2005)

The standardized group mean difference, Cohen’s d, is among the most commonly used and intuitively appealing
effect sizes for group comparisons. However, reporting this point estimate alone does not reflect the extent to which
sampling error may have led to an obtained value. A confidence interval expresses the uncertainty that exists between
d and the population value, {delta}, it represents. A set of Monte Carlo simulations was conducted to examine the integrity
of a noncentral approach analogous to that given by Steiger and Fouladi, as well as two bootstrap approaches in
situations in which the normality assumption is violated. Because d is positively biased, a procedure given by Hedges
and Olkin is outlined, such that an unbiased estimate of {delta} can be obtained. The bias-corrected and accelerated bootstrap
confidence interval using the unbiased estimate of {delta} is proposed and recommended for general use, especially in cases
in which the assumption of normality may be violated.

Assessing the assumption of symmetric proximity measures in the context of
multidimensional scaling (Kelley, 2004)

Applications of multidimensional scaling often make the assumption of symmetry for the population matrix of proximity
measures. Although the likelihood of such an assumption holding true varies from one area of research to another, formal
assessment of such an assumption has received little attention. The present article develops a nonparametric procedure
that can be used in a confirmatory fashion or in an exploratory fashion in order to probabilistically assess the assumption
of population symmetry for the proximity measures in a multidimensional scaling context. The proposed procedure makes
use of the bootstrap technique and alleviates the assumptions of parametric statistical procedures. Computer code for R
and S-Plus is included in an appendix.


Sample size for multiple regression: Obtaining regression coefficients that are 
accurate, not simply significant (Kelley & Maxwell, 2003)

An approach to sample size planning for multiple regression is presented that emphasizes accuracy in parameter
estimation (AIPE). The AIPE approach yields precise estimates of population parameters by providing necessary
sample sizes in order for the likely widths of confidence intervals to be sufficiently narrow. One AIPE method yields
a sample size such that the expected width of the confidence interval around the standardized population regression
coefficient is equal to the width specified. An enhanced formulation ensures, with some stipulated probability, that the
width of the confidence interval will be no larger than the width specified. Issues involving standardized regression
coefficients and random predictors are discussed, as are the philosophical differences between AIPE and the power
analytic approaches to sample size planning.

Obtaining Power or Obtaining Precision: Delineating Methods of Sample-Size 
Planning
Kelley, Maxwell, & Rausch (2003)

Sample-size planning historically has been approached from a power analytic perspective in order to have some
reasonable probability of correctly rejecting the null hypothesis. Another approach that is not as well-known is one
that emphasizes accuracy in parameter estimation (AIPE). From the AIPE perspective, sample size is chosen such
that the expected width of a confidence interval will be sufficiently narrow. The rationales of both approaches are delineated
and two procedures are given for estimating the sample size from the AIPE perspective for a two-group mean comparison.
One method yields the required sample size, such that the expected width of the computed confidence interval will be the 
value specified. A modification allows for a defined degree of probabilistic assurance that the width of the computed
confidence interval will be no larger than specified. The authors emphasize that the correct conceptualization of sample-size
planning depends on the research questions and particular goals of the study.


Analytic methods for questions pertaining to a randomized pretest, posttest, 
follow-up design (Rausch, Maxwell, & Kelley, 2003)

Delineates 5 questions regarding group differences that are likely to be of interest to researchers within the framework
of a randomized pretest, posttest, follow-up (PPF) design. These 5 questions are examined from a methodological
perspective by comparing and discussing analysis of variance (ANOVA) and analysis of covariance (ANCOVA) methods
and briefly discussing hierarchical linear modeling (HLM) for these questions. This article demonstrates that the pretest
should be utilized as a covariate in the model rather than as a level of the time factor or as part of the dependent variable
within the analysis of group differences. It is also demonstrated that how the posttest and the follow-up are utilized in the
analysis of group differences is determined by the specific question asked by the researcher.