HOME JOURNALS CONTACT

Asian Journal of Applied Sciences

Year: 2012 | Volume: 5 | Issue: 6 | Page No.: 371-383
DOI: 10.3923/ajaps.2012.371.383
Sample Size Determination for Repeated Measurements using Three Possible Methods of Analysis, POST, CHANGE and ANCOVA, under Various Covariance Structures for Two Groups
V. Chandrasekaran, G. Gopal and W. Basilea

Abstract: Many authors have studied the determination of sample size in longitudinal studies under different situations. A statistical methodology for determining the minimum sample size required for within-subjects repeated-measures design, applying Hotelling’s T2 analysis under the general covariance structure exists. This procedure is extended to between-subjects repeated measures design when there are two treatment groups, assuming the correlation structure among the repeated measures are autoregressive or compound symmetry. Three possible methods of analysis POST, CHANGE and ANCOVA are used to determine the sample size under the assumption that the correlation structure is compound symmetry. In this paper, we extend the existing method to obtain minimum sample size assuming that the correlation structure is first order autoregressive or a random effects model.

Fulltext PDF Fulltext HTML

How to cite this article
V. Chandrasekaran, G. Gopal and W. Basilea, 2012. Sample Size Determination for Repeated Measurements using Three Possible Methods of Analysis, POST, CHANGE and ANCOVA, under Various Covariance Structures for Two Groups. Asian Journal of Applied Sciences, 5: 371-383.

Keywords: first order auto regression, compound symmetry, summary statistics, Sample size determination and random effects

INTRODUCTION

The determination of sample size is an important step before embarking on a research study. A sample that is too small will not possess the required statistical power to identify significant changes when they truly exist, on the other hand, sample sizes larger than required will increase the cost for the research project. Under these circumstances, it becomes imperative to determine the required minimum sample size prior to start of the study with appropriate statistical significance (Machin and Campbell, 1987).

When the dependent variable is a continuous one and is measured at a single time point, the estimation procedures for sample size determination are well founded. The corresponding situation for repeated measurement experiments has not been completely developed.

The absence of autocorrelation of the error terms and zero co-variance between the explanatory variables and the error terms are the basic assumptions in the classical linear regression model. In case of violation of the above two assumptions, various methods for estimating the parameters have been suggested by Olaomi and Ifederu (2008). Their results show that when auto correlation is at ρ = 0.4 and when the exponential regressor is significantly correlated at 5% with the auto error terms the Ordinary Least Squares is preferred.

Typically, in a longitudinal study, a correlation structure is assumed among the set of repeated measures captured at well-defined junctures within an individual. The correlation structure Compound Symmetry (CS) is commonly used, as it is very simple and assumes that the variance at all time points are the same and the correlation between two time points is the same. Fleiss (1986) argues that Compound Symmetry (CS) correlation structure will rarely be satisfied in a repeated measurements experiment. Snedecor and Cochran (1980) feel that this assumption is too restrictive for many real valued situations.

An alternative correlation structure is the First Order Autoregressive, AR(1), which assumes that the correlation between adjacent time-points is the same and the correlation decreases by the power of the number of time intervals between the measures.

The Random-Effects (RE) model provides the basic modeling framework for much of the clinical trials.

The panel data can be approached from a set of similar Randomized Controlled Trials making use of pooling methods to find the efficacy of pharmaceutical regimens. Further, Random Effect model is one of the meaningful techniques when individual studies results are combined. This is a more realistic model as it allows for variation in the true effect across the studies (Antoine et al., 2007). The use of random effects allows the longitudinal data to work with Autoregressive (AR) process (Nath and Bhattacharjee, 2011).

It is useful for analyzing longitudinal clinical trials in which there is a sizable number of missing observations, either due to missed visits, loss to follow-up, or death (Albert, 1999). Ware (1985) while discussing about the goodness of fit, mentions that the Random Effects (RE) and Auto Regressive (AR) models introduce strong assumptions about covariance patterns, especially when the number of observations on each subject is large. Hence, in this study we derive an expression for the sample size requirements for repeated measurements experiments under the auto regressive model of order 1 (AR(1)) and Random Effects (RE) model.

Three different formulations of interferon beta (IFNβ) an immunomodulatory agent was compared using random effects in a longitudinal study (Nikfar et al., 2010). It has been shown that the asymptotic relative efficiency is more compared to other correlation structures under the assumption of auto regressive correlation structure (Akanda et al., 2005).

For sample size calculation in longitudinal studies, two additional quantities are required, namely (1) the number of repeated observation per subject and (2) the correlation among repeated observations which can be estimated from a previous longitudinal study. In this study, we consider the case where the subjects are allocated at random to two treatment groups and measurements on every individual is taken repeatedly at well-defined time points during the course of study.

There are various summary measures in longitudinal study settings that help to bridge the gap between complex statistical methods and inadequate statistical usage. They are Mean, Area Under Curve (AUC), Maximum (Minimum) value, Time to Maximum (Minimum), Regression co-efficient and change between first and last value (Matthews et al., 1990) and also summarizing data through a piecewise linear growth curve model (Chandrasekaran et al., 2005). Though the choices of summary measures are wide, the main objective in many clinical trials is to determine the average response to treatment over time. This provides three possible methods of analysis as identified by Frison and Pocock (1992). They are:

Post-treatment means (POST): A simple analysis using the mean for each patient’s post-treatment measurements as the summary measure
Mean changes (CHANGE): A simple analysis of each patient’s difference between mean of post-treatment measurements and mean of baseline measurements, the latter often consisting of just a single baseline value per patient

Analysis of covariance (ANCOVA): Between-patient variations in baseline measurements are taken into account, by using the mean baseline measurement for each patient as covariate in a linear model for treatment comparison of post-treatment means.

As observed by Frison and Pocock (1992), ANCOVA has a smaller variance as compared to POST and CHANGE. Hence, in situations where the main objective is to determine the average response to a particular treatment over time, in comparison to another treatment or placebo, ANCOVA is superior to both, ignoring the pre-treatment data and simply subtracting the post-treatment value from the baseline value for each individual. Also, as pointed out by Senn (1994), ANCOVA is the best of the three methods considered and the only one which produces unbiased estimators in the presence of chance observed imbalance. It does this whether or not the baselines are subject to measurement error. Hence, in this study, sample size determination has been considered in situations where ANCOVA is the method of analysis to summarize the repeated measures data in order to determine the response of a particular treatment over time.

A SIMPLE MODEL AND AN EXAMPLE

In the sample size determination for the model for two treatment groups, let us assume that a randomized clinical trial has two treatment groups (i = A or B) with ni subjects per group and suppose all patients have p pre-treatment and q post-treatment visits. The quantitative measurement y is observed at every visit for every patient. We adopt the simple model:

where, i = A, B; s = 1,2, …,ni; t = 1,2, …, p, p+1, …,p+q; μit is the underlying true mean response for treatment i at time t. As a result of randomization we can assume μAt = μBt for the pre-treatment visit and eist is the individual sth patients ‘error’ around the mean response μit.

Let ∑ = {σtl } be the covariance matrix for all pairs of measurement times t, l. For simplicity we assume this is the same for both the treatments. It is helpful to define three sub matrices:

and:

That is:

The variance formulate for three approaches are:

Post-treatment means (POST):

Mean change (CHANGE):

Analysis of covariance (ANCOVA):

For a randomized clinical trial all three estimates of the mean treatment difference have the same expected value (APost-BPost). It can be seen that among the above variance formulae, the common sample size adjustment is:

Let the alternative hypothesis be Apost-BPost = δ. In the conventional approach to power calculation, we define α and β as the type I and type II errors for ANCOVA. It is convenient to assume that sample sizes are sufficiently large and that the normal approximation to the t- distribution can be applied. In that case, for two equals sized treatment groups of size n:

(1)

where, post, mix and pre are the respective means of the q2, qxp and p2 components of the three sub-matrices ∑post, mix and ∑pre and f(α,β) = [z(α/2)+z(β)]2, z(y) is the standardized normal deviate exceeded with probability y and it is to be noted that σ is the within-patients standard deviation. The expression for the sample size formula when using the correlation matrix is:

(2)

where post, mix and pre are the respective means of the three correlation sub-matrices of q2, qxp and p2 components. Using the above expression, Frison and Pocock (1992) derived the sample size under the CS model to be:

where, the mean of the correlation matrix Ψpre is:

Mean of the correlation matrix Ψpost is:

and, Mean of the correlation matrix Ψmix = ρ.

On substitution of these quantities in (2), one gets the required sample size n, under the assumption of CS covariance structure. The same method is adopted for the determination of the sample size under the random effects first order auto regressive and random effects models.

The covariance matrix for random effects model (RE): Let yij = αij+uij, where yij is the observation on the ith individual on occasion j, αi is the individual effect, βj is the occasion effect and uij is random error. However, the αi are randomly selected from a distribution of individual effects and constitute a random effect. Then the model becomes:

and the generalization of this in matrix form is:

Here, the bi are random effects, yielding a combined contribution to yi via the design matrix Zi and β is a vector of fixed effects. Such models are sometimes called two-stage models because the ei, refer to within-individual variation (stage 1) and the bi refer to between-individual variation (stage 2). Models with this general form have now become popular. An early expositions to this type of modeling is form the work of Laird and Ware (1982). We can generalize this supposing that E(bi) = 0, E(ei) = 0, var(bi) = Bi, var(ei) = Ei and that bi and ei are independent.

Then:

where, bi and ei are multivariate normal.

The correlation matrix under the Random effects model is:

where, the matrix Z has (p+q) rows and I columns of treatment groups (two) and φ = i Xi is the dispersion matrix. For example, the covariance matrix for 7x7 (7 repeated value) the matrix of ∑pre are:

The mean of ∑pre are:

Matrix of ∑mix:

The mean of ∑mix are:

Matrix of ∑post:

Mean of ∑post is:

The general form of σ2t = φ11+t φ12+t(φ12+i φ22)+σ2; i = 1, 2,…, p+q and σij = (φ11+ i φ12)+ j (φ12+i φ22); i, j = 1, 2, …, p+q; i≠j.

On substitution of these quantities in (1) we get the sample size n, under the assumption of random effects.

Correlation structure for first order autoregressive model (AR(1): Assume that the correlation matrix has a first-order autoregressive structure, which is consistent with the observation that the correlations exhibit the ρ, ρ2, ρ3 pattern required under the AR(1) structure. From the covariance matrix, the correlation matrix is obtained by removing the constant variance, σ2. Under the auto regressive model of order 1 for pre treatment p and post treatment q, the correlation matrix is given as:

Hence, we arrive at the following:

Mean of the correlation matrix Ψpre is:

Mean of the correlation matrix Ψpost is:

Mean of the correlation matrix Ψmix is:

Substitution of these quantities in (2), we get the required sample size n, under the assumption of first order auto regressive covariance structure.

RESULT AND DISCUSSION

Table 1-4 provide the minimum number of subjects required in each treatment group, assuming equal allocation, under the CS and AR(1) correlation model, for a one sided test at α and β for Δ = .1 (.1).1, ρ = 0 (.1).9 for p (pre-treatment) = 1 and 3 and q (post-treatment) = 3 (2)9. The greatest difficulty lies in choosing an appropriate values for Δ and σ but this problem applies to any power calculation with quantitative data expressed as means. Note that σ here is the between-patients standard deviation.

Table 1: Sample required in each treatment group, assuming equal allocation, for the AR(1) correlation structure (pre-treatment p = 1), for a two-sided test at α = 0.05 and β = 0.20 for selected values of p (No. of post-treatment repeated), ρ and Δ = (σ/δ)

Table 2: Sample required in each treatment group, assuming equal allocation, for the AR(1) correlation structure (pre-treatment p = 0), for a two-sided test at α = 0.05 and β = 0.20 for selected values of p (No. of post-treatment repeated), ρ and Δ = (σ/δ)

The sample size under covariance structure in CS is less when ρ is large. In AR (1) the sample size increases with increase in post-treatment. Besides, when Δ = 0.2 irrespective of whether the pre-treatment is taken, the sample size generally tends to start from around 200, leave alone the number of post-treatment visits (q); also when Δ = 0.6 the sample size usually begins with 25. When ρ = 0.3 with the same Δ = 0.2 and p = 3, the sample size decreases in both (AR(1) and CS models. When p = 3 and Δ = 0.6 in AR(1) with ρ = 0.3 and ρ = 0.9 in CS model, the sample size decreases.

Table 3: Sample required in each treatment group, assuming equal allocation, for the CS correlation structure (pre-treatment p = 1), for a two-sided test at α 0.05 and β = 0.20 for selected values of p (No. of post-treatment repeated), ρ and Δ = (σ/δ)

Frison and Pocock (1992) have observed that in simple designs where there is only one post treatment reading and no pre-treatment reading, the POST analysis requires around n = 100 in each group. Increasing the number of post-treatment reading has some effect on decreasing n but with no use of pre-treatment reading n remains at around 75 even with q = 8. The CHANGE analysis with p = 1 pre-treatment (a two-sample t-test comparing mean change) leads to a required n around 60 for q = 1 post-treatment measurement, which can be reduced to n<40 if q is increased to 4 or more post-treatment measurements.

Table 4: Sample required in each treatment group, assuming equal allocation, for the CS correlation structure (pre-treatment p = 0), for a two-sided test at α 0.05 and β = 0.20 for selected values of p (No. of post-treatment repeated), ρ and Δ = (σ/δ)

The superiority of ANCOVA is illustrated by a further fall in sample size. For instance, with p = 1 and q>4 we can reduce n to below 30 if ANCOVA is used.

It is often possible to have more than one pre-treatment visit in a repeated measures design (all pre-treatment visits occurring before randomization) and here we consider the improved efficiency for both ANCOVA and CHANGE. Of course the time lapses between pre-treatment measurements may affect the correlation structure. The more widespread use of such power calculation formulae may add greatly to a sensible choice of n, p and q in repeated measures designs. While the compound symmetry assumption is unlikely to be true, it is often not wildly off the mark and so use of these simple formulae should give an adequate estimate of order of magnitude for n. A plausible value of ρ in the range 0.5 to 0.75 should usually be appropriate. Perhaps the greatest difficulty lies in choosing an appropriate value for δ/σ but this problem applies to any power calculation with quantitative data expressed as means. Note that σ here is the between-patient standard deviation, although the ANCOVA and CHANGE analyses are within-patient in essence (Frison and Pocock, 1992).

In most practical circumstances the bias of ANCOVA should be small. Having more than one pre-treatment measurement will reduce this bias further, since it is approximately proportional to 1/p. In many repeated measures trials the analysis is complicated by missing values and patient withdrawals. In summary statistics approach it is relatively easy to define sensible criteria for coping with occasional missing values. However, patient withdrawals provoke greater problems since they are often associated with informative censoring (Wu and Bailey, 1989).

In the summary statistics approach to analysis of repeated measures we confirm the well known result that ANCOVA is superior to both ignoring pre-treatment reading and simply subtracting pre-treatment readings for each individual (Fleiss, 1986).

It is noted that the parameters will not be unbiased when there is correlation among responses. In this respect, Nugraha (2011) has proved that the MLE on mixed logit model is precise in estimating the parameters in a discrete model like GEE. When there are many parameters in the model, the concept of subsetting has been used to eliminate the parameters which are redundant and close to zero (Ojo et al., 2008).

Maximum likelihood estimates of the covariance parameters, listed in the order; Φ11, Φ21, Φ22 and σ2. Note that neither Φ21 nor Φ22 is statistically different from zero. If Φ21 and Φ22 are both zero, then the random effects covariance structure ZΦZ’+I reduces to the compound symmetry structure (Dixon, 1992).

CONCLUSION

It has been observed that for repeated measures design, ANCOVA is the best of the three summary methods considered-POST, CHANGE and ANCOVA. It gives the most unbiased estimator in the presence of chance observed imbalance. In many longitudinal studies, the covariance structure either assumes the Auto Regressive or Random Effect (RE) structure. Hence, the procedure to estimate sample size for such kind of longitudinal studies with the covariance structure of the outcome variables as AR or RE structure is very much useful to the practitioners.

REFERENCES

  • Akanda, M.A.S., M. Khanam and M.A. Islam, 2005. Comparison of goodness-of-fit tests for GEE modeling with binary responses to diabetes mellitus. J. Applied Sci., 5: 1214-1218.
    CrossRef    Direct Link    


  • Albert, P.S., 1999. Tutorial in Biostatistics longitudinal data analysis (repeated measures) in clinical trials. Stat. Med., 18: 1707-1732.


  • Antoine, R., A. Sahai and P. Chami, 2007. Meta analysis of panel data generated by a set of randomized controlled trials. J. Applied Sci., 7: 951-957.
    CrossRef    Direct Link    


  • Chandrasekaran, V., G. Gopal and A. Thomas, 2005. Summarizing data through a piecewise linear growth curve model. Stat. Med., 24: 1139-1151.
    CrossRef    Direct Link    


  • Dixon, W.J., 1992. Statistical Software Manual: To Accompany. University of California Press, Oxford, UK., Pages: 1504


  • Fleiss, J.L., 1986. The Design and Analysis of Clinical Experiments. Wiley, New York


  • Frison, L. and S.J. Pocock, 1992. Repeated measures in clinical trials: Analysis using mean summary statistics and its implication for design. Stat. Med., 11: 1685-1704.
    PubMed    


  • Laird, N.M. and J.H. Ware, 1982. Random-effects models for longitudinal data. Biometrics, 38: 963-974.
    CrossRef    Direct Link    


  • Machin, D. and M.J. Campbell, 1987. Statistical Tables for the Design of Clinical Trials. Blackwell Scientific Publications, UK


  • Matthews, J.N.S., D.G. Altman, M.J. Campbell and P. Royston, 1990. Analysis of serial measurements in medical research. Brit. Med. J., 300: 230-235.
    PubMed    Direct Link    


  • Nath, D.C. and A. Bhattacharjee, 2011. A bayesian approach for autoregressive models in longitudinal data analysis: An application to type 2 diabetes drug comparison. Asian J. Applied Sci., 4: 640-648.
    CrossRef    


  • Nikfar, S., R. Rahimi and M. Abdollahi, 2010. A systematic review on the efficacy of interferon β in relapsing remitting multiple sclerosis: Comparison of different formulations. Int. J. Pharmacol., 6: 638-644.
    CrossRef    Direct Link    


  • Nugraha, J., 2011. Mixed logit model on multivariate binary response using maximum likelihood estimator and generalized estimating equations. Asian J. Math. Stat., 4: 113-127.
    CrossRef    Direct Link    


  • Ojo, J.F., T.O. Olatayo and O.O. Alabi, 2008. Forecasting in subsets autoregressive models and autoprojective models. Asian J. Scientific Res., 1: 481-491.
    CrossRef    Direct Link    


  • Olaomi, J.O. and A. Ifederu, 2008. Understanding estimators of linear regression model with AR(1) error which are correlated with exponential regressor. Asian J. Math. Stat., 1: 14-23.
    CrossRef    Direct Link    


  • Senn, S., 1994. Letter to the editor. Stat. Med., 13: 197-198.


  • Snedecor, G.W. and W.G. Cochran, 1980. Statistical Methods. 7th Edn., Iowa State University Press, Iowa, USA., ISBN-10: 0813815606, Pages: 507
    Direct Link    


  • Ware, J.H., 1985. Linear models for the analysis of longitudinal studies. Am. Stat., 39: 95-101.


  • Wu, M.C. and K.R. Bailey, 1989. Estimation and comparison of changes in the presence of informative right censoring: Conditional linear model. Biometrics, 45: 939-955.
    PubMed    

  • © Science Alert. All Rights Reserved