On Standardizing within-person effects: Potential problems of global standardization

By Lijuan Wang, Qian Zhang, Scott E Maxwell and C S Bergeman

Annotated Paper

Abstract

Person-mean centering has been recommended for disaggregating between-person and within-person effects when modeling time-varying predictors. Multilevel modeling textbooks recommended global standardization for standardizing fixed effects. An aim of this study is to evaluate whether and when person-mean centering followed by global standardization can accurately estimate fixed-effects within-person relations (the estimand of interest in this study) in multilevel modeling. We analytically derived that global standardization generally yields inconsistent (asymptotically biased) estimates for the estimand when between-person differences in within-person standard deviations exist and the average within-person relation is nonzero. Alternatively, a person-mean-SD standardization (P-S) approach yields consistent estimates. Our simulation results further revealed (1) how misleading the results from global standardization were under various circumstances and (2) the P-S approach had accurate estimates and satisfactory coverage rates of fixed-effects within-person relations when the number of occasions is 30 or more (in many conditions, performance was satisfactory with 10 or 20 occasions). A daily diary data example, focused on emotional complexity, was used to empirically illustrate the approaches. Researchers should choose standardization approaches based on theoretical considerations and should clearly describe the purpose and procedure of standardization in research articles.

Primer

Issues about raw units

  • The interpretation of a raw within-person coefficient depends on the scale of two time-varying variables (so not meaningful if the scale itself is not meaningful)

  • Raw coefficients are generally not good effect size measures because they are not scale/unit invariant

  • Raw coefficient usually is less useful for comparing the strengths of within-person relations across different pairs of time-varying variables.

Formulation

Set-up

Given variable X X and YY, the true within-person correlation is

ρw,i=Ei[(XitμXi)(YitμYi)i]σXiσYiμρw=E[ρw,i]\begin{align*} \rho_{w,i} &=\frac{E_i\left[(X_{it}-\mu_{X_i})(Y_{it}-\mu_{Y_i})|i\right]}{\sigma_{X_i}\sigma_{Y_i}} \\ \mu_{\rho w} &= E[\rho_{w,i}] \end{align*}
  • μXi&μYi\mu_{X_i} \& \mu_{Y_i}: Population within-person means

  • σXi&σYi\sigma_{X_i} \& \sigma_{Y_i}: Individual within-person standard deviations

  • μρw\mu_{\rho w}: is the average within-person correlation, generally the true parameter of interests.

Consistent Estimators

Let rw,i=t(xtixi)(ytiyi)Tsxisyir_{w,i} =\frac{\sum_t(x_{ti}-x_i)(y_{ti}-y_i)}{T*s_{xi}*s_{yi}} be the person-specific correlation estimators using the sample mean and within-person sample standard deviation, then generally we have the following multilevel models:

rw,i=ρw,i+eitρw,i=μρw+u1i\begin{align*} r_{w,i} &= \rho_{w,i} + e_{it} \\ \rho_{w,i} &= \mu_{\rho w} + u_{1i} \end{align*}

So the estimator is a combination of true correlation and error, and true correlation is a combination of population mean and then person-specific variations.

Person-mean centering followed by global standardization formulation

The basic multilevel model (C1) is

yit=γ0iC1+γ1iC1(xitxi)+eitC1γ0iC1=γ00C1+γ01C1xi+u0iC1γ1iC1=γ10C1+u1iC1\begin{align*} y_{it} &= \gamma_{0i}^{C1} + \gamma_{1i}^{C1}(x_{it}-x_i) + e_{it}^{C1} \\ \gamma_{0i}^{C1} &= \gamma_{00}^{C1} + \gamma_{01}^{C1}x_i + u_{0i}^{C1} \\ \gamma_{1i}^{C1} &= \gamma_{10}^{C1} + u_{1i}^{C1} \end{align*}
  • Without allowing variation to within-person effect u1iC1u_{1i}^{C1}, the estimation of within-person effect yielded inflated Type 1 error (false positive).

  • Including the term when there are no between-person variations of the within-person effect, the model has a higher non-convergence rate.

  • The person-mean centering is also conducted for the outcome variable implicitly. We can see that if we move the γ0iC1\gamma_{0i}^{C1} to the other side.

A typical multilevel model with explicitly mean centering (C2): Notice the outcome variable is mean-centered with a sample instead of a latent estimate, and PC annotes as person-specific centering.

yitPC=γ1iC2xitPC+eitC2γ1iC2=γ10C2+u1iC2yitPC=yityixitPC=xityi\begin{align*} y_{it}^{PC} &= \gamma_{1i}^{C2}*x_{it}^{PC}+e_{it}^{C2} \\ \gamma_{1i}^{C2} &= \gamma_{10}^{C2} + u_{1i}^{C2}\\ y_{it}^{PC} &=y_{it} - y_i\\ x_{it}^{PC} &=x_{it} - y_i\\ \end{align*}
  • Empirically, the estimated within-person mean γ10C2\gamma_{10}^{C2} often has high reliability even when the number of time points is as low as 5 or 10.

The estimators are then standardized using global standard deviations. We have

  • G1 method: γ^10G1=γ^10C1scxscy\hat{\gamma}_{10}^{G1} = \hat{\gamma}_{10}^{C1}*\frac{s_{cx}}{s_{cy}} (using the person-mean centered IV and DVs)

  • G2 method: γ^10G2=γ^10C1scxsy\hat{\gamma}_{10}^{G2} = \hat{\gamma}_{10}^{C1}*\frac{s_{cx}}{s_{y}}(using the person-mean centered IV and raw DV sd)

  • G3 method: γ^10G3=γ^10C2scxscy\hat{\gamma}_{10}^{G3} = \hat{\gamma}_{10}^{C2}*\frac{s_{cx}}{s_{cy}}(using the person-mean centered IV and DVs with explicitly mean centering)

The issue is that between-person difference (variance) in within-person variance is involved in all three estimators, so they are all inconsistent estimators of μρw\mu_{\rho w}

  • In G1 and G3, the estimator is a linear combination of the average with-person SD and variance in within-person SDs

  • In G2, the estimator is a linear combination of the average of within-person SDs, variance in within-person SDs, and variance in within-person means

Within-person standardization

P-S method: the standard mixed-effect formulation is:

yitPS=γ1iPSxitPS+eitPSγ1iPS=γ10+u1iPSyitPS=(yityi)syisyi=t(yityi)2Ti1\begin{align*} y_{it}^{PS} &= \gamma_{1i}^{PS}x_{it}^{PS}+e_{it}^{PS} \\ \gamma_{1i}^{PS} &= \gamma_{10} + u_{1i}^{PS}\\ y_{it}^{PS} &= \frac{(y_{it} - y_i)}{s_{yi}}\\ s_{yi} &= \sqrt{\frac{\sum_t (y_{it}-y_i)^2}{T_i-1}} \end{align*}
  • yiy_i is the sample mean

  • syis_{yi} is the sample standard deviation

EB method: An alternative within-person standardization is to first estimate γ1iC1\gamma_{1i}^{C1} with Bayesian regression, then standardize at the person level using the observed individual within-person standard deviation.

Derivation Result

Mean: Let μXi=E[Xiti]\mu_{Xi}=E[X_{it}|i]. The between-person population covariance matrix is

[σμX2σμX,μYσμX,μYσμY2]\begin{bmatrix} \sigma_{\mu_X}^2 & \sigma_{\mu_X, \mu_Y}\\ \sigma_{\mu_X, \mu_Y} & \sigma_{\mu_Y}^2\\ \end{bmatrix}
  • σμX2\sigma^2_{\mu_X} quantifies between-person differences in the person means of XX

  • ρb=σμX,μYσμxσμy\rho_b=\frac{\sigma_{\mu_X, \mu_Y}}{\sigma_{\mu_x}*\sigma_{\mu_y}} quantifies the population between-person correlation between XX and YY

Variability: Let σXi\sigma_{X_i} be individual ii's population intraindividual standard deviation in XX. Let E[σX]=μσxE[\sigma_{X}]=\mu_{\sigma_x}. The covariance matrix of the person SD variable is

[σσX2σσX,σYσσX,σYσσY2 ]\begin{bmatrix} \sigma_{\sigma_X}^2 & \sigma_{\sigma_X, \sigma_Y} \\ \sigma_{\sigma_X, \sigma_Y} & \sigma_{\sigma_Y}^2\ \end{bmatrix}

Findings:

  1. ρX,Y\rho_{X,Y}, the population correlation, is a weighted average of the b/t person correlation ρb\rho_b and the average w/i person correlation μρw\mu_{\rho w}. So itreflects neither

  2. person-mean centering successfully removes the ρb\rho_b from the population correlation. But the correlation still not equal to μρw\mu_{\rho w} when it is not 0.

  3. Person-scaled population correlation is equal to μρw\mu_{\rho w}

  4. Global standardization (G1, G2) generally yields inconsistent estimates of the average within-person correlation μρw\mu_{\rho w} when it is not 0 and there are between-person differences in the w/i person standard deviations of one or both of the time-varying variables.

Simulation Study and Result

  1. if μρw=0\mu_{\rho w}=0, regardless of whether there are between-person differences in persons SDs of XX and YY, the global scaling and person scaling approach all yield consistent results with good coverage rate.

  2. If μρw=0.5\mu_{\rho w}=0.5, global scaling approach yields inconstent results. The EB method yield results with ignorable bias when T>20T> 20, and became better when more time points in terms of bias and coverage rates .

  3. When μρw0\mu_{\rho w}\neq 0, the performance of the G1 and G2 method depends on whether there are between-person differences in person SDs.

Other Minor points

  • Researcher have found that substantial between-person difference in within-person standard deviations existed in psychological and behavioral variables such as mood, motor performance, and stress.

  • Between-person differences in intraindividual variability (i.e., between-person diff in within-person sd) have been found to be predictive of important outcomes.

Last updated