날씨가 추웠다 따뜻했다. 옷 맞추어 입기가 힘드네요. 

코트를 들고 나가려니 오바고, 얇은옷은 추울것 같고.. 에이 그냥 참아버리자.. 라는 심상으로 감기가 걸린듯 합니다. ^^

아마 대부분 이시점에 논문심사 1차정도 되는것 같습니다. 

내내 손놓고 계시다가 급하게 번갯불에 콩 볶아 드시듯이 진행하시는분도 계시고, 본인이 책을 감싸안고 노력하시다가 안되서 메일 주시는 분도 계시네요. 간혹은 잠수부를 만나셔서 다시 통째로 연락주시는 분도 계시구요

저는 블로그 마케팅도 모르고, 네이버 앞에 어떻게 나와야 하는지도 잘 모릅니다. 그저 아는거라고는 태그를 써서 글 쓴다는거, 그거 하나네요. 간혹 저도 논문통계라고 치고 들어가보면 많은 분들이 진행을 하고 계시고, 이것도 규모의 경제인지 열심히 활동하시는 몇분들의 글이 몇페이지까지 올라와 있네요. 

블로그로 통계를 시작한지가 5년째 되어가는데 그때보다는 확실히 많이 늘어난게 사실입니다. 그러다 보니 의뢰주시는분들도 당연히 가성비라고 해야하나요..ㅋ 가격대비 질이 높은것으로 찾아다니시는것 같습니다. 

사실 통계 돌리시는분들 다 아시겠지만 데이터만 똑같다면 그 누가 돌린들 다른값이 나오겠습니까~ 다 같은 값 나오는거지요. 단지 거기서 어떤 해석을 하느냐가 아마 조금씩 상이하지 않을까 합니다. 그러나 설계는 다르다고 봅니다. 분석이야 똑같겠지만 분석에 대한 설계는 어떻게 잡느냐에 따라서 전체값에 큰 영향을 미치게 됩니다. 

이 연구는 A분석을 통해서 B값을 산출해 내야하는데 B분석을 통해서 A값을 산출해 낸다면 난감하겠지요.

운이 좋으면 의뢰주시는 분도, 심사하는 교수님도 이 값이 맞는 값인지 틀린값인지 모른채 논문이 퍼블리쉬 될수도 있습니다. 근데 거의 인생의 한번의 학위인데 남이 봤을때 챙피한 논문을 만드는건 아니라고 봅니다. 

그래서 항상 저는 메일 주시는 분들께 강조합니다. 비용만을 생각하신다면 저보다는 다른 많은 분들이 저렴하게 진행하고 계시니 찾아보셔도 될겁니다. 저보다 더 잘해드리고 더 저렴한 가격을 받으시는 분들도 분명 있을겁니다. 하지만 세상에 공짜는 쉽지가 않더라구요. 저도 물건 하나 살때 꼭 이리저리 비교해보고 가장 저렴한것을 사곤 했는데. 그게 항상 좋은 결과를 내놓지는 않더군요. 

나름 지식산업구매와 제품구매와의 비교가 어색할지는 모르지만 모든것에는 다 이유가 있는것 같습니다. 

오늘 만난분이 조절회귀를 설계 하신분인데, 분석하신 분이 매개회귀결과를 보내놓고 조절회귀라고 내용을 써서 보내셨더라구요. 더군다나 만난분은 그걸 조절회귀로 알고 저한테 설명을 해달라 하시더군요. 여기서 만난분이 누군지 당연히 밝히지도 않았을뿐더러 그분을 험담하는것도 아닙니다. 그분에게 글로 써서 다른분들에게 도움이 되도 되겠냐는 허락까지 받고 올리는 글입니다. 

처음서부터 차근차근 설명해드리니 제말에 대한 이해보다는 본인 스스로가 한심스럽다며 자책을 많이 하시더라구요.

통계를 맡기는건 몰라서 맡기는 겁니다. 그러니 누구한테 맡기시더라도 알수있게 해석을 아님 설명을 꼭 해달라 하세요. 작은비용 들이는 거 아닌만큼 원하는 결과 나와야지요.

잠자기전에 일기쓰다가 주저리 했네요. 궁금하신 사항 언제든 메일주세요  chsoo.lee@gmail.com


Q1. 매개효과(Mediation Effect)을 어떻게 검증할 수 있나요?


A.  Mediation analysis uses the estimates and standard errors from the following regression equations (MacKinnon, 1994):

Y = c X + e1 
M = a X + e2 
Y = c' X + bM + e3
The independent variable (X) causes the outcome variable (Y)
The independent variable (X) causes the mediator variable (M)
The mediator (M) causes the outcome variable (Y) when controlling for the independent variable (X). This must be true.


1>Full Mediation: If the effect of X on Y is zero when the mediator is included (c' = 0), there is evidence for mediation (Judd & Kenny, 1981a, 1981b).

2>Partial Mediation: If the effect of X on Y is reduced when the mediator is included (c' < c)


Q2.  매개효과의 유의성 검증은 어떻게 하죠?


A.  To calculate the significance of the mediated effect, divide the mediated effect by its' standard error (MacKinnon & Dwyer, 1993). The regression coefficients (a, b, c, and c' from above) and the standard errors for each of those regression coefficients (sec, sea, seb, and sec' ) come from the output from running the regressions above

-Sobel Test: Divide the mediated effect (a*b) by its' standard error.

The result is a z-score.


The formula for this standard error (seab) of the mediated effect (a*b) is below (Sobel 1982, 1986).
seab =


Details may be found in:

-Sobel, M. E. (1982). Asymptotic confidence intervals for indirect effects in structural equation models. In S. Leinhardt (Ed.), Sociological Methodology 1982 (pp. 290-312). Washington, DC: American Sociological Association.

-Sobel, M. E. (1986). Some new results on indirect effects and their standard errors in covariance structure models. In N. Tuma (Ed.), Sociological Methodology 1986 (pp. 159-186). Washington, DC: American Sociological Association.

Note that there is evidence that zab  is not normally distributed. There are also alternative methods to test the significane of the mediated effect.


Q3. 매개 모델에서 총 효과(Total Effect), 직접 효과(Direct Effect), 매개(간접) 효과(Mediated Effect)는 어떻게 계산하는 건가요?


A. Using the regression coefficients from the models above, the components of a mediation model are 

1> Total effect = a*b + c'    

The total effect is the sum of direct and indirect effects of the X on the outcome (Y).

2> Direct effect = c'  

The direct effect of X on Y when taking the mediator into account.

3> Mediated(Indirect) effect = a*b  

The mediated effect is also called the indirect effect. This is because it is the part of the model that indirectly affects the outcome through the mediator.


Q.  조절 효과(interaction(Moderator) effect)와 매개 효과(mediation effect) 차이가 뭐죠?  


A.  Mediation implies a causal sequence among three variables X to M to Y (independent variable causes the mediator and the mediator causes the dependent variable).  For example, an intervention may change social norms and this change in social norms prevented smoking. An interaction means that the effect of X on Y depends on the level of a third variable. No causal sequence is implied by interaction.  For example, an intervention may be successful for males but not for females--an interaction effect.


Q5.  매개 효과에 대해서 좀 더 배울 수 있는 참고 문헌이 없을까요?


A.  Some good background references include:  

-Baron, R.M. & Kenny, D.A. (1986). The moderator-mediator distinction in social psychological research: Conceptual, Strategic, and statistical considerations. Journal of Personality and Social Psychology, 51, 1173-1182.

-Judd, C. M., & Kenny, D. A. (1981a). Estimating the effects of social interventions. New York: Cambridge University Press.

-Judd, C.M. & Kenny, D.A. (1981b). Process Analysis: Estimating mediation in treatment evaluations. Evaluation Review, 5, 602-619.

-MacKinnon, D.P. (1994). Analysis of mediating variables in prevention and intervention research. In A. Cazares and L. A. Beatty, Scientific methods in prevention research. NIDA Research Monograph 139. DHHS Pub. No. 94-3631. Washington, DC: U.S. Govt. Print. Office, pp. 127-153. 

-MacKinnon, D.P. & Dwyer, J.H. (1993). Estimating mediated effects in prevention studies. Evaluation Review, 17, 144-158. 

[출처: http://www.public.asu.edu/~davidpm/ripl/q&a.htm]


잘 안되시면 여기가보세요~


Calculation for the Sobel Test

An interactive calculation tool for mediation tests

Kristopher J. Preacher
University of Kansas
Geoffrey J. Leonardelli
University of Toronto

UPDATE: The Sobel test works well only in large samples. We recommend using this test only if the user has no access to raw data. If you have the raw data, bootstrapping offers a much better alternative that imposes no distributional assumptions. Consult Preacher and Hayes (2004, 2008) for details and easy-to-use macros that run the necessary regression analyses for you:

Preacher, K. J., & Hayes, A. F. (2008). Asymptotic and resampling strategies for assessing and comparing indirect effects in multiple mediator models. Behavior Research Methods, 40, 879-891.

Preacher, K. J., & Hayes, A. F. (2004). SPSS and SAS procedures for estimating indirect effects in simple mediation models. Behavior Research Methods, Instruments, & Computers, 36(4), 717-731.


Mediation effects
A variable may be considered a mediator to the extent to which it carries the influence of a given independent variable (IV) to a given dependent variable (DV). Generally speaking, mediation can be said to occur when (1) the IV significantly affects the mediator, (2) the IV significantly affects the DV in the absence of the mediator, (3) the mediator has a significant unique effect on the DV, and (4) the effect of the IV on the DV shrinks upon the addition of the mediator to the model. These criteria can be used to informally judge whether or not mediation is occurring, but MacKinnon & Dwyer (1993) and MacKinnon, Warsi, & Dwyer (1995) have popularized statistically based methods by which mediation may be formally assessed.

Purpose of Sobel test
To test whether a mediator carries the influence of an IV to a DV.

A friendly warning
Blind use of this application without a proper understanding of mediation or the logic behind these tests will lead to erroneous conclusions. Please consult the references before proceeding.

An illustration of mediation
a, b, and c' are path coefficients. Values in parentheses are standard errors of those path coefficients.

Description of numbers needed
a = raw (unstandardized) regression coefficient for the association between IV and mediator.
sa = standard error of a.
b = raw coefficient for the association between the mediator and the DV (when the IV is also a predictor of the DV).
sb = standard error of b.

To get numbers

  1. Run a regression analysis with the IV predicting the mediator. This will give a and sa.
  2. Run a regression analysis with the IV and mediator predicting the DV. This will give b and sb. Note that sa and sb should never be negative.

To conduct the Sobel test
Details can be found in Baron and Kenny (1986), Sobel (1982), Goodman (1960), and MacKinnon, Warsi, and Dwyer (1995). Insert the a, b, sa, and sb into the cells below and this program will calculate the critical ratio as a test of whether the indirect effect of the IV on the DV via the mediator is significantly different from zero.

Test statistic:
Sobel test:
Aroian test:
Goodman test:

Alternatively, you can insert ta and tb into the cells below, where ta and tb are the t-test statistics for the difference between the a and b coefficients and zero. Results should be identical to the first test, except for error due to rounding.

Test statistic:
Sobel test:
Aroian test:
Goodman test:

The reported p-values (rounded to 8 decimal places) are drawn from the unit normal distribution under the assumption of a two-tailed z-test of the hypothesis that the mediated effect equals zero in the population. +/- 1.96 are the critical values of the test ratio which contain the central 95% of the unit normal distribution.

We should note that there are three principal versions of the "Sobel test" - one that adds the third denominator term (Aroian, 1944/1947 - this is the version popularized by Baron & Kenny as the Sobel test), one that subtracts it (Goodman, 1960), and one that does not include it at all. We stress that researchers should consult MacKinnon, Lockwood, Hoffman, West, and Sheets (2002), as well as sources cited therein, before attempting to interpret the results of any of these tests. Researchers should consult Krull & MacKinnon (1999) before attempting to apply the Sobel test to parameter estimates obtained from multilevel modeling.

Formulae for the tests provided here were drawn from MacKinnon & Dwyer (1994) and from MacKinnon, Warsi, & Dwyer (1995):

Sobel test equation
z-value = a*b/SQRT(b2*sa2 + a2*sb2)

Aroian test equation
z-value = a*b/SQRT(b2*sa2 + a2*sb2 + sa2*sb2)

Goodman test equation
z-value = a*b/SQRT(b2*sa2 + a2*sb2 - sa2*sb2)

The Sobel test equation omits the third term of the variance estimate in the denominator. We recommend using the Aroian version of the Sobel test suggested in Baron and Kenny (1986) because it does not make the unnecessary assumption that the product of sa and sb is vanishingly small. The Goodman version of the test subtracts the third term for an unbiased estimate of the variance of the mediated effect, but this can sometimes have the unfortunate effect of yielding a negative variance estimate.

The Sobel test and the Aroian test seemed to perform best in a Monte Carlo study (MacKinnon, Warsi, & Dwyer, 1995), and converge closely with sample sizes greater than 50 or so.


Aroian, L. A. (1944/1947). The probability function of the product of two normally distributed variables. Annals of Mathematical Statistics, 18, 265-271.

Baron, R. M., & Kenny, D. A. (1986). The moderator-mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology, 51, 1173-1182.

Goodman, L. A. (1960). On the exact variance of products. Journal of the American Statistical Association, 55, 708-713.

Hoyle, R. H., & Kenny, D. A. (1999). Sample size, reliability, and tests of statistical mediation. In R. Hoyle (Ed.) Statistical Strategies for Small Sample Research. Thousand Oaks, CA: Sage Publications.

Krull, J. L., & MacKinnon, D. P. (1999). Multilevel mediation modeling in group-based intervention studies. Evaluation Review, 23(4), 418-444.

MacKinnon, D. P., & Dwyer, J. H. (1993). Estimating mediated effects in prevention studies. Evaluation Review, 17, 144-158.

MacKinnon, D. P., Lockwood, C. M., Hoffman, J. M., West, S. G., & Sheets, V. (2002). A comparison of methods to test mediation and other intervening variable effects. Psychological Methods, 7, 83-104.

MacKinnon, D. P., Warsi, G., & Dwyer, J. H. (1995). A simulation study of mediated effect measures. Multivariate Behavioral Research, 30(1), 41-62.

MacKinnon, D. P., Warsi, G., & Dwyer, J. H. (1995). "A simulation study of mediated effect measures:" Erratum. Multivariate Behavioral Research, 30(3), ii.

Preacher, K. J., & Hayes, A. F. (2004). SPSS and SAS procedures for estimating indirect effects in simple mediation models. Behavior Research Methods, Instruments, & Computers, 36(4), 717-731.

Shrout, P. E., & Bolger, N. (2002). Mediation in experimental and nonexperimental studies: New procedures and recommendations. Psychological Methods, 7(4), 422-445.

Sobel, M. E. (1982). Asymptotic intervals for indirect effects in structural equations models. In S. Leinhart (Ed.), Sociological methodology 1982 (pp.290-312). San Francisco: Jossey-Bass.


We wish to thank David MacKinnon and David Kenny for advice which made this interactive web page possible. Comments and criticisms are welcome.

Originally posted March, 2001. All material on these pages not otherwise credited is ©2003 by Kristopher J. Preacher. This page was last updated on 8/10/06. This page was optimized to work best with Internet Explorer.

