Congeneric reliability

From testwiki
Revision as of 19:54, 28 April 2024 by imported>Bernanke's Crossbow (top: [2]?)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Broader

In statistical models applied to psychometrics, congeneric reliability ρC ("rho C")[1] a single-administration test score reliability (i.e., the reliability of persons over items holding occasion fixed) coefficient, commonly referred to as composite reliability, construct reliability, and coefficient omega. ρC is a structural equation model (SEM)-based reliability coefficients and is obtained from on a unidimensional model. ρC is the second most commonly used reliability factor after tau-equivalent reliability(ρT; also known as Cronbach's alpha), and is often recommended as its alternative.

History and names

A quantity similar (but not mathematically equivalent) to congeneric reliability first appears in the appendix to McDonald's 1970 paper on factor analysis, labeled θ.[2] In McDonald's work, the new quantity is primarily a mathematical convenience: a well-behaved intermediate that separates two values.[3][4] Seemingly unaware of McDonald's work, Jöreskog first analyzed a quantity equivalent to congeneric reliability in a paper the following year.[4][5] Jöreskog defined congeneric reliability (now labeled ρ) with coordinate-free notation,[5] and three years later, Werts gave the modern, coordinatized formula for the same.[6] Both of the latter two papers named the new quantity simply "reliability".[5][6] The modern name originates with Jöreskog's name for the model whence he derived ρC: a "congeneric model".[1][7][8]

Applied statisticians have subsequently coined many names for ρC. "Composite reliability" emphasizes that ρC measures the statistical reliability of composite scores.[1][9] As psychology calls "constructs" any latent characteristics only measurable through composite scores,[10] ρC has also been called "construct reliability".[11] Following McDonald's more recent expository work on testing theory, some SEM-based reliability coefficients, including congeneric reliability, are referred to as "reliability coefficient ω", often without a definition.[1][12][13]

Formula and calculation

Congeneric measurement model

Congeneric reliability applies to datasets of vectors: each row Template:Mvar in the dataset is a list Template:Math of numerical scores corresponding to one individual. The congeneric model supposes that there is a single underlying property ("factor") of the individual Template:Mvar, such that each numerical score Template:Math is a noisy measurement of Template:Mvar. Moreover, that the relationship between Template:Mvar and Template:Mvar is approximately linear: there exist (non-random) vectors Template:Math and Template:Math such that Xi=λiF+μi+Ei, where Template:Math is a statistically independent noise term.[5]

In this context, Template:Math is often referred to as the factor loading on item Template:Mvar.

Because Template:Math and Template:Math are free parameters, the model exhibits affine invariance, and Template:Mvar may be normalized to mean Template:Math and variance Template:Math without loss of generality. The fraction of variance explained in item Template:Math by Template:Mvar is then simply ρi=λi2λi2+𝕍[Ei]. More generally, given any covector Template:Mvar, the proportion of variance in Template:Math explained by Template:Mvar is ρ=(wλ)2(wλ)2+𝔼[(wE)2], which is maximized when Template:Math.[5]

Template:Math is this proportion of explained variance in the case where Template:Math (all components of Template:Mvar equally important): ρC=(i=1kλi)2(i=1kλi)2+i=1kσEi2

Example

Fitted/implied covariance matrix
X1 X2 X3 X4
X1 10.00
X2 4.42 11.00
X3 4.98 5.71 12.00
X4 6.98 7.99 9.01 13.00
Σ 124.23=Σdiagonal+2×Σsubdiagonal

These are the estimates of the factor loadings and errors:

Factor loadings and errors
λ^i σ^ei2
X1 1.96 6.13
X2 2.25 5.92
X3 2.53 5.56
X4 3.55 .37
Σ 10.30 18.01
Σ2 106.22
ρ^C=(i=1kλ^i)2σ^X2=106.22124.23=.8550
ρ^C=(i=1kλ^i)2(i=1kλ^i)2+i=1kσ^ei2=106.22106.22+18.01=.8550

Compare this value with the value of applying tau-equivalent reliability to the same data.

Tau-equivalent reliability (ρT), which has traditionally been called "Cronbach's α", assumes that all factor loadings are equal (i.e. λ1=λ2=...=λk). In reality, this is rarely the case and, thus, it systematically underestimates the reliability. In contrast, congeneric reliability (ρC) explicitly acknowledges the existence of different factor loadings. According to Bagozzi & Yi (1988), ρC should have a value of at least around 0.6.[14] Often, higher values are desirable. However, such values should not be misunderstood as strict cutoff boundaries between "good" and "bad".[15] Moreover, ρC values close to 1 might indicate that items are too similar. Another property of a "good" measurement model besides reliability is construct validity.

A related coefficient is average variance extracted.

References

  1. 1.0 1.1 1.2 1.3 Cho, E. (2016). Making reliability reliable: A systematic approach to reliability coefficients. Organizational Research Methods, 19(4), 651–682. https://doi.org/10.1177/1094428116656239
  2. Although McDonald, R. P. (1985). Factor analysis and related methods. Hillsdale, NJ: Lawrence Erlbaum and (1999). Test theory. Mahwah, NJ: Lawrence Erlbaum claim that Template:Harvnb invented congeneric reliability, there is a subtle difference between the formula given there and the modern one. As discussed in Template:Harvnb, McDonald's denominator totals observed covariances, but the modern definition divides by the sum of fitted covariances.
  3. Template:Wikicite
  4. 4.0 4.1 Template:Wikicite
  5. 5.0 5.1 5.2 5.3 5.4 Jöreskog, K. G. (1971). Statistical analysis of sets of congeneric tests. Psychometrika, 36(2), 109–133. https://doi.org/10.1007/BF02291393
  6. 6.0 6.1 Werts, C. E., Linn, R. L., & Jöreskog, K. G. (1974). Intraclass reliability estimates: Testing structural assumptions. Educational and Psychological Measurement, 34, 25–33. Template:Doi
  7. Graham, J. M. (2006). Congeneric and (Essentially) Tau-Equivalent Estimates of Score Reliability What They Are and How to Use Them. Educational and Psychological Measurement, 66(6), 930–944. https://doi.org/10.1177/0013164406288165
  8. Lucke, J. F. (2005). “Rassling the Hog”: The Influence of Correlated Item Error on Internal Consistency, Classical Reliability, and Congeneric Reliability. Applied Psychological Measurement, 29(2), 106–125. https://doi.org/10.1177/0146621604272739
  9. Werts, C. E., Rock, D. R., Linn, R. L., & Jöreskog, K. G. (1978). A general method of estimating the reliability of a composite. Educational and Psychological Measurement, 38(4), 933–938. https://doi.org/10.1177/001316447803800412
  10. Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52(4), 281–302. https://doi.org/10.1037/h0040957
  11. Hair, J. F., Babin, B. J., Anderson, R. E., & Black, W. C. (2018). Multivariate data analysis (8th ed.). Cengage.
  12. Padilla, M. (2019). A Primer on Reliability via Coefficient Alpha and Omega. Archives of Psychology, 3(8), Article 8. https://doi.org/10.31296/aop.v3i8.125
  13. Revelle, W., & Zinbarg, R. E. (2009). Coefficients alpha, beta, omega, and the glb: Comments on Sijtsma. Psychometrika, 74(1), 145–154. https://doi.org/10.1007/s11336-008-9102-z
  14. Bagozzi & Yi (1988), https://dx.doi.org/10.1177/009207038801600107
  15. Guide & Ketokivi (2015), https://dx.doi.org/10.1016/S0272-6963(15)00056-X
  • RelCalc, tools to calculate congeneric reliability and other coefficients.
  • Handbook of Management Scales, Wikibook that contains management related measurement models, their indicators and often congeneric reliability.