Rasch model estimation

(Learn how and when to remove this template message)

Estimation of a Rasch model is used to estimate the parameters of the Rasch model. Various techniques are employed to estimate the parameters from matrices of response data. The most common approaches are types of maximum likelihood estimation, such as joint and conditional maximum likelihood estimation. Joint maximum likelihood (JML) equations are efficient, but inconsistent for a finite number of items, whereas conditional maximum likelihood (CML) equations give consistent and unbiased item estimates. Person estimates are generally thought to have bias associated with them, although weighted likelihood estimation methods for the estimation of person parameters reduce the bias.

Rasch model

The Rasch model for dichotomous data takes the form:

Pr { X n i = 1 } = exp ( β n δ i ) 1 + exp ( β n δ i ) , {\displaystyle \Pr\{X_{ni}=1\}={\frac {\exp({\beta _{n}}-{\delta _{i}})}{1+\exp({\beta _{n}}-{\delta _{i}})}},}

where β n {\displaystyle \beta _{n}} is the ability of person n {\displaystyle n} and δ i {\displaystyle \delta _{i}} is the difficulty of item i {\displaystyle i} .

Joint maximum likelihood

Let x n i {\displaystyle x_{ni}} denote the observed response for person n on item i. The probability of the observed data matrix, which is the product of the probabilities of the individual responses, is given by the likelihood function

Λ = n i exp ( x n i ( β n δ i ) ) n i ( 1 + exp ( β n δ i ) ) . {\displaystyle \Lambda ={\frac {\prod _{n}\prod _{i}\exp(x_{ni}(\beta _{n}-\delta _{i}))}{\prod _{n}\prod _{i}(1+\exp(\beta _{n}-\delta _{i}))}}.}

The log-likelihood function is then

log Λ = n N β n r n i I δ i s i n N i I log ( 1 + exp ( β n δ i ) ) {\displaystyle \log \Lambda =\sum _{n}^{N}\beta _{n}r_{n}-\sum _{i}^{I}\delta _{i}s_{i}-\sum _{n}^{N}\sum _{i}^{I}\log(1+\exp(\beta _{n}-\delta _{i}))}

where r n = i I x n i {\displaystyle r_{n}=\sum _{i}^{I}x_{ni}} is the total raw score for person n, s i = n N x n i {\displaystyle s_{i}=\sum _{n}^{N}x_{ni}} is the total raw score for item i, N is the total number of persons and I is the total number of items.

Solution equations are obtained by taking partial derivatives with respect to δ i {\displaystyle \delta _{i}} and β n {\displaystyle \beta _{n}} and setting the result equal to 0. The JML solution equations are:

s i = n = 1 N p n i , i = 1 , , I {\displaystyle s_{i}=\sum _{n=1}^{N}p_{ni},\quad i=1,\dots ,I}
r n = i = 1 I p n i , n = 1 , , N {\displaystyle r_{n}=\sum _{i=1}^{I}p_{ni},\quad n=1,\dots ,N}

where p n i = exp ( β n δ i ) / ( 1 + exp ( β n δ i ) ) {\displaystyle p_{ni}=\exp(\beta _{n}-\delta _{i})/(1+\exp(\beta _{n}-\delta _{i}))} .

The resulting estimates are biased, and no finite estimates exist for persons with score 0 (no correct responses) or with 100% correct responses (perfect score). The same holds for items with extreme scores, no estimates exists for these as well. This bias is due to a well known effect described by Kiefer & Wolfowitz (1956). It is of the order ( I 1 ) / I {\displaystyle (I-1)/I} , and a more accurate (less biased) estimate of each δ i {\displaystyle \delta _{i}} is obtained by multiplying the estimates by ( I 1 ) / I {\displaystyle (I-1)/I} .

Conditional maximum likelihood

The conditional likelihood function is defined as

Λ = n Pr { ( x n i ) r n } = exp ( i s i δ i ) n γ r {\displaystyle \Lambda =\prod _{n}\Pr\{(x_{ni})\mid r_{n}\}={\frac {\exp(\sum _{i}-s_{i}\delta _{i})}{\prod _{n}\gamma _{r}}}}

in which

γ r = ( x ) r exp ( i x n i δ i ) {\displaystyle \gamma _{r}=\sum _{(x)\mid r}\exp(-\sum _{i}x_{ni}\delta _{i})}

is the elementary symmetric function of order r, which represents the sum over all combinations of r items. For example, in the case of three items,

γ 2 = exp ( δ 1 δ 2 ) + exp ( δ 1 δ 3 ) + exp ( δ 2 δ 3 ) . {\displaystyle \gamma _{2}=\exp(-\delta _{1}-\delta _{2})+\exp(-\delta _{1}-\delta _{3})+\exp(-\delta _{2}-\delta _{3}).}

Details can be found in the chapters by von Davier (2016) for the dichotomous Rasch model and von Davier & Rost (1995) for the polytomous Rasch model.

Estimation algorithms

Some kind of expectation-maximization algorithm is used in the estimation of the parameters of Rasch models. Algorithms for implementing Maximum Likelihood estimation commonly employ Newton–Raphson iterations to solve for solution equations obtained from setting the partial derivatives of the log-likelihood functions equal to 0. Convergence criteria are used to determine when the iterations cease. For example, the criterion might be that the mean item estimate changes by less than a certain value, such as 0.001, between one iteration and another for all items.

See also

References