P r(X, Z) =
X
X
⋆
P r(X, Z, X
⋆
)
=
X
X
⋆
P r(X | Z, X
⋆
)P r(Z | X
⋆
)P r(X
⋆
)
=
X
X
⋆
P r(X | X
⋆
)P r(Z | X
⋆
)P r(X
⋆
) given the assumption
Therefore, the matrix form of the above formula is
M
XZ
= M
X|X
⋆
M
′
ZX
⋆
.
Assumption 2: M
ZX
⋆
is a full rank matrix (invertible), which can be tested. Then we have
M
−1
XZ
= M
′
−1
ZX
⋆
M
−1
X|X
⋆
M
XY Z
M
−1
XZ
= M
X|X
⋆
D
Y |X
⋆
M
′
ZX
⋆
M
′
−1
ZX
⋆
M
−1
X|X
⋆
= M
X|X
⋆
D
Y |X
⋆
M
−1
X|X
⋆
= BΛB
−1
,
where Λ and B can be obtained using eigendecomposition.
Computational tricks
• In practice, Y need to be fixed, i.e., Y = y.
• We use control variable W to ensure that Assumption 1 holds.
• One need make reasonable assumptions, e.g., P r(y | X
⋆
= 1, w) > P r(y | X
⋆
= 0, w), to re-rank
diagonal elements in Λ and match them with true values.
• Elements in the same column in B need to be normalized to sum 1 as the columns of matrix B
are probabilistic distributions. An alternative way to constrain parameters is to use extremum
estimator:
ˆ
θ = arg min
θ
c
M
XyZ|w
c
M
−1
XZ|w
− M
X|X
⋆
,w
D
y |X
⋆
,w
M
−1
X|X
⋆
,w
s.t. (i) θ
i
∈ [0, 1] ∀i
(ii) P r(y | X
⋆
, w) is a monotonic function of X
⋆
(iii) P r(X = x | X
⋆
= i, w) = P r(X = x | X
⋆
= j, w) ∀ i = j
where
c
M
XyZ|w
j,k
=
n
X
i=1
1
(X
i
= x
j
, Y
i
= y, Z
i
= z
k
)
1
(W
i
= w) /
n
X
i=1
1
(W
i
= w)
c
M
XZ|w
j,k
=
n
X
i=1
1
(X
i
= x
j
, Z
i
= z
k
)
1
(W
i
= w) /
n
X
i=1
1
(W
i
= w)
References
Hu, Yingyao (2008). “Identification and estimation of nonlinear models with misclassification error
using instrumental variables: A general solution”. In: Journal of Econometrics 144(1), pp. 27–61.
2