... function of the CRFs into that of the MCE criterion:g(y, x, λ) = log p(y|x; λ) ∝ λ · F (y, x) (11)Basically, CRF training with the MCE criterionoptimizes Eq. 9 with Eq. 11 after the selection of an ... of the different feature set, as de-scribed in Sec. 5.2. However, MCE-F showed thebetter performance of 85.29 compared with (Mc-Callum and Li, 2003) of 84.04, which used theMAP training of ... mis-classification. By using d(), the minimization of the error rate can be rewritten as the minimization of the sum of 0-1 (step) losses of the given training data. That is, arg minλLλwhereLλ=kδ(d(y∗k,...