... description of the two new algorithms for Viterbi training and stochastic EM training In addition, we implement all three algorithms, i.e the new algorithms for Viterbi training and stochastic EM training ... , Π s ( X )) and E i (y , L, M) = E iq (y , X , Π s ( X )) , Proof: The proof for this theorem is very similar to the proof of theorem for Viterbi training and therefore omitted The key differences ... for Viterbi training, Baum-Welch training and stochastic EM training for the three different models For each model, we implemented each of the three training methods using the linear-memory algorithms...