... identifying these situations. Our results show a significant improvement over the baseline and illustrate that both lower -level acoustic features and higher -level dialogue features can af- fect the ... threshold. The four thresholds used for the four pmisrecs% features are -2 ,-3 ,-4 ,-5 , and were chosen by hand from the entire dataset to be infor- mative. The dialogue efficiency features measure ... classification model learned from the training data. To evaluate these results, the error rates of the learned classification models are estimated using the resampling method of cross-validation...