... POS, and WSPP, where CW stands for content word lemmata, WS for all lemmata, POS for POS information, and PP for POS and punctuation in- formation. In the CL-experiments, we did not control for ... so grave for the LPE experiments because of the ceiling effect and the small size of the complete data set, therefore, we did not rerun the corresponding experiments. Furthermore, the number ... lem- mata for LPE and 5440 lemmata and punctuation marks for CL. We then determined the relevance of each of these lemmata for a given classifica- tion task by their gain ratio (Yang and Pedersen,...