... future work. Seman- tic classes were not examined, because defining, building, fine-tuning, and maintaining such word lists can be an arduous task (cf. e.g. (Klavans and Kan, 1998)), which ... size, incorporating information gain into the distance measure leads to a clear de- crease in performance. Overall performance: Unsurprisingly, perfor- mance in terms of precision and recall ... stop word list. They are sig- nificantly less frequent in academic texts and cat- egories E, L, NH, and P, and more frequent in fiction, NL, and R. Again, all differences are at or below...