... each other ina variety of ways, including their topic, the read-ing level of their intended audience, and their in-tended purpose (eg, to instruct, to inform, to ex-press an opinion, to summarize, ... There is no one “right set”.3 Genre in the Penn TreeBankAlthough the files in the Penn TreeBank (PTB)lack any classificatory meta-data, leading the PTB to be treated as a single homogeneous collectionof ... meta-data in the PTB files1, Ilooked at line-level patterns in the 2159 files thatmake up the Penn Discourse TreeBank subset of the PTB, and then manually confirmed the texttypes I found.2 The resulting...