... the 1996 NIST Language Recognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... n-gram Language Modeling, or PRLM (Zissman, 1996) . Orthographic forms of language, ranging from Latin alphabet to Cyrillic script to Chinese charac-ters, are far more unique to the language ... bounds of written language. All of this makes the identification of spoken language based on pho-netic units much more challenging than the identi-fication of written language. In fact, the...