Báo cáo khoa học: "Yet Another Language Identifier" pdf

Báo cáo khoa học: "Yet Another Language Identifier" pdf

Báo cáo khoa học: "Yet Another Language Identifier" pdf

... the same methods used on 1 MB of data. 3.2 Language Diversity The increasing number of languages recognised by the system decreases language diversity. This may be another reason for the observed drop in ... 40 50 60 70 80 90 Language Families Languages Class 1 Class 2 Figure 1: Language diversity on Wikipedia. Lan- guages are sorted according to their text corpus size. The first 52...

Ngày tải lên: 24/03/2014, 03:20

9 203 0
Tài liệu Báo cáo khoa học: "Yet Another Word Alignment Tool" docx

Tài liệu Báo cáo khoa học: "Yet Another Word Alignment Tool" docx

... matrix correspond to the words of the sentence in one language and the columns to the words of that sentence’s translation into the other language. Marks in the matrix ’s cells indicate whether ... pages 20–23, Columbus, June 2008. c 2008 Association for Computational Linguistics Yawat: Yet Another Word Alignment Tool Ulrich Germann University of Toronto germann@cs.toronto.edu Abstract...

Ngày tải lên: 20/02/2014, 09:20

4 417 1
Tài liệu Báo cáo khoa học: "PANEL NATURAL LANGUAGE AND DATABASES" pdf

Tài liệu Báo cáo khoa học: "PANEL NATURAL LANGUAGE AND DATABASES" pdf

... PANEL NATURAL LANGUAGE AND DATABASES, AGAIN Karen Sparck Jones Computer Laboratory, University of Cambridge Corn Exchange Street, Cambridge CB2 3QG, England INTRODUCTION Natural Language and ... on solving all the problems of language and knowledge processing at once. More importantly, the task provides a hard, rather than soft, test environment for a language processor: the...

Ngày tải lên: 21/02/2014, 20:20

2 284 0
Báo cáo khoa học: "PLANNING NATURAL LANGUAGE EXPRESSIONS REFERRING" pdf

Báo cáo khoa học: "PLANNING NATURAL LANGUAGE EXPRESSIONS REFERRING" pdf

... NATURAL LANGUAGE REFERRING EXPRESSIONS Douglas E. Appelt SRI International Menlo Park, California ABSTRACT This paper describes how a language- planning system can produce natural -language ... able to the speaker. I. INTRODUCTION One of the mo~t important constituent processes of natural -language generation is the production of referring expressions, which occur in almost...

Ngày tải lên: 17/03/2014, 19:21

5 259 0
Tài liệu Báo cáo khoa học: "How spoken language corpora can refine current speech motor training methodologies" pptx

Tài liệu Báo cáo khoa học: "How spoken language corpora can refine current speech motor training methodologies" pptx

... methodologies of speech and language therapy. In this paper, we present a novel approach for construct- ing speech motor exercises, based on lin- guistic knowledge extracted from spoken language corpora. ... the syllabic inventory of the spoken language. Besides the inventory of spoken sylla- bles, we are interested in the distribution of sylla- bles across the language. 3.1 Syllable f...

Ngày tải lên: 20/02/2014, 04:20

6 387 0
Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

... Jelinek. 2000. Structured language modeling. Computer Speech and Language, 14(4):283–332. Stanley F. Chen and Joshua Goodman. 1998. An empir- ical study of smoothing techniques for language mod- eling. ... 172–181. Matt Post and Daniel Gildea. 2009. Language modeling with tree substitution grammars. In NIPS workshop on Grammar Induction, Representation of Language, and Language Lea...

Ngày tải lên: 20/02/2014, 04:20

12 511 0
Tài liệu Báo cáo khoa học: "The Natural Language Toolkit" docx

Tài liệu Báo cáo khoa học: "The Natural Language Toolkit" docx

... USA Abstract The Natural Language Toolkit is a suite of program modules, data sets and tutorials supporting research and teaching in com- putational linguistics and natural language processing. NLTK ... pages 69–72, Sydney, July 2006. c 2006 Association for Computational Linguistics NLTK: The Natural Language Toolkit Steven Bird Department of Computer Science and Software Engineering U...

Ngày tải lên: 20/02/2014, 12:20

4 422 2
Tài liệu Báo cáo khoa học: "Discriminative Syntactic Language Modeling for Speech Recognition" pdf

Tài liệu Báo cáo khoa học: "Discriminative Syntactic Language Modeling for Speech Recognition" pdf

... predominant approach within language model- ing for speech recognition has been to use an n- gram language model, within the “source-channel” or “noisy-channel” paradigm. The language model assigns ... value that reflects the rela- tive importance of the language model; β is typi- cally chosen by optimization on held-out data. In an n-gram language model, a Markov assumption is made,...

Ngày tải lên: 20/02/2014, 15:20

8 410 0
Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

... the 1996 NIST Language Recognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... n-gram Language Modeling, or PRLM (Zissman, 1996) . Orthographic forms of language, ranging from Latin alphabet to Cyrillic script to Chinese charac- ters, are far more u...

Ngày tải lên: 20/02/2014, 15:20

8 437 0
Tài liệu Báo cáo khoa học: " A Declarative Language for Implementing Dynamic Programs∗" pptx

Tài liệu Báo cáo khoa học: " A Declarative Language for Implementing Dynamic Programs∗" pptx

... + 46 inference rules), enabling research that he would not have been willing to undertake in another language. 6 Related Work This project tries to synthesize much folk wisdom. For NLP algorithms, ... U.S.A. {jason,eerat,nasmith}@cs.jhu.edu Abstract We present the first version of a new declarative pro- gramming language. Dyna has many uses but was de- signed especially for rapid developm...

Ngày tải lên: 20/02/2014, 16:20

4 560 0
w