Báo cáo khoa học: "Syntactic Stylometry for Deception Detection" pptx

Báo cáo khoa học: "Syntactic Stylometry for Deception Detection" pptx

Báo cáo khoa học: "Syntactic Stylometry for Deception Detection" pptx

... the Association for Computational Linguistics, pages 171–175, Jeju, Republic of Korea, 8-14 July 2012. c 2012 Association for Computational Linguistics Syntactic Stylometry for Deception Detection Song ... relied only on shal- low lexico-syntactic patterns. This pa- per investigates syntactic stylometry for deception detection, adding a somewhat unconventional angle to prior...

Ngày tải lên: 07/03/2014, 18:20

5 441 0
Báo cáo khoa học: "Syntactic Annotations for the Google Books Ngram Corpus" ppt

Báo cáo khoa học: "Syntactic Annotations for the Google Books Ngram Corpus" ppt

... words, resulting in somewhat less interesting examples for some POS tags. More typical examples might be big for adjectives, quickly for adverbs or read for verbs. As suggested in Petrov et al. (2012), ... היה Table 2: The two most common words for some POS tags in the new Google Books NGram Corpus for all languages. ngrams that span page boundaries. Instead, we perform tokenizat...

Ngày tải lên: 23/03/2014, 14:20

6 395 0
Tài liệu Báo cáo khoa học: "Asynchronous Binarization for Synchronous Grammars" pptx

Tài liệu Báo cáo khoa học: "Asynchronous Binarization for Synchronous Grammars" pptx

... Transforming the entire for- est to n-ary form is intractable, however, because the number of hyperedges would be exponential in n. Instead, we include only the top k n-ary back- traces for each forest ... Collapsing Binarization To facilitate a change in binarization, we transform the translation forest into n-ary form. In the n-ary forest, each hyperedge corresponds to an original gramm...

Ngày tải lên: 20/02/2014, 09:20

4 278 0
Báo cáo khoa học: "Research Methodology for Machine Translation" pptx

Báo cáo khoa học: "Research Methodology for Machine Translation" pptx

... Equivalents. If the new form is a form of a word in the old glossary, the Eng- lish equivalents previously used are carried forward. If no form of the word has occurred before, the linguist assigns ... different occurrences of the same form. Research Methodology 9 10 Edmundson and Hays Word: a form that represents a set of forms differing only in inflection. For example, "...

Ngày tải lên: 07/03/2014, 18:20

8 281 0
Báo cáo khoa học: "Compositional Semantics for Linguistic Formalisms" pptx

Báo cáo khoa học: "Compositional Semantics for Linguistic Formalisms" pptx

... for linguistic formalisms. For lack of space, some proofs are omitted; an extended version is avail- able as a technical report (Wintner, 1999). 2 Grammar semantics Viewing grammars as formal ... of grammatical formalisms. For the following discussion fix a particular grammar G = (n,E,A~). Definition 5. Let Tc : Z -+ Z be a trans- formation on sets o.f items, where .for every...

Ngày tải lên: 08/03/2014, 06:20

8 305 0
Báo cáo khoa học: " COMPUTING MACHINES FOR LANGUAGE TRANSLATION " pptx

Báo cáo khoa học: " COMPUTING MACHINES FOR LANGUAGE TRANSLATION " pptx

... number of standard devices are available for coding the incoming text for insertion into the machine and, after the translation process is completed, for decoding and printing the translation ... its original form, the dissection scheme for endings and compounds can be tried; if this fails, the word can be printed through without alteration. Several methods are available for...

Ngày tải lên: 16/03/2014, 19:20

6 360 0
Báo cáo khoa học: "Phrase Clustering for Discriminative Learning" pptx

Báo cáo khoa học: "Phrase Clustering for Discriminative Learning" pptx

... paradigm for parallel programming (Dean and Ghemawat, 2004). The most straightforward MapReduce implementation of K-Means would be to have mappers perform Step ii and reducers perform Step ... have already collected enough data for the phrase. More data for the same phrase will not necessarily tell us anything more about it. There are two benefits for such an upper limit. Fir...

Ngày tải lên: 23/03/2014, 16:21

9 355 0
Báo cáo khoa học: "Clustering Adjectives for Class Acquisition" pptx

Báo cáo khoa học: "Clustering Adjectives for Class Acquisition" pptx

... previous lexical knowledge. There- fore, information on derivational morphology was not used at this stage (but it was for analysis; see Section 5.3). The values for each feature were set as true ... belong- ing to a formal register. The corpus had previously been automatically tagged and hand-corrected, with information on lemma, part-of-speech and other morphological information (follow...

Ngày tải lên: 31/03/2014, 20:20

8 325 0
Tài liệu Báo cáo khoa học: "A Framework for Syntactic Translation" docx

Tài liệu Báo cáo khoa học: "A Framework for Syntactic Translation" docx

... S 1 , which contains all of the information that is in the input sentence. The part of the information that is implicit in the sentence (tense, voice, and so forth) is made explicit in S 1 . ... certain German verb form into English, it is necessary to un- derstand the German verb form as part of a complex of features of German structure in- cluding possibly other verb forms within t...

Ngày tải lên: 19/02/2014, 19:20

7 509 1
Tài liệu Báo cáo khoa học: "Syntactic Phrase Reordering for English-to-Arabic Statistical Machine Tranfor slation" pptx

Tài liệu Báo cáo khoa học: "Syntactic Phrase Reordering for English-to-Arabic Statistical Machine Tranfor slation" pptx

... model orders are 4-grams for the baseline system and 6-grams for the seg- mented Arabic. The average sentence length is 33 for English, 25 for non-segmented Arabic and 36 for segmented Arabic. To ... data size, we use a trigram LM for the baseline, and a 4-gram for segmented Arabic. In this case, the average sentence length is 9 for English, 8 for Ara- bic, and 10 for segm...

Ngày tải lên: 22/02/2014, 02:20

8 377 0
w