Báo cáo khoa học: "A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization" potx

Báo cáo khoa học: "A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization" potx

Báo cáo khoa học: "A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization" potx

... both used as features in Chinese text process- ing tasks, but no systematic comparison or analysis of their values as features for Chinese text categorization has been re- ported heretofore. We ... Linguistics A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization Jingyang L...

Ngày tải lên: 08/03/2014, 02:21

8 493 0
Báo cáo khoa học: " A Tool for Error Analysis of Machine Translation Output" doc

Báo cáo khoa học: " A Tool for Error Analysis of Machine Translation Output" doc

... informa- tion, and then an item for each menu containing: • The name of the menu • A list of menu items, containing: – Display name – Internal name (used in annotation file, and internally in BLAST) – ... especially in combination with a part -of- speech analysis (Popovi ´ c et al., 2006). Human evaluation is also often quantitative, for instance in the form of estimates...

Ngày tải lên: 07/03/2014, 22:20

6 479 0
Báo cáo khoa học: "A System for Semantic Analysis of Chemical Compound Names" pdf

Báo cáo khoa học: "A System for Semantic Analysis of Chemical Compound Names" pdf

... these tasks. Krauthammer and Nenadic (2004) divide the identification task into the subtasks of term recognition (marking the interesting words in a text) , term classification (classifying them ... ver- tices and its set of edges. Therefore, the domain of a graph consists of a set of possible vertices, in our case for the atoms, and possible edges, in our case for the...

Ngày tải lên: 08/03/2014, 01:20

9 479 0
Báo cáo khoa học: "A Comparison of Document, Sentence, and Term Event Spaces" potx

Báo cáo khoa học: "A Comparison of Document, Sentence, and Term Event Spaces" potx

... ab- stracts, and the full -text IDF (see section 4.4). 4.4 Abstract vs full text comparison Although abstracts are often easier to obtain, the availability of full -text documents continues to increase. ... This comparison reflects a previous analysis comprising a random sample of 193 words from a 50 million word corpus of 85,432 news articles (Church and Gale 19...

Ngày tải lên: 17/03/2014, 04:20

8 355 0
Báo cáo khoa học: "A Comparison of Loopy Belief Propagation and Dual Decomposition for Integrated CCG Supertagging and Parsing" potx

Báo cáo khoa học: "A Comparison of Loopy Belief Propagation and Dual Decomposition for Integrated CCG Supertagging and Parsing" potx

... Computational Linguistics. J. R. Finkel, C. D. Manning, and A. Y. Ng. 2006. Solv- ing the problem of cascading errors: Approximate Bayesian inference for linguistic annotation pipelines. In Proc. of EMNLP. J. ... Forest Reranking: Discriminative pars- ing with Non-Local Features. In Proceedings of ACL- 08: HLT. W. Jiang, L. Huang, Q. Liu, and Y. L ¨ u. 2008. A cas- caded linear...

Ngày tải lên: 23/03/2014, 16:20

11 395 0
Báo cáo khoa học: "A Comparison of Head Transducers and Transfer for a Limited Domain Translation Application" pptx

Báo cáo khoa học: "A Comparison of Head Transducers and Transfer for a Limited Domain Translation Application" pptx

... translation. In the case of text translation for publishing, it is reasonable to adopt economic measures of the Fei Xia Department of Computer and Information Science University of Pennsylvania ... additional source of counts used in the trans- fer system was an unsupervised training method in which 13000 training utterances were translated from English to Chinese,...

Ngày tải lên: 24/03/2014, 03:21

6 324 0
Báo cáo khoa học: "A comparison of clausal coordinate ellipsis in Estonian and German: Remarkably similar elision rules allow a language-independent ellipsis-generation module" pot

Báo cáo khoa học: "A comparison of clausal coordinate ellipsis in Estonian and German: Remarkably similar elision rules allow a language-independent ellipsis-generation module" pot

... Nor do we deal with recasts of clausal coordina- tions as coordinate NPs (e.g., John likes skating and Peter likes skiing becoming John and Peter like skating and ski- ing, respectively). Presumably, ... and Pseudogapping because they involve the generation of pro-forms instead of, or in addi- tion to, the ellipsis proper. For example, John laughed, and Mary did, too—a c...

Ngày tải lên: 31/03/2014, 20:20

4 322 0
Tài liệu Báo cáo khoa học: "A Comparison of Alternative Parse Tree Paths for Labeling Semantic Roles" ppt

Tài liệu Báo cáo khoa học: "A Comparison of Alternative Parse Tree Paths for Labeling Semantic Roles" ppt

... Aligning arguments to parse trees nodes in a training / testing corpus We began our investigation by creating a training and testing corpus of 400 sentences each contain- ing an inflection of ... ate:eat,V,i↓He:he,N,s Minipar B: A second parse tree path encoding was generated from Minipar parses that relaxes some of the constraints used in Minpar A. In- stead of using...

Ngày tải lên: 20/02/2014, 12:20

8 520 0
Tài liệu Báo cáo khoa học: "A SPEECH-FIRST MODEL FOR REPAIR DETECTION AND CORRECTION" docx

Tài liệu Báo cáo khoa học: "A SPEECH-FIRST MODEL FOR REPAIR DETECTION AND CORRECTION" docx

... found cases of 'lengthened' intonational phrases in repair intervals, as illustrated in the single-phrase reparandum in (8), where the corresponding fluent ver- sion of the reparandum ... acoustic-phonetic and prosodic analysis of a cor- pus of repairs in spontaneous speech, indicating that reparanda offsets end in word fragments, usually of (in- tende...

Ngày tải lên: 20/02/2014, 21:20

8 502 0
Tài liệu Báo cáo khoa học: "A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora" doc

Tài liệu Báo cáo khoa học: "A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora" doc

... better initializing basis for EM methods. It has also shown promise for finding noun phrases in English and Chinese, as well as finding new Chinese words which were not tokenized by a Chinese ... such as follows: • finding Chinese words: Chinese texts do not have word boundaries such as space in English, therefore our text was tokenized into words by a stat...

Ngày tải lên: 20/02/2014, 22:20

8 427 0
w