Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

... Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study Hwee Tou Ng Bin Wang Yee Seng Chan Department of Computer Science National ... of senses before and after sense lump- ing is 5.07 and 3.52 respectively. After sense lumping, we trained a WSD classi- fier for each noun w, by using the lumped senses in the manually sense- tagged ... transl...

Ngày tải lên: 08/03/2014, 04:22

8 380 0
Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

... using one's feet” and “to direct or control”. WSD can be useful for many applications, includ- ing information retrieval, information extraction and machine translation. Sense ambiguity has ... (Res- nik and Yarowsky, 1997). For example, in machine translation, WSD, or translation disambiguation, is responsible for identifying the correct translation for an ambiguous sour...

Ngày tải lên: 08/03/2014, 02:21

8 381 0
Tài liệu Báo cáo khoa học: "REPRESENTATION OF TEXTS FOR INFORMATION RETRIEVAL" pdf

Tài liệu Báo cáo khoa học: "REPRESENTATION OF TEXTS FOR INFORMATION RETRIEVAL" pdf

... OF TEXTS FOR INFORMATION RETRIEVAL N.J. Belkin, B.G. Michell, and D.G. Kuehner University of Western Ontario The representation of whole texts is a major concern of the field known as information ... the following: a. A user, recognizing an information need, presents to an IR mechanism (i.e., a collection of texts, with a set of associated activities for representing, stor-...

Ngày tải lên: 21/02/2014, 20:20

2 419 0
Báo cáo khoa học: "Exploiting Web Redundancy for Answer Validation" pptx

Báo cáo khoa học: "Exploiting Web Redundancy for Answer Validation" pptx

... answer. As an example, (Harabagiu and Maiorano, 1999) describes answer validation as an abductive inference process, where an answer is valid with respect to a question if an explanation for it, ... the question words influence the appearance of answer words. Therefore, we introduce additional linguis- tic techniques for pattern and query formulation, such as keyword extraction, an...

Ngày tải lên: 08/03/2014, 07:20

8 407 0
Tài liệu Báo cáo khoa học: "Conditional Random Fields for Word Hyphenation" docx

Tài liệu Báo cáo khoa học: "Conditional Random Fields for Word Hyphenation" docx

... for Word Hyphenation Nikolaos Trogkanis Computer Science and Engineering University of California, San Diego La Jolla, California 92093-0404 tronikos@gmail.com Charles Elkan Computer Science and ... condi- tional random fields. We create new train- ing sets for English and Dutch from the CELEX European lexical resource, and achieve error rates for English of less than 0.1% for correc...

Ngày tải lên: 20/02/2014, 04:20

9 608 0
Tài liệu Báo cáo khoa học: "Head-Driven Parsing for Word Lattices" ppt

Tài liệu Báo cáo khoa học: "Head-Driven Parsing for Word Lattices" ppt

... Journal treebank and lattice cor- pora show word error rates competitive with the standard n-gram language model while extracting additional structural information useful for speech understanding. 1 ... training. 1 corpus are annotated with trigram scores trained using a 20 thousand word vocabulary and 40 mil- lion word training sample. The word lattices have a unique start and end...

Ngày tải lên: 20/02/2014, 15:21

8 382 0
Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

... Similarly, any index x /∈ [i, k] is external to T [i,k] . An in- valid span is any span for which our provided tree T[i,k] x1 i j k x2j' T Figure 3: Illustration of invalid spans. [j  , j] and [j, ... alignment be the complete structure that connects two parallel sentences, and a link be one of the word- to -word connections that make up an alignment. All word alignment meth...

Ngày tải lên: 08/03/2014, 02:21

8 325 0
Báo cáo khoa học: "A STOCHASTIC PROCESS FOR WORD FREQUENCY DISTRIBUTIONS" pot

Báo cáo khoa học: "A STOCHASTIC PROCESS FOR WORD FREQUENCY DISTRIBUTIONS" pot

... the hum- Table i: Spearman rank correlation analysis of the neighborhood density and frequency effects for empirical and theoretical words of length 4. Dutch Mand. Mand Simon dens. freq. ... function words excluded, and charts the lexical similarity effects of the subset of words with length 4 by means of boxplots. These show the mean (dotted line), the median, the upper and lowe...

Ngày tải lên: 08/03/2014, 07:20

8 409 0
Báo cáo khoa học: "AMBIGUITY RESOLUTION IN THE HUMAN SYNTACTIC PARSER: AN EXPERIMENTAL STUDY" ppt

Báo cáo khoa học: "AMBIGUITY RESOLUTION IN THE HUMAN SYNTACTIC PARSER: AN EXPERIMENTAL STUDY" ppt

... natural form of a parser which utilizes abandonment would be an IPA model. The construction of more than one analysis for an ambiguity would trigger the parser to throw out the analyses and wait ... other sort can be called strong parallelism, in which the possible analyses can stay active and be expanded as new input is received. If further input is inconsistent with any of the...

Ngày tải lên: 08/03/2014, 18:20

5 352 0
Tài liệu Báo cáo khoa học: "Using Confidence Bands for Parallel Texts Alignment" pptx

Tài liệu Báo cáo khoa học: "Using Confidence Bands for Parallel Texts Alignment" pptx

... the European Union, where texts must be translated daily into eleven languages, or even in the U.S.A. where Spanish and English speaking communities are intermingled. Parallel texts (texts that ... varies according to language similarity. For instance, on average, it is higher for Portuguese–Spanish than for Portuguese–English. These words end up being mainly numbers and names....

Ngày tải lên: 20/02/2014, 18:20

8 464 0
w