acquiring lexical generalizations from corpora

Tài liệu Báo cáo khoa học: "Acquiring Lexical Generalizations from Corpora: A Case Study for Diathesis Alternations" pdf

... percentages are the aver- age of the judges' individual classifications. 399 Acquiring Lexical Generalizations from Corpora: A Case Study for Diathesis Alternations Maria Lapata School of Cognitive ... threshold values varied from frame to flame but not from verb to verb and were determined by taking into account for each frame its overall frame frequency which was es- timated from the COMLEX subcategorization ... alternating verbs from large balanced corpora by using partial- parsing methods and taxonomic information, and discuss how corpus data can be used to quantify linguistic generalizations. ...

Ngày tải lên: 20/02/2014, 19:20

8 484 0

Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx

... automatic acquisition of lexical information from large repositories of unannotated text (such as the web, corpora of published text, etc.) is starting to produce large scale lexical resources ... paper describes a novel system for acquiring adjectival subcategorization frames (SCFs) and associated frequency information from English corpus data. The system incorporates a decision-tree classiﬁer ... frames from untagged text. In Meet- ing of the Association for Computational Linguistics, pages 209–214. E. J. Briscoe and J. Carroll. 1997. Automatic Extraction of Subcategorization from Corpora. ...

Ngày tải lên: 08/03/2014, 04:22

8 390 0

Báo cáo khoa học: "Acquiring a Lexicon from Unsegmented Speech" potx

Ngày tải lên: 08/03/2014, 07:20

3 315 0

Báo cáo khoa học: "A System for Large-Scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora" pot

Ngày tải lên: 17/03/2014, 04:20

8 551 0

Báo cáo khoa học: "Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II Treebank" potx

Ngày tải lên: 17/03/2014, 06:20

8 405 0

Báo cáo khoa học: "Improving the Accuracy of Subcategorizations Acquired from Corpora" pdf

Ngày tải lên: 17/03/2014, 06:20

6 317 0

Báo cáo khoa học: "AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA" doc

Ngày tải lên: 23/03/2014, 20:20

8 342 0

Báo cáo khoa học: "Deriving Generalized Knowledge from Corpora using WordNet Abstraction" pdf

Ngày tải lên: 24/03/2014, 03:20

9 257 0

Báo cáo khoa học: "ACQUIRING DISAMBIGUATION RULES FROM TEXT" docx

Ngày tải lên: 31/03/2014, 18:20

8 239 0

Free From Corporate America A Tactical Guide to Success on Your Own Terms By Jon Reed pptx

Ngày tải lên: 27/06/2014, 23:20

129 247 0

A STUDY ON LEXICAL COHESION IN VIETNAMESE AND ENGLISH CORPORATE ADVERTISINGS

... CHAPTER 3: A COMPARATIVE STUDY ON LEXICAL COHESIVE DEVICES IN ENGLISH AND VIETNAMESE CORPORATE ADVERTISEMENTS 1. General picture of lexical cohesive devices in Corporate advertisements Understanding ... clearly and in details. However, in Vietnamese corporate advertisements, the copywriters hardly notice the equivalent lexical items that are rendered from Vietnamese into English or vice versus. ... description of lexical cohesion features in English - figure out how these devices are used in texts - make comparative analysis of lexical cohesion between English and Vietnamese corporate advertisements...

Ngày tải lên: 29/01/2014, 10:43

41 1,1K 2

Tài liệu Báo cáo khoa học: "Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques" doc

... Singapore, 4 August 2009. c 2009 ACL and AFNLP Extracting Comparative Sentences from Korean Text Documents Us- ing Comparative Lexical Patterns and Machine Learning Techniques Seon Yang Department ... comparative sentences from text documents. This paper first investigates many comparative sentences referring to pre- vious studies and then defines a set of comparative keywords from them. A sentence ... to eliminate non- comparative sentences only from comparative sentence candidates with a CKL2 keyword. 4 Eliminating Non-comparative Sen- tences from the Candidates 3 As you can see in...

Ngày tải lên: 20/02/2014, 09:20

4 536 0

Tài liệu Báo cáo khoa học: "Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora" ppt

... (Cucerzan and Yarowsky, 1999) and (Collins and Singer, 1999) present algorithms to obtain NEs from untagged corpora. However, they focus on the classiﬁcation stage of already segmented entities, and ... feature vector from this example in the following manner: First, we split both words into all possible substrings of up to size two: We build a feature vector by coupling substrings from the two ... Computational Linguistics Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora Alexandre Klementiev Dan Roth Dept. of Computer Science University of Illinois Urbana,...

Ngày tải lên: 20/02/2014, 12:20

8 392 0

Tài liệu Báo cáo khoa học: "Mining metalinguistic activity in corpora to create lexical resources using Information Extraction techniques: the MOP system" doc

... information from free-text has been successfully carried out in the past (Hearst, 1999; Manning, 1993), automatically extracting lexical resources (including terminologi- cal definitions) from text ... information from a machine-readable dictionary. 3 Locating metalinguistic information in text: two approaches When implementingan IE application to mine metalinguistic information from text, ... tackle is how to obtain a reliable set of can- didate sentences from free text for input into the next phases of extraction. From our initial corpus analysis we selected 44 patterns that showed...

Ngày tải lên: 20/02/2014, 15:20

8 459 0

Tài liệu Báo cáo khoa học: "Bilingual Terminology Acquisition from Comparable Corpora and Phrasal Translation to Cross-Language Information Retrieval" pptx

... Japanese-English language pair, especially if involving the comparable corpora. Re-scoring through the Comparable Corpora Comparable corpora could be considered for the disambiguation of translation ... comparable corpora- based techniques, re- spectively compared to the hybrid two-stages comparable corpora and linguistics-based pruning. The proposed approach based on bi-directional comparable corpora ... TR2-007. P. Fung. 2000. A Statistical View of Bilingual Lexi- con Extraction: From Parallel Corpora to Non-Parallel Corpora. In Jean Veronis, Ed. Parallel Text Process- ing. G. Grefenstette. 1999....

Ngày tải lên: 20/02/2014, 16:20

4 377 0

Tài liệu Báo cáo khoa học: "INSIDE-OUTSIDE REESTIMATION FROM PARTIALLY BRACKETED CORPORA" ppt

... ( (from SF0) (to San Francisco))))).) GR (Tell ((me (((about the) public) transportation)) ( (from SF0) ((to San) (Francisco .))))) GB ((Tell (me (about (((the public) transportation) ( (from ... corpus, the inside prob- abilities of longer spans of c are computed from INSIDE-OUTSIDE REESTIMATION FROM PARTIALLY BRACKETED CORPORA Fernando Pereira 2D-447, AT~zT Bell Laboratories PO Box ... inferred from raw text. In addition, the number of iterations needed to reach a good grammar can be reduced; in extreme cases, a good solution is found from parsed text but not from raw text....

Ngày tải lên: 20/02/2014, 21:20

8 285 0

Tài liệu Báo cáo khoa học: "A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora" doc

... nouns or proper nouns is converted from their positions in the text into a vector. 3. Match pairs of positional difference vectors~ giving scores. All vectors from English and Chinese are matched ... dim(V2) 240 A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Pascale Fung Computer Science Department Columbia University New York, NY ... in the texts. For every word pair from this lexicon, we had ob- tained a DTW score and a DTW path. If we plot the points on the DTW paths of all word pairs from the lexicon, we get a graph...

Ngày tải lên: 20/02/2014, 22:20

8 427 0

Tài liệu Báo cáo khoa học: "Creating a Multilingual Collocation Dictionary from Large Text Corpora" docx

... Data from Bilingual Texts. In Pro- ceedings of the First International Lexical Acquisition Workshop, Detroit. Church, K., Gale, W., Hanks, P., and Hindle, D. (1991). Using Statistics in Lexical ... linguistic analysis. The originality of our approach comes from the fact that collocations are not extracted from raw texts, but rather from syntactically parsed texts. The linguistic analysis ... textual corpora from the World Trade Organisation (WTO), which consist in parallel documents in three languages: English, French and Spanish. All the examples given in this paper are taken from...

Ngày tải lên: 22/02/2014, 02:20

4 479 0

Tài liệu Báo cáo khoa học: "Effect of Cross-Language IR in Bilingual Lexicon Acquisition from Comparable Corpora" pot

... translation knowledge acquisition from WWW news sites, this paper studies issues on the effect of cross-language retrieval of relevant texts in bilingual lexicon acquisition from comparable corpora. We experimentally ... parallel/comparative corpora. However, the sizes as well as the domain of existing parallel/comparative corpora are lim- ited, while it is very expensive to manually col- lect parallel/comparative corpora. ... translation knowledge acquisition from parallel/comparative corpora, various kinds of translation knowledge are acquired. Within this framework of translation knowledge acquisition from WWW news sites, this...

Ngày tải lên: 22/02/2014, 02:20

8 477 0

Báo cáo khoa học: "Prototyping virtual instructors from human-human corpora" pdf

... this paper we presented a novel algorithm for rapidly prototyping virtual instructors from human- human corpora without manual annotation. Using our algorithm and the GIVE corpus we have gener- ated ... sum, this paper presents a novel way of automatically prototyping task-oriented virtual agents from corpora who are able to effectively and natu- rally help a user complete a task in a virtual ... world. References Sudeep Gandhe and David Traum. 2007. Creating spo- ken dialogue characters from corpora without annota- tions. In Proceedings of Interspeech, Belgium. Andrew Gargett, Konstantina...

Ngày tải lên: 07/03/2014, 22:20

6 220 0