Báo cáo khoa học: "Linguistic Knowledge Acquisition from Parsing Failures" pot

Báo cáo khoa học: "Linguistic Knowledge Acquisition from Parsing Failures" pot

Báo cáo khoa học: "Linguistic Knowledge Acquisition from Parsing Failures" pot

... $un-ichi Tsujii, et al. Linguistic knowledge acquisition from corpora. In Proc. of 2nd FGNLP, pages 61-81, UMIST, 1992. 231 Linguistic Knowledge Acquisition from Parsing Failures Masaki KIYONO* ... of Hypotheses 4.1 Hypothesizing Grammar Rules from Parsing Failures When the parser fails to analyse a sentence, the grammar rule hypothesizing program (shortly GRHP) inve...

Ngày tải lên: 24/03/2014, 05:21

10 279 0
Tài liệu Báo cáo khoa học: "Bilingual Terminology Acquisition from Comparable Corpora and Phrasal Translation to Cross-Language Information Retrieval" pptx

Tài liệu Báo cáo khoa học: "Bilingual Terminology Acquisition from Comparable Corpora and Phrasal Translation to Cross-Language Information Retrieval" pptx

... on bilingual ter- minology acquisition and disambiguation from com- parable corpora (Sadat et al., 2003) is described as follows: - Bilingual terminology acquisition from source language to target ... the application to knowledge acquisition, such as bilin- gual terminology. In addition, non-aligned com- parable corpora have been given a special inter- est in bilingual terminology...

Ngày tải lên: 20/02/2014, 16:20

4 377 0
Báo cáo khoa học: "Learning Common Grammar from Multilingual Corpus" potx

Báo cáo khoa học: "Learning Common Grammar from Multilingual Corpus" potx

... languages from non-parallel multilingual corpora in an unsupervised fashion. For this purpose, we assume a generative model for multilingual corpora, where each sentence is generated from a language ... borrowing from nearby languages, and 3) the innate abilities of humans (Chomsky, 1965). We assume hidden commonalities in syntax across languages, and try to extract a common grammar fr...

Ngày tải lên: 07/03/2014, 22:20

5 326 0
Báo cáo khoa học: "Acquiring a Lexicon from Unsegmented Speech" potx

Báo cáo khoa học: "Acquiring a Lexicon from Unsegmented Speech" potx

... 02139, USA cgdemarc@ai.mit.edu Abstract We present work-in-progress on the ma- chine acquisition of a lexicon from sen- tences that are each an unsegmented phone sequence paired with a primitive ... the problem for child language acquisition and computer speech recognition. 1 Introduction We are interested in how a lexicon of discrete words can be acquired from continuous s...

Ngày tải lên: 08/03/2014, 07:20

3 315 0
Báo cáo khoa học: "Automating the Acquisition of Bilingual Terminology" potx

Báo cáo khoa học: "Automating the Acquisition of Bilingual Terminology" potx

... automate the acquisition of the necessary lexical knowledge, viz. determining which nouns are likely to take PP complements, but our corpus is too small for this type of knowledge acquisition. ... removed from the collection because the trans- lation of some occurrences of these terms turned out to be incorrect, very indirect, simply missing from the text, or because they suff...

Ngày tải lên: 09/03/2014, 01:20

7 297 0
Báo cáo khoa học: "Building Emotion Lexicon from Weblog Corpora" potx

Báo cáo khoa học: "Building Emotion Lexicon from Weblog Corpora" potx

... Blog from January to July, 2006, spanning a period of 212 days. In total, 336,161 bloggers’ articles were col- lected. Each blogger posts 16 articles on average. We used the articles from ... and emotions using weblog corpora. A collocation model is proposed to learn emotion lexicons from weblog articles. Emotion classification at sentence level is experimented by using the mined...

Ngày tải lên: 17/03/2014, 04:20

4 302 0
Báo cáo khoa học: "Extracting Hypernym Pairs from the Web" potx

Báo cáo khoa học: "Extracting Hypernym Pairs from the Web" potx

... hierarchies from evidence collected from con- junctions. Pantel, Ravichandran and Hovy (2004) learned syntactic patterns for identifying hypernym relations and combined these with clusters built from ... hypernym relations from the web. We compare our approach with hypernym ex- traction from morphological clues and from large text corpora. We show that the abun- dance of available...

Ngày tải lên: 17/03/2014, 04:20

4 395 0
Báo cáo khoa học: "Extracting Social Networks from Literary Fiction" pot

Báo cáo khoa học: "Extracting Social Networks from Literary Fiction" pot

... the following cate- gories: authorial (novels from the major canoni- 140 cal authors of the period), historical (novels from each decade), generic (from the major sub-genres of nineteenth-century ... the texts from Project Gutenberg. All told, these texts total more than 10 million words. We assembled this representative corpus in or- der to test two hypotheses, which are derived from...

Ngày tải lên: 23/03/2014, 16:20

10 336 0
Báo cáo khoa học: "Learning Bilingual Lexicons from Monolingual Corpora" pot

Báo cáo khoa học: "Learning Bilingual Lexicons from Monolingual Corpora" pot

... Ohio, USA, June 2008. c 2008 Association for Computational Linguistics Learning Bilingual Lexicons from Monolingual Corpora Aria Haghighi, Percy Liang, Taylor Berg-Kirkpatrick and Dan Klein Computer ... aria42,pliang,tberg,klein }@cs.berkeley.edu Abstract We present a method for learning bilingual translation lexicons from monolingual cor- pora. Word types in each language are charac- te...

Ngày tải lên: 31/03/2014, 00:20

9 300 0
Báo cáo khoa học: "Weakly-Supervised Acquisition of Open-Domain Classes and Class Attributes from Web Documents and Query Logs" pot

Báo cáo khoa học: "Weakly-Supervised Acquisition of Open-Domain Classes and Class Attributes from Web Documents and Query Logs" pot

... necessarily labeled, from unstructured text. The extraction proceeds either iteratively by starting from a few seed ex- traction rules (Collins and Singer, 1999), or by mining named entities from comparable ... more difficult to identify in text. 4.2 Acquisition of Class Attributes Previous work on the automatic acquisition of at- tributes for open-domain classes from text is less...

Ngày tải lên: 08/03/2014, 01:20

9 447 0
w