Báo cáo khoa học: "An Algorithm for Unsupervised Transliteration Mining with an Application to Word Alignment" ppt

Báo cáo khoa học: "An Algorithm for Unsupervised Transliteration Mining with an Application to Word Alignment" ppt

Báo cáo khoa học: "An Algorithm for Unsupervised Transliteration Mining with an Application to Word Alignment" ppt

... transliteration mining algorithm on three tasks: transliteration mining from Wikipedia InterLanguage Links, transliteration mining from parallel corpora, and word alignment using a word aligner with a transliteration ... like- lihood of the non -transliteration pairs. Instead we want to optimize the transliteration performance for test data. Secondly, it is easy...
Ngày tải lên : 23/03/2014, 16:20
  • 10
  • 320
  • 0
Tài liệu Báo cáo khoa học: "Machine Learning for Coreference Resolution: From Local Classification to Global Ranking" ppt

Tài liệu Báo cáo khoa học: "Machine Learning for Coreference Resolution: From Local Classification to Global Ranking" ppt

... in- stances than McCarthy and Lehnert’s. Specifically, a positive instance is created for each anaphoric NP, NP , and its closest antecedent, NP ; and a negative instance is created for NP paired with ... Learning to Rank Candidate Partitions We train an SVM-based ranker for ranking candidate partitions by means of Joachims’ (2002) SVM package, with all the parameters set to th...
Ngày tải lên : 20/02/2014, 15:20
  • 8
  • 518
  • 1
Báo cáo khoa học: "Ensemble Methods for Unsupervised WSD" doc

Báo cáo khoa học: "Ensemble Methods for Unsupervised WSD" doc

... Oxford Collocations, the Longman Language Activator, and collocation web sites). Each collocation is mapped to the WordNet sense inventory in a semi-automatic manner (Navigli, 2005) and transformed ... outperforming any single method in every frequency band and that the rank-based ensemble consistently outperforms Similarity and SSI in all bands. Although Similar- ity has an advantage ove...
Ngày tải lên : 08/03/2014, 02:21
  • 8
  • 343
  • 0
Báo cáo khoa học: "Clustering Clauses for High-Level Relation Detection: An Information-theoretic Approach" pdf

Báo cáo khoa học: "Clustering Clauses for High-Level Relation Detection: An Information-theoretic Approach" pdf

... since many nouns can only play a certain part in the clause (for instance, many verbs can- not have an inanimate entity as their subject). The number of instances of patterns we found for the anchored ... statistics and dependencies within the se ntence, correlates with a purely semantic similarity as represented by the WordNet struc- ture, and cannot be attributed to chance. Figure...
Ngày tải lên : 08/03/2014, 02:21
  • 8
  • 261
  • 0
Báo cáo khoa học: "Minimized Models for Unsupervised Part-of-Speech Tagging" pot

Báo cáo khoa học: "Minimized Models for Unsupervised Part-of-Speech Tagging" pot

... Bayesian framework and show how EM can be used to learn good POS taggers for Hebrew and English, when provided with good initial conditions. They use language specific information (like word contexts, syntax ... In such cases, any word not appearing in the dictionary will be treated as an unknown word, and can be labeled with any of the tags from given tagset (i.e., for every...
Ngày tải lên : 17/03/2014, 01:20
  • 9
  • 375
  • 0
Báo cáo khoa học: "A Framework for Unsupervised Natural Language Morphology Induction" docx

Báo cáo khoa học: "A Framework for Unsupervised Natural Language Morphology Induction" docx

... score the potential morphological variants with a semantic distance. Word forms with small seman- tic distance are proposed as morphological variants of one anther. Goldsmith (2001), by searching ... morphological analysis of the language(s) at hand. The task of a morphological analyzer is to identify the lexeme, citation form, or inflection class of surface word forms in a l...
Ngày tải lên : 17/03/2014, 06:20
  • 6
  • 500
  • 0
Báo cáo khoa học: "Statistical Models for Unsupervised Prepositional Phrase Attachment" pdf

Báo cáo khoa học: "Statistical Models for Unsupervised Prepositional Phrase Attachment" pdf

... occurs within K words to the left of p • No verb occurs within K words to the left of p • n2 is the first noun that occurs within K words to the right of p • No verb occurs between p and n2 ... difficult to port to other languages because they re- quire resources that are expensive to construct or simply nonexistent in other languages. We present an unsupervised algo...
Ngày tải lên : 17/03/2014, 07:20
  • 7
  • 333
  • 0
Báo cáo khoa học: "Annealing Techniques for Unsupervised Statistical Language Learning" ppt

Báo cáo khoa học: "Annealing Techniques for Unsupervised Statistical Language Learning" ppt

... increased variance of the per- formance of any algorithm so ten small corpora were not enough to determine whether to expect an improvement from DA more often than not. 4.3 Mixing labeled and unlabeled ... but is too lengthy to give here; it is a straightforward extension of that given by Neal and Hinton for EM. It is clear that the value of β allows us to manip- ulate the relati...
Ngày tải lên : 23/03/2014, 19:20
  • 8
  • 242
  • 0
Tài liệu Báo cáo khoa học: Structural basis for cyclodextrin recognition by Thermoactinomyces vulgaris cyclo⁄maltodextrin-binding protein ppt

Tài liệu Báo cáo khoa học: Structural basis for cyclodextrin recognition by Thermoactinomyces vulgaris cyclo⁄maltodextrin-binding protein ppt

... maltooligo- saccharides, such as maltose, maltotriose and malto- tetraose [17], and this form is capable of interacting with the MalFGK 2 sugar-transporter complex. In contrast, the open form ... found to form the same contacts with TvuCMBP in Mol-A–D. The rmsd between Mol-A and Mol-B, Mol-A and Mol-C, Mol-A and Mol-D are 0.77, 0.94, and 0.74 A ˚ , respect- ively, for all atoms, and 0...
Ngày tải lên : 19/02/2014, 00:20
  • 12
  • 540
  • 0
Tài liệu Báo cáo khoa học: "Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules" doc

Tài liệu Báo cáo khoa học: "Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules" doc

... Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules Qin Gao and Stephan Vogel Language Technologies Institute, Carnegie Mellon University 5000 Forbes Avenue Pittsburgh, ... corpus. We used an SVM classifier with features derived from standard phrase based translation models and bilingual lan- guage models to identify high quality sentence pairs,...
Ngày tải lên : 20/02/2014, 04:20
  • 5
  • 416
  • 0

Xem thêm