... j-th model feature indexed as in the baseline system. 3.5 Integration with Baseline Translation System Since the obtained translation entries for abbrevia- tions have the same format as the regular ... corpora by treating the Chinese translations obtained in Step-2 as full- form phrases; • Step-4: induce translation entries for Chinese abbreviations by using their full-form ph...
Ngày tải lên: 20/02/2014, 09:20
... supervised/semi-supervised infor- mation for learning. We propose a flexible genera- tive model for transliteration mining usable for both unsupervised and semi-supervised learning. Previous work on ... translitera- tion mining. In Proceedings of the 2010 Named Enti- ties Workshop, Uppsala, Sweden. Haizhou Li, Zhang Min, and Su Jian. 2004. A joint source-channel model for machine...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "An Unsupervised Model for Joint Phrase Alignment and Extraction" ppt
... DeNero and Dan Klein. 2010. Discriminative mod- eling of extraction setsfor machine translation. In Pro- ceedings of the 48th AnnualMeeting of the Association for Computational Linguistics, pages ... training reg- imen up to Model 4, and combine alignments with grow-diag-final-and. For the proposed models, we train for 100 iterations, and use the final sample acquired at the end of...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Minimum Cut Model for Spoken Lecture Segmentation" ppt
... Advances in domain independent linear text segmentation. In Proceedings of the NAACL, 26–33. K. W. Church. 1993. Char align: A program for aligning parallel texts at the character level. In Proceedings ... lectures is- used for estimating the optimal word block length for representing nodes, the threshold distances for discarding node edges, the number of uniform chunks for es...
Ngày tải lên: 20/02/2014, 11:21
Tài liệu Báo cáo khoa học: "Paraphrasing Using Given and New Information in a Question-Answer System" docx
... questions. The re~aining elements are given information. They represent information assumed by the questioner to be true of the database domain. This lapeling of information within the question ... Look for a division of the computing facility.* In question (D), information belonging to each of the three categories occurred in the question. If one of these types of informati...
Ngày tải lên: 21/02/2014, 20:20
Tài liệu Báo cáo khoa học: "REPRESENTATION OF TEXTS FOR INFORMATION RETRIEVAL" pdf
... modifications in im- proving the original representations. Reference i. Belkin, N.J., Brooks, H.M., and Oddy, R.N. 1979. Representation and classification of knowledge and information for use in interactive ... "setting of context; summarizing; concept foregrounding; and stylistic vari- ation. Textual characteristics which correspond with these aspects Include discourse-ini...
Ngày tải lên: 21/02/2014, 20:20
Tài liệu Báo cáo khoa học: "A HARDWARE ALGORITHM FOR HIGH SPEED MORPHEME EXTRACTION AND ITS IMPLEMENTATION" pptx
... on the machine in linear time for the number of candidates, while conven- tional sequential algorithms are implemented in combinational time. 1 INTRODUCTION Recent advancement in natural ... as follows. • Main procedure Step 1: Load the top N characters from the input text into the character registers in the shift register block. 309 Sub-Strings Key String for Multiple...
Ngày tải lên: 21/02/2014, 20:20
Tài liệu Báo cáo khoa học: "A Shallow Model of Backchannel Continuers in Spoken Dialogue" potx
... below 400ms, and increasing the thresh- old value in increments of 100ms. Table 2 shows the values for the highest perform- ing models. The model that only inserts continuers in pauses over 900 ... passed up. The models were run on previously unseen test data, the results of which can be seen in Table 5. All models improved on the training models. The baseline model was the worst...
Ngày tải lên: 22/02/2014, 02:20
Tài liệu Báo cáo khoa học: "Syntactic Phrase Reordering for English-to-Arabic Statistical Machine Tranfor slation" pptx
... Proc. of ACL. Jenny Rose Finkel, Trond Grenager, and Christopher Manning 2005. Incorporating Non-local Informa- tion into Information Extraction Systems by Gibbs Sampling. In Proc. of ACL. Nizar ... Yonggang Deng 2007. Joint Morphological-Lexical Language Modeling for Ma- chine Translation. In Proc. of NAACL HLT. Kristina Toutanova, Dan Klein, Christopher Manning, and Yoram Singer. 200...
Ngày tải lên: 22/02/2014, 02:20
Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt
... Toolkit for Parsing-based Machine Translation. In: Proceedings of the WMT. March. Athens, Greece. D. Lin. 1998. Automatic retrieval and clustering of similar words. In: Proceedings of COLING/ACL- 98. ... Selection for SMT. In: Goutte et al (ed.), Learning Machine Translation. MIT Press. K. Gimpel and N. A. Smith. 2008. Rich Source-Side Context for Statistical Machine Tran...
Ngày tải lên: 20/02/2014, 04:20