Báo cáo khoa học: "Maximum Expected BLEU Training of Phrase and Lexicon Translation Models" pptx
... phrase- based system usually include LM, reordering model, word and phrase counts, and phrase and lexicon translation models. Given the focus of this paper, we review only the phrase and lexicon ... with the baseline, training phrase or lexicon models alone gives a gain of 0.7 and 0.5 BLEU points, respectively, on the test set. For a full training o...
Ngày tải lên: 23/03/2014, 14:20
... parent of x j . T = {(x t , y t )} T t=1 denotes the training data. We follow the edge based factorization method of Eisner (1996) and define the score of a dependency tree as the sum of the score of ... between a child and its parent. These features take the form of a POS trigram: the POS of the parent, of the child, and of a word in between, for all words linearly b...
Ngày tải lên: 20/02/2014, 15:20
... tri -training and self -training is near- significant (p <0.0150). It seems that tri -training with disagreement is a competitive technique in terms of accuracy. The main advantage of tri- training ... until none of c i changes 16: apply majority vote over c i Figure 1: Tri -training (Li and Zhou, 2005). 3.1 Tri -training with disagreement We introduce a possible improvemen...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty" potx
... for sec- tions 0-18 of the WSJ corpus). For the features, we used unigrams and bigrams of neighboring words, prefixes and suffixes of the current word, and some characteristics of the word. We also ... from subsets of the training data and updates the parameters in an online fashion. This learning framework is attractive because it often requires much less training time in...
Ngày tải lên: 17/03/2014, 01:20
Báo cáo khoa học: "Maximum Entropy Based Restoration of Arabic Diacritics" ppt
... integrate and make effective use of diverse types of information; the model we propose inte- grates a wide array of lexical, segment- based and part -of- speech tag features. The combination of these ... based on the context and their knowledge of the grammar and the lexicon of Arabic. However, a text without diacritics becomes a source of confu- sion for beginning read...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Maximum Entropy Model Learning of the Translation Rules" pot
... properties of a language model. The expected value of f with respected to iS(x, y) is defined such as: p(f) = p(x,y)f(x,y) (z) x,y Thus training data are summarized as the expected value of feature ... part -of- speech for English and 49 part -of- speech for Japanese). We tried to learn the translation rules from English to Japanese. We had two ex- periments: one of...
Ngày tải lên: 23/03/2014, 19:20
Tài liệu Báo cáo khoa học: "Cross-Domain Co-Extraction of Sentiment and Topic Lexicons" pdf
... set of nodes represents topic words, including new topic candi- dates and words in the lexicon C, and the other set of nodes represents sentiment words, including new sentiment candidates and ... topic words with the final scores, and add them to lexicons B and C. Update S 1 (w i ) and S 3 (w j ) accordingly. 8: end for 9: return Expanded lexicons B and C. 5.2 Graph Con...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: Mapping the functional domains of human transcobalamin using monoclonal antibodies pptx
... involved in the binding of Cbl [13,14]. Assembly of two domains achieved the composite structure of the ligand-binding site and built the compatible interface between IF and its receptor [14]. ... 3C4 and 2-2 ⁄ 3C4, where binding of the second mAb of the pair was inhibited by < 20%. Interaction of mAbs with the C-terminal domain of human TC was determined using peptides...
Ngày tải lên: 20/02/2014, 01:20
Tài liệu Báo cáo khoa học: Unraveling the catalytic mechanism of lactoperoxidase and myeloperoxidase A reflection on some controversial features Elena Ghibaudi and Enzo Laurenti docx
... features of the LPO and MPO catalytic cycle, such as the existence of Compound I and Compound II isomers and the identification of their spectroscopic properties. After addressing each of these ... each of these questions, we will propose a new hypothesis that describes an integrated vision of the catalytic mechanism of MPO and LPO. A brief survey of the absorption feat...
Ngày tải lên: 20/02/2014, 02:21
Tài liệu Báo cáo khoa học: "Bilingual Terminology Acquisition from Comparable Corpora and Phrasal Translation to Cross-Language Information Retrieval" pptx
... associated to each translation alternative. Selection of a phrase will modify the ranked list of phrases and will provide an access to documents related to the phrase. 3 Experiments and Evaluations ... to re-score the set of translation candidates related to the source terms. Sequences of all possible combinations are con- structed between elements of sets of highly ra...
Ngày tải lên: 20/02/2014, 16:20