Báo cáo khoa học: "Measure Word Generation for English-Chinese SMT Systems" ppt
... identify head words for sub-trees. For the bilingual corpus, we also per- form word alignment to get correspondences be- tween source and target words. Then, the colloca- tion between measure words ... target information during the measure word generation process. We do not integrate our measure word generation module into the SMT decoder since there is only little target...
Ngày tải lên: 08/03/2014, 01:20
... preferences. The generation process performs template-based generation for simple responses and updates the sys- tem’s model of the user’s intentions after generation. The text planner is used for more ... Users can ask for information about restaurants, such as phone numbers, addresses, and reviews. For example, a user might circle three restaurants as in Figure 3 and say phone...
Ngày tải lên: 08/03/2014, 07:20
... β dimensions of words’ representation vectors. We additionally introduce a bias b w for each word to capture differences in over- all word frequencies. The energy assigned to a word w given these ... com- pare our model’s word representations with several bag of words weighting methods, and alternative ap- proaches to word vector induction. 4.1 Word Representation Learning We in...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt
... work (more details in Section 7) on applying word sense disambiguation (WSD) techniques in SMT for translation selection. However, WSD techniques for SMT do so indirectly, using source-side context ... 2009. Semantic Roles for SMT: A Hybrid Two-Pass Model. In: Proceedings of NAACL/HLT, Boulder, CO. D. Yuret and M. A. Yatbaz. 2009. The Noisy Channel Model for Unsupervised...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Minimum Cut Model for Spoken Lecture Segmentation" ppt
... three lectures is- used for estimating the optimal word block length for representing nodes, the threshold distances for discarding node edges, the number of uniform chunks for estimating tf-idf ... segmentation performance. In contrast to previous approaches, the homogeneity of a seg- ment is determined not only by the similarity of its words, but also by their relation to words in...
Ngày tải lên: 20/02/2014, 11:21
Tài liệu Báo cáo khoa học: "Discriminative Word Alignment with Conditional Random Fields" ppt
... model many-to-one word alignments, where each source word is aligned with zero or one target words, and therefore each target word can be aligned with many source words. Each source word is labelled ... Section 2 presents CRFs for word alignment, describing their form and their inference techniques. The features of our model are presented in Section 3, and experimental results fo...
Ngày tải lên: 20/02/2014, 11:21
Tài liệu Báo cáo khoa học: " A Declarative Language for Implementing Dynamic Programs∗" pptx
... sentence and grammar by asserting values for certain items. If the input is John loves Mary, the user should assert values of 1 for word( John,0,1), word( loves,1,2), word( Mary,2,3), and end(3). If the ... acquire more over time: we in- tend for it to generalize and encapsulate best practices, and serve as a testbed for new practices. Dyna is now be- ing used for parsing, machin...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "Using Confidence Bands for Parallel Texts Alignment" pptx
... intercept (the value of y when x is 0), substituting x for the Portuguese word position. For Table 3, the ex- pected word position for the word I at pt word position 3877 is 0.9165 × 3877 + 141.65 = ... brackets). For average size texts (e.g. the Written Ques- tions), these words account for about 5% of the total (about 3k words / text). This number varies according to langu...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "AN INDEXING TECHNIQUE FOR IMPLEMENTING COMMAND RELATIONS" ppt
... requierments for the formalism used. Hence, the indexing technique has a wide spectrum of applications for testing command relations in syntactic analysis. Futhermore, this method can also be used for ... special cases of definitions 1.1 and 1.2 for the property of being a set of maximal projections. Before I formulate the general command definition in a formal way, I will now...
Ngày tải lên: 22/02/2014, 10:20
Báo cáo khoa học: "Hypothesis Mixture Decoding for Statistical Machine Translation" ppt
... end for 12: end for 13: for each hypothesis do 14: compute HM decoding features for 15: add to 16: end for 17: for ... BTG-based HM Decoding 1: for each component model do 2: output the search space for the input 3: end for 4: for to do 5: for all s...
Ngày tải lên: 07/03/2014, 22:20