Tài liệu Báo cáo khoa học: "Word Alignment with Synonym Regularization" doc

Tài liệu Báo cáo khoa học: "Word Alignment with Synonym Regularization" doc

Tài liệu Báo cáo khoa học: "Word Alignment with Synonym Regularization" doc

... model. 3.2 Word Alignment with Synonym Regularization In this section, we extend the bilingual genera- tive model (HM-BiTAM) with our synonym pair model. Our expectation is that synonym pairs 138 Figure ... words included in synonym pairs and to enable us to incorporate synonym information ef- fectively into word alignment models. To con- firm the effect of the synonym pair...

Ngày tải lên: 20/02/2014, 04:20

5 471 2
Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

... proposes an approach to im- prove word alignment for languages with scarce resources using bilingual corpora of other language pairs. To perform word alignment between languages L1 and L2, ... Based on these two additional corpora and with L3 as the pivot language, we build a word alignment model for L1 and L2. This approach can build a word alignment model for two languages...

Ngày tải lên: 20/02/2014, 12:20

8 359 0
Tài liệu Báo cáo khoa học: "Word representations: A simple and general method for semi-supervised learning" doc

Tài liệu Báo cáo khoa học: "Word representations: A simple and general method for semi-supervised learning" doc

... words (14K sentences, 946 documents), the test set contains 46K words (3.5K sentences, 231 documents), and the development set contains 51K words (3.3K sentences, 216 documents). We also evaluated ... of 160 million word tokens with a vocabulary size W of 70K word types. There are 2·W types of context (columns): The first or second W are counted if the word c occurs within a window of 10 to...

Ngày tải lên: 20/02/2014, 04:20

11 688 0
Tài liệu Báo cáo khoa học: "Word to Sentence Level Emotion Tagging for Bengali Blogs" doc

Tài liệu Báo cáo khoa học: "Word to Sentence Level Emotion Tagging for Bengali Blogs" doc

... task. Each feature value is boolean in nature, with discrete value for intensity feature at the word level.  POS information: We are interested with the verb, noun, adjective and adverb words ... 2006). (Yang et al., 2007) has used Yahoo! Kimo Blog corpora containing emoticons associated with textual keywords to build emotion lexicons. (Chen et al., 2007) has experimented the e...

Ngày tải lên: 20/02/2014, 09:20

4 429 0
Tài liệu Báo cáo khoa học: "Word Vectors and Two Kinds of Similarity" pptx

Tài liệu Báo cáo khoa học: "Word Vectors and Two Kinds of Similarity" pptx

... standard multiple-choice synonym test. Each item of a synonym test con- sisted of a stem word and five alternative words from which the test-taker was asked to choose one with the most similar meaning ... 800 900 1000 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 LSA (synonym) LSA (antonym) DIC (synonym) DIC (antonym) Number of Dimensions Correct Rate Figure 2: Synonym versus antonym judgment were...

Ngày tải lên: 20/02/2014, 12:20

8 473 0
Tài liệu Báo cáo khoa học: "Word Order in German: A Formal Dependency Grammar Using a Topological Hierarchy" pptx

Tài liệu Báo cáo khoa học: "Word Order in German: A Formal Dependency Grammar Using a Topological Hierarchy" pptx

... rules with the dependency tree of Fig. 1 and show how we describe phenomena such as scram- bling and (partial) VP fronting. 2.4 Non-embedded construction and “scrambling” Let us start with cases without ... marked with an accent on the first syllable of the radical). Finally, the existence of this ambiguity is also confirmed by the contrast between full infini- tives (with zu) and bare...

Ngày tải lên: 20/02/2014, 18:20

8 575 0
Tài liệu Báo cáo khoa học: " Word Translation Disambiguation Using Bilingual Bootstrapping" doc

Tài liệu Báo cáo khoa học: " Word Translation Disambiguation Using Bilingual Bootstrapping" doc

... Suppose that the classifier with respect to ‘plant’ has two decisions (denoted as A and B in Figure 5). Further suppose that the classifiers with estimate )|( )( teP E ε with MLE using ε L as ... 53.5 55.6 54.1 62.7 Figure 6: Learning curves with ‘interest’ Figure 7: Learning curves with ‘line’ α Figure 8: Accuracies of BB with different α Table 4: Accuracie...

Ngày tải lên: 20/02/2014, 21:20

9 480 0
Tài liệu Báo cáo khoa học: "WORD, PHRASE AND SENTENCE" pptx

Tài liệu Báo cáo khoa học: "WORD, PHRASE AND SENTENCE" pptx

... much can be accomplished with vocabulary analysis, with keyword scanning and statistical treatment of text and with semantic analysis at the single sentence level. Yet, with regard to most of ... The final study is an experiment with a sentence level translator applied to a large German-English translation task. These two studies are primarily concerned with analysis of lang...

Ngày tải lên: 21/02/2014, 20:20

2 381 0
Tài liệu Báo cáo khoa học: "WORD AND OBJECT IN DISEASE DESCRIPTIONS" doc

Tài liệu Báo cáo khoa học: "WORD AND OBJECT IN DISEASE DESCRIPTIONS" doc

... this large sample of text (some 333,000 word occurrences), within the context of diseases. An interesting early result was the ease with which many medical terms could be algorithmically separated ... single disease definition. (Table i lists the words at the top of the frequency list together with the number of occurrences.) Assisted by the facilities of the TMuNIX operating sys-...

Ngày tải lên: 21/02/2014, 20:20

4 527 0
Tài liệu Báo cáo khoa học: "word stress from spelling" ppt

Tài liệu Báo cáo khoa học: "word stress from spelling" ppt

... principled framework for dealing with these long distance dependencies. Stress assignment will be formulated in terms of Waltz' style constraint propagation with four sources of constraints: ... the unknown words are proper flOUfl-q. The difficulty with pmpor nouns h demonstrated by the table below which compares the Brown Corpus with the surnames in the Kansas City Teleph...

Ngày tải lên: 21/02/2014, 20:20

8 503 0
w