Báo cáo khoa học: "Using Similarity Scoring To Improve the Bilingual Dictionary for Word Alignment" doc
... example, if the maximum number is 2, then a word can align to 0, 1, or 2 words in the parallel sentence. In other settings, we en- forced a minimum score in the bilingual dictionary for a link to be ... score of the merged cluster is the aver- age similarity score of the -word cluster, av- eraged with the similarity scores between the single word and all wor...
Ngày tải lên: 08/03/2014, 07:20
... and shuffled to produce a mutant library, the members of which were then moni- tored for their ability to confer increased TMP resis- tance when fused to DHFR. The genes corresponding to resistant ... close to the N-terminus of the frag- ment ) it lies between the start of the fragment and the predicted start of the domain (Fig. 1). From the round 3 mutants, three...
Ngày tải lên: 23/03/2014, 07:20
... tagger to label the result snippets and then transfer the tags to the queries, producing a set of noisy labeled queries. These la- beled queries are then added to the training data and the tagger ... strate- gies for selecting which annotation to transfer and find that using the result that was clicked by the user gives comparable performance to using just the top r...
Ngày tải lên: 16/03/2014, 20:20
Báo cáo khoa học: "Using Anaphora Resolution to Improve Opinion Target Identification in Movie Reviews" docx
... They are gathered and to be presented in the context of one particular entity (=movie). The context or topic under which it occurs is there- fore typically clear to the reader and is therefore not ... we will refer to the configurations using these exten- sions with the numbers attributed to them above. 5 Experimental Work To integrate AR in the OM algorithm, we add the...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "Using Deep Morphology to Improve Automatic Error Detection in Arabic Handwriting Recognition" pot
... by the HR system for a given segment scan. The conf is defined here as the ratio of the number of hypotheses in the N- best list that the word appears in to the total number of hypotheses. These ... among each other is called the lexeme. A lemma is a particular word form used to represent the lexeme word set – a citation form that stands in for the class (Haba...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "Using comparable corpora to solve problems difficult for human translators" pptx
... properties, nevertheless words in the similarity class tend to follow the POS of the original word, because of the similarity of their contexts of use. Further- more, dictionaries also tend to translate words using ... dis- tributionally words can contain words irrelevant to the source word, we filter them to produce a more reliable similarity class S(s 0 ) using the...
Ngày tải lên: 31/03/2014, 01:20
Tài liệu Báo cáo khoa học: "Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classification" doc
... w i that occur in the review d are set to d i , their frequency in d. The subsequent k dimensions that correspond to the top ranked based entries for the review d are weighted according to their ranking ... co-occur with the features in the feature vector for the ele- ment v. If there are no features that co-occur with both u and v, then the relatedness reaches its m...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Using adaptor grammars to identify synergies in the unsupervised acquisition of linguistic structure" docx
... The category Word is adapted, which means that the grammar learns the words that oc- cur in the training corpus. We present our adap- Sentence → Words Words → Word Words → Word Words Word → Phonemes Phonemes ... subtree in the parses of the other strings in the training corpus. A final accept-reject step cor- rects for the difference in the probability of the sam-...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Using linguistic principles to recover empty categories" ppt
... However, if the task is to insert empty nodes into a tree, then the method leads both to false positives and to false negatives. Suppose for example that the sentence When do you expect to finish? ... categories) for the insertion of a labeled empty category into the tree (and/or string), and the term resolution for the coindexation of the empty category w...
Ngày tải lên: 20/02/2014, 16:20
... characteristics. The length of the sentences ranged between three words and 32 words. The median sentence length was 12 words, and the mean was 13.8 words, s Table 2 shows the aggregated out- comes for the ... hand-bracketed parses to examine both the inter- nal and external performance of a grammar checker. The internal performance refers to the behavior of...
Ngày tải lên: 20/02/2014, 21:20