Tài liệu Báo cáo khoa học: "Improving Probabilistic Latent Semantic Analysis with Principal Component Analysis" ppt
... Improving Probabilistic Latent Semantic Analysis with Principal Component Analysis Ayman Farahat Palo Alto Research Center 3333 Coyote Hill ... 94304 chen@fxpal.com Abstract Probabilistic Latent Semantic Analysis (PLSA) models have been shown to pro- vide a better model for capturing poly- semy and synonymy than Latent S eman- tic Analysis (LSA). ... Bishop (1999)...
Ngày tải lên: 22/02/2014, 02:20
... carried out without mimic and known as the damage group; the experi- ment carried out without the mimic, ascorbate, and ferrous sulfate was known as the control group. Biological analysis of ... were higher than with 6-SeCD. In the investigation of GPX mimics, Wilson’s disele- nides are successful [11]. As shown in Scheme 4, there are two processes (oxidation and reduction with thiols)...
Ngày tải lên: 19/02/2014, 02:20
... rows report the cross-pair similarity carried out with Eq. 6 with (Synt Trees with placeholders) and without (Only Synt Trees) augmenting the trees with placehold- ers, respectively. Each column ... After substituting 3 with b and 2 with a , we can detect if T 1 and T 3 share the bold subtree S → NP 2 VP 3 . As such subtree is shared also by H 1 and H 3 , the words within the pair...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "A Logic-based Semantic Approach to Recognizing Textual Entailment" ppt
... family relations, awards, etc. 5 Semantic Calculus The Semantic Calculus axioms combine two se- mantic relations identified within a text fragment and increase the semantic connectivity of the text ... con- sider chains with an IS-A relation followed by a HYPONYMY link ( ). Similarly, the system rejected chains with more than one HYPONYMY relations. Al- though these relations link se...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Improving Word Representations via Global Context and Multiple Word Prototypes" pdf
... neural language models. 1 1 Introduction Vector-space models (VSM) represent word mean- ings with vectors that capture semantic and syntac- tic information of words. These representations can be used to ... accounts for homonymy and polysemy by learning mul- tiple embeddings per word. We introduce a new dataset with human judgments on pairs of words in sentential context, and evaluate o...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Improving Statistical Machine Translation with Monolingual Collocation" pdf
... monolingual sentence, i denotes the number of words that are aligned with i w . Since a word never collocates with itself, the alignment set is denoted as }&],1[|),{( ialiaiA ii . ... 2010. c 2010 Association for Computational Linguistics Improving Statistical Machine Translation with Monolingual Collocation Zhanyi Liu 1 , Haifeng Wang 2 , Hua Wu 2 , Sheng Li 1 1...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Improving Chinese Semantic Role Labeling with Rich Syntactic Features" ppt
... information of sub-trees in a given parse. With help of these new features, our sys- tem achieves 93.49 F-measure with hand-crafted parses. Comparison with the best reported results, 92.0 (Xue, ... frame+w v +w h , and w v +cct. 4 Experiments and Analysis 4.1 Experimental Setting To facilitate comparison with previous work, we use CPB 1.0 and CTB 5.0, the same data set- ting with...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "A probabilistic generative model for an intermediate constituency-dependency representation" pptx
... PCFG-reranker (together with the lower and the upper bound), with the increase of the number of best candidates. 3.5 Results Table 2 reports the results we obtain when re- ranking with our model an ... WFM) specified in equations (2-4) and explained in the main text. 3 A probabilistic Model for TDS This section describes the probabilistic generative model which was implemented in ord...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: Improving Classification of Medical Assertions in Clinical Notes" pdf
... instances with that label as positive instances and instances with any other label as negative instanc- es. The final class label is assigned by choosing the class that was assigned with the ... its performance with our original system. 4.1 Data The training set includes 349 clinical notes, with 11,967 assertions of medical problems. The test set includes 477 texts with 18...
Ngày tải lên: 20/02/2014, 05:20
Tài liệu Báo cáo khoa học: "Improving Automatic Speech Recognition for Lectures through Transformation-based Rules Learned from Minimal Data" ppt
... you ⇓ Output all rules for replacing the incorrect ASR sequence with the correct text, using the entire sequence (a) or splices (b), with or without surrounding anchors: (a) the okay one and / ok why ... how the transcripts improve, words with lower information content (e.g., a lower tf.idf score) are corrected more often and with more improvement than words with higher information...
Ngày tải lên: 20/02/2014, 07:20