domain similarity for parsing

Báo cáo khoa học: "Effective Measures of Domain Similarity for Parsing" pdf

Ngày tải lên: 30/03/2014, 21:20

11 426 0

Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

... analogous similarity me- tric for γ . They range from 0 to 1. These two metrics both evaluate the similarity for two vectors in the same language, so using cosine dis- tance to compute the similarity ... performance over the baseline. The Alg2 cosine similarity function got 0.7 BLEU- score (p<0.01) improvement over the baseline for NIST 2006 test set, and a 0.5 BLEU-score (p<0.05) for ... of the Association for Computational Linguistics, pages 834–843, Uppsala, Sweden, 11-16 July 2010. c 2010 Association for Computational Linguistics Bilingual Sense Similarity for Statistical Machine...

Ngày tải lên: 20/02/2014, 04:20

10 595 0

Tài liệu Báo cáo khoa học: "A Uniﬁed Syntactic Model for Parsing Fluent and Disﬂuent Speech∗" ppt

... any information about repair is stripped from the input, including partial words, repair sym- bols 3 , and interruption point information. While an integrated system for processing and parsing ... results from Hale et al. RCT results are on the right-corner transformed grammar (transformed back to ﬂat treebank-style trees for scoring purposes). CYK and TAG lines show relevant results from ... syntactic information to ﬁnd repairs, and thus may have access to some of this information about where interruptions occur, this experiment is intended to evaluate the use of the right corner transform...

Ngày tải lên: 20/02/2014, 09:20

4 582 0

Tài liệu Báo cáo khoa học: "Keyword Extraction using Term-Domain Interdependence for Dictation of Radio News" ppt

... any other domains, domainj seems to be the domain of unit~. The system selects the domain which is the largest of all sim- ilarities in N of domains as the domain of the unit (formula (6)) ... vectors. 5.3 Domain identification experiment The system selects suitable domain of each unit for keyword extraction. Table I shows the results of domain identification. We con- ducted domain identification ... kinds of domains, i.e. 141 domains and 9 large domains. We also compared the results and the result using previous method (Suzuki et al., 1997). For comparison, we selected 5 domains which...

Ngày tải lên: 20/02/2014, 18:20

5 415 1

Tài liệu Báo cáo khoa học: "Statistical Decision-Tree Models for Parsing*" ppt

... Statistical Pattern Recognition. Doctoral dissertation. Stanford University, Stanford, Cali- fornia. 283 Statistical Decision-Tree Models for Parsing* David M. Magerman Bolt Beranek and Newman ... sentence length for Wall Street Journal experiments. 5 Conclusion Regardless of what techniques are used for parsing disambiguation, one thing is clear: if a particular piece of information is ... and 7 illustrate the performance of SPATTER as a function of sentence length. SPAT- TER's performance degrades slowly for sentences up to around 28 words, and performs more poorly and more...

Ngày tải lên: 20/02/2014, 22:20

8 389 0

Tài liệu Báo cáo khoa học: "Text-to-text Semantic Similarity for Automatic Short Answer Grading" pdf

... corpus-based measures of similarity perform comparably when used for the task of short answer grading. However, since the corpus- based measures can be improved by account- ing for domain and corpus ... unsupervised techniques for the task of automatic short answer grading. We compare a number of knowledge-based and corpus-based measures of text similarity, evaluate the effect of domain and size on ... improve the performance of the system by integrating automatic feedback from the student answers. Overall, our system signiﬁcantly and consistently out- performs other unsupervised methods for short...

Ngày tải lên: 22/02/2014, 02:20

9 577 0

Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation" pdf

... gathered from parallel texts and the evaluation data for the two SENSEVAL tasks. This gave a set of 6 nouns for SENSEVAL-2 and 9 nouns for SENSEVAL- 3. For each noun, we gathered a maximum of 500 parallel ... we use for our experiments before presenting our experimental results. Next, we propose using the well calibrated probabilities of logistic regression to estimate the sense priors, and perform ... improves performance. For example, row 1 of Table 4 shows that adjusting the pre- dictions of multiclass naive Bayes classiﬁers by sense priors estimated by logistic regression (NB- EM ) performs signiﬁcantly...

Ngày tải lên: 08/03/2014, 02:21

8 268 0

Báo cáo khoa học: "Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classiﬁcation" potx

... target domain labeled instances. We chose this number since we believe it to be a reasonable amount for a single engineer to label with minimal effort. For reasons of space, for each target domain dom ... different domains, and annotating corpora for every possible domain of interest is impractical. We investigate domain adaptation for sentiment classiﬁers, focusing on online reviews for different ... domains of discourse makes it an ideal candidate for domain adaptation. This work addressed two important questions of domain adaptation. First, we showed that for a given source and target domain, ...

Ngày tải lên: 08/03/2014, 02:21

8 425 0

Báo cáo khoa học: "Syntactic Features and Word Similarity for Supervised Metonymy Resolution" pot

... reduction Pakistan Scotland-subj-of-losePakistan-subj-of-win similarity semantic class head similarity role similarity Pakistan had won the World Cup lost in the semi-finalScotland Figure 1: Context reduction and similarity levels draw ... 1997; Stern, 1931)). In a place -for- people pattern, a place stands for any persons/organisations associ- ated with it, e.g., for sports teams in (2), (3), and (4), and for the government in (7). 4 (7) ... a place -for- event pattern, a location name refers to an event that occurred there (e.g., using the word Vietnam for the Vietnam war). In a place -for- product pattern a place stands for a product...

Ngày tải lên: 08/03/2014, 04:22

8 603 0

Báo cáo Y học: Identiﬁcation of residues in the PXR ligand binding domain critical for species speciﬁc and constitutive activation docx

... designed: 5¢-TGAGATGTGCCAGCTGAGGTTCA-3¢ for I282Q (forward), 5¢-CAACGCCCAGCATACCCAGCAGT-3¢ for Q404H (forward), 5¢-CAACGCCCAGGCAACCCAG CAGT-3¢ for Q404A (forward), 5¢-TGAACCTCAGCT GGCACATCTA-3¢ for I282Q (reverse), ... were obtained by Transformer Site-directed mutagenesis Kit (Clontech). The following primers were used: 5¢-TCGAGCTGTGTATACTGAGATTCA-3¢ for Q285I, 5¢-TCAATGCTCAGCAGACCCAGCGGC-3¢ for H407Q, 5¢-TCAATGCTCAGGCCACCCAGCG GC-3¢ ... luciferase activity. All experiments were performed at least three times in duplicates and luciferase activity was normalized for alkaline phosphatase activity. For curve ﬁtting and EC50 calculations, XLFIT version...

Ngày tải lên: 08/03/2014, 16:20

9 552 0

Báo cáo khoa học: "Aligning Medical Domain Ontologies for Clinical Query Extraction" potx

... in a specific domain (medicine) and as we are not domain experts, we are in lack of domain knowledge. This missing domain knowledge shall be acquired from external resources, for example UMLS. ... (c) is it normal or is it ab- normal? Therefore, when a radiologist looks for information, his search queries most likely con- tain terms from various information sources that provide this kind ... noun) Using a transformation rule of the form, 82 Ontologies (OBO) 5 framework. The OBO con- sortium establishes a set of principles to which the biomedical ontologies shall conform to for purposes...

Ngày tải lên: 08/03/2014, 21:20

9 384 0

Báo cáo khoa học: "Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars" doc

... Association for Computational Linguistics Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars Zhenghua Li, Ting Liu ∗ , Wanxiang Che Research Center for Social Computing and Information ... punctuation. We adopt Dan Bikel’s randomized parsing evaluation comparator for signiﬁcance test (Noreen, 1989). 7 For all models used in current work (POS tagging and parsing) , we adopt averaged perceptron ... algorithm for pro- jective dependency parsing. In Proceedings of the 8th International Workshop on Parsing Technologies (IWPT), pages 149–160. Eric W. Noreen. 1989. Computer-intensive methods for testing...

Ngày tải lên: 16/03/2014, 19:20

10 245 0

Báo cáo khoa học: "Domain Adaptation for Machine Translation by Mining Unseen Words" doc

... gains in performance when moving from Parliament domain to News domain. 3 Data Our source domain is European Parliament proceedings (http://www.statmt.org/ europarl/). We use three target domains: ... methods for mining dic- tionaries from comparable corpora to the domain adaptation setting, by “bootstrapping” them based on known translations from the source domain. (3) Develop methods for integrating ... establish baseline performance for the domains. In these experiments, we built a translation model based only on the Parliament proceedings. We then tune it using the small amount of target -domain tuning...

Ngày tải lên: 17/03/2014, 00:20

6 349 0

Báo cáo khoa học: "Exploiting Heterogeneous Treebanks for Parsing" pptx

... that the use of probability information from the parser for tree conversion helps target grammar parsing. 4.3 Using Unlabeled Data for Parsing Recent studies on parsing indicate that the use ... treebanks with same grammar formalism for domain adaptation of parsers. Roark and Bachiani (2003) presented count merging and model interpolation techniques for domain adaptation of parsers. ... Parsing Through grammar formalism conversion, we have successfully turned the problem of using heterogeneous treebanks for parsing into the problem of parsing on homogeneous treebanks. Before using converted...

Ngày tải lên: 17/03/2014, 01:20

9 289 0

Báo cáo khoa học: "Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation" potx

... (Fig- ure 1): one for the live “instant translation” user interface, one for demonstrating the different com- ponents of the system and algorithmic visualiza- tions, and one designated for technical ... and David Chiang. 2005. Better k-best parsing. In Proceedings of the International Work- shop on Parsing Technologies. 27 We will rely on 3 workstations: one for the instant translation demo, ... Open source toolkit for statistical machine translation. In Proceedings of the ACL-2007 Demo and Poster Ses- sions. Zhifei Li and Sanjeev Khudanpur. 2008. A scalable decoder for parsing- based machine...

Ngày tải lên: 17/03/2014, 02:20

4 275 0

Báo cáo khoa học: "Construction of Domain Dictionary for Fundamental Vocabulary" pdf

... Dong, 2006) and WordNet provide domain information for Chinese and English, but there has been no domain resource for Japanese that are publicly available. 8 Domain dictionary construction methods ... Preparing keywords for each domain (§3.1). 2 Associating JFWs with domains (§3.2). 3 Reassociating JFWs with NODOMAIN (§3.3). 4 Manual correction (§3.5). 3.1 Preparing Keywords for each Domain About ... NODOMAIN was prepared for those words that do not belong to any particular domain. As for the latter issue, you might use keyword extraction techniques; identifying words that represent a domain...

Ngày tải lên: 17/03/2014, 04:20

4 353 0