employing topic models for pattern based semantic class discovery

Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

... organizing them into semantic classes. For example, {red, white, black…} is a semantic class consisting of color instances. A popular way for semantic class dis- covery is pattern- based approach, ... related to multiple topics in some topic models (e.g., pLSI and LDA). Topic modeling Semantic class construction word item (word or phrase) document RASC topic semantic class Table 3. ... 459–467, Suntec, Singapore, 2-7 August 2009. c 2009 ACL and AFNLP Employing Topic Models for Pattern- based Semantic Class Discovery Huibin Zhang 1* Mingjie Zhu 2* Shuming Shi 3 Ji-Rong Wen 3 ...

Ngày tải lên: 08/03/2014, 00:20

9 398 0
Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

... relevant transla- tions based on topic- specific contexts, where topics are induced in an unsupervised way using topic models; this can be thought of as inducing subcorpora for adaptation with- out ... probability under topic 1, topic 2, etc., or F 2 : What is the probability under the most probable topic, second most, etc. A model using F 1 learns whether a specific topic is useful for translation, ... utilize informa- tion from any topic distribution if it helps us cre- ate topic relevant translations. F 2 is useful for dy- namic adaptation, where the adapted feature weight changes based on...

Ngày tải lên: 19/02/2014, 19:20

5 532 0
Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

... Association for Computational Linguistics, pages 620–631, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics Incremental Syntactic Language Models for Phrase -based ... are therefore a good mechanism for incorporat- ing syntax into phrase -based translation. We give a formal definition of one such linear- time syntactic language model, detail its re- lation to phrase -based ... has effectively used n-gram word sequence models as language models. Modern phrase -based translation using large scale n-gram language models generally performs well in terms of lexical choice,...

Ngày tải lên: 20/02/2014, 04:20

12 511 0
Báo cáo khoa học: "HAL-based Cascaded Model for Variable-Length Semantic Pattern Induction from Psychiatry Web Resources" pdf

Báo cáo khoa học: "HAL-based Cascaded Model for Variable-Length Semantic Pattern Induction from Psychiatry Web Resources" pdf

... in- formation indicates that the k words are more likely to form a semantic pattern of length k. Here the length k also ranges from 2 to 4. For each k, we compute the mutual information for ... each of the better semantic patterns. After the initial set for a particular length is created, each semantic pattern and the seed pattern are represented in the HAL space for further computing ... +1 Cascaded Induction Process Induced Patterns Relevant Patterns Figure 1. Framework for variable-length seman- tic pattern induction. semantically related to the seed pattern. According to the distance...

Ngày tải lên: 08/03/2014, 02:21

8 376 0
Tài liệu Báo cáo khoa học: "Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding" doc

Tài liệu Báo cáo khoa học: "Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding" doc

... encode some form of semantic knowledge. The semantic similarity and relatedness of words has traditionally been assessed through corpus- based and knowledge -based measures. Corpus- based measures ... determined empirically based on MAP and R-Prec). The performance gaps between the translation -based models and the baseline models are statistically significant, except for those marked with a ... terms “mosquito” (for question 2) and “form” (for question 3), but only their inflected forms “mosquitoes” and “formed”. 6 Conclusion and Future Work We have presented three datasets for training sta- tistical...

Ngày tải lên: 20/02/2014, 07:20

9 527 0
Tài liệu Báo cáo khoa học: "A FrameNet-based Semantic Role Labeler for Swedish" pdf

Tài liệu Báo cáo khoa học: "A FrameNet-based Semantic Role Labeler for Swedish" pdf

... the final classi- fiers consisted of one million training instances for the start classifier, 500,000 for the end classifier, and 272,000 for the role classifier. The features used by the classifiers ... assume that the frame was known a priori. We used the available semantic roles for all senses of the target word as features for the classifier. On a test set from FrameNet, we estimated that the ... SRL system for Swedish text. Like most previous systems, it consists of two parts: a FE bracketer and a classifier that assigns semantic roles to FEs. Both parts are implemented as SVM classifiers...

Ngày tải lên: 20/02/2014, 12:20

8 470 0
Tài liệu Báo cáo khoa học: "Statistical phrase-based models for interactive computer-assisted translation" pdf

Tài liệu Báo cáo khoa học: "Statistical phrase-based models for interactive computer-assisted translation" pdf

... framework, real-time is re- quired. 4 Phrase -based models The usual statistical translation models can be classified as single-word based alignment models. Models of this kind assume that an input ... Association for Computational Linguistics Statistical phrase -based models for interactive computer-assisted translation Jes ´ us Tom ´ as and Francisco Casacuberta Instituto Tecnol ´ ogico de Inform ´ atica Universidad ... framework (Och et al., 2003). Phrase -based models have proved to be very ad- equate statistical models for MT (Tom ´ as et al., 2005). In this work, the use of these models has been extended to interactive...

Ngày tải lên: 20/02/2014, 12:20

7 308 0
Tài liệu Báo cáo khoa học: "Towards History-based Grammars: Using Richer Models for Probabilistic Parsing*" docx

Tài liệu Báo cáo khoa học: "Towards History-based Grammars: Using Richer Models for Probabilistic Parsing*" docx

... describe a technique for estimat- ing the parameters for this model using decision trees. The history -based grammar model provides a mechanism for taking advantage of contextual information from ... contextual information used by their mod- els is because of the difficulty in estimating very rich probabilistic models of context. In this work, we present a model, the history -based grammar ... h Grammar and parse tree for aabb. dexed the non-terminal (NT) nodes of the tree with this leftmost order. We denote by ~- the sen- tential form obtained just before we expand node i. Hence,...

Ngày tải lên: 20/02/2014, 21:20

7 372 0
Tài liệu Báo cáo khoa học: "Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages" docx

Tài liệu Báo cáo khoa học: "Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages" docx

... (Zitouni et al., 2003) for language modeling and by (Gildea, 2001) for semantic role labeling. (Resnik et al., 2001) used backoff trans- lation lexicons for cross-language information re- trieval. ... re- trieval. More recently, (Xi and Hwa, 2005) have used backoff models for combining in-domain and 42 Phrase -Based Backoff Models for Machine Translation of Highly Inflected Languages Mei Yang Department ... accordingly. The phrase table will thus include entries for phrases based on full word forms as well as for their stemmed and/or split counterparts. For each entry with decomposed morphological i i...

Ngày tải lên: 22/02/2014, 02:20

8 379 0
Báo cáo khoa học: "Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs" pdf

Báo cáo khoa học: "Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs" pdf

... developed for automatic semantic class identification, under the rubrics of lexical acquisition, hyponym acquisition, semantic lexicon induction, semantic class learn- ing, and web -based information ... just one doubly-anchored pattern to identify candidate instances for a semantic class: < ;class name> such as < ;class member> and * This pattern has two variables: the name of the se- mantic class to ... both the name of the semantic class as well as a class member. For example, the pattern “CARS such as FORD and *” will extract automobiles, and the pattern “PRESIDENTS such as FORD and *” will extract presidents....

Ngày tải lên: 17/03/2014, 02:20

9 340 0
Báo cáo khoa học: "Exploring Distributional Similarity Based Models for Query Spelling Correction" docx

Báo cáo khoa học: "Exploring Distributional Similarity Based Models for Query Spelling Correction" docx

... Table 2. Performance results for different models Table 2 details the performance scores for the experiments, which shows that both of the two distributional similarity -based models boost ... methods for error model estimation are all based on the similarity between the character strings of q i and c i as described in 3.1. Here we describe a distributional similarity- based method for ... Similarity -Based Mod- els for Query Spelling Correction 3.1 Motivation Most of the previous work on spelling correction concentrates on the problem of designing better error models based on...

Ngày tải lên: 17/03/2014, 04:20

8 309 0
Báo cáo khoa học: "Integrating Pattern-based and Distributional Similarity Methods for Lexical Entailment Acquisition" doc

Báo cáo khoa học: "Integrating Pattern-based and Distributional Similarity Methods for Lexical Entailment Acquisition" doc

... pairs for the input term by pattern- based and distribu- tional similarity methods (Section 3.2); 2: Constructing a feature set for all candidates based on pattern- based and distributional in- formation ... features, suitable for supervised classification. To this end we developed a novel feature set based on both pattern- based and distributional data. To obtain pattern statistics for each pair, ... found in such patterns. In addition to their use for learning lexi- cal semantic relations, patterns were commonly used to learn instances of concrete semantic rela- tions for Information Extraction...

Ngày tải lên: 17/03/2014, 04:20

8 356 0
Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

... of a topic. Given a specific topic- weight vector θ d for a document-pair, each sentence-pair draws its conditionally independent topics from a mixture of topics. This generative process, for a document-pair ... on topical translation models concern mainly explicit logical representations of semantics for machine translation. This include knowledge -based (Nyberg and Mitamura, 1992) and interlingua -based ... success. We propose a new statistical formalism: Bilingual Topic AdMixture model, or BiTAM, to facilitate topic- based word alignment in SMT. Variants of admixture models have appeared in population...

Ngày tải lên: 17/03/2014, 04:20

8 354 0
Báo cáo khoa học: "WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings" doc

Báo cáo khoa học: "WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings" doc

... long-term information. In this paper the best performing measures from (Pucher, 2005), which outperform baseline models on word prediction for conversational tele- phone speech are used for Automatic ... dependencies. Therefore (Bellegarda, 2000) proposed combining n-gram language mod- els, which are effective for predicting local de- pendencies, with Latent Semantic Analysis (LSA) based models for covering ... WordNet -based models can be used for rescoring of ‘real’ N-best lists in a difficult task. 1.1 Word prediction by semantic similarity The standard n-gram approach in language mod- eling for speech...

Ngày tải lên: 17/03/2014, 04:20

4 204 0
Báo cáo khoa học: "A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms" potx

Báo cáo khoa học: "A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms" potx

... information is used, it is unclear what semantic information is derived by the rules themselves, and what seman- tic information is available because of unifications with the original semantics. ... category. For example, here is the rule for sen- tences: s(Form, GO-G, Store)/quant(Q,X,R,S) > s(Form, GO-G, [qterm(Q,X,R) JStore])/S. The term quant (C~, X, R, S) represents a quantified formula ... vp(nonfinite,[np(_)/you])/S. s(Form)/S > Subj, vp(Fona,[Subj])/S. (2) vp(Form,Subcat)/S > vp(Form,[Compl[Subcat])/S, Compl. (3) vp(Form,[Subj])/S > vp(Forl,[Subj])/VP, adv(VP)/S. vp(finite,[np(_)/O,np(3-sing)/S])/love(S,O)...

Ngày tải lên: 17/03/2014, 20:20

11 273 0
w