a discriminative language model with pseudonegative samples

Báo cáo khoa học: "A Discriminative Language Model with Pseudo-Negative Samples" pptx

Báo cáo khoa học: "A Discriminative Language Model with Pseudo-Negative Samples" pptx

... Discriminative Language Model with Pseudo-Negative samples We propose a novel discriminative language model; a Discriminative Language Model with Pseudo- Negative samples (DLM-PN). In this model, pseudo-negative ... dis- criminative language models can achieve more accurate discrimination because they can employ overlapping features and non- local information. However, discriminative language models have been used ... and not all relevant features can be included. A discriminative language model (DLM) assigns a score to a sentence , measuring the correct- ness of a sentence in terms of grammar and prag- matics,...

Ngày tải lên: 08/03/2014, 02:21

8 315 0
Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx

Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx

... compared to similar research on L1 English by Heilman et al. (2008). Using more complex syntactic fea- tures, they obtained an adjacent accuracy of 52% with a PO model, and 45% with a MLR model. However, ... significantly with difficulty. Then, we add two NLP-oriented features, as described below: a statistical language model and a measure of tense difficulty. 4.1 The language model The lexical difficulty of a ... Belgium thomas.francois@uclouvain.be Abstract Reading is known to be an essential task in language learning, but finding the ap- propriate text for every learner is far from easy. In this context, automatic...

Ngày tải lên: 08/03/2014, 21:20

9 514 0
Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

... NIST Language Recognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... Recognition Evaluation (LRE) data. The database was intended to establish a baseline of performance capability for language recognition of conversational tele- phone speech. The database contains recorded ... 2003. Acoustic, Pho- netic and Discriminative Approaches to Automatic language recognition, In Proc. of Eurospeech Masahide Sugiyama. 1991. Automatic language recog- nition using acoustic features ,...

Ngày tải lên: 20/02/2014, 15:20

8 437 0
Tài liệu Báo cáo khoa học: "A Structured Language Model" ppt

Tài liệu Báo cáo khoa học: "A Structured Language Model" ppt

... Proceedings of the Human Language Technology Workshop, 272-277. ARPA. Raymond Lau, Ronald Rosenfeld, and Salim Roukos. 1993. Trigger-based language models: a maximum entropy approach. In Proceedings ... University, Baltimore, MD. Frederick Jelinek, John Lafferty, David M. Mager- man, Robert Mercer, Adwait Ratnaparkhi, Salim Roukos. 1994. Decision Tree Parsing using a Hid- den Derivational Model. ... 1, Nk at position k in the sentence. To ensure a proper probabilistic model we have to make sure that (1) and (2) are well defined con- ditional probabilities and that the model halts with...

Ngày tải lên: 22/02/2014, 03:20

3 342 0
Báo cáo khoa học: "A Flexible Stand-Off Data Model with Query Language for Multi-Level Annotation" ppt

Báo cáo khoa học: "A Flexible Stand-Off Data Model with Query Language for Multi-Level Annotation" ppt

... with a capital letter. Markables are the carriers of the actual annota- tion information. They can be queried by means of string matching and by means of attribute-value combinations. A markable ... nominal attributes can have one of a (user-defined) closed set of possible values. The data model also supports associative relations between markables: Markable set relations associate arbitrarily many markables with ... the ACL Interactive Poster and Demonstration Sessions, pages 109–112, Ann Arbor, June 2005. c 2005 Association for Computational Linguistics A Flexible Stand-Off Data Model with Query Language for...

Ngày tải lên: 08/03/2014, 04:22

4 348 0
Báo cáo khoa học: "Robust Approach to Abbreviating Terms: A Discriminative Latent Variable Model with Global Information" docx

Báo cáo khoa học: "Robust Approach to Abbreviating Terms: A Discriminative Latent Variable Model with Global Information" docx

... Ao and Toshihisa Takagi. 2005. ALICE: An algorithm to extract abbreviations from MEDLINE. Journal of the American Medical Informatics Asso- ciation, 12(5):576–586. June A. Barrett and Mandalay ... forms as the training/evaluation data. The evaluation metrics used in the abbreviation generation are exact-match accuracy (hereinafter accuracy), including top-1 accuracy, top-2 accu- racy, and ... cross validation. We prepared six state-of-the-art abbreviation recognizers as baselines: Schwartz and Hearst’s method (SH) (2003), SaRAD (Adar, 2004), AL- ICE (Ao and Takagi, 2005), Chang and Sch ă utzes method...

Ngày tải lên: 17/03/2014, 01:20

9 389 0
Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

... signif- icantly. Bear in mind that Charniak et al. (2003) in- tegrated Charniak’s language model with the syntax- based translation model Yamada and Knight pro- posed (2001) to rescore a tree-to-string ... Stochastic analysis of lexical and semantic enhanced structural language model. The 8th International Colloquium on Grammatical Inference (ICGI), 97-111. K. Yamada and K. Knight. 2001. A syntax-based ... (EMNLP), 858-867. E. Charniak. 2001. Immediate-head parsing for language models. The 39th Annual Conference on Association of Computational Linguistics (ACL), 124-131. E. Charniak, K. Knight and K. Yamada. 2003....

Ngày tải lên: 20/02/2014, 04:20

10 568 0
Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

... and Linda C. Bauman Peto. 1995. A hierarchical Dirichlet language model. Natural Lan- guage Engineering, 1(3):1–19. Y.W. Teh. 2006. A hierarchical Bayesian language model based on Pitman-Yor processes. ... n-grams: C(ab) − C(ab∗). A( ab) = max(1, K(C(ab) − C(ab∗))) A different K constant is chosen for each n-gram order. Using this formulation as an interpolated 5- gram language model gives a cross ... Speech and Language. R. Kneser and H. Ney. 1995. Improved backing-off for m-gram language modeling. In International Confer- ence on Acoustics, Speech, and Signal Processing. David J. C. Mackay and...

Ngày tải lên: 20/02/2014, 09:20

4 425 1
Tài liệu Báo cáo khoa học: "A Succinct N-gram Language Model" ppt

Tài liệu Báo cáo khoa học: "A Succinct N-gram Language Model" ppt

... com- pression tasks achieved a significant com- pression rate without any loss. 1 Introduction There has been an increase in available N -gram data and a large amount of web-scaled N-gram data has been ... the ACL-IJCNLP 2009 Conference Short Papers, pages 341–344, Suntec, Singapore, 4 August 2009. c 2009 ACL and AFNLP A Succinct N-gram Language Model Taro Watanabe Hajime Tsukada Hideki Isozaki NTT ... Communication Science Laboratories 2-4 Hikaridai Seika-cho Soraku-gun Kyoto 619-0237 Japan {taro,tsukada,isozaki}@cslab.kecl.ntt.co.jp Abstract Efficient processing of tera-scale text data is an important...

Ngày tải lên: 20/02/2014, 09:20

4 458 0
Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

... Distortion models for statistical machine translation. In ACL. D. Chiang. 2005. A hierarchical phrase-based model for statis- tical machine translation. In ACL. M. Collins. 2000. Discriminative reranking ... inference and train- ing of context-rich syntactic translation models. In ACL. P. Koehn. 2004. Pharaoh: A beam search decoder for phrase- based statistical machine translation models. In AMTA. R. ... 2002. Discriminative training and max- imum entropy models for statistical machine translation. In ACL. F. J. Och and H. Ney. 2004. The alignment template approach to statistical machine translation....

Ngày tải lên: 20/02/2014, 12:20

8 404 0
w