... and applied naive Bayes and decision tree to it. Their accuracy results are worse than (Blaheta and Charniak, 2000). Neither (Blaheta and Charniak, 2000) nor (Lintean and Rus, 200 7a; Lin- tean and ... binary annotations can again be treated as pseudo function tags and the proposed tree annotator can be readily applied to this problem. As an example, the top half of Figure 3 con- tains an Arabic ... Chinese TreeBank: Phrase structure annotation of a large corpus. Natural Lan- guage Engineering, 11(2):207–238. Kenji Yamada and Kevin Knight. 2001. A syntax-based statistical translation model....
Ngày tải lên: 07/03/2014, 22:20
... system learns this as a non-transliteration but it is wrongly annotated as a transliteration in the gold standard. Arabic nouns have an article “al” attached to them which is translated in English as ... International Language Resources and Evaluation (LREC’10), Val- letta, Malta. Sittichai Jiampojamarn, Kenneth Dwyer, Shane Bergsma, Aditya Bhargava, Qing Dou, Mi-Young Kim, and Grzegorz Kondrak. ... uses Hidden Markov Models (Nabende, 2010; Darwish, 2010; Jiampojamarn et al., 2010), Finite State Au- tomata (Noeman and Madkour, 2010) and Bayesian learning (Kahki et al., 2011) to learn transliteration pairs...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Sense-based Interpretation of Logical Metonymy Using a Statistical Method" pdf
... interpretation was considered correct if it made sense in some imag- inary context. Lapata and Lascarides (2003) extend Utiyama’s approach to interpretation of logical metonymies containing aspectual ... Thomson Avenue Cambridge CB3 0FD, UK Ekaterina.Shutova@cl.cam.ac.uk Abstract The use of figurative language is ubiqui- tous in natural language texts and it is a serious bottleneck in automatic text ... un- derstanding. We address the problem of interpretation of logical metonymy, using a statistical method. Our approach origi- nates from that of Lapata and Lascarides (2003), which generates a list...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "A Statistical Analysis of Morphemes in Japanese Terminology" docx
... because usually a smaller sam- ple may well include more 'central' terms. We may need further study concerning the status of the available terminological corpora. 5.2 Statistical ... frequencies and on related Marko- vian models of discourse." In: Jakobson, R. (ed.) Structure of Language and its Math- ematical Aspects. Rhode Island: American Mathematical Society. ... incorporate the lognormal 'law' (Carrol, 1967), the inverse Gauss-Poisson 'law' (Sichel, 1986), Zipf's 'law' (Zipf, 1935) and Yule-Simon 'law' (Simon,...
Ngày tải lên: 20/02/2014, 18:20
Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc
... DOM tree alignments, there is substantial re- search focusing on syntactic tree alignment model for machine translation. For example, (Wu 1997; Alshawi, Bangalore, and Douglas, 2000; Yamada and ... grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3). Yamada K. and K. Knight. 2001. A Syntax Based Statistical Translation Model. In Proceedings of 39th Annual ... location holding more parallel data. This ap- proach is based on our observation that parallel pages share similar structures holding parallel content, and parallel hyperlinks refer to new par- allel...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "A Statistical Parser for Czech*" ppt
... of a morphological analy- sis program, and also with the single one of those tags that a statistical POS tagging program had predicted to be the correct tag (Haji~ and Hladka, 1998). Table ... A Statistical Parser for Czech* Michael Collins AT&T Labs-Research, Shannon Laboratory, 180 Park Avenue, Florham Park, NJ 07932 mcollins@research, att.com Jan Haj i~. Institute ... Other Slavic languages (such as Polish, Russian, Slovak, Slovene, Serbo-croatian, Ukrainian) also show these characteristics. Many European lan- guages exhibit FWO and HI phenomena to a lesser...
Ngày tải lên: 08/03/2014, 06:20
Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx
... poems as outliers). 4 Selection of lexical and syntactic variables Any text classification tasks require an object (here a text) to be parameterised into variables, whether qualitative or quantitative. ... suggests that there is no particular order to the CEFR levels. From a practical perspective, things are not so clear. Traditional approaches have usually viewed difficulty as an interval scale and applied ... correlation coefficient, prediction accu- racy as defined by Tan et al. (2005), and adjacent accuracy. Adjacent accuracy is defined by Heil- man et al. (2008) as “the proportion of predictions that...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "A Statistical Spoken Dialogue System using Complex User Goals and Value Directed Compression" pptx
... our knowledge such a combination of goals with dif- ferent attribute values cannot be straightforwardly handled by comparable state-of-the-art statistical SDSs which appear in the literature. Crook and Lemon ... dialogue ac- tion (e.g. offer a restaurant, ask for clarification). Recent research in statistical SDSs has success- fully addressed aspects of these problems through the application of Partially ... sys- tems computationally tractable. Work in dialogue system evaluation, e.g. Walker et al. (2004) and Lemon et al. (2006), shows that real user goals are generally sets of items, rather than a single...
Ngày tải lên: 08/03/2014, 21:20
Women’s Health in Atlantic Canada: A Statistical Portrait pptx
... CANADA: A STATISTICAL PORTRAIT 34 31.4% higher than the Canadian average. The cancer death rate for Nova Scotia women is 13% above the national average. Nova Scotia and New Brunswick have the ... revealing. A particu- larly high percentage of Nova Scotia women record high blood pressure (more than one in five), 80% above the national average, and WOMENS HEALTH IN ATLANTIC CANADA: A STATISTICAL ... specifically in health, care-giving and social services, volunteering in WOMENS HEALTH IN ATLANTIC CANADA: A STATISTICAL PORTRAIT 20 prose literacy are also higher than those of males for all age...
Ngày tải lên: 14/03/2014, 12:20
On Estimating the Size of a Statistical Audit potx
... the situation; the math is the same. We assume that we are in an adversarial situation, where an adversary may have corrupted some of the objects. For example, the adversary might have tampered ... Stenger from the National Election Data Archive Project is now also available; there is also a nice associated audit size calculation utility on a web site [7]. Stanisle- vic [11] also examines the ... voting precincts may have different numbers of voters. This complicates matters considerably. Stanislevic [11] has a good approach to handling this situation. 13 On Estimating the Size of a Statistical Audit Ronald...
Ngày tải lên: 15/03/2014, 20:20
Báo cáo khoa học: "A Statistical Model for Lost Language Decipherment" pptx
... co-occurrence analysis oper- ate over large corpora, which are typically unavail- able for a lost language. Finally, Knight and Yamada (1999) and Knight et al. (2006) describe a computational HMM- based ... Cunchillos, Juan-Pablo Vita, and Jose- ´ Angel Zamora. 2002. Ugaritic data bank. CD- ROM. Gregoria del Olo Lete and Joaqu ´ ın Sanmart ´ ın. 2004. A Dictionary of the Ugaritic Language in the Alpha- betic ... structural sparsity constraints on character-level mappings. We assume that an ac- curate alphabetic mapping between related lan- guages will be sparse in the following way: each letter will map to a...
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "Modeling Human Sentence Processing Data with a Statistical Parts-of-Speech Tagger" ppt
... the tagger with that in a theoretically more powerful model trained on the same data, such as an incremental statistical parser (Wang et al., 2004; Roark, 2001). In so doing we can find the places ... Sch ¨ utze. Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts, 1999. B. Roark. Probabilistic top-down parsing and lan- guage modeling. Computational Linguistics, ... probabil- ity re-ranking. That is, the tagger initially favors the main-verb interpretation for the ambiguous -ed form, and later it makes a repair when the ambigu- ity is resolved as a past-participle. The...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "A Part of Speech Estimation Method for Japanese Unknown Words using a Statistical Model of Morphology and Context" pptx
... Table 3: Examples of common character bigrams for each part of speech in the infrequent words character type sequence kanji katakana katakana-kanji kanji-hiragana hiragana kanji-katakana ... kanji-hiragana hiragana kanji-katakana kat akana-symbol-katakana number kanji-hiragana-kanji alphabet kanji-hir agana-kanji-hir agana hiragana-kanji percent 45.1% 11.4% 6.5% 5.6% ... different types of characters other than punc- tuation marks: kanji, hiragana, katakana, Roman alphabet, and Arabic numeral. Kanji which means 'Chinese character' is used for both...
Ngày tải lên: 23/03/2014, 19:20
Báo cáo khoa học: "A Statistical Machine Translation Model Based on a Synthetic Synchronous Grammar" docx
Ngày tải lên: 31/03/2014, 00:20
Báo cáo khoa học: "A Statistical Model for Domain-Independent Text Segmentation" pot
Ngày tải lên: 31/03/2014, 04:20
ballentine l.a. statistical interpretation of quantum mechanics
Ngày tải lên: 24/04/2014, 17:16
Báo cáo hóa học: "Research Article A Decision-Tree-Based Algorithm for Speech/Music Classification and Segmentation" pot
Ngày tải lên: 21/06/2014, 20:20
Báo cáo hóa học: " Research Article A Statistical Multiresolution Approach for Face Recognition Using Structural Hidden Markov Models" pptx
Ngày tải lên: 22/06/2014, 06:20
Bạn có muốn tìm thêm với từ khóa: