... important subtask for many natural language processing applications, such as partial parsing, information retrieval and machine translation. A baseNP is a simple noun phrase that does not contain other ... pp.218-224. COLING-ACL’98 Lance A. Ramshaw and Michael P. Marcus ( In Press). Text chunking using transformation-based learning. In Natural Language Processing Using Very large Corpora. Kluwer. Originally appeared in ... Treebank II, and the definition of baseNP is the same as Ramshaw’s, Table 1 summarizes the average performance on both baseNP tagging and POS tagging, each section of the whole Penn Treebank was...
Ngày tải lên: 08/03/2014, 05:20
... represented by a bag-of-word. Among the words, there is a topic term Avatar (t 1 ) occurring twice, i.e. Avatar in A and Avatar in C, and two senti- ment words comfortable (o 1 ) and favorite (o 2 ) ... 4.1.1 Benchmark Datasets Our experiments are based on the Chinese benchmark dataset, COAE08 (Zhao et al., 2008). COAE dataset is the benchmark data set for the opinion retrieval track in the ... Vital < 性能 不 > Performance No 1373 fortable (o 1 ) are also regarded as relevant opi- nion mistakenly, creating a false positive. In re- ality comfortable (o 1 ) describes “the seats...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "A Joint Statistical Model for Simultaneous Word Spacing and Spelling Error Correction for Korean" pdf
... needs a word dictionary and takes long time for searching many character combinations. 61 4.2 Experiment Results and Analyses We used two separate Eumjeol n-grams as lan- guage models for experiments. ... be divided into statistical algorithms and rule-based algorithms. Statistical algorithms generally use character n- gram (Eojeol 1 or Eumjeol 2 n-gram in Korean) (Kang and Woo, 2001; Kwon, ... single Jaso tran- 3 Jaso is a Korean character. 4 ‘Transition’ means the correct character is changed to other character due to some causes, such as typographical errors. sition case (나와욧Æ나와요...
Ngày tải lên: 20/02/2014, 12:20
báo cáo hóa học:" Research Article A Systematic Development Methodology for Mixed-Mode Behavioral Models of In-Vehicle " docx
Ngày tải lên: 21/06/2014, 20:20
A methodology for validation of integrated systems models with an application to coastal-zone management in south-west sulawesi
... river basin management and/or ecosystem-based river basin management (Nakamura, 2003). Embedded in these approaches are the concepts of participatory management and adaptive management (Miser and ... actions. 1.2.3. Integrated management and policy analysis Integrated management Rapid changes of objectives and methodological approaches towards the management of natural resources and ... criterion. A performance criterion defines what aspect of the model we want to examine and what references are used for this examination. For example, a certain performance criterion was drafted as...
Ngày tải lên: 06/11/2012, 10:35
Tài liệu Báo cáo khoa học: "A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining" pptx
... system learns this as a non-transliteration but it is wrongly annotated as a transliteration in the gold standard. Arabic nouns have an article “al” attached to them which is translated in English as ... International Language Resources and Evaluation (LREC’10), Val- letta, Malta. Sittichai Jiampojamarn, Kenneth Dwyer, Shane Bergsma, Aditya Bhargava, Qing Dou, Mi-Young Kim, and Grzegorz Kondrak. ... non-transliterations by N. 3.2 Implementation Details We use the Forward-Backward algorithm to estimate the counts of multigrams. The algorithm has a for- ward variable α and a backward variable...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt
... a consensus translation technique to bootstrap parallel data using off-the-shelf translation sys- tems for training a hierarchical statistical transla- tion model for general domain instant ... normalization as a translation problem from the SMS language to the English language 1 and we propose to adapt a phrase-based statistical MT model for the task. Evaluation by 5-fold cross validation ... SMS normalization. 2.3 SMS Normalization versus Text Para- phrasing Problem Others may regard SMS normalization as a para- phrasing problem. Broadly speaking, paraphrases capture core aspects...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt
... block-based model for statis- tical machine translation. A block is a pair of phrases which are translations of each other. For example, Fig. 1 shows an Arabic-English translation example that uses blocks. ... Koehn, Franz-Josef Och, and Daniel Marcu. 2003. Statistical Phrase-Based Translation. In Proc. of the HLT-NAACL 2003 conference, pages 127–133, Edmonton, Canada, May. J. Lafferty, A. McCallum, and ... Annual Conf. of the Association for Computa- tional Linguistics (ACL 02), pages 311–318, Philadel- phia, PA, July. Charles Schafer and David Yarowsky. 2003. Statistical Machine Translation Using Coercive...
Ngày tải lên: 20/02/2014, 15:20
Tài liệu Báo cáo khoa học: "A Unified Framework for Automatic Evaluation using N-gram Co-Occurrence Statistics" pptx
... evaluation metrics are able to closely approximate human evaluations for various applications. Given an application app and an evaluation guideline package eval, the faithfulness/compactness ... separately evaluated. Each version was evaluated by a human evaluator, with no reference answer available. For this evaluation 115 test questions were used, and the human evaluator was asked ... same family of metrics explain best the variations obtained with human evaluations, according to the application being evaluated (Machine Translation, Automatic Summarization, and Automatic...
Ngày tải lên: 20/02/2014, 16:20
Báo cáo khoa học: "A Corpus for Modeling Morpho-Syntactic Agreement in Arabic: Gender, Number and Rationality" docx
... Agreement in Arabic: Gender, Number and Rationality Sarah Alkuhlani and Nizar Habash Center for Computational Learning Systems Columbia University {salkuhlani,habash}@ccls.columbia.edu Abstract We ... a Large-Scale Annotated Arabic Corpus. In NEMLAR Conference on Arabic Language Resources and Tools, pages 102–109, Cairo, Egypt. Yuval Marton, Nizar Habash, and Owen Rambow. 2011. Improving Arabic ... Rambow, Yuval Marton, Tim Buckwalter, Otakar Smrž, Reem Faraj, and May Ahmar for helpful discussions and feedback. We also would like to especially thank Ahmed El Kholy and Jamila El-Gizuli for...
Ngày tải lên: 07/03/2014, 22:20
Báo cáo khoa học: "A Statistical Parser for Czech*" ppt
... A Statistical Parser for Czech* Michael Collins AT&T Labs-Research, Shannon Laboratory, 180 Park Avenue, Florham Park, NJ 07932 mcollins@research, att.com Jan Haj i~. Institute ... of a morphological analy- sis program, and also with the single one of those tags that a statistical POS tagging program had predicted to be the correct tag (Haji~ and Hladka, 1998). Table ... morphological analyzer. The PDT also contains machine-assigned tags and lemmas for each word (using a tagger de- scribed in (Haji~ and Hladka, 1998)). For evaluation purposes, the PDT has been...
Ngày tải lên: 08/03/2014, 06:20
Biostatistics A Methodology for the Health Sciences Second Edition pot
... evaluate data from a study statistically forces an investigator to sharpen the focus of the study. It makes one translate intuitive ideas into an analytical model capable of generating data that ... 3.3. A qualitative variable has values that are intrinsically nonnumerical (cate- gorical). As suggested earlier, the values of a qualitative variable can always be put into numerical form. The ... first two authors and add the new authors in alphabetical sequence. This second edition adds a chapter on randomized trials and another on longitudinal data analysis. Substantial changes have been made...
Ngày tải lên: 15/03/2014, 04:20
Báo cáo khoa học: "A Statistical Model for Lost Language Decipherment" pptx
... Cunchillos, Juan-Pablo Vita, and Jose- ´ Angel Zamora. 2002. Ugaritic data bank. CD- ROM. Gregoria del Olo Lete and Joaqu ´ ın Sanmart ´ ın. 2004. A Dictionary of the Ugaritic Language in the Alpha- betic ... morphological segmentation was carried out with the guidance of a standard Ugaritic grammar (Schniedewind and Hunt, 2007). Although Ugaritic is an inflectional rather than agglutinative language, in ... this research has similar goals, it typically builds on information or resources unavailable for ancient texts, such as comparable corpora, a seed lexi- con, and cognate information (Fung and McKe- own,...
Ngày tải lên: 17/03/2014, 00:20
A Database and Evaluation Methodology for Optical Flow pdf
... common approach is to build image pyramids by repeated blurring and downsampling (Lucas and Kanade 1981; Glazer et al. 1983;Burtetal.1983; Enkelman 1986; Anandan 1989; Black and Anandan 1996; Battiti ... equations are linear in du and dv and solved using a sparse linear solver. The estimates of u and v are then updated appropriately and the next iteration applied. One disadvantage of variational algorithms ... the data and prior terms through the introduction of two sets of flow parameters, say (u data ,v data ) for the data term and (u prior ,v prior ) for the prior: E Global = E Data (u data ,v data )...
Ngày tải lên: 17/03/2014, 00:20
A Portfolio-Analysis Tool for Missile Defense (PAT-MD) - Methodology and User docx
Ngày tải lên: 23/03/2014, 02:20
Selling Financial Products - A proven methodology for increasing sales of banking and financial services doc
... Rabobank ■ Rand Merchant Bank (SA) ■ Rating Agency Malaysia ■ Raiffeisen International and RZB ■ Saudi Arabian Monetary Agency ■ Shell ■ Société Générale ■ Standard Chartered Group ■ State Bank ... techniques after all these years” Selling Project Finance Services – Asian bank ■ ABSA ■ Alpha Bank ■ Axa Investment Managers ■ Bank BPH SA ■ Bank of America ■ Bank of China ■ Bank of Kuwait and the ... the Middle East ■ Bank Pekao SA ■ Bank Zachodni WBK SA ■ BBVA Group ■ BNP Paribas ■ Calyon ■ Central Bank of Kuwait ■ Caixa Geral de Depositos ■ China International Capital Corporation ■ Citigroup ■...
Ngày tải lên: 23/03/2014, 11:20
Báo cáo khoa học: "Automatic Story Segmentation using a Bayesian Decision Framework for Statistical Models of Lexical Chain Features" pdf
... terms and locating instances of time where the count of chain starts and ends (boun- dary strength) achieves local maxima. Chan et al. (2007) enhanced this approach through statistical modeling ... (4) 4 Modeling of Lexical Chain Features 4.1 Chain starts and ends We follow (Chan et al. 2007) to model the lexi- cal chain starts and ends at a story boundary with a statistical distribution. ... consideration and statistically modeled. 2 Experimental Setup Experiments are conducted using data from the TDT-2 Voice of America Mandarin broadcast. In particular, we only use the data from...
Ngày tải lên: 23/03/2014, 17:20
Báo cáo khoa học: "A Statistical Model for Domain-Independent Text Segmentation" pot
Ngày tải lên: 31/03/2014, 04:20
Báo cáo khoa học: "A Polynomial-Time Algorithm for Statistical Machine Translation" pot
Ngày tải lên: 31/03/2014, 06:20