... availability of rich resources will be increasingly critical to QA performance. While on-line resources such as the Web, WordNet, gazetteers, and encyclopedias are becoming more prevalent, no system-independent study ... a database containing geographical, political, and economi- cal profiles of all the countries in the world. We also analyzed two additional data sources contain- ing astrono...
Ngày tải lên: 31/03/2014, 03:20
... K (2 in the experiment) characters before and after an error-block in the Error-String, am found in the Similar- String, take out the string (denoted C) between A and B in 1 For detecting errors ... string (Error-String) that comprises an error-block and each M (5 in the experiment) character before and after the error-block out of the input string, and using this string (E...
Ngày tải lên: 20/02/2014, 18:20
Tài liệu Báo cáo khoa học: "Efficient Parsing for Bilexical Context-Free Grammar sand Head Automaton Grammars*" pptx
... exploited in parsing formalisms other than rewriting systems. The authors have developed an O(nT)-time parsing algorithm for bilexicalized tree adjoining grammars (Schabes, 1992), improving the ... by a CFG in CNF. In this paper we adopt the following conven- tions: a, b, c, d denote symbols in VT, w, x, y de- note strings in V~, and a, ~, denote strings in (VN t_J VT)*....
Ngày tải lên: 20/02/2014, 19:20
Tài liệu Báo cáo khoa học: "A PROGRAM FOR ALIGNING SENTENCES IN BILINGUAL CORPORA" docx
... constructing a probabilistic dictionary (Table 3) for use in aligning words in machine translation (Brown et al., 1990), or for constructing a bilingual concordance (Table 4) for use in lexicography ... French According to our survey, 1988 sales of mineral water and soft drinks were much higher than in 1987, reflecting the growing popularity of these products. Cola drin...
Ngày tải lên: 20/02/2014, 21:20
Báo cáo khoa học: A role for serglycin proteoglycan in granular retention and processing of mast cell secretory granule components ppt
... stained with May–Gru ¨ nwald (Merck, Sol- lentuna, Sweden) for 15 min. After being washed with water, the slides were stained with 5% Giemsa (Merck) in water for 10 min. TEM Cells were fixed for ... differenti- ation in interleukin (IL)-3-containing medium were analyzed for total intracellular b-hexosaminidase activity. Results are expressed as percentages, where the b-hexosa...
Ngày tải lên: 07/03/2014, 11:20
Báo cáo khoa học: "An Account for Compound Prepositions in Farsi" docx
... Another point to be mentioned is a delicate semantic difference between the meaning of these nouns in other constructions and in combination with prepositions. For example “dalil” in following ... this way tagging prepositions and parsing texts in Natural Language Processing is defined in a proper manner. 1 Introduction Prepositions have very versatile functions in...
Ngày tải lên: 08/03/2014, 02:21
Báo cáo khoa học: "Computer Backup for Field Work in Phonology" pdf
... materials with interesting results. To begin with, some simplifying assumptions were made. Phonetic data were treated as a linear string by simply ignoring their very real grouping properties. ... in Lower California and around the Colorado River delta. His list of around 600 words collected in Paipai was prepared for process- ing. Phonetic symbols were transliterated into str...
Ngày tải lên: 16/03/2014, 19:20
Báo cáo khoa học: "A System for Detecting Subgroups in Online Discussions" pptx
... Computational Linguistics, pages 138–147, Uppsala, Sweden, July. Andrea Esuli and Fabrizio Sebastiani. 2006. Sentiword- net: A publicly available lexical resource for opinion mining. In In Proceedings ... command line interface. For example, the user can specify which clustering algorithm should be used. To facilitate using the system for research pur- poses, the system comes with...
Ngày tải lên: 16/03/2014, 20:20
Báo cáo khoa học: "Distributional Representations for Handling Sparsity in Supervised Sequence-Labeling" pptx
... the increased performance by the HMM- smoothed model on the rare-word subset con- tributes in part to an increase in performance on the overall dataset of 1% for tagging and 3% for chunking. In ... gasolines on newer engines.” In a common dataset for NP chunking, the word “re- formulated” never appears in the training data, but appears four times in the test set as part of t...
Ngày tải lên: 17/03/2014, 01:20
Báo cáo khoa học: "SOFTWARE TOOLS FOR THE ENVIRONMENT OF A COMPUTER AIDED TRANSLATION SYSTEM" pptx
... debugging tool for linguistic applications ; - as a tool for converting the lexical base into a new form (for instance, loading it into a conventional data base). It is possible to imagine ... distinguish certain information contained in a linguistic application data base. VISULEX is intended to facilitate the comprehension and development of coded dictionaries which may be h...
Ngày tải lên: 17/03/2014, 19:21