Báo cáo khoa học: "Automatic Identification of Word Translations from Unrelated English and German Corpora" pot
... Automatic Identification of Word Translations from Unrelated English and German Corpora Reinhard Rapp University of Mainz, FASK D-76711 Germersheim, Germany rapp @usun2.fask.uni-mainz.de ... in terms of corpus frequencies: kl~ = frequency of common occurrence of word A and word B kl2 = corpus frequency of word A - kll k21 = corpus frequency of wo...
Ngày tải lên: 08/03/2014, 06:20
... defined the 3 classes of c listed in Table 1. The identification task separates pro and con candidate sentences (CR and PR in Table 1) from sentences irrelevant to either of them (NR). The ... This work differs in important ways from studies in (Hu and Liu, 2004) and (Popescu and Etzioni, 2005). These approaches extract features 483 of products and identify se...
Ngày tải lên: 20/02/2014, 12:20
... mutual information of a collocation is the log- arithm of the ratio between the probability of the collocation and the probability of events A, B, and C co-occur if we assume B and C are conditionally ... modifier), where head and modifier are words in the input sentence and type is the type of the dependency relation. For example, (la) is an example dependency tre...
Ngày tải lên: 31/03/2014, 04:20
Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx
... related to the seed term from the candidates. Our conditions of “x is closely related to s” is: (1) x is a broader or narrower term of s; or (2) relation degree between x and s is high enough, i.e., ... terms from the corpus by using Naka- gawa’s method. These extracted terms be- come the candidates for the final step. The final step, filtering step, removes inappro- priate terms from...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc
... Table 2: Efficiency of verb detection for each of the five SFs, as tested on 2.6 million words of the Wall Street Journal and controlled by the Penn Treehank's hand-verified tagging ... custom, hand-generated lists of subcategorization frames (e.g., Hindle, 1983), or published, hand- generated lists like the Ozford Advanced Learner's Dictionary of Contemporary E...
Ngày tải lên: 20/02/2014, 21:20
Báo cáo khoa học: "Automatic Compilation of Travel Information from Automatically Identified Travel Blogs" doc
... into two steps: (1) identification of travel blogs and (2) extraction of travel information from them. We explain these steps in Sections 3.1 and 3.2. 3.1 Identification of Travel Blogs Blog ... identifica- tion of travel blogs, and (2) extraction of travel information from blogs. We reported on them in Sections 4.1 and 4.2. 4.1 Identification of Travel Bl...
Ngày tải lên: 08/03/2014, 01:20
Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx
... University of Edinburgh Laboratory for Foundations of Computer Science. state -of- art statistical systems and for improving the portability of these systems between domains. One type of lexical ... yielded systems for English (Car- roll and Rooth, 1998; Briscoe and Carroll, 1997; Ko- rhonen, 2002) capable of detecting comprehensive sets of SCFs with promising accuracy an...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Automatic Acquisition of Script Knowledge from a Text Collection" docx
... Automatic Acquisition of Script Knowledge from a Text Collection Toshiaki Fujiki Hidetsugu Nanba Interdisciplinary Graduate School of Graduate School of Science and Engineering Information ... occurring in time order from a Japanese text collec- tion and then chose those that were typi- cal of certain situations by ranking these sequences (pairs) in terms of the fre- que...
Ngày tải lên: 31/03/2014, 20:20
Báo cáo khoa học: Structural studies of thymidine kinases from Bacillus anthracis and Bacillus cereus provide insights into quaternary structure and conformational changes upon substrate binding pot
... a 3-A ˚ dissociation of subunits interacting by a1. The base of dTTP is inserted between the a1-helix of two subunits and is stacked between the rings of Phe18 and Phe34 from the adjacent subunit ... case of Ba-TK, several hydrogen bonds form between side- chain atoms and main-chain atoms of the neighboring helices: NH1 of Arg27 is hydrogen-bonded to main- chain oxygen...
Ngày tải lên: 23/03/2014, 09:21
Báo cáo khoa học: "A comparison of clausal coordinate ellipsis in Estonian and German: Remarkably similar elision rules allow a language-independent ellipsis-generation module" pot
... the form of a non-finite complement 3 For lack of space, here we cannot go into aspects of word- order variation (both Estonian and German are languages with relatively free word order). ... framework by Kempen (2009) and its implementation for German and Dutch in ELLEIPO, the elision process is guided by constraints on lemma- and wordform-identity constraints...
Ngày tải lên: 31/03/2014, 20:20