... determin- istic parser for Chinese constituency parsing. In our approach, which is based on the shift-reduce parser for English reported in (Sagae and Lavie, 2005), the parsing task is transformed into ... de- pendency structures. For lexicalization, we used the same head-finding rules reported in (Bikel, 2004). With this additional information, reduce actions are now in the form of Redu...
Ngày tải lên: 20/02/2014, 12:20
... matrix 7 ) defines a uniform distribu- tion (all pi# equal), we immediately have that the expected neighborhood density for length rnl is identical for all targets Yt, while for length m~ > ... (1975), have been put forward, all of which have Zipf's law as some special or limiting form. Unrelated to Zipf's law is the lognormal hypothesis, advanced for word fre- qu...
Ngày tải lên: 08/03/2014, 07:20
Báo cáo khoa học: "A STOCHASTIC APPROACH TO SENTENCE PARSING" pptx
... paper to use statistics as a device for reducing ambigui- ties. In other words, we propose a scheme for gram- matical inference as defined by [Fu], a stochastic augmentatlon of a given grammar; ... preparation of semantic and pragmatic constraints in the form of usual semantic network, for example, should be done by human experts for each specific domain. This paper first in...
Ngày tải lên: 08/03/2014, 18:20
Báo cáo khoa học: "A STOCHASTIC FINITE-STATE WORD-SEGMENTATIONAL GORITHM FOR CHINESE" docx
... A STOCHASTIC FINITE-STATE WORD-SEGMENTATION ALGORITHM FOR CHINESE Richard Sproat Chilin Shih William Gale AT&T Bell ... Abstract We present a stochastic finite-state model for segment- ing Chinese text into dictionary entries and produc- tively derived words, and providing pronunciations for these words; the ... 'Zhou Enlai'. 3. Transliterated Foreign N...
Ngày tải lên: 17/03/2014, 09:20
Báo cáo khoa học: "A Stochastic Finite-State Morphological Parser for Turkish" doc
... distribution over all word forms. For example, we need probability es- timates for unigrams to rank misspelling sugges- tions for spelling correction. None of the previ- ous studies for Turkish have addressed ... convenient for a morphological parser as a word generator/analyzer to also output a probability estimate for a word generated/analyzed. In this work, we build such a sto...
Ngày tải lên: 23/03/2014, 17:20
Báo cáo khoa học: "A Stochastic Language Model using Dependency and Its Improvement by Word Clustering" ppt
... Search Algorithm The stochastic context-free grammar used for syn- tactic analysis consists of rewriting rules (see for- mula (3)) in Chom~ky normal form (Hopcroft and Ullman, 1979) except for ... POS-based one. 2 Stochastic Language Model based on Dependency In this section, we propose a stochastic language model based on dependency. Formally this model is based on a sto...
Ngày tải lên: 31/03/2014, 04:20
Tài liệu Báo cáo khoa học: A novel electron transport system for thermostable CYP175A1 from Thermus thermophilus HB27 doc
... There- fore, these results clearly indicate that the electron transport system for CYP175A1 is composed of Fdx, FNR, and NADPH (Fig. 4B). All quantitative analyses were performed at 65 °C for 2 ... via OFOR and Fdx to CYP119 [13,14]. Inter- estingly, the electron transport system for CYP175A1 did not utilize OFOR, although the T. thermophilus HB27 genome contains the genes encoding OFOR...
Ngày tải lên: 18/02/2014, 08:20
Tài liệu Báo cáo khoa học: "A New Dataset and Method for Automatically Grading ESOL Texts" pdf
... of the Association for Computational Linguistics, pages 180–189, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics A New Dataset and Method for Automatically ... analyses of the perfor- 180 mance of individual systems, as yet there is no pub- lically available shared dataset for training and test- ing such systems and comparing their performance. As...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "A Syntax-Driven Bracketing Model for Phrase-Based Translation" pptx
... then 8: Update bracketing instances for index j 9: end if 10: end if 11: end for 12: for each j ∈ c do 13: := ∪ {bracketing instances from j} 14: end for 15: Output: bracketing instances ... k ). Figure 1 shows the algorithm to extract brack- eting instances. Line 3-11 find all potential brack- eting instances for each (i, j, k) ∈ c but only keep 4 bracketing instances for each...
Ngày tải lên: 20/02/2014, 07:20
Tài liệu Báo cáo khoa học: "A Novel Feature-based Approach to Chinese Entity Relation Extraction" ppt
... Feature-based approaches transform the context of two entities into a liner vector of carefully selected linguistic features, varying from entity semantic information to lexical and syntactic ... relation hierarchy and co-reference information. Experiments on the ACE 2005 data set show that the positional structure feature can provide stronger support for Chinese relation extraction....
Ngày tải lên: 20/02/2014, 09:20