... determin- istic parser for Chinese constituency parsing. In our approach, which is based on the shift-reduce parser for English reported in (Sagae and Lavie, 2005), the parsing task is transformed into ... SVM parser is 2-13 times faster than state-of-the-art parsers, while produc- ing more accurate results. Our Maxent and DTree parsers run at speeds 40-270 times faster than state-o...
Ngày tải lên: 20/02/2014, 12:20
... predicate (VP henceforth). Suppose also that the primary operations of the grammar are putting constituents together. Could the minimal parser for such a grammar account for the minimal pair ... rule for type-raising (see e.g. Dowty 1988) can cause difficulties for the parsing scheme advocated here (Hepple 1987) and is therefore assumed to apply in the lexicon. So a proper name...
Ngày tải lên: 17/03/2014, 09:20
Báo cáo khoa học: "A Cascaded Finite-State Parser for German" pot
... cas- caded finite-state parser (Abney, 1997). For the tagging approach, the effects of choosing different representations of de- pendency tuples are investigated. Per- formance of the finite-state parser ... A Cascaded Finite-State Parser for German Michael Schiehlen Institute for Computational Linguistics, University of Stuttgart, Azenbergstr. ... Determin- istic parsers re...
Ngày tải lên: 17/03/2014, 22:20
Báo cáo khoa học: "A Cascaded Finite-State Parser for Syntactic Analysis of Swedish" potx
... is per- formed using finite-state recognizers based on trig- ger words, typical contexts, and typical predicates associated with the entities. The performance of the NE recognition for Swedish ... performance of the parser partly depends on the output of the tagger and the rest of the pre- processing software. Our way of dealing with how "correct" the performance of the...
Ngày tải lên: 17/03/2014, 22:20
Báo cáo khoa học: "A Stochastic Finite-State Morphological Parser for Turkish" doc
... convenient for a morphological parser as a word generator/analyzer to also output a probability estimate for a word generated/analyzed. In this work, we build such a stochastic morphological parser for ... applications for evaluation. 1 The stochastic morphological parser is available for re- search purposes at http://www.cmpe.boun.edu.tr/˜hasim 2 Language Resources...
Ngày tải lên: 23/03/2014, 17:20
... matrix 7 ) defines a uniform distribu- tion (all pi# equal), we immediately have that the expected neighborhood density for length rnl is identical for all targets Yt, while for length m~ > ... (1975), have been put forward, all of which have Zipf's law as some special or limiting form. Unrelated to Zipf's law is the lognormal hypothesis, advanced for word fre- qu...
Ngày tải lên: 08/03/2014, 07:20
... Greibach normal form, were translated into a form which is favorable to our method. 2118 rules of the original rules were rewrlttenas 5241 rules in Chom- sky normal form. B. Parser A bottom-up ... bottom-up context-free parser based on Cocke-Kasa- mi-Yotmg algorithm was developed especially for this purpose. Special emphasis was put on the design of the parser to get better pe...
Ngày tải lên: 08/03/2014, 18:20
Báo cáo khoa học: "A Language-Independent Unsupervised Model for Morphological Segmentation" pot
... application. Performance improvements due to morphological information have been reported for example in MT, information retrieval, and speech recognition. For the latter task, morphological seg- mentations ... sys- tems for morphological segmentation with respect to CELEX manual morphological annotation. Rule-based systems are currently the most com- mon approach to morphologi...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "A Freely Available Morphological Analyzer, Disambiguator and Context Sensitive Lemmatizer for German" pdf
... compact as it only stores the base form for each word together with its inflection class. Therefore, the complete morphological information for 324,000 word forms takes less than 2 Megabytes of ... lemmata for each word form. Secondly, the tagger determines the grammatical categories of the word forms. If, for any of the lemmata, the inflected form corre- sponding to the word...
Ngày tải lên: 17/03/2014, 07:20
... Abstract We present a stochastic finite-state model for segment- ing Chinese text into dictionary entries and produc- tively derived words, and providing pronunciations for these words; the ... 'Zhou Enlai'. 3. Transliterated Foreign Names: ~i~::,,~ bu4- lang 3-shi4-wei2-ke4 'Brunswick'. We present a stochastic finite-state model for seg- menting Ch...
Ngày tải lên: 17/03/2014, 09:20