Báo cáo khoa học: "Cross-Language Text Classification using Structural Correspondence Learning" pot
... ap- proach to cross-language text classification based on structural correspondence learning. Section 5 presents experimental results in the context of cross-language sentiment classification. 2 Related ... 11-16 July 2010. c 2010 Association for Computational Linguistics Cross-Language Text Classification using Structural Correspondence Learning Peter Prettenhofer and Benno...
Ngày tải lên: 23/03/2014, 16:20
... most of these methods are quite effective in tra- ditional text classification applications. In past serval years, hierarchical text classification has become an active research topic in database ... Section 4 gives experimental analysis. Section 5 concludes the paper. 2 Hierarchical Text Classification In text classification, the documents are often rep- resented with vector space...
Ngày tải lên: 20/02/2014, 05:20
... Chae and A. Nenkova. 2009. Predicting the fluency of text with shallow structural features: case stud- ies of machine translation and human-written text. In Proceedings of the 12th Conference of the ... vandalism detection is a user ed- its classification problem addressed by a yearly competition (since 2010) in conjunction with the CLEF conference (Potthast et al., 2010; Potthast and H...
Ngày tải lên: 22/02/2014, 03:20
Báo cáo khoa học: "Phrase Linguistic Classification and Generalization for Improving Statistical Machine Translation" docx
... English POS-tagging using freely-available TnT tagger (Brants, 2000), and lemmatization using wnmorph, included in the WordNet pack- age (Miller et al., 1991). • Spanish POS-tagging using FreeLing ... respectively. 1 The terms ’base form’ or ’lemma’ will be used equivalently in this text. 67 2 Morphosyntactic classification of translation units State-of-the-art SMT systems use a log-...
Ngày tải lên: 17/03/2014, 06:20
Báo cáo khoa học: "A Practical Classification of Multiword Expressions" pdf
... much too com- plicated for simple multiword expressions, such as (12). 4 Previous Classifications There are numerous classifications available in lin- guistic literature, and we considered three of ... contain a finite verb phrase, but possibly not many phraseologisms that are extremely common in texts and speech, and do not constitute proper sentences from the point of view of the grammar. ......
Ngày tải lên: 31/03/2014, 01:20
Báo cáo khoa học: "Unsupervised Argument Identification for Semantic Role Labeling" potx
... ARGID stage and the entire data (perhaps with other features) for the classification stage. Better performance is achieved on the classification, where state- of-the-art supervised approaches achieve ... verb, and supervised syntac- tic tools. Using VerbNet along with the output of a rule-based chunker (in 2004) and a supervised syntactic parser (in 2005), they spot instances in the corpus...
Ngày tải lên: 17/03/2014, 01:20
Tài liệu Báo cáo khoa học: "Predicate Argument Structure Analysis using Transformation-based Learning" pdf
... PropBank (Xue, 2008) in Chinese, the GDA Corpus (Hashida, 2005), Kyoto Text Corpus Ver.4.0 (Kawahara et al., 2002), and the NAIST Text Corpus (Iida et al., 2007) in Japanese. The construction of such ... structure, our graph expression of it, and our improved method. The results of experiments using the NAIST Text Corpus, which is our target corpus, are reported in Section 3, and our...
Ngày tải lên: 20/02/2014, 04:20
Báo cáo khoa học: "Historical Change in Language Using Monte Carlo Techniques" potx
... selection of hypotheses is free. The system, which can be viewed as performing Monte Carlo simu- lations of group, language change, has been successfully tested in several computer runs using an ... models might yield information about hy- potheses common to successful simulations or about the mutual incompatibility of certain combinations of hypotheses. Another use of the program wo...
Ngày tải lên: 07/03/2014, 18:20
Báo cáo khoa học: "Making Tree Kernels practical for Natural Language Learning" potx
... classification properties of diverse tree kernels show that kernel combinations al- ways improve the traditional methods. Ex- periments with Support Vector Machines on the predicate argument classification task ... derived that the best λ (see Section 2.2) were 1 and 0.4, respec- tively. The classification performance was eval- uated using the F 1 measure 3 for the single argu- ments and...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: "Machine Translation System Combination using ITG-based Alignments∗" pot
... steps: • Computation of all pairwise alignments be- tween system hypotheses (either using ITGs or tercom); for each pair, one of the hypotheses plays the role of the “reference”. • Selection of a system ... of several MT system outputs. use this alignment to hypothesize a set of new, com- posite translations, and select the best composite hy- pothesis from this set. The alignment step is di...
Ngày tải lên: 31/03/2014, 00:20