Báo cáo khoa học: "A Practical Classification of Multiword Expressions" pdf
... the units in each of the classes. 2 An NLP Taxonomy of Multiword Expressions At this stage of work, our taxonomy is composed of two groups of multiword expressions. The first one consists of units that ... Proceedings of the ACL 2007 Student Research Workshop, pages 19–24, Prague, June 2007. c 2007 Association for Computational Linguistics A Practical Classification of...
Ngày tải lên: 31/03/2014, 01:20
... paper. 2 Hierarchical Text Classification In text classification, the documents are often rep- resented with vector space model (VSM) (Salton et al., 1975). Following (Cai and Hofmann, 2007), we incorporate ... evaluations have shown that most of these methods are quite effective in tra- ditional text classification applications. In past serval years, hierarchical text classification has...
Ngày tải lên: 20/02/2014, 05:20
... expected clusters. Each of the two clusters corre- sponds to one of the senses of palm, and the words closest to the geometric centers of the clusters should be good descriptors of each sense. However, ... three main di- mensions of the context matrices. 1 Introduction The topic of this paper is word sense induction, that is the automatic discovery of the possible sens...
Ngày tải lên: 20/02/2014, 16:20
Tài liệu Báo cáo khoa học: "User Edits Classification Using Document Revision Histories" pptx
... over different representations of user edits, comparison of part -of- speech tags and named entities, and a set of adap- tive features extracted from large amounts of unlabeled user edits. Applied ... & 149440273 pre (“Original Society of Teachers of the Alexander Technique (est. 1958).”) post (“Original and largest professional Society of Teachers of the Alexander Techniq...
Ngày tải lên: 22/02/2014, 03:20
Báo cáo khoa học: "A Practical Solution to the Problem of Automatic Part-of-Speech Induction from Text" pdf
... Automatic Part -of- Speech Induction from Text Reinhard Rapp University of Mainz, FASK D-76711 Germersheim, Germany rapp@mail.fask.uni-mainz.de Abstract The problem of part -of- speech induction ... prob- lem of data sparseness. 1 Introduction Whereas most previous statistical work concerning parts of speech has been on tagging, this paper deals with part -of- speech i...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "A Practical Comparison of Parsing Strategies" docx
... Methods of Comparison Software instrumentation was used to measure the following: the CPU time; the number of phrases (instantiations of grammar rules) proposed by the parser; the number of these ... efficiency transcends the issue of mere practicality. At slow-to-average parsing rates, the cost of verifying linguistic theories on a large, general sample of natural langu...
Ngày tải lên: 08/03/2014, 18:20
Báo cáo khoa học: "A Practical Nonmonotonic Theory for Reasoning about Speech Acts" pot
... prerequisite to a theory of the way agents un- derstand speech acts is a theory of how their be- liefs and intentions are revised as a consequence of events. This process of attitude revision ... fect of informing that is true every time a declara- tive sentence is uttered. If one's general theory of the world and of rational behavior were sufficiently strong and de...
Ngày tải lên: 08/03/2014, 18:20
Báo cáo khoa học: "Phrase Linguistic Classification and Generalization for Improving Statistical Machine Translation" docx
... organization of the paper is as follows. Sec- tion 2 describes the rationale of this classification strategy, discussing the advantages and difficulties of such an approach. Section 3 gives details of the ... equivalently in this text. 67 2 Morphosyntactic classification of translation units State -of- the-art SMT systems use a log-linear com- bination of models to decide the bes...
Ngày tải lên: 17/03/2014, 06:20
Báo cáo khoa học: " A Practical Korean Question Answering Framework for Restricted Domain" pptx
... because of the generalized linguistic information without decreasing the performance of the question analyzer. One of possible defects of using such linguis- tic information is the loss of the ... robustness is focused the question analysis. Instead of using a technique for deep understanding of the question, the ques- tion analysis component of K-QARD tries to ex- tract only...
Ngày tải lên: 31/03/2014, 01:20
Tài liệu Báo cáo khoa học: A strategy for discovery of cancer glyco-biomarkers in serum using newly developed technologies for glycoproteomics ppt
... previously demonstrated the application of this method to the determination of the glycan structure of a form of AFP [10]. However, identification of the details of a glycan structural change on a ... methylesteri- fication of sialic acid moieties. All spectra were obtained in the positive ion mode using MALDI– quadrupole ion trap (QIT)-TOF MS. A strategy for discovery of cancer...
Ngày tải lên: 16/02/2014, 08:20