... document. Without a great deal of linguistic analysis, it is possible to create summaries for a wide range of documents. Unfortunately, extracts are of- ten documents of low readability and text ... sentence compression systems which falls short of attaining grammaticality levels of human output. For ex- ample, Clarke and Lapata (2008) evaluate a range of state -of- the-art comp...
Ngày tải lên: 16/03/2014, 23:20
Báo cáo khoa học: "Automatic Generation of Information-seeking Questions Using Concept Clusters" ppt
... Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pages 93–96, Suntec, Singapore, 4 August 2009. c 2009 ACL and AFNLP Automatic Generation of Information-seeking Questions Using Concept Clusters Shuguang ... information needs of the user, information-seeking dialogue should take advantage of the inherent grouping of the answers. Several methods have been inves...
Ngày tải lên: 23/03/2014, 17:20
... because of its hierarchical nature where information can be obtained for topics of various granularity unlike (Mishne et al., 2005) where there is no concept of topics at all. 5 Application of Domain ... pertinent questions, etc. The {topic→information} index requires iden- tification of the topic for each call to make use of information available in the model. Below we show ex...
Ngày tải lên: 31/03/2014, 01:20
Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx
... application of the method is auto- matic or semi-automatic compilation of a glossary or technical-term dictionary for a certain domain. Re- cursive application of the method enables to collect a list of ... Web by using search engines and produces a dozen technical terms that are closely related to the seed word. 2 System Figure 1 shows the configuration of the system. The system c...
Ngày tải lên: 20/02/2014, 16:20
Báo cáo khoa học: "Automatic Induction of a CCG Grammar for Turkish" pptx
... re- strictive form of free order CCG. Both Hoffman and Baldridge ignore morphology and treat the inflected forms as different words. The rest of this section contains an overview of the underlying ... review of the relevant work (1.2). In Section 2, the properties of the data are explained. Section 3 then gives a brief sketch of the algorithm used to induce a CCG lexicon, with some...
Ngày tải lên: 17/03/2014, 06:20
Báo cáo khoa học: "Automatic Prediction of Cognate Orthography Using Support Vector Machines" potx
... kind of regularity and that they can be exploited in order to draw a net of implicit rules by means of a machine learning approach. Section 2 deals with previous work done on the field of cognate ... which became known under the name of edit distance (ED). A good case in point of a practical application of ED is represented by the studies in the field of lexicon acquis...
Ngày tải lên: 31/03/2014, 01:20
Báo cáo khoa học: "Unsupervised Discovery of Generic Relationships Using Pattern Clusters and its Evaluation by Automatically Generated SAT Analogy Questions" pot
... type of semantic resource is that of concepts (represented by sets of lexical items) and their inter-relationships. While there is rel- atively good agreement as to what concepts are and which concepts ... Generated SAT Analogy Questions Dmitry Davidov ICNC Hebrew University of Jerusalem dmitry@alice.nc.huji.ac.il Ari Rappoport Institute of Computer Science Hebrew University of...
Ngày tải lên: 23/03/2014, 17:20
Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf
... (possible) medical conditions. The importance of the task of negation and spec- ulation (a.k.a. hedge) detection is attested by a num- ber of research initiatives. The creation of the Bio- Scope corpus (Vincze ... Statistics of the BioScope corpus. The 2nd and 3d columns show the total number of cues within the datasets; the 4th and 5th columns show the percentage of negated and...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt
... ex- amples of the previous section. From the point of view of bag -of- word methods, the pairs (T 1 , H 1 ) and (T 1 , H 2 ) have both the same intra-pair simi- larity since the sentences of T 1 and ... rules that describe a non trivial set of entailment cases. The experiments with the data sets of the RTE 2005 challenge show an improvement of 4.4% over the state -of- the-art me...
Ngày tải lên: 20/02/2014, 12:20
Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx
... there are two sentences in each of the 454 (1) kono software-no riten-ha hayaku ugoku koto this software-POST advantage-POS T quickly run to The advantage of this software is to run quickly. (2) ... the polarity of words There are some works that discuss learning the po- larity of words instead of sentences. Hatzivassiloglou and McKeown proposed a method of learning the polarity...
Ngày tải lên: 20/02/2014, 12:20