Báo cáo khoa học: "Ranking Class Labels Using Query Sessions" pot
... quality of the query- based, re-ranked lists of class labels, relative to alter- native ranking methods using only document-based counts. 2 Instance Class Ranking via Query Logs Ranking Hypotheses: ... query ses- sions for ranking class labels in extracted IsA repos- itories. It shows that query sessions produce better- ranked class labels than isolated queries do. A ta...
Ngày tải lên: 30/03/2014, 21:20
... challenge, we pursue an unsuper- vised self-training approach. We train a classifier on a corpus that is automatically labeled using asso- ciation information. Self-training approaches usu- ally include ... identifying clusters of entities that belong to the same named entity (NE) class. Determining common membership in an NE class like pers on is an easier task than determining coreferen...
Ngày tải lên: 07/03/2014, 22:20
... in the training and test corpus of SENSEVAL-1, we make a parsing using Apple Pie Parser (Sekine, 1996) and additional vertices using some rules automati- cally. If the resulted parsing includes ... information retrieval, and so on (Ide and V´eronis, 1998). Most of previ- ous supervised methods can be classified into two major ones; approach based on association, and ap- proach based on sel...
Ngày tải lên: 08/03/2014, 04:22
Báo cáo khoa học: "Spoken Dialogue Management Using Probabilistic Reasoning" potx
... Spoken Dialogue Management Using Probabilistic Reasoning Nicholas Roy and Joelle Pineau and Sebastian Thrun Robotics Institute Carnegie ... Institute Carnegie Mellon University Pittsburgh, PA 15213 Abstract Spoken dialogue managers have benefited from using stochastic planners such as Markov Decision Processes (MDPs). How- ever, so far, MDPs do not ... that this model is hand- crafted. The...
Ngày tải lên: 08/03/2014, 05:20
Báo cáo khoa học: "Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters" pptx
... word class. The perfor- mance and model size of class N-grams strongly depend on the definition of word classes. In fact, the performance of class N-grams based on the part-of-speech (POS) word class ... effective word class definitions are re- quired for high performance in class N-grams. In this paper, the Multi -Class assignment is proposed for effective word class definitions....
Ngày tải lên: 31/03/2014, 04:20
Tài liệu Báo cáo khoa học: "Grammar Error Correction Using Pseudo-Error Sentences and Domain Adaptation" pdf
... and method. • TRG: The models were trained using only the real-error corpus (baseline). • SRC: Trained using only the pseudo-error corpus. • ALL: Trained using the real-error and pseudo- error corpora ... task is hindered by the difficulty of collecting large error cor- pora. We tackle this problem by using pseudo- error sentences generated automatically. Fur- thermore, we apply domain...
Ngày tải lên: 19/02/2014, 19:20
Tài liệu Báo cáo khoa học: "Identifying Text Polarity Using Random Walks" pptx
... unsupervised setting where a handful of seeds is used to define the two polarity classes. The method is exper- imentally tested using a manually labeled set of positive and negative words. It out- performs ... is closely related to the partially labeled classification with random walks approach in (Szummer and Jaakkola, 2002) and the semi-supervised learning using harmonic functions approac...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Automatic Headline Generation using Character Cross-Correlation" doc
... a system that creates a headline for an English news- paper story using linguistically-motivated heuris- tics to choose a potential headline. Jin and Hauptmann (2002) proposed a probabilistic ... “ and he wrote it ” is compared with the word “ﺐﺘﻛ” “ he wrote ” using the EWM method the resulting score will be 0, but when using the CCC method it will be 0.667. The CCC method comes f...
Ngày tải lên: 20/02/2014, 05:20
Tài liệu Báo cáo khoa học: "Generating research websites using summarisation techniques" pptx
... profile): ‘automatic classification for information retrieval’, ‘intelligent automatic information retrieval’, ‘infor- mation retrieval test collections’, ‘information re- trieval system’, ‘automatic classification’, ... stylesheets are often considered inappropriate for diverse organisations. Research summary pages using stylesheets can offer alternative methods of information access and brow...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Japanese Dependency Parsing Using Co-occurrence Information and a Combination of Case Elements" pdf
... indicates a latent semantic class of co- occurrence (hidden class) . Probabilistic parame- ters P (n|z), P (r, v|z), and P (z) in Equation (9) can be estimated by using the EM algorithm. In our ... best parse and next-best parses. While our reranking model using generation probability is quite simple, we can easily verify our hypothesis that the two proposed probabilities have an eff...
Ngày tải lên: 20/02/2014, 12:20