an empirical study of the behavior of active learning for word sense disambiguation

Báo cáo khoa học: "Domain Adaptation with Active Learning for Word Sense Disambiguation" pdf

Báo cáo khoa học: "Domain Adaptation with Active Learning for Word Sense Disambiguation" pdf

Ngày tải lên : 08/03/2014, 02:21
... each of the 21 nouns. The sense with the highest estimated sense prior is taken as the predominant sense of the noun. For the set of 12 nouns where the predominant 54 Proceedings of the 45th Annual ... predominant sense of the noun interest in the BC part of the DSO corpus has the meaning “a sense of concern with and curiosity about some- one or something”. In the WSJ part of the DSO cor- pus, the ... one of the reasons for the drop in accuracy is the dif- ference in sense priors (i.e., the proportions of the different senses of a word) between BC and WSJ. When the authors assumed they knew the...
  • 8
  • 363
  • 0
Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

Ngày tải lên : 23/03/2014, 17:20
... L., and Palmer, M. 2006. An Empirical Study of the Behavior of Active Learning for Word Sense Disambiguation, Proc. of the main conference on Human Language Tech- nology Conference of the ... num- ber of its senses, the number of its data instances, the number of feature, and the percentage of positive sense instances for each data set. Assigning the correct labels of data instances ... for word sense disambiguation (WSD) in the do- main of web queries, where a complete set of ambiguous word senses are unknown. In this paper, we present a combination of active learning and...
  • 4
  • 441
  • 1
Báo cáo khoa học: "Assessing the Costs of Sampling Methods in Active Learning for Annotation" potx

Báo cáo khoa học: "Assessing the Costs of Sampling Methods in Active Learning for Annotation" potx

Ngày tải lên : 31/03/2014, 00:20
... accuracy) and cost of annotating a sentence depend not only on properties of the sentence but also on the order in which the items are annotated. Therefore, when evaluating the performance of an AL ... less human effort. Annotation cost is project dependent. For in- stance, annotators may be paid for the number of an- notations they produce or by the hour. In the context of parse tree annotation, ... employed to reduce the costs of corpus annotation (Engelson and Dagan, 1996; Ringger et al., 2007; Tomanek et al., 2007). With the assistance of AL, the role of the human oracle is either to label...
  • 4
  • 363
  • 0
Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

Ngày tải lên : 08/03/2014, 04:22
... which lists seven senses for the noun channel. Two senses are lumped together if they are translated in the same way in Chinese. For example, sense 1 and 7 of channel are both translated as “频道” ... Evaluating Word Sense Disambiguation Systems (SENSEVAL-2), pages 1-5. Gerard Escudero, Lluis Marquez, and German Rigau. 2000. An empirical study of the domain dependence of supervised word sense disambiguation ... classifier to determine the most probable sense of w. of the senses of some nouns. For instance, no oc- currences of sense 5 of the noun circuit (racing circuit, a racetrack for automobile races)...
  • 8
  • 380
  • 0
Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx

Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx

Ngày tải lên : 08/03/2014, 06:20
... each sense of one of the words. Pick one of the words, say W2, and using WordNet, form a similarity list for each sense of that word. For this, use the words from the synset of each sense and ... one of these queries, we get the number of hits for each sense i of W2 and this provides a ranking of the m senses of W2 as they relate with 1411. Example The types of query that can be formed ... summation of the conceptual densities be- tween the sense i of the word X and all the senses of the words Y. The results are shown in the tables below where the conceptual den- sity calculated for...
  • 7
  • 378
  • 0
Graph drawing aesthetics and the comprehension of UML class diagrams: an empirical study pptx

Graph drawing aesthetics and the comprehension of UML class diagrams: an empirical study pptx

Ngày tải lên : 07/03/2014, 17:20
... own performance on an alternative layout. The practice diagrams and the randomisation of the order of presentation of the experimental diagrams for each subject helped counter the learning effect ... related to the nature of the task and the form of the experimental materials. Students said that they found the diagrams easier to understand if, when reading from top to bottom, the order of the classes ... 1994): they take as input a relational graph structure of objects and the relationships between them, and produce a visual representation of the information in diagrammatic form. The designers of these...
  • 9
  • 712
  • 0
Báo cáo khoa học: "An Empirical Study of the Influence of Argument Conciseness on Argument Effectiveness" docx

Báo cáo khoa học: "An Empirical Study of the Influence of Argument Conciseness on Argument Effectiveness" docx

Ngày tải lên : 08/03/2014, 05:20
... houses in the Hot List 3 is randomly assigned to one of the three conditions. Then, the subject interacts with the evaluation framework and at the end of the interaction measures of the argument ... Figure 1 for a simple value tree in the real estate domain). The arcs of the tree are weighted to represent the importance of the value of an objective in contributing to the value of its parent ... explicitly questioning the user at the end of the interaction about the rationale for her decision (Olso and Zanna 1991). This can provide valuable information on what aspects of the argument were...
  • 8
  • 402
  • 0
Báo cáo khoa học: "An Empirical Study of Active Learning with Support Vector Machines for Japanese Word Segmentation" pptx

Báo cáo khoa học: "An Empirical Study of Active Learning with Support Vector Machines for Japanese Word Segmentation" pptx

Ngày tải lên : 17/03/2014, 08:20
... A set of the attributes of , and is used to predict the label of the . The set consists of twenty attributes: ten for the char- acter type ( , , , , , , , , , ), and an- other ten for the character ... 500 and 1000. However, they have concluded that a larger pool is better than a smaller one because the final accuracy of the former is higher than that of the latter. 6 The variance of a set of ... An Empirical Study of Active Learning with Support Vector Machines for Japanese Word Segmentation Manabu Sassano Fujitsu Laboratories Ltd. 4-1-1, Kamikodanaka, Nakahara-ku, Kawasaki...
  • 8
  • 553
  • 0
Tài liệu Báo cáo khoa học: "An Empirical Study of Information Synthesis Tasks" doc

Tài liệu Báo cáo khoa học: "An Empirical Study of Information Synthesis Tasks" doc

Ngày tải lên : 20/02/2014, 15:20
... Pro- cessing Conference and the 1st Conference of the North American Chapter of the Association for Computational Linguistics, Seattle, WA, April. An Empirical Study of Information Synthesis Tasks Enrique ... selected the Spanish CLEF 2001-2003 news collection testbed (Peters et al., 2002), be- cause Spanish is the native language of the subjects recruited for the manual generation of reports. Out of the ... substantially bet- ter than ROUGE for a relevant class of topics. Section 3 describes these metrics and the experi- mental design to compare them; in Section 4, we an- alyze the outcome of the...
  • 8
  • 425
  • 0
Báo cáo khoa học: "An Empirical Study of Chinese Chunking" docx

Báo cáo khoa học: "An Empirical Study of Chinese Chunking" docx

Ngày tải lên : 08/03/2014, 02:21
... we compare the performance of the state -of- the- art ma- chine learning models. Then we propose two approaches in order to improve the performance of Chinese chunking. 1) We propose an approach ... bi-grams of words in an n window. • POS: uni-gram and bi-grams of POS in an n window. • WORD+ POS: Both the features of WORD and POS. where n is a predefined number to denote window size. For instance, ... WORD+ POS, ”P” refers to POS. We can see from the figure that WORD+ POS yielded bet- ter performance than POS in the most cases. How- ever, when the size of training data was small, the performance...
  • 8
  • 486
  • 0
accounting ethics and its important role for reduction of accounting fraud an empirical study in hanoi

accounting ethics and its important role for reduction of accounting fraud an empirical study in hanoi

Ngày tải lên : 13/03/2014, 14:20
... This reason can be considered causes of the financial crisis, and the subsequent events of corporate collapses and accounting fraud. An organization lacks of transparency in their financial report ... In the United State, after the bankruptcy of Enron and Worldcom, the - 40 - reason can bring a not good sufficient for them to present all the view of accountants in Hanoi. In additional, the ... mainly on the accounting information and financial statements. Therefore it is the responsibility of the accounting professionals to stick to the code of ethics, i.e. this is the role of ethics...
  • 44
  • 501
  • 0

Xem thêm