bootstrapping named entity recognition by means of active machine learning

Tài liệu Báo cáo khoa học: "Inducing Gazetteers for Named Entity Recognition by Large-scale Clustering of Dependency Relations" ppt

Tài liệu Báo cáo khoa học: "Inducing Gazetteers for Named Entity Recognition by Large-scale Clustering of Dependency Relations" ppt

... English. 408 Proceedings of ACL-08: HLT, pages 407–415, Columbus, Ohio, USA, June 2008. c 2008 Association for Computational Linguistics Inducing Gazetteers for Named Entity Recognition by Large-scale Clustering of ... applications in speech recognition. Proceedings of the IEEE, 77(2):257–286. E. Riloff and R. Jones. 1999. Learning dictionaries for information extraction by multi-level bootstrapping. In 16th ... of a gazetteer and its effect. We think this is one of the important directions of future research. Parallelization has recently regained attention in the machine learning community because of...

Ngày tải lên: 20/02/2014, 09:20

9 429 0
Tài liệu Báo cáo khoa học: "Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition" pdf

Tài liệu Báo cáo khoa học: "Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition" pdf

... label of a named entity is “O”, which indicates a non -named entity. For 98.0% of the named entities in the training data of the shared task in the 2004 JNLPBA, the label of the preced- ing entity ... End- Word” capture the tendency of the length of a named entity. “Count feature” captures the ten- dency for named entities to appear repeatedly in the same sentence. “Preceding Entity and Prev Word” are ... N is the length of sentence and K is the size of label set. And that of training in first order semi-CRFs is O(K 2 LN). The increase of the cost is used to transfer non-adjacent entity information. To...

Ngày tải lên: 20/02/2014, 12:20

8 527 0
Báo cáo khoa học: "Joint Inference of Named Entity Recognition and Normalization for Tweets" doc

Báo cáo khoa học: "Joint Inference of Named Entity Recognition and Normalization for Tweets" doc

... number of organizations that are incorrectly labeled as PERSON by S BR , are now correctly recognized by our method. 532 by recognition errors. Another challenge of NEN is the dearth of information ... misconceptions in named entity recognition. In CoNLL, pages 147–155. Alan Ritter, Sam Clark, Mausam, and Oren Etzioni. 2011. Named entity recognition in tweets: An ex- perimental study. In Proceedings of the ... nature of tweets, there are rich variations of named enti- ties in them. According to our investigation on the data set provided by Liu et al. (2011), every named entity in tweets has an average of...

Ngày tải lên: 07/03/2014, 18:20

10 444 0
Báo cáo khoa học: "Incorporating speech recognition confidence into discriminative named entity recognition of speech data" ppt

Báo cáo khoa học: "Incorporating speech recognition confidence into discriminative named entity recognition of speech data" ppt

... 1999. A Maximum Entropy Ap- proach to Named Entity Recognition. Ph.D. thesis, New York University. Hai Leong Chieu and Hwee Tou Ng. 2003. Named en- tity recognition with a maximum entropy approach. In ... NER NER is a kind of chunking problem that can be solved by classifying words into NE classes that consist of name categories and such chunk- ing states as PERSON-BEGIN (the beginning of a person’s ... Conclusion We proposed a method for NER of speech data that incorporates ASR confidence as a feature of discriminative NER, where the NER model 623 by a set of binary values, the same as with an SVM-based...

Ngày tải lên: 17/03/2014, 04:20

8 311 0
Tài liệu Báo cáo khoa học: "Mining Wiki Resources for Multilingual Named Entity Recognition" pdf

Tài liệu Báo cáo khoa học: "Mining Wiki Resources for Multilingual Named Entity Recognition" pdf

... this paper, we describe a system by which the multilingual characteristics of Wikipedia can be utilized to annotate a large corpus of text with Named Entity Recognition (NER) tags requiring ... detail the process by which we use the Category structure inherent to Wikipedia to determine the named entity type of a proposed entity. We further describe the methods by which English language ... trained on up to 40,000 words of human-annotated newswire. 1 Introduction Named Entity Recognition (NER) has long been a major task of natural language processing. Most of the research in the field...

Ngày tải lên: 20/02/2014, 09:20

9 429 1
Tài liệu Báo cáo khoa học: "Acceptability Prediction by Means of Grammaticality Quantification" doc

Tài liệu Báo cáo khoa học: "Acceptability Prediction by Means of Grammaticality Quantification" doc

... adequacy of the PG grammatical- ity indices to the measurements was investigated by means of resultant analysis. We adapted the parameters of the model in order to arrive at a good fit based on half of ... grammaticality of the input. In other words, instead of deciding on the grammaticality of the input, we can give an indica- tion of its grammaticality, quantified on the basis of the description of the ... Const NP {Det, AP, N, Pro} (set of possible constituents of NP) In PG, each category of the grammar is de- scribed with a set of properties. A grammar is then made of a set of properties. Parsing an...

Ngày tải lên: 20/02/2014, 11:21

8 303 0
Tài liệu Báo cáo khoa học: "The Multilingual Named Entity Recognition Framework" docx

Tài liệu Báo cáo khoa học: "The Multilingual Named Entity Recognition Framework" docx

... this kind of systems, a set of rules is automatically learned and revised by an expert. An alternative can be the dynamic extension of an existing set of core rules previously defined by the expert, ... entropy approach for named entity recognition. PhD Thesis, New York University. Collins M. and Singer Y. (1999) Unsupervised models for named entity classification. In Proceedings of EMNLP/WVLC, 1999, ... language technology is not much developed for most of them. This has a big consequence for named entity recognition: for certain languages like most of the European languages, we benefit from already...

Ngày tải lên: 22/02/2014, 02:20

4 279 0
Pricing Portfolio Credit Derivatives by Means of Evolutionary Algorithms doc

Pricing Portfolio Credit Derivatives by Means of Evolutionary Algorithms doc

... assistance of the Deutsche Forschungsgemeinschaft by funding my research at the University of Tübingen, and of the Stiftung Landesbank Baden-Württemberg by supporting the publication of this dissertation. ... the exchange of credit risk is an interesting means of risk management, as long as it allows for maintenance of the client relationship. Eliminating the credit risk of a client by simply selling ... approach, which incorporates dependence by means of copula func- tions, allows the modeling of the dependence structure to be separated from the modeling of individual defaults. Li (2000) introduced...

Ngày tải lên: 07/03/2014, 19:20

176 378 0
Báo cáo khoa học: "Japanese Named Entity Recognition based on a Simple Rule Generator and Decision Tree Learning" pdf

Báo cáo khoa học: "Japanese Named Entity Recognition based on a Simple Rule Generator and Decision Tree Learning" pdf

... memt, January. Manabu Sassano and Takehito Utsuro. 2000. Named entity chunking techniques in supervised learning for Japanese named entity recognition. In Proceed- ings of the International Conference on Computa- tional ... tree learning for classification of a noun phrase by assuming that named entities are noun phrases. Gallippi (1996) employs hundreds of hand-crafted templates as features for decision tree learning. ... rule is refined by decision tree learning. By applying the refined recognition rules to a new document, we get NE candidates. Then, non- overlapping candidates are selected by a kind of longest match...

Ngày tải lên: 08/03/2014, 05:20

8 530 0
Báo cáo khoa học: "The Multilingual Named Entity Recognition Framework" ppt

Báo cáo khoa học: "The Multilingual Named Entity Recognition Framework" ppt

... resources and tools for named entity recognition. A team of computational linguist students develops this The members of the INaLCO Named Entity Group are: A. Acoulon, C. Avaux, L. Beroff-Beneat-, A. ... this kind of systems, a set of rules is automatically learned and revised by an expert. An alternative can be the dynamic extension of an existing set of core rules previously defined by the expert, ... interesting classification of named entity recognition systems. • Manually created rule-based systems. In this kind of system, developers initially elaborate a set of patterns that will be applied...

Ngày tải lên: 08/03/2014, 21:20

4 283 0
Báo cáo khoa học: "A Framework for Unifying Named Entity Recognition and Disambiguation Extraction Tools" pot

Báo cáo khoa học: "A Framework for Unifying Named Entity Recognition and Disambiguation Extraction Tools" pot

... the comparison of the perfor- mance of these services as well as their pos- sible combination. We address this problem by proposing NERD, a framework which unifies 10 popular named entity extractors available ... extract the list of Named Entity, their classification and the URIs that dis- ambiguate these entities. The main purpose of this interface is to enable a human user to assess the quality of the extraction ... Evaluating Named Entity Recognition Tools in the Web of Data. 10 th International Semantic Web Conference (ISWC’11), Demo Session, Bonn, Ger- many. Rizzo G. and Troncy R. 2011. NERD: Evaluat- ing Named...

Ngày tải lên: 08/03/2014, 21:20

4 466 0
Báo cáo khoa học: " Named Entity Recognition using an HMM-based Chunk Tagger" pptx

Báo cáo khoa học: " Named Entity Recognition using an HMM-based Chunk Tagger" pptx

... Proceedings of the 40th Annual Meeting of the Association for attractive in that it is trainable and adaptable and the maintenance of a machine- learning system is much cheaper than that of a rule-based ... the performance of a machine- learning system is always poorer than that of a rule-based one by about 2% [Chinchor95b] [Chinchor98b]. This may be because current machine- learning approaches ... gazetteers: lists of names of persons, organizations, locations and other kinds of named entities. This sub-feature can be determined by finding a match in the gazetteer of the corresponding...

Ngày tải lên: 17/03/2014, 08:20

8 473 1
Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx

Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx

... Ng Department of Computer Science School of Computing National University of Singapore 3 Science Drive 2 Singapore 117543 nght@comp.nus.edu.sg Abstract This paper describes how a machine- learning named entity ... the named entity task consists of labeling named entities with the classes PERSON, ORGANIZATION, LOCA- TION, DATE, TIME, MONEY, and PERCENT. We conducted experiments on upper case named entity recognition, ... work on un- supervised learning for mixed case named entity recognition (Collins and Singer, 1999; Cucerzan and Yarowsky, 1999). Collins and Singer (1999) investigated named entity classification...

Ngày tải lên: 17/03/2014, 08:20

8 285 0
Báo cáo khoa học: "Named Entity Recognition for Catalan Using Spanish Resources" potx

Báo cáo khoa học: "Named Entity Recognition for Catalan Using Spanish Resources" potx

... Language-Independent Named Entity Recognition. In Proceedings of CoNLL-2002, pages 155-158. Taipei, Taiwan. E. Tjong Kim Sang. 2002b. Memory-Based Named Entity Recognition. In Proceedings of CoNLL-2002, pages ... Sassano. 2002. Learning with Multiple Stacking for Named Entity Recognition. In Proceedings of CoNLL-2002, pages 191-194. Taipei, Taiwan. R. Weischedel. 1995. BBN: Description of the PLUM System ... Barcelona Icarreras,lluism,padroWsi.upc.es Abstract This work studies Named Entity Recog- nition (NER) for Catalan without mak- ing use of annotated resources of this language. The approach presented is based on machine learning techniques and...

Ngày tải lên: 17/03/2014, 22:20

8 288 0
Báo cáo khoa học: "Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia" potx

Báo cáo khoa học: "Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia" potx

... source and target entity by computing Levenshtein and other distance metrics between the source entity and the closest transliteration of the target (out of a 10-best list of transliterations). ... foreign candidate entity strings (sequences of tokens) and best corre- sponding English candidate entities. The candidate English entities are defined by the union of entities proposed by the Wiki-based ... followed the approach of Richman and Schone (2008) to derive named entity annotations of both English and foreign phrases in Wikipedia, using Wikipedia metadata. The following sources of in- formation...

Ngày tải lên: 23/03/2014, 14:20

9 333 0

Bạn có muốn tìm thêm với từ khóa:

w