... another clus- ter as they refer to another person, the Basketball Player Michael Jordan. To a human, named entity disambiguation is usually not a difficult task as he can make deci- sions depending ... semantic relatedness measure for named entity disambiguation. Because the key problem of named entity disambiguation is to measure the similarity between name observations, we inte- grate ... it to the named entity list extracted using the open- Calais API3, which contains more than 30 types of named entities, such as Person, Organization and Award; to find whether a N-gram is a...
Ngày tải lên: 16/03/2014, 23:20
... modest recall. As a result, large quantities of NE instances are automatically acquired. An automatically annotated NE corpus can then be constructed by extracting the tagged instances plus ... recall. Then, these rules are applied to a large raw corpus to automatically generate a tagged corpus. Finally, an HMM-based NE tagger is trained using this corpus. There is no iterative learning ... containsDigitAndAlpha, containsDigitAndDash, containsDigitAndSlash, containsDigitAndComma, containsDigitAndPeriod, otherNum, allCaps, capPeriod, initCap, lowerCase, other. 6 Benchmarking and...
Ngày tải lên: 20/02/2014, 16:20
Báo cáo khoa học: "A Framework for Unifying Named Entity Recognition and Disambiguation Extraction Tools" pot
... namely: authenti- cation, scraping, extraction, ontology mapping, store, statistics and web. The authentication en- ables to log in with an OpenID provider and sub- sequently attaches all analysis ... document that will be analyzed and option- ally an identification of the user for recording and sharing the analysis. 2 Framework NERD is a web application plugged on top of various NLP tools. Its architecture ... order to extract its main tex- tual content. Starting from the raw text, it drives one or several tools to extract the list of Named Entity, their classification and the URIs that dis- ambiguate...
Ngày tải lên: 08/03/2014, 21:20
Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx
... easily applicable. This way of teaching a weaker classifier can also be used in other domains, where the task is to in- fer , and an abundance of unlabeled data is available. If one possesses a ... Case NER The features we used can be divided into 2 classes: local and global. Local features are features that are based on neighboring tokens, as well as the token itself. Global features are ... set to 0. Case and Zone: If the token starts with a cap- ital letter (initCaps), then an additional feature (init- Caps, zone) is set to 1. If it is made up of all capital letters, then (allCaps,...
Ngày tải lên: 17/03/2014, 08:20
a simple method to synthesize nanowires titanium dioxide from layered titanate particles
... Among them, nanoscale TiO 2 is particularly interesting because they have large surface area, leading to a higher poten- tial of application in environment purification, gas sen- sor, and photovoltaic ... method to synthesize nanowires titanium dioxid e from layered titanate particles Mingdeng Wei * , Yoshinari Konishi, Haoshen Zhou, Hideki Sugihara, Hironori Arakawa National Institute of Advanced ... TiO 2 (ST-01, Ishihara Sangyo Kaisha LTD.) in the stoichiometrical ratio 1:3. The pow- ders were mixed together and repeatedly ground in an agate mortar, and calcined at 1000 °C for 2 h in the air. Synthesis...
Ngày tải lên: 19/03/2014, 16:47
Báo cáo khoa học: "Exploring Entity Relations for Named Entity Disambiguation" pot
... and geographic locations) plays an im- portant role in various natural language processing and information retrieval tasks. The goal of Named Entity Disambiguation (NED) is to label a surface form ... Computational Linguistics Exploring Entity Relations for Named Entity Disambiguation Danuta Ploch DAI-Labor, Technische Universit ¨ at Berlin Berlin, Germany danuta.ploch@dai-labor.de Abstract Named ... returned for each candidate as a disambiguation feature. 3.4 Candidate classifier and NIL detection We cast NED as a supervised classification task and use two binary SVM classifiers (Vapnik, 1995)....
Ngày tải lên: 23/03/2014, 16:20
Báo cáo khoa học: "Named Entity Disambiguation in Streaming Data" pdf
Ngày tải lên: 30/03/2014, 17:20
Báo cáo hóa học: " A Novel Method to Fabricate Silicon Nanowire p–n Junctions by a Combination of Ion Implantation and in-situ Doping" docx
Ngày tải lên: 22/06/2014, 00:20
Báo cáo hóa học: "A Simple Method to Synthesize Cadmium Hydroxide Nanobelts" pptx
Ngày tải lên: 22/06/2014, 01:20
Báo cáo hóa học: " Research Article A New Method to Represent Speech Signals Via Predefined Signature and Envelope Sequences" pptx
Ngày tải lên: 22/06/2014, 23:20
Báo cáo khoa học: "A Method for Effective and Scalable Mining of Named Entity Transliterations from Large Comparable Corpora" doc
Ngày tải lên: 24/03/2014, 03:20
Development of a method to measure consumer emotions associated with foods
... techniques which are appropriate for the academic laboratory research might not be appropriate for commercial settings of consumer laboratories. Aca- demic laboratory research typically uses student ... chocolate, vanilla ice cream, fried chicken and mashed potatoes and gravy. Pizza and chocolate produced the strongest emotions based on Analysis of Variance. The terms active, adven- turous, affectionate, ... the laboratory (CLT) and also internet testing. Thomson (2008) has also argued that concepts such as satisfaction are more appropriate than simple acceptance for commercial products, and that both...
Ngày tải lên: 03/04/2013, 21:07
Tài liệu Báo cáo khoa học: "A Novel Feature-based Approach to Chinese Entity Relation Extraction" ppt
... incorporated the base phrase chunking information and semi-automatically collected country name list and personal relative trigger word list. Jiang and Zhai (2007) then systematically explored a ... to investigate how to find an approach that is particularly appropriate for Chinese. 3 A Chinese Relation Extraction Model Due to the aforementioned reasons, entity relation extraction in ... paper, we study a feature-based approach that basically integrates entity related information with context information. 3.1 Classification Features The classification is based on the following...
Ngày tải lên: 20/02/2014, 09:20
Báo cáo " A method to construct flood damage map with an application to Huong River basin, in Central Vietnam" pdf
...
Ngày tải lên: 05/03/2014, 16:20
Báo cáo khoa học: "Japanese Named Entity Recognition based on a Simple Rule Generator and Decision Tree Learning" pdf
... Very Large Corpora. Kiyotaka Uchimoto, Qing Ma, Masaki Murata, Hi- romi Ozaku, Masao Utiyama, and Hitoshi Isahara. 2000. Named entity extraction based on a maxi- mum entropy model and transformation ... <ORGANIZATION>OO- SAKA -TO- YO-TA</ORGANIZATION> (= Os- aka Toyota) because Japanese POS taggers know that TO- YO-TA is an organization name (a kind of proper noun). *:*:location-name, ... charac- ters are used in Japanese: hiragana, katakana, kanji, symbols, numbers, and letters of the Ro- man alphabet. We use 17 character types for words, e.g., single-kanji, all-kanji, all-katakana,...
Ngày tải lên: 08/03/2014, 05:20
Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx
... defined as "any physical damage'(hypernym: health problem). This is a typical example of a mismatch caused by the fine granularity of senses in Word- Net which translates into a human ... Mihalcea and D.I. Moldovan. 1999. An au- tomatic method for generating sense tagged corpora. In Proceedings of AAAI-99, Or- lando, FL, July. (to appear). G. Miller, M. Chodorow, S. Landes, ... Computational Linguistics. J. Stetina, S. Kurohashi, and M. Nagao. 1998. General word sense disambiguation method based on a full sentential context. In Us- age of WordNet in Natural Language...
Ngày tải lên: 08/03/2014, 06:20
Báo cáo khoa học: "Automatic Discovery of Named Entity Variants – Grammar-driven Approaches to Non-alphabetical Transliterations" pptx
... perhaps due to the fact that the transliteration forms in a non-alphabetic lan- guage such as Chinese are opaque and not easy to compare. On the hand, there is often more than one way to transliterate ... script into a phonological representation 4 during the pairs extraction phase and then these representations are compared and similarity scores are given to all pair candidates. A lot of Chinese characters ... Approaches to Non-alphabetical Transliterations Chu-Ren Huang Institute of Linguistics Academia Sinica, Taiwan churenhuang@gmail.com Petr ˇ Simon Institute of Linguistics Academia Sinica, Taiwan sim@klubko.net Shu-Kai...
Ngày tải lên: 17/03/2014, 04:20
Báo cáo khoa học: "Exploiting Named Entity Taggers in a Second Language" ppt
... natural language analyzers. In Proceedings of LREC’02, Las Palmas de Gran Ca- naria, Spain. Xavier Carreras, Llu ´ ıs M ` arquez, and Llu ´ ıs Padr ´ o. 200 3a. Named entity recognition for Catalan ... Morgan Kaufmann Series in Data Management Systems. Morgan Kaufmann. Tong Zhang and David Johnson. 2003. A robust risk minimization based named entity recognition system. In Walter Daelemans and ... effort. Our goal is to present a method that will facilitate the task of increasing the coverage of named entity extractor systems. In this setting, we assume that we have available an NE extractor system...
Ngày tải lên: 17/03/2014, 06:20
Bạn có muốn tìm thêm với từ khóa: