... cross- validation of CRL NE data. The ME system at- tained 82.77% for and 82.67% for . The RG+DT system attained 84.10% for , 84.02% for , and 84.03% for . (Even if we do not use C4.5, RG+DT CRL NE all GENERAL ... answer your question. NE recognition is essential for finding possible answers from documents. Although it is easy to build an NE recognition system with mediocre per...
Ngày tải lên: 08/03/2014, 05:20
... corresponding foreign entity for a given English entity. The first oracle ORACLE1 has access to the gold- standard English entities and gold-standard word alignments among English and foreign words. For each ... amounts of information and one language might use more detail than the other. The other is that the same information might be ex- pressed using a named entity in one langua...
Ngày tải lên: 23/03/2014, 14:20
Báo cáo khoa học: "Arabic Named Entity Recognition: Using Features Extracted from Noisy Data" doc
... Association for Computational Linguistics Arabic Named Entity Recognition: Using Features Extracted from Noisy Data Yassine Benajiba 1 Imed Zitouni 2 Mona Diab 1 Paolo Rosso 3 1 Center for Computational ... Valencia {ybenajiba,mdiab}@ccls.columbia.edu, izitouni@us.ibm.com, prosso@dsic.upv.es Abstract Building an accurate Named Entity Recognition (NER) system for languages wi...
Ngày tải lên: 23/03/2014, 16:20
Báo cáo khoa học: "Bootstrapping Named Entity Recognition with Automatically Generated Gazetteer Lists" doc
... evaluation for the Named Entity detection and classification tasks with and without labeled data are in Sections 4 and 5. We conclude in Section 6. 2 The NER how to A Named Entity Recognition ... tasks. For all of them crucial role plays the feature extraction and selection module, which leads to optimal classifier performance. This section describes the features used for our N...
Ngày tải lên: 24/03/2014, 03:20
Báo cáo khoa học: "Exploiting Named Entity Taggers in a Second Language" ppt
... what we called Named Entity Classification (NEC). We explain the two procedures in the following subsections. 4.1 Named Entity Delimitation We used the BIO scheme for delimiting named enti- ties. ... variety of documents and languages for which it is desirable to have these tools available. In this work we have presented a method for per- forming named entity recognition....
Ngày tải lên: 17/03/2014, 06:20
Tài liệu Báo cáo khoa học: "Using Cross-Entity Inference to Improve Event Extraction" docx
... in- formation, actually contain sufficient clues for event detection. It is only based on the premise that we know the backgrounds of the entities before- hand. For instance, if we knew the entity ... cross -entity inference for event- type identification is not the only use of entity- type consistency. As we shall describe below, we can make use of it at all issues of event ext...
Ngày tải lên: 20/02/2014, 04:20
Tài liệu Báo cáo khoa học: "Improving Automatic Speech Recognition for Lectures through Transformation-based Rules Learned from Minimal Data" ppt
... evaluation: WER values for instructor K using the WSJ-5K language model. hours 4 for a threshold of 2 when training over tran- scripts for one third of a lecture. Therefore, it can be concluded ... train an ASR system for the other half or for when the course is next offered, and still results in signifi- cant WER reductions. And yet even in this sce- nario, the business case for m...
Ngày tải lên: 20/02/2014, 07:20
Báo cáo khoa học: "Example-Based Metonymy Recognition for Proper Nouns" pot
... metonymical cate- gory they belonged to. For the country names, Markert and Nissim distinguished between place -for- people, place -for- event and place -for- product. For the organi- zation names, the most ... metonymies are organization -for- members and organization -for- product. In addition, Markert and Nissim used a label mixed for examples that had two readings, and othermet for...
Ngày tải lên: 24/03/2014, 03:20
Tài liệu Báo cáo khoa học: "Generating Impact-Based Summaries for Scientific Literature" docx
... citation context and then for each extracted sentence find a similar one in the original pa- per. Unfortunately, we did not have time to test this approach before the deadline for the camera-ready ... sen- sitivity of performance to these parameters. In gen- eral, for a wide range of values of these parameters, the performance is relatively stable and near opti- mal. Specifically, the per...
Ngày tải lên: 20/02/2014, 09:20
Tài liệu Báo cáo khoa học: "Mining Wikipedia Revision Histories for Improving Sentence Compression" docx
... of them for such compres- sions/expansions. We make the simplifying assump- tion that all such edits also retain the core mean- ing of the sentence, and are therefore valid training data for our ... for glare protection] is effective and will help if your office has the fluorescent-light overkill [that ’s typical in offices]. (4) Prices range from $5,000 [for a microvax 2000] to $179,000...
Ngày tải lên: 20/02/2014, 09:20