Báo cáo khoa học: "Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia" potx

Báo cáo khoa học: "Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia" potx

Báo cáo khoa học: "Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia" potx

... 2012. c 2012 Association for Computational Linguistics Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia Sungchul Kim ∗ POSTECH Pohang, South Korea subright@postech.ac.kr Kristina ... multi-lingual data with named entity tags. We build on prior work utiliz- ing Wikipedia metadata and show how to ef- fectively combine the weak an...

Ngày tải lên: 23/03/2014, 14:20

9 333 0
Báo cáo khoa học: "Arabic Named Entity Recognition: Using Features Extracted from Noisy Data" doc

Báo cáo khoa học: "Arabic Named Entity Recognition: Using Features Extracted from Noisy Data" doc

... standard sets of ACE 2003, ACE 2004 and ACE 2005. 4 The ACE data is annotated for many tasks: Entity Detection and Tracking (EDT), Relation Detection and Recognition (RDR), Event Detection and ... train, de- velopment, and test used in (Benajiba et al., 2008). 3.2 Parallel Data Most of the hand-aligned Arabic-English parallel data used in our experiments is from the...

Ngày tải lên: 23/03/2014, 16:20

5 249 0
Báo cáo khoa học: "Japanese Named Entity Recognition based on a Simple Rule Generator and Decision Tree Learning" pdf

Báo cáo khoa học: "Japanese Named Entity Recognition based on a Simple Rule Generator and Decision Tree Learning" pdf

... of training data and that it improves readability. 1 Introduction Named entity (NE) recognition is a task in which proper nouns and numerical informa- tion in a document are detected and classi- fied ... candidates start at the same point, their ending points are compared and the longest candidate is selected. Therefore, the candidates overlapping the selected candidate are rem...

Ngày tải lên: 08/03/2014, 05:20

8 530 0
Báo cáo khoa học: "Bootstrapping Named Entity Recognition with Automatically Generated Gazetteer Lists" doc

Báo cáo khoa học: "Bootstrapping Named Entity Recognition with Automatically Generated Gazetteer Lists" doc

... of gazetteer lists from unlabeled data; and the building of a Named Entity Recognition system with labeled and unlabeled data. 1 Introduction Automatic information extraction and information retrieval ... evaluation for the Named Entity detection and classification tasks with and without labeled data are in Sections 4 and 5. We conclude in Section 6. 2 The NER how...

Ngày tải lên: 24/03/2014, 03:20

7 217 0
Báo cáo khoa học: "Exploiting Named Entity Taggers in a Second Language" ppt

Báo cáo khoa học: "Exploiting Named Entity Taggers in a Second Language" ppt

... Evaluation contest on named entity recognition for Portuguese”. This corpus contains newspaper articles and consists of 8,551 words with 648 NEs. 4 Two-step Named Entity Recognition Our approach ... Mexico Abstract In this work we present a method for Named Entity Recognition (NER). Our method does not rely on complex linguis- tic resources, and apart from a hand coded...

Ngày tải lên: 17/03/2014, 06:20

6 397 0
Tài liệu Báo cáo khoa học: Evidence for interactions between domains of TatA and TatB from mutagenesis of the TatABC subunits of the twin-arginine translocase docx

Tài liệu Báo cáo khoa học: Evidence for interactions between domains of TatA and TatB from mutagenesis of the TatABC subunits of the twin-arginine translocase docx

... activity, and the data confirm this result with no export detected using either assay system. The K73A and Y154S mutants are active, as expected from previ- ous studies [26] and so too are the D211A and ... and immunoblotted using antibodies to TatA, TatB, the Strep-tag II on TatC (monoclonal antibody from IBA, Stuttgart, Germany) or green fluorescent protein (GFP) using a...

Ngày tải lên: 19/02/2014, 17:20

15 532 0
Tài liệu Báo cáo khoa học: "Modeling Morphologically Rich Languages Using Split Words and Unstructured Dependencies" docx

Tài liệu Báo cáo khoa học: "Modeling Morphologically Rich Languages Using Split Words and Unstructured Dependencies" docx

... 2 gives the total log-probability (using log 2 ) for the split and unsplit datasets using n-gram models of different order. We compute the perplexity of the two datasets using a common denomina- tor: ... into their stem and suffix forms is beneficial when the split is performed using a morphologi- cal analyzer and (ii) allowing the model to choose stem and suffix dependencies sepa...

Ngày tải lên: 20/02/2014, 09:20

4 325 0
Báo cáo khoa học: Modeling hydration mechanisms of enzymes in nonpolar and polar organic solvents potx

Báo cáo khoa học: Modeling hydration mechanisms of enzymes in nonpolar and polar organic solvents potx

... are immiscible with water and that have low polar characteris- tics (hexane, diisopropyl ether, and 3-pentanone), and those that have polar properties and are water miscible (ethanol and acetonitrile). System ... organic molecules in the region away from the protein (beyond 0.25 nm from the enzyme surface). Table S2. Parameters and SEs of the water residence time fitted data...

Ngày tải lên: 16/03/2014, 10:20

13 433 0
Báo cáo khoa học: "Multilingual Document Clustering: an Heuristic Approach Based on Cognate Named Entities" docx

Báo cáo khoa học: "Multilingual Document Clustering: an Heuristic Approach Based on Cognate Named Entities" docx

... experiments and the results. Finally, Section 5 summarizes the conclusions and the future work. 2 Related Work MDC is normally applied with parallel (Silva et. al., 2004) or comparable corpus (Chen and ... types of clustering and documents; however, for other types of documents or clustering it could not be so relevant and even it could be a source of noise. In this work we dealt...

Ngày tải lên: 31/03/2014, 01:20

8 421 0
Tài liệu Báo cáo khoa học: Structural bases for recognition of Anp32⁄LANP proteins doc

Tài liệu Báo cáo khoa học: Structural bases for recognition of Anp32⁄LANP proteins doc

... 2549 residues Leu128 to Asp146, and includes h 6 , which belongs to the fifth LRR, and the short strand b 6 , which runs parallel to b 5 and is antiparallel to b 7 . The solution and the crystal structures ... restraints from searching a database for chemical shift and sequence homology. J Biomol NMR 13, 289–302. 49 Ottinger M, Delaglio F & Bax A (1998) Measurement of J and...

Ngày tải lên: 18/02/2014, 17:20

13 667 0
w