word clustering and disambiguation based on co occurrence data

Báo cáo khoa học: "Word Clustering and Disambiguation Based on Co-occurrence Data" pdf

Báo cáo khoa học: "Word Clustering and Disambiguation Based on Co-occurrence Data" pdf

... o.g CovefarJe Figure 4: Compound noun disambiguation results We next conducted structural disambiguation on the test data, using the probabilities estimated based on 2D -Clustering and Brown. ... the corpus, and extracted compound noun doubles con- taining those nouns as training data and compound noun triples containing those nouns as test data. There were 8,604 training data and ... state-of-the-art disambiguation method of (Brill and Resnik, 1994). 1 Introduction We address the problem of clustering words, or that of constructing a thesaurus, based on co- occurrence data. We...

Ngày tải lên: 17/03/2014, 07:20

7 328 0
Báo cáo khoa học: "Word Clustering and Word Selection based Feature Reduction for MaxEnt based Hindi NER" ppt

Báo cáo khoa học: "Word Clustering and Word Selection based Feature Reduction for MaxEnt based Hindi NER" ppt

... w i+2 position if w i is a NE. Note that only a subset of the lexicon are context words. For all the context words, its N weight is calculated as the ratio between the occurrence of the word as a context ... Message Understanding Conference. Li W and McCallum A. 2003. Rapid Development of Hindi Named Entity Recognition using Conditional Random Fields and Feature Induction. ACM Trans- actions on Asian Language ... word and its total number of occurrence in the corpus. The context words having the higher N weight are considered as important words for NER. For our experiments we have considered top 500 words...

Ngày tải lên: 08/03/2014, 01:20

8 444 0
Báo cáo khoa học: "Dependency Parsing and Projection Based on Word-Pair Classification" ppt

Báo cáo khoa học: "Dependency Parsing and Projection Based on Word-Pair Classification" ppt

... more than 2 commas be- tween word i and word j? • Is there a comma immediately following the first of word i and word j? • Is there a comma immediately preceding the second of word i and word j? Besides ... for convenience. A Collins distance comprises the answers of 6 ques- tions: • Does word i precede or follow word j? • Are word i and word j adjacent? • Is there a verb between word i and word ... our model and the transition -based category is that they all need a classifier to perform classification conditioned on a certain configuration. However, they differ from each other in the classification...

Ngày tải lên: 16/03/2014, 23:20

9 328 0
Tài liệu Báo cáo khoa học: "Cross-lingual Parse Disambiguation based on Semantic Correspondence" pptx

Tài liệu Báo cáo khoa học: "Cross-lingual Parse Disambiguation based on Semantic Correspondence" pptx

... themselves composed of re- lations. Each line in Figure 1 corresponds to one EP. Relations are the elementary building blocks of the EDG, and loosely correspond to words of the surface string. EPs consist ... rela- tions (corresponding to quantifiers), or a predicate- argument structure which is composed of several re- lations. During alignment, we only consider non- atomic EPs, as quantifiers should be considered ... limited to align two languages only. The algorithm is very flexible, and allows for straightforward exploration of different numbers and combinations of languages. 6 Conclusion and Future Work Translating...

Ngày tải lên: 19/02/2014, 19:20

5 458 0
Tài liệu Báo cáo khoa học: "Going Beyond AER: An Extensive Analysis of Word Alignments and Their Impact on MT" pdf

Tài liệu Báo cáo khoa học: "Going Beyond AER: An Extensive Analysis of Word Alignments and Their Impact on MT" pdf

... S U and S G . Note that S I , S A and S B use only a small portion of the phrases with more than 3 words although the majority of the phrase table contains phrases with more than 3 words on one ... Treebank Table 1: Test and Training Data Used for Experiments as translation probabilities and lexical weights), and the decoder’s job is to choose the correct phrases based on those scores using a log-linear model. 3 ... long-standing questions about the value of alignment in the context of MT. We first evaluate 5 different word alignments intrinsically, using: (1) community-standard metrics—precision, recall and...

Ngày tải lên: 20/02/2014, 11:21

8 508 0
The Breast Cancer Epidemic: Modeling and Forecasts Based on Abortion and Other Risk Factors potx

The Breast Cancer Epidemic: Modeling and Forecasts Based on Abortion and Other Risk Factors potx

... Team, Information Services Division (ISD), NHS National Services, Scotland. Computing was done by Andrew Chan and Lee Young. none disclosed. Trends in Cancer Survival in Scotland 1971-1995 REFERENCES 1 Goldacre ... Population Research Institute (PAPRI), 35 Canonbury Road, London N1 2DG, UK. Contact: papriresearch@btconnect.com. Particular thanks are due to the charities LIFE and The Medical Education Trust, ... and abortion: collaborative reanalysis of data from 53 epidemiological studies. 2004;363:1007-1016. Erlandsson G, Montgomery SM, Cnattingius S, Ekborn A. Abortions and breast cancer: record based...

Ngày tải lên: 06/03/2014, 02:21

7 406 0
Báo cáo khoa học: "Structural Disambiguation Based on Reliable Estimation of Strength of Association" potx

Báo cáo khoa học: "Structural Disambiguation Based on Reliable Estimation of Strength of Association" potx

... tively contain Wl and w2, f(C1) and f(C2) the numbers of occurrences of all the words included in the word classes C1 and C2, and f(C1, C2) is the number of co- occurrences of the word classes ... modificand relation. Experiment Disambiguation of dependency re- lations was done using 75 anlbiguous con- structions from Fukumoto (1992). Solving the ambiguity in the constructions involves ... nouns in the training data. The 'confidence' values correspond to a binomial dis- tribution and are given only as a reference s. confidence t coverage precision success 100% 68.0%...

Ngày tải lên: 17/03/2014, 07:20

7 284 0
Báo cáo " All-optical NAND and AND gates based on 3x3 general interference multimode interference couplers" pdf

Báo cáo " All-optical NAND and AND gates based on 3x3 general interference multimode interference couplers" pdf

... All-optical NAND and AND Gates based on 3x3 GI-MMI couplers Theory: The conventional MMI coupler has a structure consisting of a homogeneous planar multimode waveguide region connected to a ... NAND and AND gates based on 3x3 general interference multimode interference couplers Le Trung Thanh* Department of Telecommunication Engineering, University of Transport and Communications ... order to obtain a nonlinear interaction. In addition, since the nonlinear coefficient is often small, long interaction lengths are generally required. Moreover, devices based on nonlinear effects...

Ngày tải lên: 22/03/2014, 11:20

7 307 1
Báo cáo khoa học: "Information Classification and Navigation Based on 5W1H of the Target Information" doc

Báo cáo khoa học: "Information Classification and Navigation Based on 5W1H of the Target Information" doc

... classification I Figure 1: 5WIH classification and navigation 3 5WIH Classification and Navigation Conventional keyword -based retrieval does not con- sider logical relationships between keywords. ... events and arranging 96.10 NEC adjusts semiconductor production downward. 96.12 97.1 97.4 97.5 NEC postpones semiconductor production plant construction. NEC shifts semiconductor production ... article containing "NEC formed a technical alliance with B company, and B company produced semiconductor X." Based on 5WlH information, we propose a 5WlH classification and navigation...

Ngày tải lên: 31/03/2014, 04:20

7 638 0
Báo cáo hóa học: " A broadly applicable method to characterize large DNA viruses and adenoviruses based on the DNA polymerase gene" doc

Báo cáo hóa học: " A broadly applicable method to characterize large DNA viruses and adenoviruses based on the DNA polymerase gene" doc

... proceeding to the virus concentration step. The DNase variation on the protocol involved adding 20 µl of 10 × buffer and 100 µl of RQ1 DNAse (Promega) to the 80 µl of concentrated virus and incubating ... Innotech Corporation, San Leandro, CA). Bands of interest were excised from the gel and the DNA was recovered using GenElute Agarose Spin Columns (Supelco, Bellefonte, PA). The product was cloned ... DNA sequence comparisons, designed the primers and was the contributing author. MRR performed all fish virus cell cul- ture, DNA extraction, PCR protocol development, and most of the PCR, cloning and...

Ngày tải lên: 20/06/2014, 01:20

10 463 0
Báo cáo khoa học: "A Cognitive Cost Model of Annotations Based on Eye-Tracking Data" pdf

Báo cáo khoa học: "A Cognitive Cost Model of Annotations Based on Eye-Tracking Data" pdf

... any irritation of the participants caused by constantly changing contexts. Accordingly, the participants were assigned to one of the experimental groups and corresponding context condition already ... fixations on the anno- tation phrase and context for the document condi- tion and 20 annotation examples of each complex- ity class. These results suggest that the need for context mainly depends on ... syntactic complexity. 6 t(9) = 0.27, p = 0.79 for the annotation time in the document context condition, and t(9) = 1.97, p = 0.08 for the annotation errors in the sentence context condition. 1162 4 Cognitively...

Ngày tải lên: 23/03/2014, 16:20

10 467 1
w