Báo cáo khoa học: "Combining Association Measures for Collocation Extraction" potx
... context of collocation extraction, lexical as- sociation measures are formulas determining the degree of association between collocation com- ponents. They compute an association score for each collocation ... 651–658, Sydney, July 2006. c 2006 Association for Computational Linguistics Combining Association Measures for Collocation Extraction Pavel Pecina and Pavel...
Ngày tải lên: 23/03/2014, 18:20
... 92.35% (λ=10, δ=18) Table 4. Performance of Link Detection of Named Entities 1013 (a) (b) Figure 2. (a) Performance of association matrix strategy. (b) Performance of scalar association matrix strategy ... community. This paper measures the association of terms using snippets returned by web search. A web search with double checking model is proposed to get the statistics...
Ngày tải lên: 17/03/2014, 04:20
... of the Association for Computational Linguistics, pages 760–769, Uppsala, Sweden, 11-16 July 2010. c 2010 Association for Computational Linguistics Metadata-Aware Measures for Answer Summarization in ... Bhaowal. 2006. Investigation into trust for collaborative information repositories: A wikipedia case study. In In Proceedings of the Workshop on Models of Trust for the Web,...
Ngày tải lên: 30/03/2014, 21:20
Tài liệu Báo cáo khoa học: "Incorporating Context Information for the Extraction of Terms" pdf
... 1996), incorporating information gained from the textual context of the candidate term. 2 Context information for terms The idea of incorporating context information for term extraction came ... product. Since context carries information about terms it should be involved in the procedure for their ex- traction. We incorporate context information in the form of weights construc...
Ngày tải lên: 22/02/2014, 03:20
Báo cáo khoa học: "A Sentiment Analyzer for Micro-blogs" potx
... words used for lookup. The prediction for a tweet uses majority vote-based approach as for version 1. The optimal POS bi-tags have been derived experimen- tally by using top 10% features on information ... is considered for in- version. The two versions of C-Feel-It vary in their Lexicon-based Sentiment Predictor. Figure 3 shows the Lexicon-based Sentiment Predictor for version 1....
Ngày tải lên: 17/03/2014, 00:20
Báo cáo khoa học: "Confidence Measure for Word Alignment" potx
... C(A) is the confidence score of the align- ment A as defined in formula 1. This formula computes the sum of the alignment confidence scores for the alignments containing a ij , which is 934 Figure 3: ... by 1.5 points (from 69.3 to 70.8), 1.8 point improvement for content words and 1.0 point for function words. It also significantly outperforms the traditionally used heuristics, ”intersecti...
Ngày tải lên: 17/03/2014, 01:20
Báo cáo khoa học: "Dependency Tree Kernels for Relation Extraction" pot
... data source (e.g. a col- lection of news articles). For example, current web search engines would not perform well on the query, “list all California-based CEOs who have social ties with a United ... effectively provide such a list. The goal of Information Extraction (IE) is to dis- cover relevant segments of information in a data stream that will be useful for structuring the data. In th...
Ngày tải lên: 17/03/2014, 06:20
Báo cáo khoa học: "A MORPHOLOGICAL PROCESSOR FOR MODERN GREEK" potx
... separate dictionary entries for affixes because each affix is a model on its own. Therefore, information associated with an affix model must cover all unpredictable information listed within ... correspon- dences between the form of entries listed in the dictionary and the form they develop when they are combined in sequences of morphemes. These files are used both for analysis an...
Ngày tải lên: 18/03/2014, 02:20
Báo cáo khoa học: "Unsupervised Multilingual Learning for Morphological Segmentation" potx
... HLT, pages 737–745, Columbus, Ohio, USA, June 2008. c 2008 Association for Computational Linguistics Unsupervised Multilingual Learning for Morphological Segmentation Benjamin Snyder and Regina ... (the basic units of meaning). For example, the English word misunderstanding would be segmented into mis - understand - ing. This task is an informative testbed for our exploration, as s...
Ngày tải lên: 31/03/2014, 00:20
Báo cáo khoa học: "Adaptive Language Modeling for Word Prediction" potx
... interpolation, we must also choose whether smoothing (a prereq- uisite for backoff) is performed before or after the interpolation. If we smooth before the interpolation, then the frequencies will be overly ... metric for other applications, so we evaluated two other topical similarity scores: Jacquard’s coef- ficient, which performed better than most other sim- ilarity measures in a dif...
Ngày tải lên: 31/03/2014, 00:20