automatic extraction of subsegmental primes

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

... Statistics of the BioScope corpus. The 2nd and 3d columns show the total number of cues within the datasets; the 4th and 5th columns show the percentage of negated and spec- ulative sentences. 70% of ... neighboring identical siblings of type *scope* or * are replaced by a single node of the corresponding type. Figure 3 shows an example of this transformation. (a) The children of nodes JJ/NN/NN are pruned ... children (starting from the root of the subtree) to the rule pattern subtree. Nodes of type *scope* and * match any number of nodes, similar to the semantics of Regex Kleene star (*). 5 Results As...

Ngày tải lên: 20/02/2014, 04:20

5 544 1
Tài liệu Automatic Management of Network Security Policy pptx

Tài liệu Automatic Management of Network Security Policy pptx

... actual enforcement of these policies. Many of our design decisions were influenced by the lack of verifiable enforcement mechanisms for certain security phenomena. An example of this is Denial of Service ... provide required by policies. Often it suffices to reason about connectivity to analyze availability of services to groups of users. For instance, instead of modeling all the details of a file server, we ... environment. This generally leads either to over or under management of resourses. One of the specific goals of this work is management of security configurations in networks that span multiple administrative domains...

Ngày tải lên: 14/02/2014, 16:20

15 467 0
Tài liệu Báo cáo khoa học: "Cross-Domain Co-Extraction of Sentiment and Topic Lexicons" pdf

Tài liệu Báo cáo khoa học: "Cross-Domain Co-Extraction of Sentiment and Topic Lexicons" pdf

... values of r in the “product vs. movie” task. Observe that for sentiment word extraction, the results of the proposed methods are not sensitive to the values of r. While for the topic word extraction, ... and Ryan McDonald. 2008. A joint model of text and aspect ratings for sentiment summarization. In Proceedings of the 46th Annual Meeting of the As- sociation of Computational Linguistics: Human ... study the effect of different parameter settings. There are several parameters in the framework: the number of generated seeds r, the number of new candidates k 2 and the number of selections k...

Ngày tải lên: 19/02/2014, 19:20

10 447 0
Tài liệu Báo cáo khoa học: "Robust Extraction of Named Entity Including Unfamiliar Word" doc

Tài liệu Báo cáo khoa học: "Robust Extraction of Named Entity Including Unfamiliar Word" doc

... Extraction of Japanese Named Entity 2.1 Task of the IREX Workshop The task of NE extraction of the IREX workshop (Sekine and Eriguchi, 2000) is to recognize eight NE types in Table 1. The organizer of ... features of original morphemes and fea- tures of similar morphemes. The experiments of extracting Japanese NEs from IREX corpus and NHK corpus show the effectiveness of the proposed method. 2 Extraction ... 2003; Nakano and Hirai, 2004) formalized the task of extracting NEs as a chunking problem of a sequence of characters instead of a sequence of morphemes. In this paper, we keep the naive formal- ization,...

Ngày tải lên: 20/02/2014, 09:20

4 384 1
Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

... ex- amples of the previous section. From the point of view of bag -of- word methods, the pairs (T 1 , H 1 ) and (T 1 , H 2 ) have both the same intra-pair simi- larity since the sentences of T 1 and ... head of constituents. The example of Fig. 1 shows that the placeholder 0 climbs up to the node governing all the NPs. 5.3 Pruning irrelevant information in large text trees Often only a portion of ... t, the set of its nodes N (t), and a set of anchors, we build a tree t  with all the nodes N  that are anchors or ancestors of any anchor. Moreover, we add to t  the leaf nodes of the original...

Ngày tải lên: 20/02/2014, 12:20

8 413 0
Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

... the polarity of words There are some works that discuss learning the po- larity of words instead of sentences. Hatzivassiloglou and McKeown proposed a method of learning the polarity of adjectives ... are not in the resources. 458 (1) kono software-no riten-ha hayaku ugoku koto this software-POST advantage-POS T quickly run to The advantage of this software is to run quickly. (2) ketten-ha jikan-ga ... polarity of each sentence. This is simi- lar to the extraction from the itemization. 4.3 Extraction based on linguistic pattern The third method uses linguistic pattern. The char- acteristic of this...

Ngày tải lên: 20/02/2014, 12:20

8 409 0
Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

... examples of sen- tences that our system identified as reasons of complaints. (1) Unfortunately, I find that I am no longer comfortable in your establishment because of the unprofessional, ... Sources of Opinions with Conditional Random Fields and Extraction Pat- terns. Proceedings of HLT/EMNLP-05. Esuli, Andrea and Fabrizio Sebastiani. 2005. De- termining the semantic orientation of ... Orientation of Adjectives. Proceedings of 35th Annual Meet- ing of the Assoc. for Computational Linguistics (ACL-97): 174-181 Hatzivassiloglou, Vasileios and Janyce Wiebe. 2000. Effects of Adjective...

Ngày tải lên: 20/02/2014, 12:20

8 461 1
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

... grammar, rhythm and flow, appropriateness of tone, and several other specific characteristics of good text. In terms of automatic evaluation, we are not aware of any technique that measures only fluency ... Methods PoStag In the first of these, we constructed a rough approximation of typical sentence grammar structure by taking bigrams over part -of- speech tags. 6 Then, given a string of PoS tags of length n, t 1 . ... can be fooled by the method of sentence generation; GLEU, how- ever, gives a consistent estimate of fluency regard- less of generation type; and, across all types of gen- erated sentences examined...

Ngày tải lên: 20/02/2014, 12:20

8 508 0
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

... construction of N-best translation lexicons from parallel text. Melamed (1995) used the ratio (LCSR) between the length of the LCS of two words and the length of the longer word of the two ... present the evaluations of ROUGE-L, ROUGE-S, and compare their per- formance with other automatic evaluation meas- ures. 5 Evaluations One of the goals of developing automatic evalua- tion ... Proceedings of COLING-92, Nantes, France. Thompson, H. S. 1991. Automatic Evaluation of Translation Quality: Outline of Methodology and Report on Pilot Experiment. In Proceedings of the Evaluator’s...

Ngày tải lên: 20/02/2014, 16:20

8 443 0
Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

... 1 shows the average number of clusters with each clustering method shown chapter 3 by the part of speech. WC and WF are the average number of senses by the part of speech. In Table 1 and ... the word senses numbered i of the word x. I x is the word sense indexing function of x that gives an index to each sense of the word x. All contextual words x i ±j of a central word x have ... V N C Æ 2P C/V . In this formula, V means a set of vocabulary, N is the size of the contextual window that is an integer, and C means a set of corpus. In this paper, vocabulary refers to all...

Ngày tải lên: 20/02/2014, 16:20

4 425 0
Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

... (20%) out of 210 terms were col- lected by the system. This low recall primarily comes from the failure of automatic term recogni- tion (case A in the above classification). Improve- ment of this ... collected 610 terms in total; the average number of output terms per input is 12.2 terms. We checked whether each of the 610 terms is a correct related term of the original seed term by hand. The result ... issue: Japanese term extraction. Terminolgy, 6(2). Kyo Kageura and Bin Umino. 1996. Methods of au- tomatic term recognition: A review. Terminology, 3(2):259–289. Hiroshi Nakagawa. 2000. Automatic term...

Ngày tải lên: 20/02/2014, 16:20

4 437 0
Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

... The completeness of the output list increases monotonically with the total number of occurrences of each verb in the corpus. False positive rates are one to three percent of observa- tions. ... architecture of the system, and that of this pa- per, directly reflects the three challenges described above. The system consists of three modules: 1. Verb detection: Finds some occurrences of verbs ... is evaluated in terms of efficiency and accuracy. The most useful estimate of effi- ciency is simply the density of observations in the corpus, shown in the first column of Table 3. The SF...

Ngày tải lên: 20/02/2014, 21:20

6 416 0
Tài liệu Báo cáo khoa học: "ON THE AUTOMATIC TRANSFORMATION OF CLASS MEMBERSHIP CRITERIA" docx

Tài liệu Báo cáo khoa học: "ON THE AUTOMATIC TRANSFORMATION OF CLASS MEMBERSHIP CRITERIA" docx

... by means of a process of ~ inmtRntlat~nn OF the deflnition the translation of the de/initlon f~'om a set of criteria for satisfying the definition into an exemplary instance of the concept ... components of the definition of the class are also present in the description of the instance. This also permits easy representation of modifications to the definition, whenever the capability of ... application discussed here (the assignment of an instance of a knowledge structure to one of a set of classes), inexact matching and close relatives thereof are also found in several other domains...

Ngày tải lên: 21/02/2014, 20:20

6 366 0
Tài liệu Báo cáo khoa học: "Automatic Detection of Nonreferential It in Spoken Multi-Party Dialog" doc

Tài liệu Báo cáo khoa học: "Automatic Detection of Nonreferential It in Spoken Multi-Party Dialog" doc

... re- call of 55.1%, a precision of 71.9% and a resulting F-measure of 62.4% for the detection of the class nonreferential. The overall classification accuracy was 75.1%. The advantage of using ... a mi- nority of all instances of it. Evans (2001) reports that his corpus of approx. 370.000 words from the SUSANNE corpus and the BNC contains 3.171 examples of it, approx. 29% of which are ... variety of genres. They count 2.337 instances of it, 646 of which (28%) are non- referential. Finally, Clemente et al. (2004) report that in the GENIA corpus of medical abstracts the percentage of...

Ngày tải lên: 22/02/2014, 02:20

8 436 0
Tài liệu Báo cáo khoa học: "Incorporating Context Information for the Extraction of Terms" pdf

Tài liệu Báo cáo khoa học: "Incorporating Context Information for the Extraction of Terms" pdf

... sum of the context weights for a, N the size of the corpus in terms of number of words. 3 Future work Our future work involves 1. The investigation of the context used for the evaluation of ... University of Manchester Institute of Science and Technol- ogy. Didier Bourigault. 1992. Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases. In Proceedings of the ... of the European Chapter of the Asso- ciation for Computational Linguistics, EACL-94, pages 34-40. B~atrice Daille, I~ric Gaussier and Jean-Marc Lang,. 1994. Towards Automatic Extraction of...

Ngày tải lên: 22/02/2014, 03:20

3 370 0

Bạn có muốn tìm thêm với từ khóa:

w