korean text documents using comparative lexical patterns

Tài liệu Báo cáo khoa học: "Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques" doc

... Singapore, 4 August 2009. c 2009 ACL and AFNLP Extracting Comparative Sentences from Korean Text Documents Us- ing Comparative Lexical Patterns and Machine Learning Techniques Seon Yang Department ... 3 Extracting Comparative- sentence Candidates In this section, we define comparative keywords and extract comparative- sentence candidates by using those keywords. 3.1 Comparative keyword ... This paper proposes how to automatically identify Korean comparative sentences from text documents. This paper first investigates many comparative sentences referring to pre- vious studies...

Ngày tải lên: 20/02/2014, 09:20

4 536 0

Automatic text extraction using DWT and Neural Network

... video database. However, text extraction presents a number of problems because the properties of text may vary, as well as the text sizes and the text fonts. Furthermore, texts may appear in a ... horizontal edges to obtain candidate text regions. Real Text regions are then identified using the support vector machine. Text regions usually have special texture features because they consist ... or compressed images. Text extraction from uncompressed image can be classified as either component-based or texture-based. For component-based text extraction methods, text regions are detected...

Ngày tải lên: 05/11/2012, 14:51

5 508 1

Tài liệu Báo cáo khoa học: "Identifying Text Polarity Using Random Walks" pptx

... (2005; 2006) use a textual representation of words by collating all the glosses of the word as found in some dictionary. Then, a binary text clas- siﬁer is trained using the textual representation ... Subjectivity analysis is the task of identifying text that present opinions as op- posed to objective text that present factual information (Wiebe, 2000). Text could be either words, phrases, sentences, ... identiﬁed without consider- ing their context (Wiebe, 2000; Hatzivassiloglou and Wiebe, 2000; Banea et al., 2008). In the sec- ond category, the context of subjective text is used (Riloff and Wiebe, 2003;...

Ngày tải lên: 20/02/2014, 04:20

9 450 0

Tài liệu Báo cáo khoa học: "Extracting Comparative Entities and Predicates from Texts Using Comparative Type Classification" pptx

... EMNLP’03. Seon Yang and Youngjoong Ko. 2009. Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques. In Proceedings ... comparatives and non-comparatives by extracting only comparatives from text documents. Then we classify the comparatives into seven types. 3.1 Extracting comparative sentences from text documents Our ... in text documents, and then SVM eliminates the non -comparative sentences from the candidates. Thus, all of the sentences are divided into two classes: a comparative class and a non-comparative...

Ngày tải lên: 20/02/2014, 04:20

9 405 0

Tài liệu Báo cáo khoa học: "An ERP-based Brain-Computer Interface for text entry using Rapid Serial Visual Presentation and Language Modeling" ppt

... interfaces (BCI). This paradigm is widely used to build letter-by- letter text input systems using BCI. Neverthe- less using a BCI-typewriter depending only on EEG responses will not be sufﬁciently ... 2011. c 2011 Association for Computational Linguistics An ERP-based Brain-Computer Interface for text entry using Rapid Serial Visual Presentation and Language Modeling K.E. Hild ◦ , U. Orhan † , D. ... next letters to be typed be- come highly predictable in certain contexts, particularly word-internally. In applications where text generation/typing speed is very slow, the impact of language...

Ngày tải lên: 20/02/2014, 05:20

6 551 0

Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc

... words tend to appear in similar contexts, we can compute the similarity by using contextual information. Words and contexts play complementary roles. Contexts are similar to the extent that ... Bayes classifier using machine-labeled data as purpose and remaining contexts are assigned to each context-cluster by that revised technique. 1) Measurement of word and context similarities ... the Naive Bayes Classifier Using Context-Clusters In above section, we obtained labeled training data: context-clusters. Since training data are labeled as the context unit, we employ a Naive...

Ngày tải lên: 20/02/2014, 16:20

8 444 0

Tài liệu Báo cáo khoa học: "Choosing the Word Most Typical in Context Using a Lexical Co-occurrence Network" ppt

... most typical synonym in context, and gave a solution that relies on a generalization oflexical co-occurrence. The results show that a narrow window of training context (-t-4 words) works best ... according to Pearson's X 2 test, unless indicated. more than the surrounding context to build adequate contextual representations. Also, the narrow window gives consistently higher ac- curacy ... Machine Translation, pages 114 121, Stanford, CA, March. Elhadad, Michael. 1992. Using Argumentation to Control Lexical Choice: A Functional Unification Implementation. Ph.D. thesis, Columbia...

Ngày tải lên: 22/02/2014, 03:20

3 345 0

Báo cáo khoa học: "Evaluating Centering-based metrics of coherence for text structuring using a reliably annotated corpus" doc

... Evaluating the coherence of a text and text structuring The statistics about transitions computed as just discussed can be used to determine the de- gree to which a text conforms with, or violates, Centering’s ... discussed in Section 3, using the original ordering of texts in the gnome corpus to c ompute the average classi- ﬁcation rate of each metric. The gnome corpus contains texts from diﬀer- ent genres, ... be useful to drive a text planner. We then outline a corpus-based methodology to choose among these metrics, estimating how well they are ex- pected to do when used by a text planner. We conclude...

Ngày tải lên: 17/03/2014, 06:20

8 608 0

Báo cáo khoa học: "Identifying Syntactic Role of Antecedent in Korean Relative Clause Using Corpus and Thesaurus Information" pdf

... semantic role determination of antecedents using verbal patterns and statistic information from a corpus. These word co-occurrence patterns are all at lexical- level, so we have to con- struct ... tistical information at a lexical level for every pair of words, which may require a lot of space to store resulting patterns, we represent those co-occurrence patterns with concept types ... Japanese). Park, S. B. and Y. T. Kim. 1997. Semantic Role Determination in Korean Relative Clauses Using Idiomatic Patterns. In Proceedings of 17th International Conference on Computer Processing...

Ngày tải lên: 17/03/2014, 07:20

7 427 0

Báo cáo khoa học: "Using Non-lexical Features to Identify Effective Indexing Terms for Biomedical Illustrations" docx

... and alternative tools for mapping text to the UMLS. 5 Related Work Non -lexical features have been successful in many contexts, particularly in the areas of genre classiﬁ- cation and text and speech summarization. Genre ... their non -lexical features, gleaned from the article text and MetaMap output. Experimental results, presented in Section 4, in- dicate that ineffective indexing terms can be re- duced using this ... image re- trieval (ABIR). ABIR, compared to the image- 1 Non -lexical features describe attributes of image-related text but not the text itself, e.g., unlike a bag-of-words model. only approach...

Ngày tải lên: 17/03/2014, 22:20

8 364 0

Báo cáo khoa học: "Automatically Evaluating Text Coherence Using Discourse Relations" docx

... more coherent. The pair of texts consists of a source text and one of its permutations (i.e., the text s sentence order is randomized). Assuming that the original text is al- ways more discourse-coherent ... transitions in short texts are few in number, we have very little data to base the coherence judgment on. However, when faced with even short text excerpts, humans can distinguish coherent texts from ... present in the above text: 1. Implicit Comparison between S 1 as Arg1, and S 2 as Arg2 2. Explicit Comparison using “but” between S 2 as Arg1, and S 3 as Arg2 3. Explicit Temporal using “as” within...

Ngày tải lên: 23/03/2014, 16:20

10 292 0

Báo cáo khoa học: " Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques" doc

... Delaware. Stefan Riezler, Detlef Prescher, Jonas Kuhn, and Mark Johnson. 2000. Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Mea- sures and EM Training. In Proceedings of the ... Association for Computational Linguistics (ACL’00), Hong Kong. Parsing the Wall Street Journal using a Lexical- Functional Grammar and Discriminative Estimation Techniques Stefan Riezler Tracy H. ... this paper was measured using both the LFG and DR metrics, thanks to an f-structure-to-DR annotation mapping. Performance on the DR-annotated Brown test set was only measured using the DR metric. The...

Ngày tải lên: 23/03/2014, 20:20

8 477 0

Báo cáo khoa học: "Generating image descriptions using dependency relational patterns" pptx

... top 30 related web -documents for each image using the Yahoo! search engine and the toponym associated with the image as a query. The text from these documents was extracted using an HTML parser ... generate captions based on the immediate textual context of the image with or without consideration of image related features such as colour, shape or texture (Deschacht and Moens, 2007; Mori ... assigned to dependency patterns was lower than that assigned to other features. The small contribution of the dependency patterns may have been due to the small number of documents they used to...

Ngày tải lên: 30/03/2014, 21:20

9 362 0