text categorization using bootstrapping

Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc

Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc

... preprocessing is a set of context vectors that are represented as content words of each context. Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection ... our method is used in a text categorization task, building text categorization systems will become significantly faster and less expensive. 1 Introduction Text categorization is the task ... words tend to appear in similar contexts, we can compute the similarity by using contextual information. Words and contexts play complementary roles. Contexts are similar to the extent that...

Ngày tải lên: 20/02/2014, 16:20

8 444 0
Automatic text extraction using DWT and Neural Network

Automatic text extraction using DWT and Neural Network

... video database. However, text extraction presents a number of problems because the properties of text may vary, as well as the text sizes and the text fonts. Furthermore, texts may appear in a ... horizontal edges to obtain candidate text regions. Real Text regions are then identified using the support vector machine. Text regions usually have special texture features because they consist ... or compressed images. Text extraction from uncompressed image can be classified as either component-based or texture-based. For component-based text extraction methods, text regions are detected...

Ngày tải lên: 05/11/2012, 14:51

5 508 1
Tài liệu Word Segmentation for Vietnamese Text Categorization: An online corpus approach pptx

Tài liệu Word Segmentation for Vietnamese Text Categorization: An online corpus approach pptx

... make a preliminary text categorization experiment to examine further our approach. We only use MI3 formula in word segmentation step for the next experiment. B. Text Categorization Experiment ... approaches performing text categorization task. Nevertheless, the best performance approach for English may not be the best one for Vietnamese. To find the most appropriate text categorization approach ... Approaches to Text Categorization. Journal of Information Retrieval, Vol 1, No. 1/2, pp 67—88. [17] Yiming Yang, C.G. Chute. 1994. An example-based mapping method for text categorization...

Ngày tải lên: 12/12/2013, 11:15

6 742 1
Tài liệu Báo cáo khoa học: "Identifying Text Polarity Using Random Walks" pptx

Tài liệu Báo cáo khoa học: "Identifying Text Polarity Using Random Walks" pptx

... (2005; 2006) use a textual representation of words by collating all the glosses of the word as found in some dictionary. Then, a binary text clas- sifier is trained using the textual representation ... Subjectivity analysis is the task of identifying text that present opinions as op- posed to objective text that present factual in- formation (Wiebe, 2000). Text could be either words, phrases, sentences, ... identified without consider- ing their context (Wiebe, 2000; Hatzivassiloglou and Wiebe, 2000; Banea et al., 2008). In the sec- ond category, the context of subjective text is used (Riloff and Wiebe, 2003;...

Ngày tải lên: 20/02/2014, 04:20

9 450 0
Tài liệu Báo cáo khoa học: "An ERP-based Brain-Computer Interface for text entry using Rapid Serial Visual Presentation and Language Modeling" ppt

Tài liệu Báo cáo khoa học: "An ERP-based Brain-Computer Interface for text entry using Rapid Serial Visual Presentation and Language Modeling" ppt

... interfaces (BCI). This paradigm is widely used to build letter-by- letter text input systems using BCI. Neverthe- less using a BCI-typewriter depending only on EEG responses will not be sufficiently ... 2011. c 2011 Association for Computational Linguistics An ERP-based Brain-Computer Interface for text entry using Rapid Serial Visual Presentation and Language Modeling K.E. Hild ◦ , U. Orhan † , D. ... next letters to be typed be- come highly predictable in certain contexts, partic- ularly word-internally. In applications where text generation/typing speed is very slow, the impact of language...

Ngày tải lên: 20/02/2014, 05:20

6 551 0
Tài liệu Báo cáo khoa học: "Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques" doc

Tài liệu Báo cáo khoa học: "Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques" doc

... In this section, we define comparative keywords and extract comparative-sentence candidates by using those keywords. 3.1 Comparative keyword First of all, we classify comparative sentences ... 153–156, Suntec, Singapore, 4 August 2009. c 2009 ACL and AFNLP Extracting Comparative Sentences from Korean Text Documents Us- ing Comparative Lexical Patterns and Machine Learning Techniques Seon Yang ... Abstract This paper proposes how to automatically identify Korean comparative sentences from text documents. This paper first investigates many comparative sentences referring to pre- vious...

Ngày tải lên: 20/02/2014, 09:20

4 536 0
Tài liệu Báo cáo khoa học: "Fragments and Text Categorization" pptx

Tài liệu Báo cáo khoa học: "Fragments and Text Categorization" pptx

... of text categoriza- tion. For the Na¨ıve Bayes classifier this increase is significant. 1 Motivation In the process of automatic classifying documents into several predefined classes – text categorization (Sebastiani, ... which can cause con- fusion. However, these statements are yet to be ver- ified. Fragments and Text Categorization Jan Bla ˇ t ´ ak and Eva Mr ´ akov ´ a and Lubo ˇ s Popel ´ ınsk ´ y Knowledge ... – text documents are usually seen as sets or bags of all the words that have appeared in a document, maybe after removing words in a stop-list. In this paper we describe a novel approach to text...

Ngày tải lên: 20/02/2014, 16:20

4 360 0
Báo cáo khoa học: "A Study on Automatically Extracted Keywords in Text Categorization" doc

Báo cáo khoa học: "A Study on Automatically Extracted Keywords in Text Categorization" doc

... be state-of-the-art. 3 Text Categorization Experiments This section describes in detail the four experi- mental settings for the text categorization exper- iments. 3.1 Corpus For the text categorization ... improve automatic text categorization. We investigate what impact keywords have on the task by predicting text categories on the basis of keywords only, and by combining full -text repre- sentations ... improve text categorization. In summary we show that a higher perfor- mance — as measured by micro-averaged F-measure on a standard text categoriza- tion collection — is achieved when the full-text...

Ngày tải lên: 08/03/2014, 02:21

8 496 0
Báo cáo khoa học: "A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization" potx

Báo cáo khoa học: "A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization" potx

... 2000). Few similar comparative studies have been re- ported for Text Categorization (Li et al., 2003) so far in literature. Text categorization and Information Retrieval are tasks that sometimes ... Features to Improve Text Categorization Effectiveness, Journal of Intelligent Systems, Spe- cial Issue. Dejun Xue, Maosong Sun. 2003b. A Study on Feature Weighting in Chinese Text Categorization, ... acts as a pre-requisite step in most text information proc- essing tasks such as Information Retrieval (Baeza-Yates and Ribeiro-Neto, 1999) and Text Categorization (Sebastiani, 2002). It is...

Ngày tải lên: 08/03/2014, 02:21

8 493 0
Báo cáo khoa học: "Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization" potx

Báo cáo khoa học: "Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization" potx

... Strapparava. 2005. Cross language text categorization by acquiring multilingual domain models from comparable corpora. In Proc. of the ACL Workshop on Building and Using Parallel Texts (in conjunction of ... solu- tion for the Cross-Language Text Categorization task. In particular, when bilingual dictionar- ies/repositories are available, the performance of the categorization gets close to that of ... for the other terms in the lexicons. We evaluate the performance of the cross-lingual text categorization, using both the BoW Kernel and the Multilingual Domain Kernel, observing that also in...

Ngày tải lên: 17/03/2014, 04:20

8 361 0

Bạn có muốn tìm thêm với từ khóa:

w