0

korean text documents using comparative lexical patterns

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques" doc

Báo cáo khoa học

... Singapore, 4 August 2009.c2009 ACL and AFNLPExtracting Comparative Sentences from Korean Text Documents Us-ing Comparative Lexical Patterns and Machine Learning Techniques Seon Yang Department ... 3 Extracting Comparative- sentence Candidates In this section, we define comparative keywords and extract comparative- sentence candidates by using those keywords. 3.1 Comparative keyword ... This paper proposes how to automatically identify Korean comparative sentences from text documents. This paper first investigates many comparative sentences referring to pre-vious studies...
  • 4
  • 536
  • 0
Automatic text extraction using DWT and Neural Network

Automatic text extraction using DWT and Neural Network

Kỹ thuật lập trình

... video database. However, text extraction presents a number of problems because the properties of text may vary, as well as the text sizes and the text fonts. Furthermore, texts may appear in a ... horizontal edges to obtain candidate text regions. Real Text regions are then identified using the support vector machine. Text regions usually have special texture features because they consist ... or compressed images. Text extraction from uncompressed image can be classified as either component-based or texture-based. For component-based text extraction methods, text regions are detected...
  • 5
  • 507
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Identifying Text Polarity Using Random Walks" pptx

Báo cáo khoa học

... (2005; 2006) use a textual representation ofwords by collating all the glosses of the word asfound in some dictionary. Then, a binary text clas-sifier is trained using the textual representation ... Subjectivity analysis is the taskof identifying text that present opinions as op-posed to objective text that present factual in-formation (Wiebe, 2000). Text could be eitherwords, phrases, sentences, ... identified without consider-ing their context (Wiebe, 2000; Hatzivassiloglouand Wiebe, 2000; Banea et al., 2008). In the sec-ond category, the context of subjective text is used(Riloff and Wiebe, 2003;...
  • 9
  • 450
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Extracting Comparative Entities and Predicates from Texts Using Comparative Type Classification" pptx

Báo cáo khoa học

... EMNLP’03. Seon Yang and Youngjoong Ko. 2009. Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques. In Proceedings ... comparatives and non-comparatives by extracting only comparatives from text documents. Then we classify the comparatives into seven types. 3.1 Extracting comparative sentences from text documents Our ... in text documents, and then SVM eliminates the non -comparative sentences from the candidates. Thus, all of the sentences are divided into two classes: a comparative class and a non-comparative...
  • 9
  • 405
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "An ERP-based Brain-Computer Interface for text entry using Rapid Serial Visual Presentation and Language Modeling" ppt

Báo cáo khoa học

... interfaces (BCI). Thisparadigm is widely used to build letter-by-letter text input systems using BCI. Neverthe-less using a BCI-typewriter depending only onEEG responses will not be sufficiently ... 2011.c2011 Association for Computational LinguisticsAn ERP-based Brain-Computer Interface for text entry using Rapid Serial Visual Presentation and Language ModelingK.E. Hild◦,U. Orhan†,D. ... next letters to be typed be-come highly predictable in certain contexts, partic-ularly word-internally. In applications where text generation/typing speed is very slow, the impactof language...
  • 6
  • 551
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Learning with Unlabeled Data for Text Categorization Using Bootstrapping and Feature Projection Techniques" doc

Báo cáo khoa học

... words tend to appear in similar contexts, we can compute the similarity by using contextual information. Words and contexts play complementary roles. Contexts are similar to the extent that ... Bayes classifier using machine-labeled data as purpose and remaining contexts are assigned to each context-cluster by that revised technique. 1) Measurement of word and context similarities ... the Naive Bayes Classifier Using Context-Clusters In above section, we obtained labeled training data: context-clusters. Since training data are labeled as the context unit, we employ a Naive...
  • 8
  • 443
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Choosing the Word Most Typical in Context Using a Lexical Co-occurrence Network" ppt

Báo cáo khoa học

... most typical synonym in context, and gave a solution that relies on a generalization oflexical co-occurrence. The results show that a narrow window of training context (-t-4 words) works best ... according to Pearson's X 2 test, unless indicated. more than the surrounding context to build adequate con- textual representations. Also, the narrow window gives consistently higher ac- curacy ... Machine Translation, pages 114 121, Stanford, CA, March. Elhadad, Michael. 1992. Using Argumentation to Control Lexical Choice: A Functional Unification Implementation. Ph.D. thesis, Columbia...
  • 3
  • 345
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Evaluating Centering-based metrics of coherence for text structuring using a reliably annotated corpus" doc

Báo cáo khoa học

... Evaluating the coherence of a text and text structuringThe statistics about transitions computed asjust discussed can be used to determine the de-gree to which a text conforms with, or violates,Centering’s ... discussed in Section3, using the original ordering of texts in thegnome corpus to c ompute the average classi-fication rate of each metric.The gnome corpus contains texts from differ-ent genres, ... beuseful to drive a text planner. We then outlinea corpus-based methodology to choose amongthese metrics, estimating how well they are ex-pected to do when used by a text planner. Weconclude...
  • 8
  • 608
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Identifying Syntactic Role of Antecedent in Korean Relative Clause Using Corpus and Thesaurus Information" pdf

Báo cáo khoa học

... semantic role determination of antecedents using verbal patterns and statistic information from a corpus. These word co-occurrence pat- terns are all at lexical- level, so we have to con- struct ... tistical information at a lexical level for every pair of words, which may require a lot of space to store resulting patterns, we represent those co-occurrence patterns with concept types ... Japanese). Park, S. B. and Y. T. Kim. 1997. Semantic Role Determination in Korean Relative Clauses Using Idiomatic Patterns. In Proceedings of 17th International Conference on Computer Processing...
  • 7
  • 427
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Using Non-lexical Features to Identify Effective Indexing Terms for Biomedical Illustrations" docx

Báo cáo khoa học

... andalternative tools for mapping text to the UMLS.5 Related WorkNon -lexical features have been successful in manycontexts, particularly in the areas of genre classifi-cation and text and speech summarization.Genre ... their non -lexical features,gleaned from the article text and MetaMap output.Experimental results, presented in Section 4, in-dicate that ineffective indexing terms can be re-duced using this ... image re-trieval (ABIR). ABIR, compared to the image-1Non -lexical features describe attributes of image-related text but not the text itself, e.g., unlike a bag-of-words model.only approach...
  • 8
  • 364
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatically Evaluating Text Coherence Using Discourse Relations" docx

Báo cáo khoa học

... more coherent.The pair of texts consists of a source text and oneof its permutations (i.e., the text s sentence order israndomized). Assuming that the original text is al-ways more discourse-coherent ... transitions in short texts are few in num-ber, we have very little data to base the coherencejudgment on. However, when faced with even short text excerpts, humans can distinguish coherent textsfrom ... present in the above text: 1. Implicit Comparison between S1as Arg1, and S2as Arg22. Explicit Comparison using “but” between S2asArg1, and S3as Arg23. Explicit Temporal using “as” within...
  • 10
  • 292
  • 0
Báo cáo khoa học:

Báo cáo khoa học: " Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques" doc

Báo cáo khoa học

... Delaware.Stefan Riezler, Detlef Prescher, Jonas Kuhn, and MarkJohnson. 2000. Lexicalized Stochastic Modeling ofConstraint-Based Grammars using Log-Linear Mea-sures and EM Training. In Proceedings of the ... Association for ComputationalLinguistics (ACL’00), Hong Kong.Parsing the Wall Street Journal using a Lexical- Functional Grammar andDiscriminative Estimation TechniquesStefan Riezler Tracy H. ... this paperwas measured using both the LFG and DR metrics,thanks to an f-structure-to-DR annotation mapping.Performance on the DR-annotated Brown test setwas only measured using the DR metric.The...
  • 8
  • 477
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Generating image descriptions using dependency relational patterns" pptx

Báo cáo khoa học

... top 30 relatedweb -documents for each image using the Yahoo!search engine and the toponym associated with theimage as a query. The text from these documents was extracted using an HTML parser ... generatecaptions based on the immediate textual context ofthe image with or without consideration of imagerelated features such as colour, shape or texture(Deschacht and Moens, 2007; Mori ... assigned todependency patterns was lower than that assignedto other features. The small contribution of the de-pendency patterns may have been due to the smallnumber of documents they used to...
  • 9
  • 362
  • 0

Xem thêm

Tìm thêm: hệ việt nam nhật bản và sức hấp dẫn của tiếng nhật tại việt nam xác định các mục tiêu của chương trình xác định các nguyên tắc biên soạn khảo sát các chuẩn giảng dạy tiếng nhật từ góc độ lí thuyết và thực tiễn khảo sát chương trình đào tạo của các đơn vị đào tạo tại nhật bản khảo sát chương trình đào tạo gắn với các giáo trình cụ thể tiến hành xây dựng chương trình đào tạo dành cho đối tượng không chuyên ngữ tại việt nam điều tra đối với đối tượng giảng viên và đối tượng quản lí khảo sát thực tế giảng dạy tiếng nhật không chuyên ngữ tại việt nam khảo sát các chương trình đào tạo theo những bộ giáo trình tiêu biểu nội dung cụ thể cho từng kĩ năng ở từng cấp độ xác định mức độ đáp ứng về văn hoá và chuyên môn trong ct mở máy động cơ lồng sóc đặc tuyến hiệu suất h fi p2 đặc tuyến mômen quay m fi p2 đặc tuyến tốc độ rôto n fi p2 đặc tuyến dòng điện stato i1 fi p2 thông tin liên lạc và các dịch vụ từ bảng 3 1 ta thấy ngoài hai thành phần chủ yếu và chiếm tỷ lệ cao nhất là tinh bột và cacbonhydrat trong hạt gạo tẻ còn chứa đường cellulose hemicellulose chỉ tiêu chất lượng theo chất lượng phẩm chất sản phẩm khô từ gạo của bộ y tế năm 2008