teaching a weaker classifier

Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx

Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx

Ngày tải lên : 17/03/2014, 08:20
... easily applicable. This way of teaching a weaker classifier can also be used in other domains, where the task is to in- fer , and an abundance of unlabeled data is available. If one possesses a ... sets are conditionally independent of each other. Each set of features can be used to build a classifier, resulting in two independent classifiers, A and B. Classifications by A on unlabeled data can ... other tasks such as part-of-speech tag- ging, where case information is helpful. With the abundance of unlabeled text available, such an ap- proach requires no additional annotation effort, and hence...
  • 8
  • 285
  • 0
Báo cáo khoa học: "A Scalable Probabilistic Classifier for Language Modeling" pdf

Báo cáo khoa học: "A Scalable Probabilistic Classifier for Language Modeling" pdf

Ngày tải lên : 07/03/2014, 22:20
... Ducharme, P. Vincent, and C. Jauvin. 2003. A Neural Probabilistic Language Model. Journal of Machine Learning Research, 3:1137–1155. A. Berger, V. Della Pietra, and S. Della Pietra. 1996. A Maximum ... Categorization Research. Journal of Machine Learning Research, 5:361–397. A. Mnih and G. Hinton. 2008. A Scalable Hierarchical Distributed Language Model. In Advances in Neural Information Processing ... Discrimi- native n-gram Language Modeling. Computer, Speech and Language, 21:373–392. R. Rosenfeld. 1994. Adaptive Statistical Language Mod- elling: A Maximum Entropy Approach. Ph.D. thesis, Carnegie...
  • 6
  • 350
  • 0
Báo cáo khoa học: "A Chain-starting Classifier of Definite NPs in Spanish" docx

Báo cáo khoa học: "A Chain-starting Classifier of Definite NPs in Spanish" docx

Ngày tải lên : 08/03/2014, 21:20
... a cate- gory that, although it did originally have a seman- tic meaning of “identifiability”, has increased its range of contexts so that it is often a grammati- cal rather than a semantic category ... AnCora – Annotated Corpora for Spanish and Catalan (Taule et al., 2008), developed at the University of Barcelona and freely available from http: //clic.ub.edu/ancora. AnCora-Es is a half-million-word ... mentions. Given that chain starting is the majority class and following (Ng and Cardie, 2002), we took the “one class” classification as a naive baseline: all instances were classified as chain starting,...
  • 8
  • 322
  • 0
Báo cáo khoa học: "A Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation" potx

Báo cáo khoa học: "A Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation" potx

Ngày tải lên : 23/03/2014, 16:20
... it by far the largest hand-annotated compound noun dataset in existence that we are aware of. Proper nouns were not included. The next largest available datasets have a vari- ety of drawbacks for ... interpreta- tion in general text. Kim and Baldwin’s (2005) dataset is the second largest available dataset, but inter-annotator agreement was only 52.3%, and the annotations had an usually lopsided ... Linguistics. Berger, A. , S. A. Della Pietra, and V. J. Della Pietra. 1996. A Maximum Entropy Approach to Natural Language Processing. Computational Linguistics 22:39-71. Brants, T. and A. Franz. 2006....
  • 10
  • 475
  • 0
UNIVERSITY TEACHER’S CONCEPTUALIZATION OF TASK-BASED TEACHING: A CASE study IN taybac university

UNIVERSITY TEACHER’S CONCEPTUALIZATION OF TASK-BASED TEACHING: A CASE study IN taybac university

Ngày tải lên : 07/11/2012, 15:01
... Communicative Approach and the Natural Approach are based on this view. The interactional view sees language primarily as a means for establishing and maintaining interpersonal relations and for ... are available only in target language, and the necessary materials can only be obtained if they ask in target language, such activities stimulate a natural need to understand and use it. Many ... towards, task-based language teaching? 2. To what extent do their conceptualizations match the composite view of task- based language teaching? 3. How do they implement task-based language teaching...
  • 105
  • 568
  • 1
Tài liệu Báo cáo khoa học: Structural insights into the substrate specificity and activity of ervatamins, the papain-like cysteine proteases from a tropical plant, Ervatamia coronaria ppt

Tài liệu Báo cáo khoa học: Structural insights into the substrate specificity and activity of ervatamins, the papain-like cysteine proteases from a tropical plant, Ervatamia coronaria ppt

Ngày tải lên : 18/02/2014, 16:20
... coronaria Raka Ghosh, Sibani Chakraborty, Chandana Chakrabarti, Jiban Kanti Dattagupta and Sampa Biswas Crystallography and Molecular Biology Division, Saha Institute of Nuclear Physics, Kolkata, ... acid at a particular position for this family of plant cysteine pro- teases. The primers used were 5¢-TTGCCTGAGCA TGTT GATTGGAGAGCGA AAG-3 ¢ (forward) and 5¢-GGGAT AATAAGGTAATCTAGTGATTCCAC-3¢ ... S, Sundd M, Jagan- nadham MV & Dattagupta JK (1999) Crystallization and preliminary X-ray analysis of ervatamin B and C, two thiol proteases from Ervatamia coronaria. Acta Crystallogr D 55,...
  • 14
  • 634
  • 0
Tài liệu Báo cáo khoa học: "Creating Robust Supervised Classifiers via Web-Scale N-gram Data" pdf

Tài liệu Báo cáo khoa học: "Creating Robust Supervised Classifiers via Web-Scale N-gram Data" pdf

Ngày tải lên : 20/02/2014, 04:20
... Workshop on Natural Language Generation. Natalia N. Modjeska, Katja Markert, and Malvina Nis- sim. 2003. Using the Web in machine learning for other-anaphora resolution. In EMNLP. Preslav Nakov and Marti ... bedrag- gled 56-year-old [professor]. Also, in a particu- lar domain, words may have a non-standard usage. Systems trained on labeled data can learn the do- main usage and leverage other regularities, ... Linking Biological Lit- erature, Ontologies and Databases. Mirella Lapata and Frank Keller. 2005. Web-based models for natural language processing. ACM Transactions on Speech and Language Processing, 2(1):1–31. Mark...
  • 10
  • 359
  • 0
Báo cáo khoa học: " A Tool for Error Analysis of Machine Translation Output" doc

Báo cáo khoa học: " A Tool for Error Analysis of Machine Translation Output" doc

Ngày tải lên : 07/03/2014, 22:20
... machine translation evaluation. Machine Translation, 17(1):43–75. Masaki Murata, Kiyotaka Uchimoto, Qing Ma, Toshiyuki Kanamaru, and Hitoshi Isahara. 2005. Analysis of machine translation systems’ ... graphical tool for performing human error analysis, from any MT system and for any language pair. BLAST has a graphical user interface, and is designed to be easy 1 The BiLingual Annotation/Annotator/Analysis ... annotations. BLAST can handle two types of annotations: er- ror annotations and support annotations. Error an- notations are based on a hierarchical error typology, and are used to annotate errors...
  • 6
  • 479
  • 0
Báo cáo khoa học: "Towards a Semantic Classification of Spanish Verbs Based on Subcategorisation Information" doc

Báo cáo khoa học: "Towards a Semantic Classification of Spanish Verbs Based on Subcategorisation Information" doc

Ngày tải lên : 08/03/2014, 04:22
... in Data - An Introduction to Cluster Analysis. Probability and Mathematical Statistics. Jonh Wiley and Sons, Inc., New York. Anna Korhonen. 200 2a. Semantically motivated subcategorization acquisition. ... noisy fea- tures. In Proceedings of the Seventh Conference on Natural Language Learning (CoNLL-2003), page , Edmonton/Canada. Gloria V´azquez, Ana Fern´andez, Irene Castell´on, and M. Antonia Mart´ı. ... prepositions are also taken into account as part of the subcategorisation frame types. Adapting a methodology that has been thought for English presents a few problems, because En- glish is a language...
  • 6
  • 418
  • 0
SCOP: A Structural Classification of Proteins Database for the Investigation of Sequences and Structures ppt

SCOP: A Structural Classification of Proteins Database for the Investigation of Sequences and Structures ppt

Ngày tải lên : 23/03/2014, 12:20
... spectroscopy means that there is now a large and rapidly growing corpus of information available. At present (January, 1995) the Brookhaven Protein Databank (PDB, (Abola et al., 1987)) contains 3091 ... treated as a whole. The domains in large proteins are usually classified individually. The classification is on hierarchical levels that embody the evolutionary and structural relation- ships. FAMILY. ... to use local copies of PDB files if they are available. Equivalent WWW browsers, image-display programs and molecular viewers are also available free for Windows-PC and Macintosh platforms. JMB—MS...
  • 5
  • 546
  • 0
Báo cáo khoa học: "Discriminative Classifiers for Deterministic Dependency Parsing" docx

Báo cáo khoa học: "Discriminative Classifiers for Deterministic Dependency Parsing" docx

Ngày tải lên : 23/03/2014, 18:20
... varying com- plexity, with a separate optimization of learning algorithm parameters for each combination of lan- guage and feature model. The central importance of feature selection and parameter ... space of possible parses (Taskar et al., 2004; McDonald et al., 2005). A radically different approach is to perform disambiguation deterministically, using a greedy parsing algorithm that approximates ... Chapter of the As- sociation for Computational Linguistics (NAACL), pages 132–139. Yuchang Cheng, Masayuki Asahara, and Yuji Mat- sumoto. 200 5a. Chinese deterministic dependency analyzer: Examining...
  • 8
  • 238
  • 0
Báo cáo khoa học: "A Practical Classification of Multiword Expressions" pdf

Báo cáo khoa học: "A Practical Classification of Multiword Expressions" pdf

Ngày tải lên : 31/03/2014, 01:20
... simul- taneously with syntactic analysis. 3 Rationale The above classification was formulated during an examination of the available formalisms for encod- ing multiword expressions, which was a part ... use a powerful formalism (cf. the example in (9)). Our analysis revealed that IDAREX, which is a simple formalism based on regular grammars, is not appropriate for handling expressions that have ... a subclass that allows passivization, another one that allows nominalization and subject-verb inversion, etc. The problem with this approach is that it leads to a proliferation of classes. At least in...
  • 6
  • 431
  • 0
Báo cáo khoa học: "Bootstrapped Training of Event Extraction Classifiers" ppt

Báo cáo khoa học: "Bootstrapped Training of Event Extraction Classifiers" ppt

Ngày tải lên : 31/03/2014, 20:20
... story (i.e., an article that primarily discusses the details of a domain-relevant event). Documents that are classified as event narratives warrant additional scrutiny because they most likely contain a ... previous example, tsunami will not be extracted as a weapon because it has an incompatible semantic class (EVENT), but bomb will be extracted because it has a com- patible semantic class (WEAPON). We ... lookup). Each pattern is then matched against the unannotated texts, and if the extracted noun phrase satisfies its semantic con- straints, then the noun phrase is automatically la- beled as a role...
  • 10
  • 283
  • 0
Bí mật của một trí nhớ   siêu phàm   Secrets of a Super Memory Eran Katz

Bí mật của một trí nhớ siêu phàm Secrets of a Super Memory Eran Katz

Ngày tải lên : 07/04/2014, 16:44
... chúng ta đặt ch a kh a xuống trong khi đang nghĩ đến chuyện khác. Chúng ta đang v a tự hỏi xem phải mang theo thứ gì v a triền miên suy nghĩ về điều đang băn khoăn. Chúng ta đã để ch a kh a trên ... thoại reo vang. Michelle đi nghe điện thoại. Cô bắt đầu độc thoại về anh trai c a mình một cách say s a. Anh ấy v a đi công tác về và đi quên mua cho cô ấy chiếc máy fax như đã h a. Không để ... nhóm có cùng đặc điểm như sau: Sản phẩm làm từ s a: s a, s a chua, bơ (ba sản phẩm). Rau quả: cà chua, ớt, cà rốt (ba sản phẩm). Thịt: bánh hamburger, lườn gà (hai sản phẩm). Một đồng nghiệp...
  • 180
  • 2.1K
  • 4