a scalable probabilistic classifier

Báo cáo khoa học: "A Scalable Probabilistic Classifier for Language Modeling" pdf

Báo cáo khoa học: "A Scalable Probabilistic Classifier for Language Modeling" pdf

Ngày tải lên : 07/03/2014, 22:20
... Ducharme, P. Vincent, and C. Jauvin. 2003. A Neural Probabilistic Language Model. Journal of Machine Learning Research, 3:1137–1155. A. Berger, V. Della Pietra, and S. Della Pietra. 1996. A Maximum ... Categorization Research. Journal of Machine Learning Research, 5:361–397. A. Mnih and G. Hinton. 2008. A Scalable Hierarchical Distributed Language Model. In Advances in Neural Information Processing ... assumptions, which translate into an accurate and scalable model. Future work includes further evaluation of the VMM, e.g. as a language model within a speech recognition or machine translation system....
  • 6
  • 350
  • 0
Báo cáo khoa học: "A Chain-starting Classifier of Definite NPs in Spanish" docx

Báo cáo khoa học: "A Chain-starting Classifier of Definite NPs in Spanish" docx

Ngày tải lên : 08/03/2014, 21:20
... a cate- gory that, although it did originally have a seman- tic meaning of “identifiability”, has increased its range of contexts so that it is often a grammati- cal rather than a semantic category ... AnCora – Annotated Corpora for Spanish and Catalan (Taule et al., 2008), developed at the University of Barcelona and freely available from http: //clic.ub.edu/ancora. AnCora-Es is a half-million-word ... mentions. Given that chain starting is the majority class and following (Ng and Cardie, 2002), we took the “one class” classification as a naive baseline: all instances were classified as chain starting,...
  • 8
  • 322
  • 0
Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx

Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx

Ngày tải lên : 17/03/2014, 08:20
... easily applicable. This way of teaching a weaker classifier can also be used in other domains, where the task is to in- fer , and an abundance of unlabeled data is available. If one possesses a ... sets are conditionally independent of each other. Each set of features can be used to build a classifier, resulting in two independent classifiers, A and B. Classifications by A on unlabeled data can ... other tasks such as part-of-speech tag- ging, where case information is helpful. With the abundance of unlabeled text available, such an ap- proach requires no additional annotation effort, and hence...
  • 8
  • 285
  • 0
Báo cáo khoa học: "A Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation" potx

Báo cáo khoa học: "A Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation" potx

Ngày tải lên : 23/03/2014, 16:20
... it by far the largest hand-annotated compound noun dataset in existence that we are aware of. Proper nouns were not included. The next largest available datasets have a vari- ety of drawbacks for ... interpreta- tion in general text. Kim and Baldwin’s (2005) dataset is the second largest available dataset, but inter-annotator agreement was only 52.3%, and the annotations had an usually lopsided ... Linguistics. Berger, A. , S. A. Della Pietra, and V. J. Della Pietra. 1996. A Maximum Entropy Approach to Natural Language Processing. Computational Linguistics 22:39-71. Brants, T. and A. Franz. 2006....
  • 10
  • 475
  • 0
Tài liệu Báo cáo khoa học: "Creating Robust Supervised Classifiers via Web-Scale N-gram Data" pdf

Tài liệu Báo cáo khoa học: "Creating Robust Supervised Classifiers via Web-Scale N-gram Data" pdf

Ngày tải lên : 20/02/2014, 04:20
... Workshop on Natural Language Generation. Natalia N. Modjeska, Katja Markert, and Malvina Nis- sim. 2003. Using the Web in machine learning for other-anaphora resolution. In EMNLP. Preslav Nakov and Marti ... bedrag- gled 56-year-old [professor]. Also, in a particu- lar domain, words may have a non-standard usage. Systems trained on labeled data can learn the do- main usage and leverage other regularities, ... Linking Biological Lit- erature, Ontologies and Databases. Mirella Lapata and Frank Keller. 2005. Web-based models for natural language processing. ACM Transactions on Speech and Language Processing, 2(1):1–31. Mark...
  • 10
  • 359
  • 0
Tài liệu Báo cáo khoa học: "A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence" pptx

Tài liệu Báo cáo khoa học: "A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence" pptx

Ngày tải lên : 20/02/2014, 18:20
... grammatical and n-gram based statistical language constraints, and uses a robust parsing technique to apply the grammatical constraints described by context-free grammar (Tsukada et aL, 97). ... the Error-Pattem-Database and String-Database can be mechanically prepared, which reduces the effort required to prepare the databases and makes it possible to apply this method to a new recognition ... Error-Pattern examples. Table 2-1 Examples of Error-Patterns Correct-Part Error-Part 2.1.1 Extraction of Error-Patterns The Error-Pattern-Database is mechanically prepared using a pair of parts...
  • 5
  • 588
  • 0
Tài liệu Báo cáo khoa học: "GPSM: A GENERALIZED PROBABILISTIC SEMANTIC MODEL FOR AMBIGUITY RESOLUTION" pptx

Tài liệu Báo cáo khoa học: "GPSM: A GENERALIZED PROBABILISTIC SEMANTIC MODEL FOR AMBIGUITY RESOLUTION" pptx

Ngày tải lên : 20/02/2014, 21:20
... measure shows substantial im- provement in structural disambiguation over a syntax-based approach. 1. Introduction In a large natural language processing system, such as a machine translation ... R&D Road II, Science-Based Industrial Park Hsinchu, TAIWAN 30077, R.O.C. ABSTRACT In natural language processing, ambiguity res- olution is a central issue, and can be regarded as a ... from a semantic representation. In general, a particular interpretation of a sentence can be represented by an annotated syntax tree (AST), which is a syntax tree annotated with fea- ture...
  • 8
  • 412
  • 0
A Scalable and Explicit Event Delivery Mechanism for UNIX doc

A Scalable and Explicit Event Delivery Mechanism for UNIX doc

Ngày tải lên : 07/03/2014, 17:20
... indicating which file descriptors are available for I/O. A member of the readfds set is available if there is any available input data; a member of writefds is con- sidered writable if the available ... NetBIOS provides a command's result via a callback. The NetBIOS “receive any” command returns (calls back) when data arrives on any network “session” (con- nection). This allows an application to wait ... Banga gaurav@netapp.com Network Appliance Inc., 2770 San Tomas Expressway, Santa Clara, CA 95051 Jeffrey C. Mogul mogul@pa.dec.com Compaq Computer Corp. Western Research Lab., 250 University Ave.,...
  • 14
  • 453
  • 0
Báo cáo khoa học: "A Morphographemic Model for Error Correction Nonconcatenative Strings" pot

Báo cáo khoa học: "A Morphographemic Model for Error Correction Nonconcatenative Strings" pot

Ngày tải lên : 08/03/2014, 07:20
... takaatab tukuutib 7 nkatab nkutib 8 ktatab ktutib 9 ktabab 10 staktab stuktib 11 ktaabab 12 ktawtab 13 ktawwab 14 ktanbab 15 ktanbay Q1 dahraj duhrij Q2 tadahraj tuduhrij Q3 dhanraj ... Forms kadi~ kud~, *kidaa~ kaafil kuffal, *kufalaa~, *kuffaal kaffil kufalaaP sahm *Pashaam, suhuum, Pashum Patterns marked with * are morphologically plausi- ble, but do not occur lexically ... data. Subsection 3.2 presents error checking. (4) ARABIC VERBAL STEMS Measure Active Passive 1 katab kutib 2 kattab kuttib 3 kaatab kuutib 4 ~aktab ~uktib 5 takattab tukuttib 6 takaatab...
  • 7
  • 451
  • 0
Chord: A Scalable Peertopeer Lookup Service for Internet Applications pot

Chord: A Scalable Peertopeer Lookup Service for Internet Applications pot

Ngày tải lên : 15/03/2014, 22:20
... ring. Assuming that the data Chord is being used to locate is cryptographically authenticated, this is a threat to availability of data rather than to authenticity. The same approach used above ... virtual nodes as an indirection layer can sig- nificantly improve load balance. The tradeoff is that routing table space usage will increase as each actual node now needs times as much space to ... mechanism also helps higher layer software replicate data. A typical application using Chord might store repli- cas of the data associated with a key at the nodes succeeding the key. The fact that...
  • 12
  • 441
  • 0
LogBase: A Scalable Log-structured Database System in the Cloud pot

LogBase: A Scalable Log-structured Database System in the Cloud pot

Ngày tải lên : 16/03/2014, 16:20
... capability of recovering data from machine failures compared to the WAL+Data approach. Recall that in the WAL+Data approach, data durability is guar- anteed with the “stable storage” assumption, i.e., ... database systems such as System R [14] use shadow pag- ing strategy to avoid the cost of in-place updates. When a transac- tion updates a data page, it makes a copy, i.e., a shadow, of that page and ... transactions. 3.3 Architecture Overview  DFS Client  Data Access Manager Mem index Read cache Transaction Manager …  Data Access Manager Mem index Read cache Transaction Manager  Data...
  • 12
  • 628
  • 0
A study on punctuation errors in writing of first year English majors at HPU

A study on punctuation errors in writing of first year English majors at HPU

Ngày tải lên : 20/03/2014, 01:26
... paragraph may stand by itself or may also be one part of a longer piece of writing such as a chapter of a book or essay. According to Dorothy E. Zemach and Lisa A. Rumiser a paragraph is a ... English majors. 19 2. Paragraph. 2.1. Definition A paragraph is a basic unit of organization in writing in which a group of related sentences develop one main idea. A paragraph can be as short ... a large deck and pool. The pool was set in a private area and had views of the lake and mountains beyond …. It was not apparent to us until much later that our neighbors felt that their peace...
  • 71
  • 910
  • 7
NiagaraCQ: A Scalable Continuous Query System for Internet Databases ppt

NiagaraCQ: A Scalable Continuous Query System for Internet Databases ppt

Ngày tải lên : 23/03/2014, 03:20
... initial writing of the paper. We are particularly grateful to Ashraf Aboulnaga, Navin Kabra and David Maier for their careful review and helpful comments on the paper. We also thank the anonymous ... (1999). [MD89] D. McCarthy and U. Dayal. The architecture of an active database management system. SIGMOD 1989: 215-224. [RC88] A. Rosenthal and U. S. Chakravarthy. Anatomy of a Modular Multiple Query ... Third, NiagaraCQ groups both change-based and timer-based queries in a uniform way. To insure that NiagaraCQ is scalable, we have also employed other techniques including incremental evaluation...
  • 12
  • 425
  • 0
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications ppt

Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications ppt

Ngày tải lên : 23/03/2014, 03:20
... section a Chord-based application that maps keys onto val- ues. A value can be an address, a document, or an arbitrary data item. A Chord-based application would store and find each value at the ... in a particular geographi- cal region, or all the nodes that usea particular access link, orall the nodes that have a certain IP address prefix. As was discussed above, because Chord node IDs are ... is a variant of the Plaxton algorithm. Like Chord, it guarantees that queries make no more than a logarithmic number of hops and that keys are well-balanced. The Plaxton protocol’s main advantage...
  • 14
  • 539
  • 1
Báo cáo khoa học: "Discriminative Classifiers for Deterministic Dependency Parsing" docx

Báo cáo khoa học: "Discriminative Classifiers for Deterministic Dependency Parsing" docx

Ngày tải lên : 23/03/2014, 18:20
... varying com- plexity, with a separate optimization of learning algorithm parameters for each combination of lan- guage and feature model. The central importance of feature selection and parameter ... space of possible parses (Taskar et al., 2004; McDonald et al., 2005). A radically different approach is to perform disambiguation deterministically, using a greedy parsing algorithm that approximates ... Chapter of the As- sociation for Computational Linguistics (NAACL), pages 132–139. Yuchang Cheng, Masayuki Asahara, and Yuji Mat- sumoto. 200 5a. Chinese deterministic dependency analyzer: Examining...
  • 8
  • 238
  • 0