1. Trang chủ
  2. » Thể loại khác

An introduction to language processing with perl and prolog

523 18 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 523
Dung lượng 2,96 MB

Nội dung

Cognitive Technologies Managing Editors: D M Gabbay J Siekmann Editorial Board: A Bundy J G Carbonell M Pinkal H Uszkoreit M Veloso W Wahlster M J Wooldridge Advisory Board: Luigia Carlucci Aiello Franz Baader Wolfgang Bibel Leonard Bolc Craig Boutilier Ron Brachman Bruce G Buchanan Anthony Cohn Artur d’Avila Garcez Luis Fariñas del Cerro Koichi Furukawa Georg Gottlob Patrick J Hayes James A Hendler Anthony Jameson Nick Jennings Aravind K Joshi Hans Kamp Martin Kay Hiroaki Kitano Robert Kowalski Sarit Kraus Maurizio Lenzerini Hector Levesque John Lloyd Alan Mackworth Mark Maybury Tom Mitchell Johanna D Moore Stephen H Muggleton Bernhard Nebel Sharon Oviatt Luis Pereira Lu Ruqian Stuart Russell Erik Sandewall Luc Steels Oliviero Stock Peter Stone Gerhard Strube Katia Sycara Milind Tambe Hidehiko Tanaka Sebastian Thrun Junichi Tsujii Kurt VanLehn Andrei Voronkov Toby Walsh Bonnie Webber Pierre M Nugues An Introduction to Language Processing with Perl and Prolog An Outline of Theories, Implementation, and Application with Special Consideration of English, French, and German With 153 Figures and 192 Tables 123 Author: Pierre M Nugues Institutionen för Datavetenskap Lunds Tekniska Högskola E-huset Ole Römers väg 223 63 Lund, Sweden Pierre.Nugues@cs.lth.se Managing Editors: Prof Dov M Gabbay Augustus De Morgan Professor of Logic Department of Computer Science, King’s College London Strand, London WC2R 2LS, UK Prof Dr Jörg Siekmann Forschungsbereich Deduktions- und Multiagentensysteme, DFKI Stuhlsatzenweg 3, Geb 43, 66123 Saarbrücken, Germany Library of Congress Control Number: 2005938508 ACM Computing Classification (1998): D.1.6, F.3, H.3, H.5.2, I.2.4, I.2.7, I.7, J.5 ISSN 1611-2482 ISBN-10 3-540-25031-X Springer Berlin Heidelberg New York ISBN-13 978-3-540-25031-9 Springer Berlin Heidelberg New York This work is subject to copyright All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilm or in any other way, and storage in data banks Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer Violations are liable for prosecution under the German Copyright Law Springer is a part of Springer Science+Business Media springer.com © Springer-Verlag Berlin Heidelberg 2006 Printed in Germany The use of general descriptive names, registered names, trademarks, etc in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use Cover Design: KünkelLopka, Heidelberg Typesetting: by the Author Production: LE-TEX Jelonek, Schmidt & Vöckler GbR, Leipzig Printed on acid-free paper 45/3100/YL 543210 À mes parents, À Madeleine Preface In the past 15 years, natural language processing and computational linguistics have considerably matured The move has mainly been driven by the massive increase of textual and spoken data and the need to process them automatically This dramatic growth of available data spurred the design of new concepts and methods, or their improvement, so that they could scale up from a few laboratory prototypes to proven applications used by millions of people Concurrently, speed and capacity of machines became an order of magnitude larger enabling us to process gigabytes of data and billions of words in a reasonable time, to train, test, retrain, and retest algorithms like never before Although systems entirely dedicated to language processing remain scarce, there are now scores of applications that, to some extent, embed language processing techniques The industry trend, as well as the user’s wishes, towards information systems able to process textual data has made language processing a new requirement for many computer science students This has shifted the focus of textbooks from readers being mostly researchers or graduate students to a larger public, from readings by specialists to pragmatism and applied programming Natural language processing techniques are not completely stable, however They consist of a mix that ranges from well mastered and routine to rapidly changing This makes the existence of a new book an opportunity as well as a challenge This book tries to take on this challenge and find the right balance It adopts a hands-on approach It is a basic observation that many students have difficulties to go from an algorithm exposed using pseudo-code to a runnable program I did my best to bridge the gap and provide the students with programs and ready-made solutions The book contains real code the reader can study, run, modify, and run again I chose to write examples in two languages to make the algorithms easy to understand and encode: Perl and Prolog One of the major driving forces behind the recent improvements in natural language processing is the increase of text resources and annotated data The huge amount of texts made available by Internet and the never-ending digitization led many of the practitioners to evolve from theory-oriented, armchair linguists to frantic empiricists This books attempts as well as it can to pay attention to this trend and VIII Preface stresses the importance of corpora, annotation, and annotated corpora It also tries to go beyond English-only and expose examples in two other languages, namely French and German The book was designed and written for a quarter or semester course At Lund, I used it when it was still under the form of lecture notes in the EDA171 course It comes with a companion web site where slides, programs, corrections, an additional chapter, and Internet pointers are available: www.cs.lth.se/˜pierre/ilppp/ All the computer programs should run with Perl available from www.perl.com or Prolog Although I only tested the programs with SWI Prolog available from www.swiprolog.org, any Prolog compatible with the ISO reference should apply Many people helped me during the last 10 years when this book took shape, stepby-step I am deeply indebted to my colleagues and to my students in classes at Caen, Nottingham, Stafford, Constance, and now in Lund Without them, it could never have existed I would like most specifically to thank the PhD students I supervised, in chronological order, Pierre-Olivier El Guedj, Christophe Godéreaux, Dominique Dutoit, and Richard Johansson Finally, my acknowledgments would not be complete without the names of the people I most cherish and who give meaning to my life, my wife, Charlotte, and my children, Andreas and Louise Lund, January 2006 Pierre Nugues Contents An Overview of Language Processing 1.1 Linguistics and Language Processing 1.2 Applications of Language Processing 1.3 The Different Domains of Language Processing 1.4 Phonetics 1.5 Lexicon and Morphology 1.6 Syntax 1.6.1 Syntax as Defined by Noam Chomsky 1.6.2 Syntax as Relations and Dependencies 1.7 Semantics 1.8 Discourse and Dialogue 1.9 Why Speech and Language Processing Are Difficult 1.9.1 Ambiguity 1.9.2 Models and Their Implementation 1.10 An Example of Language Technology in Action: the Persona Project 1.10.1 Overview of Persona 1.10.2 The Persona’s Modules 1.11 Further Reading 1 8 10 11 14 14 15 16 17 17 18 19 Corpus Processing Tools 2.1 Corpora 2.1.1 Types of Corpora 2.1.2 Corpora and Lexicon Building 2.1.3 Corpora as Knowledge Sources for the Linguist 2.2 Finite-State Automata 2.2.1 A Description 2.2.2 Mathematical Definition of Finite-State Automata 2.2.3 Finite-State Automata in Prolog 2.2.4 Deterministic and Nondeterministic Automata 2.2.5 Building a Deterministic Automata from a Nondeterministic One 23 23 23 24 26 27 27 28 29 30 31 X Contents 2.2.6 Searching a String with a Finite-State Automaton 2.2.7 Operations on Finite-State Automata Regular Expressions 2.3.1 Repetition Metacharacters 2.3.2 The Longest Match 2.3.3 Character Classes 2.3.4 Nonprintable Symbols or Positions 2.3.5 Union and Boolean Operators 2.3.6 Operator Combination and Precedence Programming with Regular Expressions 2.4.1 Perl 2.4.2 Matching 2.4.3 Substitutions 2.4.4 Translating Characters 2.4.5 String Operators 2.4.6 Back References Finding Concordances 2.5.1 Concordances in Prolog 2.5.2 Concordances in Perl Approximate String Matching 2.6.1 Edit Operations 2.6.2 Minimum Edit Distance 2.6.3 Searching Edits in Prolog Further Reading 31 33 35 36 37 38 39 41 41 42 42 42 43 44 44 45 46 46 48 50 50 51 54 55 Encoding, Entropy, and Annotation Schemes 3.1 Encoding Texts 3.2 Character Sets 3.2.1 Representing Characters 3.2.2 Unicode 3.2.3 The Unicode Encoding Schemes 3.3 Locales and Word Order 3.3.1 Presenting Time, Numerical Information, and Ordered Words 3.3.2 The Unicode Collation Algorithm 3.4 Markup Languages 3.4.1 A Brief Background 3.4.2 An Outline of XML 3.4.3 Writing a DTD 3.4.4 Writing an XML Document 3.4.5 Namespaces 3.5 Codes and Information Theory 3.5.1 Entropy 3.5.2 Huffman Encoding 3.5.3 Cross Entropy 59 59 60 60 61 63 66 2.3 2.4 2.5 2.6 2.7 66 67 69 69 69 71 74 75 76 76 77 80 Contents XI 3.5.4 Perplexity and Cross Perplexity Entropy and Decision Trees 3.6.1 Decision Trees 3.6.2 Inducing Decision Trees Automatically Further Reading 81 82 82 82 84 Counting Words 4.1 Counting Words and Word Sequences 4.2 Words and Tokens 4.2.1 What Is a Word? 4.2.2 Breaking a Text into Words: Tokenization 4.3 Tokenizing Texts 4.3.1 Tokenizing Texts in Prolog 4.3.2 Tokenizing Texts in Perl 4.4 N -grams 4.4.1 Some Definitions 4.4.2 Counting Unigrams in Prolog 4.4.3 Counting Unigrams with Perl 4.4.4 Counting Bigrams with Perl 4.5 Probabilistic Models of a Word Sequence 4.5.1 The Maximum Likelihood Estimation 4.5.2 Using ML Estimates with Nineteen Eighty-Four 4.6 Smoothing N -gram Probabilities 4.6.1 Sparse Data 4.6.2 Laplace’s Rule 4.6.3 Good–Turing Estimation 4.7 Using N -grams of Variable Length 4.7.1 Linear Interpolation 4.7.2 Back-off 4.8 Quality of a Language Model 4.8.1 Intuitive Presentation 4.8.2 Entropy Rate 4.8.3 Cross Entropy 4.8.4 Perplexity 4.9 Collocations 4.9.1 Word Preference Measurements 4.9.2 Extracting Collocations with Perl 4.10 Application: Retrieval and Ranking of Documents on the Web 4.11 Further Reading 87 87 87 87 88 89 89 91 92 92 93 93 95 95 95 97 99 99 100 101 102 103 104 104 104 105 105 106 106 107 108 109 111 Words, Parts of Speech, and Morphology 5.1 Words 5.1.1 Parts of Speech 5.1.2 Features 5.1.3 Two Significant Parts of Speech: The Noun and the Verb 113 113 113 114 115 3.6 3.7 XII Contents 5.2 5.3 5.4 5.5 5.6 5.7 Lexicons 117 5.2.1 Encoding a Dictionary 119 5.2.2 Building a Trie in Prolog 121 5.2.3 Finding a Word in a Trie 123 Morphology 123 5.3.1 Morphemes 123 5.3.2 Morphs 124 5.3.3 Inflection and Derivation 125 5.3.4 Language Differences 129 Morphological Parsing 130 5.4.1 Two-Level Model of Morphology 130 5.4.2 Interpreting the Morphs 131 5.4.3 Finite-State Transducers 131 5.4.4 Conjugating a French Verb 133 5.4.5 Prolog Implementation 134 5.4.6 Ambiguity 136 5.4.7 Operations on Finite-State Transducers 137 Morphological Rules 138 5.5.1 Two-Level Rules 138 5.5.2 Rules and Finite-State Transducers 139 5.5.3 Rule Composition: An Example with French Irregular Verbs 141 Application Examples 142 Further Reading 142 Part-of-Speech Tagging Using Rules 6.1 Resolving Part-of-Speech Ambiguity 6.1.1 A Manual Method 6.1.2 Which Method to Use to Automatically Assign Parts of Speech 6.2 Tagging with Rules 6.2.1 Brill’s Tagger 6.2.2 Implementation in Prolog 6.2.3 Deriving Rules Automatically 6.2.4 Confusion Matrices 6.3 Unknown Words 6.4 Standardized Part-of-Speech Tagsets 6.4.1 Multilingual Part-of-Speech Tags 6.4.2 Parts of Speech for English 6.4.3 An Annotation Scheme for Swedish 6.5 Further Reading 147 147 147 147 149 149 151 153 154 154 156 156 158 160 162 500 References Church, K W and Hanks, P (1990) Word association norms, mutual information, and lexicography Computational Linguistics, 16(1):22–29 Church, K W and Mercer, R L (1993) Introduction to the special issue on computational linguistics using large corpora Computational Linguistics, 19(1):1–24 Clarkson, P R and Rosenfeld, R (1997) Statistical language modeling using the CMU-Cambridge toolkit In Proceedings ESCA Eurospeech Clocksin, W F and Mellish, C S (2003) Programming in Prolog: Using the ISO Standard Springer Verlag, Berlin Heidelberg New York, 5th edition Collins, M J (1996) A new statistical parser based on bigram lexical dependencies In Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, pages 184–191, Santa Cruz, California Collins, M J (1999) Head-Driven Statistical Models for Natural Language Parsing PhD thesis, University of Pennsylvania Collins, M J (2003) Head-driven statistical models for natural language parsing Computational Linguistics, 29(4):589–637 Colmerauer, A (1970) Les systèmes-Q ou un formalisme pour analyser et synthétiser des phrases sur ordinateur Publication interne 43, Département d’informatique, Université de Montréal Colmerauer, A (1978) Metamorphosis grammars In Bolc, L., editor, Natural language communication with computers, volume 63 of Lecture Notes in Computer science, pages 133–189 Springer Verlag, Berlin Heidelberg New York Colmerauer, A (1982) An interesting subset of natural language In Clark, K L and Tärnlund, S., editors, Logic Programming, pages 45–66 Academic Press, London Colmerauer, A., Kanoui, H., Pasero, R., and Roussel, P (1972) Un systốme de communication en franỗais Rapport prộliminaire de n de contrat IRIA, Groupe Intelligence Artificielle, Faculté des Sciences de Luminy, Université d’Aix-Marseille II Colmerauer, A and Roussel, P (1996) The birth of Prolog In Bergin, T J and Gibson, R G., editors, History of Programming Languages II ACM Press/AddisonWesley Constant, P (1991) Analyse syntaxique par couches Thèse de doctorat, École Nationale Supérieure des Télécommunications, Paris Cooper, D (1999) Corpora: kwic concordances with Perl CORPORA Mailing List Archive, Concordancing thread Corbett, E P J and Connors, R J (1999) Classical Rhetoric for the Modern Student Oxford University Press, New York, 4th edition Corston-Oliver, S (1998) Computing Representations of the Structure of Written Discourse PhD thesis, Linguistics Department, the University of California, Santa Barbara Coulthard, M (1985) An Introduction to Discourse Analysis Longman, Harlow, 2nd edition Covington, M (1989) GULP 2.0, an extension of Prolog for unification-based grammar Research report AI-1989-01, Artificial Intelligence Programs, University of Georgia References 501 Covington, M (1990) Parsing discontinuous constituents in dependency grammar Computational Linguistics, 16(4):234–236 Covington, M (1994) Natural Language Processing for Prolog Programmers Prentice Hall, Upper Saddle River Covington, M., Nute, D., and Vellino, A (1997) Prolog Programming in Depth Prentice Hall, Upper Saddle River Cozannet, A (1992) A model for task driven oral dialogue In Proceedings of the Second International Conference on Spoken Language Processing (ICSLP), pages 1451–1454 Crystal, D (1997) The Cambridge Encyclopedia of Language Cambridge University Press, Cambridge, 2nd edition d’Arc, S J., editor (1970) Concordance de la Bible, Nouveau Testament Éditions du Cerf – Desclées De Brouwer, Paris Davidson, D (1966) The logical form of action sentences In Rescher, N., editor, The Logic of Decision and Action University of Pittsburgh Press, Pittsburgh Davis, M and Whistler, K (2002) Unicode collation algorithm Unicode Technical Standard 10, The Unicode Consortium de la Briandais, R (1959) File searching using variable length keys In Proceedings of the Western Joint Computer Conference, pages 295–298, San Francisco AFIPS Delahaye, J.-P (1986) Outils logiques pour l’intelligence artificielle Eyrolles, Paris Deransart, P., Ed-Dbali, A A., and Cervoni, L (1996) Prolog: The Standard, Reference Manual Springer Verlag, Berlin Heidelberg New York Dermatas, E and Kokkinakis, G K (1995) Automatic stochastic tagging of natural language texts Computational Linguistics, 21(2):137–163 Ducrot, O and Schaeffer, J.-M., editors (1995) Nouveau dictionnaire encyclopédique des sciences du langage Éditions du Seuil, Paris Dunning, T (1993) Accurate methods for the statistics of surprise and coincidence Computational Linguistics, 19(1):61–74 Dutoit, D (1992) A set-theoretic approach to lexical semantics In Proceedings of the 15th International Conference on Computational Linguistics, COLING-92, volume III, pages 982–987, Nantes Earley, J C (1970) An efficient context-free parsing algorithm Communications of the ACM, 13(2):94–102 Eckert, W (1996) Gesprochener Mensch-Machine-Dialog Shaker Verlag, Aachen Ejerhed, E (1988) Finding clauses in unrestricted text by finitary and stochastic methods In Second Conference on Applied Natural Language Processing, pages 219–227 ACL Ejerhed, E., Källgren, G., Wennstedt, O., and Åstrưm, M (1992) The linguistic annotation system of the Stockholm-Umể corpus project Technical Report 33, Department of General Linguistics, University of Umeå El Guedj, P.-O (1996) Analyse syntaxique par charts combinant règles de dépendance et règles syntagmatiques PhD thesis, Université de Caen Fellbaum, C., editor (1998) WordNet: An Electronic Lexical Database (Language, Speech and Communication) MIT Press, Cambridge, Massachusetts 502 References Fikes, R and Nilsson, N (1971) STRIPS: A new approach to the application of theorem proving to problem solving Artificial Intelligence, 2(3/4):189–208 Fillmore, C J (1968) The case for case In Bach, E and Harms, R T., editors, Universals in Linguistic Theory, pages 1–88 Holt, Rinehart and Winston, New York Fillmore, C J (1976) Frame semantics and the nature of language Annals of the New York Academy of Sciences: Conference on the Origin and Development of Language and Speech, 280:20–32 Franz, A (1996) Automatic Ambiguity Resolution in Natural Language Processing: An Empirical Approach, volume 1171 of Lecture Notes in Artificial Intelligence Springer Verlag, Berlin Heidelberg New York Frege, G (1892) Über Sinn und Bedeutung Zeitschrift für Philosophie und philosophische Kritik, pages C:25–50 English transl in Philosophical writings of Gottlob Frege, Blackwell, 1966 Friedl, J E F (2002) Mastering Regular Expressions O’Reilly, Sebastopol, California, second edition Fromkin, V., editor (2000) Linguistics: An Introduction to Linguistic Theory Blackwell, Oxford Fromkin, V., Rodman, R., and Hyams, N (2003) An Introduction to Language Thomson/Heinle, Boston, 7th edition Frühwirth, T (1998) Theory and practice of constraint handling rules Journal of Logic Programming, 37(1–3):95–138 Gagnon, M and Lapalme, G (1996) From conceptual time to linguistic time Computational Linguistics, 22(1):91–127 Gaifman, H (1965) Dependency systems and phrase-structure systems Information and Control, 8:304–337 Gal, A., Lapalme, G., and Saint-Dizier, P (1989) Prolog pour l’analyse automatique du langage naturel Eyrolles, Paris Gal, A., Lapalme, G., Saint-Dizier, P., and Somers, H (1991) Prolog for Natural Language Processing Wiley, Chichester Gale, W A and Church, K W (1993) A program for aligning sentences in bilingual corpora Computational Linguistics, 19(1):75–102 Gazdar, G and Mellish, C (1989) Natural Language Processing in Prolog: An Introduction to Computational Linguistics Addison-Wesley, Wokingham Gendner, V., Illouz, G., Jardino, M., Monceaux, L., Paroubek, P., Robba, I., and Vilnat, A (2003) PEAS, the first instantiation of a comparative framework for evaluating parsers of French In Proceedings of the Research Note Sessions of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL’03), volume 2, pages 95–98, Budapest Giannesini, F., Kanoui, H., Pasero, R., and van Caneghem, M (1985) Prolog Interéditions, Paris Gildea, D and Jurafsky, D (2002) Automatic labeling of semantic roles Computational Linguistics, 28(3):245–288 References 503 Godart-Wendling, B., Ildefonse, F., Pariente, J.-C., and Rosier, I (1998) Penser le principe de compositionnalité : éléments de réflexion historiques et épistémologiques Traitement automatique des langues, 39(1):9–34 Godéreaux, C., Diebel, K., El Guedj, P.-O., Revolta, F., and Nugues, P (1996) An interactive spoken dialog interface to virtual worlds In Connolly, J H and Pemberton, L., editors, Linguistic Concepts and Methods in CSCW, Computer supported cooperative work, chapter 13, pages 177–200 Springer Verlag, Berlin Heidelberg New York Godéreaux, C., El Guedj, P.-O., Revolta, F., and Nugues, P (1998) Ulysse: An interactive, spoken dialogue interface to navigate in virtual worlds Lexical, syntactic, and semantic issues In Vince, J and Earnshaw, R., editors, Virtual Worlds on the Internet, chapter 4, pages 53–70, 308–312 IEEE Computer Society Press, Los Alamitos, California Goldfarb, C F (1990) The SGML Handbook Oxford University Press, Oxford Good, I J (1953) The population frequencies of species and the estimation of population parameters Biometrika, 40(16):237–264 Gosselin, L (1996) Sộmantique de la temporalitộ en franỗais : un modèle calculatoire et cognitif du temps et de l’aspect Duculot, Louvain-la-Neuve Graham, S L., Harrison, M A., and Ruzzo, W L (1980) An improved contextfree recognizer ACM Transactions on Programming Languages and Systems, 2(3):415–462 Grefenstette, G and Tapanainen, P (1994) What is a word, what is a sentence? Problems in tokenization MLTT Technical Report 4, Xerox Grosz, B J., Joshi, A K., and Weinstein, S (1995) Centering: A framework for modeling the local coherence of discourse Computational Linguistics, 21(2):203– 225 Grosz, B J and Sidner, C L (1986) Attention, intention, and the structure of discourse Computational Linguistics, 12(3):175–204 Habermas, J (1988) Nachmetaphysisches Denken Suhrkamp, Frankfurt am Main Haegeman, L and Gueron, J (1999) English Grammar: A Generative Perspective Number 14 in Blackwell Textbooks in Linguistics Blackwell, Malden Harper, M H., Hockema, S A., and White, C M (1999) Enhanced constraint dependency grammar parsers In Proceedings of the IASTED International Conference on Artificial Intelligence and Soft Computing, Honolulu Harris, R and Taylor, T J (1997) Landmarks in Linguistic Thought, The Western Tradition from Socrates to Saussure Routledge, London, 2nd edition Harris, Z (1962) String Analysis of Sentence Structure Mouton, The Hague Hausser, R (2000) Grundlagen der Computerlinguistik Mensch-MaschineKommunikation in natürlicher Sprache Springer Verlag, Berlin Heidelberg New York Hausser, R (2001) Foundations of Computational Linguistics Human-Computer Communication in Natural Language Springer Verlag, Berlin Heidelberg New York, 2nd edition Hays, D G (1964) Dependency theory: A formalism and some observations Language, 40(4):511–525 504 References Hellwig, P (1980) PLAIN – a program system for dependency analysis and for simulating natural language inference In Bolc, L., editor, Representation and Processing of Natural Language, pages 271–376 Hanser, München Hellwig, P (1986) Dependency unification grammar (DUG) In Proceedings of the 11th International Conference on Computational Linguistics (COLING 86), pages 195–198 Herbrand, J (1930) Recherches sur la théorie de la démonstration Travaux de la Société des Sciences et des Lettres de Varsovie, Classe III Sciences mathématiques et physiques, 33 Heringer, H.-J (1993) Dependency syntax – basic ideas and the classical model In Jacobs, J., von Stechow, A., Sternefeld, W., and Venneman, T., editors, Syntax – An International Handbook of Contemporary Research, volume 1, chapter 12, pages 298–316 Walter de Gruyter, Berlin – New York Hindle, D and Rooth, M (1993) Structural ambiguity and lexical relations Computational Linguistics, 19(1):103–120 Hintikka, J (1962) Knowledge and Belief, An Introduction to the Logic of the Two Notions Cornell University Press, Ithaca Hirschman, L and Chinchor, N (1997) MUC-7 coreference task definition Technical report http://www.itl.nist.gov/iaui/894.02/related_projects/muc/proceedings/co_task.html Cited November 2005 Hjelmslev, L (1935-37) La catégorie des cas Étude de grammaire générale, volume VII(1), IX(2) of Acta Jutlandica Aarhus Hjelmslev, L (1943) Omkring sprogteoriens grundlæggelse Festskrift udgivet af Københavns Universiteit, Copenhagen English translation Prolegomena to a Theory of Language Hobbs, J R., Appelt, D E., Bear, J., Israel, D., Kameyama, M., Stickel, M., and Tyson, M (1997) FASTUS: a cascaded finite-state transducer for extracting information from natural-language text In Roche, E and Schabes, Y., editors, FiniteState Language Processing, chapter 13, pages 383–406 MIT Press, Cambridge, Massachusetts Hopcroft, J E., Motwani, R., and Ullman, J D (2001) Introduction to Automata Theory, Languages, and Computation Addison-Wesley, Boston, 2nd edition Hornby, A S., editor (1974) Oxford Advanced Learner’s Dictionary of Current English Oxford University Press, Oxford, 3rd edition Hornby, A S., editor (1995) Oxford Advanced Learner’s Dictionary of Current English Oxford University Press, Oxford, 5th edition Huang, X., Acero, A., Alleva, F., Hwang, M.-Y., Jiang, L., and Mahaja, M (1995) Microsoft highly intelligent speech recognizer: Whisper In Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, Detroit Huls, C., Claassen, W., and Bos, E (1995) Automatic referent resolution of deictic and anaphoric expressions Computational Linguistics, 21(1):59–79 Ide, N and Véronis, J (1998) Introduction to the special issue on word sense disambiguation: The state of the art Computational Linguistics, 24(1):1–40 References 505 Imbs, P and Quemada, B., editors (1971-1994) Trộsor de la langue franỗaise Dictionnaire de la langue franỗaise du XIXe et du XXe siècle (1789-1960) Éditions du CNRS puis Gallimard, Paris 16 volumes Ingria, B and Pustejovsky, J (2004) TimeML: A formal specification language for events and temporal expressions http://www.cs.brandeis.edu/˜jamesp/arda/time/timeMLdocs/TimeML12.htm Cited 28 October 2005 Jackendoff, R (1990) Semantic Structures MIT Press, Cambridge, Massachusetts Jacob, A., editor (1989) Encyclopédie philosophique universelle Presses Universitaires de France, Paris Järvinen, T and Tapanainen, P (1997) A dependency parser for English Technical Report TR-1, Department of General Linguistics, University of Helsinki Jekat, S., Klein, A., Maier, E., Maleck, I., Mast, M., and Quantz, J (1995) Dialogue acts in Verbmobil Verbmobil-Report 65, Universität Hamburg, DFKI, Universität Erlangen, TU Berlin Jelinek, F (1990) Self-organized language modeling for speech recognition In Waibel, A and Lee, K.-F., editors, Readings in Speech Recognition Morgan Kaufmann, San Mateo Reprinted from an IBM Report, 1985 Jelinek, F (1997) Statistical Methods for Speech Recognition MIT Press, Cambridge, Massachusetts Jelinek, F and Mercer, R L (1980) Interpolated estimation of Markov source parameters from sparse data In Gelsema, E S and Kanal, L N., editors, Pattern Recognition in Practice, pages 38–397 North-Holland, Amsterdam Jensen, K., Heidorn, G., and Richardson, S., editors (1993) Natural Language Processing: The PLNLP Approach Kluwer Academic Publishers, Boston Johansson, R., Berglund, A., Danielsson, M., and Nugues, P (2005) Automatic text-to-scene conversion in the traffic accident domain In IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, pages 1073–1078, Edinburgh Johnson, C D (1972) Formal Aspects of Phonological Description Mouton, The Hague Joshi, A K and Hopely, P (1999) A parser from antiquity: an early application of finite state transducers to natural language processing In Kornai, A., editor, Extended Finite State Models of Language, Studies in Natural Language Processing, pages 6–15 Cambridge University Press, Cambridge Jurafsky, D and Martin, J H (2000) Speech and Language Processing, An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition Prentice Hall, Upper Saddle River Kameyama, M (1997) Recognizing referential links: An information extraction perspective In Mitkov, R and Boguraev, B., editors, Proceedings of ACL Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts, pages 46–53, Madrid Kamp, H and Reyle, U (1993) From Discourse to Logic: Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory Kluwer Academic Press, Dordrecht 506 References Kaplan, R M and Bresnan, J (1982) Lexical-functional grammar: A formal system for grammatical representation In Bresnan, J., editor, The Mental Representation of Grammatical Relations, pages 173–281 MIT Press, Cambridge, Massachusetts Kaplan, R M and Kay, M (1994) Regular models of phonological rule systems Computational Linguistics, 20(3):331–378 Karttunen, L (1983) KIMMO: A general morphological processor Texas Linguistic Forum, 22:163–186 Karttunen, L (1994) Constructing lexical transducers In Proceedings of the 15th Conference on Computational Linguistics, COLING-94, volume 1, pages 406– 411 Karttunen, L., Kaplan, R M., and Zaenen, A (1992) Two-level morphology with composition In Proceedings of the 15th Conference on Computational Linguistics, COLING-92, volume 1, pages 141–148 Kasami, T (1965) An efficient recognition and syntax analysis algorithm for context-free languages Technical Report AFCRL-65-758, Air Force Cambridge Research Laboratory, Bedford, Massachusetts Cited from Wikipedia Katz, S M (1987) Estimation of probabilities from sparse data for the language model component of a speech recognizer IEEE Transactions on Acoustics, Speech, and Signal Processing, 35(3):400–401 Kernighan, M D., Church, K W., and Gale, W A (1990) A spelling correction program based on a noisy channel model In Papers presented to the 13th International Conference on Computational Linguistics (COLING-90), volume II, pages 205–210, Helsinki Kingsbury, P., Palmer, M., and Marcus, M (2002) Adding semantic annotation to the Penn Treebank In Proceedings of the Human Language Technology Conference, San Diego Kiraz, G A (2001) Computational Nonlinear Morphology: With Emphasis on Semitic Languages Studies in Natural Language Processing Cambridge University Press, Cambridge Kleene, S C (1956) Representation of events in nerve nets and finite automata In Shannon, C E and Carthy, J M., editors, Automata Studies, pages 3–42 Princeton University Press, Princeton Klein, S and Simmons, R (1963) A computational approach to grammatical coding of English words Journal of the ACM, 10(3):334–347 Knuth, D E (1986) The Texbook Addison-Wesley, Reading, Massachusetts Koch, U (1993) The enhancement of a dependency parser for Latin Technical Report AI-1993-03, Artificial Intelligence Programs, University of Georgia Kornai, A., editor (1999) Extended Finite State Models of Language Studies in Natural Language Processing Cambridge University Press, Cambridge Koskenniemi, K (1983) Two-level morphology: a general computation model for word-form recognition and production Technical Report 11, University of Helsinki, Department of General Linguistics Kowalski, R A and Kuehner, D (1971) Linear resolution with selection function Artificial Intelligence, 2:227–260 References 507 Lallot, J., editor (1998) La grammaire de Denys le Thrace CNRS Éditions, Collection Science du langage, Paris, 2e edition Text in Greek, translated in French by Jean Lallot Landes, S., Leacock, C., and Tengi, R (1998) Building semantic concordances In Fellbaum, C., editor, WordNet: An Electronic Lexical Database MIT Press, Cambridge, Massachusetts Laplace, P (1820) Théorie analytique des probabilités Coursier, Paris, 3e edition Lasnik, H., Depiante, M A., and Stepanov, A (2000) Syntactic Structures Revisited: Contemporary Lectures on Classic Transformational Theory, volume 33 of Current Studies in Linguistics MIT Press, Cambridge, Massachusetts Lesk, M (1986) Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone In Proceedings of the 5th annual international conference on systems documentation, pages 24–26, Toronto, Ontario Levinson, S (1983) Pragmatics Cambridge University Press, Cambridge Lin, D (1995) A dependency-based method for evaluating broad-coverage parsers In Proceedings of IJCAI-95, pages 1420–1427 Linke, A., Nussbaumer, M., and Portmann, P R (2004) Studienbuch Linguistik Niemeyer, Tübingen, edition Lloyd, J W (1987) Foundations of Logic Programming Springer Verlag, Berlin Heidelberg New York, 2nd edition Magerman, D (1995) Book reviews: Statistical language learning by Eugene Charniak Computational Linguistics, 21(1):103–111 Maier, D and Warren, D S (1988) Computing with Logic, Logic Programming with Prolog Benjamin/Cummings, Menlo Park Malmberg, B (1983) Analyse du langage au XXe siècle Théorie et méthodes Presses universitaires de France, Paris Malmberg, B (1991) Histoire de la linguistique De Sumer Saussure Presses universitaires de France, Paris Mann, W C and Thompson, S A (1987) Rhetorical structure theory: A theory of text organization Technical Report RS-87-190, Information Sciences Institute of the University of Southern California Mann, W C and Thompson, S A (1988) Rhetorical structure theory: Toward a functional theory of text organization Text, 8:243–281 Manning, C D and Schütze, H (1999) Foundations of Statistical Natural Language Processing MIT Press, Cambridge, Massachusetts Marcu, D (1997) The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts PhD thesis, Department of Computer Science, University of Toronto Marcus, M., Marcinkiewicz, M A., and Santorini, B (1993) Building a large annotated corpus of English: The Penn Treebank Computational Linguistics, 19(2):313–330 Martelli, A and Montanari, U (1982) An efficient unification algorithm ACM Transactions on Programming Languages and Systems, 4(2):258–282 508 References Maruyama, H (1990) Constraint dependency grammar and its weak generative capacity Computer Software Mast, M (1993) Ein Dialogmodul für ein Spracherkennungs- und Dialogsystem, volume 50 of Dissertationen zur Künstlichen Intelligenz Infix, Sankt Augustin Mast, M., Kummert, F., Ehrlich, U., Fink, G A., Kuhn, T., Niemann, H., and Sagerer, G (1994) A speech understanding and dialog system with a homogeneous linguistic knowledge base IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(2):179–194 Mauldin, M L and Leavitt, J R R (1994) Web-agent related research at the Center for Machine Translation In Proceedings of the ACM SIG on Networked Information Discovery and Retrieval, McLean, Virginia Maxwell, D (1995) Unification dependency grammar ftp://ftp.ling.ohiostate.edu/pub/HPSG/Papers/UDG/ Draft Cited 28 October 2005 McMahon, J G and Smith, F J (1996) Improving statistical language models performance with automatically generated word hierarchies Computational Linguistics, 22(2):217–247 Mel’cuk, I A (1988) Dependency Syntax: Theory and Practice State University Press of New York, Albany Mel’cuk, I A., Clas, A., and Polguère, A (1995) Introduction la lexicologie explicative et combinatoire Éditions Duculot, Louvain-la-Neuve Merialdo, B (1994) Tagging English text with a probabilistic model Computational Linguistics, 20(2):155–171 Mikheev, A (2002) Periods, capitalized words, etc Computational Linguistics, 28(3):289–318 Miller, G A (1995) WordNet: A lexical database for English Communications of the ACM, 38(11):39–41 Miller, G A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K J., and Tangi, R (1993) Five papers on WordNet Technical report, Princeton University ftp://ftp.cogsci.princeton.edu/pub/wordnet/5papers.ps Cited 28 October 2005 Moeschler, J (1989) Modélisation du dialogue : Représentation de l’inférence argumentative Hermès, Paris Moeschler, J and Reboul, A (1994) Dictionnaire encyclopédique de pragmatique Éditions du Seuil, Paris Mohri, M., Pereira, F C N., and Riley, M (1998) A rational design for a weighted finite-state transducer library In Wood, D and Yu, S., editors, Automata implementation Second International Workshop on Implementing Automata, WIA ’97, London, Ontario, September 1997 Revised Papers, volume 1436 of Lecture Notes in Computer Science, pages 144–158, Berlin Heidelberg New York Springer Verlag Montague, R (1974) Formal Philosophy: Selected Papers Yale University Press, New Haven Morton, A (2003) A Guide through the Theory of Knowledge Blackwell, Malden, 3rd edition MUC-5, editor (1993) Proceedings of the Fifth Message Understanding Conference (MUC-5), San Francisco Morgan Kaufmann References 509 Müller, S (1999) Deutsche Syntax deklarativ Head-Driven Phrase Structure Grammar für das Deutsche, volume 394 of Linguistische Arbeiten Max Niemeyer Verlag, Tübingen Newmeyer, F J., editor (1988) Linguistics: The Cambridge Survey Cambridge University Press, Cambridge Ng, V and Cardie, C (2002) Improving machine learning approaches to coreference resolution In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 104–111 Nguyen, L., Abdou, S., Afify, M., Makhoul, J., Matsoukas, S., Schwartz, R., Xiang, B., Lamel, L., Gauvain, J.-L., Adda, G., Schwenk, H., and Lefevre, F (2004) The 2004 BBN/LIMSI 10xRT English broadcast news transcription system In Proceedings DARPA RT04, Palisades, New York Nilsson, N (1998) Artificial Intelligence: A New Synthesis Morgan Kaufmann, San Francisco Nivre, J (2003) An efficient algorithm for projective dependency parsing In Proceedings of the 8th International Workshop on Parsing Technologies (IWPT 03), pages 149–160, Nancy Nivre, J and Nilsson, J (2005) Pseudo-projective dependency parsing In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05), pages 99–106, Ann Arbor Nivre, J and Scholz, M (2004) Deterministic dependency parsing of English text In Proceedings of Coling 2004, pages 64–70, Geneva Och, F J and Ney, H (2000) Improved statistical alignment models In Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, pages 440–447, Hongkong Pereira, F (1981) Extraposition grammars Computational Linguistics, 7(4):243– 256 Pereira, F (1984) C-Prolog User’s Manual, Version 1.5 University of Edinburgh Pereira, F and Warren, D (1980) Definite clause grammar for language analysis–a survey of the formalism and a comparison with augmented transition networks Artificial Intelligence, 13(3):231–278 Pereira, F C N and Shieber, S M (1987) Prolog and Natural-Language Analysis, volume 10 of CSLI Lecture Notes Center for the Study of Language and Information, Stanford Perelman, C and Olbrechts-Tyteca, L (1976) Traité de l’argumentation : la nouvelle rhétorique Éditions de l’Université de Bruxelles, Brussels Pérennou, G and de Calmès, M (1987) BDLex lexical data and knowledge base of spoken and written French In European conference on Speech Technology, pages 393–396, Edinburgh Pollard, C and Sag, I A (1994) Head-Driven Phrase Structure Grammar University of Chicago Press, Chicago Procter, P., editor (1978) Longman Dictionary of Contemporary English Longman, Harlow Procter, P., editor (1995) Cambridge International Dictionary of English Cambridge University Press, Cambridge 510 References Pustejovsky, J (1995) The Generative Lexicon MIT Press, Cambridge, Massachusetts Quillian, M R (1967) Word concepts: a theory and simulation of some basic semantic capabilities Behavioral Science, 12(5):410–430 Quinlan, J R (1986) Induction of decision trees Machine Learning, 1(1):81–106 Rabiner, L R (1989) A tutorial on hidden Markov models and selected applications in speech recognition Proceedings of the IEEE, 77(2):257–286 Radford, A (1988) Transformational Grammar: A First Course Cambridge Textbooks in Linguistics Cambridge University Press, Cambridge Ramshaw, L and Marcus, M P (1995) Text chunking using transformation-based learning In Yarowsky, D and Church, K., editors, Proceedings of the Third Workshop on Very Large Corpora, pages 82–94, Cambridge, Massachusetts Ray, E T (2003) Learning XML O’Reilly, Sebastopol, California, 2nd edition Rayner, M., Bretan, I., Wirén, M., Rydin, S., and Beshai, E (1996) Composition of transfer rules in a multi-lingual MT system In Proceedings of the workshop on Future Issues for Multilingual Text Processing, Cairns, Australia Rayner, M and Carter, D (1995) The spoken language translator project In Proceedings of the Language Engineering Convention, London Rayner, M., Carter, D., Bouillon, P., Digalakis, V., and Wirén, M., editors (2000) The Spoken Language Translator Cambridge University Press, Cambridge Reboul, O (1994) Introduction la rhétorique : théorie et pratique Presses universitaires de France, Paris, 2e edition Reichenbach, H (1947) Elements of Symbolic Logic Macmillan, New York Rey, A., editor (1988) Le Robert Micro Dictionnaires Le Robert, Paris Rich, E and Knight, K (1991) Artificial Intelligence McGraw-Hill, New York, 2nd edition Ritchie, G D., Russell, G J., Black, A W., and Pulman, S G (1992) Computational Morphology Practical Mechanisms for the English Lexicon MIT Press, Cambridge, Massachusetts Robins, R H (1997) A Short History of Linguistics Longman, London, 4th edition Robinson, J A (1965) A machine-oriented logic based on the resolution principle Journal of the ACM, 12(1):23–41 Roche, E and Schabes, Y (1995) Deterministic part-of-speech tagging with finitestate transducers Computational Linguistics, 21(2):227–253 Roche, E and Schabes, Y., editors (1997) Finite-State Language Processing MIT Press, Cambridge, Massachusetts Ruppenhofer, J., Ellsworth, M., Petruck, M R L., and Johnson, C R (2005) Framenet: Theory and practice http://framenet.icsi.berkeley.edu/book/book.html Cited 28 October 2005 Russell, S J and Norvig, P (2003) Artificial Intelligence, A Modern Approach Prentice Hall, Upper Saddle River, 2nd edition Ruwet, N (1970) Introduction la grammaire générative Plon, Paris, 2e edition Sabah, G (1990) L’intelligence artificielle et le langage Hermès, Paris, 2e edition Sagerer, G (1990) Automatisches Verstehen gesprochener Sprache, volume Band 74 of Reihe Informatik B.I Wissenschaftsverlag, Mannheim References 511 Salton, G (1988) Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer Addison-Wesley, Reading, Massachusetts Saussure, F (1916) Cours de linguistique générale Reprinted Payot, 1995, Paris Schiffrin, D (1994) Approaches to Discourse Number in Blackwell Textbooks in Linguistics Blackwell, Oxford Schlkopf, B and Smola, A J (2002) Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond (Adaptive Computation and Machine Learning) MIT Press, Cambridge, Massachusetts Schmid, H (1994) Probabilistic part-of-speech tagging using decision trees In Proceedings of International Conference on New Methods in Language Processing, Manchester Schmid, H (1995) Improvements in part-of-speech tagging with an application to German In Proceedings of the ACL SIGDAT Workshop Schwartz, R L and Phoenix, T (2001) Learning Perl O’Reilly, Sebastopol, California, 3rd edition Searle, J R (1969) Speech Acts An Essay in the Philosophy of Language Cambridge University Press, Cambridge Searle, J R (1979) Expression and Meaning, Studies in the Theory of Speech Acts Cambridge University Press, Cambridge Searle, J R and Vanderveken, D (1985) Foundations of Illocutionary Logic Cambridge University Press, Cambridge Shannon, C (1948) A mathematical theory of communication Bell System Technical Journal, 27:398–403 and 623–656 Simone, R (1998) Fundamenti di linguistica Laterza, Bari, 9e edition Sinclair, J., editor (1987) Collins COBUILD English Language Dictionary Collins, London Sleator, D and Temperley, D (1993) Parsing English with a link grammar In Proceedings of the Third International Workshop on Parsing Technologies, pages 277–291 Smadja, F (1993) Retrieving collocations from text: Xtract Computational Linguistics, 19(1):143–177 Soon, W M., Ng, H T., and Lim, D C Y (2001) A machine learning approach to coreference resolution of noun phrases Computational Linguistics, 27(4):521– 544 Sorin, C., Jouvet, D., Gagnoulet, C., Dubois, D., Sadek, D., and Toularhoat, M (1995) Operational and experimental French telecommunication services using CNET speech recognition and text-to-speech synthesis Speech Communication, 17(3-4):273–286 Sproat, R (1992) Morphology and computation MIT Press, Cambridge, Massachusetts Sterling, L and Shapiro, E (1994) The Art of Prolog Advanced Programming Techniques MIT Press, Cambridge, Massachusetts, 2nd edition Suri, L Z and McCoy, K F (1994) RAFT/RAPR and centering: A comparison and discussion of problems related to processing complex sentences Computational Linguistics, 20(2):301–317 512 References Tapanainen, P and Järvinen, T (1997) A non-projective dependency parser In Proceedings of the Fifth Conference on Applied Natural Language Processing (ANLP’97), pages 64–71, Washington, D.C ACL TAUM (1971) Taum 71 Rapport annuel du projet de traduction automatique de l’université de Montréal, Université de Montréal Taylor, K (1998) Truth and Meaning An Introduction to the Philosophy of Language Blackwell, Oxford Ter Meulen, A (1995) Representing Time in Natural Language The Dynamic Interpretation of Tense and Aspect MIT Press, Cambridge, Massachusetts Tesnière, L (1966) Éléments de syntaxe structurale Klincksieck, Paris, 2e edition The Unicode Consortium (2003) The Unicode Standard, Version 4.0 AddisonWesley, Boston Thompson, K (1968) Regular expression search algorithm Communications of the ACM, 11(6):419–422 van Benthem, J and Ter Meulen, A., editors (1997) Handbook of Logic and Language North Holland, Amsterdam van Noord, G and Gerdemann, D (2001) An extendible regular expression compiler for finite-state approaches in natural language processing In Boldt, O and Jürgensen, H., editors, Automata Implementation 4th International Workshop on Implementing Automata, WIA’99, Potsdam, Germany, July 1999, Revised Papers, volume 2214 of Lecture Notes in Computer Science, pages 122–139, Berlin Heidelberg New York Springer Verlag Vanderveken, D (1988) Les actes de discours : essai de philosophie du langage et de l’esprit sur la signification des énonciations Mardaga, Bruxelles Vendler, Z (1967) Linguistics in Philosophy Cornell University Press, Ithaca, New York Vergne, J (1998) Entre arbre de dépendance et ordre linéaire, les deux processus de transformation: linéarisation, puis reconstruction de l’arbre Cahiers de grammaire, 23 Vergne, J (1999) Étude et modélisation de la syntaxe des langues l’aide de l’ordinateur Analyse syntaxique automatique non combinatoire Synthèse et résultats Habilitation diriger des recherches, Université de Caen Viterbi, A J (1967) Error bounds for convolutional codes and an asymptotically optimum decoding algorithm IEEE Transactions on Information Theory, 13(2):260–267 Voutilainen, A., Heikkilä, J., and Anttila, A (1992) Constraint grammar of English: A performance-oriented introduction Technical Report 21, Department of General Linguistics, University of Helsinki Voutilainen, A and Järvinen, T (1995) Specifying a shallow grammatical representation for parsing purposes In Proceedings of the Seventh Conference of the European Chapter of the ACL, pages 210–214, Dublin Wahrig, G., editor (1978) dtv-Wörterbuch der deutschen Sprache Deutscher Taschenbuch Verlag, Munich Wall, L., Christiansen, T., and Orwant, J (2000) Programming Perl O’Reilly, Sebastopol, California, 3rd edition References 513 Warren, D (1983) An abstract Prolog instruction set Technical Note 309, SRI International, Menlo Park Weizenbaum, J (1966) ELIZA – a computer program for the study of natural language communication between man and machine Communications of the ACM, 9(1):36–45 Wilks, Y and Stevenson, M (1997) Sense tagging: Semantic tagging with a lexicon In Tagging Text with Lexical Semantics: Why, What, and How? Proceedings of the Workshop, pages 74–78, Washington, DC ACL SIGLEX Wilks, Y A., Slator, B M., and Guthrie, L M (1996) Electric Words Dictionaries, Computers, and Meanings MIT Press, Cambridge, Massachusetts Yarowsky, D (1995) Unsupervised word sense disambiguation rivaling supervised methods In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pages 189–196, Cambridge, Massachusetts Yarowsky, D (1996) Homograph disambiguation in speech synthesis In van Santen, J., Sproat, R., Olive, J P., and Hirschberg, J., editors, Progress in Speech Synthesis, pages 159–175, Berlin Heidelberg New York Springer Verlag Zampolli, A (2003) Past and on-going trends in computational linguistics The ELRA Newsletter, 8(3-4):6–16 Zelle, J M and Mooney, R J (1997) An inductive logic programming method for corpus-based parser construction Technical note, University of Texas at Austin http://www.cs.utexas.edu/users/ml/publication/ilp.html Cited 28 October 2005 Cognitive Technologies T Frühwirth, S Abdennadher: Essentials of Constraint Programming IX, 144 pages 2003 J W Lloyd: Logic for Learning Learning Comprehensive Theories from Structured Data X, 256 pages 2003 S K Pal, L Polkowski, A Skowron (Eds.): Rough-Neural Computing Techniques for Computing with Words XXV, 734 pages 2004 H Prendinger, M Ishizuka (Eds.): Life-Like Characters Tools, Affective Functions, and Applications IX, 477 pages 2004 H Helbig: Knowledge Representation and the Semantics of Natural Language XVIII, 646 pages 2006 P M Nugues: An Introduction to Language Processing with Perl and Prolog An Outline of Theories, Implementation, and Application with Special Consideration of English, French, and German XX, 513 pages 2006 ... M Nugues An Introduction to Language Processing with Perl and Prolog An Outline of Theories, Implementation, and Application with Special Consideration of English, French, and German With 153... Language Processing (Manning and Schütze 1999), Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition (Jurafsky and Martin... computer to understand texts or speech and to interact accordingly with human beings Understanding or translating texts automatically and talking to an artificial conversational assistant are major

Ngày đăng: 05/09/2020, 11:43