RippleDownRulesforQuestionAnalysis Nguyễn Quốc Đạt Trường Đại học Công nghệ Luận văn Thạc sĩ ngành: Khoa học máy tính; Mã số: 60 48 01 Người hướng dẫn: TS Phạm Bảo Sơn Năm bảo vệ: 2011 Keywords Khoa học máy tính; Hệ thống hỏi – đáp; Phân tích câu hỏi; Xử lý ngơn ngữ Content Table of Contents Introduction Literature review 2.1 Questionanalysis in question answering systems 3 2.1.1 Question classification 2.1.2 Pattern-matching based analysis 2.1.3 Syntactic-based analysis 2.1.4 Semantic-based analysis 2.1.5 Annotation-based questionanalysis in question answering systems 10 2.2 GATE 12 2.2.1 Information Extraction in GATE 14 2.2.2 JAPE 14 2.3 Single Classification RippleDownRules 19 Our Question Answering System Architecture 20 3.1 Introduction 20 3.2 Preprocessing module 23 3.3 Syntactic analysis module 24 3.3.1 Noun phrases detection 24 3.3.2 Question-phrases detection 25 3.3.3 Relations detection 26 3.4 Semantic analysis module 27 3.5 Answer retrieval component 29 Systematic Knowledge Acquisition forQuestionAnalysis 4.1 Recall Intermediate Representation of an input question 4.2 4.3 30 30 Rule language 32 Knowledge Acquisition Process 33 Evaluation 37 QuestionAnalysisfor Vietnamese 37 5.1 5.2 QuestionAnalysisfor English 39 Conclusion 41 A Definitions of question-class types 43 B 45 Definitions of question-structures C Intermediate Representation Elements of English questions D Embedding Java code in JAP 48 59 References I Androutsopoulos, G Ritchie, and P Thanisch Masque/sql: an efficient and portable natural language query interface for relational databases In Proceedings of the 6th international conference on Industrial and engineering applications of artificial intelligence and expert systems, pages 327-330, 1993 Ion Androutsopoulos, Graeme Ritchie, and Peter Thanisch Natural language interfaces to databases — an introduction Natural Language Engineering, 1:29-81, 1995 Paolo Atzeni, Roberto Basili, Dorte Haltrup Hansen, Paolo Missier, Patrizia Paggio, Maria Teresa Pazienza, and Fabio Massimo Zanzotto Ontology-based question answering in a federation of university sites: The MOSES case study In Proceedings of 9th International Conference on Applications of Natural Languages to Information Systems, NLDB 2004, pages 413-420, 2004 Van Dur Benjamin, Yifen Huang, Anna Kupsc, and Eric Nyberg Towards light semantic processing forquestion answering In Proceedings of the HLT-NAACL 2003 workshop on Text meaning - Volume 9, pages 54-61, 2003 Noam Chomsky Syntactic Structures Mouton, The Hague, 1957 Philipp Cimiano, Peter Haase, Jorg Heizmann, Matthias Mantel, and Rudi Studer Towards portable natural language interfaces to knowledge bases - the case of the orakel system Data Knowl Eng., 65:325-354, 2008 Stephen Clark, Mark Steedman, and James R Curran Object-extraction and question-parsing using ccg In Proceedings of the SIGDAT Conference on Empirical Methods in Natural Language Processing, pages 111-118, 2004 William W Cohen, Pradeep Ravikumar, and Stephen E Fienberg A comparison of string distance metrics for name-matching tasks In Proceedings of IJCAI-03 Workshop on Information Integration, pages 73-78, 2003 P Compton and R Jansen A philosophical basis for knowledge acquisition Knowledge Aquisition, 2(3):241-257, 1990 Paul Compton and Bob Jansen Knowledge in context: A strategy for expert system maintenance In Proceedings of the second Australian joint conference on Artificial intelligence, volume 406, pages 292-306, 1988 Hammish Cunningham, Diana Maynard, Kalina Bontcheva, and Valentin Tablan GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications In Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics, pages 168-175, 2002 Danica Damljanovic, Valentin Tablan, and Kalina Bontcheva A text-based query interface to owl ontologies In Proceedings of 6th Language Resources and Evaluation Conference, 2008 Christiane D Fellbaum WordNet: An Electronic Lexical Database MIT Press, 1998 A Galea Open-domain surface-based question answering system In Proceedings of the Computer Science Annual Workshop (CSAW), 2003 Sanda Harabagiu, Dan Moldovan, Marius Pasca, Rada Mihalcea, Mihai Surdeanu, Zvan Bunescu, Roxana Girju, Vasile Rus, and Paul Morarescu Falcon: Boosting knowledge for answer engines In Proceedings of the Ninth Text REtrieval Conference, pages 479-488, 2000 Sanda M Harabagiu, Steven J Maiorano, and Marius A Pasca Open-domain textual question answering techniques Natural Language Engineering, 9(3):231-267, 2003 Zhiheng Huang, Marcus Thint, and Zengchang Qin Question classification using head words and their hypernyms In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’08, pages 927-936, 2008 John Judge, Yuqing Guo, Gareth J F Jones, and Bin Wang An analysis of question processing of english and chinese for the ntcir cross-language question answering task 2005 Daniel Jurafsky and James H Martin Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition (Second Edition) Prentice Hall, 2008 Boris Katz Annotating the world wide web using natural language In Proceedings of the 5th RIAO Conference on Computer Assisted Information Searching on the Internet - RIAO 1997, pages 136-159, 1997 Boris Katz, Gary C Borchardt, and Sue Felshin Syntactic and semantic decomposition strategies forquestion answering from multiple resources In Proceedings of the AAAI 2005 Workshop on Inference for Textual Question Answering, pages 35-41, 2005 Boris Katz, Gary C Borchardt, and Sue Felshin Natural language annotations forquestion answering In Proceedings of the 19th International Florida Artificial Intelligence Research Society Conference, pages 303-306, 2006 Krystle Kocik Question classification using maximum entropy models Master’s thesis, University of Sydney, 2004 Wei Li Question classification using language modeling Technical report, In CIIR Technical Report: University of Massachusetts, 2002 Xin Li and Dan Roth Learning question classifiers In Proceedings of the 19th international conference on Computational linguistics - Volume 1, COLING ’02, pages 1-7 Association for Computational Linguistics, 2002 Xin Li and Dan Roth Learning question classifiers: the role of semantic information Natural Language Engineering, 12(3):229-249, 2006 Vanessa Lopez, Victoria Uren, Enrico Motta, and Michele Pasin Aqualog: An ontologydriven question answering system for organizational semantic intranets Web Semantics: Science, Services and Agents on the World Wide Web, 5(2):72-105, 2007 Christopher D Manning and Hinrich Schutze Foundations of statistical natural language processing MIT Press, Cambridge, MA, USA, 1999 Christopher D Manning, Prabhakar Raghavan, and Hinrich Schtze Introduction to Information Retrieval Cambridge University Press, New York, NY, USA, 2008 Donald Metzler and W Bruce Croft Analysis of statistical questionclassification fact-based questions Inf Retr., 8:481-504, May 2005 ISSN 1386-4564 for Wu Min and Strzalkowski Tomek Utilizing entity relation to bridge thelanguage gap in cross lingual question answering system In Proceedings of the 6th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access, 2006 D Moldovan, S Harabagiu, R Girju, P Morarescu, F Lacatusu, A Novischi, A Badulescu, and O Bolohan Lcc tools forquestion answering In Voorhees and Buckland, editors, Proceedings of the 11th Text REtrieval Conference (TREC-2002), 2002 Anh Kim Nguyen and Huong Thanh Le Natural language interface construction using semantic grammars In Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence, pages 728-739, 2008 Dai Quoc Nguyen, Dat Quoc Nguyen, and Son Bao Pham A vietnamese question answering system In Proceedings of the 2009 International Conference on Knowledge and Systems Engineering, pages 26-32, 2009 Dat Quoc Nguyen, Dai Quoc Nguyen, and Son Bao Pham Systematic knowledge acquisition forquestionanalysis In Proceedings of 8th International Conference on Recent Advances in Natural Language Processing, (In press), September, 2011a Dat Quoc Nguyen, Dai Quoc Nguyen, Son Bao Pham, and Dang Duc Pham Rippledownrulesfor part-of-speech tagging In Proc of 12th International on Conference Computational Linguistics and Intelligent Text Processing, pages 190-201, 2011b Ahad Niknia and Leila Sharif Hassanabadi A question answering system based on grammatical structure matching In Proceedings of the IADIS International Conference Applied Computing 2009, pages 165-172, 2009 Dang Duc Pham, Giang Binh Tran, and Son Bao Pham A hybrid approach to vietnamese word segmentation using part of speech tags In Proceedings of the 2009 International Conference on Knowledge and Systems Engineering, pages 154-161, 2009 Son Bao Pham and Achim Hoffmann Efficient knowledge acquisition for extracting temporal relations In Proceeding of the 17th European Conference on Artificial Intelligence, pages 521525, 2006 T.T Phan and T.C Nguyen Question semantic analysis in vietnamese qa system In Edited book "Advances in Intelligent Information and Database Systems" of The 2nd Asian Conference on Intelligent Information and Database Systems (CIIDS2010), pages 29-40, 2010 Ana-Maria Popescu, Oren Etzioni, and Henry Kautz Towards a theory of natural language interfaces to databases In Proceedings of the 8th international conference on Intelligent user interfaces, IUI ’03, pages 149-157, 2003 Debbie Richards Two decades of rippledownrules research Knowledge Engineering Review, 24 (2):159-184, 2009 Ashish Kumar Saxena, Ganesh Viswanath Sambhu, Saroj Kaushik, and L Venkata Subramaniam Iitd-ibmirl system forquestion answering using pattern matching, semantic type and semantic category recognition In Proceedings of The Sixteenth Text REtrieval Conference, 2007 Sanjay Silakari, Mahesh Motwani, and Neelu Nihalani Natural language interface for database: A brief review IJCSI International Journal of Computer Science Issues, 8:600608, 2011 Eriks Sneiders Automated question answering using question templates that cover the conceptual model of the database In Proceedings of the 6th International Conference on Applications of Natural Language to Information Systems-Revised Papers, NLDB ’02, pages 235-239, 2002 Niculae Stratica, Leila Kosseim, and Bipin C Desai Nlidb templates for semantic parsing In Proceedings of the 8th International Conference on Applications of Natural Language to Information Systems, pages 235-241, 2003 Valentin Tablan, Daina Maynard, Kalina Bontcheva, and Hamish Cunningham Gate - an application developer’s guide http: //gate ac uk/sale/pg/pg pdf, 2004 Marjorie Templeton and John Burger Problems in natural-language interface to dbms with examples from eufid In Proceedings of the first conference on Applied natural language processing, pages 3-16, 1983 M Vargas-Vera and E Motta An ontology-driven similarity algorithm Technical report, Knowledge Media Institute, The Open University, 2004 David L Waltz An english language question answering system for a large relational database Commun ACM, 21:526-539, July 1978 W A Woods, Ron Kaplan, and Nash B Webber The LUNAR sciences natural language information system: Final report Technical Report BBN Report No 2378, Bolt Beranek and Newman, 1972 Min Wu, Xiaoyu Zheng, Michelle Duan, Ting Liu, and Tomek Strzalkowski Question answering by pattern matching, web-proofing, semantic form proofing In Proceedings of the Twelfth Text REtrieval Conference (TREC 2003), pages 578-585, 2003 Dell Zhang and Wee Sun Lee Question classification using support vector machines In Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pages 26-32, 2003 ... Evaluation 37 Question Analysis for Vietnamese 37 5.1 5.2 Question Analysis for English 39 Conclusion 41 A Definitions of question- class types 43 B 45 Definitions of question- structures... lingual question answering system In Proceedings of the 6th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information... interface for database: A brief review IJCSI International Journal of Computer Science Issues, 8:600608, 2011 Eriks Sneiders Automated question answering using question templates that cover the