Current research directions are looking at Data Mining (DM) and Knowledge Management (KM) as complementary and interrelated felds, aimed at supporting, with algorithms and tools, the lifecycle of knowledge, including its discovery, formalization, retrieval, reuse, and update. While DM focuses on
Data Mining and Medical Knowledge Management: Cases and Applications Petr Berka University of Economics, Prague, Czech Republic Jan Rauch University of Economics, Prague, Czech Republic Djamel Abdelkader Zighed University of Lumiere Lyon 2, France Medical Information science reference Hershey • New York Director of Editorial Content: Managing Editor: Assistant Managing Editor: Typesetter: Cover Design: Printed at: Kristin Klinger Jamie Snavely Carole Coulson Sean Woznicki Lisa Tosheff Yurchak Printing Inc Published in the United States of America by Information Science Reference (an imprint of IGI Global) 701 E Chocolate Avenue, Suite 200 Hershey PA 17033 Tel: 717-533-8845 Fax: 717-533-8661 E-mail: cust@igi-global.com Web site: http://www.igi-global.com/reference and in the United Kingdom by Information Science Reference (an imprint of IGI Global) Henrietta Street Covent Garden London WC2E 8LU Tel: 44 20 7240 0856 Fax: 44 20 7379 0609 Web site: http://www.eurospanbookstore.com Copyright © 2009 by IGI Global All rights reserved No part of this publication may be reproduced, stored or distributed in any form or by any means, electronic or mechanical, including photocopying, without written permission from the publisher Product or company names used in this set are for identification purposes only Inclusion of the names of the products or companies does not indicate a claim of ownership by IGI Global of the trademark or registered trademark Library of Congress Cataloging-in-Publication Data Data mining and medical knowledge management : cases and applications / Petr Berka, Jan Rauch, and Djamel Abdelkader Zighed, editors p ; cm Includes bibliographical references and index Summary: "This book presents 20 case studies on applications of various modern data mining methods in several important areas of medicine, covering classical data mining methods, elaborated approaches related to mining in EEG and ECG data, and methods related to mining in genetic data" Provided by publisher ISBN 978-1-60566-218-3 (hardcover) Medicine Data processing Case studies Data mining Case studies I Berka, Petr II Rauch, Jan III Zighed, Djamel A., 1955[DNLM: Medical Informatics methods Case Reports Computational Biology methods Case Reports Information Storage and Retrieval methods Case Reports Risk Assessment Case Reports W 26.5 D2314 2009] R858.D33 2009 610.0285 dc22 2008028366 British Cataloguing in Publication Data A Cataloguing in Publication record for this book is available from the British Library All work contributed to this book is new, previously-unpublished material The views expressed in this book are those of the authors, but not necessarily of the publisher If a library purchased a print copy of this publication, please go to http://www.igi-global.com/agreement for information on activating the library's complimentary electronic access to this publication Editorial Advisory Board Riccardo Bellazzi, University of Pavia, Italy Radim Jiroušek, Academy of Sciences, Prague, Czech Republic Katharina Morik, University of Dortmund, Germany Ján Paralič, Technical University, Košice, Slovak Republic Luis Torgo, LIAAD-INESC Porto LA, Portugal Blaž Župan, University of Ljubljana, Slovenia List of Reviewers Ricardo Bellazzi, University of Pavia, Italy Petr Berka, University of Economics, Prague, Czech Republic Bruno Crémilleux, University Caen, France Peter Eklund, Umeå University, Umeå, Sveden Radim Jiroušek, Academy of Sciences, Prague, Czech Republic Jiří Kléma, Czech Technical University, Prague, Czech Republic Mila Kwiatkovska, Thompson Rivers University, Kamloops, Canada Martin Labský, University of Economics, Prague, Czech Republic Lenka Lhotská, Czech Technical University, Prague, Czech Republic Ján Paralić, Technical University, Kosice, Slovak Republic Vincent Pisetta, University Lyon 2, France Simon Marcellin, University Lyon 2, France Jan Rauch, University of Economics, Prague, Czech Republic Marisa Sánchez, National University, Bahía Blanca, Argentina Ahmed-El Sayed, University Lyon 2, France Olga Štěpánková, Czech Technical University, Prague, Czech Republic Vojtěch Svátek, University of Economics, Prague, Czech Republic Arnošt Veselý, Czech University of Life Sciences, Prague, Czech Republic Djamel Zighed, University Lyon 2, France Table of Contents Foreword xiv Preface xix Acknowledgment xxiii Section I Theoretical Aspects Chapter I Data, Information and Knowledge Jana Zvárová, Institute of Computer Science of the Academy of Sciences of the Czech Republic v.v.i., Czech Republic; Center of Biomedical Informatics, Czech Republic Arnošt Veselý, Institute of Computer Science of the Academy of Sciences of the Czech Republic v.v.i., Czech Republic; Czech University of Life Sciences, Czech Republic Igor Vajda, Institutes of Computer Science and Information Theory and Automation of the Academy of Sciences of the Czech Republic v.v.i., Czech Republic Chapter II Ontologies in the Health Field 37 Michel Simonet, Laboratoire TIMC-IMAG, Institut de l’Ingénierie et de l’Information de Santé, France Radja Messai, Laboratoire TIMC-IMAG, Institut de l’Ingénierie et de l’Information de Santé, France Gayo Diallo, Laboratoire TIMC-IMAG, Institut de l’Ingénierie et de l’Information de Santé, France Ana Simonet, Laboratoire TIMC-IMAG, Institut de l’Ingénierie et de l’Information de Santé, France Chapter III Cost-Sensitive Learning in Medicine 57 Alberto Freitas, University of Porto, Portugal; CINTESIS, Portugal Pavel Brazdil, LIAAD - INESC Porto L.A., Portugal; University of Porto, Portugal Altamiro Costa-Pereira, University of Porto, Portugal; CINTESIS, Portugal Chapter IV Classification and Prediction with Neural Networks 76 Arnošt Veselý, Czech University of Life Sciences, Czech Republic Chapter V Preprocessing Perceptrons and Multivariate Decision Limits 108 Patrik Eklund, Umeå University, Sweden Lena Kallin Westin, Umeå University, Sweden Section II General Applications Chapter VI Image Registration for Biomedical Information Integration 122 Xiu Ying Wang, BMIT Research Group, The University of Sydney, Australia Dagan Feng, BMIT Research Group, The University of Sydney, Australia; Hong Kong Polytechnic University, Hong Kong Chapter VII ECG Processing 137 Lenka Lhotská, Czech Technical University in Prague, Czech Republic Václav Chudáček, Czech Technical University in Prague, Czech Republic Michal Huptych, Czech Technical University in Prague, Czech Republic Chapter VIII EEG Data Mining Using PCA 161 Lenka Lhotská, Czech Technical University in Prague, Czech Republic Vladimír Krajča, Faculty Hospital Na Bulovce, Czech Republic Jitka Mohylová, Technical University Ostrava, Czech Republic Svojmil Petránek, Faculty Hospital Na Bulovce, Czech Republic Václav Gerla, Czech Technical University in Prague, Czech Republic Chapter IX Generating and Verifying Risk Prediction Models Using Data Mining 181 Darryl N Davis, University of Hull, UK Thuy T.T Nguyen, University of Hull, UK Chapter X Management of Medical Website Quality Labels via Web Mining 206 Vangelis Karkaletsis, National Center of Scienti.c Research “Demokritos”, Greece Konstantinos Stamatakis, National Center of Scientific Research “Demokritos”, Greece Pythagoras Karampiperis, National Center of Scientific Research “Demokritos”, Greece Martin Labský, University of Economics, Prague, Czech Republic Marek Růžička, University of Economics, Prague, Czech Republic Vojtěch Svátek, University of Economics, Prague, Czech Republic Enrique Amigó Cabrera, ETSI Informática, UNED, Spain Matti Pưllä, Helsinki University of Technology, Finland Miquel Angel Mayer, Medical Association of Barcelona (COMB), Spain Dagmar Villarroel Gonzales, Agency for Quality in Medicine (AquMed), Germany Chapter XI Two Case-Based Systems for Explaining Exceptions in Medicine 227 Rainer Schmidt, University of Rostock, Germany Section III Speci.c Cases Chapter XII Discovering Knowledge from Local Patterns in SAGE Data 251 Bruno Crémilleux, Université de Caen, France Arnaud Soulet, Université Franỗois Rabelais de Tours, France Jiớ Klộma, Czech Technical University, in Prague, Czech Republic Céline Hébert, Université de Caen, France Olivier Gandrillon, Université de Lyon, France Chapter XIII Gene Expression Mining Guided by Background Knowledge 268 Jiří Kléma, Czech Technical University in Prague, Czech Republic Filip Železný, Czech Technical University in Prague, Czech Republic Igor Trajkovski, Jožef Stefan Institute, Slovenia Filip Karel, Czech Technical University in Prague, Czech Republic Bruno Crémilleux, Université de Caen, France Jakub Tolar, University of Minnesota, USA Chapter XIV Mining Tinnitus Database for Knowledge 293 Pamela L Thompson, University of North Carolina at Charlotte, USA Xin Zhang, University of North Carolina at Pembroke, USA Wenxin Jiang, University of North Carolina at Charlotte, USA Zbigniew W Ras, University of North Carolina at Charlotte, USA Pawel Jastreboff, Emory University School of Medicine, USA Chapter XV Gaussian-Stacking Multiclassifiers for Human Embryo Selection 307 Dinora A Morales, University of the Basque Country, Spain Endika Bengoetxea, University of the Basque Country, Spain Pedro Larrañaga, Universidad Politécnica de Madrid, Spain Chapter XVI Mining Tuberculosis Data 332 Marisa A Sánchez, Universidad Nacional del Sur, Argentina Sonia Uremovich, Universidad Nacional del Sur, Argentina Pablo Acrogliano, Hospital Interzonal Dr José Penna, Argentina Chapter XVII Knowledge-Based Induction of Clinical Prediction Rules 350 Mila Kwiatkowska, Thompson Rivers University, Canada M Stella Atkins, Simon Fraser University, Canada Les Matthews, Thompson Rivers University, Canada Najib T Ayas, University of British Columbia, Canada C Frank Ryan, University of British Columbia, Canada Chapter XVIII Data Mining in Atherosclerosis Risk Factor Data 376 Petr Berka, University of Economics, Prague, Czech Republic; Academy of Sciences of the Czech Republic, Prague, Czech Republic Jan Rauch, University of Economics, Praague, Czech Republic; Academy of Sciences of the Czech Republic, Prague, Czech Republic Marie Tomečková, Academy of Sciences of the Czech Republic, Prague, Czech Republic Compilation of References 398 About the Contributors 426 Index 437 Detailed Table of Contents Foreword xiv Preface xix Acknowledgment xxiii Section I Theoretical Aspects This section provides a theoretical and methodological background for the remaining parts of the book It defines and explains basic notions of data mining and knowledge management, and discusses some general methods Chapter I Data, Information and Knowledge Jana Zvárová, Institute of Computer Science of the Academy of Sciences of the Czech Republic v.v.i., Czech Republic; Center of Biomedical Informatics, Czech Republic Arnošt Veselý, Institute of Computer Science of the Academy of Sciences of the Czech Republic v.v.i., Czech Republic; Czech University of Life Sciences, Czech Republic Igor Vajda, Institutes of Computer Science and Information Theory and Automation of the Academy of Sciences of the Czech Republic v.v.i., Czech Republic This chapter introduces the basic concepts of medical informatics: data, information, and knowledge It shows how these concepts are interrelated and can be used for decision support in medicine All discussed approaches are illustrated on one simple medical example Chapter II Ontologies in the Health Field 37 Michel Simonet, Laboratoire TIMC-IMAG, Institut de l’Ingénierie et de l’Information de Santé, France Radja Messai, Laboratoire TIMC-IMAG, Institut de l’Ingénierie et de l’Information de Santé, France Gayo Diallo, Laboratoire TIMC-IMAG, Institut de l’Ingénierie et de l’Information de Santé, France Ana Simonet, Laboratoire TIMC-IMAG, Institut de l’Ingénierie et de l’Information de Santé, France This chapter introduces the basic notions of ontologies, presents a survey of their use in medicine, and explores some related issues: knowledge bases, terminology, information retrieval It also addresses the issues of ontology design, ontology representation, and the possible interaction between data mining and ontologies Chapter III Cost-Sensitive Learning in Medicine 57 Alberto Freitas, University of Porto, Portugal; CINTESIS, Portugal Pavel Brazdil, LIAAD - INESC Porto L.A., Portugal; University of Porto, Portugal Altamiro Costa-Pereira, University of Porto, Portugal; CINTESIS, Portugal Health managers and clinicians often need models that try to minimize several types of costs associated with healthcare, including attribute costs (e.g the cost of a specific diagnostic test) and misclassification costs (e.g the cost of a false negative test) This chapter presents some concepts related to cost-sensitive learning and cost-sensitive classification in medicine and reviews research in this area Chapter IV Classification and Prediction with Neural Networks 76 Arnošt Veselý, Czech University of Life Sciences, Czech Republic This chapter describes the theoretical background of artificial neural networks (architectures, methods of learning) and shows how these networks can be used in medical domain to solve various classification and regression problems Chapter V Preprocessing Perceptrons and Multivariate Decision Limits 108 Patrik Eklund, Umeå University, Sweden Lena Kallin Westin, Umeå University, Sweden This chapter introduces classification networks composed of preprocessing layers and classification networks, and compares them with “classical” multilayer percpetrons on three medical case studies Section II General Applications This section presents work that is general in the sense of a variety of methods or variety of problems described in each of the chapters Chapter VI Image Registration for Biomedical Information Integration 122 Xiu Ying Wang, BMIT Research Group, The University of Sydney, Australia Dagan Feng, BMIT Research Group, The University of Sydney, Australia; Hong Kong Polytechnic University, Hong Kong 426 About the Contributors Petr Berka is a full professor at the Dept of Information and Knowledge Engineering, University of Economics and also works in the Centre of Biomedical Informatics, Institute of Computer Science, Academy of Sciences of the Czech Republic His main research interests are machine learning, data mining and knowledge-based systems Jan Rauch is an associate professor at the Dept of Information and Knowledge Engineering, University of Economics and also works in the Centre of Biomedical Informatics, Institute of Computer Science, Academy of Sciences of the Czech Republic His main research interest is data mining Djamel Abdelkader Zighed received his Master’s in computer and automatic science in 1982 and his PhD in computer science in 1985 – both from university Lyon He was a assistant professor at University Lyon in 1984-1987 In 1987, he joined the University Lyon where he worked as lecturer (1987-1991), professor (1991-2000) and 1st class professor (2000 - ) He is interested in data mining (including mining complex data), machine learning and knowledge engineering He is the founder and was director (1995-2002) of the ERIC laboratory (laboratory for development of methods and software for knowledge engineering, and more specifically automatic knowledge discovery in databases) *** Pablo Luis Acrogliano received the degree of doctor in medicine from Universidad de Buenos Aires He is a staff at the Neumology Service Dr Raul H Catá at Hospital Dr José Penna in Bahía Blanca, Argentina He is working as a doctor specialized in Neumology, as regional supervisor of the Regional Tuberculosis Control Program, and as teaching instructor of epidemiology in Dirección de Capacitación de Profesionales de la Salud in Buenos Aires province M Stella Atkins received the BSc degree in chemistry from Nottingham University in 1966 and the PhD degree in computer science from the University of British Columbia in 1985 She is a professor in the School of Computing Science at Simon Fraser University (SFU), and director of the Medical Computing Lab at SFU Her research interests include medical image display and analysis, telehealth, and computers in medicine This includes medical image display, denoising and enhancement, segmentation, image registration, and radiology workstation design Additionally, she is interested in the use of eyegaze trackers, which provide a new imaging modality for seeing inside the brain Copyright © 2009, IGI Global, distributing in print or electronic forms without written permission of IGI Global is prohibited About the Contributors Najib T Ayas is an associate professor of medicine, University of British Columbia His research is focused on the health, economic, and safety effects of sleep deprivation and sleep apnea He is author or co-author of over 40 peer-reviewed publications, and his work has been published in a number of high impact journals including the New England Journal of Medicine, JAMA, and Annals of Internal Medicine Endika Bengoetxea received his B.Sc in Computer Studies from the University of the Basque in 1994, his M.Sc in Medical Imaging (Medical Physics) from the University of Aberdeen in 1999, and his Ph.D in Signal and Image Treatment from the French Ecole Nationale Supérieure des Télécommunications in 2002 He joined the University of the Basque Country in 1996 where he is currently professor of the Computer Engineering Faculty, and member of the research group Intelligent Systems Group His research interests are on the application of Bayesian networks for optimization or classification problems applied to biomedical problems Pavel Brazdil got his PhD degree from the University of Edinburgh in 1981 Since 1998 he is full professor at Faculty of Economics Currently he is the coordinator of R&D Unit LIAAD (earlier known as group NIAAD of LIAAC) founded in 1988 Pavel Brazdil is known for his activities in machine learning, data mining, metalearning, economic modeling, text mining and applications in medicine and bioinformatics He has participated in two international projects, was a technical coordinator of one of them (METAL) and participated in various international research networks He has organized more than 10 international conferences or workshops and participated in many program committees (including 14 different editions of ECML) He has (co-)edited books published by major publisher and published more than 120 articles Enrique Amigó Cabrera, PhD in computer science (UNED,/Universidad Nacional de Educación a Distancia/, 2006), BsD in computer science (UPM, Universidad Politécnica de Madrid, 2000) /Teacher assistant in the department LSI of UNED His research is focused on information synthesis tasks and the evaluation of summarization, translation and clustering systems His work has been also focused on interactive information access systems He has participated in several national and international research projects (HERMES, MEDIEQ, QUEAVIS) Václav Chudáček received his Master’s degree in biomedical engineering at the Faculty of Electrical Engineering of the Czech Technical University in Prague, Czech Republic in 2003 He is currently working towards the PhD degree in Biomedical Signal and Data Processing group (BioDat) at the Department of Cybernetics, Faculty of Electrical Engineering of the Czech Technical University in Prague His main research interests are in applications of artificial intelligence methods for analysis of ECG and CTG signals Altamiro Costa-Pereira got the PhD degree from the University of Dundee in 1993 Currently he is full professor at FMUP and director of the Department of Biostatistics and Medical Informatics He is also coordinator of the research unit CINTESIS and director of two master programs (Master’s in medical informatics, Master’s in evidence and decision making in healthcare) and one PhD program (doctoral program in clinical and health services research) Since 1999, he has been an independent expert, acting as evaluator, at the European Commission He is author of more than two hundred publications, including fifty indexed by the ISI, which attracted more the 300 citations 427 About the Contributors Bruno Crémilleux is professor in computer science at GREYC laboratory (CNRS UMR 6072), University of Caen, France He is currently heading the data mining research group He received his PhD in 1991 from the University of Grenoble, France and a habilitation degree in 2004 from the University of Caen His current research interests include knowledge discovery in databases, machine learning, text mining and their applications notably in bioinformatics and medicine Darryl N Davis, director of research candidate in the Department of Computer Science, the University of Hull, UK His PhD came from the Faculty of Medicine, University of Manchester He has worked in data mining and intelligent systems in medicine since 1988 Medical domains include urology, orthodontics, cervical cancer, neuroanatomy, colorectal cancer and cardiovascular medicine Gayo Diallo is member of the City eHealth Research Centre at City University of London, where he is working on the EU funded project SeaLife (Semantic Web Browser for the Life Science) Prior to joining City University, Gayo Diallo completed in 2006 his PhD thesis in computer science at University of Joseph Fourier Grenoble within the TIMC-IMAG Laboratory (Osiris Group) He was also a part time lecturer at University Pierre Mendes-France Grenoble from 2001 to 2006 His research interests include DB&IR integration, Semantic Web technologies and ontologies, knowledge representation and reasoning, information contextualisation and adaptation Patrik Eklund, professor in computer science, is involved in research both on the theoretical as well the applied side Research is carried in algebra and logic for computer science application, intelligent computing and medical informatics Medical and health informatics research involves information infrastructures and platforms, including various patient record systems and repositories The overall scope and motivation is to enhance methodological issues to support effective and efficient development of guidelines and workflows Dagan Feng received his ME in electrical engineering & computing science (EECS) from Shanghai Jiao Tong University in 1982, MSc and PhD in computer science / biocybernetics from the University of California, Los Angeles (UCLA) in 1985 and 1988 respectively He joined the University of Sydney at the end of 1988, as lecturer, senior lecturer, reader, professor and head of Department of Computer Science / School of Information Technologies He is currently associate dean of Faculty of Science at the University of Sydney, chair-professor of information technology, Hong Kong Polytechnic University and advisory or guest professor at several universities in China His research area is biomedical & multimedia information technology (BMIT) He is the Founder and Director the BMIT Research Group He has published over 500 scholarly research papers, pioneered several new research directions, and received the Crump Prize for Excellence in Medical Engineering from UCLA He is a fellow of Australian Academy of Technological Sciences and Engineering, ACS, HKIE, IET, and IEEE, special area editor of IEEE Transactions on Information Technology in Biomedicine, and is the current chairman of International Federation of Automatic Control (IFAC) Technical Committee on Biological and Medical Systems Alberto Freitas received the MSc and PhD degrees from the University of Porto (Portugal) in 1998 and 2007, respectively He is member of the Faculty of Medicine of University of Porto (FMUP), in the Department of Biostatistics and Medical Informatics He is also member of CINTESIS (Centre for 428 About the Contributors Research in Health Technologies and Information Systems), a research unit recognized and supported by the Portuguese Foundation for Science and Technology (FCT) [POCTI/0753/2004] and hosted by FMUP His research interests include health information systems and data mining Olivier Gandrillon is a researcher at the Centre National de La Recherche Scientifique (CNRS) He is heading a research group in Lyon that is interested in defining the molecular basis for self-renewal, as well as pathological modifications of this process For this, the group used the SAGE technique extensively He has been involved for years in transdisciplinary efforts together with computer scientists to make sense of the huge amount of data generated by this transcriptomic approach Václav Gerla received his Master’s degree in biomedical engineering at the Faculty of Electrical Engineering of the Czech Technical University in Prague, Czech Republic in 2005 He is currently working toward his PhD degree in artificial intelligence and biocybernetics at the Czech Technical University His research interests are in biomedical signal processing, pattern recognition and biomedical informatics He is working on automated classification of sleep and the EEG signals of newborns Dagmar Villarroel Gonzales is a medical doctor from the Universidad del Rosario in Bogota, Colombia; in October 2005 received her Master’s in public health from the Hannover Medical School She joined the German Agency for Quality in Medicine (Joint Institution of The German Medical Association and the National Association of the Statutory Health Insurance Physicians) in 2004, where she works on appraisal of medical information for patients in internet and development of medical guidelines Céline Hébert received her PhD in data mining in 2007 from the University of Caen, France Her fields of interest include data mining (especially pattern discovery and interestingness measures) and hypergraph theory Michal Huptych received his Master’s degree in biomedical engineering at the Faculty of Electrical Engineering of the Czech Technical University in Prague, Czech Republic in 2005 He is currently working towards the PhD degree in Biomedical Signal and Data Processing group (BioDat) at the Department of Cybernetics, Faculty of Electrical Engineering of the Czech Technical University in Prague His main research interests are in applications of signal processing and artificial intelligence methods for biomedical signals analysis Pawel Jastreboff is a professor and director of Emory Tinnitus & Hyperacusis Center at the Emory University School of Medicine since January 1999 His previous affiliations include University of Maryland and Yale University He received his PhD and DSc degrees in neuroscience from the Polish Academy of Science He did his postdoctoral training at the University of Tokyo, Japan Dr Jastreboff holds visiting professor appointment at Yale University School of Medicine, the University College of London, and Middlesex Hospital in London, England In 1988, on the basis of his research on the physiological mechanisms of tinnitus he proposed a neuro-physiological model of tinnitus and Tinnitus Retraining Therapy (TRT) He also created an animal model of tinnitus, and his present research is aimed at delineating the mechanisms of tinnitus and designing new methods of tinnitus and hyperacusis alleviation 429 About the Contributors Wenxin Jiang is a PhD student in the Computer Science Department at the University of North Carolina at Charlotte His research interest includes knowledge discovery and data mining, music information retrieval, and flexible query answering systems He is an author of research papers, including one journal publication His research is supported by NSF Lena Kallin-Westin, senior lecturer in computer science, carried out her PhD studies within the field of intelligent computing and medical informatics The special area of interest has been the preprocessing perceptron and its applications In recent years, her focus of research interest has moved from intelligent computing to another field concerning learning and knowledge representation; computer science didactics Within that area, she is currently working with assessment and quality measurements of examination Pythagoras Karampiperis holds a Diploma (2000) and MSc on electronics and computer engineering (2002) and an MSc on operational research (2002), all from the Technical University of Crete, Greece His main scientific interests are in the areas of technology-enhanced learning, next generation service-based learning systems and Semantic Web technologies He is the co-author of more than 45 publications in scientific books, journals and conferences with at least 50 citations Filip Karel is a PhD student at the Department of Cybernetics, CTU Currently he is working on his doctoral thesis focused on quantitative association rule mining His main research interests are association rules, medical data mining and usage of e-learning Vangelis Karkaletsis is a research director at NCSR “Demokritos” He holds a PhD in artificial intelligence from the University of Athens He has substantial experience in the field of language and knowledge engineering, especially Web content analysis, ontology evolution, adaptive information filtering and extraction He is currently coordinating the EC-funded project MedIEQ on quality labeling of health related web content He is also technical manager of the EC-funded project QUATRO Plus on web content labeling and social networking, and leads the text information extraction and ontology evolution tasks in the BOEMIE project He is the local chair of the European Conference on Computational Linguistics (EACL-09) and vice chair of the Hellenic Artificial Intelligence Society (EETN) Jiří Kléma is an assistant professor in artificial intelligence (AI) at the Department of Cybernetics, Czech Technical University in Prague (CTU) He received his PhD in AI and biocybernetics from CTU and carried out post-doctoral training at the University of Caen, France His main research interest is data mining and its applications in industry, medicine and bioinformatics Vladimír Krajča received his Master’s degree in control engineering at the Faculty of Electrical Engineering of the Czech Technical University in Prague, Czech Republic in 1979, and his PhD degree in biomedical engineering at the Czech Technical University in Prague in 1985 He is the head of the Electrophysiological Laboratories at the Department of Neurology, Faculty Hospital Na Bulovce in Prague His research interests are: digital signal processing, adaptive segmentation of stochastic signals, cluster analysis, pattern recognition and artificial intelligence, fuzzy sets and neural networks for signal processing, principal component analysis, independent component analysis 430 About the Contributors Mila Kwiatkowska received the MA degree in interdisciplinary studies of Polish philology and informatics from the University of Wroclaw in 1979, the MSc degree in computing science from the University of Alberta in 1991, and the PhD degree in computing science from Simon Fraser University in 2006 She is currently an assistant professor in the Computing Science Department at Thompson Rivers University (TRU) Her research interests include medical data mining, medical decision systems, clinical prediction rules, knowledge representation, fuzzy logic, semiotics, and case-based reasoning Martin Labský received his MSc in information technologies from the University of Economics in Prague (2002) Currently he is a PhD student at the Department of Knowledge Engineering at the UEP, with statistical methods for information extraction as thesis topic His research interests include machine learning methods, knowledge representation and natural language processing (Co-)author of more than 10 reviewed papers Pedro Larrañaga received his MSc degree in mathematics from the University of Valladolid, in Spain, in 1981, and his PhD in computer science from the University of the Basque Country, in Spain, in 1995 He is professor at the Department of Artificial Intelligence of the Technical University of Madrid His research topics include Bayesian networks and evolutionary computation with applications in medicine and bioinformatics Lenka Lhotská received her Master’s degree in cybernetics at the Faculty of Electrical Engineering of the Czech Technical University in Prague, Czech Republic in 1984, and her PhD degree in cybernetics at the Czech Technical University in Prague in 1989 Currently she is head of the biomedical signal and data processing group (BioDat) at the Department of Cybernetics, Faculty of Electrical Engineering of the Czech Technical University in Prague Her research interests are in artificial intelligence methods and their applications in medicine Les Matthews is a respiratory therapist with a Diploma in Adult Education from University of British Columbia and a Master’s degree in education from Gonzaga University With 30 years of experience in education and clinical practice he is an assistant professor of respiratory therapy and the supervisor of the Obstructive Sleep Apnea clinic at Thompson Rivers University His research interests include ambulatory monitoring, auto titration CPAP, patient mentoring and clinical simulation Miquel Angel Mayer MD and PhD in health and life sciences is a Specialist of Family and Community Medicine He is the Director of Web Médica Acreditada (WMA) of the Medical Association of Barcelona, the first Spanish trust mark and e-health initiative He participated in several international semantic web projects as MedCIRCLE and QUATRO and currently he is involved in the EU project MedIEQ and QUATRO Plus for the implementation of semantic web vocabularies and technologies helping users to get the best health information on the Internet Radja Messai was trained in computer science (bachelor of science from the University of Batna - Algeria in 2003) and is currently a PhD student at the UJF University in Grenoble She is working on health-related ontology and terminology services for patients and citizens 431 About the Contributors Jitka Mohylová received her Master’s degree in radioelectronics, specialization medical electronics at the Faculty of Electrical Engineering of the Brno University of Technology, Czech Republic in 1983, and her PhD degree in cybernetics at the Czech Technical University in Prague in 1999 Since 1990 she has been with the Department of General Electrical Engineering, Faculty of Electrical Engineering and Computer Science of the VŠB – Technical University of Ostrava Currently she is also lecturing at the University of Zilina, Slovakia Her research interests are in signal processing and electronics and their applications in medicine Dinora A Morales received her BA and MSc degree in computer science from Technological Institute of Querétaro, México in 1999, and MSc in neurobiology from Institute of Neurobiology, National University of México in 2003 Since October, 2005 she is a member of Intelligent Systems Group of the University of the Basque Country where she has been working for her PhD She is interested in learning from data within machine learning, artificial intelligence, and statistics In particular, she is interested in learning probabilistic graphical models such as Bayesian networks Actually her work concentrates in developing algorithms for supervised classification from human embryo selection in assisted reproduction techniques data Thuy T.T Nguyen received the BSc degree in mathematics from Hanoi University of Pedagogy in 1993 She received the MSc in computer science from Hanoi Nation University in Vietnam in 1999 She is now a PhD candidate in the Department of Computer Science, the University of Hull, UK Her project is on the use of pattern recognition and data mining techniques to predict cardiovascular risk She has published four papers from this research with specific focus on the use of neural nets and unsupervised clustering techniques Svojmil Petránek received his Master’s degree in radioelectronics, specialization Medical Electronics at the Faculty of Electrical Engineering of the Czech Technical University in Prague, Czech Republic in 1971, MD at the School of Medicine, Charles University, Prague, Czech Republic in 1977 and his PhD degree in medicine at Charles University in Prague in 1983 He is the head of the Department of Neurology, Faculty Hospital Na Bulovce in Prague His research interests are: neurophysiology, EEG processing, automated classification of EEG Matti Pöllä received his MSc(Tech) degree in electrical and communications engineering in 2005 from Helsinki University of Technology (TKK), Finland Presently he works as a researcher at the Adaptive Informatics Research Centre at TKK His current research interests include text data mining using bio-inspired algorithms Zbigniew W Ras is a professor of computer science in the College of Computing and Informatics at the University of North Carolina, Charlotte Also, he is a professor and member of the senate at the Polish-Japanese Institute of Information Technology in Warsaw, Poland His previous affiliations include: University of Florida (Gainesville), Columbia University (New York), University of Tennessee (Knoxville), and Lockheed Research Lab (Palo Alto) He received his PhD degree from the University of Warsaw (Poland) and DSc degree (habilitation) from the Polish Academy of Sciences He is the editor-in-chief of the Journal of Intelligent Information Systems (Springer) and the deputy editor-in-chief 432 About the Contributors of Fundamenta Informaticae Journal He is the editor/co-editor of 32 books and the author of more than 200 papers all in the area of intelligent information systems Marek Růžička is part-time software engineer at the Department of Information and Knowledge Engineering, University of Economics, Prague He has long-term experience in the development of software tools for semantic annotation, knowledge management and information search, in the framework of projects undertaken at University of Economics, Prague (MGT, M-CAST, MedIEQ) as well as at INRIA - Sophia-Antipolis, France (LifeLine, Corese) C Frank Ryan graduated from the National University of Ireland (Galway) in 1979 He completed his post-graduate clinical training in Internal Medicine and Respiratory Medicine in Dublin at the Royal College of Surgeons in Ireland After pursuing a post-doctoral research fellowship in respiratory sleep disorders at the University of British Columbia in Vancouver, he joined the Faculty of Medicine at UBC and the active staff at Vancouver Coastal Health in 1990 He is a professor of medicine at UBC and a consultant in internal medicine, respiratory medicine and sleep disorders medicine at Vancouver Coastal Health Dr Ryan’s clinical and research interests include respiratory sleep and neuromuscular diseases, and pleural diseases Marisa A Sánchez received a PhD in computer science from Universidad Nacional del Sur in Bahía Blanca, Argentina She is currently a professor of graduate and post-graduate courses at Department of Management Sciences at Universidad Nacional del Sur From 1994 to 2004 she had been on the faculty at Departments of Computer Science and Electrical Engineering She has been a fellow at the International Institute for Software Technologies of the United Nations University, Macao, and at Politécnico de Milano, Italy Her research interests include data warehousing, knowledge discovery, and data mining She is also interested in business process modeling Rainer Schmidt studied social science and computer science (both with MSc) and received a PhD in medical informatics After working on explanation-based learning (EBL) at the University of Dortmund, in 1993 he moved to Munich, where he started working on case-based reasoning for medical applications Since 1996 he is working at the University of Rostock still in the same research area – only interrupted by working at the Robert Gordon University in Aberdeen, Scotland, in 1998 Arnaud Soulet is an associate professor in computer science at LI laboratory, University of Tours Franỗois Rabelais, France He received his PhD in 2006 from the University of Caen, France His main research interests include data mining and machine learning (especially applied in bioinformatics and medicine) Ana Simonet is an assistant professor in computer science at the University Pierre Mendès-France in Grenoble Both her teaching and her research work are oriented towards conceptual modeling She has developed a novel approach to database design, based on an initial ontological schema that is automatically transformed into a database schema optimized for a given set of queries She is an active member of the TIMC group working on data and knowledge modeling and representation Besides her interest in the design of information systems she is also involved in the integration of heterogeneous databases, following an ontology-based approach 433 About the Contributors Michel Simonet is the head of the knowledge base and database team of the TIMC laboratory at the Joseph Fourier University of Grenoble His group works on the design and the implementation of knowledge bases and databases, using an ontology-based approach, and currently on the integration of Information Systems by using the tools and methodologies they have developed They work on two main projects: a database and knowledge base management system, named OSIRIS, and a system for database conception and reverse engineering based on original concepts and a new methodology In the recent years Michel Simonet has managed the European ASIA-ITC GENNERE with China and has been responsible of ontology enrichment in the European IP project Noesis, a platform for wide-scale integration and visual representation of medical intelligence Konstantinos Stamatakis has experience in the area of information discovery and extraction from online resources Being a research associate in NCSR “Demokritos” during the last years he has worked in several national and European RTD projects and has developed methods and tools for Web crawling and spidering and an information discovery platform incorporating several applications involved in information identification and extraction from the Web (ontologies, formation of corpora, training, crawling, spidering, extraction of information, data storage and representation) He is currently involved in the MedIEQ project being responsible for the web content collection tools to be developed in the project Vojtěch Svátek is associate professor at the Department of Information and Knowledge Engineering, University of Economics, Prague His main areas of research are knowledge modeling and knowledge discovery from databases and texts Local contact person of EU projects K-Space and MedIEQ, organizer of numerous events (e.g Co-Chair of the EKAW conference in 2006), author of about 80 papers Pamela L Thompson is a doctoral student in the department of computer science at the University of North Carolina at Charlotte She received her undergraduate degree in management information systems and her MBA from James Madison University in Harrisonburg, Virginia She is an associate professor at Catawba College in Salisbury, NC Her research is focusing on the development of a decision support system for diagnosis and treatment of tinnitus Jakub Tolar received his MD from Charles University in Prague, Czech Republic, and his PhD in molecular, cellular, developmental biology and genetics from the University of Minnesota He has been interested in the use of hematopoietic transplantation for bone marrow failure (e.g., aplastic anemia and dyskeratosis congenita) and metabolic disorders (e.g., mucopolysaccharidosis type I and adrenoleukodystrophy) His research focuses on the use of bone marrow derived stem cells and Sleeping Beauty transposon gene therapy for correction of genetic diseases and improving outcome of blood and marrow transplantation Marie Tomečková is a physician working at the Dept of Medical Informatics and in the Centre of Biomedical Informatics, Institute of Computer Science, Academy of Sciences of the Czech Republic Her specialization is non-invasive cardiology, namely epidemiology of atherosclerosis Igor Trajkovski is a researcher at the Department of Knowledge Technologies of the Jožef Stefan Institute in Ljubljana, Slovenia He received a MSc in computer science from the Saarland University and Max Plank Institute for Informatics in Germany and is now a PhD student at the Jožef Stefan Inter- 434 About the Contributors national Postgraduate School in Ljubljana His research interests are in machine learning, microarray data analysis, common sense knowledge representation and reasoning Sonia Uremovich received a Bachelor’s in business administration from Universidad Nacional del Sur in Bahía Blanca, Argentina She is teaching assistant at Department of Management Sciences at Universidad Nacional del Sur, and at a Technical High School Her research interests include decision support systems in management Igor Vajda was born in Martin, Czechoslovakia, in 1942 He graduated in mathematics from the Czech Technical University, Prague, in 1965 He received the PhD degree in probability and statistics from the Charles University, Prague, in 1968, and the DrSc degree in mathematical informatics from the same university in 1990 From 1965 he served as a research assistant, from 1966 as a researcher, and from 1990 as a principal researcher at the Institute of Information Theory and Automation, Czech Academy of Sciences, Prague During 1966 -2007 he was visiting scientist or visiting professor at various universities and research institutes in Europe, Russia and USA His research interests include information theory, asymptotic statistics and statistical decisions Dr Vajda is fellow of IEEE, associate editor of several international scientific journals and chairs or vice-chairs various committees at the Czech Technical University He was awarded several prizes for outstanding results of research Arnošt Veselý graduated at Faculty of Technical and Nuclear Physics of Czech Technical University in Prague Scientific degree in logic he received in 1990 at Philosophical Faculty of Charles University From 1992 he is a member of Department of Information Engineering of Czech University of Life Sciences in Prague and from 2001 he is associate professor in information management He reads lectures in operating systems and artificial intelligence From 2001 he is a member of Department of Medical Informatics at Institute of Computer Science of the Academy of Sciences of the Czech Republic His main interests are neural networks and formalization of knowledge mainly with regard to their applications in medicine Xiu Ying Wang received her PhD in computer science from The University, of Sydney in 2005 Currently she is working in the School of Information Technologies, The University of Sydney She is a member of IFAC and executive secretary of IFAC TC on BioMed Her research interests include image registration for applications in biomedicine and multimedia, and computer graphics Xin Zhang received her PhD in information technology degree from the University of North Carolina at Charlotte in December, 2007 Beginning Fall of 2008, she will join the University of North Carolina at Pembroke as an assistant professor in the Department of Mathematics and Computer Science Her research interest includes knowledge discovery and data mining, music information retrieval, and flexible query answering systems She is an author of 12 research papers, including two journal publications Her PhD research was supported by NSF Jana Zvárová graduated in 1965 at Charles University in Prague, Faculty of Mathematics and Physics She received PhD scientific degree in 1978 at Charles University in Prague She passed the habilitation for Doc (associated professor) in 1991 and became Prof (full professor) at Charles University in Prague in 1999 Since 1994 she is the principal researcher at the Institute of Computer Science, Academy of 435 About the Contributors Sciences of the Czech Republic, head of the Department of Medical Informatics and the director of the European Center of Medical Informatics, Statistics and Epidemiology (EuroMISE Center) She has been the representative of the Czech Republic in the International Medical Informatics Association (IMIA) and European Federation for Medical Informatics (EFMI) She is chairing the board of biomedical informatics of the PhD studies of Charles University and Academy of Sciences of the Czech Republic She has been a member of the editorial boards of national and international journals, she has served as the expert in the field for the EC and Czech governmental institutions She has published more then 200 of papers in national and international literature, received several medals for her work Filip Železný is the head of the Intelligent Data Analysis research group at GL He received his PhD in AI and biocybernetics from CTU and carried out post-doctoral training at the University of Wisconsin in Madison He was a visiting professor at the State University of New York in Binghamton Currently he is a grantee of the Czech Academy of Sciences and the European Commission His main research interest is relational machine learning and its applications in bioinformatics 436 437 Index Symbols 4ft-Miner 380, 383, 384, 386, 387, 388 A acute lymphoblastic leukemia (ALL) 270, 272, 276, 278, 279, 280, 291 acute myeloid leukemia (AML) 270, 272, 276, 278, 279, 280, 291 adaptation 78, 83, 177, 211, 229, 230, 232, 234, 236, 245, 249, 275, 315, 316, 317 adaptation rules 230, 232, 249 analytical knowledge 26, 35 APACHE 182, 204, 418 apnea 350, 351, 352, 360, 361, 362, 370, 372, 373, 374, 403, 405, 409, 413, 414, 418, 419, 423, 424 AQUA 207, 208, 211, 213, 214, 215, 216, 217, 218, 219, 220, 222, 223, 225, 420 artificial case 240, 241, 244 association rule 265, 268, 269, 283, 284, 285, 384, 387, 388 association rule mining 265, 268, 269, 283, 284, 285 atherosclerosis 101, 376, 377, 378, 380, 383, 393, 394 autonomic nervous system 294, 305 B background knowledge 73, 172, 173, 251, 252, 253, 254, 256, 268, 269, 270, 271, 273, 275, 277, 280, 281, 283, 286, 288, 355, 395, 415 backpropagation of error 76, 78, 85, 95 Bayesian network 312, 315, 329, 405 biomedical image registration 122, 123, 127, 128, 132 body surface potential mapping (BSPM) 137, 138, 142, 143, 149, 160 C C4.5 65, 66, 73, 303, 305, 311, 330, 366, 417 case-base 229, 231, 232, 233, 236, 238, 23 9, 240, 242, 243, 244, 245, 249 case-based reasoning (CBR) 228, 230, 236, 244, 246, 247, 248, 249, 398, 399, 407, 413, 414, 416, 418 centre vectors 194, 205 chi-squared automatic interaction detection (CHAID) 334, 335, 338, 339, 340, 344, 345, 349 classification learning 301, 306 classification problem 63, 64, 65, 67, 77, 90, 92, 93, 99, 219, 317, 319, 325, 382 clinical prediction rules 350–375 clustering 51, 53, 54, 149, 151, 152, 154, 157, 174, 178, 182, 191, 192, 194, 195, 196, 197, 199, 200, 201, 202, 203, 204, 237, 251, 252, 253, 254, 264, 266, 272, 281, 283, 284, 289, 301, 333, 355, 399, 405, 406, 408, 409, 411, 415, 416, 419 communication information 14, 15, 19, 20, 21, 35 compositional adaptation 229, 249 confusion matrix 67, 68, 70, 71, 186, 187, 188, 189, 200, 205, 310, 342 Copyright © 2009, IGI Global, distributing in print or electronic forms without written permission of IGI Global is prohibited Index constraint-based paradigm 251, 252, 253, 254, 255, 262 cost-sensitive approaches 58 cost-sensitive evaluation 57, 66, 67, 70, 73, 413 cost-sensitive learning 57, 59, 60, 62, 63, 64, 66, 72, 74, 404, 425 cost curves 70, 71 CRISP-DM 333, 334, 348, 419 cross-entropy error function 76, 78, 91, 92, 95, 99, 100 cross-validation 97, 98, 99, 173, 174, 276, 317, 319, 342, 394 D data constitution 25 data integration 38, 45, 55, 420 decision information 1, 21, 22, 23, 35 decision limits 108, 109, 113, 117, 118 decision tree 59, 62, 63, 64, 66, 72, 73, 74, 152, 154, 200, 303, 305, 311, 334, 338, 343, 349, 366, 374, 404, 415, 419, 422, 424 deformable registration 125, 128, 129 diagnostic criteria 350, 351, 360, 364, 370 digital filtering 151 Direct Observation of Therapy (DOT) strategy 332, 336, 337, 347 Discovery Challenge workshops 265, 394, 396, 399, 407 discretization 284, 285, 286, 299, 306, 327 , 395 discrimination information 1, 15, 16, 17, 18, 19, 20, 21, 22, 30, 35 E electrocardiogram (ECG) 62, 77, 106, 137, 138, 139, 140, 141, 142, 143, 145, 146, 147, 148, 149, 138, 137, 149, 143, 149, 150, 151, 152, 154, 155, 156, 157, 158, 159, 375, 379, 398, 400, 402, 403, 406, 408, 409, 410, 411, 412, 415, 419, 421, 423 electroencephalogram (EEG) 5, 77, 105, 142, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 172, 173, 174, 438 175, 164, 163, 170, 175, 176, 177, 178, 179, 180, 375, 398, 404, 406, 408, 411, 412, 415, 417, 418, 420, 423 error rate 58, 61, 64, 67, 68 F false negative 57, 58, 67, 115, 155, 272, 310 false positive 58, 59, 67, 155, 186, 187, 205, 272, 310 feature extraction 96, 137, 138, 143, 148, 149, 151, 156, 161, 162, 164, 166, 168, 169, 177, 179, 280, 411 feature selection 177, 197, 202, 268, 277, 278, 279, 280, 289, 310, 313, 322, 407 feature subset selection 309, 310, 329, 331, 410 Fourier transform 143, 144, 151, 165 functional genomics 253, 267, 291 fuzzy logic 119, 159, 350, 351, 355, 358, 371, 374, 404, 411 G Gaussian-stacking 318, 319, 320, 321, 326, 327 Gaussian network 307, 309, 313, 318, 327 gene expression 251, 252, 253, 254, 260, 264, 265, 267, 268, 269, 270, 277, 278, 280, 283, 284, 287, 288, 289, 290, 400, 401, 403, 406, 407, 421, 423 gene ontology 273, 274, 275, 277, 281, 28 4, 288 gradient descent search 83, 84, 86, 96, 107 GUHA procedures 378, 382, 383, 395 H habituation 294, 295, 304, 306 heuristics 181, 182, 185 hierarchical clustering 192, 252, 264 Holter ECG 141, 142, 151, 154, 156, 160 hyperacussis 296, 306 hypopnea 352, 360 Index I implantation rate 308, 309, 324 in-vitro fertilization (IVF) 307, 308, 321, 322, 323, 324, 326, 329, 330, 409, 413, 416, 422 information extraction 51, 207, 220, 221, 223 information retrieval 37, 39, 48, 55, 207, 305, 425 inter-subject registration 135 intra-subject registration 135 intra cytoplasmic sperm injection (ICSI) 308, 322, 324 irrelevant variables 277, 314, 331 K K-means 51, 191, 192, 196, 204, 252, 286 , 415 KL-Miner 389, 391, 396, 417 knowledge base 2, 34, 37, 39, 43, 53, 201, 230, 231, 232, 233, 234, 235, 236, 238, 240, 245, 246, 334, 350, 351, 356, 359, 360, 392, 407 knowledge discovery 51, 52, 55, 58, 122, 251, 253, 254, 264, 266, 267, 268, 269, 281, 288, 291, 333, 343, 355, 410, 413, 420 L labelling criteria 208, 211, 216, 219 LISp-Miner 378, 383, 395 M machine learning 58, 59, 60, 68, 73, 74, 75, 101, 113, 150, 159, 190, 194, 201, 208, 275, 305, 307, 309, 311, 319, 331, 355, 376, 377, 378, 411, 424 medical informatics 1, 2, 3, 34, 35, 402, 406, 407, 414, 425 meta-learning 63, 308, 309, 322 metadata 6, 50, 207, 214, 215, 216, 225, 226, 296, 318, 355 microarray 253, 266, 270, 274, 284, 289, 290, 401, 404, 405, 407, 412, 417 misclassification costs 57, 58, 59, 60, 61, 62, 64, 65, 66, 70, 73, 74, 408 misophonia 294, 295, 299, 306 monomodal images 128, 136 multilayer feedforward network 78, 81, 83, 106, 119, 408 multilayer perceptron 109, 116, 118, 175 multimodal images 126, 131, 136 mutual information (MI) 197, 198, 205 N neural nets 181, 182, 188 neural network training 86, 107 O obstructive sleep apnea (OSA) 351–375 ontology web language (OWL) 38, 42, 47, 48, 50, 56, 219 optimal efficiency 115, 116, 117 overfitting 78, 96, 97, 98, 99, 102, 104, 115, 276, 277, 318, 319, 326, 346 oximetry 352, 369, 375 P partitioning 191, 205, 335, 353 phonophobia 294, 299, 306 polysomnogram (PSG) 352, 364, 369, 375 POSSUM 182, 183, 184, 185, 186, 188, 189, 190, 200, 203, 204, 205, 402, 416, 424 predictor variables 309, 312, 313, 314, 315, 316, 318, 328, 331, 334, 353 preprocessing perceptron 109–121 R receiver operating characteristic (ROC) graphs 68, 69, 70, 72, 187, 405 recursive mining 251, 258, 259, 264, 267 redundant variables 309, 331 RefSeq 271, 291 regression problem 77, 78, 89, 101, 382 relational descriptions 274, 276 relational subgroup discovery (RSD) 274, 275, 276, 288, 290, 291, 412, 424 439 Index resource description framework (RDF) 50, 20 7, 211, 213, 214, 215, 216, 219, 225, 226, 423 rigid-body transformations 125 risk prediction 181, 182, 183, 185, 186, 189, 197, 199, 200, 201, 202, 205 S SAGE 251–267 SD4ft-Miner 387, 388, 389 Semantic Web 37, 46, 50, 53, 207, 223, 224, 226, 396, 399, 401, 405, 417 semio-fuzzy framework 350, 352, 355, 359, 360, 363, 370, 371 semiotics 350, 355, 357, 371 sequential minimal optimization (SMO) algorithm 220, 221, 222, 312, 319, 320, 321, 326, 327 sigmoid 80, 81, 82, 84, 92, 98, 99, 100, 110, 112 simple perceptron 80 SRC 273, 292 STULONG 377, 378, 380, 381, 382, 383, 387, 393, 394, 395 success rate 64, 174, 175, 308, 309, 322 sum-of-square error function 76, 78, 83, 91, 92, 99 supervised classification 308, 309, 310, 313, 317, 318, 325, 331 440 supervised classifier 190, 205 synthetic knowledge 26, 36 T taxonomy 38, 46, 47, 49, 60 temporal feature 299, 306 text mining 50, 51, 52, 293, 297, 300 tinnitus 293–306 Tinnitus Handicap Inventory 296, 298, 299, 302, 304, 306 tinnitus retraining therapy (TRT) 293, 294, 295, 306 training data 62, 63, 64, 65, 87, 88, 90, 92, 96, 97, 221, 276, 309, 310, 317, 318, 334, 343, 353, 368 true negative 67, 115, 155, 205, 310 true positive 67, 115, 155, 186, 187, 205, 310 tuberculosis 332–349 U Unified Medical Language System (UMLS) 38, 45, 48, 51, 212, 217, 218, 219 W wavelet 105, 137, 138, 143, 144, 145, 147, 149, 150, 151, 156, 160, 401 Web crawling 208, 216, 221, 223, 225 Web mining 207, 226 Web spidering 208, 219, 221, 223, 226 ... Cataloging-in-Publication Data Data mining and medical knowledge management : cases and applications / Petr Berka, Jan Rauch, and Djamel Abdelkader Zighed, editors p ; cm Includes bibliographical references and index... The book ? ?Data Mining and Medical Knowledge Management: Cases and Applications? ?? is a collection of case studies in which advanced DM and KM solutions are applied to concrete cases in biomedical... peer-reviewed journals and international conferences xix Preface The basic notion of the book ? ?Data Mining and Medical Knowledge Management: Cases and Applications? ?? is knowledge A number of definitions