Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống
1
/ 427 trang
THÔNG TIN TÀI LIỆU
Thông tin cơ bản
Định dạng
Số trang
427
Dung lượng
17,44 MB
Nội dung
ProbabilisticMethodsforFinancialandMarketingInformatics Richard E Neapolitan Xia Jiang Publisher Publishing Services Manager Project Manager Assistant Editor Interior printer Cover printer Diane D Cerra George Morrison Kathryn Liston Asma Palmeiro The Maple-Vail Book Manufacturing Group Phoenix Color Morgan Kaufmann Publishers is an imprint of Elsevier 500 Sansome Street, Suite 400, San Francisco, CA 94111 This book is printed on acid-free paper @ 2007 by Elsevier Inc All rights reserved Designations used by companies to distinguish their products are often claimed as trademarks or registered trademarks In all instances in which Morgan Kaufmann Publishers is aware of a claim, the product names appear in initial capital or all capital letters Readers, however, should contact the appropriate companies for more complete information regarding trademarks and registration No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means electronic, mechanical, photocopying, scanning, or otherwise-without prior written permission of the publisher Permissions may be sought directly from Elsevier's Science & Technology Rights Department in Oxford, UK: phone: (+44) 1865 843830, fax: (+44) 1865 853333, E-mail: permissions@elsevier.com You may also complete your request online via the Elsevier homepage (http://elsevier.com), by selecting "Support Contact" then "Copyright and Permission" and then "Obtaining Permissions." Library of Congress Cataloging-in-Publication Data Application submitted ISBN 13:978-0-12-370477-1 ISBN10:0-12-370477-4 For information on all Morgan Kaufmann publications, visit our Web site at www.mkp.com or www.books.elsevier.com Printed in the United States of America 07 08 09 10 11 10987654321 Working together to grow libraries in developing countries www.elsevier.com I www.bookaid.org I www.sabre.org Preface This book is based on a course I recently developed for computer science majors at Northeastern Illinois University (NEIU) The motivation for developing this course came from guidance I obtained from the NEIU Computer Science Department Advisory Board One objective of this Board is to advise the Department concerning the maintenance of curricula that is relevant to the needs of companies in Chicagoland The Board consists of individuals in IT departments from major companies such as Walgreen's, AON Company, United Airlines, Harris Bank, and Microsoft After the dot.com bust and the introduction of outsourcing, it became evident that students, trained only in the fundamentals of computer science, programming, web design, etc., often did not have the skills to compete in the current U.S job market So I asked the Advisory Board what else the students should know The board unanimously felt the students needed business skills such as knowledge of IT project management, marketing, and finance As a result, our revised curriculum, for students who hoped to obtain employment immediately following graduation, contained a number of business courses However, several members of the board said they'd like to see students equipped with knowledge of cutting edge applications of computer science to areas such as decision analysis, risk management, data mining, and market basket analysis I realized that some of the best work in these areas was being done in my own field, namely Bayesian networks After consulting with colleagues worldwide and checking on topics taught in similar programs at other universities, I decided it was time for a course on applying probabilistic reasoning to business problems So my new course called "Informatics for MIS Students" and this book called ProbabilisticMethodsforFinancialandMarketingInformatics were conceived Part I covers the basics of Bayesian networks and decision analysis Much of this material is based on my 2004 book Learning Bayesian Networks However, I've tried to make the material more accessible Rather than dwelling on rigor, algorithms, and proofs of theorems, I concentrate on showing examples and using the software package Netica to represent and solve problems The specific content of Part I is as follows: Chapter provides a definition of informaticsandprobabilisticinformatics Chapter reviews the probability and statistics needed to understand the remainder of the book Chapter presents Bayesian networks and inference in Bayesian networks Chapter concerns learning Bayesian networks from data Chapter introduces decision analysis iii iv and influence diagrams, and Chapter covers further topics in decision analysis There is overlap between the material in Part I and that which would be found in a book on decision analysis However, I discuss Bayesian networks and learning Bayesian networks in more detail, whereas a decision analysis book would show more examples of solving problems using decision analysis Sections and subsections in Part I that are marked with a star ( ~ ) contain material that either requires a background in continuous mathematics or that seems to be inherently more difficult than the material in the rest of the book For the most part, these sections can be skipped without impacting one's mastery of the rest of the book The only exception is that if Section 3.6 (which covers d-separation) is omitted, it will be necessary to briefly review the faithfulness condition in order to understand Sections 4.4.1 and 4.5.1, which concern the constraint-based method for learning faithful DAGs from data I believe one can gain an intuition for this type of learning from a few simple examples, and one does not need a formal knowledge of d-separation to understand these examples I've presented constraint-based learning in this fashion at several talks and workshops worldwide and found that the audience could always understand the material Furthermore, this is how I present the material to my students Part II presents financial applications Specifically, Chapter presents the basics of investment science and develops a Bayesian network for portfolio risk analysis Sections 7.2 and 7.3 are marked with a star ('k) because the material in these sections seems inherently more difficult than most of the other material in the book However, they not require as background the material from Part I that is marked with a star ( ~ ) Chapter discusses modeling real options, which concerns decisions a company must make as to what projects it should pursue Chapter covers venture capital decision making, which is the process of deciding whether to invest money in a start-up company Chapter 10 discusses a model for bankruptcy prediction Part III contains chapters on two important areas of marketing First, Chapter 11 shows methodsfor doing collaborative filtering and market basket analysis These disciplines concern determining what products an individual might prefer based on how the individual feels about other products Finally, Chapter 12 presents a technique for doing targeted advertising, which is the process of identifying those customers to whom advertisements should be sent There is too much material for me to cover the entire book in a one semester course at NEIU Since the course requires discrete mathematics and business statistics as prerequisites, I only review most of the material in Chapter However, I discuss conditional independence in depth because ordinarily the students have not been exposed to this concept I then cover the following sections from the remainder of the book: Chapter 3:3.1-3.5.1 Chapter 4: 4.1, 4.2, 4.4.1, 4.5.1, 4.6 Chapter 5: 5.1-5.3.2, 5.3.4 Chapters - 12: All sections The course is titled "Informatics for MIS Students," and is a required course in the MIS (Management Information Science) concentration of NEIU's Computer Science M.S Degree Program This book should be appropriate for any similar course in an MIS, computer science, business, or MBA program It is intended for upper level undergraduate and graduate students Besides having taken one or two courses covering basic probability and statistics, it would be useful but not necessary for the student to have studied data structures Part I of the book could also be used for the first part of any course involving probabilistic reasoning using Bayesian networks That is, although many of the examples in Part I concern the stock market and applications to business problems, I've presented the material in a general way Therefore, an instructor could use Part I to cover basic concepts and then provide papers relative to a particular domain of interest For example, if the course is "Probabilistic Methodsfor Medical Informatics," the instructor could cover Part I of this book, and then provide papers concerning applications in the medical domain For the most part, the applications discussed in Part II were the results of research done at the School of Business of the University of Kansas, while the applications in Part III were the results of research done by the Machine Learning and Applied Statistics Group of Microsoft Research The reason is not that I have any particular affiliations with either of this institutions Rather, I did an extensive search forfinancialandmarketing applications, and the ones I found that seemed to be most carefully designed and evaluated came from these institutions I thank Catherine Shenoy for reviewing the chapter on investment science and Dawn Homes, Francisco Javier Dfez, and Padmini Jyotishmati for reviewing the entire book They all offered many useful comments and criticisms I thank Prakash Shenoy and Edwin Burmeister for correspondence concerning some of the content of the book I thank my co-author, Xia Jiang, for giving me the idea to write this book in the first place, andfor her efforts on the book itself Finally, I thank Prentice Hall for granting me permission to reprint material from my 2004 book Learning Bayesian Networks Rich Neapolitan RE-Neapolit an@neiu, ed u This Page Intentionally Left Blank Contents Preface I iii Bayesian Networks and Decision Analysis ProbabilisticInformatics W h a t Is I n f o r m a t i c s ? ProbabilisticInformatics 1.3 O u t l i n e of T h i s B o o k Probability and Statistics 2.1 2.2 2.3 2.4 2.5 3 1.1 1.2 P r o b a b i l i t y Basics 2.1.1 Probability Spaces 2.1.2 Conditional Probability and Independence 10 12 2.1.3 Bayes' Theorem Random Variables 2.2.1 P r o b a b i l i t y D i s t r i b u t i o n s of R a n d o m V a r i a b l e s 2.2.2 I n d e p e n d e n c e of R a n d o m V a r i a b l e s T h e M e a n i n g of P r o b a b i l i t y 2.3.1 Relative Frequency Approach to Probability 2.3.2 Subjective Approach to Probability R a n d o m V a r i a b l e s in A p p l i c a t i o n s Statistical Concepts 2.5.1 Expected Value 2.5.2 Variance and Covariance 2.5.3 Linear Regression 15 16 16 21 24 25 28 30 34 34 35 41 Bayesian Networks 53 3.1 W h a t Is a B a y e s i a n N e t w o r k ? 54 3.2 P r o p e r t i e s of B a y e s i a n N e t w o r k s 3.2.1 D e f i n i t i o n of a B a y e s i a n N e t w o r k 3.2.2 R e p r e s e n t a t i o n of a B a y e s i a n N e t w o r k 56 56 59 3.3 C a u s a l N e t w o r k s as B a y e s i a n N e t w o r k s 63 3.3.1 3.3.2 63 68 Causality Causality and the Markov Condition CONTENTS viii 3.4 3.5 3.6 3.3.3 T h e Markov Condition w i t h o u t Causality Inference in Bayesian Networks 3.4.1 Examples of Inference 3.4.2 Inference Algorithms and Packages 3.4.3 Inference Using Netica How Do We O b t a i n t h e Probabilities? 3.5.1 T h e Noisy O R - G a t e Model 3.5.2 M e t h o d s for Discretizing Continuous Variables * Entailed Conditional Independencies * 3.6.1 Examples of Entailed Conditional Independencies 3.6.2 d-Separation 3.6.3 Faithful and Unfaithful P r o b a b i l i t y Distributions 3.6.4 Markov Blankets and Boundaries 71 72 73 75 77 78 79 86 92 92 95 99 102 Learning Bayesian Networks 4.1 4.2 4.3 4.4 4.5 4.6 4.7 111 P a r a m e t e r Learning 112 4.1.1 Learning a Single P a r a m e t e r 112 4.1.2 Learning All P a r a m e t e r s in a Bayesian Network 119 Learning S t r u c t u r e (Model Selection) 126 Score-Based S t r u c t u r e Learning * 127 4.3.1 Learning S t r u c t u r e Using the Bayesian Score 127 4.3.2 Model Averaging 137 C o n s t r a i n t - B a s e d S t r u c t u r e Learning 138 4.4.1 Learning a DAG Faithful to P 138 4.4.2 Learning a DAG in Which P Is E m b e d d e d Faithfully ~ 144 Causal Learning 145 4.5.1 Causal Faithfulness A s s u m p t i o n 145 4.5.2 Causal E m b e d d e d Faithfulness A s s u m p t i o n ~ 148 Software Packages for Learning 151 E x a m p l e s of Learning 153 4.7.1 Learning Bayesian Networks 153 4.7.2 Causal Learning 162 Decision Analysis Fundamentals 5.1 5.2 5.3 Decision Trees 5.1.1 Simple E x a m p l e s 5.1.2 Solving More C o m p l e x Decision Trees Influence D i a g r a m s 5.2.1 Representing with Influence D i a g r a m s 5.2.2 Solving Influence Diagrams 5.2.3 Techniques for Solving Influence D i a g r a m s * 5.2.4 Solving Influence Diagrams Using Netica D y n a m i c Networks * 5.3.1 D y n a m i c Bayesian Networks 5.3.2 D y n a m i c Influence D i a g r a m s 177 178 178 182 195 195 202 202 207 212 212 219 Further Techniques in Decision Analysis 6.1 6.2 6.3 6.4 6.5 6.6 II 229 M o d e l i n g Risk Preferences 230 6.1.1 The Exponential Utility Function 231 6.1.2 A D e c r e a s i n g Risk-Averse U t i l i t y F u n c t i o n A n a l y z i n g Risk D i r e c t l y 6.2.1 Using t h e Variance to M e a s u r e Risk 6.2.2 Risk Profiles Dominance 235 236 236 238 240 6.3.1 Deterministic Dominance 6.3.2 Stochastic Dominance 6.3.3 G o o d Decision versus G o o d O u t c o m e 240 241 S e n s i t i v i t y Analysis 6.4.1 Simple M o d e l s 6.4.2 A More Detailed Model 243 244 244 250 Value of I n f o r m a t i o n 254 6.5.1 E x p e c t e d Value of Perfect I n f o r m a t i o n 255 6.5.2 E x p e c t e d Value of I m p e r f e c t I n f o r m a t i o n N o r m a t i v e Decision Analysis Financial Applications 257 259 265 Investment Science 267 7.1 267 7.2 7.3 Basics of I n v e s t m e n t Science 7.1.1 Interest 7.1.2 Net P r e s e n t Value 267 270 7.1.3 Stocks 271 7.1.4 Portfolios 276 7.1.5 T h e M a r k e t Portfolio 276 7.1.6 M a r k e t Indices 277 A d v a n c e d Topics in I n v e s t m e n t Science ~ 278 7.2.1 M e a n - V a r i a n c e Portfolio T h e o r y 278 7.2.2 M a r k e t Efficiency a n d C A P M 7.2.3 Factor Models and A P T 7.2.4 Equity Valuation Models 285 296 303 A B a y e s i a n N e t w o r k Portfolio Risk A n a l y z e r * 314 315 7.3.1 Network Structure 7.3.2 Network Parameters 7.3.3 T h e Portfolio Value a n d A d d i n g E v i d e n c e 317 319 Modeling Real Options 329 8.1 Solving R e a l O p t i o n s Decision P r o b l e m s 330 8.2 Making a Plan 339 8.3 S e n s i t i v i t y Analysis 340 400 BIBLIOGRAPHY [Diez and Druzdzel, 2006] Diez, F.J., and M.J Druzdzel, "Canonical Probabilistic Models for Knowledge Engineering," submitted for publication, 2006 [Druzdzel and Glymour, 1999] Druzdzel, M.J., and C Glymour, "Causal Inferences from Databases: Why Universities Lose Students," in Glymour, C., and G.F Cooper (Eds.): Computation, Causation, and Discovery, AAAI Press, Menlo Park, California, 1999 [Eells, 1991] Eells, E., Probabilistic Causality, Cambridge University Press, London, 1991 [Fama and MacBeth, 1973] Fama, E., and J MacBeth, "Risk, Return, and Equilibrium: Empirical Tests," Jourhal of Political Economy, Vol 81, No 3, 1973 [Feller, 1968] Feller, W., An Introduction to Probability Theory and Its Applications, Wiley, New York, 1968 [Flanders et al., 1996] Flanders, A.E., C.M Spettell, L.M Tartaglino, D.P Friedman, and G.J Herbison, "Forecasting Motor Recovery after Cervical Spinal Cord Injury: Value of MRI," Radiology, Vol 201, 1996 [Fung and Chang, 1990] Fung, R., and K Chang, "Weighing and Integrating Evidence for Stochastic Simulation in Bayesian Networks," in Henrion, M., R.D Shachter, L.N Kanal, and J.F Lemmer (Eds.): Uncertainty in Artificial Intelligence 5, North Holland, Amsterdam, 1990 [Graham and Zweig, 2003] Graham, B., and J Zweig, The Intelligent Investor: The Definitive Book on Value Investing, Revised Edition, HarperCollins, New York, 2003 [Heckerman, 1996] Heckerman, D., "A Tutorial on Learning with Bayesian Networks," Technical Report # MSR-TR-95-06, Microsoft Research, Redmond, Washington, 1996 [Heckerman et al., 1999] Heckerman, D., C Meek, and G Cooper, "A Bayesian Approach to Causal Discovery," in Glymour, C., and G.F Cooper BIBLIOGRAPHY 401 (Eds.): Computation, Causation, and Discovery, AAAI Press, Menlo Park, California, 1999 [Heckerman and Meek, 1997] Heckerman, D., and C Meek, "Embedded Bayesian Network Classifiers," Technical Report MSR-TR-97-06, Microsoft Research, Redmond, Washington, 1997 [Herskovits and Dagher, 1997] Herskovits, E.H., and A.P Dagher, "Applications of Bayesian Networks to Health Care," Technical Report NSI-TR-1997-02, Noetic Systems Incorporated, Baltimore, Maryland, 1997 [Hogg and Craig, 1972] Hogg, R.V., and A.T Craig, Introduction to Mathematical Statistics, Macmillan, New York, 1972 [Huang et al., 1994] Huang, T., D Koller, J Malik, G Ogasawara, B Rao, S Russell, and J Weber, "Automatic Symbolic Traffic Scene Analysis Using Belief Networks," Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAIg4), AAAI Press, Seattle, Washington, 1994 [Hume, 1748] Hume, D., An Inquiry Concerning Human Understanding, Prometheus, Amhurst, New York, 1988 (originally published in 1748) [Ingersoll, 1987] Ingersoll, J., Theory of Financial Decision Making, Rowman & Littlefield, Lanham, Maryland, 1987 [Iversen et al., 1971] Iversen, G.R., W.H Longcor, F Mosteller, J.P Gilbert, and C Youtz, "Bias and Runs in Dice Throwing and Recording: A Few Million Throws," Psychometrika, Vol 36, 1971 [Jensen et al., 1990] Jensen, F.V., S L Lauritzen, and K.G Olesen, "Bayesian Updating in Causal Probabilistic Networks by Local Computation," Computational Statistical Quarterly, Vol 4, 1990 [Jensen, 2001] Jensen, F.V., Bayesian Networks and Decision Graphs, Springer-Verlag, New York, 2001 402 BIBLIOGRAPHY [Kahneman and Tversky, 1979] Kahneman, D., and A Tversky, "Prospect Theory: An Analysis of Decision Under Risk," Econometrica, Vol 47, 1979 [Keefer, 1983] Keefer, D.L., "3-Point Approximations for Continuous Random Variables," Management Science, Vol 29, 1983 [Kemmerer et al., 2006] Kemmerer, B., Mishra, S., and P Shenoy, "Bayesian Causal Maps as Decision Aids in Venture Capital Decision Making," submitted to the Entrepreneurship Division (ENT), 2006 [Kennett et al., 2001] Kennett, R., K Korb, and A Nicholson, "Seabreeze Prediction Using Bayesian Networks: A Case Study," Proceedings of the 5th Pacific-Asia Conference on Advances in Knowledge Discovery and Data MiningPAKDD, Springer-Verlag, New York, 2001 [Kerrich, 1946] Kerrich, J.E., An Experimental Introduction to the Theory of Probability, Einer Munksgaard, Copenhagen, 1946 [Lander and Shenoy, 1999] Lander, D.M., and P Shenoy, "Modeling and Valuing Real Options Using Influence Diagrams," School of Business Working Paper No 283, University of Kansas, Lawrence, Kansas, 1999 [Lauritzen and Spiegelhalter, 1988] Lauritzen, S.L., and D.J Spiegelhalter, "Local Computation with Probabilities in Graphical Structures and Their Applications to Expert Systems," Journal of the Royal Statistical Society B, Vol 50, No 2, 1988 [Li and D'Ambrosio, 1994] Li, Z., and B D'Ambrosio, "Efficient Inference in Bayes' Networks as a Combinatorial Optimization Problem," International Journal of Approximate Inference, Vol 11, 1994 [Lindley, 1985] Lindley, D.V., Introduction to Probability and Statistics from a Bayesian Viewpoint, Cambridge University Press, London, 1985 BIBLIOGRAPHY 403 [Luenberger, 1998] Luenberger, D., Investment Science, Oxford, New York, 1998 [Lugg et al., 1995] Lugg, J.A., J Raifer, and C.N.F Gonz~lez, "Dehydrotestosterone is the Active Androgen in the Maintenance of Nitric OxideMediated Penile Erection in the Rat," Endocrinology, Vol i[36, No 4, 1995 [Lynch and Rothchild, 2000] Lynch, P., and J Rothchild, One Up On Wall Street: How to Use What You Already Know to Make Money in the Market, Fireside, New York, 2000 [Mani et al., 1997] Mani, S., S McDermott, and M Valtorta, "MENTOR: A Bayesian Model for Prediction of Mental Retardation in Newhorns," Research in Developmental Disabilities, Vol 8, No 5, 1997 [McClennan and Markham, 1999] McClennan, K.J., and A Markham, "Finasteride: A Review of Its Use in Male Pattern Baldness," Dr~zgs, Vol 57, No 1, 1999 [McElroy and Burmeister, 1988] McElroy, M., avd E Burmeister, "Arbitrage Pricing Theory as a Restricted Nonlinear Multivariate Regression Model: Iterated Nonlinear Seemingly Unrelated Regression Estimates," Journal of Business and Economic Statistics, Vol 6, No 1, 1988 [McKee, 2003] Mckee, T.E., "Rough Sets Bankruptcy Prediction Models Versus Auditor Signaling Rates," Journal of Forecasting, Vol 22, 2003 [McKee and Greenstein, 2000] McKee, T.E., and M Greenstein, "Predicting Bankruptcy Using Recursive Partitioning and a Realistically Proportioned Data Set," Journal of Forecasting, Vol 19, 2000 [McKee and Lensberg, 2002] McKee, T.E., and T Lensberg, "Genetic Programming and Rough Sets: A Hybrid Approach to Bankruptcy Classification," Journal of Operational Research, Vol 138, 2002 404 BIBLIOGRAPHY [McLachlan and Krishnan, 1997] McLachlan, G.J., and T Krishnan, The EM Algorithm and Its Extensions, Wiley, New York, 1997 [Meek, 1995] Meek, C., "Strong Completeness and Faithfulness in Bayesian Networks," in Besnard, P., and S Hanks (Eds.): Uncertainty in Artificial Intelligence; Proceedings of the Eleventh Conference, Morgan Kaufmann, San Mateo, California, 1995 [Neal, 1992] Neal, R., "Connectionist Learning of Belief Networks," Artificial Intelligence, Vol 56, 1992 [Neapolitan, 1990] Neapolitan, R.E., Probabilistic Reasoning in Expert Systems, Wiley, New York, 1990 [Neapolitan, 1992] Neapolitan, R.E., "A Survey of Uncertain and Approximate Inference," in Zadeh, L., and J Kacprzyk (Eds.): Fuzzy Logic for the Management of Uncertainty, Wiley, New York, 1992 [Neapolitan, 1996] Neapolitan, R.E., "Is Higher-Order Uncertainty Needed?" in IEEE Transactions on Systems, Man, and Cybernetics Part A: Systems and Humans, Vol 26, No 3, 1996 [Neapolitan, 2004] Neapolitan, R.E., Learning Bayesian Networks, Prentice Hall, Upper Saddle River, New Jersey, 2004 [Neapolitan and Morris, 2002] Neapolitan, R.E., and S Morris, "Probabilistic Modeling Using Bayesian Networks," in D Kaplan (Ed.): Handbook of Quantitative Methodology in the Social Sciences, Sage, Thousand Oaks, California, 2002 INcase and Owens, 1997] Nease, R.F., and D.K Owens, "Use of Influence Diagrams to Structure Medical Decisions," Medical Decision Making, Vol 17, 1997 [Nefian et al., 2002] Nefian, A.F., L H Liang, X.X Liu, X Pi and K Murphy, "Dynamic Bayesian Networks for Audio-Visual Speech Recognition," Journal of Applied Signal Processing, Special Issue on Joint Audio Visual Speech Processing, Vol 11, 2002 BIBLIOGRAPHY 405 [Nicholson, 1996] Nicholson, A.E., "Fall Diagnosis Using Dynamic Belief Networks," Proceedings of the ~th Pacific Rim International Conference on Artificial Intelligence (PRICAI96), Cairns, Australia, 1996 [Ohlson, 1980] Ohlson, J.A., "Financial Ratios and the Probabilistic Prediction of Bankruptcy," Journal of Accounting Research, Vol 19, 1980 [Olesen et al., 1992] Olesen, K.G., S.L Lauritzen, and F.V Jensen, "aHUGIN: A System Creating Adaptive Causal Probabilistic Networks," in Dubois, D., M.P Wellman, B D'Ambrosio, and P Smets (Eds.): Uncertainty in Artificial Intelligence; Proceedings of the Eighth Conference, Morgan Kaufmann, San Mateo, California, 1992 [Olmsted, 1983] Olmsted, S.M., "On Representing and Solving Influence Diagrams," Ph.D Thesis, Dept of Engineering-Economic Systems, Stanford University, California, 1983 [Pearl, 1986] Pearl, J., "Fusion, Propagation, and Structuring in Belief Networks," Artificial Intelligence, Vol 29, 1986 [Pearl, 1988] Pearl, J., Probabilistic Reasoning in Intelligent Systems, Morgan Kaufmann, San Mateo, California, 1988 [Pearl, 2000] Pearl, J., Causality: Models, Reasoning, and Inference, Cambridge University Press, Cambridge, U.K., 2000 [Piaget, 1966] Piaget, J., The Child's Conception of Physical Causality, Routledge and Kegan Paul, London, 1966 [Pradham and Dagum, 1996] Pradham, M., and P Dagum, "Optimal Monte Carlo Estimation of Belief Network Inference," in Horvitz, E., and F Jensen (Eds.): Uncertainty in Artificial Intelligence; Proceedings of the Twelfth Conference, Morgan Kaufmann, San Mateo, California, 1996 406 BIBLIOGRAPHY [Resnick et al., 1994] Resnick, P., N Iacovou, M Suchak, P Bergstrom, and J Riedl, "Grouplens: An Open Architecture for Collaborative Filtering of Netnews," Proceedings of the A CM 199~ Conference on Computer Supported Cooperative Work, New York, 1994 [Robinson, 1977] Robinson, R.W., "Counting Unlabeled Acyclic Digraphs," in Little, C.H.C (Ed.): Lecture Notes in Mathematics, 622: Combinatorial Mathematics V, Springer-Verlag, New York, 1977 [Rucker and Polanco, 1997] Rucker, J., and M.J Polanco, "Siteseer: Personalized Navigation of the Web," Communications of the A CM, Vol 40, No 3, 1997 [Ruhnka et al., 1992] Ruhnka, J.C., H.D Feldman, and T.J Dean, "The 'Living Dead' Phenomena in Venture Capital Investments," Journal of Business Venturing, Vol 7, No 2, 1992 [Russell and Norvig, 1995] Russell, S., and P Norvig, Artificial Intelligence: A Modern Approach, Prentice Hall, Upper Saddle River, New Jersey, 1995 [Salmon, 1997] Salmon, W., Causality and Explanation, Oxford University Press, New York, 1997 [Sarkar and Sriram, 2001] Sarkar, S., and R.S Sriram, "Bayesian Models for Early Warning of Bank Failure," Management Science, Vol 47, 2001 [Savage, 1954] Savage, L.J., Foundations of Statistics, Wiley, New York, 1954 [Scarville et al., 1996] Scarville J., S.B Button, J.E Edwards, A.R Lancaster, and T.W Elig, "Armed Forces 1996 Equal Opportunity Survey," DMDC Report No 97-0279, Defense Manpower Data Center, Arlington, VA., 1996 [Scheines et al., 1994] Scheines, R., P Spirtes, C Glymour, and C Meek, Tetrad II: User Manual, Lawrence Erlbaum, HilIsdale, New Jersey, 1994 [Sercu and Uppal, 1995] Sercu, P., and R Uppal, International Financial Markets and the Fi77n, Southwestern College, Cincinnati, Ohio, 1995 BIBLIOGRAPHY 407 [Shachter, 1986] Shachter, R.D., "Evaluating Influence Diagrams," Operations Research, Vol 34, 1986 [Shachter and Peot, 1990] Shachter, R.D., and M Peot, "Simulation Approaches to General Probabilistic Inference in Bayesian Networks," in Henrion, M., R.D Shachter, L.N Kanal, and J.F Lemmer (Eds.): Uncertainty in Artificial Intelligence 5, North-Holland, Amsterdam, 1990 [Shepherd and Zacharakis, 2002] Shepherd, D.A., and A Zacharakis, "Venture Capitalists' Expertise: A Call for Research into Decision Aids and Cognitive Feedback," Journal of Business Venturing, Vol 17, 2002 [Singh and Valtorta, 1995] Singh, M., and M Valtorta, "Construction of Bayesian Network Structures from Data: A Brief Survey and an Efficient AIgorithm," International Journal of Approximate Reasoning, Vol 12, 1995 [Spirtes et al., 1993, 2000] Spirtes, P., C Glymour, and R Scheines, Causation, Prediction, and Search, Springer-Verlag, New York, 1993; 2nd ed.: MIT Press, Cambridge, Massachusetts, 2000 [Srinivas, 1993] Srinivas, S., "A Generalization of the Noisy OR Model," in Heckerman, D., and A Mamdani (Eds.): Uncertainty in Artificial Intelligence; Proceedings of the Ninth Conference, Morgan Kaufmann, San Mateo, California, 1993 [Stangor et al., 2002] Stangor, C., J.K Swim, K.L Van Allen, and G.B Sechrist, "Reporting Discrimination in Public and Private Contexts," Jourhal of Personality and Social Psychology, Vol 82, 2002 [Sun and Shenoy, 2006] Sun, L., and P Shenoy, "Using Bayesian Networks for Bankruptcy Prediction: Some Methodological Issues," School of Business Working Paper No 302, University of Kansas, Lawrence, Kansas, 2006 408 BIBLIOGRAPHY [Tam and Kiang, 1992] Tam, K.Y., and M.Y Kiang, "Managerial Applications of Neural Networks: The Case of Bank Failure Predictions," Management Science, Vol 38, 1992 [Tatman and Shachter, 1990] Tatman, J.A., and R.D Shachter, "Dynamic Programming and Influence Diagrams," IEEE Transactions on Systems, Man, and Cybernetics, Vol 20, 1990 [Terveen et al., 1997] Terveen, L., W Hill, B Amento, D McDonald, and J Creter, "PHOAKS: A Systern for Sharing Recommendations," Communications of the A CM, Vol 40, No 3, 1997 [Tversky and Kahneman, 1981] Tversky, A., and D Kahneman, "The Framing of Decisions and the Psychology of Choice," Science, Vol 211, 1981 [van Lambalgen, 1987] van Lambalgen, M., "Random Sequences," Ph.D Thesis, University of Amsterdam, 1987 [von Mises, 1919] yon Mises, R., "Grundlagen der Wahrscheinlichkeitsrechnung," Mathematische Zeitschrift, Vol 5, 1919 [Wallace and Korb, 1999] Wallace, C.S., and K Korb, "Learning Linear Causal Models by MML Sampling," in Gammerman, A (Ed.): Causal Models and Intelligent Data Mining, Springer-Verlag, New York, 1999 [Zadeh, 1995] Zadeh, L., "Probability Theory and Fuzzy Logic Are Complementary Rather Than Competitive," Technometrics, Vol 37, 1995 Index Bracket Medians Method, 87 Abstract model, Accountability, 82 Active user, 374 Algorithm heuristic, model-based, Alternative, 179 Ancestor, 57 Arbitrage, 297 Arbitrage Pricing Theory (APT), 297 Arc reversal/node reduction, 207 Asset, 272 Average absolute deviation scoring, 381 Capital Asset Pricing Model (CAPM), 289 Capital market line, 286 Cascaded naive Bayesian network, 368 Case amplification, 377 Causal DAG, 63, 68, 105 Causal embedded faithfulness condition, 148 Causal faithfulness condition, 146 Causal graph, 68 Causal inhibition, 82 Causal Markov assumption, 69, 145 Causal minimum message length (CaMML), 155 Causal network, 69, 145 Causal strength, 80 Causality, 63 and the Markov condition, 68 Bad decision/good outcome, 243 Bayes' Theorem, 15, 53 Bayesian, 33 Bayesian information criterion (BIC), 152 Bayesian network, 58 dynamic, 213 embedded, 86 inference in, 72 learning parameters of, 112 learning structure of constraint-based, 138 score-based, 127 model averaging and, 137 naive, 358 cascaded, 368 noisy OR-gate model in, 79 parameters, 111 structure, 111 Bayesian score, 127 Beliefs, 28 Beta, 287 Blocked, 96 Cause direct, 68 Chain, 57 Chain rule, 20, 47 Chance node, 179, 195 Class probability tree, 388 complete, 388 growing, 390 Cluster learning problem, 378 Coefficient of determination, 44, 293 Collaborative filtering, 6, 373 memory-based, 374 model-based, 374 Compound interest, 269 Constant risk-averse utility function, 235 409 410 Constant-Growth Dividend Discount Model, 307 Convenience sample, 66 Correlation coefficient, 38, 375 Covariance, 37 Cumulative risk profile, 239 Cycle, 57 D-separation, 97 Data, Data mining, Datum, Decision, 179 versus good/bad outcome, 243 Decision analysis, 180 normative, 259 Decision node, 179, 195 Decision tree, 179 algorithm for solving, 182 solving, 180 Decreasing risk-averse utility function, 235 Default voting, 377 Descendent, 57 Deterministic dominance, 240 Direct cause, 68 Directed acyclic graph (DAG), 57 d-separation in, 97 head-to-head meeting in, 96 head-to-tail meeting in, 96 tail-to-tail meeting in, 96 Directed edge, 57 Directed graph, 56 Discounted cash flow (DCF), 329 Discounting, 56, 66 Discretizing continuous variables, 86 Bracket Medians Method, 87 Pearson-Tukey Method, 90, 363 Dividend, 272 Dividend Discount Model (DDM), 307 Dominance deterministic, 240 stochastic, 241 Dynamic Bayesian network, 213 Dynamic influence diagram, 219 INDEX Earnings, 308 retained, 308 Earnings per share (EPS), 308 Efficient frontier, 284 Efficient market, 285 Embedded Bayesian network, 86 Embedded faithfulness condition, 143 causal, 148 Emergent behavior, 222 Entailed conditional independency, 92 Equity, 308 Equivalent sample size, 123 Event, 10 elementary, 10 Exception independence, 82 Exchangeability, 113 Expected dividend growth rate, 307 Expected lift in profit (ELP), 392 Expected utility, 179 Expected value, 34 Expected value maximizer, 230 Expected value of imperfect informarion (EVII), 258 Expected value of perfect information (EVPI), 255 Experiment, 10 Explicit voting, 374 Exponential utility function, 231 Factor model, 296 Faithfulness condition, 101 and Markov boundary, 103 causal, 146 Firm-specific risk, 288 Forward P/E ratio, 311 Frequentist, 25 Fundamental analysis, 303 Good decision/bad outcome, 243 Greedy equivalent search (GES), 152 Growth rate expected dividend, 307 Growth stocks, 301 Head-to-head meeting, 96 Head-to-tail meeting, 96 INDEX Heuristic algorithm, Holding period, 268 Holding period return (HPR) return rate, 272 Implicit voting, 373 Includes, 127 Income stocks, 301 Independence, 13 conditional, 13 of random variables, 22, 23 of random variables, 21, 23 of random vectors, 212 Influence diagram, 195 dynamic, 219 solving, 203 Informatics, bio, financial, marketing, 5, medical, Information, Initial public offering (IPO), 271 Instantiate, 66 Interest compound, 269 simple, 268 Intrinsic value, 305 Inverse-user frequency, 377 Knowledge, Leaky noisy OR-gate model, 83 general formula for, 84 Linear regression multiple, 45 simple, 42 Logistic regression, 369 Logit function, 86 Macroeconomic risk factors, 296 Maintenance margin, 275 Managerial option, 330 Manipulation, 64 bad, 105 Margin, 274 call, 276 411 maintenance, 275 Market basket analysis, 374 Market portfolio, 276 Market risk, 288 Market-value-weighted index, 277 Markov blanket, 102 Markov boundary, 103 Markov condition, 58 and causal Markov assumption, 145 and Markov blanket, 102 Markov equivalent, 122 Markov property, 214 Maximum a posterior probability (MAP) 113 Maximum likelihood estimate (MLE), 27, 112 Mean, 35 Mean-standard deviation diagram, 280 Minimum message length (MML), 155 Minimum variance portfolio, 282 Mobile target localization, 216 Model averaging, 137 Model selection, 125 Model-based algorithm, Multiple linear regression, 45 Mutually exclusive and exhaustive, 14 Naive Bayesian network, 358 Net present expected value (NPEV), 329 Net present value (NPV), 271 No arbitrage principle, 297 Node(s), 56 chance, 179, 195 decision, 179, 195 utility, 195 Noisy OR-gate model, 79 assumptions in, 82 general formula for, 82 leaky, 83 Nondescendent, 57 Nonsystematic risk, 288 Nonsystematic shock, 296 412 Normative decision analysis, 259 Odds, 29 One-Fund Theorem, 285 Outcomes, 10 P / E ratio, 311 forward, 311 trailing, 311 Parameters, 111 Parent, 57 Path, 57 Pearson-Tukey Method, 90, 363 Plowback ratio, 309 Population, 10, 26, 30 finite, 26 Price-weighted index, 277 Principal, 268 Principle of Indifference, 11 Probability conditional, 12 correlation coefficient and, 38 covariance and, 37 distribution, 17 joint, 18 marginal, 19 expected value and, 34 law of total, 14 maximum a posterior, 113 maximum likelihood estimate of, 27 odds and, 29 posterior, 32 Principle of Indifference and, 11 prior, 32 relative frequency approach to, 25 space, 11 subjective approach to, 28 variance and, 35 Prospect theory, 259 Prospectus, 271 Quality adjusted life expectancy (QALE), 191 R-squared, 44, 293 INDEX Random matrix, 212 Random process, 27 Random sample, 26 Random sequence, 27 Random variable(s), 16 chain rule for, 20, 47 conditional independence of, 22, 23 in applications, 30 independence of, 21, 23 joint probability distribution of, 18 probability distribution of, 17 marginal, 19 space of, 16 Random vector, 212 Randomized controlled experiment (RCE), 64 Ranked scoring, 381 Rate of return, 268 Real option, 330 Regression logistic, 369 multiple linear, 45 simple linear, 42 Regret theory, 259 Relative frequency, 25 Required rate of return, 304 Retained earnings, 308 Return on equity (ROE), 308 Risk exposure, 290, 296 Risk premium, 290, 298 Risk profile, 238 cumulative, 239 Risk tolerance, 231 Risky discount rate, 329 Sample convenience, 66 random, 26 Sample space, 10 Sampling, 26 with replacement, 27 Score-based structure learning, 127 Security market line, 290 Selection bias, 66 Sensitivity analysis, 244 413 INDEX two-way, 246 Share capital, 308 Shareholder's equity, 308 Short sale, 274 Sigmoid function, 86 Simple interest, 268 Simple linear regression, 42 Standard error of coefficient, 43 Stationary, 214 Stochastic dominance, 241 Stock exchange, 271 Stock market index, 277 market-value-weighted, 277 price-weighted, 277 Stocks, 271 growt h, 301 income, 301 total capitalization of, 276 Structure, 111 Subjective probability, 28 Subjectivist, 28 Systematic risk, 288 Tail-to-tail meeting, 96 Targeted advertising, 387 Time trade-off quality adjustment, 191 Time-separable, 219 Total capitalization, 276 Trailing P/E ratio, 311 Treasury shares, 308 Two-way sensitivity analysis, 246 Utility, 179 expected, 179 Utility function, 230 constant risk-averse, 235 decreasing risk-averse, 235 exponential, 231 Utility node, 195 Valuation Model, 303 Value-at-risk (VaR), 319 Variance, 35 Vector similarity, 377 Venture capital (VC), 343 Voting default, 377 explicit, 374 implicit, 373 Weight in a portfolio, 276 This Page Intentionally Left Blank ... areas of informatics, namely financial informatics and marketing informatics F i n a n c i a l i n f o r m a t i c s involves applying the methods of informatics to the management of money and other... time for a course on applying probabilistic reasoning to business problems So my new course called "Informatics for MIS Students" and this book called Probabilistic Methods for Financial and Marketing. .. States These programs go by various names, including bioinformatics, medical informatics, chemical informatics, music informatics, marketing informatics, etc What these programs have in common? To