Theory of Financial Decision Making pptx

The first part, Chapters 1-3, provides an duction to utility theory, arbitrage, portfolio formation, and efficient markets.. The absence of arbitrage is one of themost convincing and, th

Trang 1

Theory of Financial Decision Making Jonathan E Ingersoll, Jr.

Yale University

Trang 2

This book is structured in four parts The first part, Chapters 1-3, provides an duction to utility theory, arbitrage, portfolio formation, and efficient markets Chapter 1provides some necessary background in microeconomics Consumer choice is reviewed,and expected utility maximization is introduced Risk aversion and its measurement arealso covered.

intro-Chapter 2 introduces the concept of arbitrage The absence of arbitrage is one of themost convincing and, therefore, farthest-reaching arguments made in financial economics.Arbitrage reasoning is the basis for the arbitrage pricing theory, one of the leading modelspurporting to explain the cross-sectional difference in asset returns, Perhaps more impor-tant, the absence of arbitrage is the key in the development of the Black-Scholes optionpricing model and its various derivatives, which have been used to value a wide variety ofclaims both in theory and in practice

Chapter 3 begins the study of single-period portfolio problems It also introduces thestudent to the theory of efficient markets: the premise that asset prices fully reflect allinformation available to the market The theory of efficient (or rational) markets is one ofthe cornerstones of modern finance; it permeates almost all current financial research andhas found wide acceptance among practitioners, as well

In the second main section, Chapters 4-9 cover single-period equilibrium models ter 4 covers mean-variance analysis and the capital asset pricing model - a model whichhas found many supporters and widespread applications Chapters 5 through 7 expand onChapter 4 The first two cover generalized measures of risk and additional mutual fundtheorems The latter treats linear factor models and the arbitrage pricing theory, probablythe key competitor of the CAPM

Chap-Chapter 8 offers an alternative equilibrium view based on complete markets theory Thistheory was originally noted for its elegant treatment of general equilibrium as in the models

of Arrow and Debreu and was considered to be primarily of theoretical interest Morerecently it and the related concept of spanning have found many practical applications incontingent-claims pricing

Trang 3

Chapter 9 reviews single-period finance with an overview of how the various modelscomplement one another It also provides a second view of the efficient markets hypothesis

in light of the developed equilibrium models

Chapter 10, which begins the third main section on multiperiod models, introduces els set in more than one period It reviews briefly the concept of discounting, with which

mod-it is assumed the reader is already acquainted, and reintroduces efficient markets theory inthis context

Chapters 11 and 13 examine the multiperiod portfolio problem Chapter 11 introducesdynamic programming and the induced or derived singleperiod portfolio problem inherent

in the intertemporal problem After some necessary mathematical background provided inChapter 12, Chapter 13 tackles the same problem in a continuous-time setting using themeanvariance tools of Chapter 4 Merton’s intertemporal capital asset pricing model isderived, and the desire of investors to hedge is examined

Chapter 14 covers option pricing Using arbitrage reasoning it develops free and preference-free restrictions on the valuation of options and other derivative assets

distribution-It culminates in the development of the Black-Scholes option pricing model Chapter 15summarizes multiperiod models and provides a view of how they complement one anotherand the single-period models It also discusses the role of complete markets and spanning

in a multiperiod context and develops the consumption- based asset pricing model

In the final main section, Chapter 16 is a second mathematical interruption- this time

to introduce the Ito calculus Chapter 17 explores advanced topics in option pricing usingIto calculus Chapter 18 examines the term structure of interest rates using both optiontechniques and multiperiod portfolio analysis Chapter 19 considers questions of corporatecapital structure Chapter 19 demonstrates many of the applications of the Black-Scholesmodel to the pricing of various corporate contracts

The mathematical prerequisites of this book have been kept as simple as practicable Aknowledge of calculus, probability and statistics, and basic linear algebra is assumed TheMathematical Introduction collects some required concepts from these areas Advancedtopics in stochastic processes and Ito calcu1us are developed heuristically, where needed,because they have become so important in finance Chapter 12 provides an introduction

to the stochastic processes used in continuous-time finance Chapter 16 is an introduction

to Ito calculus Other advanced mathematical topics, such as measure theory, are avoided.This choice of course, requires that rigor or generality sometimes be sacrificed to intuitionand understanding Major points are always presented verbally as well as mathematically.These presentations are usually accompanied by graphical illustrations and numerical ex-amples

To emphasize the theoretical framework of finance, many topics have been left ered There is virtually no description of the actual operation of financial markets or of thevarious institutions that play vital roles Also missing is a discussion of empirical tests ofthe various theories Empirical research in finance is perhaps more extensive than theoret-ical, and any adequate review would require a complete book itself The effects of marketimperfections are also not treated In the first place, theoretical results in this area have notyet been fully developed In addition the predictions of the perfect market models seem to

uncov-be surprisingly robust despite the necessary simplifying assumptions In any case an derstanding of the workings of perfect markets is obviously a precursor to studying marketimperfections

un-The material in this book (together with journal supplements) is designed for a full year’s

Trang 4

study Shorter courses can also be designed to suit individual tastes and prerequisites Forexample, the study of multiperiod models could commence immediately after Chapter 4.Much of the material on option pricing and contingent claims (except for parts of Chapter

18 on the term structure of interest rates) does not depend on the equilibrium models andcould be studied immediately after Chapter 3

This book is a text and not a treatise To avoid constant interruptions and footnotes,outside references and other citations have been kept to a minimum An extended chapter-by-chapter bibliography is provided, and my debt to the authors represented there should beobvious to anyone familiar with the development of finance It is my hope that any student

in the area also will come to learn of this indebtedness

I am also indebted to many colleagues and students who have read, or in some casestaught from, earlier drafts of this book Their advice, suggestions, and examples have allhelped to improve this product, and their continuing requests for the latest revision haveencouraged me to make it available in book form

Jonathan Ingersoll, Jr.New Haven

November 1986

Trang 5

Glossary of Commonly Used Symbols

a Often the parameter of the exponential utility functionu(Z) = − exp(−aZ).

B The factor loading matrix in the linear model

b Often the parameter of the quadratic utility functionu(Z) = Z − bZ2/2.

b k

i = Cov(˜z i , ˜ Z k

e )/(Cov(˜ z k

e , ˜ Z k

e )) A measure of systematic risk for the ith asset

with respect to thekth efficient portfolio Also the loading of the ith asset

on thekth factor, the measure of systematic risk in the factor model.

E The expectation operator Expectations are also often denoted with an overbar¯

e The base for natural logarithms and the exponential function e ≈ 2.71828.

¯ A factor in the linear factor model.

i As a subscript it usually denotes theith asset.

J A derived utility of wealth function in intertemporal portfolio models

j As a subscript it usually denotes theJth asset.

K The call price on a callable contingent claim

k As a subscript or superscript it usually denotes thekth investor.

L Usually a Lagrangian expression

m As a subscript or superscript it usually denotes the market portfolio

N(·) The cumulative normal distribution function

n(·) The standard normal density function

O(·) Asymptotic order symbol Function is of the same as or smaller order than its

argument

o(·) Asymptotic order symbol Function is of smaller order than its argument

p The supporting state price vector

q Usually denotes a probability

R The riskless return (the interest rate plus one)

r The interest rate r ≡ R − 1.

S In single-period models, the number of states In intertemporal models, the

price of a share of stock

s As a subscript or superscript it usually denotes states.

T Some fixed time, often the maturity date of an asset

t The tangency portfolio in the mean-variance portfolio problem

U A utility of consumption function

u A utility of return function

V A derived utility function

v The values of the assets

W (S, τ ) The Black-Scholes call option pricing function on a stock with priceS and

time to maturity ofτ

w A vector of portfolio weights w i is the fraction of wealth in theith asset.

X The exercise price for an option

Y The state space tableau of payoffs Y si is the payoff in states on asset i.

Z The state space tableau of returns Z siis the return in states on asset i.

Trang 6

ˆ

Zw The return on portfolio w

z As a subscript it denotes the zero beta portfolio

˜z The random returns on the assets

¯z The expected returns on the assets

0 A vector or matrix whose elements are 0

1 A vector whose elements are 1

> As a vector inequality each element of the left-hand vector is greater than the

corresponding element of the right-hand vector.< is similarly defined.

> As a vector inequality each element of the left-hand vector is greater than

or equal to the corresponding element of the right-hand vector, and at least

one element is strictly greater.6 is similarly defined

= As a vector inequality each element of the left-hand vector is greater than

or equal to the corresponding element of the righthand vector.5 is similarlydefined

α The expected, instantaneous rate of return on an asset

β ≡ Cov(˜ z, ˜ Z m) The beta of an asset

γ Often the parameter of the power utility functionu(Z) = Z γ /γ.

∆ A first difference

˜

ε The residual portion of an asset’s return

η A portfolio commitment of funds not nomalized

Θ A martingale pricing measure

ι j TheJth column of the identity matrix.

˜

Λ The state price per unit probability; a martingale pricing measure

λ Usually a Lagrange multiplier

λ The factor risk premiums in the APT

υ A portfolio of Arrow-Debreu securities.υ sis the number of states securities held.

π The vector of state probabilities

ρ A correlation coefficient

Σ The variance-covariance matrix of returns

σ A standard deviation, usually of the return on an asset

τ The time left until maturity of a contract

Φ Public information

φ k Private information of investork.

ω An arbitrage portfolio commitment of funds (10 ω = 0).

ω A Gauss-Wiener process dω is the increment to a Gauss-Wiener process.

Trang 7

0.1 Definitions and Notation 1

0.2 Matrices and Linear Algebra 4

0.3 Constrained Optimization 6

0.4 Probability 9

1 Utility Theory 1 1.1 Utility Functions and Preference Orderings 1

1.2 Properties of Ordinal Utility Functions 2

1.3 Properties of Some Commonly Used Ordinal Utility Functions 4

1.4 The Consumer’s Allocation Problem 5

1.5 Analyzing Consumer Demand 6

1.6 Solving a Specific Problem 9

1.7 Expected Utility Maximization 9

1.8 Cardinal and Ordinal Utility 11

1.9 The Independence Axiom 11

1.10 Utility Independence 13

1.11 Utility of Wealth 13

1.12 Risk Aversion 14

1.13 Some Useful Utility Functions 16

1.14 Comparing Risk Aversion 17

1.15 Higher-Order Derivatives of the Utility Function 18

1.16 The Boundedness Debate: Some History of Economic Thought 19

1.17 Multiperiod Utility Functions 19

2 Arbitrage and Pricing: The Basics 22 2.1 Notation 22

2.2 Redundant Assets 25

2.3 Contingent Claims and Derivative Assets 26

2.4 Insurable States 27

2.5 Dominance And Arbitrage 27

2.6 Pricing in the Absence of Arbitrage 29

2.7 More on the Riskless Return 32

2.8 Riskless Arbitrage and the “Single Price Law Of Markets” 33

2.9 Possibilities and Probabilities 34

2.10 “Risk-Neutral” Pricing 35

2.11 Economies with a Continuum of States 36

Trang 8

CONTENTS vii

3.1 The Canonical Portfolio Problem 38

3.2 Optimal Portfolios and Pricing 40

3.3 Properties of Some Simple Portfolios 41

3.4 Stochastic Dominance 43

3.5 The Theory of Efficient Markets 44

3.6 Efficient Markets in a “Riskless” Economy 45

3.7 Information Aggregation and Revelation in Efficient Markets: The General Case 46

3.8 Simple Examples of Information Revelation in an Efficient Market 48

4 Mean-Variance Portfolio Analysis 52 4.1 The Standard Mean-Variance Portfolio Problem 52

4.2 Covariance Properties of the Minimum-Variance Portfolios 56

4.3 The Mean-Variance Problem with a Riskless Asset 56

4.4 Expected Returns Relations 58

4.5 Equilibrium: The Capital Asset Pricing Model 59

4.6 Consistency of Mean-Variance Analysis and Expected Utility Maximization 62 4.7 Solving A Specific Problem 64

4.8 The State Prices Under Mean-Variance Analysis 65

4.9 Portfolio Analysis Using Higher Moments 65

A The Budget Constraint 68 B The Elliptical Distributions 70 B.1 Some Examples of Elliptical Variables 72

B.2 Solving a Specific Problem 75

B.3 Preference Over Mean Return 76

5 Generalized Risk, Portfolio Selection, and Asset Pricing 78 5.1 The Background 78

5.2 Risk: A Definition 78

5.3 Mean Preserving Spreads 79

5.4 Rothschild And Stiglitz Theorems On Risk 82

5.5 The Relative Riskiness of Opportunities with Different Expectations 83

5.6 Second-Order Stochastic Dominance 84

5.7 The Portfolio Problem 85

5.8 Solving A Specific Problem 86

5.9 Optimal and Efficient Portfolios 87

5.10 Verifying The Efficiency of a Given Portfolio 89

5.11 A Risk Measure for Individual Securities 92

5.12 Some Examples 93

A Stochastic Dominance 96 A.1 Nth-Order Stochastic Dominance 97

Trang 9

viii CONTENTS

6.1 Inefficiency of The Market Portfolio: An Example 99

6.2 Mutual Fund Theorems 102

6.3 One-Fund Separation Under Restrictions on Utility 103

6.4 Two-Fund Separation Under Restrictions on Utility 103

6.5 Market Equilibrium Under Two-Fund, Money Separation 105

6.7 Distributional Assumptions Permitting One-Fund Separation 107

6.8 Distributional Assumption Permitting Two-Fund, Money Separation 108

6.9 Equilibrium Under Two-Fund, Money Separation 110

6.10 Characterization of Some Separating Distributions 110

6.11 Two-Fund Separation with No Riskless Asset 111

6.12 K-Fund Separation 113

6.13 Pricing UnderK-Fund Separation 115

6.14 The Distinction between Factor Pricing and Separation 115

6.15 Separation Under Restrictions on Both Tastes and Distributions 117

7 The Linear Factor Model: Arbitrage Pricing Theory 120 7.1 Linear Factor Models 120

7.2 Single-Factor, Residual-Risk-Free Models 120

7.3 Multifactor Models 121

7.4 Interpretation of the Factor Risk Premiums 122

7.5 Factor Models with “Unavoidable” Risk 122

7.6 Asymptotic Arbitrage 123

7.7 Arbitrage Pricing of Assets with Idiosyncratic Risk 125

7.8 Risk and Risk Premiums 127

7.9 Fully Diversified Portfolios 128

7.10 Interpretation of the Factor Premiums 130

7.11 Pricing Bounds in A Finite Economy 133

7.12 Exact Pricing in the Linear Model 134

8 Equilibrium Models with Complete Markets 136 8.1 Notation 136

8.2 Valuation in Complete Markets 137

8.3 Portfolio Separation in Complete Markets 137

8.4 The Investor’s Portfolio Problem 138

8.5 Pareto Optimality of Complete Markets 139

8.6 Complete and Incomplete Markets: A Comparison 140

8.7 Pareto Optimality in Incomplete Markets: Effectively Complete Markets 140 8.8 Portfolio Separation and Effective Completeness 141

8.9 Efficient Set Convexity with Complete Markets 143

8.10 Creating and Pricing State Securities with Options 144

9 General Equilibrium Considerations in Asset Pricing 147 9.1 Returns Distributions and Financial Contracts 147

9.2 Systematic and Nonsystematic Risk 153

9.3 Market Efficiency with Nonspeculative Assets 154

9.4 Price Effects of Divergent Opinions 158

Trang 10

CONTENTS ix

9.5 Utility Aggregation and the “Representative” Investor 161

10 Intertemporal Models in Finance 163 10.1 Present Values 163

10.2 State Description of a Multiperiod Economy 163

10.3 The Intertemporal Consumption Investment Problem 166

10.4 Completion of the Market Through Dynamic Trading 168

10.5 Intertemporally Efficient Markets 170

10.6 Infinite Horizon Models 172

11 Discrete-time Intertemporal Portfolio Selection 175 11.1 Some Technical Considerations 187

A Consumption Portfolio Problem when Utility Is Not Additively Separable 188 B Myopic and Turnpike Portfolio Policies 193 B.1 Growth Optimal Portfolios 193

B.2 A Caveat 194

B.3 Myopic Portfolio Policies 195

B.4 Turnpike Portfolio Policies 195

12 An Introduction to the Distributions of Continuous-Time Finance 196 12.1 Compact Distributions 196

12.2 Combinations of Compact Random Variables 198

12.3 Implications for Portfolio Selection 198

12.4 “Infinitely Divisible” Distributions 200

12.5 Wiener and Poisson Processes 202

12.6 Discrete-Time Approximations for Wiener Processes 204

13 Continuous-Time Portfolio Selection 206 13.1 Solving a Specific Problem 208

13.2 Testing The Model 211

13.3 Efficiency Tests Using the Continuous-Time CAPM 213

13.4 Extending The Model to Stochastic Opportunity Sets 213

13.5 Interpreting The Portfolio Holdings 215

13.6 Equilibrium in the Extended Model 218

13.7 Continuous-Time Models with No Riskless Asset 219

13.8 State-Dependent Utility of Consumption 220

13.10A Nominal Equilibrium 225

14 The Pricing of Options 227 14.1 Distribution and Preference-Free Restrictions on Option Prices 227

14.2 Option Pricing: The Riskless Hedge 235

14.3 Option Pricing By The Black-Scholes Methodology 237

14.4 A Brief Digression 238

14.5 The Continuous-Time Riskless Hedge 239

14.6 The Option’s Price Dynamics 241

Trang 11

x CONTENTS

14.7 The Hedging Portfolio 242

14.8 Comparative Statics 243

14.9 The Black-Scholes Put Pricing Formula 244

14.10The Black-Scholes Model as the Limit of the Binomial Model 246

14.11Preference-Free Pricing: The Cox-Ross-Merton Technique 247

14.12More on General Distribution-Free Properties of Options 248

15 Review of Multiperiod Models 252 15.1 The Martingale Pricing Process for a Complete Market 252

15.2 The Martingale Process for the Continuous-Time CAPM 253

15.3 A Consumption-Based Asset-Pricing Model 254

15.4 The Martingale Measure When The Opportunity Set Is Stochastic 256

15.5 A Comparison of the Continuous-Time and Complete Market Models 257

15.6 Further Comparisons Between the Continuous-Time and Complete Market Models 258

15.7 More on the Consumption-Based Asset-Pricing Model 261

15.8 Models With State-Dependent Utility of Consumption 263

15.9 Discrete-Time Utility-Based Option Models 263

15.10Returns Distributions in the Intertemporal Asset Model 265

16 An Introduction to Stochastic Calculus 267 16.1 Diffusion Processes 267

16.2 Ito’s Lemma 267

16.3 Properties of Wiener Processes 268

16.4 Derivation of Ito’s Lemma 268

16.5 Multidimensional Ito’s Lemma 269

16.6 Forward and Backward Equations of Motion 269

16.7 Examples 270

16.8 First Passage Time 272

16.9 Maximum and Minimum of Diffusion Processes 273

16.10Diffusion Processes as Subordinated Wiener Processes 273

16.11Extreme Variation of Diffusion Processes 274

16.12Statistical Estimation of Diffusion Processes 275

17 Advanced Topics in Option Pricing 279 17.1 An Alternative Derivation 279

17.2 A Reexamination of The Hedging Derivation 280

17.3 The Option Equation: A Probabilistic Interpretation 281

17.4 Options With Arbitrary Payoffs 282

17.5 Option Pricing With Dividends 282

17.6 Options with Payoffs at Random Times 285

17.7 Option Pricing Summary 287

17.8 Perpetual Options 287

17.9 Options with Optimal Early Exercise 289

17.10Options with Path-Dependent Values 291

17.11Option Claims on More Than One Asset 294

17.12Option Claims on Nonprice Variables 295

17.13Permitted Stochastic Processes 297

Trang 12

CONTENTS xi

17.14Arbitrage “Doubling” Strategies in Continuous Time 298

18 The Term Structure of Interest Rates 300 18.1 Terminology 300

18.2 The Term Structure in a Certain Economy 301

18.3 The Expectations Hypothesis 302

18.4 A Simple Model of the Yield Curve 304

18.5 Term Structure Notation in Continuous Time 305

18.6 Term Structure Modeling in Continuous Time 306

18.7 Some Simple Continuous-Time Models 307

18.8 Permissible Equilibrium Specifications 309

18.9 Liquidity Preference and Preferred Habitats 311

18.10Determinants of the Interest Rate 314

18.11Bond Pricing with Multiple State Variables 315

19 Pricing the Capital Structure of the Firm 318 19.1 The Modigliani-Miller Irrelevancy Theorem 318

19.2 Failure of the M-M Theorem 320

19.3 Pricing the Capital Structure: An Introduction 321

19.4 Warrants and Rights 322

19.5 Risky Discount Bonds 324

19.6 The Risk Structure of Interest Rates 326

19.7 The Weighted Average Cost of Capital 329

19.8 Subordinated Debt 330

19.9 Subordination and Absolute Priority 331

19.10Secured Debt 333

19.11Convertible Securities 333

19.12Callable Bonds 335

19.13Optimal Sequential Exercise: Externalities and Monopoly Power 337

19.14Optimal Sequential Exercise: Competitive and Block Strategies 340

19.15Sequential and Block Exercise: An Example 343

19.16Pricing Corporate Securities with Interest Rate Risk 345

19.17Contingent Contracting 346

Trang 13

Mathematical Introduction

0.1 Definitions and Notation

Unless otherwise noted, all quantities represent real values In this book derivatives areoften denoted in the usual fashion by0,00and so forth Higher-order derivatives are denoted

by f (n) for the nth derivative Partial derivatives are often denoted by subscripts For

Closed intervals are denoted by brackets, open intervals by parentheses For example,

x ∈ [a, b] means all x such that a 6 x 6 b,

x ∈ [a, b) means all x such that a 6 x < b, (2)

The greatest and least values of a set are denoted by max(·) and min(·), respectively.

For example, ifx > y, then

The relative importance of terms is denoted by the asymptotic order symbols:

f (x) = o(x n) means lim

Dirac delta function

The Dirac delta functionδ(x) is defined by its properties:

Trang 14

Unit step function

The unit step function is the formal integral of the Dirac delta function and is given by

Iff and all its derivatives up to order n exist in the region [x, x + h], then it can be

repre-sented by a Taylor series with Lagrange remainder

Mean Value Theorem

The mean value theorem is simply the two-term form of the exact Taylor series with grange remainder:

La-f (x + h) = La-f (x) + La-f 0 (x + αh)h (12)for someα in [0, 1] The mean value theorem is also often stated in integral form If f (x)

is a continuous function in(a, b), then

Trang 15

0.1 Definitions and Notation 3

Implicit Function Theorem

Consider all points(x, y) on the curve with F (x, y) = a Along this curve the derivative

ofy with respect to x is

dy dx

SettingdF = 0 and solving for dy/dx gives the desired result.

Differentiation of Integrals: Leibniz’s Rule

Let F (x) ≡ RA(x) B(x) f (x, t)dt and assume that f and ∂f /∂x are continuous in t in [A, B]

andx in [a, b] Then

F 0 (x) =

Z B

A

f1(x, t)dt + f (x, B)B 0 (x) − f (x, A)A 0 (x) (15)

for allx in [a, b] If F (x) is defined by an improper integral (A = −∞ and/ or B = ∞),

then Leibniz’s rule can be employed if |f2(x, t)| 6 M(t) for all x in [a, b] and all t in [A, B], and the integralR M(t)dt converges in [A, B].

Homotheticity and Homogeneity

A functionF (x) of a vector x is said to be homogeneous of degree k to the point x0 if forallλ 6= 0

F (λ(x − x0)) = λ k F (x − x0). (16)

If no reference is made to the point of homogeneity, it is generally assumed to be 0 For

k = 1 the function is said to be linearly homogeneous This does not, of course, imply that

Similarly, allnth-order partial derivatives of F (·) are homogeneous of degree k − n.

Euler’s theorem states that the following condition is satisfied by homogeneous tions:

Trang 16

4 CONTENTS

A functionF (x) is said to be homothetic if it can be written as

whereg is homogeneous and h is continuous, nondecreasing, and positive.

0.2 Matrices and Linear Algebra

It is assumed that the reader is familiar with the basic notions of matrix manipulations Wewrite vectors as boldface lowercase letters and matrices as boldface uppercase letters Idenotes the identity matrix 0 denotes the null vector or null matrix as appropriate 1 is

a vector whose elements are unity ι n is a vector with zeros for all elements except the

nth, which is unity Transposes are denoted by 0 Vectors are, unless otherwise specified,column vectors, Transposed vectors are row vectors The inverse of a square matrix A isdenoted by A−1

Some of the more advanced matrix operations that will be useful are outlined next

Vector Equalities and Inequalities

Two vectors are equal, x = z, if every pair of components is equal: x i = z i Two vectorscannot be equal unless they have the same dimension We adopt the following inequalityconventions:

Each vector is normalized to have unit length and is orthogonal to all others

Generalized (Conditional) Inverses

Only nonsingular (square) matrices possess inverses; however, all matrices have alized or conditional inverses The conditional inverse Ac of a matrix A is any matrixsatisfying

If A ism × n, then A c isn × m The conditional inverse of a matrix always exists, but it

need not be unique If the matrix is nonsingular, then A−1 is a conditional inverse

Trang 17

0.2 Matrices and Linear Algebra 5

The Moore-Penrose generalized inverse A−is a conditional inverse satisfying the tional conditions

Also AA−1, A−A, I− AA −, and I− A −A are all symmetric and idempotent

If A is anm × n matrix of rank m, then A − = A0(AA0)−1 is the right inverse of A (i.e.,

AA− = I) Similarly, if the rank of A is n, then A − = (A0A)−1A0is the left inverse

Vector and Matrix Norms

A norm is a single nonnegative number assigned to a matrix or vector as a measure ofits magnitude It is similar to the absolute value of a real number or the modulus of acomplex number A convenient, and the most common, vector norm is the Euclidean norm(or length) of the vectorkxk:

kAk E ≡³X X

a2ij

´1/2

The Euclidean matrix norm should not be confused with the spectral norm, which is induced

by the Euclidean vector norm

kAk ≡ sup

x6=0

kAxk

The spectral norm is the largest eigenvalue of A0A

Other types of norms are possible For example, the H¨older norm is defined as

h n (x) ≡ ³X

|x i | n

´1/n

and similarly for matrices, with the additional requirement thatn 6 2 ρ(x) ≡ max |x i |

andM(A) ≡ max |a ij | for an n × n matrix are also norms.

All norms have the following properties (A denotes a vector or matrix):

Trang 18

Note that if A is symmetric, then the above look like the standard results from calculus:

∂ax2/∂x = 2ax, ∂2ax2/∂x2 = 2a.

0.3 Constrained Optimization

The conditions for an unconstrained strong local maximum of a function of several variablesare that the gradient vector and Hessian matrix with respect to the decision variables be zeroand negative definite:

∇f = 0, z0 (Hf )z < 0 all nonzeroz. (37)(For an unconstrained strong local minimum, the Hessian matrix must be positive definite.)

Trang 19

0.3 Constrained Optimization 7

The Method of Lagrange

For maximization (or minimization) of a function subject to an equality constraint, we useLagrangian methods For example, to solve the problemmax f (x) subject to g(x) = a, we

define the Lagrangian

L(x, λ) ≡ f (x) − λ(g(x) − a) (38)and maximize with respect to x andλ:

∇f − λ∇g = 0, g(x) − a = 0. (39)The solution to (39) gives for x∗ the maximizing arguments and forλ ∗ the marginal cost ofthe constraint That is,

The Method of Kuhn and Tucker

For maximization (or minimization) subject to an inequality constraint on the variables,

we use the method of Kuhn-Tucker For example, the solution to the problem max f (x)

subject tox = x0is

∇f 5 0, (x − x0)0 ∇f = 0 (43)For a functional inequality constraint we combine the methods, For example, to solvethe problem max f (x) subject to g(x) ≥ a, we pose the equivalent problem max f (x)

subject tog(x) = b, b ≥ a Form the Lagrangian

L(x, λ, b) ≡ f (x) − λ(g(x) − b), (44)and use the Kuhn-Tucker method to get

Trang 20

Here x1and x2 are vectors ofn and m control variables The first set must be nonnegative

from (46d) Equation (46a) is the objective function A11, A12, and b1 all haver rows, and

(46b) represents inequality constraints A21, A22, and b2 haveq rows, and (46c) represents

In this case z1 and z2 are vectors ofr and q elements, respectively For each of the n

non-negative controls in the primal (x1) in (46d), there is one inequality constraint (47b) in thedual For each of the m unconstrained controls (x2) there is one equality constraint (47c).The r inequality constraints in the primal (46b) correspond to the r nonnegative controls

in the dual z1, and theq equality constraints (46c) correspond to the q unconstrained dual

controls z2

We state without proof the following theorems of duality

Theorem 1 For the primal and dual problems in (46) and (47), one of the following four

Trang 21

0.4 Probability 9

Theorem 2 For any feasible x and z, p(x) − d(z) ≥ 0

Theorem 3 (of complementary slackness) For the optimal controls, either x ∗

1i = 0 or the

ith row of (47b) is exactly equal Also, either z ∗

1i = 0 or the ith row of (46b) is exactly

equal.

Each dual variable can be interpreted as the shadow price of the associated constraint inthe primal That is, the optimal value of a dual variable gives the change in the objectivefunction for a unit increase in the right-hand side constraint (provided that the optimal basisdoes not change)

0.4 Probability

Central and Noncentral Moments

The moments of a random variablex about some value a are defined as˜

provided the integral exists Ifa is taken as the mean of ˜ x, then the moments are called

central moments and are conventionally written without the prime For any other a the

moments are noncentral moments The most common alternative isa = 0.

Central and noncentral moments are related by

¶

µ 0 n−i (−µ 01)i (49)

Characteristic Function and Related Functions

The characteristic function for the density functionf (x) is

Trang 22

Probability Limit Theorems

Let x˜i, represent a sequence of independent random variables If the x˜i are identicallydistributed and have finite expectationµ, then for any constant ε > 0,

n

Ã nX

Trang 23

Bivariate Normal Variables

Letx˜1 and x˜2 be bivariate normal random variables with meansµ i, variancesσ2

i, and varianceσ12 The weighted sumw1x˜1+ w2x˜2is normal with mean and variance

co-µ = w1µ1 + w2µ2, σ2 = w12+ σ12+ 2w1w2σ12+ w22+ σ22. (61)For suchx˜1andx˜2and for a differentiable functionh(x),

Cov[˜x1, h(˜ x2)] = E[h 0(˜x2)]σ12. (62)This property can be proved as follows If x˜1 and x˜2 are bivariate normals, then fromour understanding of regression relationships we may write

f (x2)

dx2 = −

x2 − µ2

σ2 2

upon integrating by parts Then if h(x2) = o(exp(x2

2)), the first term vanishes at bothlimits, and the remaining term is justE[h 0(˜x2)]σ12

Trang 24

ncan be derived by settingit = n in the characteristic function for a normal

random variable (55) A quantity that is very useful to know in some financial models isthe truncated meanE(z; z > a):

“Fair Games”

If the conditional mean of one random variable does not depend on the realization of

an-other, then the first random variable is said to be conditionally independent of the second.

That is, x is conditionally independent of ˜˜ y if E[˜ x|y] = E[˜ x] for all realizations y If, in

addition, the first random variable has a zero mean, then it is said to be noise or a fair game

with respect to the second,E[˜x|y] = 0.

The name “conditional independence” is applied because this statistical property is termediate between independence and zero correlation, as we now show x and ˜˜ y are un-

in-correlated ifCov(˜x, ˜ y) = 0; they are independent if Cov(f (˜ x), g(˜ y)) = 0 for all pairs of

functionsf and g Under mild regularity conditions ˜ x is conditionally independent of ˜ y if

and only ifCov[˜x, g(˜ y)] = 0 for all functions g(˜ y).

To prove necessity, assumeE[˜x|y] = ¯ x Then

Cov[˜x, g(˜ y)] = E[˜ xg(˜ y)] − E[g(˜ y)]E[˜ x]

= E[E[˜xg(˜ y)|y]] − E[g(˜ y)]¯ x

= E[g(˜ y)E[˜ x|y]] − E[g(˜ y)]¯ x = 0.

(71)The second line follows from the law of iterated conditional expectations In the third line

g(y) can be removed from the expectation conditional on y, and the last equality follows

from the assumption that the conditional expectation ofx is independent of y.

In proving sufficiency we assume thaty is a discrete random variable taking on n out-˜comesy i, with probabilitiesπ i Define the conditional excess meanm(y) ≡ E[˜ x|y] − E[˜ x].

Trang 25

0.4 Probability 13

Then for any functiong(y),

Cov[˜x, g(˜ y)] = E[(˜ x − ¯ x)g(˜ y)]

= E[E(˜x − ¯ x|y)g(˜ y)]

by assumption Now consider the set ofn functions defined as g(y; k) = i k wheny = y i

fork = 1, , n For these n functions the last line in (72) may be written as

whereΠ is a diagonal matrix with Πii = π i, G is a matrix withG ki = i k, and m is a vectorwithm i = m(y i ) Since Π is diagonal (and π i > 0), it is nonsingular G is nonsingular by

construction Thus the only solution to (73) ism = 0 or E(˜ x|y) = E(˜ x).

Ify is continuous, then under mild regularity conditions the same result is true Note that

the functions used were all monotone Thus a stronger sufficient condition for conditionalindependence is thatCov[x, g(y)] = 0 for all increasing functions.

A special case of this theorem is x is a fair game with respect to ˜˜ y if and only if

E(˜x) = 0 and Cov[˜ x, g(˜ y)] = 0 for all functions g(y) Also equivalent is the statement

thatE[˜xg(˜ y)] = 0 for all functions g(y).

Stochastic Processes

A stochastic process is a time series of random variables ˜ X0, ˜ X1, , ˜ X N with tionsx0, x1, , X N Usually the random variables in a stochastic process are related in

Trang 26

where theε˜nare independent and identically distributed µ, which is the expected change

per period, is called the drift Another stochastic process is the autoregressive process

˜

where again the error terms are independent and identically distributed

Markov Processes

A Markov process is a stochastic process for which everything that we know about its future

is summarized by its current value; that is, the distribution ofX n, is

are special cases of martingales

More generally, the stochastic process { ˜ X n } N

0 is a martingale with respect to the theinformation in the stochastic process{ ˜ Y n } N

0 if, in addition to (80a),E[ ˜X n+1 |y0, y1, , y n ] = x n (81)

It is clear from (81) thatx nis a function ofy0, , y n (In particular, it is the expectedvalue of ˜X n+1.) Thus, (81) is a generalization of (80b) The martingale property can also

be stated as ˜X n+1 − ˜ X nis a fair game with respect to{ ˜ Y i } n

E[ ˜X n+1 |y0, y1, , y n] = E

hE[ ˜X n+1 | ˜ Y0, ˜ Y1, , ˜ Y n+1 ]|y0, y1, , y n

i

. (83)

Trang 27

0.4 Probability 15

But ˜X n+1is a function of y0, , y n+1 (that is, it is not random), so the inner expectation

is just the realizationx n+1, which is also, by definition,E[ ˜X|y0, , y n+1] Therefore, theright-hand side of (83) is

E

hE[ ˜X|y0, y1, , y n+1 ]|y0, y1, , y n

i

= E[ ˜X|y0, y1, , y n ] ≡ x n

(84)

Now equating (83) and (84) verifies the martingale property (81)

One important property of martingales is that there is no expected change in the level

over any interval This is obviously true for a single period from the definition in (81) Over

longer intervals this can be proven as follows Take the unconditional expectation of bothsides of (81) Then

E[ ˜X n+1] = E

hE[ ˜X n+1 | ˜ Y0, ˜ Y1, , ˜ Y n]

i

Then by inductionE[ ˜X n+m] = E[ ˜X n ] for all m > 0.

Trang 28

Chapter 1

Utility Theory

It is not the purpose here to develop the concept of utility completely or with the mostgenerality Nor are the results derived from the most primitive set of assumptions All ofthis can be found in standard textbooks on microeconomics or game theory Here rigor istempered with an eye for simplicity of presentation

1.1 Utility Functions and Preference Orderings

A utility function is not presumed as a primitive in economic theory What is assumed isthat each consumer can “value” various possible bundles of consumption goods in terms ofhis own subjective preferences

For concreteness it is assumed that there are n goods Exactly what the “goods” are

or what distinguishes separate goods is irrelevant for the development The reader shouldhave some general idea that will suffice to convey the intuition of the development Typ-ically, goods are different consumption items such as wheat or corn Formally, we oftenlabel as distinct goods the consumption of the same physical good at different times or indifferent states of nature Usually, in finance we lump all physical goods together into asingle consumption commodity distinguished only by the time and state of nature when it

is consumed Quite often this latter distinction is also ignored This assumption does littledamage to the effects of concern in finance, namely, the tradeoffs over time and risk Then

vector x denotes a particular bundle or complex of x iunits of goodi Each consumer selects

his consumption x from a particular setX We shall always take this set to be convex and

and read as “x is (strictly) preferred to z” and “x is equivalent to z.”

The following properties of the preordering are assumed

Axiom 1 (completeness) For every pair of vectors x ∈ X and z ∈ X either x % z or

z % x.

Trang 29

2 Utility Theory

Axiom 2 (relexivity) For every vector x ∈ X , x % x.

Axiom 3 (transitivity) If x % y and y % z, then x % z.

Axioms are supposed to be self-evident truths The reflexivity axiom certainly is Thecompleteness axiom would also appear to be; however, when choices are made under un-certainty, many commonly used preference functions do not provide complete orderingsover all possible choices (See, for example, the discussion of the St Petersburg paradoxlater in this chapter.) The transitivity axiom also seems intuitive, although among certainchoices, each with many distinct attributes, we could imagine comparisons that were nottransitive, as illustrated by Arrow’s famous voter paradox This issue does not loom large

in finance, where comparisons are most often one dimensional

Unfortunately, these three axioms are insufficient to guarantee the existence of an

ordi-nal utility function, which describes the preferences in the preordering relation An ordiordi-nal

utility function is a functionΥ from X into the real numbers with the properties

Examples of a preording satisfying these three axioms but which do not admit to

rep-resentation by ordinal utility functions are the lexicographic preorderings Under these

preferences the relative importance of certain goods is immeasurable For example, for

n = 2 a lexicographic preordering is x Â z if either x1 > z1 orx1 = z1 andx2 > z2 Thecomplexes x and z are equivalent only if x = z In this case the first good is immeasurablymore important than the second, since no amount of the latter can make up for a shortfall

of x∗

With just these four axioms the existence of a continuous ordinal utility function over

e consistent with the preordering can be demonstrated Here we simply state the result.The interested reader is referred to texts such as Introduction to Equilibrium Analysis byHildenbrand and Kirman and Games and Decisions by Luce and Raiffa

Theorem 1 For any preordering satisfying Axioms 1-4 defined over a closed, convex set of

complexes X , there exists a continuous utility function Υ mapping X into the real line with the property in (3).

1.2 Properties of Ordinal Utility Functions

The derived utility function is an ordinal one and, apart from continuity guaranteed by theclosure axiom, contains no more information than the preordering relation as indicated in

Trang 30

1.2 Properties of Ordinal Utility Functions 3

(3) No meaning can be attached to the utility level other than that inherent in the “greaterthan” relation in arithmetic It is not correct to say x is twice as good as z ifΥ(x) = 2Υ(z).Likewise, the conclusion that x is more of an improvement over y than the latter is over zbecauseΥ(x) − Υ(y) > Υ(y) − Υ(z) is also faulty.

In this respect if a particular utility function Υ(x) is a valid representation of somepreordering, then so isΦ(x) ≡ θ[Υ(x)], where θ(·) is any strictly increasing function We

shall later introduce cardinal utility functions for which this is not true

To proceed further to the development of consumer demand theory, we now assume that

Assumption 1 The function Υ(x) is twice differentiable, increasing, and strictly concave.

This assumption guarantees that all of the first partial derivatives are positive where, except possibly at the upper boundaries of the feasible set Therefore, a marginalincrease in income can always be profitably spent on any good to increase utility Theassumption of strict concavity guarantees that the indifference surfaces, defined later, arestrictly concave upwards That is, the set of all complexes preferred to a given complexmust be strictly convex This property is used is showing that a consumer’s optimal choice

every-is unique

The differentiability assumption is again one of technical convenience It does forbid,for example, the strict complementarity utility functionΥ(x1, x2) = min(x1, x2), which isnot differentiable wheneverx1 = x2 (See Figure 1.2.) On the other hand, it allows us toemploy the very useful concept of marginal utility

A utility function can be characterized by its indifference surfaces (see Figures 1.1 and1,2) These capture all that is relevant in a given pre ordering but are invariant to anystrictly increasing transformation, An indifference surface is the set of all complexes ofequal utility; that is,{x ∈ X |x ∼ x ◦ } or {x ∈ X |Υ(x = Υ ◦ )} The directional slopes of

the indifference surface are given by the marginal rates of (commodity) substitution Usingthe implicit function theorem gives

Trang 31

1.3 Properties of Some Commonly Used Ordinal Utility Functions

An important simplifying property of utility is preferential independence Two subsets of

goods are preferentially independent of their complements if the conditional preferenceordering when varying the amounts of the goods in each subset does not depend on theallocation in the complement In other words, for x partitioned as (y, z),

[(y1, z0) % (y2, z0)] ⇒ [(y1, z) % (y2, z)] for all y1, y2, z, (1.6a)(y0, z1) % (y0, z2) ⇒ (y, z1) % (y, z2) for all y, z1, z2. (1.6b)Preferential independence can also be stated in terms of marginal rates of substitution,Two subsets with more than one good each are preferentially independent if the marginalrates of substitution within each subset (and the indifference curves) do not depend uponthe allocation in the complement

A preferentially independent utility function can be written as a monotone transform of

Mutual preferential independence is obviously a necessary condition for additivity as well.One commonly used form of additive utility function is a sum of power functions, Theseutility functions can all be written as

Υ(x) =

P

a i x γ i

Trang 32

1.4 The Consumer’s Allocation Problem 5

These include linearγ = 1 and log-linear γ = 0, −γ(x) = Pa i log(x i), as limiting cases

An equivalent representation for the log-linear utility function is

Φ(x) = exp[Υ(x)] =Yx a i

This is the commonly employed Cobb-Douglas utility function

Each of these utility functions is homothetic as well as additive That is, the marginal

rates of substitution depend only on the relative allocation among the goods For linearutility the marginal rates of substitution are constant

which depends only on the ratio of consumption of the two goods

Forγ = 1 the goods are perfect substitutes, and the indifference curves are straight lines.

Another special case isγ → −∞, in which the marginal rate of substitution −dx i /dx j isinfinite ifx i > x j and zero forx i < x j This is the strict complementarity utility functionmentioned earlier

In the intermediate case γ < 0, the indifference surfaces are bounded away from the

axes, For a level of utilityΥ0 the indifference surface is asymptotic to x i = (γΓ0/a i)1/γ.Thus, a limit on the availability of any single good places an upper bound on utility For

0 < γ < 1 the indifference surfaces cut each axis at the point x i = Υ0a −1/γ i , and forγ = 0

the indifference surfaces are asymptotic to the axes so no single good can limit utility

1.4 The Consumer’s Allocation Problem

The standard problem of consumer choice is for consumers to choose the most preferredcomplex in the feasible set In terms of utility functions they are to maximize utility subject

to their budget constraints, Formally, for a fixed vector p of prices and a given feasible set,the consumer with wealthW and utility function Υ solves the problem

Because X contains the null vector a feasible solution to (15) exists for p > 0 and

W > 0 Since it is unbounded above, the optimum will not be constrained by the scarcity

of any good, and since it is bounded below the consumer cannot sell unlimited quantities

of certain “goods” (e.g., labor services) to finance unlimited purchases of other goods

Trang 33

6 Utility Theory

Theorem 2 Under Assumptions 1 and 2 the consumer’s allocation problem possesses a

unique, slack-free solution x ∗ for any positive price vector, p > 0, and positive wealth.

Proof The existence of a solution is guaranteed because the physically available set

X is closed and bounded below by assumption x i > x l

i The setE of economically

fea-sible complexes is bounded above (Each x i 6 (W −Pj6=i x l

j p j )/p i ) Thus, the set of

feasible complexes F , the intersection of X and E , is closed and bounded; that is, it is

compact Furthermore, it is not empty since x= 0 is an element But since Υ(x) is uous, it must attain a maximum on the setF If the solution is not slack free, p 0x∗ < W ,

contin-then the slack s ≡ W − p 0x∗ can be allocated in a new complex x ≡ x ∗ + δ , where

δ i = s/np i > 0 This new complex is feasible since p 0x = W , and it must be preferred

to x∗ (since x > x ∗), so x∗ cannot be the optimum Now suppose that the optimum is notunique If there is a second optimum x0 6= x ∗, thenΥ(x0) = Υ(x∗ ) But F is convex since

X is, so x = (x0 + x∗ )/2 is in F , and, by strict concavity, Υ(x) > Υ(x ∗), which is aviolation Q.E.D

Note that existence did not require Assumption 1 apart from continuity of Υ, whichcan be derived from the axioms The absence of slack required the utility function to beincreasing Uniqueness used, in addition, the strict concavity

To determine the solution to the consumer’s problem we form the Lagrangian L ≡

Υ(x) + λ(W − p 0x) Since we know the optimal solution has no slack we can impose anequality constraint The first-order conditions are

1.5 Analyzing Consumer Demand

The solution to the consumer’s allocation problem can be expressed as a series of demandfunctions, which together make up the demand correspondence

x ∗

Trang 34

1.5 Analyzing Consumer Demand 7

Under the assumption that the utility function is twice differentiable, the demand functionscan be analyzed as follows

Budget Line

Optimum

Figure 1.3 Consumer’s Maximization Problem

Quantity of Good One x

1

x2

Quantity of Good Two

Take the total differential of the system in (16) with respect to x, p,λ, and W :

Trang 35

8 Utility Theory

Equation (24) is the Slutsky equation The first term measures the substitution effect, andthe second term is the income effect The former can be given the following interpretation.Consider a simultaneous change inp j andW to leave utility unchanged Then

0 = dΥ = ∂Υ

∂x 0 dx = λp 0 dx, (1.25)where the last equality follows from (16a) Substituting (25) into (19b) givesdW = dp 0x,which, when substituted into (22), gives

The direct substitution effect ∂x ∗

i /∂p i |Υ must be negative because the bordered Hessianand, hence, its inverse are negative definite The income effect can be of either sign If

∂x ∗

i /∂W < 0, then good i is an inferior good and the income effect is positive If the

good is sufficiently inferior so that the income effect dominates the substitution effect and

1 will rotate the budget line clockwise The substitution effect,

a movement from pointsA to B, involves increased consumption of good 2 and decreased

consumption of good 1 at the same utility The income effect is the movement from points

Trang 36

1.6 Solving a Specific Problem 9

1.6 Solving a Specific Problem

Consider an investor with the log-linear utility functionΥ(x, z) = α log(x)+(1−α) log(z).

The first order conditions from (16a) are

= αz ∗

p x > 0. (1.32)

Định dạng
Số trang	390
Dung lượng	2,54 MB