Lecture Notes 8, Black-Scholes-Merton Option Valuation – 3 separate explanations!

K Foster, CCNY, Spring 2010

 

 

Learning Outcomes (from CFA exam)

Students will be able to:

§  explain the assumptions underlying the Black–Scholes–Merton model and their limitations;

§  use the Black-Scholes-Merton model to calculate options prices;

 

From Hull Chapter 13

 

The Black-Scholes-Merton formula is one of the highlights of finance theory.  Mark Rubinstein, past president of the American Finance Association, described it as, "one of the most successful in the social sciences and has perhaps … the most widely used formula, with embedded probabilities, in human history" (Journal of Finance, 49(3), p. 772).  Nonetheless, it can be frustrating to learn it, since the key formula seems pulled out of thin air to solve a differential equation.  You can read the original articles (from JStor in the library):

Black, Fischer, and Myron Scholes, (1973).  "The Pricing of Options and Corporate Liabilities," The Journal of Political Economy, 81(3), 637-54.

Merton, Robert C., (1973).  "Theory of Rational Option Pricing," The Bell Journal of Economics and Management Science, 4(1), 141-83.

 

 

Assume that VS is Wiener process, VS = µSdt + sSdz

 

If this is a model of a stock price then we want to look at how the returns vary, where the percentage return is (S1 - S0)/S0 = ∆S/S and from calculus, the derivative of ln(S) is dS/S or ∆S/S.

 

So first find the distribution of ln(S).  By Itô's Lemma:

we have to find the first and second derivatives of ln(S):

, , and of course , so that:

 

.

 

This probably doesn't seem intuitive at all: why do we need to subtract off a term for the volatility?

 

Nonetheless, this implies that ln(S) has a normal distribution, which can also be stated as S has a log-normal distribution.  If a normal distribution looks like this:

a log-normal distribution looks like this:

since exp(0) = 1, the median is still at 1 however the mean is skewed to the right.  This might seem like a better model for stock changes: a stock price can never be negative (this defines the limited liability corporation) but the upside gains can be huge (although with a very tiny probability).

 

So Itô's Lemma tells us that the distribution of ln(ST) is going to be normal with mean  and standard deviation .  Alternately we could express this by noting that ln(ST) – ln(S0) = ln(ST/S0) has mean  and the same standard deviation.

 

Now we want to figure the distribution of x, where x is the return on a stock over a period of time,

.  This is just dividing through by a constant (T) so the new mean is going to be  and the standard deviation will be .

 

Assumptions of Black-Scholes-Merton model:

  • Stock has log-normal distribution
  • Short-selling is permitted without restrictions
  • No transactions costs or taxes
  • No riskless arbitrage opportunities
  • Continuous trading of securities
  • Riskfree interest rate, r, is constant for all maturities

 

Some of these assumptions will be later relaxed but we'll keep them for now.

 

Black-Scholes formulas:

Modify the intrinsic value formulas (and recall the lower bound formulas) to get something that looks like the expected present value analogs:

 

call payoff:      

Remember we found that it has a lower bound of S0Ke-rT, which in some sense is the present-value analog of its intrinsic value.

From what we learned about risk-neutral valuation, we might think of taking some sort of risk-neutral expectation of the call value to find its price; this might mean finding some probabilities, π1 and π2, such that

                                π1S0 - π2Ke-rt

gives the price of the call.  This is an excellent guess!  Figuring out the exact functions for these probabilities is a bit tricky, but that's what BSM got a Nobel prize for.  The formula just fills in Normal calls for those probabilities.

call price:          

Similarly for puts,

put payoff:        

has lower bound of Ke-rT - S0 so the BSM formula fills in probabilities, π3 and π4, in the formula,

                                π3Ke-rt  - π4S0  to get

put price:           ,

 

where  and  or .

 

The d1 and d2 parameters can be seen as measuring the location of the strike price relative to the expected value of the stock.  They are the standardized measure of how many standard deviations from the mean is the value, .  These probabilities discount the intrinsic value terms for the call and put.

 

We are given a present stock price and a strike price, and we want to figure out how far away the strike price is, from the likely future value of the stock.  How far is far?  A few pennies might not be much if the stock trades at $75 per share but for "penny stocks" it could be a very long way.  It is the volatility that gives the answer.  So we want to take the distance between S0 and K (actually the natural log of the distance, since we're assuming a distribution of returns), {lnS0lnK}, subtract the mean (this is the  term), and divide by the standard deviation.

 

Consider the problem graphically from the standpoint of risk-neutral valuation.  We have S0 and K, where we assume that ST is distributed log-normal.  So if we were valuing an out-of-the-money call, we might have something like this:

Where the red shaded area is where the call would pay off.  The first step is to translate this log-normal distribution into a standard normal, which is done by transforming the mean and variance, so that we get something like:

Then we want to find the expected value of a payoff that starts at zero at K and rises above that level.

 

This call option has value max(ST – K, 0) so its present discounted expected value is e-rTE[max(ST – K, 0)], which is .  The reason for the separate terms d1 and d2 is that they relate to different expected values: a call option entitles the bearer to get ST if ST>K but costs K (but no more) if ST>K.  If we rewrite the Black-Scholes formula, , then the second term is the probability that the stock will get as high as the strike price, K.  The first term is the probability that the call will return the stock price (i.e. that it will be above the strike) – and will be zero if not.

 

The Black-Scholes formula is a solution to the Black-Scholes differential equation (which is a form of the heat equation).  Return to the "delta hedging" argument.  We could buy V shares of stock to form a perfect hedge against the movements of an option.  (Actually it is only a perfect hedge for infinitesimal changes, but for now we'll elide that problem.)  What is delta?  If we label the option price as p (some price) we of course note that p is a function of the stock value and time, so we can write p(S,t).  So if the stock price changes by VS then the option price will change by , so if  then we have a perfect hedge.  So the position is (VS – p) , or, dividing through by V, (S – p/V).  Since this is a riskless portfolio, it should return rVt (or else there are arbitrage opportunities).  As there are tiny changes, the value is (dSdp/V), but we need some stochastic calculus (Itô's Lemma) to get a value of dp.  Recall that

 so, substituting back,

.  Set this equal to the riskfree rate and get

.  There is one boundary condition, which is the definition of the option, that p(x,T) = payoff (at date T, the expiration date).  The solution to this partial differential equation is the Black-Scholes equation.

 

A bit more intuition:

We can check if the formula results change as we would intuitively expect, given the different input arguments.

call price:          

put price:           ,

 

where  and  or .

 

The stock return does not explicitly appear in the formula, except insofar as it affects the current price.  The time-to-maturity for the option appears only multiplied by the risk-free interest rate and the volatility, so these aspects are intuitive enough.  As time-to-maturity, volatility, or the risk-free rate increase (T, s, or r), the value of the option rises towards its intrinsic value.

 

As volatility falls toward zero, then if the option is already in the money then the probability that it will stay there goes towards 1 and the option is worth the intrinsic value.  If the option were out of the money, then a low volatility means it is less and less likely to ever be worth more than zero.

 

As the stock price rises (S0), it becomes almost certain that the option will be exercised and the call price rises toward S0 – e-rTK, as d1 and d2 rise so that cdfN( ) of each of those gets closer and closer to 1.  A put price would go in the other direction as the stock price rose; again -d1 and -d2 would drive the cdfN( ) terms toward zero.

 

See the Excel sheet showing how changes in the parameters affect call and put prices.  (These are "the Greeks" – but we'll get to that later.)

 

Implied Volatilities

In practice, some of the crucial Black-Scholes assumptions are not true so the formulas are not exact.  However they are nonetheless often used in reverse to generate a volatility from a given option price.  If an at-the-money call price is $6.46, is that a good or a bad price?  It depends on the current price of the stock, in a nonlinear way.  A different way to price an option is to quote its implied volatility – i.e. that volatility which, put into the Black-Scholes equation, would give the price that is "really" being quoted.  This may allow a market participant to better judge whether the price is good or bad by instead asking, is the volatility reasonable?  If the implied volatility is too high then the price is high; if the implied volatility is low then the price is low.  Note that this is done, even though everyone knows that the Black-Scholes model is not the "true" description of option prices!  It still provides a handy way of translating prices.

 

Of course, figuring out these volatilities is no small task, since there is no simple formula.  It takes a computer search program to do it – to guess numbers and then work to improve the guess until it is within some criterion range.

 

Relation to Binomial Trees

Finally note that we would get the same answer by taking a binomial tree model over a fixed time period but taking more and more ever-smaller steps.

 

 

Another take on Black-Scholes-Merton

 

The basic Black-Scholes formula is pretty simple, once we get used to it.  The expected payoff to a call (for example) at time T is max{ST – K,0}.  If the call is not exercised its value is zero so it only has value if exercised, so the future payoff is STcdfN(d1) - KcdfN (d2), where d1 and d2 indicate the probabilities that the option will be exercised.  We know that a risk-neutral no-arbitrage valuation implies that ST = S0ert so we replace that to get S0ertcdfN (d1) - KcdfN (d2), which is the future value.  The present value multiplies by e-rt to get S0cdfN (d1) - Ke-rt cdfN (d2).

 

Remember our basic argument: given some stock return that follows a Wiener process, dS = µSdt + sSdz, we want to find the distribution of the function, g(S) = ln(S).  Itô's Lemma tells us that

                ,

so we figure that , , , a = µS, and b = sS, that .  Since d[lnS] is a Wiener process, we can assert that lnST has a Normal distribution with mean, under a risk-neutral measure, of

                . 

Where did that  come from?  That's the mystery of stochastic calculus. But there is another way.

 

Jay Jorgenson and Adriana Espinosa (here at CCNY) have figured out another way to derive the Black-Scholes-Merton formula, without Itô's Lemma, just some algebra.  This summarizes their paper, Espinosa and Jorgenson (2003).

 

We assume that a stock's return is given by:

               

 

Next define and thus

,

 ,

               

So that , which allows us to interpret the  as the percent return; also note that .

 

We assume that the ΔS are independent and identically distributed.  Independent means that the ΔS at time t does not depend on the previous day's (or week's or month's) ΔS; the assumption of identical distribution allows us to assert that its variance is constant over time.  The independence is a consequence of weak market efficiency, since if the change in stock price could have been predicted then why wasn't it arbitraged away?

 

Remember, from stats, the basic relation that if A and B are independent then the var(A+B) = var(A) + var(B).  Now from the definition of Xt as a sum of independent but identically distributed terms, we can see that, for any number of little steps up to time t, the variance of Xt is

                ,

where, since theΔS are identical, the variance of ΔS1 is the same as ΔST or any other ΔS – and we'll define this variance as some constant, s2.

 

Why all of this work?  Because we might remember the Central Limit Theorem (CLT) from statistics.  This states that a sum of independent and identically distributed terms can converge to a distribution, as the number of steps goes higher and higher.  (I use the term "can" not "must" to remind us that there are a few technical details to be taken care of, before this is an actual proof, but for now we'll ignore those.)  The Central Limit Theorem allows us to assert that the distribution goes towards a Normal distribution.  The Central Limit Theorem implies:

               

converges to a normal distribution with some mean and standard deviation.  We have seen that the variance of lnSt must be s2t so that the standard deviation is , but what is the mean?

 

To figure out the mean of this variable, we take a detour through risk-neutral no-arbitrage pricing.  Remember that this argument contends that

                S0 = e-rt E[St],

where the E[ ] expected value function uses the probability density function f( ) and is defined as

                . 

This doesn't seem to help since we don't know the distribution function of S so we don't know the probability density function, f( ).  But the Central Limit Theorem tells us that XT = lnST has a normal distribution, so we would like to use X instead of S and then integrate with respect to the normal probability density function rather than some unknown pdf.  So we switch the variables X and S by inserting eX for S (since, by definition, ), and put in a normal probability distribution,  instead of  to write

                .

 

But the normal probability density function depends on the mean, µ, and standard deviation, s, so we can write the normal distribution function as .  We figured out that the standard deviation is  but what about µ?  For now we'll leave it as a variable, and write:

                , where the function, , is defined as

                 so that

                .

 

We have an awful lot of terms with e to some power – can we fix that?  Let's concentrate on the integrand; we see that we are integrating

.  We can transform the exponent as follows:

                 =

=

                                =            

Now what if we did the ancient mathematician's trick of adding and subtracting a number – which doesn't change the equality but can transform the variables?  So add-and-subtract these terms,  

and another , to get

                                =

                                =

                                =

=

                                =

 

So that we can re-write  as

                .

 

Why the heck is that any better?  Recall that the formula for a normal distribution involves , where M is the mean and S is the standard deviation.  The last part of the exp( ) function above looks like a normal, if we had  and .

 

So going back up to the equation, which had the risk-neutral no-arbitrage value of , we replace the integrand with the value found after all of the algebra to get:

               

 

This allows us to take the ex term that we were integrating out from under the integral, where it was a pain to deal with.  Now that it's e to the power of some constant (), we can take it to the outside of the integral.  Once that's outside the integral, then all that's left inside is the pdf of a probability function, and we know that  so that we get

                 -- wow!

 

That's a lot easier, isn't it?!  Then we can take logs and solve to find that

                . 

This is what Ito's Lemma gave (way back in the beginning), but without the stochastic calculus.  Granted, there's a lot of algebra instead, but that's more tedious than mystery.

 

So to price an option, we just use the expected value formula from above, with the proper distribution function.  A risk-neutral no-arbitrage argument tells us that a call price, c, must be:

                .

Note that we've eliminated the "max( )" function by just changing the bounds of integration – this is the conditional expectation now, the expectation of (ST – K), conditional on if the stock is greater than K.

 

If we make a change of variables so that X = ln(S) (thus, S=ex), then we can switch the probability function from involving ST to involving ex, which has a distribution that we understand – that's got mean µ (remember that and standard deviation .  As part of this change of variables, we also must change the boundary of integration, now to ln(K).  So we get:

                .

 

So now we break up the pieces and get an integral of ex (which is a problem that we solved before – we've just got to fix the bound of integration) and an integral of K, which is a constant and so easy.  So

                , which is

                ,

and where the equation with all of the algebra allows us to rewrite the first term to get

               

(note that now the first and second pdf's are different – one has a transformed mean from the ex simplification and the other does not).  But both pdf's are pretty simple: they're just integrating over a part of the curve, which is finding the cumulative distribution, cdfN( ), for some values.  So by definition

                .

For a variable, y, that doesn't have a standard normal distribution (mean zero and standard error of one), we define , where M is the mean and  is the standard error, to change variables so that

                .

 

So going back to the equation for the call price, we evaluate the piece,

                ,

where the term d1 (chosen for no special significance, just by coincidence it might seem like it could be like the Black-Scholes formula term d1!) is

(the last substitution for µ is from the earlier equation ).  This simplifies to

                 -- which should look familiar.

 

Again going back to the equation for the call price, we next evaluate the piece,

                ,

where now d2 just happens to be

                ,

which again looks familiar.

 

So the call price, c is given as

                ,

which ought to look real familiar – it's the Black-Scholes-Merton formula!

 

 

Yet another take on Black-Scholes-Merton

 

from Björk

 

This basic method of finding the BSM model remains fundamentally similar to how we solved the tree models: show that there is some combination of stock and derivative that makes a riskless portfolio, so then this portfolio must return the risk-free rate, so we can value the initial derivative position.

 

As usual in solving differential equations, this means pushing around our equation until we get it into a form that has already been solved by smart people; in this case this is the Feynman-Kac equation.

 

Again assume that there is free and costless trading of any position in the derivative, stock, and riskless bond.

 

The derivative price process changes over time; we label this  since it depends directly on time and the stock value.  As usual St is the stock price, Bt is the bond price that returns a riskless rate of return, r, the derivative matures at T, and we model the stock process as

                ,

where dW is the driving Wiener process.

 

Taking the derivative price as , we use Itô's Lemma to find the total differential,

                .

Then substitute that , , and , so

                .

Rewrite this into

                ,

where  and .

 

So now we have

                the bond price process,                           

                the stock price process,                           ,

                and the derivative price process,        .

 

Next we want to find some combination of stock and derivative positions that will make a riskless self-financing portfolio, V.  This means dividing the fraction of the portfolio invested in the stock, u, and of the fraction invested in the derivative, (1 – u), such that .

 

Recall that, from our most basic classes, we understand how to find the growth rate for some composite based on the growth rates of the parts: if my portfolio is invested ¼ in small-cap stocks and ¾ in large-cap stocks, then the growth of my portfolio is ¼(growth of small-cap) + ¾(growth of large-cap).

 

So we can write  and substitute from above to write:

               

Now this portfolio will be riskless if we choose the share invested into stock, u, correctly so as to zero out the dW term; so choose u such that , or .  This is defined as long as , which would only happen if , if the price of the derivative were unrelated to the stock price – which would be very odd indeed!  So this makes

                .

 

 

So if the value of the portfolio has no random element, then it must grow at the riskless rate, so  and .

 

Now substitute in terms, , ,  and , and note especially that the  terms drop out – the derivative price has no dependence on the stock's rate of growth.  So we get the equation,

                 with the boundary constraint that, at the expiration date T, .

 

Now for the casual person reading along, this doesn't look like it's much help, but it actually is helpful.  This is because, as I mentioned above, somebody else has already a solved tough equation just like it – it's called Feynman-Kac.

 

Sidebar:

The Feynman-Kac solution sets up the problem from a differential equation like this:

                 and at the boundary, T, .

The equation might look like a hopeless mess unless we were to remember our Itô calculus and decide that the second part looks like a part of the derivative when we have to include second-order terms, so we could write the differential equation as  if we accept some sloppy notation.

 

Then the Feynman-Kac result shows that the solution for the function, F, that solves this problem is:

                ,

where X satisfies , and the  and  functions are lifted right out of the initial differential equation.

 

It is a straightforward extension to show that if we had some , then since the product rule gives , then this would be equivalent to solving  and at the boundary, T, .  Then the modified solution is .

 

These actually allow a much more generalized Itô process than in the original Black-Scholes-Merton, since these allow the mean process  and variance process  to be functions of t and x, not just constants.

 

 

To return to the original Black-Scholes-Merton problem, we have the equation

                 with the boundary constraint that, at the expiration date T, .  Hooray!  This is exactly the formula that the Feynmac-Kac result solves for us, so we don't have to do extra hard work, they've already done it for us!  (Positive externalities of knowledge capital.)

 

In the simple case where the mean and variation processes are constants, then we can write the solution from  to get the Black-Scholes-Merton formula, that for a call,

                ,

where  and  and  is the standard normal cumulative distribution function.