π¨βπ« Notes
Markowitz Portfolio Optimization
The bulk of this lecture is based on Markowitz Portfolio optimization, which is an process for finding optimal allocations in a portfolio. It was derived from classical tools of probability.
It has three steps:
- Find the optimal risky portfolio.
- Build the CAL. The CAL is like a menu of different complete portfolios that an investor may choose, based on their risk aversion.
- Choose the complete portfolio off of the CAL that matches the investorβs risk preferences.

This entire process is illustrated in the following Excel spreadsheet:

Find the optimal risky portfolio
The following slide show a pair of assets. As you can see from the diagram, Asset 1 has a lower Ο and Er. Specifically, it looks like and , whereas and . Therefore, we can think of asset 1 as bonds and asset 2 as stocks.

The purple line shows all combinations of assets 1 and 2 that the investor can own. For example, the highest black dot represents a portfolio invested 100% in stock and the lowest black dot represents a portfolio invested 100% in bonds. The other points on the purple line represent all of the other possible weightings of the portfolio.
The middle black point represents a portfolio that is 50% invested in stocks and 50% invested in bonds. This investor puts half of their money in each asset. Note the following label for this portfolio:

The second half of the label indicates that the expected return of the β50/50 portfolioβ is a 50/50 weighted average of the expected returns of stocks and bonds:
Likewise, the label also indicates that the standard deviation of the 50/50 portfolio is lower than the 50/50 weighted average of the standard deviation:
This is due to the benefits of diversification when the correlation between the assets is less than 1 (which it almost always is).
The followings slide indicates that you can get the βbestβ (=highest return) CAL from the point on the curved line:
- where the CAL is tangent to the curved line
- the Sharpe ratio is highest

To find this risky portfolio, we plot out all points on the curved line and choose the one with the highest Sharpe ratio. To plot each point, weβll need formulas for and . Weβll show these in the following section.
The full process of finding the optimal risky portfolio is illustrated below in Excel:

π₯
πHow do you determine which specific various weighted combinations to include when calculating the optimal risky portfolio?(βStep 1β) Would these be the points that make up the minimum variance frontier?
See answer
You want to include enough points that you can get very close to the truly optimal portfolio. However, there is no scientifically best approach. At the bottom of the spreadsheet, I use a computer optimization program for this to find an optimal risky portfolio that is 29.1429% bonds and 70.8571% stocks. Here is a far better tool to use than Excel:https://qontigo.com/products/axioma-portfolio-optimizer/
The key equations for and
In the previous lecture, we talked about how investing is a tradeoff between risk and return . We investigated this tradeoff as an investor allocated their assets between a risky and a risk-free asset. In this lecture, we extend the analysis. We consider how an investor can find the optimal risky portfolio to mix with the risk-free asset.
This involves another risk-return tradeoff, as we allocate between two (or more) risky assets. As before, we look at and . Our new formulas for them are:
where
Note, Ο1,2 refers to the correlation between asset 1 and asset 2. Also, if youβreg not given Ο1,2 then you can use Ο1,2 = Ο_1Ο_2Cov(r_1, r_2).
βοΈImage you are making a portfolio out of three mutual funds:
E(r) | sd | ||
---|---|---|---|
T-Bill fund = rf | 2.4% | 0 | |
Bond fund | 4% | 9% | |
Stock Fund | 10% | 15% | Ο1,2=-0.221 |
Imagine a risky portfolio consisting of 60% stocks and 40% bonds. What are the expected return, variance, and standard deviation of this portfolio?
βLetβs just decide that bonds are asset 1 and stocks are asset 2.
π§ Key insight:
It turns out that these new formulas are derived from the same classical formulas from probability that last weekβs key formulas are derived from:
2 Pairs of Statistics & 4 Helper Formulas
Bruce derived the previous four formulas from the following two classic formulas.
These are worth knowing well!
I prefer the risk equations that use standard deviation rather than variance and that use correlation rather than covariance:
- Standard deviation is measured in percentage points like returns are, so itβs easy to interpret.
- Correlation is always between -1 and 1, so itβs also easy to interpret.
βοΈ Redo the last question, but imagine that instead of being told the standard deviation of the risky assets, you were told the variances. In particular the variance of the bond fund is .0081 and the variance of the stock fund is .0225.
βHere, we just convert the variance to standard deviation and then just redo the problem as before. To convert between Ο and Var, we use the following formulas:
Therefore,
Because these two standard deviations are identical to the standard deviations in the previous problem, we just do the exact same calculations and get the exact same answers.
Clearly, these formulas are helpful and βnot incredibly hard to useβ once you get used to them. But you should be asking where they come from.
βοΈ Consider the following data. What is the covariance between stocks and bonds?
E(r) | sd | ||
---|---|---|---|
T-Bill fund = rf | 2.4% | 0 | |
Bond fund | 4% | 9% | |
Stock Fund | 10% | 15% | Ο1,2=-0.221 |
βThe helper formulas we want are on this slide:

Cov(r_1, r_2) = Ο1,2Ο_1Ο_2 = Ο1,2 * Ο1 * Ο_2 = -0.221 * 9% * 15% = -0.002984
βοΈ In the above example, if the Covariance was .01, what would the correlation be?
β Ο1,2 = (Cov r_1,r_2)/(Ο_1*Ο_2) = .01/(9% * 15% ) = .74
What if there are more than two assets?
The two formulas above can be used, as we saw, to build the optimal portfolio of risky assets when there are only two assets. But what if there are more than two assets?
In this case, there are more complicated formulas based on linear algebra (vectors and matrices). YOU ARE NOT RESPONSIBLE FOR THESE MORE COMPLICATED FORMULAS. But itβs nice to know that the material weβve learned can be extended to optimize larger portfolios.
When you have six or more assets, you can use the extended formulas to calculate a βminimum variance frontier:β

If you look at all of the portfolios you can construct with six assets and then plot their E(rP) and ΟP, then you will get a cloud-like shape as above. If you isolate all portfolios that have the lowest possible variance for a specific expected return, you get the βminimum variance frontier.β

The portfolio with the lowest variance/sd of all portfolios made from those six assets is the βGlobal Minimum Varianceβ portfolio. If we throw away the portfolios that are beneath a better portfolio, we get the βEfficient Frontier,β which are the relatively βeliteβ portfolios that are left over.
The following graphic shows many example risky portfolios constructed as mixtures of the available securities. Note that the red dot portfolios cluster around the minimum variance frontier:

By choosing the portfolio in the efficient frontier with the highest Sharpe ratio, we get the optimal risky portfolio:

Using this, we can construct our CAL.
Build the CAL
Our end-goal is to find an optimal risky portfolio that we can use to build a Capital Allocation line. Then we can choose a complete portfolio from the CAL that matches our risk preferences.
To plot, just make a table with various values of y. For each value of y, you just apply our two main formulas for the CAL.
Formula | |
---|---|
E(rC) = rF + y[E(rP) - rF] | ΟC = y ΟP |
A far more complex example is illustrated here:

Risk preferences
βοΈ An investor has the following utility function. Their coefficient of risk aversion, A, is 2.5 and their utility function is
Consider the following three portfolios. Which portfolio is best for this investor?
1 | 2 | 3 | |
---|---|---|---|
y | 10% | 50% | 110% |
Erc | 3.45% | 5.27% | 8% |
Οc | 1.11% | 5.55% | 12.22% |
β
Suppose that there are three portfolios, L, M, and H:

Also, imagine there are three investors. Each investor has a different A.
In the following table each investor is represented by a row and each portfolio is represented by a column. In each cell, heβs put the utility of the portfolio for that specific investor. For example in the upper right, we have the utility of portfolio L for an investor with A=2.

We can see from the table above that the first investor (A=2.0) will choose portfolio H. The second investor will choose portfolio M, and the third will choose portfolio M as well.
In the above example, the choice of optimal complete portfolio depends on the level of risk aversion. However, there are some portfolios that should never be chosen (regardless of risk aversion) because they are dominated by another portfolio.
Portfolio A dominates portfolio B if:
and
If a portfolio dominates the other, it is βbetter on both dimensions.β A dominated portfolio will never be chosen if the investor could, instead, choose the dominating portfolio.
βοΈ Of the following four portfolios, which portfolio would never be chosen by any investor? (Hint: look for two portfolios where one portfolio dominates the other. Ie look for a pair of portfolios where one portfolio has a lower Er and a higher Ο.)
βD is dominated by A because
- D has a lower expected return: βΉοΈE(rD)= 8% < E(rA)=9%π
- D has a higher risk: βΉοΈΟD= 11% > ΟA= 10%π
βοΈ The following table shows Er, Ο and U for 11 portfolios on the same CAL. The bonus exercise below shows how we can determine that rF = .07, E(rP)=.150, and ΟP=.220, just by looking at the table. With some algebra, we can also find A=4.
Fill in the blank in the table. In other words, what is the utility when y=50%.

β When y=50%,
π§ This number is of a similar magnitude to the other utility numbers, so we can be confident in it!
Bonus exercise:
βοΈ From what you know about the CAL, in the example above, what are rF, E(rP) and ΟP?
Hint: how does E(rC) compare to E(rP) when y=1?

β When y=100%, E(rC)=E(rP). The tables shows us that E(rC) is .150 when y=1.0. Therefore, E(rP)=.150.
Similarly, when y=100%, ΟC=ΟP . The tables shows us that ΟC is .220 when y=1.0. Therefore, ΟP=.220.
Finally, when y=0%, E(rC) = rF. The table shows us that E(rC)= .070 when y=0, so rF = .07.
Bonus exercise 2:
βοΈ What is A?
Plug and chug: (help)
- Equation:
- Plug:π
Letβs look at when y = 1 and plug in: U = E(r)-.5AΟ^2
- Solve: π
- Reflect: π§
The investor on that table has A=4.
In the following slide, he plots utility for each of the values in the table above:


The conclusion is that there is a specific point where utility is maximized. Looking at both the table and the chart, you can see that the utility is maximized around y=.4.
Therefore, for an investor with a risk aversion of A=4, the optimal capital allocation decision is for them to invest roughly 40% of their assets in the risky portfolio.
βοΈ In the following diagram, what is the utility of portfolios A, B, C, D, and E?

Portfolio A is on the dashed black line, so it receives a utility of .07.
Portfolio B is on the solid blue line, so it receives a utility of .078
βοΈ If the values of y for each of the βletteredβ portfolios is given on the diagram, what percentage of their assets should the investor allocate to risk assets?
β The highest utility is reached at portfolio C, so the investor should invest 42% of their investable wealth in risk assets.
Using algebra/Plug and chug, (help) we could solve for the portfolio weights that will give us a specific Er in our risky portfolio.
β¦or we could just use the formula Bruce gave us:

βοΈ Suppose you want to make a risky portfolio with an expected return of 7% out of the following two assets. What should you invest in each asset?
E(r) | sd | |
---|---|---|
Bond fund | 4% | 8% |
Stock Fund | 9% | 15% |
β
Plug and chug: (help)
- Equation:
Our main formula for Er in this lecture is:
- Plug:π
All of our money will either be invested in asset 1 or asset 2. Therefore, w_2 = 100%-w_1. If we substitute 100%-w_1 in for w_2, then we get:
- Solve: π
If we solve that equation for w_1 we get the formula Bruce gave us, above.