School of Stocks
Home : Risk and Money Management

Covariance and Correlation

In this chapter, we will talk about Covariance and Correlation, two tools that are widely used in statistical analysis and portfolio management to measure the relationship between two securities and diversify the portfolio to reduce risks.

Abhishek Chinchalkar
12 minutes read
comment
Comments
Users learning chapter

Up until now, we have discussed the mean, variance, standard deviation, normal distribution, skewness, and kurtosis. In this chapter, we will continue our discussion on statistical measures of risk and talk about two important concepts, namely:

  • Covariance
  • Correlation

Covariance

Over the past two chapters, we talked about variance and standard deviation. These are statistical measures that are widely used to quantify volatility of returns for a single security. But what if you are interested in finding out the variation of returns between two securities? That is, what if you want to know what type of relationship exists between the returns of two securities? Well, one of the most popular tools to do this is covariance and correlation. In this section, we shall talk about covariance.

In finance, covariance is a statistical metric t hat is used to measure the directional relationship that exists between the returns of two securities. That is, covariance tells us whether the two securities share a positive relation, a negative relation, or no relation. Mathematically, covariance is calculated using the following formula:

 

covarience-formula

cov-for-exp

The above formula will give us sample covariance. Covariance can be positive, negative, or zero. A positive value means the returns of securities move in the same direction, while a negative value means the returns of securities move in the opposite direction. Meanwhile, a covariance of zero means there is no relationship between the returns of two securities. This can happen when the two securities are independent of each other.

return-data

In the above table, we have calculated the covariance between two hypothetical stocks, A and B, based on the daily price of the past ten sessions. Following are the steps that were involved in the calculation:

  • Step 1: The log return of each day for stock A has been calculated in column 3
  • Step 2: The mean return has then been calculated for stock A at the bottom of Column 3 (A')
  • Step 3: The difference between the corresponding return of each day and the mean return (A-A') has then been calculated for stock A in column 4
  • Step 4: Similarly, the log return of each day for stock B has been calculated in column 6
  • Step 5: The mean return has then been calculated for stock B at the bottom of Column 6 (B')
  • Step 6: The difference between the corresponding return of each day and the mean return (B-B') has then been calculated for stock B in column 7
  • Step 7: The product of the deviation from the mean return [i.e., (A-A')*(B-B')] has then been calculated for each day in column 8
  • Step 8: Each product arrived in step 7 is summed (Σ)
  • Step 9: Finally, the summation figure arrived in step 8 has been divided by (n-1), where n is 10 in our case. This figure that is arrived is the covariance of returns between stock A and B

The above steps show how to calculate covariance manually. However, as you may have noticed, calculating covariance this way is time consuming and tedious. Instead, there is an easier way to calculate covariance in Microsoft Excel. All that you need is historical returns of two securities and then use the =COVARIANCE.S() function is Excel.

historical_price_stocks.png

In the above image, see that once we have the historical price of two stocks, all that we need to do is calculate their log returns for each period (as has been done in columns C and E above). Once these returns are calculated, we can then calculate covariance between the returns. To understand the Excel function that we have used to calculate covariance in cell G26, look at the formula bar above. Note that because we are calculating sample covariance, we need to use =COVARIANCE.S function.

As can be seen in the image above, the covariance of returns between stock A and B is 0.00025. As the covariance is positive, the returns typically move with each other. That is, when one return is positive, the other is also positive; and when one return is negative, the other is also negative. On the other hand, what if the above covariance were negative? In that case, it would imply that the returns typically move against each other. That is, when one return is positive, the other is negative; and when one return is negative, the other is positive.

Let us now talk about some real-world examples. Let us measure the covariance between the returns of the following securities based on the daily closing price for the first three months of 2021:

  • NSE Pharma index and Nifty 50
  • Bank Nifty and Nifty 50
  • Hindustan Petroleum and L&T

covarience_daily_return.png

In the above image, look at the formula to see how the covariance between the daily returns of pharma index and nifty has been calculated in cell G11. As can be seen, the 3-month covariance between the daily returns of pharma index and nifty stood at 0.000109. Similarly, the other two covariances can be calculated. For the same period, the covariance between the daily returns of Bank Nifty and Nifty stood at 0.000224, while that between HPCL and L&T stood at 0.000123. As can be seen, each of the three covariances were positive, meaning there is a positive relation between the returns of pharma index and nifty, bank nifty and nifty, and HPCL and L&T. Meanwhile, about the Excel formula of covariance, whether you input the formula as =COVARIANCE.S(A,B) or =COVARIANCE.S(B,A), the output will be the same. That is, irrespective of whether the returns of A are put first followed by those of B or the returns of B are put first followed by those of A, the output will be the same. So, you need not worry about the order in which the returns are inputted.

Importance of Covariance in finance:

Covariance is a widely used statistical metric in the world of finance to identify the directional relationship between the returns of two securities. The two securities that are chosen could be anything - a stock and another stock, a stock and a sectoral index, a stock and a stock market index, a commodity and a stock market index, a commodity and another commodity etc. Covariance is useful when building a portfolio that comprises of stocks and/or other assets. In an earlier chapter, recollect that we said unsystematic risks of a portfolio can be mitigated by diversification. By selecting securities that do not share a high positive covariance with each other as well as by mixing securities that exhibit positive covariance with some and negative covariance with the rest, one can lower the unsystematic risks of a portfolio.

Besides, covariance is used in the calculation of other statistical metrics. Two of the most important and widely used statistical metrics where covariance is used are in the computation of correlation coefficient and Beta. We will talk in detail about correlation in this chapter and Beta in the next.

Shortcomings of Covariance:

While covariance helps in identifying the directional relationship between the returns of two securities, there are a few shortcomings that it faces. One of the major shortcomings is that it says nothing about the strength of the relationship between the returns of two securities. Also, covariance is unbounded on either side of zero. Hence, other than knowing the direction of the relationship (positive or negative), it is difficult to make out another more from covariance, such as identifying the strength of the relationship and comparing covariances. Well, these two issues are solved by another statistical metric called correlation coefficient, which is the topic of discussion in the next section.

Correlation:

Correlation is like covariance in that the former also measures the directional relationship between the returns of two securities. However, unlike covariance which just states the directional relationship, correlation goes deeper and talks about the strength of the relationship also. In other words, correlation informs us how strongly the returns of two securities are related. Also, unlike covariance, correlation is standardized because of the way in which it is calculated. This means correlation is not measured in units but is rather expressed as a pure number that fluctuates within a set band. This makes it possible to compare one correlation value with another. Statistically, correlation is calculated as follows:

co-relation-formula

co-relation-exp

Because covariance is a product of the deviation from mean return for two securities, it is expressed in terms of squared units. Dividing it by the product of the individual standard deviation standardizes the expression and makes it much easier to not only interpret but also compare various correlation values. As said earlier, correlation fluctuates within a band of -1 and +1. It can neither be less than -1 nor greater than +1. A positive value means the returns of two securities are positively correlated, that is they typically move in the same direction (rise together and fall together). A negative value means the returns of two securities are negatively correlated, that is they typically move in the opposite direction (if one rises, the other falls, and vice versa). As said earlier, besides measuring the directional relationship, correlation also measures the strength of the relationship. Mentioned below are general guidelines for gauging the strength of the relationship between the returns of two securities:

corelation_values.png

From the above table, how does one interpret correlation? Well, when the correlation is perfectly positive, it means the returns of two securities move in the same direction and by the same magnitude. On the other hand, when the correlation is perfectly negative, it means the returns of two securities move by the same magnitude but in the opposite direction. Meanwhile, if the correlation is ≥0.5 or ≤-0.5, the returns of the two securities are strongly correlated, meaning change in the value of one typically coincides with a strong change in the value of the other (do not confuse this with causation though, as correlation tells nothing about the causal relationship). Meanwhile, if the correlation falls between 0 and 0.5 or between 0 and -0.5, the returns of the two securities share a weak correlation. Finally, when the correlation values are 0 or very close to 0, such as between -0.1 and 0.1, it indicates an absence of correlation between the returns of the two securities.

Because correlation is unitless and has bounds on either side of zero, see how it plays a dual role of informing about the directional relationship as well as the strength of this relationship. The sign of the correlation lets us know whether the returns of two securities move in the same direction or in the opposite direction. Meanwhile, the positioning of the correlation value within the 0 and +1 band or within the 0 and -1 band lets us know the strength of the relationship.

By now, as we already know the steps that are involved in the calculation of covariance and standard deviation, let us jump directly into MS Excel and understand how to compute correlation of security returns. Again, let us use the same hypothetical and real-world examples that we demonstrated earlier in this chapter.

hypothetical_ex_up.png

In the above hypothetical example, see the formula bar to understand how correlation of stock returns has been calculated in cell O24. As can be seen, the excel function that is used to calculate correlation is =CORREL(). Inside the parenthesis, you would specify the range of returns for stock A and B. This range would be separated by a comma. So, as can be see, the correlation between stock A and B based on the historical data of the past 10 days is 0.83. This tells that the returns of stock A and B have shared a strong positive correlation over the lookback period. Let us now proceed to a real-world example of Nifty and NSE Pharma index, which we had presented earlier. Let us measure the covariance between their returns based on the daily closing price for the first three months of 2021.

nse_pharma_index.png

Above, in the formula bar, notice how the 3-month correlation between the daily returns of Nifty and NSE Pharma index has been calculated in cell G12. The value 0.61 indicates that the two have shared a strong positive correlation during the first three months of 2021. Similarly, as we did earlier in this chapter, the correlation between Nifty and Bank Nifty as well as that between HPCL and L&T can be calculated. The table below shows these statistics:

nifty_bank_nifty.png

From the above table, see that during the first three months of 2021, Nifty and Bank Nifty index shared a very strong positive correlation, suggesting that the two have pretty much moved in sync during this period. On the other hand, while the correlation between HPCL and L&T is positive, see that it is a relatively weak correlation, as compared to that between Nifty and Bank Nifty as well as that between Nifty and NSE Pharma index. Also notice that it is much easier to compare correlations than to compare covariances.

Importance of Correlation in finance:

When it comes to portfolio management, correlation is an important statistical metric that can help in diversifying holdings across securities, which in turn would help in efficiently managing portfolio risks. Keep in mind that it is quite tempting to deploy all your capital in highly volatile and highly correlated stocks, given the possibility of the portfolio generating strong returns during times when markets, in general, are trending higher. However, during volatile times, there is also a possibility of severe drawdowns if the portfolio comprises of only such stocks. Always remember, risk and reward go hand in hand – the higher the reward, the higher would be the risk, and vice versa. Hence, it is prudent to have a combination of stocks in your portfolio with varying levels of correlation – some strong positive, some weak positive to zero, and some even negative. This way, the portfolio risk gets diversified, so that a large drawdown in any one or two stocks is unlikely to have much of a negative impact on the portfolio.

So far, we have covered statistical tools that primarily cover the unsystematic side of risk. In the next chapter, we will discuss about a tool that covers the systematic side of risk.

Next Chapter

Beta

3 Lessons

Over the past three chapters, we discussed some key statistical tools that help in analyzing individual risks of a security as well as in understanding the relationship between two securities. In this chapter, we will talk about another important statistical metric that helps in measuring the systematic risk of a security, Beta.

Portfolio Return, Variance, and Beta

4 Lessons

Now that we understand the individual stock-specific risk measurement metrics, it is time to focus on Portfolio. In this chapter, we will talk about portfolio risks and returns. We shall discuss how to calculate the expected return on a portfolio as well as how to measure portfolio variance, standard deviation, and Beta.

Comments & Discussions in

FYERS Community Learn More...

Responses

risk-and-money-management