The Misuse of Correlation Part 3

“It ain’t what you don’t know that gets you into trouble. It’s what you know for sure that just ain’t so.” Mark Twain


Definition

Dictionaries have a range of broadly related definitions, mostly suggesting the mathematical meaning and everyday sense are identical. As we saw with the word “significance” https://appliedmacro.com/2017/06/12/the-misuse-of-significance/, when there is no clear distinction in statistical and everyday usage, it can lead directly to confusion and often dangerous errors.

A few meanings:

  • Historical origin of the word is likely from medieval Latin:
  • Modern common usage, it is “a mutual relationship or connection between two things”
  • In statistics, it broadly means a “quantity measuring the interdependence of variable quantities”.
  • Here a difficulty arises, as although statistics has this broad meaning for it, often (especially in economics or finance) when people say correlation, they are talking about the Pearson Correlation Coefficient, also referred to as Pearson’s r.
  • Person’s r is the formula I gave you in the previous posts (https://appliedmacro.com/2017/06/28/the-misuse-of-correlation-part-1-quick-refresher-and-quiz/) and is specifically a measure of linear correlation between two variables.
    It has a value between +1 and −1, where 1 is perfect positive linear correlation, 0 is no linear correlation, and −1 is perfect negative linear correlation.


Issues

I would like to deal with well-known issues that still lead to common mistakes.

  1. Uncorrelated does not mean unrelated
  2. Correlation does not imply causation
  3. Correlation is not transitive
  4. Data issues

I’ll deal with the first two in the remainder of this post, and the second two in the following, to keep the posts to a reasonable length.

  1. Uncorrelated does not mean unrelated

Saying things are uncorrelated, people generally mean there is no relationship between them in everyday usage. Let’s look at a couple of examples where this common logic error leads to dangerously incorrect conclusions:

A). Relationships can be non-linear

When a cannonball is fired, its path will form a parabola (OK strictly only in a vacuum!). If we look for a relationship between height and time, draw a chart then the relationship is very clear.

If we ran a Pearson r correlation analysis instead of drawing the picture, we would find there is ZERO correlation between height and time. Other statistical correlation measures, such Spearman’s rho which looks for monocity may pick up non-linear monotonic relationships, but also will not help you here for a parabolic relationship.

If you are using a statistical analysis early on in your investigations, the absence of a measurable correlation, such as a linear one, can lead you to assume there is not any relationship and thus discard and ignore a very valid but non-linear relationship. This is a serious problem.

As with “significance”, using the same word for a mathematical term that we use in everyday language can lead to serious mistakes. Once explained in this way, it may seem obvious but starting an analysis by filtering early, only looking for relationships with high correlations, remains shockingly common even with people that are aware of the problem.

B). There can be a logical relationship which is important

Take an example from financial markets, there is an intuitive connection between the short end and the long end of the bond market. If we look at US government bond yields 2 year (x-axis) versus 10 year (y-axis) over the last 5 years (chart below). The correlation between their levels is virtually zero and, if we were to only look at correlation, we may falsely conclude there is no relation between them.

A more dramatic example with very dangerous consequences was the lack of a correlation between US house prices and the price of AAA tranches of mortgage backed securities before the crisis.

Before the crisis Correlation coefficient virtually zero


During the crisis
Correlation coefficient = 0.96 (virtually 1!)

This was a very bad way to think about the relationship but a huge amount of money was invested on this poor assumption. It also relates to a serious logic flaw – just because something hasn’t happened in the past does not mean it can’t happen in the future.

  1. Correlation does not imply causation

Everyone is taught this early in the study of statistics. Often with an example, as below, of a spuriously high correlation, where the intuitive relationship suggests something rather different.

https://www.mathsisfun.com/data/correlation.html

Everyone knows that ice cream and sunglasses have a common driver i.e. the weather. Perhaps it is less well known, how frequently this error is repeatedly made in economics and finance, even in the upper echelons of academia and policy making,

The policy impact and subsequent furore over the paper “Growth in a Time of Debt” by Carmen Reinhart and Ken Rogoff is a notable example. They found a correlation between national debt and growth rates, stating that “for levels of external debt in excess of 90%” GDP, growth was “roughly cut in half”.

On both sides of the political spectrum, the calculated correlation had become all that mattered:

  • For those who wanted to reduce budget deficits in the US and UK, this was referred to as “conclusive empirical evidence” (Paul Ryan) and “convincing” (George Osborne). A strong correlation proved the case for austerity
  • For their opponents, their attention was focused on the details of a data error which reduced the strength of the calculated relationship. The weak correlation proved there should be no restriction on debt levels.

Both sides of that argument were so simplistic, it was bizarre. This is not a fault of the original work, doing statistical analysis is a good idea, it is a fault of over-simplistic interpretations of its meaning.

The relationship between macro data and financial crises are similarly an area of extreme concern.

It may be true that there is a correlation between budget deficits and currency crises.

If you then conclude that budget deficits cause currency crises, then it is a quick jump to proposing that the way to prevent a currency crisis is to focus on the deficit and cut spending.

This of course fails to explore some crucial, causal links. If budget problems and currency weakness are both manifestations of a common underlying problem then treating one of the symptoms will not cure anything. Once again, an overly simplistic analysis based on correlations can lead to disastrous policy recommendations.

The Misuse of Correlation Part 2 – the results

In this post, I want to talk about an insidious error that creeps in with the usage of correlation in finance.

FT lexicon supports the idea that:

a correlation is said to be positive if movements between the two variables are in the same direction and negative if it moves in the opposite direction.”

This definition is not unusual, commonly seen in finance textbooks.

Occasionally the formula may be presented:

But caveats in using the formula will likely be absent or at the least hidden from view.
By that, I mean the terms and are critically important but this importance is rarely appreciated. [1]

In the previous post, I asked you about the correlation of the changes of the two assets in the chart below:

a. Positive correlation

b. Negative correlation

c. They are uncorrelated

d. Not sure (be honest!)


The most obvious answer is of course b)
One line goes up and the other goes down so this means they have a negative correlation. This is unfortunately strictly incorrect if you paid attention to the instruction to consider the “changes in the two assets”.

A good answer is d)
Given the amount of information I had supplied, it’s a perfectly reasonable one.

Because another answer is a)

The correlation of the changes in the variables is +1, PERFECT POSITIVE correlation
and the lines are going in the OPPOSITE direction!!

(If you doubt this result please look at the data and calculations in the sheet attached (download) and use the CORREL function in excel.)

b) is an intuitive answer but a) is the answer that a financial analyst would calculate. If you imagine of situations where you are being given financial advice, it is clear there could be an immediate conflict!


First insidious confusion – the importance of the mean

If you have never seen this before, you may think I am lying or this is a convoluted trick. But it rests upon one key part of the calculation of correlation that is missing from virtually every definition I see, and is certainly missing from the vast bulk of work done by analysts in the finance industry.

The key is that correlation is calculated by looking at the relationship in deviations from the means (the terms and in the complicated mathematical equation).

In our example, the changes in the two variables in the chart have equal and opposite means, and so trend in different directions. However, the day to day volatility (deviation from the mean of the changes) is identical for both variables, and it is this term that drives the correlation whilst having no impact on the trend.

Here is a scatterplot of the % changes for each variable. Observe all the dots are distributed along the line – a perfect POSITIVE correlation.

This has a clear relationship to the way we think about the change in market prices of any asset:

In financial markets, the daily noise is usually much greater than the daily trend, and so forms the focus of most market commentary.

The key result is that if the noise term correlates for two assets, then they will correlate irrespective of their underlying trend, given the way correlation is calculated.
i.e. they could end up in very different places even if they are positively correlated!

Second insidious confusion – levels vs changes

The second insidious confusion can arise from a reference to correlation of the CHANGES or a correlation in the LEVELS of the two variables.

In financial markets, the method invariably used is to look at the changes in variables. In our example, we get the answer of positive 1 i.e. perfect positive correlation.

If we calculate the correlation using the levels or prices, we get an answer of -0.97
i.e. strong negative correlation

The intuitive result is the opposite of the result most likely to be calcuated by financial analysts.

Why does finance prefer the use the correlation of changes?

It is done for good reason. When you are looking at data with strong trends, as a lot of asset prices do, the correlation of levels can yield very strange results. Let’s take an example.

Let’s look at the US equity market (S&P 500 price – white line) and its PE ratio (orange line) over the last 30 years.

If we first look at the correlation of levels, we get a correlation of virtually zero.
This suggests a rather unintuitive result that there is no meaningful correlation between PE ratio and equity prices!

If we instead look at the correlation of changes, we get that there is a meaningful positive correlation of 0.78 which makes a lot more sense.

Conclusion

If these differences in the correlation results is were just some statistical fluke, from a couple of silly examples, then it would not matter.
But it is not an unusual result and it occurs when looking at the biggest and most commonly traded financial markets. It is therefore critical to avoid confusions such as these when thinking about what type of correlation to use or, more often, what someone else has used in the analysis you are reading.


[1] I very much enjoyed this paper by Francois-Serge Lhabitant which explains this issue very well. http://www.edhec-risk.com/edhec_publications/all_publications/RISKReview.2011-09-07.3757/attachments/EDHEC_Working_Paper_Correlation_vs_Trends_F.pdf

The misuse of Correlation Part 1 – Quick Refresher and Quiz

First, let’s refresh our memories of what correlation means.
This may seem very basic right now, but I would like to make sure the meaning is clear before we move on to its use.

I have included a question at the end, once you have read and thought about the definition:

  • A definition from the FT Lexicon:
    “a correlation is said to be positive if movements between the two variables are in the same direction and negative if it moves in the opposite direction.”
  • You can read examples in a number of sources such as

https://www.mathsisfun.com/data/correlation.html

and
http://www.bbc.co.uk/schools/gcsebitesize/maths/statistics/scatterdiagramsrev2.shtml

Here is a range of correlations, shown via a scatterplot:

Some important concepts

  • A positive correlation is “when the values increase together”
    An example would be temperature and ice cream sales as “warmer weather and higher sales go together”.
  • A negative correlation is “when one value increases and the other decreases
    Note this is sometimes called an “inverse correlation”.
    An example would be weight of a car and its fuel efficiency as “cars that are heavier tend to get less miles per gallon.”
  • No correlation is when “there is no connection”. An example would be IQ and house number.”
  • For those of you with a more formal approach the mathematical formula for correlation is:
  • In practice, most of us find it much easier to use the function CORREL() in Excel!

Question time

Here is an example with two asset prices A and B. When we represent the data in a chart it can often be done in one of two ways.

This chart has two lines, showing how both the prices of asset A and B moved over time.

The other way to chart this is to put the prices of A and B on the two axes instead. It looks like this.

To make sure you have understood the basic concept of correlation, I would appreciate it if you could vote on an answer to the following question. (all anonymous of course!)

Career Tips

I was asked recently to speak at an undergraduate event. Part of it was to give some career advice in the form of 3 tips. Here is what I came up with:

Many people after leaving university find adjusting to the world of work difficult and become very unhappy. Focusing on a lack of “meaning” in their job while searching for a “mentor” to guide them, they can quickly come to resent their firm and co-workers.

It does not have to be this way.

The most important thing to realise is that the workplace is not going to feel like an extension of education – it is completely and fundamentally different. For at least the first two decades of your life, focusing on your knowledge and your skills is the key and the whole environment around you is geared to helping you develop. However, the ability of a student to successfully transition into a happy and productive career has remarkably little to do with the knowledge and skills they start with.

What really matters is how well they can change their mindset.

Here are 3 things to focus on:

  1. It’s not about you any more

This is the piece of advice students generally find the most upsetting. A big change in mindset is required to succeed in a work environment compared to the one needed for education.

In education, the student is the product. The ultimate aim for a student, with the help of teachers, is to gain the skills and knowledge required to pass exams. This does not mean that students have complete free rein to do what they want. There will be various restrictions on behaviour, such as a requirement to go to lectures, prepare for tutorials, do reading, problem sets and essays – however these are all designed with the success of the student in mind. The best attitude for the student is to be focused on themselves and their own needs.

In the workplace, the business is the product. The ultimate aim for a new employee is to become useful. Many graduates find this transition to the workplace a shock. Senior members of staff may not think that a key part of their role is to educate you and make you more productive or happy. In a few years’ time, you will also be more senior too and it will be obvious to you that this is not a priority either. You will want to be productive at work, impress your boss, get promoted, get a bonus etc.

Adjusting to this new reality, the best attitude is to be focused, not on yourself, but on the needs of the people around you and of the firm – Be useful! You will then find good things will start to happen to you. Given reciprocity (see “Influence: The Psychology of Persuasion” by Robert Cialdini), people you help will also help you. Senior people will start to spend time helping you learn and improve. You will have signalled to the firm that you have the right mentality to succeed and so will be promoted more quickly, paid more and given more training.

Having a real job is extremely helpful in preparing you for work and choosing a career path. I spent my Gap year working full time as an economist, but working at McDonalds may have perhaps been even better. You need to understand what it is like to be the other side of the counter.

  1. Be flexible.

In education, a targeted focus and narrow determination are extremely helpful for excelling with high results. The world of academia is fragmented and siloed, with status derived from expertise in ever more specialised areas.

The world of work is very different. A modern and successful career will come with many parallel and some orthogonal leaps into new areas, combined with an ability to master a broad range of cross-disciplinary problems.

I could easily have become a consultant or economist and I think I would have really enjoyed it and been successful. In banking and hedge funds, my career could have gone in lots of different directions. The only way to take opportunities is by being open minded.

  1. Work with people you would like to become.

This piece of advice was given to me as an undergraduate, and it has repeatedly proven itself true as my career developed.

Don’t think that you can join an Investment bank for the money and not become like them. Either you will change to fit in, or you will not and you will hate it and leave.

You must judge it from meeting real employees, not from impressions from TV shows. Being a lawyer is not the way it is on Suits just as being a Hedge Fund manager is not like Billions (well mostly anyway). That is why internships are so useful.

Conclusion

The world of work is can be a stimulating and fulfilling experience. For that to happen you need to be able to have the right mindset to take advantage of the opportunities on offer.