Causation indicates that one event causes another. Correlation only identifies that there is a relationship between two events or outcomes.
If you were to collect data on the sale of ice cream cones and swimming pools throughout the year, you would likely find a strong positive correlation between the two as sales of both increase during the summer months. If you make the mistake of assuming correlation implies causation, you would incorrectly claim that an increase in ice cream cone sales causes people to buy swimming pools. However, this isn’t the case since you can attribute the increase in both to another variable—likely the warmer weather people experience during the summer. So although a correlation is present, you can’t support causation.
In another correlation versus causation example, it may not be as easy to identify whether causation is present with two variables. For example, you could find a correlation between the amount someone exercises and their reported levels of happiness. While it’s possible an increase in exercise is causing an increase in happiness, you can’t say for sure that it’s the cause since there could be another unknown variable that has a more significant influence on a person’s mood.
The homeless chart per state is of the number of people in shelters.
Correlation could indicate the poor states have less homeless.
Causation could indicate the reason they have less homeless in shelters is because the have no shelters.
If you look at many of these poor states, you may find less shelter and services exist for the homeless, homeless is more punished by law, or other factors making it less likely for someone to stay there or to be counted as homeless there.
This is why many say you can make a chart show anything you want it to, and you need to be critical when looking at people’s data.
I didn’t pull the HUD data to dive too much into it. The link to the source I gave had this though for your second question:
[One source of data was places] That provide temporary shelter during extremely cold weather (like churches). This category does not include shelters that operate only in the event of a natural disaster.
They may also be unemployed seasonal labor, so they have work sometimes (agriculture, tourism, ranching, etc) but not enough year round income. Just guessing on that, I’m not much familiar with Montana and the Dakotas.
Check out the full info at the links though. I’m a but sleep deprived to do much in depth analysis on this today. 😔
This is a case where correlation and causation are important.
Grabbed this quick example from Coursera here:
The homeless chart per state is of the number of people in shelters.
Correlation could indicate the poor states have less homeless.
Causation could indicate the reason they have less homeless in shelters is because the have no shelters.
If you look at many of these poor states, you may find less shelter and services exist for the homeless, homeless is more punished by law, or other factors making it less likely for someone to stay there or to be counted as homeless there.
This is why many say you can make a chart show anything you want it to, and you need to be critical when looking at people’s data.
I hope this was helpful!
Ah! Fewer shelters in poorer states makes sense. But I gathered that shelter info was used to extrapolate the total number of homeless.
Also, the map would make one wonder why there are so many homeless in the colder states. That wouldn’t make sense.
I didn’t pull the HUD data to dive too much into it. The link to the source I gave had this though for your second question:
They may also be unemployed seasonal labor, so they have work sometimes (agriculture, tourism, ranching, etc) but not enough year round income. Just guessing on that, I’m not much familiar with Montana and the Dakotas.
Check out the full info at the links though. I’m a but sleep deprived to do much in depth analysis on this today. 😔