What is Time Series Data Analysis?
Time series data analysis is the analysis of datasets that change over a period of time. Time series datasets record observations of the same variable over various points of time. Financial analysts use time series data such as stock price movements, or a company’s sales over time, to analyze a company’s performance.
Examples of time series datasets include:
- The Gross Domestic Product (GDP) of the United States of America between 2010 and 2015 – The economic unit of analysis is the U.S. The economic unit of analysis is for the time period 2010-2015. A typical entry from this dataset would be (2012, $16.16 trillion).
- The GDP per capita of Germany between 2008 and 2018 – The economic unit of analysis is Germany. The economic unit of analysis is for the time period 2008-2018. A typical entry from this dataset would be (2010, $41,700).
- Total steel exported by India between 2000 and 2018 – The economic unit of analysis is India. The economic unit of analysis is for the time period 2000-2018. A typical entry from this dataset would be (2015, $3.17 billion).
- Total oranges eaten by a specific household in Ghana between 2008 and 2018 – The economic unit of analysis is a specific household in Ghana (e.g., Household 302). The economic unit of analysis is for the time period 2008-2018. A typical entry from this dataset would be (2018, 200).
Unlike cross-sectional data analysis, time series data analysis cannot make use of the random sampling framework. This makes time series data analysis much more complex and computationally demanding than cross-sectional data analysis. Random sampling cannot be used because the past values of a variable are almost always highly correlated with the present value of that variable.
For example, the GDP of the US in the fourth quarter of 2017 is highly correlated with the GDP in the third quarter of 2017. The degree of correlation is much higher than the correlation across economic entities at the same point in time.
The correlation coefficient between the US GDP in the current quarter and the US GDP in the previous quarter for the period 2008 to 2018 is 0.998. The correlation coefficient between the US GDP in the current year and the US GDP in the previous year for the period 2008 to 2018 is 0.992.
Causal Questions and Time Series Analysis
The majority of economic analysis involves the study of intertemporal causal claims. Examples include:
- How much does a 1% increase in current period GDP affect future period GDP?
- How does the past unemployment rate affect the current unemployment rate?
- What is the effect of a 1% increase in Grade 7 test scores on Grade 8 test scores?
Consider the test scores example: Suppose there is some policy instrument (e.g., increasing the teacher-student ratio) that can be used to increase Grade 7 test scores by 1%. Such a policy change is likely to be very expensive, and a policymaker who only looks at Grade 7 test scores might not implement the policy.
However, suppose a 1% increase in Grade 7 test scores is associated with a 0.5% increase in Grade 8 test scores. This additional benefit may make implementing the policy worthwhile.
Thank you for reading CFI’s guide to Time Series Data Analysis. To keep advancing your career, the additional CFI resources below will be useful: