Financial Econometrics Econ 40357 Topic 2: Exploratory data analysis - PowerPoint PPT Presentation

Financial Econometrics Econ 40357 Topic 2: Exploratory data analysis . . . N.C. Mark University of Notre Dame and NBER August 24, 2020 1 / 24

Concepts to cover Time series and stochastic processes. 1 Our ‘models’ of the data generating process 2 Stationarity, and why it’s important 3 Exploratory data analysis. What do we learn? 4 2 / 24

Stochastic process, time series Stochastic ⇔ random A time-series is a sequence of observations over time. We think of them as stochastic processes. iid ∼ N ( µ , σ 2 ) The simplist stochastic process. x t 3 / 24

The driftless random walk iid ∼ N ( µ , σ 2 ) and p t = ln ( P t ) be the price of non-dividend Let x t paying stock (Netflix, Tesla, and for a long time, Apple). p t = p t − 1 + x t ∆ p t = x t Return is x t . Sometimes referred to as white noise . A white noise process is i.i.d. (independent, identically distributed). This is a powerful statement. Independence of returns over time. 4 / 24

20 replications of a white noise process 5 / 24

Stationarity Strict stationarity says the distribution of X t is the same for all t . So the distribution of X t is the same as for X t + 1 , etc. Covariance stationarity A less restrictive form says the covariance between X t and X t − s is the same, for all t . A time series can violate strict stationarity but still be covariance stationary. E.g., volatility clustering. Comparison of Cov ( x t , x t − 1 ) and Var ( x t ) . 6 / 24

What do we mean? What do we mean by things like E ( x t ) and Var ( x t ) ? In observational time-series there is only one observation of x t at t . One observation of x 96 , where t = 96 . What does the distribution of x t have to do with (time-series) sample moments? (Sample mean, sample variance, etc.) 7 / 24

Ergodic Theorem T 1 ∑ x t = E ( x t ) lim T → ∞ T t = 1 T 1 � � x 2 x 2 ∑ t = E lim T → ∞ t T t = 1 and so on If the conditions for the ergodic theorem hold, then we can use time-series sample moments to estimate the theoretical moments. 8 / 24

First thing you do Plot your data. Stare at it. Are there mistakes? What else are we looking for? Properties of the data Are there trends ? If so, trends need to be eliminated before doing econometric analysis. Why? Things like T 1 ∑ lim T → ∞ x t T t = 1 doesn’t exist if there is a trend. Are there structrual breaks ? Is there volatility clustering ? 9 / 24

Let’s look at plots of DJIA price and retunrs There is a trend . We say these data are unit-root nonstationary . Will explain the term unit-root later. Contrast with non identicalness of distribution, under volatility clustering, also not strictly stationary, could be covariance stationary not unit-root nonstationary. To analyze these data, transform the observations to induce stationarity. i.e., Dont look at price P or log price ln ( P ) , but look at returns ∆ ln ( P ) (ignoring dividends here). 10 / 24

Quarterly returns 11 / 24

Monthy returns 12 / 24

Daily returns 13 / 24

Questions Do monthly DJIA returns look strictly stationary? Covariance 1 stationary? Do daily DJIA returns look strictly stationary? Covariance 2 stationary? 14 / 24

The Normal (Gaussian) benchmark We usually take the normal distribution as a benchmark. Why? B/C properties are well understood, the normal is a good model for many natural phenomena, it has good mathematical properties, especially on the asymptotics , through the central limit theorem . We typically focus on symmetry and tail thickness. We know what these are for the normal. Is the data well described thusly, or are there significant deviations? First, we must be familiar with the following concepts 15 / 24

Moments of a Distribution The k -th theoretical moment of a distribution or of the random variable x ) � x k � E The k -th central moment E ( x − µ ) k where the first moment is µ = E ( x ) . Sample moments are the sample counterparts. Let { x t } T t = 1 be a sequence of time-series observations (e.g., returns). Mean and variance. First two moments T x T = 1 ∑ µ = E ( x t ) ; ¯ x t T t = 1 T 1 E ( x t − µ ) 2 ; x T ) 2 σ 2 ∑ T = ( x t − ¯ ˆ T − 1 t = 1 16 / 24

Moments of a distribution Third moment for symmetry/asymmetry. Skewness measure T − 1 ∑ T 1 x T ) 3 E ( x t − µ ) 3 t = 1 ( x t − ¯ sk T = ; σ 3 σ 3 ˆ T For normal distribution, skewness measure is 0 . Skewness is 0 for all symmetric distributions. If x t is a return, might want to know if it has a heavy left tail (propensity to crash) or heavy right tail (propensity to boom). 17 / 24

Skewed Left Skewed Right 18 / 24

Moments of a distribution Fourth moment measures tail thickness. The theoretical measure is kurtosis � 4 T − 1 ∑ T E ( X t − µ ) 4 1 x t − ¯ � X T t = 1 kurt T = ; σ 4 σ 4 ˆ T For normal distribution, kurtosis is 3 . Distribution has excess kurtosis if the measure exceeds 3 . These are fat-tailed distributions and peaked . There is a higher probability of extreme events than predicted by the normal. Called leptokurtotic. Pay attention to whether the software computes kurtosis or excess kurtosis. 19 / 24

Properties of the normal distribution Standard normal. For − ∞ ≤ x ≤ ∞ < 1 e − x 2 f ( x ) = √ (1) 2 π (General) Normal 1 2 e − ( x − µ σ ) f ( x ) = √ (2) 2 π σ Distribution is symmetric around µ (mean, location) 1 Dispersion regulated by σ (scale). How is σ a measure of scale? If X is 2 household income in dollars, then 100 X is household income in cents. � � Var ( 100 X ) = 10 Var ( X ) (3) In finance, standard deviation ⇔ volatility 3 20 / 24

Properties of the normal distribution Tail probabilities converge to 0 at a well defined rate. Loosely speaking normal tail probabilities converge to 0 quickly (even though it’s possible to have realizations that are arbitraily large or small). Conclusion: Assessments of normality involve checking for distributional symmetry and appropriate tail thickness . How do we do that? Through examination of sample moments . 21 / 24

Varying kurtosis Figure: Distributions with differing kurtosis 22 / 24

Jarque-Bera test for normality The Jarque-Bera statistic measures the difference between 1 skewness and kurtosis in the data and the normal distribution. Let sk T be sample skewness, and kurt T be sample kurtosis. 2 Jarque and Bera showed that their statistic JB � � T + ( kurt T − 3 ) 2 JB = T sk 2 ∼ χ 2 2 6 4 under the null hypothesis of normality. Eviews produces JB test and p-values when asking for descriptive 3 statistics. 23 / 24

Returns S&P Annual Returns Daily Returns 24 / 24

Financial Econometrics Econ 40357 Topic 2: Exploratory data analysis - PowerPoint PPT Presentation

Financial Econometrics Econ 40357 Topic 2: Exploratory data analysis . . . N.C. Mark University of Notre Dame and NBER August 24, 2020 1 / 24 Concepts to cover Time series and stochastic processes. 1 Our models of the data generating

Virtual Student Orientation Information for Families SLIDESMANIA.COM TOPIC TOPIC TOPIC TOPIC

ConnectHome ConnectHome Topic 2 Topic 2 Nation Webinar Nation Webinar Topic 3 Topic 3 Topic

Financial Econometrics, Econ 40357 Macro and Financial Time Series N.C. Mark University of Notre

Financial Econometrics Econ 40357 Volatility, ARCH, GARCH N.C. Mark University of Notre Dame and

Financial Econometrics Econ 40357 Local Projections N.C. Mark University of Notre Dame and NBER

Financial Econometrics Econ 40357 Constant expected return model and efficient market hypothesis

Financial Econometrics Econ 40357 Regression review, Time-series regression Some Necessary Matrix

Financial Econometrics Econ 40357 The Event Study Method Brooks pp 571-583 N.C. Mark University

Financial Econometrics Econ 40357 ARIMA (Auto Regressive Integrated Moving Average) Models Part

Financial Econometrics Econ 40357 Principal Components N.C. Mark University of Notre Dame and

Financial Econometrics Econ 40357 The Fama and French 3-Factor Framework N.C. Mark University of

Financial Econometrics Econ 40357 Factors Single-Factor ModelsThe market model and the CAPM

Financial Econometrics Econ 40357 ARIMA Part 2: Autoregressive Models N.C. Mark University of

BS2247 Introduction to Econometrics Lecture 1: Basic Mathematical Review Dr. Kai Sun Aston

Exploratory Data Analysis Paul Cohen ISTA 370 Spring, 2012 Paul Cohen ISTA 370 () Exploratory

Introduction to Data Science: x (1) x 1 x 2 x ( n ) x i n 1 1 Size: size

Chapter 3. Early Macro Divergence from Micro UMSL Max Gillman Max Gillman () 1 / 52 Facts: The

Evolutionary Stochastic Portfolio Optimization and Probabilistic Constraints Ronald Hochreiter

Defending Against Malicious Socialbots Position Paper Yazan Boshmaf, Ildar Muslukhov,

Screening: The First Step for Finding Winning Stocks John M. Bajkowski johnb@aaii.com 1

Advanced Features of xts Manipulating Time Series Data in R: Case Studies Finding Endpoints

Neural Networks for Time Series Prediction 15-496/782: Artificial Neural Networks Kornel

The COIN-OR Optimization Suite: Open Source Tools for Optimization Part 5: Advanced Modeling with

DMA implementations for FPGA- based data acquisition systems Presenter: Wojciech M. Zabootny

Sambuz

Useful Links

Newsletter

Mail Us