David's (BT) data blog

Aug. 12, 2023

David Harper

Simulating the equity risk premium

The implied ERP is very sensitive to assumptions, in particular G2

Sept. 15, 2022

David Harper

Phillips curve to illustrate bias-variance tradeoff

Underfitting implies low-variance but high-bias; overfitting implies low-bias but high variance

Sept. 5, 2022

David Harper

Using purrr to map over a range of k-means clusters

Mapping a k-means factory

Aug. 31, 2022

David Harper

Intermediate Functional Programming with purrr

My progress in learning how to purrr

Aug. 27, 2022

David Harper

Advanced Data Visualization with R at JH

The sequel in the JH dataviz specialization

July 30, 2022

David Harper

Foundations of purrr

Map over list elements with elegance and power

July 16, 2022

David Harper

How to Process Missing Data

How do we visualize what's missing? And the art of imputation

June 6, 2022

David Harper

My JH dataviz submission

Product of JH's DataViz in R with ggplot2.

May 2, 2022

David Harper

Example of embedded Excel snippet

Excel workbook (or ranges) can be embedded in the classic iframe

Jan. 25, 2022

David Harper

BT PQ P1.T2.21.1 (SET) Non-stationary time series (2021)

Seasonal dummy model, roots of characteristic equation, and transformation (difference versus detrend) of non-stationary process

Jan. 23, 2022

David Harper

BT PQ P1.T2.20.25 (SET) Long-horizon AR(p) MA(q) forecasts

The long-run mean of an MA(q) process is the intercept; of the AR(p) process is delta/(1 - sum of params)

Jan. 21, 2022

David Harper

BT PQ P1.T2.20.24.3 AIC and BIC

Penalized MSE measures are called information criteria (IC) and two popular such measures are the Akaike Information Crite-rion (AIC) and the Bayesian Information Criterion (BIC).

Jan. 19, 2022

David Harper

BT PQ P1.T2.20.24.2 Box-Pierce and the Ljung-Box tests

The Box-Pierce statistic is a simplified version of the Ljung-Box statistic; both are joint tests of autocorrelation

Jan. 16, 2022

David Harper

BT PQ P1.T2.20.23 (SET) autoregressive moving average (ARMA) models

ARMA(p,q) combines an AR(p) and MA(q)

Jan. 15, 2022

David Harper

BT PQ P1.T2.20.22.2 autoregressive (AR) versus moving average (MA) process

What's the difference between and AR and MA process, when they appear to be similar?

Jan. 14, 2022

David Harper

BT PQ P1.T2.20.21.3 White Noise (WN) Process

White noise (WN) is the basic time series building block

Jan. 13, 2022

David Harper

BT PQ P1.T2.20.21.2 Autocorrelation function (ACF)

The autocorrelation function (ACF; aka, correlogram) plots autocorrelation coefficients

Jan. 12, 2022

David Harper

BT PQ P1.T2.20.20.3 Regression residual plost

standard lm() diagnostic plots: residual vs fitted, normal Q-Q, scale-location, residuals vs levereage

Jan. 11, 2022

David Harper

BT PQ P1.T2.20.20.2 Regression diagnostics: m-fold cross-validation (CV)

m-fold cross validation is for model checking, not model building

Jan. 10, 2022

David Harper

BT PQ P1.T2.20.20.1 Regression diagnostics: Cook's distance

Cook's distance evaluates an outlier

Jan. 9, 2022

David Harper

BT PQ P1-T2-20-19: Regression diagnostics (SET)

Diagnostics: omitted variable bias, heteroskedasticity, and multicollinearity

Jan. 8, 2022

David Harper

BT PQ P1-T2-20-18 (SET) Multivariate regressions

Fama-French three-factor model; House prices; and Medical costs

Jan. 7, 2022

David Harper

BT PQ P1-T2-20-17. Univariate regressions cont (2nd set v2)

Coefficient confidence interval (CI); hypothesis test; interpretation of SE, t-stat and p-value

Jan. 6, 2022

David Harper

BT PQ P1-T2-20-16-3: Univariate regression: Monthly rental versus footage

Monthly rent against feet^2 per kaggle dataset

Jan. 5, 2022

David Harper

BT PQ P1-T2-20-16-2: Univariate regression: Portfolio versus benchmark returns

Simulated portfolio & benchmark for purposes of testing basic features of univariate regression

Jan. 4, 2022

David Harper

BT PQ P1-T2-20-16-1: Univariate regression: Inflation versus unemployment

With FRED data and applying gt_table

Jan. 3, 2022

David Harper

velocity of money

MV = PY illustrates the problem but is tautological

Jan. 2, 2022

David Harper

New distill site in 15 minutes

Distill is so much easier than blogdown

David's (BT) data blog

Simulating the equity risk premium

Phillips curve to illustrate bias-variance tradeoff

Using purrr to map over a range of k-means clusters

Intermediate Functional Programming with purrr

Advanced Data Visualization with R at JH

Foundations of purrr

How to Process Missing Data

My JH dataviz submission

Example of embedded Excel snippet

BT PQ P1.T2.21.1 (SET) Non-stationary time series (2021)

BT PQ P1.T2.20.25 (SET) Long-horizon AR(p) MA(q) forecasts

BT PQ P1.T2.20.24.3 AIC and BIC

BT PQ P1.T2.20.24.2 Box-Pierce and the Ljung-Box tests

BT PQ P1.T2.20.23 (SET) autoregressive moving average (ARMA) models

BT PQ P1.T2.20.22.2 autoregressive (AR) versus moving average (MA) process

BT PQ P1.T2.20.21.3 White Noise (WN) Process

BT PQ P1.T2.20.21.2 Autocorrelation function (ACF)

BT PQ P1.T2.20.20.3 Regression residual plost

BT PQ P1.T2.20.20.2 Regression diagnostics: m-fold cross-validation (CV)

BT PQ P1.T2.20.20.1 Regression diagnostics: Cook's distance

BT PQ P1-T2-20-19: Regression diagnostics (SET)

BT PQ P1-T2-20-18 (SET) Multivariate regressions

BT PQ P1-T2-20-17. Univariate regressions cont (2nd set v2)

BT PQ P1-T2-20-16-3: Univariate regression: Monthly rental versus footage

BT PQ P1-T2-20-16-2: Univariate regression: Portfolio versus benchmark returns

BT PQ P1-T2-20-16-1: Univariate regression: Inflation versus unemployment

velocity of money

New distill site in 15 minutes

David’s (BT) data blog