π GitHub repository
π Dataset Source
Description
The goal of this analysis is to forecast the number of positive COVID-19 tests in Boston based on various factors such as emergency department visits, hospitalizations, airport passenger arrivals, and COVID-19 testing data. The dataset contains information about these variables over time.
Technique Used
The technique used in the code is Vector Autoregression (VAR) modeling. VAR is a multivariate time series analysis method that models the relationship between multiple variables as a system of equations. It extends the autoregressive (AR) model to multiple variables.
- Causal Analysis: Granger causality tests are performed to determine the causal relationship between the variables.
- Johansen’s Cointegration Test is conducted to check for long-term equilibrium relationships among the variables.
- Stationarity Check: Augmented Dickey-Fuller (ADF) test is performed to test the stationarity of the variables. If the variables are non-stationary, differencing is applied to make them stationary.
- Model Order Selection:Different lag orders (p) are tested using the VAR model, and the Akaike Information Criterion (AIC) is used to select the optimal lag order.
- Model Training and Forecasting: The VAR model is fitted to the training data.Serial correlation of residuals is checked using the Durbin-Watson statistic.The model is used to forecast future values based on the differenced data.