BLOG

Behind the numbers.

Code examples, market analysis, and data quality deep-dives.

Is AI Capex Paying Back Fast Enough? Revenue Hurdle Forecasting in Python
Could Shorter AI Asset Lives Hit Earnings? Depreciation Stress Test in Python
How Much AI Capex Risk Can a Portfolio Remove? Constrained Optimization in Python
Is the AI Capex Trade Crowded? Rolling Volatility and Sector Rotation in Python
Did the AI Boom Come From Existing S&P 500 Members? Point-in-Time Momentum Test in Python
Is AI Revenue Circular? Customer-Vendor Capex Loop Analysis in Python
Is the AI Trade Connected to Private Credit? Rolling Correlation Network in Python
Is Apollo More Balance-Sheet Sensitive Than Peers? Leverage Screen in Python
Are AI Earnings Supported by Cash Flow? Accrual and Capex Screen in Python
Can Defensive Stocks Hedge AI Drawdowns? Basket Regime Test in Python
How Fast Does the Market Price In Fed Decisions? FOMC Event Study in Python
How Much Are Options Sellers Overpaid? The Variance Risk Premium in Python
Which Companies Have the Worst Earnings Quality? Sloan Accrual Screen with Geographic Revenue Data in Python
Does the Oil-to-Gold Ratio Signal Recessions? XLE/GLD Backtest in Python
Is AI Spending Crowding Out Free Cash Flow? Capex Sustainability Across the Mag 7 in Python
Does a Long Energy / Short Bonds Portfolio Capture Inflation Surprises? Factor Construction in Python
Can a Hidden Markov Model Detect Oil Market Regimes? HMM Analysis in Python
Do Grain Prices Predict Food Inflation? Granger Causality Test in Python
Does the Corporate Credit Spread Predict Stock Market Crashes? BAA-AAA Spread Analysis in Python
Do Oil Stocks Hedge Inflation? Rolling Beta Analysis in Python
Which Stocks Are Most Rate-Sensitive? Equity Duration via Bond Beta in Python
Which Companies Have the Highest Accrual Ratios? Earnings Quality Screening in Python
Is Alpha Persistent or Decaying? Rolling Sharpe Ratio Analysis in Python
Are Markets Trending or Mean-Reverting? Hurst Exponent Analysis in Python
Is Consumer Discretionary vs Staples a Leading Indicator? XLY/XLP Ratio Analysis in Python
Does Heavy Capex Predict Future Stock Returns? Capital Expenditure Analysis in Python
How to Estimate Cost of Equity Using CAPM in Python
Is Volatility Predictable? Testing for Volatility Clustering in Python
Which Industrials Are Overleveraged? Net Debt to EBITDA Screening in Python
GM Before and After Bankruptcy: Why Entity Resolution Matters for Financial Data
What Is Adjusted Beta? Merrill Lynch Beta Shrinkage in Python
How Good Is a Stock Pick? Information Ratio and Tracking Error in Python
Do Stock Returns Follow a Normal Distribution? Testing for Fat Tails in Python
Which Large Caps Have the Highest Free Cash Flow Yield? FCF Screening in Python
Which Sectors Won Over 5 Years? Sector Rotation Analysis in Python
How to Forecast Stock Volatility with GARCH Models in Python
Are Stock Prices Mean-Reverting? Augmented Dickey-Fuller Test in Python
How to Calculate CAPM Alpha and Beta with Regression in Python
How to Compare Sector Sharpe Ratios and Sortino Ratios in Python
DELL: Why Stitching Historical Price Data Together Is Wrong
How to Analyze Drawdown and Recovery for Bank Stocks in Python
How to Screen SaaS Stocks by Revenue Growth and Cash Flow in Python
How to Screen REITs by Dividend Yield and Valuation in Python
How Correlated Are the Magnificent 7? Intra-Group Correlation in Python
AAPL vs XOM: Do Individual Stocks Have Seasonal Patterns?
How to Rank Large-Cap Stocks by Momentum in Python
How to Build a Multi-Endpoint Financial Dashboard in Python
How to Compare Volatility Across Energy Stocks in Python
How to Screen Healthcare Stocks by Valuation in Python
How to Build a Sector Correlation Matrix for Portfolio Diversification in Python
How to Find Oversold and Overbought Stocks Using Z-Scores in Python
How to Measure Earnings Quality: Cash Flow vs Net Income in Python
How to Build a Multi-Factor Stock Screen in Python (Value + Momentum + Quality)
How to Build a Simple DCF Model for Any Stock in Python
How to Screen Tech Stocks by Revenue Growth in Python
How to Screen Stocks by Balance Sheet Health in Python
Is "Sell in May" Real? SPY Monthly Seasonality Over 10 Years
How to Compare Sector Performance YTD Using Python
How to Track S&P 500 Additions and Removals Over Time in Python
How to Screen Dividend Stocks by Yield and Quality in Python
How to Calculate Max Drawdown and Recovery Time for Any Stock in Python
How to Compare Profitability Across Mega-Cap Tech Stocks in Python
Why Ticker Symbols Are Unreliable: The Recycling Problem Every Quant Should Know
How to Calculate and Compare Stock Volatility in Python
How to Screen Blue-Chip Stocks by P/E Ratio in Python
How to Track Companies Through Ticker Changes, Bankruptcies, and Renames in Python
S&P 500 Turnover: How Much the Index Has Changed Since 2010
How to Calculate Stock Beta and Correlation in Python
← All articles

Is Volatility Predictable? Testing for Volatility Clustering in Python

What’s the question?

Volatility clustering is the empirical observation that large price moves tend to be followed by large price moves, and small moves by small moves. A stock that had a 5% daily swing yesterday is more likely to have another large swing today than a stock that moved 0.3%. This violates the assumption of independently and identically distributed (i.i.d.) returns — if today’s volatility carries information about tomorrow’s, then volatility is partially predictable. The Ljung-Box test applied to squared returns (a proxy for variance) formalizes this: it tests whether the autocorrelation structure of squared returns is statistically significant. If squared returns are autocorrelated, volatility clusters, and models like GARCH (Generalized Autoregressive Conditional Heteroskedasticity) that capture this dependence are appropriate.

The approach

Select 6 tickers including SPY over 3 years of daily data. Compute squared daily returns, which measure realized variance for each day. Apply the Ljung-Box test at lags 10 and 20 to test for joint autocorrelation across multiple lags. Also compute the lag-1 autocorrelation coefficient of squared returns directly as a single-number summary of day-to-day clustering strength. Generate a chart of SPY squared returns to visually illustrate clustering.

import xfinlink as xfl
import pandas as pd
import numpy as np
from statsmodels.stats.diagnostic import acorr_ljungbox
import matplotlib
matplotlib.use("Agg")
import matplotlib.pyplot as plt

xfl.api_key = "YOUR_API_KEY"  # free at https://xfinlink.com/signup

# -- Configuration ----------------------------------------------------------
tickers = ["AAPL", "MSFT", "NVDA", "TSLA", "XOM", "SPY"]

# -- Fetch 3Y daily returns -------------------------------------------------
df = xfl.prices(tickers, period="3y", fields=["return_daily"])

# -- Chart: SPY squared returns ---------------------------------------------
spy = df[df["ticker"] == "SPY"].copy()
spy["squared_return"] = spy["return_daily"] ** 2

fig, ax = plt.subplots(figsize=(12, 4))
ax.bar(pd.to_datetime(spy["date"]), spy["squared_return"], width=1, color="#2563eb", alpha=0.7)
ax.set_title("SPY Squared Daily Returns (3Y) -- Volatility Clustering", fontsize=13)
ax.set_ylabel("Squared Return")
ax.set_xlabel("Date")
plt.tight_layout()
plt.savefig("volatility-clustering-ljungbox-python.png", dpi=150)
plt.close()
print("Chart saved: volatility-clustering-ljungbox-python.png\n")

# -- Ljung-Box test on squared returns -------------------------------------
print("=== Volatility Clustering: Ljung-Box Test on Squared Returns ===")
print("(Tests whether squared returns are autocorrelated -- i.e., whether large moves cluster together)")
print("(H0: no autocorrelation. If p < 0.05: reject H0 = volatility clusters.)")
print()
header = (
    f"{'Ticker':6s}  {'LB(10)':>8s}  {'p-value':>8s}  {'LB(20)':>8s}  "
    f"{'p-value':>8s}  {'Clusters?':>10s}"
)
print(header)
print("-" * 56)

lag1_results = {}

for ticker in tickers:
    r = df[df["ticker"] == ticker]["return_daily"].dropna()
    sq = r ** 2

    lb = acorr_ljungbox(sq, lags=[10, 20], return_df=True)
    lb10_stat = lb.loc[10, "lb_stat"]
    lb10_p = lb.loc[10, "lb_pvalue"]
    lb20_stat = lb.loc[20, "lb_stat"]
    lb20_p = lb.loc[20, "lb_pvalue"]

    clusters = "YES" if lb10_p < 0.05 else "NO"

    print(
        f"{ticker:6s}  {lb10_stat:>8.1f}  {lb10_p:>8.4f}  {lb20_stat:>8.1f}  "
        f"{lb20_p:>8.4f}  {clusters:>10s}"
    )

    # Lag-1 autocorrelation of squared returns
    lag1_results[ticker] = sq.autocorr(lag=1)

# -- Lag-1 autocorrelation summary ----------------------------------------
print("\n=== Lag-1 Autocorrelation of Squared Returns ===")
for ticker in tickers:
    print(f"  {ticker}: {lag1_results[ticker]:.3f}")

Output:

SPY squared returns showing volatility clustering over 3 years
=== Volatility Clustering: Ljung-Box Test on Squared Returns ===
(Tests whether squared returns are autocorrelated -- i.e., whether large moves cluster together)
(H0: no autocorrelation. If p < 0.05: reject H0 = volatility clusters.)

Ticker    LB(10)   p-value    LB(20)   p-value   Clusters?
--------------------------------------------------------
AAPL       118.4    0.0000     122.1    0.0000         YES
MSFT        20.0    0.0296      48.5    0.0004         YES
NVDA         6.2    0.8017       8.0    0.9921          NO
TSLA        33.0    0.0003      46.0    0.0008         YES
XOM         94.8    0.0000     106.2    0.0000         YES
SPY        129.1    0.0000     131.1    0.0000         YES

=== Lag-1 Autocorrelation of Squared Returns ===
  AAPL: 0.193
  MSFT: -0.000
  NVDA: 0.050
  TSLA: 0.037
  XOM: 0.175
  SPY: 0.202

What this tells us

Five of six tickers exhibit statistically significant volatility clustering. SPY shows the strongest effect: Ljung-Box statistic of 129.1 (p=0.0000) and lag-1 autocorrelation of 0.202. This means knowing today’s squared return provides meaningful information about tomorrow’s — a direct violation of the i.i.d. assumption underlying basic portfolio theory. NVDA is the exception. Its Ljung-Box statistic at lag 10 is only 6.2 (p=0.80), failing to reject the null hypothesis of no autocorrelation. NVDA’s volatility does not cluster — large moves are not followed by large moves more often than random chance would predict. The lag-1 autocorrelation column provides a single-number summary: SPY at 0.202 and XOM at 0.175 have the strongest day-to-day clustering, while NVDA at 0.050 and TSLA at 0.037 have essentially none.

So what?

If volatility clusters, it is partially predictable — and models that ignore this leave information on the table. For risk management, use GARCH or exponentially weighted moving average (EWMA) models instead of a fixed historical volatility window. For options pricing, realized volatility estimates should overweight recent observations rather than treating all days equally. For portfolio construction, the Ljung-Box test is a diagnostic: run it on squared returns before assuming constant volatility. If the test rejects (p < 0.05), a time-varying volatility model is justified. If it does not — as with NVDA — simpler constant-volatility assumptions may be adequate.

Built with xfinlink — free financial data API for Python. pip install xfinlink
← All articles