CROSS-ENDPOINT

How Good Is a Stock Pick? Information Ratio and Tracking Error in Python

May 15, 2026

What’s the question?

Beating the benchmark is only meaningful if the outperformance is consistent. A stock that beats SPY by 20% one year but trails by 25% the next has high active return but also high tracking error — the volatility of the difference between the stock’s return and the benchmark’s return. The information ratio (IR) divides active return by tracking error to measure the risk-adjusted quality of the outperformance. An IR above 0.5 indicates persistent alpha generation; below -0.5 indicates consistent underperformance.

The approach

Treat 8 individual stocks as single-stock “portfolios” measured against SPY over 1 year. For each stock, compute the total return, the active return (total return minus SPY’s return), the tracking error (annualized standard deviation of daily active returns), and the information ratio (active return divided by tracking error). Also compute the hit rate — the percentage of trading days on which the stock outperformed SPY.

import xfinlink as xfl
import pandas as pd
import numpy as np

xfl.api_key = "YOUR_API_KEY"  # free at https://xfinlink.com/signup

# -- Configuration ----------------------------------------------------------
tickers = ["AAPL", "MSFT", "NVDA", "AMZN", "META", "XOM", "JNJ", "JPM"]
benchmark = "SPY"

# -- Fetch 1Y daily returns -------------------------------------------------
df = xfl.prices(tickers + [benchmark], period="1y", fields=["return_daily"])
pivot = df.pivot_table(index="date", columns="ticker", values="return_daily").dropna()

# -- Benchmark annualized return --------------------------------------------
spy_total = (1 + pivot[benchmark]).prod() - 1
print(f"=== Information Ratio vs SPY (1Y) ===")
print(f"Benchmark (SPY) annualized return: {spy_total:.1%}")
print()

# -- Compute IR for each stock ----------------------------------------------
results = []

for ticker in tickers:
    stock_total = (1 + pivot[ticker]).prod() - 1
    active_daily = pivot[ticker] - pivot[benchmark]
    active_return = stock_total - spy_total
    tracking_error = active_daily.std() * np.sqrt(252)
    ir = active_return / tracking_error if tracking_error > 0 else 0.0
    hit_rate = (active_daily > 0).mean()

    results.append({
        "ticker": ticker,
        "total_return": stock_total,
        "active_return": active_return,
        "tracking_error": tracking_error,
        "ir": ir,
        "hit_rate": hit_rate,
    })

rdf = pd.DataFrame(results).sort_values("ir", ascending=False)

header = f"{'Ticker':6s}  {'Return':>8s}  {'Active':>7s}  {'TE':>7s}  {'IR':>6s}  {'Hit%':>5s}"
print(header)
print("-" * 44)

for _, r in rdf.iterrows():
    print(
        f"{r['ticker']:6s}  {r['total_return']:>+7.1%}  {r['active_return']:>+6.1%}  "
        f"{r['tracking_error']:>6.1%}  {r['ir']:>+5.2f}  "
        f"  {r['hit_rate']:>3.0%}"
    )

print()
print("IR interpretation:")
print("  > +0.5: strong outperformance | -0.5 to +0.5: noise | < -0.5: consistent underperformance")

Output:

=== Information Ratio vs SPY (1Y) ===
Benchmark (SPY) annualized return: 28.4%

Ticker   Return   Active      TE      IR   Hit%
--------------------------------------------
NVDA    +84.6%  +36.4%  27.5%  +1.32    54%
JNJ     +60.1%  +22.1%  20.0%  +1.11    48%
AAPL    +44.2%  +11.6%  19.2%  +0.61    48%
XOM     +45.0%  +12.2%  27.8%  +0.44    53%
AMZN    +32.9%   +3.5%  24.7%  +0.14    51%
JPM     +15.5%  -10.6%  17.5%  -0.61    53%
META     -0.4%  -25.4%  30.5%  -0.83    46%
MSFT     -7.1%  -32.3%  21.2%  -1.53    46%

IR interpretation:
  > +0.5: strong outperformance | -0.5 to +0.5: noise | < -0.5: consistent underperformance

What this tells us

NVDA and JNJ have the strongest IRs (+1.32 and +1.11) — both delivered substantial active returns relative to their tracking error. JNJ is the more remarkable case: it achieved an IR of 1.11 with only 20% tracking error, meaning it consistently outperformed with moderate deviation from SPY. NVDA’s higher active return (+36.4%) came with proportionally higher tracking error (27.5%). The hit rate column adds nuance: JNJ beat SPY on only 48% of trading days despite having the second-highest IR. This shows that information ratio is about the size of wins relative to losses, not the frequency.

So what?

When evaluating whether a stock pick or fund is genuinely skilled vs lucky, the information ratio is more informative than raw return. An IR above 0.5 sustained over 2–3 years is the threshold most institutional allocators consider evidence of skill. For portfolio construction, prefer holdings with high IR and moderate tracking error over those with extreme returns but proportionally extreme deviation from the benchmark.

Built with xfinlink — free financial data API for Python. pip install xfinlink

← All articles

Behind the numbers.

How Good Is a Stock Pick? Information Ratio and Tracking Error in Python

What’s the question?

The approach

What this tells us

So what?