What is the difference between purging and embargoing?

Purging is the process of removing from the training set any observation whose timestamp falls within the time range of formation of a label in the test set. This ensures that the algorithm cannot learn during training time information that will be used to assess the performance of the algorithm. Embargoing is an additional step that removes a percentage of observations immediately following the test set. This guards against a more subtle form of leakage caused by market reaction lag or downstream dependencies.

Why is standard k-fold cross-validation bad for trading?

Standard k-fold cross-validation assumes that observations are independently and identically distributed (IID). Financial time series data violates this assumption because today's price is highly correlated with yesterday's price. If you shuffle the data or split it randomly, you introduce look-ahead bias, where the model learns from future data. This results in overly optimistic performance estimates that do not hold up in live trading.

How do I interpret the distribution of results from purged k-fold?

Instead of looking at a single Sharpe ratio or total return, you should look at the distribution of these metrics across all splits. A narrow distribution with a high mean suggests a robust strategy that works across different market regimes. A wide distribution or one with a high standard deviation indicates that the strategy is sensitive to specific market conditions and may be overfit. You can use the distribution to calculate confidence intervals and determine the probability of the strategy being profitable out-of-sample.

Can I use purged k-fold with NinjaTrader 8?

NinjaTrader 8 does not natively support purged k-fold cross-validation. Its backtesting engine is designed for single-path historical simulation. To use purged k-fold, you must export your data and indicators from NinjaTrader 8 using tools like the Exporter and then run the validation in a Python environment using VectorBT. This hybrid approach allows you to leverage the execution capabilities of NinjaTrader 8 and the statistical rigor of VectorBT.

What is the computational cost of purged k-fold?

The computational cost is significantly higher than standard backtesting because you are running the strategy multiple times with different data splits. The number of combinations grows combinatorially with the number of folds. However, VectorBT's vectorized architecture and Numba acceleration make it feasible to run thousands of splits in seconds. For example, VectorBT can fill 1,000,000 orders in 70-100ms on an Apple M1, making it one of the fastest backtesting engines available for this type of analysis.

Vectorbt Walk-Forward: Implementing Purged

What if your backtest results are lying to you because your training data peeked at the future? This is the silent killer of algorithmic strategies, but...

By Trader Algorítmico (Equipo Editorial) | Published: January 24, 2026

What if your backtest results are lying to you because your training data peeked at the future? This is the silent killer of algorithmic strategies, but vectorbt offers a specific solution to stop this leakage before it ruins your edge. Most traders run standard backtests and assume the numbers are real. In reality, financial time series data violates the independence assumption required by traditional statistical methods. When you test a strategy on data that overlaps with its training period, you create look-ahead bias. This leads to inflated performance metrics that vanish the moment you go live. Key fact: Standard k-fold cross-validation fails in finance because it assumes observations are independently and identically distributed (IID), which is rarely true for time series data where labels depend on future events. This article explores how to implement purged k-fold cross-validation to generate realistic performance distributions. We will move beyond single-point estimates and build robust validation pipelines that account for market structure and temporal dependencies.

Related Products

ata | exporter | alvanor

Back to Blog | Indicators | Strategies | About