Data leakage in threshold_strategy

Question

Data leakage in threshold_strategy

mvinoba opened this issue 2 years ago · 1 comments

In threshold_strategy the spread is calculated as follows:

# calculate normalized spread
spread = y - beta * x
norm_spread = (spread - spread.mean()) / np.std(spread)
norm_spread = np.asarray(norm_spread.values)

In this case, the longs and shorts entry and exit positions are calculated using unseen data, or am I missing something?

Answer 1 · 2023-09-28T12:19:06.000Z

I'm pretty sure you are right, unfortunately this minor oversight invalidates all the results. Finance is very unforgiving :(