Hi Beppe,
Hi Beppe,
Again, this has nothing to do with the version of Matlab you are using. You need to download and install the free package from spatial-econometrics.com and use the johansen function there in order to get identical results as mine.
Ernie
Hi,

I'm following your code IndexArb.m (pg. 97-98 book "Algorithmic Trading:winning strategies..."

I'm using Matlab 2016 so I had to change ONLY the code line about johansen test.
I've followed the same model you suggested (include intercept but no trend and lags=1).
So I substituted your code : "results=johansen(y2, 0, 1); with this:
"[h,pValue,stat,cValue,mles]=jcitest(y2, 'model', 'H1*','lags',1);

Unfortunately, unlike your results (98 stocks cointegrated with the ETF) I never find a single stock cointegrated with it. It was so strange that I tested every johansen test model .
Nothing changed but...the quadratic trend model gave me 290 stocks cointegrated!.
But it's clear that something is wrong. I'm sure I'm using the very same time series "inputDataOHLCDaily_stocks_20120424" and "inputData_ETF" as well as the same training date you selected. I mean from 98 to 0 is something that worries me....

Beppe
It could still be profitable if the basket does not exactly replicate the ETF.

Ernie

Hi Ernie,

Love your blog. I have a question. If one has restrictions on their trading such that they must hold positions for at least a week, would trading a basket of underlying symbols against their ETF still be a profitable strategy, in general?
NT,
The process is the same for any basket, even if you impose the exclusion constraint. You don't need factor or PCA models.
Ernie

How would this process differ if you wanted to create a basket of goods to pairs trade against a single security with the constraint that basket cannot contain the security in which you want to trade individually? Would you have to come up with a factor model and run a PCA to discover underlying factors that drive correlation?

-NT
Jeet,
Ultimately, you should see the Sharpe ratio of your strategy, irrespective of with R square you get for linear regression.

Often, a poor fit can still result in a profitable strategy.

Ernie

1. In co integration relationship for pairs trading, we regress S1 ~ S2 without intercept. But we know, regression without intercept increases R square dramatically.

How to handle this? R square change dramatically, how can we say that this fitting is not spurious?


2.We know, in non stationary time series, R square is not a good measure to check the goodness of fit. What measure I should use for regression equation provided we do not use constant in regression equation

Jeet
Joe,
It may. No one can tell in advance, and you should just run the experiment yourself. I haven't seen any papers on this technique of find the best basket either.
Ernie

So if I keep the volatility fixed and minimize the convergence time or vice versa, do you think this object function will outperforms linear regression?
Are there any paper about this topics?

Thank you
bottleneck56 at gmail.com
Joe,
I am not sure how you can have an objective function that does two things at once (maximize volatility AND minimize convergence time). At most, you can use one criterion as a constraint (fixed to a value), and use the other as objective.
Ernie

Using linear/ridge regression to generate the ratio of a basket of stock means the mean square error of basket of stock and the index is minimized.

However, the target function should be maximizing the spread volatility while minimizing the convergence time.

Are there any study or paper regarding this topics?

Thank you(Joe bottleneck56 at gmail.com)
Hi Anon,
There are 2 different use of the word "lag".

If you mean the lag used as a parameter in the cadf test, then I believe the result is not very sensitive to whether you use lag=0, 1, or 2.

However, I think you actually mean "lookback", which is the length of the data set used for the cadf test. In this case, it should be at least 1 year, but 3 years is ideal.
Ernie

Hi Ernie - great job with the blog.

I had a simple question for you.

When you run the cointegration tests what lag should you use? I've been doing a lot of reading about this and its quite debatable. Mean reversion occurs at various time frequencies i.e. minutes, hours, days, weeks, months. So in that sense one could test using various different lags - no? I notice in some of your comments you use a lag=0. I would greatly appreciate some guidance on this issue as it would help me greatly.

Kind Regards,
Sam
Hi Anon,
IB's commission is about 0.5 cent per...Hi Anon,<br />IB's commission is about 0.5 cent per share, and can be lower for large volume.<br /><br />You can scale your components and the ETF by any amount you like so as to trade round lots.<br /><br />$1 stddev seems pretty reasonable.<br />ErnieErnie Chanhttps://www.blogger.com/profile/02747099358519893177noreply@blogger.comtag:blogger.com,1999:blog-35364652.post-8890540717961654572011-03-31T17:58:28.658-04:002011-03-31T17:58:28.658-04:00Hi Ernie -
Hi Ernie,

I had a question regarding your statement that you can boost this return by using options to implement the XLE position. Is this always true for one side of the cointegrating pair? Could we also substitute options for stocks on both sides of a pair?

I didn't think that a cointegration test for equities would also translate into a strategy for options. I thought cointegration tells us about a linear combination of two stock prices while options are bets on individual stock prices.

Thanks
Anonymous,
Thanks for your kind words on my book.
...Anonymous,<br />Thanks for your kind words on my book.<br /><br />a) Certainly the more stocks you include in your basket, the shorter the half-life. The number of stocks to include depends on your desired holding period. You can backtest the strategy to find that out.<br /><br />b) Yes, hedge ratio is the coefficient from a linear regression.<br /><br />ErnieErnie Chanhttps://www.blogger.com/profile/02747099358519893177noreply@blogger.comtag:blogger.com,1999:blog-35364652.post-62846950576918493882009-10-10T15:19:35.920-04:002009-10-10T15:19:35.920-04:00Ernest,
Ernest,

I thoroughly enjoyed your book, but I would like to bring up my issues with this strategy.

If I can again bring up the example of trading a basket of N-1 stocks against an index with N components. The deviations of your basket from the index will be due to the component that you have NOT included in your basket, say component X. I accept that the spread between the index and your basket will be stationary because the component X is in the same sector.

The problem that I can't put aside is the chance that the one component that we didn't include in our basket goes through an M&A. I can imagine myself waking up one day to the news that component X is merging with component Y (the leader) and its share price is about to open at a price that will wipe out all of my profits for the last year. This is something that your backtests and cointegration tests will not prepare you for. In your book you discuss the illogicality of using stop losses with mean reversion strategies (which I totally agree with), but at least stop losses put a limit on the amount of money you can lose on any one trade.

To sum up, I just think that this strategy lacks some serious risk management. The example is also quite conservative given that you trade a basket of 10 stocks in 33, leaving 23 component X's to stress about.

Jeff
Anonymous,
Thanks for your kind words.
Yes, the mo...Anonymous,<br />Thanks for your kind words.<br />Yes, the more stocks you include in a basket, the shorter the half-life of mean-reversion to the corresponding ETF. However, it also implies smaller returns since cointegration is likely to be very tight. It may be a good candidate for high frequency trading.<br />ErnieErnie Chanhttps://www.blogger.com/profile/02747099358519893177noreply@blogger.comtag:blogger.com,1999:blog-35364652.post-44370843981931770002009-06-08T19:37:08.834-04:002009-06-08T19:37:08.834-04:00Ernest,
Hi ezbentley,
I stand by my argument that relative...Hi ezbentley,<br />I stand by my argument that relative weights of the components are important to whether a basket cointegrates with an index, contrary to what you argued.<br /><br />The N-1 stock basket can cointegrate with an index, but the 1 stock basket will not, because in the 1 stock basket, that stock suddenly has weight of 1. <br />ErnieErnie Chanhttps://www.blogger.com/profile/02747099358519893177noreply@blogger.comtag:blogger.com,1999:blog-35364652.post-77471111744265336092009-06-04T17:41:25.460-04:002009-06-04T17:41:25.460-04:00Hi Ernest,
Hi Anonymous,
If you believe that AUD cointegrates with a basket of currencies, then yes, your trade makes sense.
Ernie