Econometrics Beat: Dave Giles' Blog: ARDL Models - Part II

Wednesday, June 19, 2013

ARDL Models - Part II - Bounds Tests

[Note: For an important update of this post, relating to EViews 9, see my 2015 post, here.]

Well, I finally got it done! Some of these posts take more time to prepare than you might think.

The first part of this discussion was covered in a (sort of!) recent post, in which I gave a brief description of Autoregressive Distributed Lag (ARDL) models, together with some historical perspective. Now it's time for us to get down to business and see how these models have come to play a very important role recently in the modelling of non-stationary time-series data.

In particular, we'll see how they're used to implement the so-called "Bounds Tests", to see if long-run relationships are present when we have a group of time-series, some of which may be stationary, while others are not. A detailed worked example, using EViews, is included.

First, recall that the basic form of an ARDL regression model is:

y_t = β₀ + β₁y_t-1 + .......+ β_ky_t-p + α₀x_t + α₁x_t-1 + α₂x_t-2 + ......... + α_qx_t-q + ε_t, (1)

where ε_t is a random "disturbance" term, which we'll assume is "well-behaved" in the usual sense. In particular, it will be serially independent.

We're going to modify this model somewhat for our purposes here. Specifically, we'll work with a mixture of differences and levels of the data. The reasons for this will become apparent as we go along.

Let's suppose that we have a set of time-series variables, and we want to model the relationship between them, taking into account any unit roots and/or cointegration associated with the data. First, note that there are three straightforward situations that we're going to put to one side, because they can be dealt with in standard ways:

We know that all of the series are I(0), and hence stationary. In this case, we can simply model the data in their levels, using OLS estimation, for example.
We know that all of the series are integrated of the same order (e.g., I(1)), but they are not cointegrated. In this case, we can just (appropriately) difference each series, and estimate a standard regression model using OLS.
We know that all of the series are integrated of the same order, and they are cointegrated. In this case, we can estimate two types of models: (i) An OLS regression model using the levels of the data. This will provide the long-run equilibrating relationship between the variables. (ii) An error-correction model (ECM), estimated by OLS. This model will represent the short-run dynamics of the relationship between the variables.

Now, let's return to the more complicated situation mentioned above. Some of the variables in question may be stationary, some may be I(1) or even fractionally integrated, and there is also the possibility of cointegration among some of the I(1) variables. In other words, things just aren't as "clear cut" as in the three situations noted above.

What do we do in such cases if we want to model the data appropriately and extract both long-run and short-run relationships? This is where the ARDL model enters the picture.

The ARDL / Bounds Testing methodology of Pesaran and Shin (1999) and Pesaran et al. (2001) has a number of features that many researchers feel give it some advantages over conventional cointegration testing. For instance:

It can be used with a mixture of I(0) and I(1) data.
It involves just a single-equation set-up, making it simple to implement and interpret.
Different variables can be assigned different lag-lengths as they enter the model.

We need a road map to help us. Here are the basic steps that we're going to follow (with details to be added below):

Make sure than none of the variables are I(2), as such data will invalidate the methodology.
Formulate an "unrestricted" error-correction model (ECM). This will be a particular type of ARDL model.
Determine the appropriate lag structure for the model in step 2.
Make sure that the errors of this model are serially independent.
Make sure that the model is "dynamically stable".
Perform a "Bounds Test" to see if there is evidence of a long-run relationship between the variables.
If the outcome at step 6 is positive, estimate a long-run "levels model", as well as a separate "restricted" ECM.
Use the results of the models estimated in step 7 to measure short-run dynamic effects, and the long-run equilibrating relationship between the variables.

We can see from the form of the generic ARDL model given in equation (1) above, that such models are characterised by having lags of the dependent variable, as well as lags (and perhaps the current value) of other variables, as the regressors. Let's suppose that there are three variables that we're interested in modelling: a dependent variable, y, and two other explanatory variables, x₁ and x₂. More generally, there will be (k + 1) variables - a dependent variable, and k other variables.

Before we start, let's recall what a conventional ECM for cointegrated data looks like. It would be of the form:

Δy_t = β₀ + Σ β_iΔy_t-i + Σγ_jΔx_1t-j + Σδ_kΔx_2t-k + φz_t-1 + e_t; (2)

Here, z, the "error-correction term", is the OLS residuals series from the long-run "cointegrating regression",

y_t = α₀ + α₁x_1t + α₂x_2t + v_t ; (3)

The ranges of summation in (2) are from 1 to p, 0 to q₁, and 0 to q₂ respectively.

Now, back to our own analysis-

Step 1:
We can use the ADF and KPSS tests to check that none of the series we're working with are I(2).

Step 2:

Formulate the following model:

Δy_t = β₀ + Σ β_iΔy_t-i + Σγ_jΔx_1t-j + Σδ_kΔx_2t-k + θ₀y_t-1 + θ₁x_1t-1 + θ₂ x_2t-1 + e_t; (4)

Notice that this is almost like a traditional ECM. The difference is that we've now replaced the error-correction term, z_t-1 with the terms y_t-1, x_1t-1, and x_2t-1. From (3), we can see that the lagged residuals series would be z_t-1 = (y_t-1 - a₀ - a₁x_1t-1 - a₂x_2t-1), where the a's are the OLS estimates of the α's. So, what we're doing in equation (4) is including the same lagged levels as we do in a regular ECM, but we're not restricting their coefficients.

This is why we might call equation (4) an "unrestricted ECM", or an "unconstrained ECM". Pesaran et al. (2001) call this a "conditional ECM".

Step 3:

The ranges of summation in the various terms in (4) are from 1 to p, 0 to q₁, and 0 to q₂ respectively.We need to select the appropriate values for the maximum lags, p, q₁, and q₂. Also, note that the "zero lags" on Δx₁ and Δx₂ may not necessarily be needed. Usually, these maximum lags are determined by using one or more of the "information criteria" - AIC, SC (BIC), HQ, etc. These criteria are based on a high log-likelihood value, with a "penalty" for including more lags to achieve this. The form of the penalty varies from one criterion to another. Each criterion starts with -2log(L), and then penalizes, so the smaller the value of an information criterion the better the result.

I generally use the Schwarz (Bayes) criterion (SC), as it's a consistent model-selector. Some care has to be taken not to "over-select" the maximum lags, and I usually also pay some attention to the (apparent) significance of the coefficients in the model.

Step 4:
A key assumption in the ARDL / Bounds Testing methodology of Pesaran et al. (2001) is that the errors of equation (4) must be serially independent. As those authors note (p.308), this requirement may also be influential in our final choice of the maximum lags for the variables in the model.

Once an apparently suitable version of (4) has been estimated, we should use the LM test to test the null hypothesis that the errors are serially independent, against the alternative hypothesis that the errors are (either) AR(m) or MA(m), for m = 1, 2, 3,...., etc.

Step 5:
We have a model with an autoregressive structure, so we have to be sure that the model is "dynamically stable". For full details of what this means, see my recent post, When is an Autoregressive Model Dynamically Stable? What we need to do is to check that all of the inverse roots of the characteristic equation associated with our model lie strictly inside the unit circle. That recent post of mine showed how to trick EViews into giving us the information we want in order to check that this condition is satisfied. I won't repeat that here.

Step 6:
Now we're ready to perform the "Bounds Testing"!

Here's equation (4), again:

Δy_t = β₀ + Σ β_iΔy_t-i + Σγ_jΔx_1t-j + Σδ_kΔx_2t-k + θ₀y_t-1 + θ₁x_1t-1 + θ₂ x_2t-1 + e_t; (4)

All that we're going to do is preform an "F-test" of the hypothesis, H₀: θ₀= θ₁ = θ₂ = 0 ; against the alternative that H₀is not true. Simple enough - but why are we doing this?

As in conventional cointegration testing, we're testing for the absence of a long-run equilibrium relationship between the variables. This absence coincides with zero coefficients for y_t-1, x_1t-1 and x_2t-1 in equation (4). A rejection of H₀ implies that we have a long-run relationship.

There is a practical difficulty that has to be addressed when we conduct the F-test. The distribution of the test statistic is totally non-standard (and also depends on a "nuisance parameter", the cointegrating rank of the system) even in the asymptotic case where we have an infinitely large sample size. (This is somewhat akin to the situation with the Wald test when we test for Granger non-causality in the presence of non-stationary data. In that case, the problem is resolved by using the Toda-Yamamoto (1995) procedure, to ensure that the Wald test statistic is asymptotically chi-square, as discussed here.)

Exact critical values for the F-test aren't available for an arbitrary mix of I(0) and I(1) variables. However, Pesaran et al. (2001) supply bounds on the critical values for the asymptotic distribution of the F-statistic. For various situations (e.g., different numbers of variables, (k + 1)), they give lower and upper bounds on the critical values. In each case, the lower bound is based on the assumption that all of the variables are I(0), and the upper bound is based on the assumption that all of the variables are I(1). In fact, the truth may be somewhere in between these two polar extremes.

If the computed F-statistic falls below the lower bound we would conclude that the variables are I(0), so no cointegration is possible, by definition. If the F-statistic exceeds the upper bound, we conclude that we have cointegration. Finally, if the F-statistic falls between the bounds, the test is inconclusive.

Does this remind you of the old Durbin-Watson test for serial independence? It should!

As a cross-check, we should also perform a "Bounds t-test" of H₀: θ₀= 0, against H₁: θ₀< 0. If the t-statistic for y_t-1in equation (4) is greater than the "I(1) bound" tabulated by Pesaran et al. (2001; pp.303-304), this would support the conclusion that there is a long-run relationship between the variables. If the t-statistic is less than the "I(0) bound", we'd conclude that the data are all stationary.

Step 7:
Assuming that the bounds test leads to the conclusion of cointegration, we can meaningfully estimate the long-run equilibrium relationship between the variables:

y_t = α₀ + α₁x_1t + α₂x_2t + v_t ; (5)

as well as the usual ECM:

Δy_t = β₀ + Σ β_iΔy_t-i + Σγ_jΔx_1t-j + Σδ_kΔx_2t-k + φz_t-1 + e_t; (6)

where z_t-1 = (y_t-1 -a₀ - a₁x_1t-1 - a₂x_2t-1), and the a's are the OLS estimates of the α's in (5).

Step 8:
We can "extract" long-run effects from the unrestricted ECM. Looking back at equation (4), and noting that at a long-run equilibrium, Δy_t = 0, Δx_1t= Δx_2t= 0, we see that the long-run coefficients for x₁ and x₂ are -(θ₁/ θ₀) and -(θ₂/ θ₀) respectively.

An Example:
Now we're ready to look at a very simple empirical example. I'm going to use the data for U.S. and European natural gas prices that I made available as a second example in my post, Testing for Granger Causality. I didn't go through the details of testing for Granger causality with that set of data, but I mentioned near the end of the post, and the EViews file (which included a "read_me" object with comments about the results) is there on the code page for this blog (dated 29 April, 2011).

If you look back at that earlier file, you'll find that I used the Toda-Yamamoto (1995) testing procedure to determine that there is Granger causality running from the U.S. series to the European series, but not vice versa.

A new EViews file that uses the same data for our ARDL modelling is available on the code page, under the date for the current post. The data for the two time-series we'll be using are also available on the data page for this blog. The data are monthly, from 1995(01) to 2011(03). In terms of the notation that was introduced earlier, we have (k + 1) = 2 variables, so k = 1 when it comes to the bounds testing.

Here's a plot of the data we'll be using (remember that you can enlarge most of these inserts by clicking on them):

To complete Step 1, we need to check that neither of our time-series are I(2). Applying the ADF test to the levels of EUR and US, the p-values are 0.53 and 0.10 respectively. Applying the test to the first-differences of the series, the p-values are both 0.00. (The lag-lengths for the ADF regressions were chosen using the Schwarz criterion, SC.) Clearly, neither series is I(2).

Applying the KPSS test we reject the null of stationarity, even at the 1% significance level, for both EUR and US, but cannot reject the null of I(1) against I(2). The p-value of 10% for the ADF test of I(1) vs. I(0) for the EUR series may leave us wondering if that series is stationary, or not. You'll know that apparent "conflicts" between the outcomes of tests such as these are very common in practice.

This is a great illustration of how the ARDL / Bounds Testing methodology can help us. In order for standard cointegration testing (such as that of Engle and Granger, or Johansen) to make any sense, we must be really sure that all of the series are integrated of the same order. In this instance, you might not be feeling totally sure that this is the case.

Step 2 is straightforward. Given that the Granger causality testing associated with my earlier post suggested that there is causality from US to EUR (but not vice versa), ΔEUR is going to be the dependent variable in my unrestricted ECM:

ΔEUR_t = β₀ + Σ β_iΔEUR_t-i + Σγ_jΔUS_t-j + θ₀EUR_t-1 + θ₁US_t-1 + e_t; (5)

That's Step 2 out of the way!

To implement the information criteria for selecting the lag-lengths in an time-efficient way, I "tricked" EViews into providing lots of them at once by doing the following. I estimated a 1-equation VAR model for ΔEUR_t and I supplied the intercept, EUR_t-1, US_t-1, and a fixed number of lags of ΔUS_tas exogenous regressors. For example, when the fixed number of lags on ΔUS_twas zero, here's how I specified the VAR:

After estimating this model, I then chose VIEW, LAG STRUCTURE, LAG LENGTH CRITERIA:

I then repeated this by adding ΔUS_t-1to the list of exogenous variables, and got the following results:

I proceeded in this manner with additional lags of ΔUS_tin the "exogenous" list. I also considered cases such as:

which resulted in the following information criteria values:

Looking at the SC values in these three tables of results, we see that a maximum lag of 4 is suggested for ΔEUR_t. (The AIC values suggest that 8 lags of ΔEUR_tmay be appropriate, but some experimentation with this was not fruitful.)

There is virtually no difference between the SC values for the case where the model includes just US_tas a regressor (0.8714), and the case where just ΔUS_t-1is included (0.8718). To get some dynamics into the model, I'm going to go with the latter case.

With Step 3 completed, and with this lag specification in mind, let's now look at the estimated unrestricted ECM:

Step 4 involves checking that the errors of this model are serially independent. Selecting VIEW, RESIDUAL DIAGNOSTICS, SERIAL CORRELATION LM TEST, I get the following results:

m LM p-value

1 0.079 0.779

2 2.878 0.237

3 5.380 0.146

4 11.753 0.019

O.K., we have a problem with serial correlation! To deal with it, I experimented with one or two additional lags of the dependent variable as regressors, and ended up with the following specification for the unrestricted ECM:

The serial independence results now look much more satisfactory:

m LM p-value

1 0.013 0.911

2 3.337 0.189

3 5.183 0.159

4 7.989 0.092

5 8.473 0.132

6 11.023 0.088

7 12.270 0.092

8 12.334 0.137

Next, Step 5 involves checking the dynamic stability of this ARDL model. Here are the inverse roots of the associated characteristic equation:

All seems to be well - these roots are all inside the unit circle.

Before proceeding to the Bounds Testing, let's take a look at the "fit" of our unrestricted ECM. The "Actual / Fitted / Residuals" plot looks like this:

When we "unscramble" these results, and look at the fit of the model in terms of explaining the level of EUR itself, rather than ΔEUR, things look pretty good:

We're now ready for Step 6 - the Bounds Test itself. We want to test if the coefficients of both EUR(-1) and US(-1) are zero in our estimated model (repeated below):

The associated F-test is obtained as follows:

With the result:

The value of our F-statistic is 5.827, and we have (k + 1) = 2 variables (EUR and US) in our model. So, when we go to the Bounds Test tables of critical values, we have k = 1.

Table CI (iii) on p.300 of Pesaran et al. (2001) is the relevant table for us to use here. We haven't constrained the intercept of our model, and there is no linear trend term included in the ECM. The lower and upper bounds for the F-test statistic at the 10%, 5%, and 1% significance levels are [4.04 , 4.78], [4.94 , 5.73], and [6.84 , 7.84] respectively.

As the value of our F-statistic exceeds the upper bound at the 5% significance level, we can conclude that there is evidence of a long-run relationship between the two time-series (at this level of significance or greater).

In addition, the t-statistic on EUR(-1) is -2.926. When we look at Table CII (iii) on p.303 of Pesaran et al. (2001), we find that the I(0) and I(1) bounds for the t-statistic at the 10%, 5%, and 1% significance levels are [-2.57 , -2.91], [-2.86 , -3.22], and [-3.43 , -3.82] respectively. At least at the 10% significance level, this result reinforces our conclusion that there is a long-run relationship between EUR and US.

So, here we are at Step 7 and Step 8.

Recalling our preferred unrestricted ECM:

we see that the long-run multiplier between US and EUR is -(0.047134 / (-0.030804)) = 1.53. In the long run, an increase of 1 unit in US will lead to an increase of 1.53 units in EUR.

If we estimate the levels model,

EUR_t = α₀ + α₁US_t + v_t,

by OLS, and construct the residuals series, {z_t}, we can fit a regular (restricted) ECM:

Notice that the coefficient of the error-correction term, z_t-1, is negative and very significant. This is what we'd expect if there is cointegration between EUR and US. The magnitude of this coefficient implies that nearly 3% of any disequilibrium between EUR and US is corrected within one period (one month).

This final ECM is dynamically stable:

As none of the roots lie on the X (real) axis, it's clear that we have three complex conjugate pairs of roots. Accordingly, the short-run dynamics associated with the model are quite complicated. This can be seen if we consider the impulse response function associated with a "shock" of one (sample) standard deviation:

Finally, the within-sample fit (in terms of the levels of EUR) is exceptionally good:

In fact, the simple correlations between EUR and the "fitted" EUR series from the unrestricted and regular ECM's are each 0.994, and the correlation between the two fitted series is 0.9999.

So, there we have it - bounds testing with an ARDL model.

[Note: For an important update of this post, relating to EViews 9, see my 2015 post, here.]

References

Pesaran, M. H. and Y. Shin, 1999. An autoregressive distributed lag modelling approach to cointegration analysis. Chapter 11 in S. Strom (ed.), Econometrics and Economic Theory in the 20th Century: The Ragnar Frisch Centennial Symposium. Cambridge University Press, Cambridge. (Discussion Paper version.)

Pesaran, M. H., Shin, Y. and Smith, R. J., 2001. Bounds testing approaches to the analysis of level relationships. Journal of Applied Econometrics, 16, 289–326.

Pesaran, M. H. and R. P. Smith, 1998. Structural analysis of cointegrating VARs. Journal of Economic Surveys, 12, 471-505.

Toda, H. Y and T. Yamamoto (1995). Statistical inferences in vector autoregressions with possibly integrated processes. Journal of Econometrics, 66, 225-250.

363 comments:

AnonymousJune 20, 2013 at 7:52 AM
Great post! What is the preferred method for computing z? You show using a levels-on-levels OLS, but you also mention that the long run parameters can be estimated from the unrestricted ECM (i.e. the ARDL model used for the bounds test). Is it appropriate to use these implied parameters to construct z as well?
ReplyDelete
Replies
AnonymousJune 20, 2013 at 4:11 PM
Excellent post! Thank you for your time and effort in putting
them all together with an example.
ReplyDelete
Replies
AnonymousJune 21, 2013 at 5:14 AM
I was looking for a way to perform this in Eviews and with no help on google, I have to use trial version of limdep. Anyway, thank you very much for that.
ReplyDelete
Replies
AnonymousJune 21, 2013 at 9:39 AM
Thank you Professor Dave for the Eviews solution. You have used here data of monthly frequency with the span of less than 20 years. I was wondering whether we can use data of less than 20 years for cointegration (is there a rule of thumb on this?)and that whether using the data of monthly frequency produces different results when compared to the data using annual or quarterly frequency (I have read in some old papers that frequency doesn't matter for cointegration so just wanted to make sure whether this still holds).
ReplyDelete
Replies
nottrampisJune 21, 2013 at 10:28 PM
David,

What I like about most, but not all of your posts, is that for some of us who studied
some econometrics as part of studying economics can read these posts and be glad we can still understand this.

Keep it up.

However I have to say I am a tad disappointed there was only one article I could publicise this week!!
ReplyDelete
Replies
AnonymousJune 22, 2013 at 9:06 AM
Dave is this a monster post! Thank you so much.
ReplyDelete
Replies
AnonymousJune 23, 2013 at 1:39 PM
Professor, thank you for the post.

While you gave a recipe, you were very light on the theory side. For example, you did not explain why the joint test does what Pesaran et al claimed. For example, if the joint test rejects, it could be the case that (1) only one of the equalities fails, or (2) all of them fail. It is not clear if one could conclude the same thing from different reasons of rejection. Also, you did not explain why the additional bound test on y(t-1) is even needed and what happens if this test does not reject.
ReplyDelete
Replies
AnonymousJune 25, 2013 at 9:08 AM
hello Professor, thank you for the post.

I have a doubt, I need to estimate determinants of inflation in Cape Verde, and I have 7 variables how can select the appropriate values for the maximum lags for the 7 variables in eviews
ReplyDelete
Replies
T.E.H. NottenJune 25, 2013 at 10:26 PM
Dear professor,

great post! Please allow me a bit of nitpicking. Not that it matters for the outcome of the F-test, but instead of citing the critical values for the 1% level, you cited the ones for the 2.5%. In the Pesaran et al. (2001), p. 300 table, the reported critical values are [6.84-7.84]. Maybe it would be interesting next post to include a bootstrapping exercise for generating critical values for the bounds test in small samples?
ReplyDelete
Replies
AnonymousJune 30, 2013 at 3:43 AM
Dear Professor, thank you for the nice post. I was sincerely wondering whether there is any restriction in ARDL model that stationarity of the dependent variable.
ReplyDelete
Replies
AnonymousJune 30, 2013 at 11:32 PM
Maybe a silly question, but are all these variables in log form? Thanks for this post David, I can't wait for the next instalment.
Tim Watson
ReplyDelete
Replies
Mini ThomasJuly 2, 2013 at 11:52 PM
Thank you so much Professor Dave for this enlightening discussion and illustration using E-views. I had been struggling for the past one month to understand the bound-testing approach with ARDL model, and your post has been really helpful for my ongoing PhD work.
One small query: How do we interpret the short-run relationship between two time-series, using this approach to ARDL model?
ReplyDelete
Replies
AnonymousJuly 5, 2013 at 2:23 PM
Dear Professor Giles,

Thank you for your post! I have been trying to figure out whether I could use Eviews to ARDL model. Is it possible to post your program file for reference. This would be most appreciated. Thank you in advance for help.

Master student.
ReplyDelete
Replies
UnknownAugust 2, 2013 at 7:07 AM
Dear Prof, if I have multiple independent variables, how should I go about conducting step 3, ie. find the optimal lag length for each variable. Thanks.
ReplyDelete
Replies
EconisticsAugust 4, 2013 at 12:12 AM
HI
see the step 7, zt-1 = (a0 - a1x1t-1 - a2x2t-1) , can you please confirm if the yt-1 is missed by typo or it is not there by purpose.
and can you please suggest some literature to study too see the implications if there is a trend in the long run equation.
thanks
ReplyDelete
Replies
AnonymousAugust 6, 2013 at 2:29 PM
Hello Professor, thank you so much for the post. It was helpful for me
Please in the ARDL (Bounds tests) model, I should test the normality of errors, homoscedasticity, and autocorrelation or not.
Engineering student
ReplyDelete
Replies
AnonymousAugust 8, 2013 at 8:39 AM
Thank you very much for the post. I have been struggling with how to perform bound tests for some days but your detailed explanation has granted me understanding. I have one observation though: The statement - "The upper and lower bounds for the F-test statistic at the 10%, 5%, and 1% significance levels are [4.04 , 4.78], [4.94 , 5.73], and [6.84 , 7.84] respectively". I think should read "Lower and Upper bounds". Thank you.
ReplyDelete
Replies
AnonymousAugust 9, 2013 at 9:09 AM
Thank you for the post. Please do you know how to run the bound test on STATA because I do not use EVIEWS in my university. Thank you
ReplyDelete
Replies
UnknownAugust 11, 2013 at 6:43 AM
See Narayan and Smyth (2005). they provide exact critical values for up to 80 observations
ReplyDelete
Replies
Prof. DhillonAugust 18, 2013 at 7:25 AM
hello sir, thanks for this post. in my model lag become zero and if i add trend then value of sbc inceases. so in short run equation dependent variable only in left hand side?
thanks in advance
ReplyDelete
Replies
JJAugust 20, 2013 at 1:58 PM
Prof. Giles, you ended this post with promising "a second and more comprehensive illustrative example", which "will involve more variables in the model." I have looked through all subsequent posts of yours, but haven't found anything more on ARDL Bounds test. Did I accidentally overlook it, or have you not made it available yet? Thanks!
ReplyDelete
Replies
AnonymousAugust 28, 2013 at 9:58 PM
Prof. Giles, thanks for this post. I have some questions, are there no dynamic variables in the long run model? Ex. lagged dependent varible? And when you estimate the error correstion terms from the residuals are they from a model whitout dynamic terms or from the ARDL? And if you got that you should use Xt, Xt-1 and only Xt-1 is significant can I remove Xt? Thanks!
ReplyDelete
Replies
AnonymousSeptember 7, 2013 at 3:57 PM
Dear prof, I remarked that the R squared of the model above in the example is(0.38) less than 60%. Is it good for a ardl model ?
Thank you.
ReplyDelete
Replies
UnknownSeptember 10, 2013 at 7:25 AM
Now I like. I wish you long life dear professor
ReplyDelete
Replies
JJSeptember 12, 2013 at 10:45 AM
Prof. Giles, before we test for Granger Causality (Step2 ), must it be true that the VAR model is appropriate? What if the 2 VAR equations are not significant? Can we still go ahead and test for Granger noncausality?
ReplyDelete
Replies
AnonymousOctober 1, 2013 at 4:34 AM
Dear Professor Giles,

I am wondering if the long-run coefficients for x1 and x2 should be -(θ1 / θ0) and -(θ2 / θ0) respectively in step 8. I have benefited a lot from your blog. Thank you so much.

ReplyDelete
Replies
AnonymousOctober 5, 2013 at 12:27 PM
Please Mr, Is it true that ardl model help to avoid the problem of collinearity?
ReplyDelete
Replies
AnonymousOctober 7, 2013 at 3:09 AM
Hello Professor Dave,
I have some questions. Can we can use ARDL approach for panel data ? If yes, I don't use step 4 because I have tried ARDL for panel, but I don't succeful.
All of p-value of each variables of ARDL (inclus delay variables ) are signficative ?
Thank you
ReplyDelete
Replies
JJOctober 10, 2013 at 6:51 AM
Dear Prof. Giles,

Narayan (2005) has provided critical values for small sample sizes (30 - 80). These are F-statistics, for testing the existence of a long-run relationship. However, he hasn't provided t-statistics for the bounds test. Given that the t-statistic also has non-standard distribution, could you recommend either a source in the literature that provides these statistics (bounds) for small samples, or suggest a way to generate them?

Your posts are eagerly awaited

Thanks,
ReplyDelete
Replies
UnknownOctober 14, 2013 at 11:30 AM
thank you so much Pr Dave Giles.
Can you extend your post through panel cointegrated model using ARDL method
again thanks a lot.
ReplyDelete
Replies
AnonymousOctober 14, 2013 at 4:30 PM
Hi professor Giles,
thank you very much for your very helpful posts. There's a little, confusion in my mind..I don't know if I missed it from your previous posts. If all the variables under consideration are found to be I(0), but presence of cointegration is detected, is it ok to use an ECM? thank you for your help.
ReplyDelete
Replies
UnknownOctober 18, 2013 at 10:12 PM
Dear Professor Giles,

Thank you very much for your post; it helped me a lot with my thesis research. I am testing the relationship between economic growth and different types of energy consumption for 21 OECD countries by using TY technique.
My question is if a bidirectional causality was found, would it mean that two equations would have to be examined separately - in one economic growth would be an endogenous variable, and energy consumption exogenous, and vice versa?
Thank you for your help!
ReplyDelete
Replies
UnknownOctober 22, 2013 at 11:52 AM
Dear Professor Giles,
Thank you for your response. I have sorted out the causality. Since I was using the TY technique, the cointegration didn’t have much impact on determination of the direction of causality because the variables did not have to be integrated of the same order. For the countries where energy consumption and economic growth are integrated of different orders, I am modeling the structural relationship by using ARDL Bounds tests. Although all the series for which I am doing ARDL Bounds Tests have a unidirectional causality, I just wanted to know how the test would work in the case of bidirectional causality.
Thank you again for answering my question.

Best regards,
Daria
ReplyDelete
Replies
AnonymousOctober 27, 2013 at 5:09 AM
Hello Professor,
My question is not related directly to ARDL model, but I am confused , and I need your help. Please in my ARDL model I estimated the impact of exchange rate in trade balance deficit, and I use foreign revenue as one of variables, is not a problem if trade balance and foreign revenue are not in the same curency.
Thank you so much.

Best Regards,
ReplyDelete
Replies
AnonymousNovember 4, 2013 at 10:04 AM
hi professor,
can we use ARDL if endogenity does exist b/w variables?
ReplyDelete
Replies
UnknownNovember 9, 2013 at 9:36 PM
Worthy sir
Really great effort. Thanks for your time that you spend for us.
ReplyDelete
Replies
AnonymousDecember 2, 2013 at 12:30 PM
Dear Professor Dave Giles,

I have just a question. Why in the ARDL model, in the technical estimation in Eview, we must use the difference each series for estimate?
Thank you very much.
ReplyDelete
Replies
UnknownDecember 5, 2013 at 2:19 AM
Thank You very much for your efforts! this is really clear and helpfull theoretically and practically.
One more question please, are the johansen cointegration test compatible with this modeling? Can they give us more information about long-term relationships?
Thank you again Professor.
ReplyDelete
Replies
UnknownDecember 11, 2013 at 12:33 PM
Dear Prof.,

Please forward me the link for ARDL-models Part-I.
ReplyDelete
Replies
AnonymousDecember 16, 2013 at 6:19 AM
Dear prof.,
Thank you so much for the post. However i would like to know if we need to test the significance of the long-run multiplier before jumping to the conclusions i.e step number 7. In other words, how will i know that the long run elasticity estimates are significant? And especially when the long-run regression coefficients extrated from ARDL model are not significant?
ReplyDelete
Replies
AnonymousDecember 19, 2013 at 6:00 AM
Dear prof,
tank for this useful explanations.
I use stata and i haven't ever use eviews, is it possible to perform ARDL test with Stata?
thanks you and I look forward.
ReplyDelete
Replies
AnonymousDecember 27, 2013 at 4:16 AM
Professor Giles,
Thanks for the post, it's very useful. just a brief question. I have a 3-variable case. Bound test suggests that these variables are cointegrated. However one of the two independent variables appears to be insignificant (in the long-run part). Should I drop this variable from the long-run equation and re-do the exercise? and only include this variable in the short-run part?
ReplyDelete
Replies
UnknownDecember 30, 2013 at 9:04 AM
Dear prof,
Thank you so much for your post. This is really helpfull theoretically and practically.
One more question please, I have a model with 4 independent variables: X1, X2, X3, X4. So, how I can estimate lag length with each other. Example, with X1, first using Var process, next estimate d(X1) with C d(Y) Y(-1) X1(-1) X2(-1) X3(-1) X4(-1) or d(x1) with C d(Y(-1)) Y(-1) X1(-1) X2(-1) X3(-1) X4(-1), and last using AIC, BIC to select the lag length for X1. In this post, you only estimate the lag length for Y. So, with independent variables, how can we do?
Thank you and I look forward.
ReplyDelete
Replies
AnonymousJanuary 2, 2014 at 6:28 AM
Dear Prof. D.Giles

I have benefited so much from your blog. I have a following question

In Step 7 - equation (5) yt = α0 + α1x1t + α2x2t + zt - the long-run equilibrium relationship

Before constructing ECM model, we must estimate equation (5) to get z(t-1), but some papers mentioned that to test the long-run equilibrium relationship, equation (5*) was used

Δyt = β0 + Σ βiΔy(t-i) + ΣγjΔx1(t-j) + ΣδkΔx2(t-k) + zt (5*). They will estimate equation (5*) to get z(t-1) to construct ECM model

My question is that comparing between equation (5) and equation (5*), which one is true?

Thank you so much
ReplyDelete
Replies
ArnoldasJanuary 6, 2014 at 4:39 PM
First of all, thank you for this extremely helpful exercise you have done!
I would like to ask, if I may, what if the ARDL bound test seems to confirm the cointegration between two variables (test value is > critical even at 1% lvl), but the when I fit a restricted ECM, the coeficients of lagged residuals series is nor negative, nor siginificant - does this mean that there is no cointegration after all? Or does this mean that there is an error in the model? Is it OK to receive controversial results?

Also, what would you suggest: If a wanted to investigate the effects of one variable to a few other variables, is it best to create multiple ARDL models for 2 variables or should I create multiple ARDL models for all variables included in each of them and only changing the endogenous variable?

Thanks a lot in advance!
ReplyDelete
Replies
Ronald Ravinesh KumarJanuary 7, 2014 at 8:50 PM
Dear Professor Giles,

thank you so much for explaining the ARDL method and all your responses.
Best wishes.
ReplyDelete
Replies
UnknownJanuary 8, 2014 at 1:08 AM
Dear Professor Giles,
thank you so mush for this helpful exercise!
i would like to ask , in step 5, if the var model does not satisfy the stability condition , what should i do ?
thanks in advance.
ReplyDelete
Replies
AnonymousJanuary 22, 2014 at 4:45 AM
Hello everyone,
i like this very helpful exercise.
It's possible, to do the same in R??? i couldn't use EViews....
Step 1 it's possible
Step 2 it's also possible to generate an ARDL-model in R
Step 3: the lag-structure won't be a problem
Step 4: it's ok
Step 5: okay, now it becomes tricky. how to proof the stability of the model in R?? Could i use Breusch-Gofrey-Test??
Step 6: i don't know, if there is something i can use in R......
Does anyone can help??
thanks!
ReplyDelete
Replies
Hanan IshaqueJanuary 22, 2014 at 7:51 AM
Dear Professor Giles
Please be kind enough to explain how could we resolve serial correlation and heterskedasticity problems in ARDL estimation using Microfit 4.01?
Thanks in advance
ReplyDelete
Replies
AnonymousJanuary 25, 2014 at 5:36 AM
Dear Professor Giles,
sorry for that question: how to get the last graphic of your analysis (the graphic with the fitted an actual Euro-serie......) in Eviews???
ReplyDelete
Replies
AnonymousJanuary 30, 2014 at 4:56 AM
Dear professor Giles,

I am trying to estimate an ARDL model, where the dependent variable is I(1) y and some of the explanatory variables are I(1) and some are I(0). Assuming the dependent and the I(1) explanatory variables are cointegrated, is it correct to include the I(0) variables even in the long-run relationship? Or should I(0) variables go only in the ECM?

Best regards
ReplyDelete
Replies
JJFebruary 4, 2014 at 12:05 PM
In step 6, you say "If the t-statistic for yt-1 in equation (4) is less than the "I(1) bound" tabulated by Pesaran et al. (2001; pp.303-304), this would support the conclusion that there is a long-run relationship between the variables. If the t-statistic is greater than the "I(0) bound", we'd conclude that the data are all stationary."

Should it not be the other way around, i.e., when the t-stat is above I(1) bound, the null of no-cointegration is rejected in favour of the alternate hypothesis that the variables are cointegrated, and when the t-stat is below the I(0) bound, there is not enough evidence to reject the null of no-cointegration, and we conclude that the data are all stationary.

Am I mistaken?
ReplyDelete
Replies
AnonymousFebruary 7, 2014 at 7:09 AM
Dear Dave Thanks for this very helpful post in showing how to implement the ARDL bounds method in Eviews. I am wondering whether you can help me with a question. Suppose you wanted to see whether there is bi-directional causality between y and x. So then you run delta(y) as the dependent variable in the ECM (with a particular lag structure, dummies and time trend) and you find that the ECM coefficient is negative and significant and that the F stat is also significant. This suggests that x "pulls" at y. Now suppose you run another ECM with delta(x) as the dependent variable and just suppose that the lag structure is different and perhaps you have different dummies with no time trend (in short a different ECM). But you get a significant ECM coefficient which is negative and a significant F. Does that allow you to conclude that there is bi-directional causality? Thanks so much!
ReplyDelete
Replies
AnonymousFebruary 10, 2014 at 6:12 AM
Dear Prof. Giles,

Thank you for this post. it helped me a lot. but now i'm confused with bounds testing procedure. As long as i understood after selecting optimal lag lenght for the model. For example i have a model with 3 independent variables. Before reading your post i used to found optimal ardl model with mfit then i do wald test with e-views. so if we suppose that i found optimal lag as 3 but ardl model as (2,1,1,1). should i do wald test with 3 lags for all variables, or should i use specified lags as mfit provided?

Thanks in advance,!
ReplyDelete
Replies
UnknownFebruary 12, 2014 at 1:16 PM
Hi Professor.
Thanks for this post. I am working on the relationship between energy consumption, employment and economic growth for Saudi Arabia using the production function framework. I would like to use ARDL Bounds testing to cointegration but the data on labor are only available from 1990 to 2012. It seems that the period is too short, is it possible to use ARDL approach?
ReplyDelete
Replies
AnonymousFebruary 15, 2014 at 10:19 PM
Thanks Professor Giles
i am waiting for Panel data ARDL procedure. any idea when you will post this topic.
best wishes
ReplyDelete
Replies
ArslanMarch 3, 2014 at 10:42 AM
Sir Dave Giles.. will u please guide how to calculate standard errors and t-ratios of long run coefficients???
ReplyDelete
Replies
adeelMarch 7, 2014 at 9:44 AM
sir
can the 2nd order data can also be used for ardl?i mean if you have data sets in first order nd 2nd order both, can still be ardl used?
ReplyDelete
Replies
ทองใหญ่ อัยยะวรากูลMarch 31, 2014 at 5:43 PM
Dear Professor Giles,

Thank you so much for such a wonderful tutorial!

When we perform VAR order selection and the calculation of VAR in EViews, why are all the observations included in the process (it shows included observations = 195 which is equal to the number of observations in the data set)? I tried to replicate your results using Stata and got results that are very close to those from Eviews, but not exactly the same. I notice that the number of observations included in Stata decreases, depending on the number of lags included in the model, which I think is the source of discrepancy in the results.

Thanks so much in advance for your reply and thank you once again for this wonderful blog.

Tongyai
ReplyDelete
Replies
UnknownApril 1, 2014 at 12:41 PM
Dear Professor,

The ARDL approach to co-integration is based on a single equation approach. According to Pesaran, to deal with cases where there may be more than one level relationship involving yt will require the computation of further tables of critical values. Do you know what these critical values be?
Thank you
ReplyDelete
Replies
AnonymousApril 3, 2014 at 9:44 AM
Dear Professor Giles,

Thanks a lot for this amazing post! It helped me a lot.

I am currently trying to estimate an ARDL model with multiple explanatory variables. Thus, I have problems with step 3 because EViews always states "Near singular matrix" whenever I include too many variables/lags. Do you have any advice on how to deal with this?

Thanks a lot for your amazing blog and all the help!

Best

Maximilian
ReplyDelete
Replies
AnonymousApril 6, 2014 at 7:02 AM
it Dependent variable is I(0) then what abot ARDL. it is still valid or it breaks down?
ReplyDelete
Replies
Jacob TapperApril 7, 2014 at 10:22 AM
Dear Prof. Giles,

Firstly, thank you for uploading this post, it has been extremely useful as a guide when conducting my own ARDL bounds test.

I'm curious as to why you have chosen to test for serial correlation in the errors of the unrestricted ECM in step 4 to just 4 lags? Having looked around I haven't found any concrete rule regarding the number of lags to test, but did read elsewhere that 12 lags would be appropriate (without any justification) in this case as monthly data is being used. Is there a specific reason that you have tested up to 4 lags?

Thank you,

Jacob
ReplyDelete
Replies
UnknownApril 7, 2014 at 12:40 PM
Dear Dave,
I have two questions. The first one is that why empirical papers use the ARDL Bounds testing approach in more than one equation?
The second question is: I am applying the ARDL model for studying the role of energy in economic growth by incorporating employment and capital as conventional inputs. How can I cite your post as a reference?
ReplyDelete
Replies
AnonymousApril 21, 2014 at 10:13 AM
Dear Dave,

Are the results of the Bounds Test valid if in the VEC model the coefficient on the lagged dependent variable is statistically insignificant?
ReplyDelete
Replies
AnonymousApril 28, 2014 at 2:50 AM
This is a great post - thank you for writing this it is very useful and interesting.
ReplyDelete
Replies
AnonymousApril 28, 2014 at 10:24 AM
Thanks the post. For the example you give, shouldn't the LR multiplier be the US lag coefficient/ Eur lag coefficient (under steps 7 and 8)?
ReplyDelete
Replies
PhilipApril 30, 2014 at 4:20 AM
Dear Professor Giles,

Thank you very much for your post.
As I am currently working on a study including a ARDL bound testing model with multiple variables I was wondering whether you have already posted your article on this topic on your blog?

Thanks again!
ReplyDelete
Replies
AnonymousMay 5, 2014 at 2:02 PM
Hey professor, thank you for the detailed analysis. I have a question, that can we use the estimates of level OLS as the long run parameter of independent variable ? Or should we have to apply the formula of division for estimating the LR parameter? or can both be used ?
Will really appreciate your answer.
ReplyDelete
Replies
RirinMay 8, 2014 at 12:25 AM
Another question Professor Dave,.

By SBC criterion, the lag of my data should be 4, but can I use a different lag ? such as 3 ?
because when I use 4, my model is not cointegrated.

Thank you prof!
ReplyDelete
Replies
UnknownMay 12, 2014 at 11:09 PM
Sir, there is a small typological mistake. please update it.
See in Step 2:......"From (3), we can see that the lagged residuals series would be zt-1 = (a0 - a1x1t-1 - a2x2t-1), where the a's are the OLS estimates of the α's."
It would be zt-1 = (yt-1 -a0 - a1x1t-1 - a2x2t-1).
ReplyDelete
Replies
UnknownMay 15, 2014 at 4:43 AM
Thanks for your helpful post. In the restricted ECM estimation, is it necessary to worry about the value of R square? Yours is 0.37 and mine is also around that.

Kewani
ReplyDelete
Replies
AnonymousMay 16, 2014 at 3:54 AM
dear professor thanks for your post.... now i am working with ardl model and found out that there is long run relationship among my variable... then i am fitted an error correction term, the sign is negative and significant, however my difference lags variable became not significant as in the bound test... is that problems?
ReplyDelete
Replies
AnonymousMay 21, 2014 at 4:49 AM
Hi Professor Giles,
I have an ARDL where I believe one variable (X) impacts Y over the first historical period and then another variable (Z) impacts Y over the second half of the time-series. I have a baseline ECM and then a second model where I add a dummy to separate the historical period and then appropriate multiplicative terms (short and long-run of X and Z multiplied by year dummy). Is this appropriate in the ECM? The model fits better when including the interaction terms and makes sense if my hypothesis is true.

Thank you in advance for your insights.
ReplyDelete
Replies
AnonymousMay 27, 2014 at 3:36 AM
Thank you so much for this awesome post!
But I have 2 questions:
1. Why is equation yt-1 - a0 - a1*x1t-1 - a2*x2t-1 called restricted and equation θ0*yt-1 + θ1*x1t-1 + θ2*x2t-1 called unrestricted? Where is a restriction in the first equation and where is an "unrestriction" in the second one?Why are they so called? And why is it applied in ARDL?
2. Why at first did you estimate unrestricted ECM and after that restricted ECM. Why do we need to perform the latter if we cannot use restricted ECM for ARDL model?
ReplyDelete
Replies
RichardMay 27, 2014 at 5:56 AM
Prof. Giles,

Great post! Thank you!

I have a question about estimating the long run coefficients in an ARDL model since I find two different approaches in the empirical literature.

In some articles the authors estimate a long run model, after evidence of cointegration, like this:
yt = β0 + Σ βiyt-i + Σγjx1t-j + ΣδkΔx2t-k + et
For example here: http://www.lahoreschoolofeconomics.edu.pk/JOURNAL/LJE%20Vol15-%20No.12010/1%20Waliullah%20EDITED%20AC.pdf

After selecting the optimal number of lags they use the coefficients of this equation as long run estimates.
It is not clear to me, which coefficient of which lag I have to take as long run coefficient? Or do I take the coefficient of the level variable there and are the lags only used as control variables? Could you help me out?
And next, they construct et as being the ECM term in the restricted ARDL ECM to estimate the short run dynamics.
This is, in my eyes, different to your approach of calculating the long run coefficients from the unrestricted error correction model by dividing the coefficients there and taking et just from your yt = α0 + α1x1t + α2x2t + vt model.

In some articles authors use both approaches...

Could you please help me out and explain which approach to use for long run coefficients? And the coefficient of which lag or level in the long run model to use as long run coefficient?
Additionally could you explain to me why we can estimate the long run coefficients in your example from dividing -(θ1/ θ0) and -(θ2/ θ0)?

Thank you very much!
ReplyDelete
Replies
UnknownMay 27, 2014 at 9:54 AM
Hello Sir,

Thanks for this very detailed explanation of ARDL Bounds testing. I have a question to ask, i.e., isn't rejecting the presence of a unit root in the second difference of a given variable the only necessary and sufficient condition for the justification of application of ARDL approach. If this is the case why do we need to check for presence/absence of unit root in level data or after first difference. I understand that if all the variables under consideration are I(0) then they can't be co-integrated but that would itself be reflected in the acceptance of null of no co-integration as per the Wald test for level data variables. So why not to preform unit root test only for the second difference of the variables?
ReplyDelete
Replies
resMay 31, 2014 at 8:28 AM
Sorry i am a bit lost on step 3 ...how did the result of estimated unrestricted ECM come ?
ReplyDelete
Replies
AnonymousMay 31, 2014 at 10:33 AM
Sorry i got it
ReplyDelete
Replies
AnonymousMay 31, 2014 at 2:15 PM
Dear Prof, Thank you for the briefing and i was not sure how you come with the value of Z which fitted in the model in step 7 ...thank you
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.

Pages

Wednesday, June 19, 2013

ARDL Models - Part II - Bounds Tests

363 comments: