Economics留学生讲解、辅导R语言、讲解zbwLeibniz、辅导R程序设计 辅导R语言编程|辅导R语言编程
- 首页 >> OS编程 Make Your Publications Visible.
A Service of
zbwLeibniz-Informationszentrum
Wirtschaft
Leibniz Information Centre
for Economics
Clegg, Matthew; Krauss, Christopher; Rende, Jonas
Working Paper
partialCI: An R package for the analysis of partially
cointegrated time series
FAU Discussion Papers in Economics, No. 05/2017
Provided in Cooperation with:
Friedrich-Alexander University Erlangen-Nuremberg, Institute for
Economics
Suggested Citation: Clegg, Matthew; Krauss, Christopher; Rende, Jonas (2017) : partialCI:
An R package for the analysis of partially cointegrated time series, FAU Discussion Papers
in Economics, No. 05/2017, Friedrich-Alexander-Universit?t Erlangen-Nürnberg, Institute for
Economics, Erlangen
This Version is available at:
http://hdl.handle.net/10419/150014
Standard-Nutzungsbedingungen:
Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen
Zwecken und zum Privatgebrauch gespeichert und kopiert werden.
Sie dürfen die Dokumente nicht für ffentliche oder kommerzielle
Zwecke vervielf ltigen, ffentlich ausstellen, ffentlich zug nglich
machen, vertreiben oder anderweitig nutzen.
Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen
(insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten,
gelten abweichend von diesen Nutzungsbedingungen die in der dort
genannten Lizenz gew?hrten Nutzungsrechte.
Terms of use:
Documents in EconStor may be saved and copied for your
personal and scholarly purposes.
You are not to copy documents for public or commercial
purposes, to exhibit the documents publicly, to make them
publicly available on the internet, or to distribute or otherwise
use the documents in public.
If the documents have been made available under an Open
Content Licence (especially Creative Commons Licences), you
may exercise further usage rights as specified in the indicated
licence.
www.econstor.eu_____________________________________________________________________
Friedrich-Alexander-Universitit Erlangen-Nürnberg
Institute for Economics
https://www.iwf.rw.fau.de/research/iwf-discussion-paper-series/
No. 05/2017
partialCI: An R package for the analysis of partially
cointegrated time series
Matthew Clegg
Independent
Christopher Krauss
University of Erlangen-Nürnberg
Jonas Rende
University of Erlangen-Nürnberg
ISSN 1867-6707
Discussion Papers
in EconomicspartialCI: An R package for the analysis of partially cointegrated
time series
Matthew Clegga,1,, Christopher Kraussb,1,, Jonas Rendec,1,
a
Independent
bUniversity of Erlangen-N¨urnberg, Department of Statistics and Econometrics, Lange Gasse 20, 90403
N¨urnberg, Germany
cUniversity of Erlangen-N¨urnberg, Department of Statistics and Econometrics, Lange Gasse 20, 90403
N¨urnberg, Germany
Friday 10th February, 2017
Abstract
Partial cointegration is a weakening of cointegration, allowing for the residual series to contain
a mean-reverting and a random walk component. Analytically, the residual series is
described by a partially autoregressive process. The partialCI package provides estimation,
testing, and simulation routines for PCI models in state space. We illustrate the functionality
with two examples: A financial application in the context of pairs trading and a macroeconomic
application, i.e., the relationship between GDP and consumption. For both examples,
we show that the variables are not cointegated in the classic sense, but can be modeled with
partial cointegration.
Keywords: R software, cointegration, partial cointegration, pairs trading, permanent
components, transient components.
Email addresses: matthewcleggphd@gmail.com (Matthew Clegg), christopher.krauss@fau.de
(Christopher Krauss), jonas.rende@fau.de (Jonas Rende)
1The authors have benefited from many helpful discussions with Ingo Klein.1. Introduction
The partialCI package (Clegg, 2016) fits a partial cointegration model2
to describe a
time series. Partial cointegration (PCI) is a weakening of cointegration, allowing for the
residual series to contain a mean-reverting and a random walk component. Analytically,
this residual series is described by a partially autoregressive process (PAR – see Summers
(1986), Poterba and Summers (1988), and Clegg (2015a))3
, consisting of a stationary ARprocess
and a random walk. Related is the short-term / long-term model introduced by
Schwartz and Smith (2000), which models a security price as the sum of a Brownian motion
and an Ornstein-Uhlenbeck process. Whereas classic cointegration in the sense of Engle and
Granger (1987) requires all shocks to be transient, PCI is more flexible and allows for permanent
shocks as well – a realistic assumption across many (macro)economic applications.
Even though neither the residual series, nor its mean-reverting and permanent component
are directly observable, estimation is still possible in state space – see Brockwell and Davis
(2010) and Durbin and Koopman (2012). The partialCI package encloses suitable estimation,
testing, and simulation routines for such PCI models.
Partial cointegration enhances several existing cointegration concepts in the literature –
namely classic cointegration, fractional cointegration and threshold cointegration.
In their seminal paper, Engle and Granger (1987) introduce the concept of classic cointegration.
Loosely speaking, if a collection of time series is cointegrated, they share a long-run
equilibrium. Shocks to the cointegration process are not persistent, i.e., the process adjusts
exponentially towards the long-run equilibrium value after exhibiting a shock (Pfaff, 2008).
Thus, if the cointegration process is subject to permanent shocks, the partial cointegration
model may be more appropriate. Test procedures for classic cointegration are implemented
in the R packages urca (Pfaff, 2008) and egcm (Clegg, 2015c).
2Please note that we use the term partial cointegration according to Clegg and Krauss (2016).
3Partially autoregressive processes are already implemented in the corresponding R package partialAR
(Clegg, 2015b).
2In a fractional cointegration model the residual series is assumed to follow a fractionally
integrated process. Such a process incorporates weighted higher-order lags to model longterm
effects (Baillie, 1996). In terms of shock persistence, fractionally integrated processes
are between classic cointegrating processes (short-run persistence) and random walks (infinite
persistence). The ability to account for long-term persistence makes fractionally integrated
processes especially useful to analyze long-memory time series data (Baillie, 1996). The
benefit of PCI compared to fractional cointegration is twofold: First, with PCI it is possible
to disentangle the transient and permanent component, allowing to separately investigate
the dynamics associated with the transient component (Clegg and Krauss, 2016). Second,
within a PCI framework the proportion of variance attributable to mean-reversion (PVMR)
can be computed (Clegg and Krauss, 2016). The PVMR allows to assess the degree of noise
in the time series.
In their seminal paper, Balke and Fomby (1997) introduce the concept of threshold cointegration.
In the cointegration models introduced so far, every shock, independent of its
magnitude, induces an instant adjustment process towards the long-run equilibrium value.
Balke and Fomby (1997) flexiblize this assumption of linear adjustment. The process is assumed
to solely consist of a permanent component, if it does not exceed a certain threshold
level. By contrast, if the time series exceeds the threshold level the process is modeled as a
classic cointegration process and adjustment towards the corresponding long-run equilibrium
occurs as long as the process exceeds the threshold value in absolute terms. The advantage
of the partial cointegration model is the ability to model the impact of permanent shocks
globally and not just locally as in a thresold cointegration model. Threshold cointegration
models are implemented in the R package tsDyn (Stigler, 2010).
Potential fields of application for the PCI model in a financial context are: term structures,
stock indices and tracking portfolios, stock pairs, spot and future prices, commodities,
spread options, international stock indices, as well as foreign exchange (Alexander, 2011). In
addition, the PCI framework could be used to revisit macroeconomic theories, e.g., monetary
policy, fiscal policy or business cycle models. An initial show case for PCI can be found in
3Clegg and Krauss (2016). They apply the partialCI package to detect partially cointegrated
pairs of stocks on the S&P 500 from January 1990 to October 2015. The authors extract
the mean-reverting component of the price spread time series of the partially cointegrated
pairs of stocks as baseline for a relative-value arbitrage strategy.
The remainder of this paper is organized as follows. In section 2, we outline the methodological
details of the PCI model. In section 3, we explain how to use the key functions of the
partialCI package. In section 4, we provide a finance as well as a macroeconomic example.
Finally, section 5 provides concluding thoughts.
2. The partial cointegration framework
2.1. Model definition
Based on Engle and Granger (1987), Clegg and Krauss (2016, p. 4) define the concept
of partial cointegration as follows:
Definition: ”The components of the vector Xt are said to be partially cointegrated of
order d, b, denoted Xt ~ P CI (d, b), if (i) all components of Xt are I (d)
4
; (ii) there exists
a vector α so that Zt = α0Xt and Zt can be decomposed as a sum Zt = Rt + Mt
, where
Rt ~ I (d) and Mt ~ I (d ? b).”
While Clegg and Krauss (2016) focus on the special case of two partially cointegrated time
series, we extend the model to the case of (k + 1) partially cointegrated time series. Let Yt denote
the target time series and Xj,t the j
th factor time series at time t, where j = {1, 2, . . . , k}.
The target time series and the k factor time series are partially cointegrated, if a parameter
vector ι = {β1, β2, . . . , βk, ρ, σM, σR, M0, MR} exists such that the subsequent model
4
If a time series exhibits d unit roots, it is said to be integrated of order d (I (d)) (L¨utkepohl, 2007, p.
238-242).
4equations are satisfied (Clegg and Krauss, 2016)5:
Yt = β1X1,t + β2X2,t + ... + βkXk,t + Wt
Wt = Mt + Rt
Mt = ρMt1 + εM,t
Rt = Rt1 + εR,tεM,t ~ N0, σ2MεR,t ~ N0, σ2Rβj ∈ R; ρ ∈ (?1, 1) ; σ2
M, σ2R ∈ R+0.
(1)
Thereby, Wt denotes the partially autoregressive process, Rt the permanent component,
Mt the transient component and β = {β1, β2, . . . , βk} is the partially cointegrating vector.6
The permanent component is modeled as a random walk and the transient component as
an AR(1)-process with AR(1)-coefficient ρ. The corresponding error terms εM,t and εR,t
are assumed to follow mutually independent, normally distributed white noise processes
with mean zero and variances σ2M and σ2
R. For the sake of simplicity, we set M0 = 0 and
R0 = Y0 β1X1,0 β2X2,0 ... βkXk,0. A key advantage of modeling the cointegrating
process as a partially autoregressive process is that we are able to calculate the PVMR,
defined as (Clegg and Krauss, 2016),
R2MR = AR [(1 B) Mt]V AR [(1 B) Wt]=2σ2M2σ2M + (1 + ρ) σ2R, R2
MR ∈ [0, 1] , (2)
where B denotes the backshift operator. The statistic R2
MR is useful to assess how close the
cointegration process is to either a pure random walk (R2
MR = 0) or a pure AR(1)-process
(R2MR = 1).
2.2. State space representation
The applied state space transformation is in line with Clegg and Krauss (2016). Given
that the PAR process Wt
is not observable, we convert the PCI model into the following
5
It is possible to include an intercept within the partialCI package.
6Note that in the implemented estimation routine the estimated partially cointegrating vector is a linear
combination of all existing partially cointegrating vectors in the sense of Verbeek (2010, p. 324).
5state space model, consisting of an observation (3) and a state equation (4):
Xt = HZt (3)Zt = F Zt1 + Wt. (4)
Thereby, Zt (4) denotes the state which is assumed to be influenced linearly by the state in
the last period and a noise term Wt
. The matrix F is assumed to be time invariant. The
observable part is denoted by Xt (3). By assumption, there is a linear dependence between
Xt and Zt
, captured in the time invariant matrix H.
The PCI framework presented in equation (1) consists of the observable target as well as
factor time series and the two hidden state variables Mt and Rt
. Following the approach
of Clegg and Krauss (2016), the k factor variables are declared as additional hidden state
variables. As a consequence X1,t, X2,t, ..., Xk,t are part of both, the observation and the state
equation. Applying the state space transformation yields the following observation equation:, (6)
with εXj,t denoting the innovation of process Xj,t. By assumption, εXj,t is normally distributed
with zero mean and variance σ
and is independent of εM,t and εR,t.
62.3. Estimation of a partial cointegration model
Parameters are estimated via the maximum likelihood (ML) method. Using a quasiNewton
algorithm, the ML method searches for the parameters ρ, σand the parameter
vector β which maximizes the likelihood function of the associated Kalman filter.7 The
following likelihood score is maximized (Clegg and Krauss, 2016):, (7)
where φ (·) denotes the probability density function of the normal distribution. Clegg and
Krauss (2016) provide (i) a derivation of the likelihood function (7), (ii) a proof that the
partial cointegration model is identifiable, and (iii) a comprehensive discussion about the
consistency of the ML estimation routine.8
2.4. A likelihood ratio test routine for partial cointegration
The likelihood ratio test (LRT) implemented in the partialCI package adopts the LRT
routine for PAR models proposed by Clegg (2015a). In a PCI scenario the null hypothesis
consists of two conditions – namely the hypothesis that the residual series is a pure random
walk (HR) or a pure AR(1)-process (HM0). The two conditions are separately tested. Only
if both, HR0 and HM 0
are individually rejected, the null hypothesis of no partial cointegration
is rejected. On the first stage the LRT for partial cointegration tests the null hypothesis
of a pure random walk versus the alternative hypothesis of a pure AR(1)-process or PCI. To construct the first stage of the LRT for partial cointegration it is necessary
to estimate the likelihood scores of an unrestricted and a restricted model. The likelihood
score of the unrestricted model, i.e., the largest likelihood score found by the Kalman filter
optimization routine, is denoted by(8)
7The complete algorithm as well as the determination of the starting values are available in the R package
partialCI.
8The partialCI package also provides a two-step estimation method, which often produces results that
are inferior to the joint-penalty method, and so the joint-penalty method is to be preferred.
7The restricted model is obtained by setting ρ and σM to zero which is in line with the null
hypothesis of a pure random walk. The restricted model is given by. (9)
The test statistic for the pure radom walk hypothesis is given as
ΛR = log. (10)
Let CR (α) (CM (α)) denote the critical value associated with ΛR (ΛM) dependent on the
significance level α. If HR0
cannot be rejected, i.e., ΛR < CR (α), the tested time series is
classified as a pure random walk. On the other hand, if the test rejects HR0, the routine
continues, testing the conditional null hypothesis HM0|ΛR < CR (α) against HPCI1. Settingσ2R = 0 yields the likelihood score of the restriced model:L
M = maxβ,ρ,σ2MLMRβ, ρ, σ2M, σ2R = 0. (11)
The test statistic for the second stage is given as. (12)
If the conditional null hypothesis HM
0
|ΛR < CR (α) cannot be rejected, i.e., ΛM < CM (α),
the tested time series follows a pure AR(1)-process. Vice versa, if ΛM > CM (α) holds, the
time series is classified as partially cointegrated. Note that the critical values for both test
statistics ΛR as well as ΛM need to be simulated because the test statistics do not follow a
standard distribution. They are embedded in the package partialCI.
3. Using the PCI package
In this section, we outline the four key functions of the partialCI package in detail –
namely fit.pci(), test.pci(), statehistory.pci(), and hedge.pci().
3.1. fit.pci()
The function fit.pci() fits a partial cointegration model to a given collection of time
series.
8fit.pci(Y, X, pci opt method = c("jp", "twostep"), par model = c("par",
"ar1", "rw"), lambda = 0, robust = FALSE, nu = 5, include alpha=FALSE)
Y : Denotes the target time series and X is a matrix containing the k factors used to
model Y .9 pci opt method: Specifies, whether the joint-penalty method ("jp") or the twostep
("twostep") method is applied to obtain the model with the best fit. If pci opt method
is specified as "twostep", a two-step procedure similar to the method introduced by
Engle and Granger (1987) is performed. The residuals of the first stage regression are
extracted and a prespecified model is fitted to the residual series. Which model is fitted
to the residual series, depends on the specification for the argument par model. In case
of "par", a partial autoregressive model is used, in case of "ar1", an AR(1)-process
and in case of "rw" a random walk (default: par model = "par"). On the other
hand, if the pci opt method is specified as "jp", the joint-penalty method is applied,
to estimate β, ρ, σ2M and σ2
R jointly via ML. The likelihood score of the associated
Kalman filter is extended by a penalty value λσ2
R, where λ ∈ R+0. Larger values for λ
favor solutions with a larger transient component and vice versa (default: lambda =0). To reach a higher chance of finding the global minimum, the procedure uses several
different starting points. One of these starting points are the parameter estimates of
an ex-ante two-step procedure, ensuring that the likelihood score obtained under "jp"is at least as good as under "twostep" (default: pci opt method = "jp").
robust: Determines whether the residuals are assumed to be normally (FALSE) or tdistributed
(TRUE) (default: robust = TRUE). If robust is set to TRUE the degrees of
freedom can be specified, using the argument nu (default: nu = 5). If pci opt method
matches "twostep", a robust linear model (rlm()) included in the R package MASS
(Ripley and Venables, 2002) is applied, i.e., a Huber (1981) M-estimator is calculated.10
include alpha: If TRUE, an intercept α is added to the PCI relationship (default:
9Both, X and Y are plain or zoo (Grothendieck and Zeileis, 2005) objects. If k = 1, X is a vector.
10For a discussion about robust parameter estimation in a PAR context, see Clegg (2015a).
9include alpha = FALSE).
key return values: The proportion of variance attributable to mean-reversion ($pvmr),
the partially cointegrating vector ($beta), the AR(1)-coefficient ($rho) and the negative
log likelihood ($negloglik).
3.2. test.pci()
The test.pci() function tests the goodness of fit of a PCI model.
test.pci(Y, X, alpha = 0.05, null hyp = c("rw", "ar1"), robust = FALSE,
pci opt method = c("jp", "twostep"))
alpha: Determines at which significance level the null hypothesis is rejected (default:
alpha = 0.05).
null hyp: Specifies whether the null hypothesis is a random walk ("rw"), an AR(1)-process ("ar1") or a union of both hypotheses (c("rw", "ar1")) (default: null hyp= c("rw", "ar1")).
key return values: The test statistic ($statistic) and p-values ($p.value) for the
selected null hypothesis.
3.3. statehistory.pci()
To estimate the sequence of hidden states the statehistory.pci() function can be applied.
statehistory.pci(A, data = A$data, basis = A$basis)
A: Denotes a fit.pci() object.
data: Is a matrix consisting of the target time series and the k factor time series
(default: data = A$data).
basis: Captures the coefficients of the factor time series (default: basis = A$basis). key return values: The two estimated hidden states Mt ($M) and Rt ($R).
103.4. hedge.pci()
The function hedge.pci() finds those k factors from a predefined set of factors which yield
the best fit to the target time series.
hedge.pci(Y, X, maxfact = 10, lambda = 0, use.multicore = TRUE,
minimum.stepsize = 0, verbose = TRUE, exclude.cols = c(), search type =
c("lasso", "full", "limited"), pci opt method=c("jp", "twostep"))
maxfact: Denotes the maximum number of considered factors (default: maxfact =
10). use.multicore: If TRUE, parallel processing is activated (default: use.multicore =TRUE).
verbose: Controls whether detailed information are printed (default: verbose =
TRUE).
exclude.cols: Defines a set of factors which should be excluded from the search
routine (default: exclude.cols = c()).
search type: Determines the search algorithm applied to find the model that fits best
to the target time series. The likelihood ratio score (LRT score) is used to compare
the model fits, whereby lower scores are associated with better fits. If the option
"lasso" is specified the lasso algorithm as implemented in the R package glmnet
(Friedman et al., 2010) is deployed to search for the portfolio of factors that yields the
best linear fit to the target time series. If the option "full" is specified, then at each
step, all possible additions to the portfolio are considered and the one which yields the
highest likelihood score improvement is chosen. If the option "limited" is specified,
then at each step, the correlation of the residuals of the current portfolio is computed
with respect to each of the candidate series in the input set X, and the top B series
are chosen for further consideration. Among these top B candidates, the one which
improves the likelihood score by the greatest amount is chosen. The parameter B can
be controled via maxfact (default: search type = "lasso").
11 key return values: The best fit ($pci), the column indices ($indexes), and the names
of the factors included in the best fit ($index names).
4. Examples
4.1. Finance
As an introductory example, we explore the relationship between Royal Dutch Shell plc
A (RDS-A) and Royal Dutch Shell plc B (RDS-B), using daily (closing) price data from 1
January 2006 to 1 December 2016.11 To download the price data we use the getYahooData()
function, implemented in the R package TTR (Ulrich, 2016). The subsequent R code is
used to obtain the data.
library(partialCI)
library(TTR)
RDSA<-getYahooData("RDS-A", 20060101, 20161201)$Close
RDSB<-getYahooData("RDS-B", 20060101, 20161201)$Close
A classic cointegration analysis yields that the two time series are not cointegrated. In particular,
we apply the two-step approach of Engle and Granger (1987) implemented in the R
package egcm. By default, the egcm package uses the unit root test of Phillips and Perron
(1988)
12 (specification: with constant, no linear time trend) to investigate the residuals obtained
from an Ordinary Least Squares (OLS) regression. The R code,
library(egcm)
egcm_finance <- egcm(RDSA,RDSB,include.const = FALSE),
results in the following output:
Y[i] = 0.9732 X[i] + 0.0000 + R[i], R[i] = 0.9941 R[i-1] + eps[i],
(0.0005) (0.0000) (0.0025)
11RDS-A (Royal Dutch Shell plc - A, 2016) and RDS-B (Royal Dutch Shell plc - B, 2016) data are
downloaded from Yahoo Finance.
12The test of Phillips and Perron (1988) corrects for heteroscedasticity, a well-known stylized fact of
financial price time series (Krauss and Herrmann, 2017).
12eps ~ N(0, 0.1679^2)
R[2016-12-01] = -1.8991 (t = -1.477)
WARNING: X and Y do not appear to be cointegrated.
The residual plot in figure 1 (code: plot(egcm finance$residuals,type = "l") suggests
that the residual series is not purely mean-reverting, but rather shows a stochastical trend
as well as a mean-reverting behavior. Hence, it is not suprising that RDS-A and RDS-B are
Figure 1: Residual plot classic cointegration: RDS-A and RDS-B (1.01.2006 - 1.12.2016, daily)
not cointegrated. Using the PCI framework, we are able to fit a PCI model to RDS-A and
RDS-B with the following R code:
PCI RDSA RDSB<-fit.pci(RDSA, RDSB, pci opt method = c("jp"), par model
=c("par"), lambda = 0, robust = FALSE, nu = 5, include alpha = FALSE)).
The R output is given as,
Fitted values for PCI model
Y[t] = X[t] %*% beta + M[t] + R[t]
M[t] = rho * M[t-1] + eps_M [t], eps_M[t] ~ N(0, sigma_M^2)
R[t] = R[t-1] + eps_R [t], eps_R[t] ~ N(0, sigma_R^2)
13Estimate Std. Err
beta_Close 0.9274 0.0038
rho 0.3959 0.0965
sigma_M 0.1081 0.0083
sigma_R 0.1195 0.0076
-LL = -1117.29, R^2[MR] = 0.540,
where beta Close denotes the partially cointegrating coefficient. Thereby, the coefficient of
0.9274 indicates a positive relationship between RDS-A and RDS-B, and the PVMR of 0.54
suggests that the spread time series also exhibits a clear mean-reverting behavior.
In the subsequent step, we utilize the test.pci() function to check whether RDS-A and RDS-B
are partially cointegrated. The R code
test.pci(RDSA, RDSB, alpha = 0.05, null hyp = c("rw", "ar1"), robust =
FALSE, pci opt method = c("jp")),
leads to the following output:
Likelihood ratio test of [Random Walk or CI(1)] vs Almost PCI(1)
(joint penalty method)
data: StockA
Hypothesis Statistic p-value
Random Walk -55.09 0.010
AR(1) -52.88 0.010
Combined 0.010.
Recall that a time series is classified as partially cointegrated, if and only if the random walk
as well as the AR(1)-hypotheses are rejected. The p-value of 0.010 for the combined null
hypothesis indicates that RDS-A and RDS-B are partially cointegrated in the considered
period of time.
Next, we demonstrate the use of the statehistory.pci() function which allows to estimate and
extract the hidden states. The R code,
statehistory.pci(PCI RDSA RDSB), results in the R output:
14Y Yhat Z M R eps_M eps_R
2006-01-03 35.87002 35.26781 0.6022031 0.00000000 0.6022031 0.00000000 0.00000000
2006-01-04 36.23993 35.57175 0.6681755 0.02030490 0.6478706 0.02030490 0.04566752
2006-01-05 35.80276 35.24161 0.5611509 -0.02112621 0.5822771 -0.02916450 -0.06559352
2006-01-06 36.48653 35.83377 0.6527591 0.01590352 0.6368556 0.02426695 0.05457850
...
2016-11-25 50.18000 49.52231 0.6576906 -0.08762384 0.7453144 -0.07643882 -0.17191764
2016-11-28 49.20000 48.22397 0.9760311 0.04699758 0.9290335 0.08168603 0.18371909
2016-11-29 49.06000 48.02922 1.0307808 0.04419468 0.9865862 0.02558931 0.05755262
2016-11-30 51.10000 50.23639 0.8636066 -0.02573955 0.8893462 -0.04323530 -0.09724000
2016-12-01 51.78000 51.15450 0.6254956 -0.08826115 0.7137567 -0.07807140 -0.17558945.
The latter table covers the estimates of the hidden states M and R as well as the corresponding
error terms eps M and eps R. Z is equal to the sum of M and R. The estimate
of the target time series is denoted by Yhat. Figure 2 illustrates a plot of the extracted
mean-reverting component of the spread associated with the RDS-A and RDS-B price time
series (plot(statehistory.pci(PCI RDSA RDSB)[,4]
,type = "l",ylab = "", xlab = "")). The horizontal blue lines are equal to two times
Figure 2: Mean-reverting component RDS-A and RDS-B (1.01.2006 - 1.12.2016, daily)
the historical standard deviation in absolute terms of the mean-reverting component. A pairs
trading strategy could exploit the mean-reverting behavior of Mt
. Note that this example is
in-sample; for a true out-of-sample application see Clegg and Krauss (2016).
15We continue with using hedge.pci() to find the set of sector ETFs forming the best hedging
portfolio for the SPY index (S&P500 index). Thereby, the R code,
sectorETFS <- c("XLB", "XLE", "XLF", "XLI", "XLK", "XLP", "XLU", "XLV", "XLY")
prices <- multigetYahooPrices(c("SPY", sectorETFS), start=20060101)
hedge.pci(prices[,"SPY"], prices),
results in the subsequent output:
-LL LR[rw] p[rw] p[mr] rho R^2[MR] Factor | Factor coefficients
2320.00 -23.3743 0.0100 0.0100 0.5759 0.4526 XLI | 3.1106
1765.50 -46.5925 0.0100 0.0100 0.3170 0.4713 XLY | 1.8951 1.1989
1494.95 -53.7256 0.0100 0.0100 0.3244 0.5038 XLV | 1.6999 0.9106 0.6619
972.58 -65.9058 0.0100 0.0100 0.4060 0.5904 XLK | 1.3089 0.4933 0.5320 1.5182.
The table summarizes information about the best hedging portfolio, where each row corresponds
to an increasing number of factors. Row 1: The best single-factor hedging portfolio
comprises XLI (industrials) as only factor. Row 2: The best two-factor hedging portfolio
consists of XLI and XLY (consumer discretionary). As such, XLY leads to the best improvement
of the LRT score among all remaining factors. Row 3 includes XLV (health care) for
the three-factor portfolio and row 4 XLK (technology) for the best four-factor portfolio. The
last row corresponds to the overall best fit out of the nine potential sector ETFs, based on
the LRT score. Note that for all rows, the union of random walk and AR(1)-null hypothesis
is rejected at the 5 percent significant level, so we find a PCI model at each step.
4.2. Macroeconomics
As a second example, we revisit the relationship between GDP and personal consumption
expenditures for the United States (among others see Cochrane (1994), Gonzalo et al. (2008)
and Guisan (2008)), using quarterly seasonally adjusted annual rates in billion US-Dollar
from January 1976 to July 2016.13 The following R code triggers the data download:
13We utilize the R package Quandl (Daroczi et al., 2016) to download the GDP (US. Bureau of Economic
Analysis, 2016a) as well as personal consumption expenditures data (US. Bureau of Economic Analysis,
2016b). Thereby, the time series data are directly converted into xts (Ryan and Ulrich, 2014) objects.
16library(xts)
library(Quandl)
library(partialCI)
GDP = Quandl("FRED/GDP", start_date = "1976-01-01",
end_date = "2016-04-01", type = "xts")
Consumption = Quandl("FRED/PCEC", start_date = "1976-01-01",
end_date = "2016-04-01",type = "xts").
Applying the unit root test of Phillips and Perron (1988) as implemented in the R package
egcm yields that GDP and personal consumption are not cointegrated in the classic sense,
within the considered time frame.14 The residual plot in figure 3 (code: plot(egcm macro$
residuals,type = "l") obtained from standard cointegration analysis shows that the residuals
exhibit both, mean-reverting and stochastic trending behavior.15 To account for the
Figure 3: Residual plot classic cointegration: GDP and consumption (1976-2016, quarters)
stochastic trending behavior we apply the following PCI model:
14The R code is given as egcm macro <- egcm(Consumption,GDP,include.const = FALSE). For the sake
of brevity, we do not show the R output.
15We are aware of the structural break in the residual series around the second quarter of the year 2000.
The function breakpoints() implemented in the R package strucchange (Hornik et al., 2003) is used to
obtain the estimate of the structural break.
17PCI GDP Consumption<-fit.pci(GDP, Consumption, pci opt method = c("jp"),
par model =c("par"), lambda = 0, robust = FALSE, nu = 5, include alpha =
FALSE)).
The latter function yields the following R output:
Fitted values for PCI model
Y[t] = X[t] %*% beta + M[t] + R[t]
M[t] = rho * M[t-1] + eps_M [t], eps_M[t] ~ N(0, sigma_M^2)
R[t] = R[t-1] + eps_R [t], eps_R[t] ~ N(0, sigma_R^2)
Estimate Std. Err
beta_ 1.3963 0.0358
rho 0.2812 0.3357
sigma
A Service of
zbwLeibniz-Informationszentrum
Wirtschaft
Leibniz Information Centre
for Economics
Clegg, Matthew; Krauss, Christopher; Rende, Jonas
Working Paper
partialCI: An R package for the analysis of partially
cointegrated time series
FAU Discussion Papers in Economics, No. 05/2017
Provided in Cooperation with:
Friedrich-Alexander University Erlangen-Nuremberg, Institute for
Economics
Suggested Citation: Clegg, Matthew; Krauss, Christopher; Rende, Jonas (2017) : partialCI:
An R package for the analysis of partially cointegrated time series, FAU Discussion Papers
in Economics, No. 05/2017, Friedrich-Alexander-Universit?t Erlangen-Nürnberg, Institute for
Economics, Erlangen
This Version is available at:
http://hdl.handle.net/10419/150014
Standard-Nutzungsbedingungen:
Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen
Zwecken und zum Privatgebrauch gespeichert und kopiert werden.
Sie dürfen die Dokumente nicht für ffentliche oder kommerzielle
Zwecke vervielf ltigen, ffentlich ausstellen, ffentlich zug nglich
machen, vertreiben oder anderweitig nutzen.
Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen
(insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten,
gelten abweichend von diesen Nutzungsbedingungen die in der dort
genannten Lizenz gew?hrten Nutzungsrechte.
Terms of use:
Documents in EconStor may be saved and copied for your
personal and scholarly purposes.
You are not to copy documents for public or commercial
purposes, to exhibit the documents publicly, to make them
publicly available on the internet, or to distribute or otherwise
use the documents in public.
If the documents have been made available under an Open
Content Licence (especially Creative Commons Licences), you
may exercise further usage rights as specified in the indicated
licence.
www.econstor.eu_____________________________________________________________________
Friedrich-Alexander-Universitit Erlangen-Nürnberg
Institute for Economics
https://www.iwf.rw.fau.de/research/iwf-discussion-paper-series/
No. 05/2017
partialCI: An R package for the analysis of partially
cointegrated time series
Matthew Clegg
Independent
Christopher Krauss
University of Erlangen-Nürnberg
Jonas Rende
University of Erlangen-Nürnberg
ISSN 1867-6707
Discussion Papers
in EconomicspartialCI: An R package for the analysis of partially cointegrated
time series
Matthew Clegga,1,, Christopher Kraussb,1,, Jonas Rendec,1,
a
Independent
bUniversity of Erlangen-N¨urnberg, Department of Statistics and Econometrics, Lange Gasse 20, 90403
N¨urnberg, Germany
cUniversity of Erlangen-N¨urnberg, Department of Statistics and Econometrics, Lange Gasse 20, 90403
N¨urnberg, Germany
Friday 10th February, 2017
Abstract
Partial cointegration is a weakening of cointegration, allowing for the residual series to contain
a mean-reverting and a random walk component. Analytically, the residual series is
described by a partially autoregressive process. The partialCI package provides estimation,
testing, and simulation routines for PCI models in state space. We illustrate the functionality
with two examples: A financial application in the context of pairs trading and a macroeconomic
application, i.e., the relationship between GDP and consumption. For both examples,
we show that the variables are not cointegated in the classic sense, but can be modeled with
partial cointegration.
Keywords: R software, cointegration, partial cointegration, pairs trading, permanent
components, transient components.
Email addresses: matthewcleggphd@gmail.com (Matthew Clegg), christopher.krauss@fau.de
(Christopher Krauss), jonas.rende@fau.de (Jonas Rende)
1The authors have benefited from many helpful discussions with Ingo Klein.1. Introduction
The partialCI package (Clegg, 2016) fits a partial cointegration model2
to describe a
time series. Partial cointegration (PCI) is a weakening of cointegration, allowing for the
residual series to contain a mean-reverting and a random walk component. Analytically,
this residual series is described by a partially autoregressive process (PAR – see Summers
(1986), Poterba and Summers (1988), and Clegg (2015a))3
, consisting of a stationary ARprocess
and a random walk. Related is the short-term / long-term model introduced by
Schwartz and Smith (2000), which models a security price as the sum of a Brownian motion
and an Ornstein-Uhlenbeck process. Whereas classic cointegration in the sense of Engle and
Granger (1987) requires all shocks to be transient, PCI is more flexible and allows for permanent
shocks as well – a realistic assumption across many (macro)economic applications.
Even though neither the residual series, nor its mean-reverting and permanent component
are directly observable, estimation is still possible in state space – see Brockwell and Davis
(2010) and Durbin and Koopman (2012). The partialCI package encloses suitable estimation,
testing, and simulation routines for such PCI models.
Partial cointegration enhances several existing cointegration concepts in the literature –
namely classic cointegration, fractional cointegration and threshold cointegration.
In their seminal paper, Engle and Granger (1987) introduce the concept of classic cointegration.
Loosely speaking, if a collection of time series is cointegrated, they share a long-run
equilibrium. Shocks to the cointegration process are not persistent, i.e., the process adjusts
exponentially towards the long-run equilibrium value after exhibiting a shock (Pfaff, 2008).
Thus, if the cointegration process is subject to permanent shocks, the partial cointegration
model may be more appropriate. Test procedures for classic cointegration are implemented
in the R packages urca (Pfaff, 2008) and egcm (Clegg, 2015c).
2Please note that we use the term partial cointegration according to Clegg and Krauss (2016).
3Partially autoregressive processes are already implemented in the corresponding R package partialAR
(Clegg, 2015b).
2In a fractional cointegration model the residual series is assumed to follow a fractionally
integrated process. Such a process incorporates weighted higher-order lags to model longterm
effects (Baillie, 1996). In terms of shock persistence, fractionally integrated processes
are between classic cointegrating processes (short-run persistence) and random walks (infinite
persistence). The ability to account for long-term persistence makes fractionally integrated
processes especially useful to analyze long-memory time series data (Baillie, 1996). The
benefit of PCI compared to fractional cointegration is twofold: First, with PCI it is possible
to disentangle the transient and permanent component, allowing to separately investigate
the dynamics associated with the transient component (Clegg and Krauss, 2016). Second,
within a PCI framework the proportion of variance attributable to mean-reversion (PVMR)
can be computed (Clegg and Krauss, 2016). The PVMR allows to assess the degree of noise
in the time series.
In their seminal paper, Balke and Fomby (1997) introduce the concept of threshold cointegration.
In the cointegration models introduced so far, every shock, independent of its
magnitude, induces an instant adjustment process towards the long-run equilibrium value.
Balke and Fomby (1997) flexiblize this assumption of linear adjustment. The process is assumed
to solely consist of a permanent component, if it does not exceed a certain threshold
level. By contrast, if the time series exceeds the threshold level the process is modeled as a
classic cointegration process and adjustment towards the corresponding long-run equilibrium
occurs as long as the process exceeds the threshold value in absolute terms. The advantage
of the partial cointegration model is the ability to model the impact of permanent shocks
globally and not just locally as in a thresold cointegration model. Threshold cointegration
models are implemented in the R package tsDyn (Stigler, 2010).
Potential fields of application for the PCI model in a financial context are: term structures,
stock indices and tracking portfolios, stock pairs, spot and future prices, commodities,
spread options, international stock indices, as well as foreign exchange (Alexander, 2011). In
addition, the PCI framework could be used to revisit macroeconomic theories, e.g., monetary
policy, fiscal policy or business cycle models. An initial show case for PCI can be found in
3Clegg and Krauss (2016). They apply the partialCI package to detect partially cointegrated
pairs of stocks on the S&P 500 from January 1990 to October 2015. The authors extract
the mean-reverting component of the price spread time series of the partially cointegrated
pairs of stocks as baseline for a relative-value arbitrage strategy.
The remainder of this paper is organized as follows. In section 2, we outline the methodological
details of the PCI model. In section 3, we explain how to use the key functions of the
partialCI package. In section 4, we provide a finance as well as a macroeconomic example.
Finally, section 5 provides concluding thoughts.
2. The partial cointegration framework
2.1. Model definition
Based on Engle and Granger (1987), Clegg and Krauss (2016, p. 4) define the concept
of partial cointegration as follows:
Definition: ”The components of the vector Xt are said to be partially cointegrated of
order d, b, denoted Xt ~ P CI (d, b), if (i) all components of Xt are I (d)
4
; (ii) there exists
a vector α so that Zt = α0Xt and Zt can be decomposed as a sum Zt = Rt + Mt
, where
Rt ~ I (d) and Mt ~ I (d ? b).”
While Clegg and Krauss (2016) focus on the special case of two partially cointegrated time
series, we extend the model to the case of (k + 1) partially cointegrated time series. Let Yt denote
the target time series and Xj,t the j
th factor time series at time t, where j = {1, 2, . . . , k}.
The target time series and the k factor time series are partially cointegrated, if a parameter
vector ι = {β1, β2, . . . , βk, ρ, σM, σR, M0, MR} exists such that the subsequent model
4
If a time series exhibits d unit roots, it is said to be integrated of order d (I (d)) (L¨utkepohl, 2007, p.
238-242).
4equations are satisfied (Clegg and Krauss, 2016)5:
Yt = β1X1,t + β2X2,t + ... + βkXk,t + Wt
Wt = Mt + Rt
Mt = ρMt1 + εM,t
Rt = Rt1 + εR,tεM,t ~ N0, σ2MεR,t ~ N0, σ2Rβj ∈ R; ρ ∈ (?1, 1) ; σ2
M, σ2R ∈ R+0.
(1)
Thereby, Wt denotes the partially autoregressive process, Rt the permanent component,
Mt the transient component and β = {β1, β2, . . . , βk} is the partially cointegrating vector.6
The permanent component is modeled as a random walk and the transient component as
an AR(1)-process with AR(1)-coefficient ρ. The corresponding error terms εM,t and εR,t
are assumed to follow mutually independent, normally distributed white noise processes
with mean zero and variances σ2M and σ2
R. For the sake of simplicity, we set M0 = 0 and
R0 = Y0 β1X1,0 β2X2,0 ... βkXk,0. A key advantage of modeling the cointegrating
process as a partially autoregressive process is that we are able to calculate the PVMR,
defined as (Clegg and Krauss, 2016),
R2MR = AR [(1 B) Mt]V AR [(1 B) Wt]=2σ2M2σ2M + (1 + ρ) σ2R, R2
MR ∈ [0, 1] , (2)
where B denotes the backshift operator. The statistic R2
MR is useful to assess how close the
cointegration process is to either a pure random walk (R2
MR = 0) or a pure AR(1)-process
(R2MR = 1).
2.2. State space representation
The applied state space transformation is in line with Clegg and Krauss (2016). Given
that the PAR process Wt
is not observable, we convert the PCI model into the following
5
It is possible to include an intercept within the partialCI package.
6Note that in the implemented estimation routine the estimated partially cointegrating vector is a linear
combination of all existing partially cointegrating vectors in the sense of Verbeek (2010, p. 324).
5state space model, consisting of an observation (3) and a state equation (4):
Xt = HZt (3)Zt = F Zt1 + Wt. (4)
Thereby, Zt (4) denotes the state which is assumed to be influenced linearly by the state in
the last period and a noise term Wt
. The matrix F is assumed to be time invariant. The
observable part is denoted by Xt (3). By assumption, there is a linear dependence between
Xt and Zt
, captured in the time invariant matrix H.
The PCI framework presented in equation (1) consists of the observable target as well as
factor time series and the two hidden state variables Mt and Rt
. Following the approach
of Clegg and Krauss (2016), the k factor variables are declared as additional hidden state
variables. As a consequence X1,t, X2,t, ..., Xk,t are part of both, the observation and the state
equation. Applying the state space transformation yields the following observation equation:, (6)
with εXj,t denoting the innovation of process Xj,t. By assumption, εXj,t is normally distributed
with zero mean and variance σ
and is independent of εM,t and εR,t.
62.3. Estimation of a partial cointegration model
Parameters are estimated via the maximum likelihood (ML) method. Using a quasiNewton
algorithm, the ML method searches for the parameters ρ, σand the parameter
vector β which maximizes the likelihood function of the associated Kalman filter.7 The
following likelihood score is maximized (Clegg and Krauss, 2016):, (7)
where φ (·) denotes the probability density function of the normal distribution. Clegg and
Krauss (2016) provide (i) a derivation of the likelihood function (7), (ii) a proof that the
partial cointegration model is identifiable, and (iii) a comprehensive discussion about the
consistency of the ML estimation routine.8
2.4. A likelihood ratio test routine for partial cointegration
The likelihood ratio test (LRT) implemented in the partialCI package adopts the LRT
routine for PAR models proposed by Clegg (2015a). In a PCI scenario the null hypothesis
consists of two conditions – namely the hypothesis that the residual series is a pure random
walk (HR) or a pure AR(1)-process (HM0). The two conditions are separately tested. Only
if both, HR0 and HM 0
are individually rejected, the null hypothesis of no partial cointegration
is rejected. On the first stage the LRT for partial cointegration tests the null hypothesis
of a pure random walk versus the alternative hypothesis of a pure AR(1)-process or PCI. To construct the first stage of the LRT for partial cointegration it is necessary
to estimate the likelihood scores of an unrestricted and a restricted model. The likelihood
score of the unrestricted model, i.e., the largest likelihood score found by the Kalman filter
optimization routine, is denoted by(8)
7The complete algorithm as well as the determination of the starting values are available in the R package
partialCI.
8The partialCI package also provides a two-step estimation method, which often produces results that
are inferior to the joint-penalty method, and so the joint-penalty method is to be preferred.
7The restricted model is obtained by setting ρ and σM to zero which is in line with the null
hypothesis of a pure random walk. The restricted model is given by. (9)
The test statistic for the pure radom walk hypothesis is given as
ΛR = log. (10)
Let CR (α) (CM (α)) denote the critical value associated with ΛR (ΛM) dependent on the
significance level α. If HR0
cannot be rejected, i.e., ΛR < CR (α), the tested time series is
classified as a pure random walk. On the other hand, if the test rejects HR0, the routine
continues, testing the conditional null hypothesis HM0|ΛR < CR (α) against HPCI1. Settingσ2R = 0 yields the likelihood score of the restriced model:L
M = maxβ,ρ,σ2MLMRβ, ρ, σ2M, σ2R = 0. (11)
The test statistic for the second stage is given as. (12)
If the conditional null hypothesis HM
0
|ΛR < CR (α) cannot be rejected, i.e., ΛM < CM (α),
the tested time series follows a pure AR(1)-process. Vice versa, if ΛM > CM (α) holds, the
time series is classified as partially cointegrated. Note that the critical values for both test
statistics ΛR as well as ΛM need to be simulated because the test statistics do not follow a
standard distribution. They are embedded in the package partialCI.
3. Using the PCI package
In this section, we outline the four key functions of the partialCI package in detail –
namely fit.pci(), test.pci(), statehistory.pci(), and hedge.pci().
3.1. fit.pci()
The function fit.pci() fits a partial cointegration model to a given collection of time
series.
8fit.pci(Y, X, pci opt method = c("jp", "twostep"), par model = c("par",
"ar1", "rw"), lambda = 0, robust = FALSE, nu = 5, include alpha=FALSE)
Y : Denotes the target time series and X is a matrix containing the k factors used to
model Y .9 pci opt method: Specifies, whether the joint-penalty method ("jp") or the twostep
("twostep") method is applied to obtain the model with the best fit. If pci opt method
is specified as "twostep", a two-step procedure similar to the method introduced by
Engle and Granger (1987) is performed. The residuals of the first stage regression are
extracted and a prespecified model is fitted to the residual series. Which model is fitted
to the residual series, depends on the specification for the argument par model. In case
of "par", a partial autoregressive model is used, in case of "ar1", an AR(1)-process
and in case of "rw" a random walk (default: par model = "par"). On the other
hand, if the pci opt method is specified as "jp", the joint-penalty method is applied,
to estimate β, ρ, σ2M and σ2
R jointly via ML. The likelihood score of the associated
Kalman filter is extended by a penalty value λσ2
R, where λ ∈ R+0. Larger values for λ
favor solutions with a larger transient component and vice versa (default: lambda =0). To reach a higher chance of finding the global minimum, the procedure uses several
different starting points. One of these starting points are the parameter estimates of
an ex-ante two-step procedure, ensuring that the likelihood score obtained under "jp"is at least as good as under "twostep" (default: pci opt method = "jp").
robust: Determines whether the residuals are assumed to be normally (FALSE) or tdistributed
(TRUE) (default: robust = TRUE). If robust is set to TRUE the degrees of
freedom can be specified, using the argument nu (default: nu = 5). If pci opt method
matches "twostep", a robust linear model (rlm()) included in the R package MASS
(Ripley and Venables, 2002) is applied, i.e., a Huber (1981) M-estimator is calculated.10
include alpha: If TRUE, an intercept α is added to the PCI relationship (default:
9Both, X and Y are plain or zoo (Grothendieck and Zeileis, 2005) objects. If k = 1, X is a vector.
10For a discussion about robust parameter estimation in a PAR context, see Clegg (2015a).
9include alpha = FALSE).
key return values: The proportion of variance attributable to mean-reversion ($pvmr),
the partially cointegrating vector ($beta), the AR(1)-coefficient ($rho) and the negative
log likelihood ($negloglik).
3.2. test.pci()
The test.pci() function tests the goodness of fit of a PCI model.
test.pci(Y, X, alpha = 0.05, null hyp = c("rw", "ar1"), robust = FALSE,
pci opt method = c("jp", "twostep"))
alpha: Determines at which significance level the null hypothesis is rejected (default:
alpha = 0.05).
null hyp: Specifies whether the null hypothesis is a random walk ("rw"), an AR(1)-process ("ar1") or a union of both hypotheses (c("rw", "ar1")) (default: null hyp= c("rw", "ar1")).
key return values: The test statistic ($statistic) and p-values ($p.value) for the
selected null hypothesis.
3.3. statehistory.pci()
To estimate the sequence of hidden states the statehistory.pci() function can be applied.
statehistory.pci(A, data = A$data, basis = A$basis)
A: Denotes a fit.pci() object.
data: Is a matrix consisting of the target time series and the k factor time series
(default: data = A$data).
basis: Captures the coefficients of the factor time series (default: basis = A$basis). key return values: The two estimated hidden states Mt ($M) and Rt ($R).
103.4. hedge.pci()
The function hedge.pci() finds those k factors from a predefined set of factors which yield
the best fit to the target time series.
hedge.pci(Y, X, maxfact = 10, lambda = 0, use.multicore = TRUE,
minimum.stepsize = 0, verbose = TRUE, exclude.cols = c(), search type =
c("lasso", "full", "limited"), pci opt method=c("jp", "twostep"))
maxfact: Denotes the maximum number of considered factors (default: maxfact =
10). use.multicore: If TRUE, parallel processing is activated (default: use.multicore =TRUE).
verbose: Controls whether detailed information are printed (default: verbose =
TRUE).
exclude.cols: Defines a set of factors which should be excluded from the search
routine (default: exclude.cols = c()).
search type: Determines the search algorithm applied to find the model that fits best
to the target time series. The likelihood ratio score (LRT score) is used to compare
the model fits, whereby lower scores are associated with better fits. If the option
"lasso" is specified the lasso algorithm as implemented in the R package glmnet
(Friedman et al., 2010) is deployed to search for the portfolio of factors that yields the
best linear fit to the target time series. If the option "full" is specified, then at each
step, all possible additions to the portfolio are considered and the one which yields the
highest likelihood score improvement is chosen. If the option "limited" is specified,
then at each step, the correlation of the residuals of the current portfolio is computed
with respect to each of the candidate series in the input set X, and the top B series
are chosen for further consideration. Among these top B candidates, the one which
improves the likelihood score by the greatest amount is chosen. The parameter B can
be controled via maxfact (default: search type = "lasso").
11 key return values: The best fit ($pci), the column indices ($indexes), and the names
of the factors included in the best fit ($index names).
4. Examples
4.1. Finance
As an introductory example, we explore the relationship between Royal Dutch Shell plc
A (RDS-A) and Royal Dutch Shell plc B (RDS-B), using daily (closing) price data from 1
January 2006 to 1 December 2016.11 To download the price data we use the getYahooData()
function, implemented in the R package TTR (Ulrich, 2016). The subsequent R code is
used to obtain the data.
library(partialCI)
library(TTR)
RDSA<-getYahooData("RDS-A", 20060101, 20161201)$Close
RDSB<-getYahooData("RDS-B", 20060101, 20161201)$Close
A classic cointegration analysis yields that the two time series are not cointegrated. In particular,
we apply the two-step approach of Engle and Granger (1987) implemented in the R
package egcm. By default, the egcm package uses the unit root test of Phillips and Perron
(1988)
12 (specification: with constant, no linear time trend) to investigate the residuals obtained
from an Ordinary Least Squares (OLS) regression. The R code,
library(egcm)
egcm_finance <- egcm(RDSA,RDSB,include.const = FALSE),
results in the following output:
Y[i] = 0.9732 X[i] + 0.0000 + R[i], R[i] = 0.9941 R[i-1] + eps[i],
(0.0005) (0.0000) (0.0025)
11RDS-A (Royal Dutch Shell plc - A, 2016) and RDS-B (Royal Dutch Shell plc - B, 2016) data are
downloaded from Yahoo Finance.
12The test of Phillips and Perron (1988) corrects for heteroscedasticity, a well-known stylized fact of
financial price time series (Krauss and Herrmann, 2017).
12eps ~ N(0, 0.1679^2)
R[2016-12-01] = -1.8991 (t = -1.477)
WARNING: X and Y do not appear to be cointegrated.
The residual plot in figure 1 (code: plot(egcm finance$residuals,type = "l") suggests
that the residual series is not purely mean-reverting, but rather shows a stochastical trend
as well as a mean-reverting behavior. Hence, it is not suprising that RDS-A and RDS-B are
Figure 1: Residual plot classic cointegration: RDS-A and RDS-B (1.01.2006 - 1.12.2016, daily)
not cointegrated. Using the PCI framework, we are able to fit a PCI model to RDS-A and
RDS-B with the following R code:
PCI RDSA RDSB<-fit.pci(RDSA, RDSB, pci opt method = c("jp"), par model
=c("par"), lambda = 0, robust = FALSE, nu = 5, include alpha = FALSE)).
The R output is given as,
Fitted values for PCI model
Y[t] = X[t] %*% beta + M[t] + R[t]
M[t] = rho * M[t-1] + eps_M [t], eps_M[t] ~ N(0, sigma_M^2)
R[t] = R[t-1] + eps_R [t], eps_R[t] ~ N(0, sigma_R^2)
13Estimate Std. Err
beta_Close 0.9274 0.0038
rho 0.3959 0.0965
sigma_M 0.1081 0.0083
sigma_R 0.1195 0.0076
-LL = -1117.29, R^2[MR] = 0.540,
where beta Close denotes the partially cointegrating coefficient. Thereby, the coefficient of
0.9274 indicates a positive relationship between RDS-A and RDS-B, and the PVMR of 0.54
suggests that the spread time series also exhibits a clear mean-reverting behavior.
In the subsequent step, we utilize the test.pci() function to check whether RDS-A and RDS-B
are partially cointegrated. The R code
test.pci(RDSA, RDSB, alpha = 0.05, null hyp = c("rw", "ar1"), robust =
FALSE, pci opt method = c("jp")),
leads to the following output:
Likelihood ratio test of [Random Walk or CI(1)] vs Almost PCI(1)
(joint penalty method)
data: StockA
Hypothesis Statistic p-value
Random Walk -55.09 0.010
AR(1) -52.88 0.010
Combined 0.010.
Recall that a time series is classified as partially cointegrated, if and only if the random walk
as well as the AR(1)-hypotheses are rejected. The p-value of 0.010 for the combined null
hypothesis indicates that RDS-A and RDS-B are partially cointegrated in the considered
period of time.
Next, we demonstrate the use of the statehistory.pci() function which allows to estimate and
extract the hidden states. The R code,
statehistory.pci(PCI RDSA RDSB), results in the R output:
14Y Yhat Z M R eps_M eps_R
2006-01-03 35.87002 35.26781 0.6022031 0.00000000 0.6022031 0.00000000 0.00000000
2006-01-04 36.23993 35.57175 0.6681755 0.02030490 0.6478706 0.02030490 0.04566752
2006-01-05 35.80276 35.24161 0.5611509 -0.02112621 0.5822771 -0.02916450 -0.06559352
2006-01-06 36.48653 35.83377 0.6527591 0.01590352 0.6368556 0.02426695 0.05457850
...
2016-11-25 50.18000 49.52231 0.6576906 -0.08762384 0.7453144 -0.07643882 -0.17191764
2016-11-28 49.20000 48.22397 0.9760311 0.04699758 0.9290335 0.08168603 0.18371909
2016-11-29 49.06000 48.02922 1.0307808 0.04419468 0.9865862 0.02558931 0.05755262
2016-11-30 51.10000 50.23639 0.8636066 -0.02573955 0.8893462 -0.04323530 -0.09724000
2016-12-01 51.78000 51.15450 0.6254956 -0.08826115 0.7137567 -0.07807140 -0.17558945.
The latter table covers the estimates of the hidden states M and R as well as the corresponding
error terms eps M and eps R. Z is equal to the sum of M and R. The estimate
of the target time series is denoted by Yhat. Figure 2 illustrates a plot of the extracted
mean-reverting component of the spread associated with the RDS-A and RDS-B price time
series (plot(statehistory.pci(PCI RDSA RDSB)[,4]
,type = "l",ylab = "", xlab = "")). The horizontal blue lines are equal to two times
Figure 2: Mean-reverting component RDS-A and RDS-B (1.01.2006 - 1.12.2016, daily)
the historical standard deviation in absolute terms of the mean-reverting component. A pairs
trading strategy could exploit the mean-reverting behavior of Mt
. Note that this example is
in-sample; for a true out-of-sample application see Clegg and Krauss (2016).
15We continue with using hedge.pci() to find the set of sector ETFs forming the best hedging
portfolio for the SPY index (S&P500 index). Thereby, the R code,
sectorETFS <- c("XLB", "XLE", "XLF", "XLI", "XLK", "XLP", "XLU", "XLV", "XLY")
prices <- multigetYahooPrices(c("SPY", sectorETFS), start=20060101)
hedge.pci(prices[,"SPY"], prices),
results in the subsequent output:
-LL LR[rw] p[rw] p[mr] rho R^2[MR] Factor | Factor coefficients
2320.00 -23.3743 0.0100 0.0100 0.5759 0.4526 XLI | 3.1106
1765.50 -46.5925 0.0100 0.0100 0.3170 0.4713 XLY | 1.8951 1.1989
1494.95 -53.7256 0.0100 0.0100 0.3244 0.5038 XLV | 1.6999 0.9106 0.6619
972.58 -65.9058 0.0100 0.0100 0.4060 0.5904 XLK | 1.3089 0.4933 0.5320 1.5182.
The table summarizes information about the best hedging portfolio, where each row corresponds
to an increasing number of factors. Row 1: The best single-factor hedging portfolio
comprises XLI (industrials) as only factor. Row 2: The best two-factor hedging portfolio
consists of XLI and XLY (consumer discretionary). As such, XLY leads to the best improvement
of the LRT score among all remaining factors. Row 3 includes XLV (health care) for
the three-factor portfolio and row 4 XLK (technology) for the best four-factor portfolio. The
last row corresponds to the overall best fit out of the nine potential sector ETFs, based on
the LRT score. Note that for all rows, the union of random walk and AR(1)-null hypothesis
is rejected at the 5 percent significant level, so we find a PCI model at each step.
4.2. Macroeconomics
As a second example, we revisit the relationship between GDP and personal consumption
expenditures for the United States (among others see Cochrane (1994), Gonzalo et al. (2008)
and Guisan (2008)), using quarterly seasonally adjusted annual rates in billion US-Dollar
from January 1976 to July 2016.13 The following R code triggers the data download:
13We utilize the R package Quandl (Daroczi et al., 2016) to download the GDP (US. Bureau of Economic
Analysis, 2016a) as well as personal consumption expenditures data (US. Bureau of Economic Analysis,
2016b). Thereby, the time series data are directly converted into xts (Ryan and Ulrich, 2014) objects.
16library(xts)
library(Quandl)
library(partialCI)
GDP = Quandl("FRED/GDP", start_date = "1976-01-01",
end_date = "2016-04-01", type = "xts")
Consumption = Quandl("FRED/PCEC", start_date = "1976-01-01",
end_date = "2016-04-01",type = "xts").
Applying the unit root test of Phillips and Perron (1988) as implemented in the R package
egcm yields that GDP and personal consumption are not cointegrated in the classic sense,
within the considered time frame.14 The residual plot in figure 3 (code: plot(egcm macro$
residuals,type = "l") obtained from standard cointegration analysis shows that the residuals
exhibit both, mean-reverting and stochastic trending behavior.15 To account for the
Figure 3: Residual plot classic cointegration: GDP and consumption (1976-2016, quarters)
stochastic trending behavior we apply the following PCI model:
14The R code is given as egcm macro <- egcm(Consumption,GDP,include.const = FALSE). For the sake
of brevity, we do not show the R output.
15We are aware of the structural break in the residual series around the second quarter of the year 2000.
The function breakpoints() implemented in the R package strucchange (Hornik et al., 2003) is used to
obtain the estimate of the structural break.
17PCI GDP Consumption<-fit.pci(GDP, Consumption, pci opt method = c("jp"),
par model =c("par"), lambda = 0, robust = FALSE, nu = 5, include alpha =
FALSE)).
The latter function yields the following R output:
Fitted values for PCI model
Y[t] = X[t] %*% beta + M[t] + R[t]
M[t] = rho * M[t-1] + eps_M [t], eps_M[t] ~ N(0, sigma_M^2)
R[t] = R[t-1] + eps_R [t], eps_R[t] ~ N(0, sigma_R^2)
Estimate Std. Err
beta_ 1.3963 0.0358
rho 0.2812 0.3357
sigma