Changepoint tests under local alternatives for longrange dependent processes
Abstract
We consider the changepoint problem for the marginal distribution of subordinated Gaussian processes that exhibit longrange dependence. The asymptotic distributions of KolmogorovSmirnov and Cramérvon Mises type statistics are investigated under local alternatives. By doing so we are able to compute the asymptotic relative efficiency of the mentioned tests and the CUSUM test. In the special case of a meanshift in Gaussian data it is always . Moreover our theory covers the scenario where the Hermite rank of the underlying process changes.
In a small simulation study we show that the theoretical findings carry over to the finite sample performance of the tests..
Keywords: asymptotic relative efficiency, changepoint test, empirical process, local alternatives, longrange dependence.
1 Introduction
Over the last two decades various authors have studied the changepoint problem under longrange dependence and classical methods are often found to yield different results than under shortrange dependence. The CUSUM test is studied in Csörgő and Horvath (1997) and compared to the Wilcoxon changepoint test in Dehling et al. (2012). Ling (2007) investigates a DarlingErdőstype result for a parametric changepoint test, and estimators for the time of change are considered in Horvath and Kokoszka (1997) and Hariz et al. (2009). Moreover, the special features of long memory motivated new procedures. Beran and Terrin (1996) and Horvath and Shao (1999) are testing for a change in the linear dependence structure of the time series and Berkes et al. (2006) and Baek and Pipiras (2011) construct tests in order to discriminate between stationary long memory observations and short memory sequences with a structural change. For a general overview of the changepoint problem under longrange dependence see Kokoszka and Leipus (2001) and the associated chapter in Beran et al. (2013).
One of the classical changepoint problems is the change of the marginal distributions of a time series . When testing for at most one changepoint (AMOC) in the marginal distribution one often considers the empirical distribution function of the first observations and that of the remaining observations. Taking a distance between the empirical distributions and the maximum over all yields a natural statistic. Common distances are the supremum norm, which gives the KolmogorovSmirnov statistic
(1.1) 
or an distance, which gives the Cramérvon Mises statistic
(1.2) 
Both are widely used for goodnessoffit tests and twosample problems. In the changepoint literature they are considered by Szyszkowicz (1994) for independent data, by Inoue (2001) for strongly mixing sequences and by Giraitis et al. (1996b) for linear longmemory processes. However, note that in the LRD setting only the KolmogorovSmirnov test has been investigated.
(1.1) and (1.2) are functionals of the sequential empirical process, that is for and . Thus the asymptotic distributions of and rely on that of the sequential empirical process. For weakly dependent sequences this would be a Gaussian process, in the special case of independent random variables it is called KieferMüller process. For stationary sequences that exhibit longrange dependence, Dehling and Taqqu (1989a) proved that the limit process is of the form , where is a deterministic function and the process is therefore called semidegenerate. They considered subordinated Gaussian processes, in detail for any measurable function and a Gaussian sequence with nonsummable autocovariance function. A similar limit structure was later obtained independently by Ho and Hsing (1996) and Giraitis et al. (1996a) for longrange dependent movingaverage sequences.
It is the main goal of this paper to derive the limit distribution of changepoint statistics of the type (1.1) and (1.2) under local alternatives. We then apply these results to derive the asymptotic relative efficiency (ARE) of several changepoint tests. To this end we investigate the sequence
(1.3) 
Here is a sequence of functions such that the distribution of converges to the distribution of in some suitable way.
Therefore, we are able to analyze various types of changepoints, among them a meanshift. Thus we may compute the ARE of Kolmogorov Smirnov, Cramérvon Mises, CUSUM and Wilcoxon test and get the surprising result that in case of Gaussian data it is always .
The mathematically most challenging case is the situation when the Hermite rank changes. The Hermite rank of the class is defined as the smallest positive integer, such that for some , with being the th Hermite polynomial. The structure of the limiting process , e.g. the marginal distribution and the covariance structure, mainly depends on . However, a special feature of distributional changes in subordinated Gaussian processes is the fact that the Hermite rank may change, too. Hence the question arises which Hermite process will determine the limit distribution. Under a meanshift the Hermite rank remains unchanged, which can be seen easily by its definition.
Our results differ in various ways from those obtained in Giraitis et al. (1996b), where changes in the coefficients of an LRD linear process were investigated. While the empirical process of LDR moving average sequences converges to fractional Brownian motion, we may encounter higher order Hermite processes. The possible change in the Hermite rank is therefore a novel feature in our investigation.
The rest of the paper is organized as follows. In section 2 we will state a limit theorem for the sequential empirical process under changepoint alternatives. Moreover we will give the asymptotic distribution of the test statistics under the hypothesis of no change as well as under local alternatives. Thus we are able to derive the asymptotic relative efficiency of several changepoint tests. In section 2.5 we consider the empirical process for longrange dependent arrays that are stationary within rows. The outcome mainly serves as a device for proving the main results, but is also of interest on its own. Section 3 contains the simulation study. To the best of our knowledge there are no results on the finite sample performance of the Cramérvon Mises changepoint test under long memory. It is compared to other changepoint tests and the effect of an estimated Hurstcoefficient is discussed. We obtain that the theoretical results (e.g. asymptotic relativ efficiency between Cramérvon Mises and CUSUM test) carry over to the finite sample performance of the tests. Finally proofs are provided in section 4.
2 Main results
Let be a stationary Gaussian process, with
for and a slowly varying function . The nonsummability of the covariance function is one possibility to define longrange dependence. We investigate our results for so called subordinated Gaussian processes , where and is a measurable function. The key tool in our analysis of possible changes in the marginal distribution of such a process is the sequential empirical process. To obtain weak convergence of this process the right normalization is given by , defined by
(2.1) 
where the constant of proportionality is , see Theorem 3.1 in Taqqu (1975). is called Hurst coefficient and
is the Hermite rank of . The mentioned result of Dehling and Taqqu (1989a) then reads as follows.
Theorem A (Dehling, Taqqu).
Let the class of functions have Hermite rank m and let . Then
(2.2) 
where the convergence takes place in , equipped with the uniform topology. is defined by
and is an th order Hermite process, see Taqqu (1979) for a definition.
Remark 2.1.
In the case , the Hermite process becomes the well known fractional Brownian Motion, which we denote by .
2.1 The empirical process under changepoint alternatives
Let us consider the following changepoint model. Define the triangular array
(2.3) 
for measurable functions and and unknown . For one gets a rowwise stationary triangular array, as considered in section 2.5, and for a stationary sequence, as in Dehling and Taqqu (1989a). In what follows we will denote the distribution functions of and by and , respectively.
To obtain weak convergence of the empirical process of (2.3) we have to make some assumptions on the structure of the change and the Hermite rank.
Assumption A:

The class of functions has Hermite rank with .

Let be the Hermite rank of and . Then we assume
for some .
Theorem 1.
If Assumption A holds, then
where is the Hermite coefficient of . The convergence takes place in , equipped with the uniform topology.
Remark 2.2.
(i) For given functions and , Assumption A2 might easily being checked, see the examples below. It serves to ensure convergence of the Hermite coefficients . In detail,
implies, see the proof of Lemma 4.5,
(2.4) 
By Assumption A1, for all , yet for some . Together with (2.4) this implies .
(ii) Moreover, A2 implies convergence of the marginal distribution function. To see this, note
and . However, the converse is not always true. Consider for instance the functions and or the situation in Example 2.8. Then again, there are lots of natural choices of and for whom convergence of the marginal distribution functions (with a certain rate) implies Assumption A2. Among them (meanshift), (change in variance) and
(iii) Our assumptions explicitly allow for the Hermite rank to change together with the marginal distribution. Then again, the limit behaivior seems to be untouched by this change. Intuitively this corresponds to the idea that the change in distribution and the change in the Hermite coefficient, both caused by the difference of and , are of the same order. For this enforces the function to converge rather fast to . Technically this can be explained through A2. If this assumption is dropped, we might actually encounter limits with multiple Hermite processes. Such cases will be considered in Example 2.8 and Corollary 2.13.
(iv) If A1 is violated, the sequence is actually shortrange dependent. For stationary observations Csörgő and Mielniczuk (1996) showed convergence of the sequential empirical process to a twoparameter Gaussian process. Changepoint alternatives have not been considered for such random variables, yet, but would require fundamentally different proofs compared to our results.
2.2 Asymptotic behavior of the changepoint statistics
We now apply the results concerning empirical processes to determine the asymptotic distribution of the KolmogorovSmirnov statistics
(2.5) 
and that of the Cramérvon Mises changepoint statistic
(2.6) 
To get a non degenerate limit under a sequence of local alternatives it is important to choose the right amount of change. For a meanshift this is naturally the difference of the expectations before and after the change. For a general change we formulate the test problem as follows: We wish to test the hypothesis
against the sequence of local alternatives
(2.7)  
Remark 2.3.
Theorem 2.
(i) Under the hypothesis of no change we have, as ,
and 
where .
(ii) Under the sequence of local alternatives we have, as ,
and 
where
Motivated by this Theorem we consider changepoint tests based on the statistics and . Critical values might be chosen as
for the KolmogorovSmirnov test and the Cramérvon Mises test, respectively. Here is the quantile of . Thereby the tests have asymptotically level and nontrivial power against local alternatives.
The tests can be performed, if the right normalization for the empirical process, the supremum of and the distribution of are known. In practical applications this might be not the case. Solutions are selfnormalization (Shao (2011)), estimating the the Hurstcoefficient (see for example Künsch (1987)) and bootstrap estimators for (Tewes (2016)).
2.3 Examples
Example 2.4 (Meanshift).
Let with , then we get the typical change in the mean problem. In the case of longrange dependent subordinated Gaussian processes this was considered in Dehling et al. (2012, 2013), Csörgő and Horvath (1997), Shao (2011) and Betken (2016). Let be the probability density of , and assume that it is continuous and of bounded variation. Then we obtain
where, due to continuity of , the convergence holds uniformly.
Example 2.5 (Change in the variance).
To describe the changeinvarianceproblem define , with . For ease of notation let . Then we get
(2.8)  
(2.9) 
The derivative of is , hence (2.8) converges to . The convergence is uniform, if and are continuous. (2.9) converges to , because of continuity, monotonicity and boundedness of . Thus (2.7) holds with function Assume without loss of generality , then
The minimum can be treated analogously, hence Assumption A2 follows from convergence of the marginals.
Additionally one might consider a combined change in mean and variance, given through . In this case (2.7) holds with .
Example 2.6 (Generalized inverse of a mixture distribution).
By using the generalized inverse of a distribution function one could generate subordinated Gaussian processes with any given marginals, see for example Dehling et al. (2013). We use this for the changepoint problem by setting
For a continuous distribution function define the mixture
with . Then (2.7) holds with and moreover
Analogously one has . Hence
thus Assumption A2 is also satisfied. For strongly mixing data similar local alternatives were considered by Inoue (2001).
Example 2.7 (distribution).
Consider a distribution given through and note that the indicator functions have Hermite rank , see also Dehling and Taqqu (1989a). Further let
with Hermite ranks for all . If , then one can show (similar to the case of a variance change in Example 2.5) that
uniformly in . As Assumption A2 is satisfied, too, we may apply Corollary 2 (ii) with function and .
Example 2.8 (Multiple Hermite processes in the limit).
In the previous example, together with the marginal distribution, also the Hermite rank has changed. However, the limiting process seems to be untouched by this fact and one might ask whether this is intuitive or not.
It is caused by the fact that the change in the distribution and the change in the Hermite coefficients, both originating in the difference of the functions and , are of the same order.
To get an additional Hermite process in the limit, one would need , see Corollary 2.13 and its proof. But then
and the test would have asymptotic power .
To achieve nontrivial asymptotic power one has to consider structural breaks that consists of two aspects and where only one is captured by the marginal distribution. To this end define the transformations
and
where and
for some sequence with and . On the one hand, has Hermite rank and . On the other hand, has Hermite rank for all and . Now let , then Example 2.4 applies and we obtain
for any sequence . In contrast, the convergence of the Hermite coefficients is highly influenced by . If the sequence is chosen such that (therefore, it converges slower than ), then the sequential empirical process will converge towards
Actually this can be proved similar to Corollary 2.13. Moreover, the KolmogorovSmirnov statistic converges weakly to
We find this example rather pathological, therefore such situations are excluded from the main results via Assumption A2.
2.4 Asymptotic relative efficiency
By studying the asymptotic distributions under local alternatives one might compare different tests in terms of the asymptotic relative efficiency (ARE). Here we give a precise definition of the ARE in the very special context of our changepoint setting. The general idea is due to Pitman (1948) (for a published article see for example Noether (1950)) and was formalized in Noether (1955). Of course it can be extended to all kinds of testing procedures.
Definition 2.9.
Let and represent two changepoint test procedures. Consider the local alternatives
such that for all and .
Let be the asymptotic power of the test against the local alternatives given by and be the asymptotic power of the test against the local alternatives given by . If equals , then the asymptotic relative efficiency (ARE) of the tests and is defined as
Example 2.10 (Meanshift in Gaussian data).
Consider and , in other words a meanshift in Gaussian data. As for the Hermite coefficient function, we get , where is the standard normal probability density. Thus, according to Corollary 2, the test statistic converges towards
whereas under the Null, that is we have a stationary standard Gaussian sequence, the limit distribution would be
For the Cramérvon Mises statistic we obtain analogously the limit distributions
under local alternative and hypothesis, respectively. Hence in this special case the CUSUM test, the Wilcoxon test (see Dehling et al. (2013) for each), the KolmogorovSmirnov test and the Cramérvon Mises test all have the same asymptotic power, namely
(2.10) 
where is the quantile of the maximum of a fractional Brownian bridge .
As a direct consequence, one gets that the ARE of the four tests is . This result is quite surprising, keeping in mind that CUSUM and Wilcoxon tests are designed to detect levelshifts, while our tests have power against all kinds of distributional changes.
For nonGaussian data and changepoints beyond a simple meanshift, the investigation of the ARE is not that straightforward. In fact, little is known about the distribution of
and even less if higher order Hermite processes are considered. This seems to prevent a precise computation of the ARE in many cases. However, one might derive lower bounds for the efficiency as we do in next example for a combined change in mean and variance. Unlike in the previous example, we will make use of the subtle definition of the ARE.
Example 2.11 (Combined change in mean and variance).
Let and , that is a combined change of mean and variance in Gaussian data. If further and , then by example 2.5 the empirical bridgetype process converges to (for the sample size )
We now consider slightly modified Cramérvon Mises and CUSUM tests, in detail, instead of the supremum is taken over for some and .
The asymptotic distribution of the CUSUM test has been derived in Dehling et al. (2013), but only in the case of a meanshift with constant variance. However, for the CUSUM statistic is a continuous functional of the sequential empirical process. Thus, we might apply our Theorem 1 and conclude that the CUSUM statistic converges under this type of local alternatives to
Note that this is the same limit as under a meanshift with constant variance and thus, too, the asymptotic power is the same as in example 2.10.
The limiting distribution of the Cramérvon Mises statistic is given by
and for its asymptotic power we obtain
(2.11)  
First assume and consider , given by
Now is constructed in a way, such that
and for all with
If, on the other hand, , then (because ) automatically . Combining these two findings with (2.11) we can bound the asymptotic power from below by
(2.12) 
for .
Now we are ready to compute the ARE. To this end we chose different sample sizes for both test. In detail, for the Cramérvon Mises test and for the CUSUM test. Moreover, the local alternatives are such that for all , consequently
For the CUSUM test, in order to achieve at least asymptotic power , its limit distribution has to satisfy
In other words, , where