Construction of interlaced polynomial lattice rules for infinitely differentiable functions
Abstract
We study multivariate integration over the dimensional unit cube in a weighted space of infinitely differentiable functions. It is known from a recent result by Suzuki that there exists a good quasiMonte Carlo (QMC) rule which achieves a superpolynomial convergence of the worstcase error in this function space, and moreover, that this convergence behavior is independent of the dimension under a certain condition on the weights.
In this paper we provide a constructive approach to finding a good QMC rule achieving such a dimensionindependent superpolynomial convergence of the worstcase error. Specifically, we prove that interlaced polynomial lattice rules, with an interlacing factor chosen properly depending on the number of points and the weights, can be constructed using a fast componentbycomponent algorithm in at most arithmetic operations to achieve a dimensionindependent superpolynomial convergence. The key idea for the proof of the worstcase error bound is to use a variant of Jensen’s inequality with a purposelydesigned concave function.
Keywords: QuasiMonte Carlo integration, superpolynomial convergence, interlaced polynomial lattice rules, infinitely differentiable functions
MSC classifications: 65C05, 65D30, 65D32
1 Introduction
We study the approximation of multivariate integrals of realvalued functions defined over the dimensional unit cube ,
QuasiMonte Carlo (QMC) integration approximates by using a deterministically chosen finite point set as
where denotes the cardinality of . Note that we interpret here as a set in which the multiplicity of elements matters. In order to make the integration error small for a class of functions , needs to be carefully designed depending on the class to which the function belongs. Digital nets and sequences are a wellknown choice for constructing good quadrature points for several classes of functions [10, 20].
A classical criterion for measuring the distribution properties of point sets is the socalled stardiscrepancy. The Koksma–Hlawka inequality bounds the integration error using a point set by the stardiscrepancy of this point set times the total variation in the sense of Hardy and Krause, see for instance [17, Chapter 2, Section 5]. Thus a lowdiscrepancy point set of points yields a small integration error bound, typically of order with arbitrarily small , assuming that the function has bounded total variation in the sense of Hardy and Krause. Regarding explicit constructions of lowdiscrepancy digital nets and sequences, we refer to [10, Chapter 8] and [20, Chapter 4]. Polynomial lattice point sets, first introduced in [21], are a special construction method for digital nets and have been extensively studied in the literature, see for instance [10, Chapter 10] and [24]. Polynomial lattice rules are QMC rules using a polynomial lattice point set as quadrature points. While we usually resort to some computer search algorithm to find good polynomial lattice rules for , the major advantage of polynomial lattice rules lies in their flexibility, that is, we can design a suitable QMC rule for the problem at hand.
In order to achieve a faster convergence of the integration error, explicit constructions of point sets, referred to as higher order digital nets, have been established by Dick [2, 3] which can fully exploit the smoothness of an integrand. Specifically QMC rules using higher order digital nets achieve the optimal convergence rate of the integration error of order with arbitrarily small , when the function has square integrable partial mixed derivatives up to order in each variable. We remark that recent applications in the area of uncertainty quantification, in particular partial differential equations with random coefficients, are in need of using these types of quadrature rules, see for instance [7]. The above result by Dick is based chiefly on analyzing the decay of the Walsh coefficients of smooth functions [3, 4].
Numerical integration of infinitely many times differentiable functions in certain function spaces has recently been considered in [8, 11, 15, 16]. However, the results on higher order digital nets in [2, 3] do not improve if one assumes that the integrand is infinitely many times differentiable. More precisely, if one sets in [2, 3] one obtains constants which are infinite and the error bounds become trivial. To improve the error bounds in these papers for function spaces consisting of infinitely many times differentiable functions using higher order digital nets requires new bounds on the Walsh coefficients. Such an analysis of the Walsh coefficients was recently done in [28, 30], where they obtained a space of infinitely differentiable functions whose Walsh coefficients decay with a certain order. The worstcase error in by a digital net is closely related to the Walsh figure of merit (WAFOM) introduced in [18, 26], which is one of the computable quality criteria of digital nets, although WAFOM was originally derived in a different way from [28, 30]. Moreover, Suzuki [27] considered a weighted space of infinitely differentiable functions and studied tractability of multivariate integration in , where the positive real numbers are the weights. His result can be summarized as follows: There exists a good QMC rule using a digital net which achieves an superpolynomial convergence of the worstcase error in as , and moreover, the convergence can be independent of the dimension as for some under a certain condition on the weights .
In this paper, beyond the existence result of [27], we provide a constructive approach to finding good QMC rules achieving a dimensionindependent superpolynomial convergence of the worstcase error. Specifically we prove that interlaced polynomial lattice rules can be constructed using a fast componentbycomponent (CBC) algorithm, in at most arithmetic operations, to achieve a dimensionindependent superpolynomial convergence. As first studied in [12, 13, 14], interlaced polynomial lattice rules belong to the family of higher order digital nets and therefore achieve a higher order polynomial convergence of the integration error. We use them as QMC rules achieving a superpolynomial convergence in this paper. For this purpose, we are required to choose an interlacing factor depending on the number of points and the weights, instead of keeping it fixed (as for instance in [2, 3]). Furthermore, in order to show the worstcase error bound with a superpolynomial convergence, we purposely design a concave function to modify Jensen’s inequality which has been often used in the literature to obtain error bounds with an improved rate of convergence.
Our approach requires to set the weights for constructing a tailored QMC rule, as often encountered in this type of construction algorithms. In practical applications, however, it is not always the case where one can know in advance to which function class the functions of interest belong. To work around this drawback, it must be interesting to study whether a good convergence property which such a tailored QMC rule holds for a specific function class can be also established for other function classes, as discussed for instance in [14, Remark 1]. We observe in Section 5 that our constructed rules empirically work even for some functions not belonging to the target space. In another direction for constructing a robust QMC rule working for many different function classes, one can implement a more elaborate construction algorithm as given in [5]. However, theoretical analysis of these issues is beyond the scope of this paper and we leave them open for further research.
The remainder of this paper is organized as follows. In the next section, we introduce the necessary background and notation, namely Walsh functions, a weighted space of infinitely differentiable functions, our considering superpolynomial convergence and interlaced polynomial lattice rules. We also describe the main results of this paper. Namely, we introduce a componentbycomponent algorithm, state a result on the convergence behavior of interlaced polynomial lattice rules and discuss the dependence of the worstcase error bound on the dimension. In Section 3, we study the worstcase error in for QMC rules using a digital net and derive a computable upper bound. We prove in Section 4 that the CBC algorithm can be used to obtain good interlaced polynomial lattice rules which achieve a dimensionindependent superpolynomial convergence of the worstcase error. Thereafter we describe the fast CBC algorithm using the fast Fourier transform as in [22, 23], and show that interlaced polynomial lattice rules achieving a dimensionindependent superpolynomial convergence can be constructed in at most arithmetic operations using memory. Finally, we conclude this paper with numerical experiments in Section 5.
2 Background, notation and results
Throughout this paper, we use the following notation. Let be the set of positive integers and let . For a positive integer , let be a finite ring with elements, which we identify with the set equipped with addition and multiplication modulo . For , we denote its adic expansion by with for all , which is unique in the sense that infinitely many are different from . The operators and denote digitwise addition and subtraction modulo , respectively. That is, for whose unique adic expansions are and , and are defined as
where and , respectively. Similarly, we define digitwise addition and subtraction for nonnegative integers based on their adic expansions. In case of vectors in or , the operators and are applied componentwise.
2.1 Walsh functions
Walsh functions were first introduced in [29] for the case and were later generalized to arbitrary base , see for instance [1]. We refer to [10, Appendix A] for more information on Walsh functions in the context of numerical integration. We first give the definition for the onedimensional case.
Definition 1.
Let be a positive integer and let be a th root of unity. We denote the adic expansion of by with . The th adic Walsh function is defined as
for with its unique adic expansion .
This definition can be generalized to the higherdimensional case.
Definition 2.
Let be a positive integer. For a dimension , let and . The th adic Walsh function is defined as
Since we always use Walsh functions in a fixed base , we omit the subscript and simply write or in the remainder of this paper. From the fact that the system is a complete orthonormal system in for any [10, Theorem A.11], we have a Walsh series expansion for any
where denotes the th Walsh coefficient of , which is defined as
For continuous functions for which , the Walsh series of converges to pointwise absolutely. In fact, for any function in a weighted space which we consider in this paper, its Walsh series converges to pointwise absolutely.
2.2 Weighted function space
We first define the function for a real number .
Definition 3.
Let be a real number. For , we denote its adic expansion by such that and . The function is defined as
(1) 
and .
Remark 1.
Let us consider the case . If the sum on the righthand side of (1) which runs over is replaced by the sum which runs over for a fixed , we recover the definitions by Niederreiter, Rosenbloom and Tsfasman in [19, 25] for and by Dick in [3] for . Our function with has been used in [18, 26]. The parameter was included in the definition originally by Yoshiki [30] for and later by Suzuki [27] for an arbitrary real number .
For the higherdimensional case, we consider a vector of real numbers and define the function as follows.
Definition 4.
Let be a vector of real numbers, and let . The function is defined as
We are now ready to introduce a weighted space of infinitely differentiable functions. Let be a sequence of positive real numbers which we call weights, and we assume that throughout this paper.
Definition 5.
Let be a sequence of weights. We define a weighted space as
where denotes the th mixed partial derivative of , i.e., .
In the function space , small means that higher order partial mixed derivatives associated with the th coordinate must be relatively small. Thus, the weights play a role in moderating the importance of different variables. Owing to the refined analyses of the Walsh coefficients in [28, 30], it was shown that the Walsh coefficients of any function in decay with a certain order, as we describe in the following. Let
and
Moreover, let
Then we have the following.
2.3 Superpolynomial convergence
From [27], it is known that there exists a good QMC rule which achieves a dimensionindependent superpolynomial convergence of the worstcase error in under a certain condition on the weights . Here we briefly recall the result of [27].
The initial error in is given by the error of the zero algorithm, i.e.,
which indeed equals 1 for any and . Hence the integration problem in is well normalized. The worstcase error in for a QMC rule using a point set is defined as
We are interested in a dimensionindependent superpolynomial convergence of the worstcase error of the form
(2) 
where and are positive constants independent of and . The following existence result is from [27].
2.4 Interlaced polynomial lattice rules
Here we give the definition of interlaced polynomial lattice rules, which are based on polynomial lattice rules, introduced by Niederreiter [21], and a digit interlacing composition, introduced by Dick [2, 3].
We first introduce polynomial lattice rules. In this subsection, let be a prime number, and let be the finite field with elements. We denote by the set of all polynomials over , and denote by the field of formal Laurent series over . Every element of can be represented as
for some integer and for all . For a given integer , we define the mapping from to the interval by
A nonnegative integer whose adic expansion is given by will be identified with the polynomial . For and , we define the inner product as
(3) 
and we write if divides in . Using this notation, polynomial lattice rules are constructed as follows.
Definition 6.
Let . Let such that and let . A polynomial lattice point set is a set consisting of points that are defined as
for . A QMC rule using this point set is called a polynomial lattice rule with generating vector and modulus .
We add one more notation and introduce the concept of the socalled dual polynomial lattice of a polynomial lattice point set. For with its adic expansion , let be the polynomial of degree at most obtained by truncating the associated polynomial as
where we set if . For a vector , we define . With this notation, we introduce the following definition of the dual polynomial lattice .
Definition 7.
The dual polynomial lattice of a polynomial lattice point set with modulus , , and generating vector is given by
where the inner product is in the sense of (3).
The following important lemma relates the dual polynomial lattice to numerical integration of Walsh functions, see [10, Lemmas 4.75 and 10.6] for the proof.
Lemma 1.
Let be a polynomial lattice point set with modulus , , and generating vector , and let be its dual polynomial lattice. Then we have
We introduce the digit interlacing composition next. Let be a positive integer called interlacing factor, and let be a generic point in whose unique adic expansions are given by . Then the digit interlacing function is defined as
We also define such a function for dimensional vectors by applying to every consecutive components, that is,
Now we are ready to introduce the definition of interlaced polynomial lattice rules [12, 13, 14].
Definition 8.
Let . Let such that and let . An interlaced polynomial lattice point set of order is a set consisting of points defined as
A QMC rule using this point set is called an interlaced polynomial lattice rule of order with generating vector and modulus .
2.5 The results
We now describe the main results of this paper. In the following, let be a prime number and let . For with and , we denote the polynomial lattice point set by with , and denote the adic expansion of by for and . Moreover, we denote the interlaced polynomial lattice point set by , where for .
Let be a sequence of weights, and as in Proposition 1, let be the sequence given by
In Section 3, we show that the worstcase error in by a QMC rule using as quadrature points is bounded by
where and are given by
and
respectively, where is defined as
Since is independent of the modulus and generating vector , can be used as a quality criterion for searching for good and . In the following we introduce the CBC algorithm.
We restrict , , to nonzero polynomials over with its degree less than , where . Provided that is irreducible, we can set without loss of generality. We denote by the set of all nonzero polynomials over with degree less than , i.e.,
We note that . Further, we write for . The idea is now to search for the polynomials componentbycomponent. To do so, we need to define for arbitrary . This is done in the following way. Let and . Then
(4) 
The CBC construction proceeds as follows.
Algorithm 1.
Let be as above.

Choose an irreducible polynomial with .

Set .

For , find by minimizing as a function of .
In Subsection 4.3, we show that one can also use the fast CBC algorithm of [22, 23] to find good generating vectors.
Next we show that the generating vector found by Algorithm 1 satisfies the following bound.
Theorem 2.
Let be a prime and be irreducible with . Let be a concave and unbounded monotonic increasing function. Suppose that is constructed using Algorithm 1. Then we have
The proof of this result is presented in Subsection 4.1.
The function , , has been often used to obtain these types of error bounds in the literature. In this case, one may apply socalled Jensen’s inequality
(5) 
for any sequence of nonnegative real numbers . The inequality (5), however, also holds for any concave function [6, Section 2.3]. In our case, the function is not a good choice because it does not give us the worstcase error bound with a superpolynomial convergence. Instead we use which maps to for . Such a map can be designed as follows. For , let for . Then
(6) 
For ,
(7) 
Note that we set for any and , and that the function is concave and unbounded monotonic increasing on . As above we need a slight modification for the case since the function is concave over the interval but not over the interval . Using this function and under the same condition on the weights with Theorem 1, we have the following corollary of Theorem 2.
Corollary 1.
Assume that satisfies for . Let be a prime and be irreducible with . Suppose that is constructed using Algorithm 1. Then there exist constants both independent of such that we have
for any . Moreover, by setting , the worstcase error satisfies the bound
where is a constant independent of .
The proof of this result is presented in Subsection 4.2.
This result means that we can construct a QMC rule which achieves a dimensionindependent superpolynomial convergence of the worstcase error in as (2) with . This is a bit weaker than Theorem 1 (shown by Suzuki in [27]), since we do not have an error bound for the endpoint . Under an additional assumption, however, it is even possible to include the case in Corollary 1, see Remark 2. The most important advantage of our approach is that a good QMC rule can be explicitly constructed by using a CBC algorithm.
3 The worstcase error in
To analyze the worstcase error of interlaced polynomial lattice rules, we introduce a digit interlacing composition for nonnegative integers. Let be an interlacing factor, and let whose adic expansions are given by , which is actually a finite expansion. Then the digit interlacing function is defined as
It is obvious to show that is bijective. We also define such a function for dimensional vectors by applying to every consecutive components, that is,
The following lemma relates an interlaced polynomial lattice point set to numerical integration of Walsh functions, see [12, Lemma 1] for the proof.
Lemma 2.
Let be an interlaced polynomial lattice point set of order with modulus , , and generating vector . For , we have
We introduce another function for a real number and an integer . For , we denote its adic expansion by such that and . The function is defined as
and . For vectors of real numbers and , we define
With a slight abuse of notation, for and , we write , where the vector denotes the dimensional vector whose th component is for and otherwise. From Definition 4 and the definition of , we have
(8) 
Now the worstcase error for numerical integration in using an interlaced polynomial lattice rule is given as follows.
Proposition 2.
Let be an interlaced polynomial lattice point set of order with modulus , , and generating vector . For a sequence of the weights , we have
where is the dual polynomial lattice of , and is a sequence of real numbers given as in Proposition 1.
Proof.
We write . Let us consider a function . Given the Walsh series expansion of and the fact that is bijective, the signed integration error becomes