The cavity method for counting spanning subgraphs subject to local constraints

The cavity method for counting spanning subgraphs subject to local constraints

Justin Salez111INRIA-École Normale Supérieure - France. Email: justin.salez@ens.fr
Abstract

Using the theory of negative association for measures and the notion of random weak limits of sparse graphs, we establish the validity of the cavity method for counting spanning subgraphs subject to local constraints in asymptotically tree-like graphs. Specifically, the corresponding free entropy density is shown to converge along any sequence of graphs whose random weak limit is a tree, and the limit is directly expressed in terms of the unique solution to a limiting cavity equation. On a Galton-Watson tree, the latter simplifies into a recursive distributional equation which can be solved explicitely. As an illustration, we provide an explicit-limit formula for the matching number of an Erdős-Rényi random graph with fixed average degree and diverging size, for any .

1 Introduction

The general framework we consider is that of a finite graph , in which spanning subgraphs are weighted according to their local aspect around each vertex as follows :

(1)

Here, a spanning subgraph is identified with its egde-set , and each is a given non-negative function over the subsets of . We call the global measure induced by the local measures . Of particular interest in combinatorial optimization is the number

(2)

which is the maximum possible size of a spanning subgraph satisfying the local contraint at every node . More generally, counting the weighted number of spanning subgraphs of each given size in , i.e. determining the generating polynomial

(3)

is a fundamental task, of which many combinatorial problems are special instances. Intimately related to this is the study of a random spanning subgraph sampled from the Gibbs-Boltzmann law :

(4)

where is a variable parameter called the activity. In particular, the expected size of is called the energy and is connected to via the elementary identity

(5)

Our concern is the behavior of these quantities in the infinite volume limit : , .

Originating from spin glass theory [26], the cavity method is a powerful nonrigorous technique for evaluating such asymptotics on graphs that are locally tree-like. Essentially, the heuristic consists in neglecting cycles in order to obtain an approximate local fixed point equation for the marginals of the Gibbs-Boltzmann law. Despite its remarkable practical efficiency and the mathematical confirmation of its analytical predictions for various important models [31, 1, 20, 28, 7, 16, 13], this ansatz is still far from being completely understood, and the exact conditions for its validity remain unknown. More precisely, two crucial questions arise in presence of cycles :

  1. convergence : is there a unique, globally attractive fixed point to the cavity equation ?

  2. correctness : if yes, does it have any relation to the Gibbs-Boltzmann marginals ?

In this paper, we exhibit a general condition under which the cavity method is valid for counting spanning subgraphs subject to local constaints. Specifically, we positively answer question 1 for arbitrary finite graphs (Theorem 2), under the only assumption that each local measure enjoys a certain form of negative association which we call the cavity-monotone property, and which simply boils down to ultra-log-concavity in the exchangeable case. Regarding question 2, we use the framework of local weak convergence [8, 4] and the notion of unimodularity [3] to prove asymptotical correctness for any sequence of graphs whose random weak limit is concentrated on trees (Theorem 4). This includes many classical sequences of diluted graphs, such as random regular graphs, Erdős-Rényi random graphs with fixed average degree, or more generally random graphs with a prescribed degree distribution. In all these examples, the limit is a unimodular Galton-Watson (UGW) tree. Thanks to the distributional self-similarity of such a tree, the cavity equation simplifies into a recursive distributional equation which may be solved explicitely. As a motivation, let us first describe the implications of our work in the special case of b-matchings.

A famous combinatorial structure that fits in the above framework is obtained by fixing and taking for all : the induced global measure is then nothing but the counting measure for matchings in , i.e. spanning subgraphs with maximum degree at most . The reader is refered to the monograph [30] for a complete survey on matchings, and to [24] for the important case of matchings (). The associated quantities and are important graph invariants respectively known as the matching number and matching polynomial. Determining is a classical example of a computationally hard problem [32], although efficient approximation algorithms have been designed [6, 5]. The mathematical properties of have been investigated in detail, notably in the case for the purpose of understanding monomer-dimer systems [21, 10]. Interestingly, the geometry of the complex zeros of has been proven to be quite remarkable (see [21] for , [29] for , and [34] for the general case). Regarding , the first results in the infinite volume limit were obtained by Karp and Sipser [23] for the Erdős-Rényi random graph with average degree on vertices :

(6)

where is the smallest root of . The analysis has then been extended to random graphs with a log-concave degree profile [12], and finally to any graph sequence that converges in the local weak sense [13]. Contrastingly, only little is known for : to the best of our knowledge, the limit of is only known to exist in the Erdős-Rényi case [19], and could not be explicitely determined. As a special case of our main result, it will follow that

Theorem 1 (matchings in locally tree-like graphs)

For any sequence of finite graphs satisfying and whose random weak limit is concentrated on trees, the limits

and

exist and depend only on the random weak limit . When is a UGW tree, we have the explicit formula

where are defined in terms of the degree generating function as follows :

Moreover, any where the above minimum is achieved must be a root of .

For example, in the case of Erdős-Rényi random graphs with average degree on vertices, the random weak limit is a.s. the law of a UGW tree with Poisson(c) degree distribution, and hence,

where we have set

Since any where the minimum is achieved must satisfy , we recover exactly (6) in the special case of matchings ().

The paper is organized as follows : in section 2, we recall the basic notions and properties pertaining to measures over subsets, which will be of constant use throughout the paper. In section 3, we define and study the cavity equation associated to a finite network. In section 4, we extend the results to infinite networks that arise as local weak limits of finite networks. Finally, section 5 is devoted to the study of the cavity equation in the limit of infinite activity, and to its explicit resolution in the case of matchings.

2 Preliminaries

In this section, we define the important notions pertaining to (non-negative) measures over the subsets of an arbitrary finite ground set . Later on, these will be specialized to the local measures attached to the vertices of a graph . First, is caracterized by its multivariate generating polynomial

(7)

where and . Since is affine in each , it can be decomposed as

(8)

where and , are the multi-affine polynomials with ground set respectively obtained from by setting the variable to (deletion) and differentiating with respect to (contraction). By definition, the cavity ratio of the pair is then simply the multi-affine rational function

(9)

When positive values are assigned to the variables (a so-called external field), we may consider the probability distribution A quantity of interest is the expected size of when viewed as a function of the external field. We call this the energy :

(10)

From the decomposition (8), it follows immediately that

hence (11)

Note that the supremum of the energy is exactly the rank of : The following properties will be of crucial importance throughout the paper.

Definition 1 (Cavity-monotone measures)

The measure is called

  • Rayleigh if every two distinct ground elements are negatively correlated in :

  • Size-increasing if every ground element positively influences the total size :

  • Cavity-monotone if its satisfies and the two above properties.

Rayleigh measures were introduced in the context of matroid theory [33], but soon found their place in the modern theory of negative dependence for probability measures [27, 22]. Cavity-monotone measures will play a major role in our study, for the following elementary reason.

Lemma 1 (Monotony of energy and cavity ratios)

Proof. Differentiating the corresponding quantities and playing with the definition of easily yields

Remark 1 (Matroids)

Interestingly, the support of a cavity-monotone measure admits a remarkable structure : it follows from [33, Theorem 4.6] that for Rayleigh with , is a matroid:

  • is not empty ;

  • If and , then ;

  • If and , then such that .

The cavity-monotone property admits a particularly simple caracterization in the important case where is exchangeable, i.e. for some non-negative coefficients :

Lemma 2 (The exchangeable case)

An exchangeable measure is cavity-monotone if and only if

  1. is log-concave, i.e. for all , and

  2. the support is an interval containing and .

In particular, so is the measure describing the local constraints of a matching.

Proof of Lemma 2. The result essentially follows from the work of Pemantle [27]. Indeed, Theorem 2.7 therein guarantees that is Rayleigh if and only if the sequence is log-concave and its support is an interval. That the latter must contain is nothing but the last property in the definition of a cavity-monotone measure. That it is not reduced to is imposed by the strict inequality in the size-increasing property. Conversely, let us show that any exchangeable measure with and is indeed size-increasing. Fix an external field . By Lemma 2.9 in [27], the law obtained from by conditionning on the event is stochastically increasing in . By Proposition 1.2 in [27], this implies in particular that for every , the following weak inequality holds :

Note that the condition guarantees that this conditional expectation is well-defined. Since we have not yet used the fact that , the above inequality remains true if one changes the coefficient to . Setting it then back to its initial (positive) value does not affect the left-hand side, but strictly decreases the right-hand side, hence the desired strict inequality.

3 The cavity equation on finite networks

Let be a finite graph at the vertices of which some local measures are specified. We call the resulting object a network. A configuration is an assignment of numbers to every oriented edge . Starting from a configuration , we define a new configuration by

(12)

where denotes the set of all neighbors of . Each may be thought of as a message sent by to along the edge , and as a local rule for propagating messages. For , the fixed point equation

(13)

is called the cavity equation at activity on the network . Its relation to the global measure induced by the is revealed by the following well-known result.

Lemma 3 (Validity on trees)

Assume that is finite and acyclic. Then, for every activity ,

  1. convergence : the cavity equation admits a unique solution , which can be reached from any initial configuration by iterating a number of times equal to the diameter of ;

  2. correctness : for every , the exact marginal law of under the Gibbs-Boltzmann law is given by directly imposing the external field onto the local measure .

The important consequence is that on trees, the energy can be determined using only local operations :

(14)

where the second equality is obtained by applying (11) to each .

Proof of Lemma 3 . When is a leaf, the message defined by equation (12) does not depend at all on the initial configuration . Iterating this argument immediately proves the convergence part, and we now focus on correctness. Let be a finite tree, a vertex, and a neighbour of . We let denote the subtree induced by and all vertices that the edge separates from . Now assume that is equipped with local measures, and let inherit from these local measures, except for which we replace by the trivial local measure with constant value . With these notations, any spanning subgraph can be uniquely decomposed as the disjoint union of a subset and a spanning subgraph on each , with . Thus, writing for the global measure on the network , we have for any ,

Fixing and summing over all possible values for , we obtain

where are normalizing constants that do not depend on . This already proves that the law of can be obtained from the local measure by imposing on each edge the external field

(15)

In turn, this ratio can now be computed by applying the result to the vertex in the network :

which shows that the configuration defined on by (15) satisfies the cavity equation (13).

There are two distinct parts in Lemma 3 : convergence and correctness. As we will now show, the former extends to arbitrary graphs under the only assumption that each local measure is cavity-monotone. Henceforth, such a network will be called a cavity-monotone network.

Theorem 2 (Convergence on finite cavity-monotone networks)

On a finite cavity-monotone network, the cavity equation admits a unique, globally attractive fixed point at any activity .

Proof. Fixing and starting with the minimal configuration , we set inductively

for all . By Lemma 1, the Rayleigh property of the local measures ensures that is coordinate-wise non-increasing on the space of configurations. Therefore, the limiting configuration

(16)

exist, and any fixed point must satisfy Moreover, is clearly continuous with respect to the product topology on configurations, so that and . Thus, the existence of unique globally attractive solution to (13) boils down to the equality

(17)

Now applying (11) to the local measure at a fixed vertex yields

Summing over all vertices , we therefore obtain

This implies (17), since by Lemma 1 each is strictly increasing in every coordinate.

4 The limit of infinite volume

In the previous section, we have established existence and uniqueness of a cavity solution on any finite cavity-monotone network. Our concern now is its asymptotical meaning as the size of the underlying graph tends to infinity. Following the principles of the objective method [4], we will replace the asymptotical analysis of our finite networks by the direct study of their infinite limits.

4.1 Random weak limits

We first briefly recall the framework of local convergence, introduced by Benjamini and Schramm [8] and developped further by Aldous and Steele [4]. Examples of successful uses include [3, 9, 18, 25, 15, 14, 13, 16]. Here, a network will be simply a denumerable graph whose vertices are equipped with local measures . A rooted network is a network together with the specification of a particular vertex , called the root. For , we write if there exists a bijection that preserves

  • the root : ;

  • the adjacency : ;

  • the support of the local measures : , with .

  • the values of the local measures, up to : .

We let denote the set of all locally finite connected rooted networks considered up to the isomorphism relation . In the space , a sequence converges locally to if for every radius and every , there is such that

where denotes the finite rooted network obtained by keeping only the vertices lying at graph-distance at most from . It is not hard to construct a distance which metrizes this notion of convergence and turns into a complete separable metric space. We can thus import the usual machinery of weak convergence of probability measures on Polish spaces (see e.g. [11]).

Uniform rooting is a natural procedure for turning a finite deterministic network into a random element of : one simply chooses uniformly at random a vertex to be the root, and restrains to the connected component of . If is a sequence of finite networks and if the sequence of their laws under uniform rooting admits a weak limit , we call the random weak limit of the sequence . In [3], it was shown that any such limit enjoys a remarkable invariance property known as unimodularity : let denote the space of locally finite connected networks with an ordered pair of distinguished adjacent vertices , taken up to the natural isomorphism relation and endowed with the natural topology. A measure is called unimodular if it satisfies the Mass-Transport Principle : for any Borel function ,

(18)

where we have written for the expectation with respect to . This is a deep and powerful notion, which we will now use to extend the results of section 3 to the infinite setting.

4.2 Main result : validity of the cavity method on unimodular trees

The definition of remains valid for any locally finite network . When the latter is cavity-monotone, the configurations introduced in the proof of Theorem 2 remain perfectly well-defined, and the convergence of the cavity method again boils down to the identity . However, the proof of the latter involves a summation over all edges, which is no longer valid in the infinite setting. Instead, the desired will be derived from unimodularity, and will thus hold for any random weak limit of finite networks. Indeed, applying the Mass-Transport Principle to the function

yields ( is Borel as the pointwise limit of continuous functions). Under the assumption , this expectation is finite, and the size-increasing property of then implies that almost surely, for all . This automatically extends to every oriented edge since under unimodularity, everything shows up at the root (another fruitful application of the Mass-Transport-Principle, see [3, Lemma 2.3]). We state this as a Theorem.

Theorem 3 (Convergence of the cavity method on unimodular networks)

Let be a unimodular probability measure supported by cavity-monotone networks. If , then the cavity equation admits a.-s. a unique, globally attractive solution at any activity .

By analogy with formula (14) in the finite case, the (now well-defined) quantity

(19)

appears as a natural candidate for the limiting energy of any sequence of finite networks whose random weak limit is . Our second result is the validity of this cavity ansatz when is concentrated on trees.

Theorem 4 (Asymptotical correctness of the cavity method)

Let be a sequence of finite cavity-monotone networks admitting a random weak limit which is concentrated on cavity-monotone trees. Assume that the local rank at a uniformly chosen vertex is uniformly integrable as . Then,

(20)

If and all the local measures take values in for a fixed compact , then

(21)
(22)
Remark 2 (Large deviation principle)

Integrating (20) immediately implies that

It will later be checked that is continuous on (Remark 3). Therefore, denoting by a random spanning subgraph with the Gibbs-Boltzmann law , Gärtner-Ellis Theorem [17] guarantees that obeys a large deviation principle with rate and good rate function .

4.3 Proof of the main result

Lemma 4 (Tree approximation)

Let be a finite rooted cavity-monotone network, and let . If is a tree, then for every activity ,

Proof. The proof makes use of a classical ingredient known as the spatial Markov property, which we first briefly recall. Let be a finite network and let be an induced subgraph. We let denote the boundary of , i.e. the set of edges having one end-point in and one in . Any boundary condition can be used to assign local measures to the vertices of , namely . Note that these local measures differ from the original ones only for vertices that are adjacent to the boundary. The resulting network is denoted by . Now, a spanning subgraph is clearly the disjoint union of a spanning subgraph of , a boundary condition and a spanning subgraph in . The product form of immediately yields :

(23)

In other words, conditionally on the boundary , the restrictions of to and are independent with law and , respectively. Applying this to the tree ,

where we have applied Lemma 3 to the tree , writing for the unique solution to the cavity equation at activity thereon. But by monotony of the cavity operator, each must satisfy . Using the size-increasing property of , we see that

and re-injecting this into the above equation finally yields the desired inequalities.

Let us now see how Lemma 4 implies the convergence (20). Let be a sequence of finite cavity-monotone networks admitting a random weak limit which is concentrated on cavity-monotone trees. Denote by the law under uniform rooting of , so that . We will use the short-hand , and for the indicator function that is a tree. Lemma 4 guarantees that for any finite cavity-monotone network and any vertex ,

(24)

As functions of , the left-hand side and right-hand side are continuous on , since they depend only on . Moreover, both are dominated by which is assumed to be uniformly integrable with respect to the sequence . Thus, their expectation under tends to their expectation under as . But is zero on the support of , so we are simply left with

Since the random weak limit is unimodular, Theorem 3 finally implies that both the lower and upper bounds tend to as . Note that the requirement in Theorem 3 is here automatically fullfilled, by the uniform integrability assumption.

It now remains to show (21) and (22). The identity (5) implies that for any activity and any finite network satisfying ,

Now take and let : the compactness assumption guarantees that is bounded uniformly in , so the first term converges to . As per the second one, it tends to because of (20), provided the uniform domination holds in Lebesgue’s dominated convergence Theorem. The latter fact is ensured by the first inequality in Lemma 5 below, combined with the compactness assumption and the fact that . The second inequality in Lemma 5 easily guarantees (22).

Lemma 5 (Uniform controls for the energy)

Let be a finite cavity-monotone network. As a function of the activity , the energy increases from to . Furthermore, the rate of convergence to these two extrema can be precisely controlled :

(25)
(26)

where , with and .

Proof of Lemma 5. That the energy increases with the activity is equivalent by (5) to the convexity of , a direct consequence of Hölder’s inequality. This also implies that for any

Clearly and and , so (26) follows. Regarding (25), we have for any