On Stronger Calculi for QBFs^{†}^{†}thanks: The work was supported by the Austrian Science Foundation (FWF) under grant S11409N23. Partial results have been announced at the QBF Workshop 2014 (http://www.easychair.org/smartprogram/VSL2014/QBFprogram.html).
Abstract
Quantified Boolean formulas (QBFs) generalize propositional formulas by admitting quantifications over propositional variables. QBFs can be viewed as (restricted) formulas of firstorder predicate logic and easy translations of QBFs into firstorder formulas exist. We analyze different translations and show that firstorder resolution combined with such translations can polynomially simulate wellknown deduction concepts for QBFs. Furthermore, we extend QBF calculi by the possibility to instantiate a universal variable by an existential variable of smaller level. Combining such an enhanced calculus with the propositional extension rule results in a calculus with a universal quantifier rule which essentially introduces propositional formulas for universal variables. In this way, one can mimic a very general quantifier rule known from sequent systems.
1 Introduction
Quantified Boolean formulas (QBFs) generalize propositional formulas by admitting quantifications over propositional variables. QBFs can be viewed in two different ways, namely (i) as a generalization of propositional logic and (ii) as a restriction of firstorder predicate logic (where we interpret over a two element domain). A number of calculi are available for QBFs: the ones based on variants of resolution for QBFs [13, 11, 2, 3], the ones based on instantiating universal variables with truth constants combined with propositional resolution and an additional instantiation rule [4], and different sequent systems [7, 14, 10, 9].
In all these calculi (except the latter ones from [7, 14, 9]), the possibility to instantiate a given formula is limited. In purely resolutionbased calculi, formulas (or more precisely universal variables) are never instantiated. In instantiationbased calculi, instantiation is restricted to truth constants. In contrast, sequent systems possess flexible quantifier rules, and (existential) variables as well as (propositional) formulas can be used for instantiation with tremendous speedups in proof complexity. This motivates why we are interested in strengthening instantiation techniques for instantiationbased calculi.
We allow to replace (some) universal variables not only by truth constants but by existential variables left of in the quantifier prefix. This approach mimics the effect of quantifier rules introducing atoms in sequent calculi from [9]. We add a propositional extension principle (known from extended resolution [19]), which enables the introduction of propositional formulas for universal variables via extension variables (or names for the formula). Contrary to [9], where we proposed propositional extensions of the form which can be eliminated if the cut rule is available in the sequent calculus, such an elimination is not possible here for which reason we have to use (classical) extensions.
Contributions.

We show that such combinations can polynomially simulate Qresolution with resolution over existential and universal variables (QUres [11], Theorem 5), Qresolution (Qres [13], Corollary 1) and the instantiationbased calculus IRcalc [4] (Theorem 5, Corollary 2). The latter simulation provides a soundness proof for IRcalc independent from strategy extraction.
Structure. In Sect. 2 we introduce necessary definitions and notations. In Sect. 3 different translations from QBFs to (restrictions of) firstorder logic [17] are reconsidered. In Sect. 4 different calculi based on (variants of) the resolution calculus are described. Here, we introduce our calculi generalized from IRcalc. In Sect. 5 we present our results on polynomial simulations between considered calculi and in Sect. 6 we provide exponential separations. In the last section we conclude and discuss future research possibilities.
2 Preliminaries
We assume familiarity with the syntax and semantics of propositional logic, QBFs and firstorder logic (see, e.g., [15] for an introduction). We recapitulate some notions and notations which are important for the rest of the paper.
We consider a propositional language based on a set of Boolean variables and truth constants (true) and (false), both of which are not in . A variable or a truth constant is called atomic and connectives are from . A literal is a variable or its negation. A clause is a disjunction of literals, but sometimes we consider it as a set of literals. Tautological clauses contain a variable and its negation and the empty clause is denoted by . Propositional formulas are denoted by capital Latin letters like possibly annotated with subscripts, superscripts or primes.
We extend the propositional language by Boolean quantifiers. Universal () and existential () quantification is allowed within a QBF. The superscript is used to distinguish Boolean quantifiers from firstorder quantifiers introduced later. QBFs are denoted by Greek letters. Observe that we allow nonprenex formulas, i.e., quantifiers may occur deeply in a QBF. An example for a nonprenex QBF is , where , , and are variables. Moreover, free variables (like ) are allowed, i.e., there might be occurrences of variables in the formula for which we have no quantification. Formulas without free variables are called closed; otherwise they are called open. The universal (existential) closure of is (), for which we often write () if is the set of all free variables in . A formula in prenex conjunctive normal form (PCNF) has the form , where is the quantifier prefix, and is the (propositional) matrix which is in CNF. Often we write a QBF as ( for all and the elements of are pairwise disjoint). We define the level of a literal , , as the index such that the variable of occurs in . The logical complexity of a formula , , is the number of occurrences of connectives and quantifiers.
We use a firstorder language consisting of (objects) variables, function symbols (FSs), predicate symbols (PSs), together with the truth constants and connectives mentioned above. Quantifiers and bind object variables. Terms and formulas are defined according to the usual formation rules. We identify ary PSs with propositional atoms, and ary FSs with constants. Clauses, tautological clauses and the empty clause are defined as in the propositional case.
Let be the set of firstorder variables and be the set of terms. A substitution is a mapping of type such that only for finitely many variables . We represent by a finite set of the form . The domain of , , is the set . The range of , , is the set . We call a variable substitution if . The empty substitution is denoted by . We often write substitutions postfix, e.g., we use instead of . Algebraically, substitutions define a monoid with being the neutral element under the usual composition of substitutions.
Substitutions are extended to terms and formulas in the usual way, e.g., , , and , where is an place FS, is an place PS, are terms, and are (quantifierfree) formulas and is a binary connective. For substitutions and , is more general than if there is a substitution such that . A substitution is called a permutation if is oneone and a variable substitution. A permutation is called a renaming (substitution) of an expression (i.e., is a term or a quantifierfree formula) if , where is the set of all variables occurring in . For an expression , is a variant of provided is a renaming substitution.
Let be a nonempty set of expressions. A substitution is called a unifier of if . Unifier is called most general unifier (mgu), if for every unifier of , is more general than .
Let and be two proof systems. polynomially simulates (psimulates) if there is a polynomial such that, for every natural number and every formula , the following holds. If there is a proof of in of size , then there is a proof of (or a suitable translation of it) in whose size is less than .
3 Different translations of QBFs to firstorder logic
We introduce different translations of (closed) QBFs to (closed) formulas in (restrictions of) firstorder logic. We start with the basic translation from [17] in Fig. 1. Obviously, the QBF and the firstorder formula enjoy a very similar structure. Especially the variable dependencies expressed by the quantifier prefix are exactly the same.
propQBFIsomorphic Let be a (closed) QBF and let be its (closed) firstorder translation. Then , i.e., and are isomorphic. The proof in the appendix is by induction on the logical complexity of .
The basic translations from Fig. 1 can be extended to generating a skolemized version of . We restrict our attention here to QBFs in PCNF.
Definition 1
Let be a closed QBF in PCNF with matrix and let be its closed firstorder translation. For any existential variable in the quantifier prefix of , let be the sequence of universal variables left of (in exactly the same order in which they occur in the prefix). Let be the Skolem function symbol associated to . We call the skolemized form of and denote it by , where the substitution is as follows.
Traditionally, is denoted as a quantifierfree formula with the assumption that all free variables are (implicitly) universally quantified.
The number of universal variables a Skolem function depends on can be optimized, e.g., by using miniscoping or dependency schemes [17]. As we will see later on, most of our results do not depend on such optimizations.
Proposition 1
Let be a closed QBF in PCNF with matrix and let be its closed firstorder translation. Let be the skolemized form of . Then .
Due to propositions 3 and 1, we can relate each literal of each clause from to its isomorphic counterpart in .
Since we interpret over a twoelement domain, proper Skolem function symbols (i.e., the arity is greater than ) can be eliminated by introducing new predicate symbols. The resulting formula belongs to EPR (Effectively PRopositional logic or more traditionally it belongs to the BernaysSchoenfinkel class).
Definition 2
Let be a closed QBF in PCNF with matrix and let be its closed firstorder translation. Let the skolemized form of . Replace any occurrence of a predicate of the form by where is a proper function symbol and is a nonempty list of universal variables. The formula resulting after all possible replacements is the EPR formula .
We will see later that the firstorder and the EPR translation have different prooftheoretical properties because some resolutions are blocked by different predicate symbols. Proposition 3 is Lemma 1 in [17] (stated without a proof).
propSatEquiv Let be a closed QBF. Then
is satisfiable  iff  is satisfiable. 
A proof can be found in the appendix.
\prfbyaxiomAxiomC  \prftree[r]Res x ∨C_1¬x∨C_2C_1∨C_2  \prftree[r]Fac C∨ℓ∨ℓC∨ℓ  \prftree[r]R D∨mD 
is a nontautological clause from the matrix. If then . Variable is existential (Qres) and existential or universal (QUres), is a literal and is a universal literal. If is existential, then holds.
is a nontautological clause from the matrix , where is a shorthand for if and if .
\prftree[r]Res x^τ ∨C_1¬x^τ∨C_2C_1∨C_2  \prftree[r]Fac C∨ℓ^τ ∨ℓ^τC∨ℓ^τ  \prftree[r]InstC 
is an assignment to universal variables and .
4 Different calculi based on resolution
We introduce different calculi used in this paper. We start with two resolution calculi, Qres and QUres, for QBFs in Fig. 2. Observe that the consequence of each rule is nontautological. We continue with the calculus in Fig. 3, where we use the same presentation as in [4]. is the quantifier prefix and is the quantifierfree matrix in CNF. In the following instantiationbased calculi, inference rules do not work on usual clauses but on annotated clauses based on extended assignments. An extended assignment is a partial mapping from the Boolean variables to . An annotated clause consists of annotated literals of the form , where is an extended assignment to universal variables and with . Composition of extended assignments is defined using completion. The expression is called the completion of by . Then , the completion of by , is defined as follows.
(1) 
The function allows instantiations of clauses; it computes for an extended assignment and an annotated clause . Later on, we will clarify the relation between annotations and substitutions in firstorder logic.
We extend by the possibility to instantiate universal variables by existential ones. Technically the instantiation is performed by a global substitution . If a universal variable is replaced by some existential variable , i.e., , then must hold. We name the calculus equipped with the substitution and depict the rules in Fig. 4.

is a nontautological clause from the matrix .

.

. 
, Res, Fac and Inst are the same as in .
It is immediately apparent that this calculus is sound and complete. We get completeness, when we use the empty substitution as because then, reduces to which is sound and complete [4]. Soundness follows from the validity of QBFs of the form
If the right formula has an refutation, then it is false and therefore the left formula has to be false.

is a nontautological clause from the matrix or from .

, , , Res, Fac and Inst are the same as in .

If then and by construction.
We further enhance by the possibility to use propositional extensions [19, 6]. This extension operation is a generalization of the wellknown structurepreserving translation to (conjunctive) normal form in propositional logic. For presentational reasons, we require to have all extensions at the very beginning of the deduction in order to allow extension variables as replacements for universal variables. Figure 5 shows the inference rules of this calculus , where is a sequence of (clausal representations of) extensions of the form with being of the form or of the form () and is a variable neither occurring in nor in nor in . The variables are existential. The quantification extends the quantifier prefix such that for all variables occurring in and is minimal. Due to the requirements on the extension variables and the placement of , the resulting calculus is sound. Completeness is not an issue here, because we can use an empty .
Remark 1
The usual formalization of clauses and resolvents as sets of literals can be simulated in our formalizations by the factoring rule Fac. We assume in the following that Fac is applied as soon as possible.
We finally introduce firstorder resolution. Let be a clause and let and be two distinct literals in both of which are either negated or unnegated. If there is an mgu of and , then the clause is called a factor of . The clause is called the premise of the factoring operation.
Let and be two clauses and let be a variant of which has no variable in common with . A clause is a resolvent of the parent clauses and if the following conditions hold:

and are literals of opposite sign whose atoms are unifiable by an mgu .

.
Let be a set of clauses. A sequence is called deduction (firstorder resolution deduction) of a clause from if and for all , one of the following conditions hold.

is an input clause from .

is a factor of a for .

is a resolvent of and for .
An refutation of is an deduction of the empty clause from . The size of a deduction is given by , where is the number of character occurrences in . An deduction has tree form if every occurrence of a clause is used at most once as a premise in a factoring operation or as a parent clause in a resolution operation.
Next we introduce the subsumption rule taken from Definition 2.3.4 in [8]. Contrary to the usual use of subsumption in automated deduction as a deletion rule, here we add clauses which are (factors of) instantiations of clauses.
Definition 3
If and are clauses, then subsumes or is subsumed by , if there is a substitution such that . A set of clauses is obtained from a set by subsumption if where is subsumed by a clause of .
Resolution can be extended by the subsumption rule (Definition 3.2.3 in [8]).
Definition 4
By a derivation of a set of clauses from a set of clauses by plus subsumption, we mean a sequence of clause such that the following conditions are fulfilled.

.

For all there is a clause subsuming the clause or there exist clauses such that is subsumed by a resolvent of and .
Factors are not needed in item 2, because the factor of can be generated by subsumption. We need a simplified version of Proposition 3.2.1 from [8].
Proposition 2
polynomially simulates plus subsumption.
The subsumption rule is not necessary but makes proofs of polynomial simulation results much more convenient. It allows instantiated deductions for which eventually the lifting theorem provides a deduction “on the most general level”.
5 Polynomial simulations of calculi
In this section we show that together with a suitable translation (denoted by ) polynomially simulates QUres, Qres and .
thmRfoPsimQU polynomially simulates QUres. The proof is by induction on the number of clauses in the QUres deduction. It can be found in the appendix. It shows that firstorder literals obtained from universal literals in the QBF and eliminated by R are eliminated by resolutions with and without instantiating the firstorder resolvent.
Corollary 1
The following results are immediate consequences of Theorem 5.

polynomially simulates QUres.

as well as polynomially simulates Qres.
We present a soundness proof of independent from strategy extraction by a polynomial simulation of by .
Definition 5
Let and be two substitutions. The composition of and , , is obtained from
by deleting all for which holds.
Lemma 1
Let and be two substitutions as defined in Definition 5, where are universal variables and . Then is the composition .
Proof
Let be the completion of by defined in (1). Since as well as is a subset of the set of universal variables and as well as is a subset of , and therefore for all . Hence, the completion of the two substitutions and is exactly their composition .
In the following, we deal with annotated clauses of the form where any is an existential literal and any is the restriction of assignment to exactly those universal variables for which holds. We denote the sequence of all universal variables with by where we assume the same order as in the quantifier prefix. A firstorder clause corresponding to is constructed as follows
where is the isomorphic counterpart of (cf. the remark after Proposition 1). Using together with mimics the effect of ; the difference is the explicit notation of all universal variables left of and not only the variables in .
thmRfoPsimIRcalc polynomially simulates .
In the proof, we construct by induction on the number of derived clauses in the IRcalc deduction stepwisely a deduction in plus subsumption. We consider the sequence of firstorder clauses obtained from the original clauses as a skeleton for the final proof. Since the clauses in the skeleton do not follow by a single application of an inference rule, we have to provide a short deduction of the clauses.
Proof
We utilize Proposition 2 and allow subsumption in the simulation. The proof is by strong mathematical induction on the number of derived clauses in the IRcalc deduction. Let denote the statement “Given a IRcalc deduction from a QBF and a sequence of firstorder clauses , the clause has a short deduction in plus subsumption from ”.
Base: . is a consequence of the axiom rule using clause from the matrix . Let be the assignment induced by . Then we have a clause from which we can derive by resolution steps using and . The number of these steps is equal to the number of universal variables in .
IH: Suppose hold for some .
Step: We have to show . Consider and .
Case 1: is derived by the axiom rule. Then proceed like in the base case.
Case 2: is a consequence of the rule Inst with premise (for some with ) and assignment . By IH and Remark 1, we have a short plus subsumption deduction of . is of the form . By Lemma 1, for any universal variable with . Therefore is of the form . Now and can be derived by subsumption.
Case 3: is a consequence of the rule Fac with premise (for some with ). By IH, we have a short plus subsumption deduction of , where is of the form . We generate a factor of simply by omitting one of the duplicates.
Case 4: is a consequence of the resolution rule with parent clauses (for some with ). By IH, we have two clauses
We use of the form as a renaming of the variables in such that does not share any variable with . The resolvent is where is the mgu of the form . We add by subsumption, where maps all remaining variables to their counterpart.
Corollary 2
polynomially simulates .
When we inspect the translation of (axiom) clauses, we observe that a universal variable is translated to an atom of the form . With the subsumption rule we can instantiate the clause by a substitution of the form for a term . This observation was the trigger to introduce the stronger calculus , where universal variables cannot be replaced only by or but also by any existential variable with .
6 Exponential separation of resolution calculi
We constructed in [9] a family of short closed QBFs in PCNF for which any Qres refutation of is superpolynomial. We recapitulate the construction here. The formula is
(2) 
is the pigeon hole formula for holes and pigeons in conjunctive normal form and denoted over the variables . Variable is intended to denote that pigeon is sitting in hole . is
The number of clauses in is and is . The formula is obtained from the pigeon hole formula in disjunctive normal form, , by a structurepreserving polaritysensitive translation to clause form [16]. The formula is simply the negation of where negation has been pushed in front of atoms and doublenegation elimination has been applied.
We use new variables of the form for disjuncts in . For the first disjuncts of the form with , we use variables . For the second part, for any and the disjuncts, we use
(3) 
The set of these variables for is denoted by . Due to this construction, we can speak about the conjunction corresponding to the variable .
We construct the conjunctive normal form of as follows. First, we take the clause over all variables in . The formula for the first disjuncts of is of the form
For the remaining disjuncts of , we have the formula
Then is and is . It is easy to check that is valid.
Let us modify the quantifier prefix of . By quantifier shifting rules we get, in an “antiprenexing” step, the equivalent formula . Prenexing yields the equivalent QBF
(4) 
which has only one quantifier alternation instead of two. In [9] we showed that and have short cutfree tree proofs in a sequent system , where weak quantifiers introduce atoms. The following extends Proposition 3 in [9].
The proof is based on the fact that (i) the two conjuncts belong to languages with different alphabets and (ii) that the alphabets cannot be made identical by instantiation of quantifiers in Qres. Therefore we have to refute either or under the given quantifier prefix. Since is true, there is no Qres refutation and we have to turn to . But then, we essentially have to refute with propositional resolution and consequently, by Haken’s famous result [12], any Qres refutation of is superpolynomial in .
Since QUres, LDQres, LDQUres, LDQUres, and Q(D)resolution (Q(D)res) [18] are based on the same quantifierhandling mechanism as Qres, the following corollary is obvious.
Corollary 3
For the situation is not better. Since universal literals are only replaced by , no unification of the two alphabets can happen.
The quantifier prefix is unfortunate if one expects being false. Actually, the initial universal quantifier block prevents any nonempty and consequently, any refutation of reduces to an refutation of .
Proposition 5
Any refutation of from (4) in has size superpolynomial in .
In the following we show that has a short refutation in . We use to denote the Skolem function symbol corresponding to and to denote the Skolem function symbols corresponding to . All the Skolem function symbols have arity . Let denote the formula under the firstorder translation. We have
The refutation of is constructed as follows.

We use together with the first clauses from to derive (for all ). The deduction consists of clauses and applies resolution and factoring. The substitution is , where is a variable renaming from the variant generation in resolution.

We use together with the binary clauses from to derive (for all and with ). The deduction consists of clauses and applies resolution and factoring. Then is . Again is a variable renaming like above.

We use together with the derived instance of to derive by resolution. Since any variable is assigned to a variant of for all and all , all resolution steps are possible. The deduction consists of clauses.
The formula can be refuted in a similar fashion in by replacing variants of the form by Skolem constants .
Proposition 6
Let and be the families of closed QBFs defined above. Then and have short tree refutations in consisting of clauses. Moreover the size of the refutation is .
thmNoPsimRfo The calculi QUres, LDQres, LDQUres, LDQUres, Q(D)res, , and IRMcalc cannot polynomially simulate tree or