Skip to main content Accessibility help
×
Home
Hostname: page-component-56f9d74cfd-dpvgk Total loading time: 2.352 Render date: 2022-06-27T03:06:25.932Z Has data issue: true Feature Flags: { "shouldUseShareProductTool": true, "shouldUseHypothesis": true, "isUnsiloEnabled": true, "useRatesEcommerce": false, "useNewApi": true }

EPSILON THEOREMS IN INTERMEDIATE LOGICS

Published online by Cambridge University Press:  10 January 2022

MATTHIAS BAAZ
Affiliation:
INSTITUTE OF DISCRETE MATHEMATICS AND GEOMETRY VIENNA UNIVERSITY OF TECHNOLOGY WIEDNER HAUPTSTRASSE 8–10 1040VIENNA, AUSTRIAE-mail:baaz@logic.at
RICHARD ZACH
Affiliation:
DEPARTMENT OF PHILOSOPHY UNIVERSITY OF CALGARY 2500 UNIVERSITY DRIVE NORTHWEST CALGARY, ABT2N 1N4, CANADAE-mail:rzach@ucalgary.caURL: https://richardzach.org/
Rights & Permissions[Opens in a new window]

Abstract

Any intermediate propositional logic (i.e., a logic including intuitionistic logic and contained in classical logic) can be extended to a calculus with epsilon- and tau-operators and critical formulas. For classical logic, this results in Hilbert’s $\varepsilon $ -calculus. The first and second $\varepsilon $ -theorems for classical logic establish conservativity of the $\varepsilon $ -calculus over its classical base logic. It is well known that the second $\varepsilon $ -theorem fails for the intuitionistic $\varepsilon $ -calculus, as prenexation is impossible. The paper investigates the effect of adding critical $\varepsilon $ - and $\tau $ -formulas and using the translation of quantifiers into $\varepsilon $ - and $\tau $ -terms to intermediate logics. It is shown that conservativity over the propositional base logic also holds for such intermediate ${\varepsilon \tau }$ -calculi. The “extended” first $\varepsilon $ -theorem holds if the base logic is finite-valued Gödel–Dummett logic, and fails otherwise, but holds for certain provable formulas in infinite-valued Gödel logic. The second $\varepsilon $ -theorem also holds for finite-valued first-order Gödel logics. The methods used to prove the extended first $\varepsilon $ -theorem for infinite-valued Gödel logic suggest applications to theories of arithmetic.

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press on behalf of The Association for Symbolic Logic

1 Introduction

The $\varepsilon $ -calculus was originally introduced by Hilbert as a formalization of classical first-order logic. It is a way to reduce proofs in first-order logic to proofs in propositional logic from so-called critical formulas, where the role of quantifiers is taken over by certain terms. The $\varepsilon $ -calculus was the basis for Hilbert’s approach to proof theory (in particular, consistency proofs). It still is a useful logical formalism with interesting properties and theoretical and practical applications.

The $\varepsilon $ -calculus is formulated by allowing for terms of the form $\varepsilon _{x}\,A(x)$ for any formula $A(x)$ with x free. A formula of the form $A(t) \mathbin {\rightarrow } A(\varepsilon _{x}\,A(x))$ is called a critical formula belonging to $\varepsilon _{x}\,A(x)$ . A proof in the $\varepsilon $ -calculus is a proof in the quantifier-free fragment of classical logic from critical formulas. It is now possible to define the existential quantifier by $\exists x\, A(x) \equiv A(\varepsilon _{x}\,A(x))$ , and—in classical logic—the universal quantifier by $\forall x\, A(x) \equiv A(\varepsilon _{x}\,\lnot A(x))$ . A formula A with quantifiers can thus be translated into a formula $A^{\varepsilon }$ with $\varepsilon $ -terms but without quantifiers. A formula is provable in classical first-order logic $\mathbf {Q}\mathbf {C}$ iff its translation is provable in the $\varepsilon $ -calculus.

Hilbert proved two fundamental results about the $\varepsilon $ -calculus:

  1. 1. The extended first $\varepsilon $ -theorem: If $A(\vec e)$ is derivable in the $\varepsilon $ -calculus ( $\vec e$ a tuple of $\varepsilon $ -terms), then there are tuples of terms $\vec t_{1}, \dots , \vec t_{n}$ such that $A(\vec t_{1}) \lor \cdots \lor A(\vec t_{n})$ is provable in propositional logic.

  2. 2. The second $\varepsilon $ -theorem: If $A^{\varepsilon }$ is the standard $\varepsilon $ -translation of a first-order formula derivable in the $\varepsilon $ -calculus, A is derivable in first-order logic.

The extended first $\varepsilon $ -theorem has two important consequences. The first consequence is what Hilbert simply called the first $\varepsilon $ -theorem: If A is $\varepsilon $ -free and derivable in the $\varepsilon $ -calculus, it is derivable in the quantifier-free fragment of first-order logic, the so-called elementary calculus of free variables (i.e., without critical formulas, indeed, without any use of $\varepsilon $ -terms). This implies that the $\varepsilon $ -calculus is conservative over propositional logic for quantifier-free formulas without identity. The second consequence is Herbrand’s theorem for existential formulas: If $\exists \vec x\, A(\vec x)$ is provable in the $\varepsilon $ -calculus, then some disjunction $A(\vec t_{1}) \lor \cdots \lor A(\vec t_{n})$ is provable in propositional logic alone, i.e., is a tautology.Footnote 1

In contrast to other proof-theoretic methods which also yield the existence of Herbrand disjunctions (such as cut-elimination), the proof based on the extended first $\varepsilon $ -theorem, and consequently the length of the Herbrand disjunction the proof yields, is insensitive to the propositional complexity of the original proof. This is an important advantage of methods based on the $\varepsilon $ -calculus.

Hilbert’s results and proofs make essential use of classical principles, especially the law of excluded middle. The question naturally arises whether the results can also be obtained for weaker logics, and whether the same proof methods can be used, i.e., whether the use of excluded middle can be avoided. In this paper, we investigate $\varepsilon $ -calculi for intermediate logics, i.e., logics between intuitionistic and classical logic, and specifically the question of when the extended first $\varepsilon $ -theorem holds in such logics. Well-known examples of intermediate logics are Jankov’s logic of weak excluded middle and finite- and infinite-valued Gödel–Dummett logics.

We consider only intermediate logics for two reasons. One is that Hilbert’s methods rely essentially on the deduction theorem, and this holds in intermediate logics but not in many other logics. The other is that $\varepsilon $ -calculi for intuitionistic and intermediate logics are of independent interest. What is the effect of adding $\varepsilon $ -operators to intermediate logics? When does the extended first $\varepsilon $ -theorem hold? When is the $\varepsilon $ -calculus for a logic conservative over the propositional base logic? We also, for the most part, discuss only pure logics, i.e., logics without identity.

In intermediate logics it is necessary to introduce a separate $\tau $ -operator which defines the universal quantifier. Whereas $A(\varepsilon _{x}\,A(x))$ translates $\exists x\,A(x)$ , $A(\tau _{x}\,A(x))$ translates $\forall x\,A(x)$ . The corresponding critical formulas are those of the form $A(\tau _{x}\,A(x)) \mathbin {\rightarrow } A(t)$ . Weakening the logic makes the addition of $\tau $ necessary, since the equivalence of $\forall x\, A(x)$ and $A(\varepsilon _{x}\,\lnot A(x))$ relies on the schema of contraposition; the equivalence of $\forall x\, A(x)$ and $A(\tau _{x}\,A(x))$ does not. The system resulting from a propositional intermediate logic $\mathbf {L}$ by adding $\varepsilon $ - and $\tau $ -terms and critical formulas is called the ${\varepsilon \tau }$ -calculus for $\mathbf {L}$ .

It is well known that adding the $\varepsilon $ -operator to intuitionistic logic in a straightforward way results in translations $A^{\varepsilon }$ of intuitionistically invalid formulas A becoming provable. Mints [Reference Mints20, Reference Mints and Petkov21] has investigated different systems based on intuitionistic logic with $\varepsilon $ -operators which are conservative and in which the $\varepsilon $ -theorem holds. He allows the use of $\varepsilon $ -terms only when $\exists x\,A(x)$ has been established; other approaches (e.g., [Reference Shirai25]) use existence predicates to accomplish the same. We investigate the basic ${\varepsilon \tau }$ -calculus without this assumption. In our ${\varepsilon \tau }$ -calculi, ${\varepsilon \tau }$ -terms are treated syntactically as something like Skolem functions rather than semantically as choice operators.

We begin (Section 2) by introducing intermediate logics. In Section 3 we introduce the ${\varepsilon \tau }$ -calculus and consider the differences in formulas provable in a logic vs. those provable in the corresponding ${\varepsilon \tau }$ -calculus. This is essentially the question of whether the addition of $\varepsilon $ - and $\tau $ -terms and critical formulas allows the derivation (or requires the validity) of formulas not provable in the base logic.

We investigate in detail which intuitionistically invalid formulas are provable in ${\varepsilon \tau }$ -calculi for intermediate logics in Section 4. We show that quantifier shift schemas play a special role here. All but three of these are valid in intuitionistic logic. The addition of ${\varepsilon \tau }$ -terms and critical formulas results in the provability of the remaining three. Consequently, no intermediate first-order logic in which one of these three quantifier shifts is unprovable can have the second $\varepsilon $ -theorem (Proposition 4.1). This includes intuitionistic logic itself, logics complete for non-constant domain Kripke frames, and infinite-valued Gödel–Dummett logic.

In Section 5, we show that conservativity for the propositional fragment nevertheless holds for all intermediate logics (Theorem 5.3). This in itself is a surprising result, even though the proof is very easy. It holds for theorems of the logics with or without identity, however not in general for provability from theories.

In Sections 69 we give a complete characterization of the intermediate logics where the extended first $\varepsilon $ -theorem holds. We show (Theorem 6.2) that whenever it holds, the underlying logic must prove (or validate) a sentence ${B_{m}}$ of the form

$$ \begin{align*} (A_{1} \mathbin{\rightarrow} A_{2}) \lor (A_{2} \mathbin{\rightarrow} A_{3}) \lor \cdots \lor (A_{m} \mathbin{\rightarrow} A_{m+1}). \end{align*} $$

No $B_{m}$ is provable in intuitionistic logic, any logic complete for Kripke frames with branching worlds, or in infinite-valued Gödel–Dummett logic. Consequently, the extended first $\varepsilon $ -theorem does not hold for these logics.

Provability of ${B_{m}}$ is also a sufficient condition: We show that the extended first $\varepsilon $ -theorem holds whenever the underlying logic proves at least one ${B_{m}}$ . The argument follows the idea of Hilbert’s proof, but does not make use of excluded middle. In order to establish the result, we provide a more fine-grained analysis of the proof of the extended first $\varepsilon $ -theorem. We first introduce the notion of an elimination set in Section 7: a set of terms which can replace an $\varepsilon $ -term in a proof and render the corresponding critical formulas redundant, using only the resources of the underlying propositional logic. This is the part of Hilbert’s proof that uses excluded middle. We isolate here how excluded middle is used. In Section 8, we show that logics that prove some ${B_{m}}$ also have elimination sets.

If such elimination sets exist, the procedure given by Hilbert and Bernays can be used for a proof of the extended first $\varepsilon $ -theorem (Section 9). This establishes the second half of the characterization, that the extended first ${\varepsilon \tau }$ -theorem holds in logics that prove ${B_{m}}$ , i.e., the finite-valued Gödel logics (Theorem 9.9). By putting emphasis on elimination sets, we can also show that in logics in which the extended first $\varepsilon $ -theorem does not hold in general, it may still hold for formulas of a special form. This allows us to show that the extended first ${\varepsilon \tau }$ -theorem holds for negated formulas in Jankov’s logic of weak excluded middle (Theorem 9.8). We also show that the first ${\varepsilon \tau }$ -theorem (for theorems not containing ${\varepsilon \tau }$ -terms) holds for infinite-valued Gödel–Dummett logic (Theorem 9.10).

The extended first ${\varepsilon \tau }$ -theorem is closely related to Herbrand’s theorem. We discuss this connection, as well as the second ${\varepsilon \tau }$ -theorem, in Section 10. We show that the second ${\varepsilon \tau }$ -theorem holds for any intermediate predicate logic that proves all quantifier shifts and has the extended first ${\varepsilon \tau }$ -theorem (Proposition 10.2), e.g., $\mathbf {Q}\mathbf {LC}_{m} + \mathit {CD}$ , first-order finite-valued Gödel logic.

In the case of some proofs, it is possible to eliminate ${\varepsilon \tau }$ -terms in a simplified way where the cases that require the presence of ${B_{m}}$ do not arise, and the linearity schema $\mathit {Lin}$ ( $(A \mathbin {\rightarrow } B) \lor (B \mathbin {\rightarrow } A)$ ) is enough. Although we cannot give an independent characterization of the proofs or theorems for which this is the case, the simplified procedure will sometimes terminate and produce a Herbrand disjunction. Conversely, if a Herbrand disjunction exists, there is always a proof of the original formula for which the procedure terminates and produces the Herbrand disjunction (Section 11).

This result sheds light on the conditions under which (a version of) Hilbert’s method which uses principles weaker than excluded middle produces a Herbrand disjunction. In fact, a similar method can be used to give a partial $\varepsilon $ -elimination procedure for number theory, where linearity of the natural order of $\mathbb {N}$ plays a similar role as the schema of linearity does in the case of logic (Section 12). We summarize the logics considered and results established in Tables 1 and 2.

Table 1 Intermediate logics considered.

Table 2 Epsilon theorems for intermediate logics.

2 Preliminaries

Intermediate propositional logics have been investigated extensively since the 1950s. Intermediate predicate logics are comparatively less well understood; however, they constitute an active area of research (see [Reference Gabbay, Shehtman and Skvortsov14]). We begin by collecting some preliminary definitions.

All the logics we consider are formulated in the standard language of intuitionistic and classical logic with propositional connectives $\land $ , $\lor $ , $\mathbin {\rightarrow }$ , the constant $\bot $ for absurdity, and the quantifiers $\forall $ and $\exists $ . $\lnot A$ is defined as $A \mathbin {\rightarrow } \bot $ and $\top $ as $\lnot \bot $ . Terms and atomic formulas are defined as usual. We allow $0$ -place predicate symbols, i.e., propositional variables. A formula containing no quantifiers is called quantifier-free and a quantifier-free formula with only $0$ -place predicate symbols is called propositional. For the most part we will consider pure logics, i.e., logics not involving the identity predicate $=$ . We assume that function symbols are available, however.

We use A, B, …, as metavariables for formulas, and t, s, …, as metavariables for terms. We write $A(x)$ to indicate that x occurs free in A. The result of substituting s for all (free) occurrences of x in a term t or a formula A is indicated by $t[s/x]$ or $A[s/x]$ . When it is clear which variable x is intended, we write $A(s)$ for $A[s/x]$ . The result of replacing every occurrence of a term t by a term s in a formula A is indicated by $A[s/t]$ . A substitution instance of a formula A is any formula resulting from A by uniformly replacing any number of atomic formulas $P(t_{1}, \dots , t_{n})$ by formulas $B[t_{1}/x_{1}, \dots , t_{n}/x_{n}]$ (in such a way that free variables of B are not captured by quantifiers in A of course). In particular, a substitution instance of a propositional formula A is any formula resulting from A by uniformly replacing any number of $0$ -place predicate symbols P by formulas B.

Definition 2.1. A propositional logic $\mathbf {L}$ is a set of propositional formulas closed under substitution and modus ponens, i.e., if $\mathbf {L}$ contains $A \mathbin {\rightarrow } B$ and B it also contains B. A predicate logic is a set of formulas closed under substitution, modus ponens, and the quantifier rules

which are subject to the eigenvariable condition: x must not be free in the conclusion.

Logics can be characterized by the formulas derivable from a set of axiom schemas by modus ponens. This is of course equivalent to closing the axioms under substitution and modus ponens. For instance, intuitionistic propositional logic $\mathbf {H}$ is obtained from the propositional axioms given in [Reference Troelstra and van Dalen28, 4.1]. Classical propositional logic $\mathbf {C}$ and predicate logic $\mathbf {Q}\mathbf {C}$ are obtained by adding $\lnot A \lor A$ (or alternatively $\lnot \lnot A \mathbin {\rightarrow } A$ ) as an axiom.

Definition 2.2. An intermediate propositional logic $\mathbf {L}$ is a propositional logic that contains $\mathbf {H}$ and is contained in $\mathbf {C}$ .

The following intermediate propositional logics will play important roles:

  1. 1. $\mathbf {LC}$ , characterized alternatively as the formulas valid on linearly ordered Kripke frames or as infinite-valued Gödel logic,Footnote 2 axiomatized over $\mathbf {H}{}$ using the schema

    (Lin) $$\begin{align} (A \mathbin{\rightarrow} B) \lor (B \mathbin{\rightarrow} A). \end{align}$$
  2. 2. $\mathbf {LC}_{m} = \mathbf {LC} + {B_{m}}$ , characterized as formulas valid on linearly ordered Kripke frames of height $< m$ , or as the Gödel logic on m truth values, also known as $\textbf {S}_{m-1}$ [Reference Hosoi16]. Here, ${B_{m}}$ is:

    (Bm) $$\begin{align} (A_{1} \mathbin{\rightarrow} A_{2}) \lor (A_{2} \mathbin{\rightarrow} A_{3}) \lor \dots \lor (A_{m} \mathbin{\rightarrow} A_{m+1}). \end{align}$$
  3. 3. $\mathbf {KC}$ , the logic of weak excluded middle [Reference Jankov19], axiomatized over $\mathbf {H}{}$ using the schema

    (J) $$\begin{align} \lnot A \lor \lnot\lnot A. \end{align}$$

If $\mathbf {L}$ is an intermediate propositional logic, we can consider the corresponding “elementary calculus,” i.e., the system obtained by replacing propositional variables with atomic formulas of a first-order language, with or without identity. We will be interested in formulas provable in such an elementary calculus from a set of assumptions $\Gamma $ .

Definition 2.3. Suppose $\mathbf {L}$ is an intermediate logic. A proof $\pi $ of A from $\Gamma $ in the corresponding elementary calculus is a sequence $A_{1}$ , …, $A_{n} =A$ of quantifier-free formulas such that each $A_{i}$ is either a substitution instance of a formula in $\mathbf {L}$ , is in $\Gamma $ , or follows from formulas $A_{k}$ and $A_{l}$ ( $k, l < i$ ) by modus ponens, and $A_{n} \equiv A$ . We then write $\Gamma \vdash _{\mathbf {L}}^{\pi } A$ . If such a proof $\pi $ exists we write $\Gamma \vdash _{\mathbf {L}}$ and if $\Gamma $ is empty, simply $\mathbf {L} \vdash A$ . If $\mathbf {L}$ is not mentioned, we mean $\mathbf {H}$ .

Given a propositional logic $\mathbf {L}$ , and possibly a set of additional axiom schemas $\mathit {Ax}$ involving quantifiers, we can generate a predicate logic:

Definition 2.4. If $\mathbf {L}$ is a propositional logic, and $\mathit {Ax}$ is a set of formulas, then $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ is the smallest predicate logic containing $\mathbf {L}$ , the standard quantifier axioms

$$ \begin{align*} \forall x\, A(x) & \mathbin{\rightarrow} A(t) \quad \text{and}\\ A(t) & \mathbin{\rightarrow} \exists x\, A(x), \end{align*} $$

and the formulas in $\mathit {Ax}$ .

If $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ contains intuitionistic predicate logic $\mathbf {Q}\mathbf {H}$ and is contained in classical predicate logic $\mathbf {Q}\mathbf {C}$ , it is called an intermediate predicate logic.

Definition 2.5. Suppose $\mathbf {L}$ is a propositional logic. A proof $\pi $ of A in $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ is a sequence $A_{1}$ , …, $A_{n} = A$ of formulas of predicate logic such that each $A_{i}$ is either a substitution instance of a formula in $\mathbf {L}$ , of a standard quantifier axiom, of a formula in $\mathit {Ax}$ , or follows from previous formulas by modus ponens or a quantifier rule. We then write $ \mathbf {Q}\mathbf {L} + \mathit {Ax} \vdash ^{\pi } A$ . If such a proof $\pi $ exists we write $\mathbf {Q}\mathbf {L} + \mathit {Ax} \vdash A$ .

A is a formula in $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ iff $\mathbf {Q}\mathbf {L} + \mathit {Ax} \vdash A$ . If $\mathbf {L}{}$ itself is characterized by a set of propositional axioms, it is enough to require substitution instances of axioms of $\mathbf {L}{}$ in the above definition. For instance, taking $\mathbf {H}{}$ as above, the proofs in intuitionistic predicate logic $\mathbf {Q}\mathbf {H}$ are just the proofs in the system $H_{2}$ -IQC of [Reference Troelstra and van Dalen28, 4.3].

If $\mathit {Ax}$ is empty, $\mathbf {Q}\mathbf {L} = \mathbf {Q}\mathbf {L} + \mathit {Ax}$ is the weakest pure intermediate predicate logic extending $\mathbf {L}$ . For instance, $\mathbf {Q}\mathbf {LC}$ is the weakest predicate logic obtained from $\mathbf {LC}$ , and is axiomatized by $\mathbf {Q}\mathbf {H} + \mathit {Lin}$ . It is complete for linearly ordered Kripke frames (see [Reference Corsi11, Reference Skvortsov26]).

It is possible to extend the weakest predicate logic of an intermediate propositional logic $\mathbf {L}$ by adding intuitionistically invalid schemas $\mathit {Ax}$ involving quantifiers. Some important examples are the constant domain principle,

(CD) $$\begin{align} \forall x(A(x) \lor B) & \mathbin{\rightarrow} (\forall x\,A(x) \lor B), \end{align}$$

the double negation shift (or Kuroda’s principle),

(K) $$\begin{align} \forall x\,\lnot\lnot A(x) & \mathbin{\rightarrow} \lnot\lnot \forall x\,A(x), \end{align}$$

and the quantifier shifts

(Q∃) $$\begin{align} (B \mathbin{\rightarrow} \exists x\,A(x)) & \mathbin{\rightarrow} \exists x(B \mathbin{\rightarrow} A(x)), \end{align}$$
(Q∀) $$\begin{align} (\forall x\,A(x) \mathbin{\rightarrow} B) & \mathbin{\rightarrow} \exists x(B \mathbin{\rightarrow} A(x)). \end{align}$$

$\mathbf {Q}\mathbf {LC} + \mathit {CD}$ axiomatizes the formulas valid on linearly ordered Kripke frames with constant domains, and also the first-order Gödel logic $\mathbf {G}_{\mathbb {R}}$ of formulas valid on the interval $[0, 1]$ . $\mathbf {Q}\mathbf {H} + K$ characterizes the formulas valid on Kripke frames with the McKinsey property and $\mathbf {Q}\mathbf {LC} + \mathit {CD} + K$ is the logic of linear Kripke frames with maximal element. It is also the first-order Gödel logic $\mathbf {G}_{0}$ of formulas valid on $\{0\} \cup [1/2,1]$ . $\mathbf {Q}\mathbf {LC}_{m}$ is not complete for linear Kripke frames of height $< m$ (contrary to what one might expect). However, $\mathbf {G}_{m} = \mathbf {Q}\mathbf {LC}_{m} + \mathit {CD}$ is complete for linearly ordered Kripke frames of height $<m$ with constant domains. It is also the m-valued first-order Gödel logic $\mathbf {G}_{m}$ .Footnote 3

3 ${\varepsilon \tau }$ -calculi for intermediate logics

Formulas and terms of the ${\varepsilon \tau }$ -calculus are defined by simultaneous induction, allowing that if $A(x)$ is a formula already defined, then $\varepsilon _{x}\,A(x)$ and $\tau _{x}\,A(x)$ are terms. In $\varepsilon _{x}\,A(x)$ and $\tau _{x}\,A(x)$ the variable x is bound. We call terms of the form $\varepsilon _{x}\,A(x)\ \varepsilon $ -terms and those of the form $\tau _{x}\,A(x)\ \tau $ -terms (collectively: ${\varepsilon \tau }$ -terms).

As usual, we consider $\varepsilon $ - and $\tau $ -terms to be identical up to renaming of bound variables, and define substitution of ${\varepsilon \tau }$ -terms into formulas, as in $A(\varepsilon _{x}\,A(x))$ , so that bound variables are tacitly renamed to avoid clashes.

Definition 3.1. A critical formula belonging to $\varepsilon _{x}\,A(x)$ is any formula of the form $A(t) \mathbin {\rightarrow } A(\varepsilon _{x}\,A(x))$ .

A critical formula belonging to $\tau _{x}\,A(x)$ is any formula of the form $A(\tau _{x}\,A(x)) \mathbin {\rightarrow } A(t)$ .

Definition 3.2. Suppose $\mathbf {L}$ is an intermediate propositional logic. An ${\varepsilon \tau }$ -proof $\pi $ of B is a proof in the elementary calculus of $\mathbf {L}$ from critical formulas $\Gamma $ of the form

$$ \begin{align*} A(t) & \mathbin{\rightarrow} A(\varepsilon_{x}\,A(x)),\\ A(\tau_{x}\,A(x)) & \mathbin{\rightarrow} A(t). \end{align*} $$

We write $\mathbf {L}{\varepsilon \tau } \vdash B$ if such a $\pi $ exists, or $\Gamma \vdash _{\mathbf {L}{\varepsilon \tau }}^{\pi } B$ when we want to identify the critical formulas and the proof $\pi $ .

The pure ${\varepsilon \tau }$ -calculus $\mathbf {L}{\varepsilon \tau }$ of $\mathbf {L}$ is the set of quantifier-free formulas that have ${\varepsilon \tau }$ -proofs.

We are interested in the relationships between intermediate predicate logics $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ and the ${\varepsilon \tau }$ -calculus $\mathbf {L}{\varepsilon \tau }$ of their propositional fragment $\mathbf {L}$ . Since the language of $\mathbf {L}{\varepsilon \tau }$ does not contain quantifiers, we must define a translation of predicate formulas that do contain them into the language of the ${\varepsilon \tau }$ -calculus.

Definition 3.3. The ${\varepsilon \tau }$ -translation $A^{\varepsilon \tau }$ of a formula A is defined as follows:

$$ \begin{align*} A^{\varepsilon\tau} & = A \text{ if }A\text{ is atomic}.\\ (A \land B)^{\varepsilon\tau} & = A^{\varepsilon\tau} \land B^{\varepsilon\tau}, & (A \lor B)^{\varepsilon\tau} & = A^{\varepsilon\tau} \lor B^{\varepsilon\tau}, \\ (A \mathbin{\rightarrow} B)^{\varepsilon\tau} & = A^{\varepsilon\tau} \mathbin{\rightarrow} B^{\varepsilon\tau}, & (\lnot A)^{\varepsilon\tau} & = \lnot A^{\varepsilon\tau}, \\ (\exists x\, A(x))^{\varepsilon\tau} & = A^{\varepsilon\tau}(\varepsilon_{x}\,A(x)^{\varepsilon\tau}), & (\forall x\, A(x))^{\varepsilon\tau} & = A^{\varepsilon\tau}(\tau_{x}\,A(x)^{\varepsilon\tau}). \end{align*} $$

Again, substitution of ${\varepsilon \tau }$ -terms for variables must be understood modulo renaming of bound variables so as to avoid clashes. Clearly, if A contains no quantifiers, then $A^{\varepsilon \tau } = A$ .

The point of the classical $\varepsilon $ -calculus is that it can replace quantifiers and quantifier inferences. And indeed, in classical first-order logic, a first-order formula A is provable iff its translation $A^{\varepsilon }$ is provable in the pure $\varepsilon $ -calculus. The “if” direction is the content of the second $\varepsilon $ -theorem, while the “only if” direction follows more simply by translating derivations.

We defined the ${\varepsilon \tau }$ -calculus on the basis of a propositional logic $\mathbf {L}$ . It is also possible to define an “extended” ${\varepsilon \tau }$ -calculus by adding ${\varepsilon \tau }$ -terms and critical formulas to the full first-order language including quantifiers, and then considering proofs in $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ from critical formulas.

Definition 3.4. A proof $\pi $ in $(\mathbf {Q}\mathbf {L}+\mathit {Ax}){\varepsilon \tau }$ of A is a sequence of formulas of the ${\varepsilon \tau }$ -calculus (possibly containing quantifiers) ending in A in which each formula is either an instance of a formula in $\mathbf {L}$ , an instance of a schema in $\mathit {Ax}$ , a standard quantifier axiom, a critical formula, or follows from preceding formulas by modus ponens or a quantifier rule. We write $(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau } \vdash A$ if such a $\pi $ exists.

The extended  ${\varepsilon \tau }$ -calculus $(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau }$ of $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ is the set of formulas that have proofs in $(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau }$ .

If A is quantifier-free and $\mathbf {L}{\varepsilon \tau } \vdash A$ then $(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau } \vdash A$ . One may wonder, however, if $(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau }$ is stronger than $\mathbf {L}{\varepsilon \tau }$ in the sense that for some formulas A, $(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau } \vdash A$ but not $\mathbf {L}{\varepsilon \tau } \vdash A^{\varepsilon \tau }$ . This is not so as long as the ${\varepsilon \tau }$ -translations of the axioms $\mathit {Ax}$ of $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ are provable in $\mathbf {L}{\varepsilon \tau }$ ; then the extended $\varepsilon $ -calculus is conservative over the pure $\varepsilon $ -calculus.

Lemma 3.5. Suppose $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ is an intermediate predicate logic, and for each quantifier axiom $B \in \mathit {Ax}$ , $\mathbf {L}{\varepsilon \tau } \vdash B^{\varepsilon \tau }$ . If $(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau } \vdash A$ , then $\mathbf {L}{\varepsilon \tau } \vdash A^{\varepsilon \tau }$ .

Proof. By standard proof transformations we may assume that the proof $\pi $ in $(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau }$ is such that every formula is used as a premise of at most one modus ponens or quantifier inference, and that the eigenvariables of all quantifier inferences are distinct (the proof is regular). The proof then proceeds by induction on the length of $\pi $ .

Any formula B in $\pi $ that is not the conclusion of an inference is either in $\mathbf {L}$ , a critical formula, in $\mathit {Ax}$ , or a standard quantifier axiom. Then $B^{\varepsilon \tau }$ is also either an axiom in $\mathbf {L}$ , a critical formula (as can easily be seen from the definition of the ${\varepsilon \tau }$ -translation), or, if $B \in \mathit {Ax}$ then $\mathbf {L}{\varepsilon \tau } \vdash B^{\varepsilon \tau }$ (by hypothesis). If B is a standard quantifier axiom, its ${\varepsilon \tau }$ -translation is a critical formula:

$$ \begin{align*} [A(t) \mathbin{\rightarrow} \exists x\,A(x)]^{\varepsilon\tau} &= A^{\varepsilon\tau}(t) \mathbin{\rightarrow} A^{\varepsilon\tau}(\varepsilon_{x}\,A^{\varepsilon\tau}(x)),\\ [\forall x\,A(x) \mathbin{\rightarrow} A(t)]^{\varepsilon\tau} &= A^{\varepsilon\tau}(\tau_{x}\,A^{\varepsilon\tau}(x)) \mathbin{\rightarrow} A^{\varepsilon\tau}(t). \end{align*} $$

If $B \mathbin {\rightarrow } A(x)$ is derivable (where the eigenvariable x is not free in B), then so is $B \mathbin {\rightarrow } A(\tau _{x}\,A(x))$ , by substituting $\tau _{x}\,A(x)$ everywhere x appears free in the part of $\pi $ leading to $B \mathbin {\rightarrow } A(x)$ , and renaming bound variables to avoid clashes. Thus, if B is the conclusion of a quantifier inference, $\mathbf {L}{\varepsilon \tau } \vdash B^{\varepsilon \tau }$ . Similarly, if $B(x) \mathbin {\rightarrow } A$ is derivable, so is $B(\varepsilon _{x}\,B(x)) \mathbin {\rightarrow } A$ . (Cf. [Reference Moser and Zach23, Lemma 7].)⊣

As we’ll see in Section 4, the quantifier axioms of the intermediate predicate logics considered in the preceding section satisfy the condition that the ${\varepsilon \tau }$ -translations of their additional quantifier axioms are derivable from critical formulas alone (i.e., already in $\mathbf {L}{\varepsilon \tau }$ ).

Corollary 3.6. If $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ satisfies the conditions of Lemma 3.5, and A contains no quantifiers, then $(\mathbf {Q}\mathbf {L}+\mathit {Ax}){\varepsilon \tau } \vdash A$ only if $\mathbf {L}{\varepsilon \tau } \vdash A$ .

Proof. If A contains no quantifiers, then $A^{\varepsilon \tau } \equiv A$ .⊣

Proposition 3.7. For any intermediate predicate logic $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ ,

$$ \begin{align*} & (\mathbf{Q}\mathbf{L} + \mathit{Ax}){\varepsilon\tau} \vdash \forall x\, A(x) \mathbin{\leftrightarrow} A(\tau_{x}\,A(x)) \text{ and}\\ &(\mathbf{Q}\mathbf{L}+\mathit{Ax}){\varepsilon\tau} \vdash \exists x\, A(x) \mathbin{\leftrightarrow} A(\varepsilon_{x}\,A(x)). \end{align*} $$

Proof. In each case, one direction is an instance of the corresponding quantifier axiom, and the other direction follows from a critical formula by the corresponding quantifier rule. For instance, $A(\varepsilon _{x}\,A(x)) \mathbin {\rightarrow } \exists x\, A(x)$ is a standard quantifier axiom, and from the critical formula $A(x) \mathbin {\rightarrow } A(\varepsilon _{x}\,A(x))$ we get $\exists x\, A(x) \mathbin {\rightarrow } A(\varepsilon _{x}\,A(x))$ by the $\exists $ -rule, since x is not free in $A(\varepsilon _{x}\,A(x))$ .⊣

Proposition 3.8. If $\mathbf {Q}\mathbf {L}+\mathit {Ax}$ satisfies the conditions of Lemma 3.5, then $(\mathbf {Q}\mathbf {L}+\mathit {Ax}){\varepsilon \tau } \vdash A \mathbin {\leftrightarrow } A^{\varepsilon \tau }$ .

Proof. Since $\mathbf {Q}\mathbf {L}+\mathit {Ax}$ includes $\mathbf {Q}\mathbf {H}$ , the substitution rule $B \mathbin {\leftrightarrow } C \vdash D(B) \mathbin {\leftrightarrow } D(C)$ is admissible. The result follows by induction on complexity of A and the previous proposition.⊣

4 Critical formulas and quantifier shifts

We will show later (Theorem 5.3) that any ${\varepsilon \tau }$ -calculus for an intermediate logic $\mathbf {L}$ is conservative over $\mathbf {L}$ . It is well-known that the ${\varepsilon \tau }$ -calculus over intuitionistic logic is not conservative over intuitionistic predicate logic. We’ll show now specifically that for any intermediate logic $\mathbf {L}$ , the ${\varepsilon \tau }$ -translations of all classically valid quantifier shift principles are provable from critical formulas.

Quantifier shift formulas divide into two kinds. On the one hand, we have conditionals the ${\varepsilon \tau }$ -translations of which are critical formulas, and are therefore provable in $\mathbf {L}{\varepsilon \tau }$ (see Table 3). On the other hand, we have formulas provable from critical formulas together with some propositional principles, all of which are intuitionistically valid and hence provable in all intermediate logics. For instance, to obtain the ${\varepsilon \tau }$ -translation of

$$ \begin{align*}(\forall x\, A(x) \lor B) \mathbin{\rightarrow} \forall x(A(x) \lor B), \end{align*} $$

take $A_{1} = A(\tau _{x}\,A(x))$ and $A_{2} = A(\tau _{x}\,(A(x) \lor B))$ . Then $A_{1} \mathbin {\rightarrow } A_{2}$ is a critical formula, viz.,

$$ \begin{align*} A(\tau_{x}\,A(x)) \mathbin{\rightarrow} A(\tau_{x}\,(A(x) \lor B)). \end{align*} $$

Apply modus ponens to it and the principle

$$ \begin{align*} (A_{1} \mathbin{\rightarrow} A_{2}) \mathbin{\rightarrow} ((A_{1} \lor B) \mathbin{\rightarrow} (A_{2} \lor B)). \end{align*} $$

This same pattern works in all cases, and the required critical formulas $A_{1} \mathbin {\rightarrow } A_{2}$ and propositional principles are given in Table 4.

Table 3 Quantifier shift formulas whose ${\varepsilon \tau }$ -translations are critical formulas. In each case, x is not free in B, and the ${\varepsilon \tau }$ -translation of the quantifier shift formula on the left is $C(t_{1}) \mathbin {\rightarrow } C(t_{2})$ .

Table 4 Proofs of ${\varepsilon \tau }$ -translations of quantifier shift formulas. In each case, x is not free in B, $A_{1} \mathbin {\rightarrow } A_{2}$ is a critical formula, and the ${\varepsilon \tau }$ -translation of the formula is given on the left. The propositional principle on the right is provable in intuitionistic logic, and the ${\varepsilon \tau }$ -translation of the quantifier shift formula follows by one application of modus ponens.

The most interesting quantifier shift formulas here are $\mathit {CD}$ , $Q_{\forall }$ , and $Q_{\exists }$ , since they are not intuitionistically valid. By contrast, we have:

Proposition 4.1. If  $\mathbf {L}$ is an intermediate propositional logic, then $:$

  1. 1. $K^{\varepsilon \tau }$ , $\mathit {CD}^{\varepsilon \tau }$ , $(Q_{\exists })^{\varepsilon \tau }$ , and $(Q_{\forall })^{\varepsilon \tau }$ are provable in $\mathbf {L}{\varepsilon \tau }$ .

  2. 2. K, $\mathit {CD}$ , $Q_{\exists }$ , and $Q_{\forall }$ are provable in $(\mathbf {Q}\mathbf {L}+\mathit {Ax}){\varepsilon \tau }$ .

Proof.

  1. (1) They are critical formulas; see Table 3.

  2. (2) Follows from Proposition 3.8.⊣

The second $\varepsilon $ -theorem states that if $A^{\varepsilon }$ is provable in the pure $\varepsilon $ -calculus, then A is provable in classical predicate logic. The second $\varepsilon $ -theorem fails for any intermediate predicate logic $\mathbf {Q}\mathbf {L} +\mathit {Ax}$ , in which $A^{\varepsilon \tau }$ is provable in $\mathbf {L}{\varepsilon \tau }$ but A is not provable in $\mathbf {Q}\mathbf {L} +\mathit {Ax}$ , e.g., when $\mathbf {Q}\mathbf {L} +\mathit {Ax}$ does not prove one of K, $\mathit {CD}$ , $Q_{\exists }$ , or $Q_{\forall }$ .

Note that the only intuitionistically invalid De Morgan rule for quantifiers,

(Q) $$\begin{align} \lnot \forall x\,A(x) \mathbin{\rightarrow} \exists x\,\lnot A(x), \end{align}$$

is a special case of $Q_{\forall }$ , taking $\bot $ for B; $Q^{\varepsilon \tau }$ is a critical formula. The ${\varepsilon \tau }$ -translation of double negation shift $K^{\varepsilon \tau }$ is

$$ \begin{align*}\lnot\lnot A(\tau_{x}\,\lnot\lnot A(x)) \mathbin{\rightarrow} \lnot\lnot A(\tau_{x}\,A(x))\end{align*} $$

and is also a critical formula.

In classical first-order logic, both the addition of $\varepsilon $ -operators and critical formulas and the replacement of quantifiers by $\varepsilon $ -operators are conservative. The previous results show that for extensions of first-order intuitionistic logic, this is not the case: intuitionistically invalid quantified formulas (or their ${\varepsilon \tau }$ -translations) become provable. However, these quantifier shifts are provable in some intermediate logics, e.g., in some Gödel logics.

We might think of ${\varepsilon \tau }$ -terms semantically as terms for objects which serve the role of generics taking on the role of quantifiers, and indeed in classical logic this connection is very close. Because of the validity of

(Wel1) $$\begin{align} & \exists x(\exists y\,A(y) \mathbin{\rightarrow} A(x)) \text{ and} \end{align}$$
(Wel2) $$\begin{align} & \exists x(A(x) \mathbin{\rightarrow} \forall y\,A(y)) \end{align}$$

in classical logic, there always is an object x which behaves as an $\varepsilon $ -term ( $A(x)$ holds iff $\exists x\,A(x)$ holds), and an object x which behaves as a $\tau $ -term (i.e., $A(x)$ holds iff $\forall y\,A(y)$ holds). One might expect then that $\mathit {Wel}_{1}$ and $\mathit {Wel}_{2}$ , when added to $\mathbf {Q}\mathbf {H}$ , have the same effect as adding critical formulas, i.e., that all quantifier shifts become provable. Note that $\mathit {Wel}_{1}$ and $\mathit {Wel}_{2}$ are intuitionistically equivalent to

(Wel'1) $$\begin{align} & \exists x\forall y(A(y) \mathbin{\rightarrow} A(x)), \end{align}$$
(Wel'2) $$\begin{align} & \exists x\forall y(A(x) \mathbin{\rightarrow} A(y)). \end{align}$$

As is easily checked, $\mathbf {Q}\mathbf {H} \vdash \mathit {Wel}_{1} \mathbin {\leftrightarrow } Q_{\exists }$ and $\mathbf {Q}\mathbf {H} \vdash \mathit {Wel}_{2} \mathbin {\leftrightarrow } Q_{\forall }$ , and even $\mathbf {Q}\mathbf {H} \vdash \mathit {Wel}_{2} \mathbin {\rightarrow } \mathit {CD}$ . However, $\mathbf {Q}\mathbf {H} + \mathit {Wel}_{1} \nvdash \mathit {CD}$ .Footnote 4

5 $\mathbf {L}{\varepsilon \tau }$ is conservative over $\mathbf {L}$

The classical $\varepsilon $ -calculus is conservative over propositional logic. Work by Bell [Reference Bell9] and DeVidi [Reference DeVidi12] shows that, however, the addition of critical formulas to intuitionistic logic results in intuitionistically invalid propositional formulas becoming provable in certain simple theories. These results require the presence of identity axioms. One may wonder if these results can be strengthened to the pure logic and the ${\varepsilon \tau }$ -calculus alone. The following proposition shows that this is not the case. The addition of critical formulas to intermediate logics alone does not have any effects on the propositional level.

Definition 5.1. The shadow $A^{s}$ of a formula is defined as follows:

$$ \begin{align*} P(t_{1}, \ldots, t_{n})^{s} & = X_{P},\\ (t_{1} = t_{2})^{s} & = \top,\\ (A \land B)^{s} & = A^{s} \land B^{s}, & (A \lor B)^{s} & = A^{s} \lor B^{s}, \\ (A \mathbin{\rightarrow} B)^{s} & = A^{s} \mathbin{\rightarrow} B^{s}, & (\lnot A)^{s} & = \lnot A^{s}, \\ (\exists x\, A(x))^{s} & = A(x)^{s}, & (\forall x\, A(x))^{s} & = A(x)^{s}, \end{align*} $$

where $X_{P}$ is a propositional variable and $\top $ is any theorem of $\mathbf {L}$ .

The shadow of a proof $\pi = A_{1}$ , …, $A_{n}$ is $A_{1}^{s}$ , …, $A_{n}^{s}$ .

A first-order intermediate logic $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ (over a propositional base logic $\mathbf {L}$ ) is preserved under shadow if $\mathbf {L} \vdash B^{s}$ for all quantifier axioms $B \in \mathit {Ax}$ .

The shadow of a formula is a propositional formula obtained by disregarding all first-order structure. If an intermediate predicate logic $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ is preserved under shadow, the shadows of its theorems are already valid in $\mathbf {L}$ .

Proposition 5.2. Suppose $\mathbf {Q}\mathbf {L}+\mathit {Ax}$ is preserved under shadow. If $A_{1}, \ldots , A_{n} \vdash _{(\mathbf {Q}\mathbf {L} + \mathit {Ax}){\varepsilon \tau }} B$ , then $A_{1}^{s}, \ldots , A_{n}^{s} \vdash _{\mathbf {L}} B^{s}$ . This also holds if identity axioms are present.

Proof. Consider a derivation $\pi $ in $(\mathbf {Q}\mathbf {L}+\mathit {Ax}){\varepsilon \tau }$ of B from $A_{1}$ , …, $A_{n}$ , and a formula C in $\pi $ not a conclusion of an inference, and not among $A_{1}$ , …, $A_{n}$ . If $C \in \mathbf {L}$ , then also $C^{s} \in \mathbf {L}$ . If C is a critical formula, then $C^{s}$ is of the form $A \mathbin {\rightarrow } A$ . If C is a standard quantifier axiom, we have $(\forall x\, A \mathbin {\rightarrow } A(t))^{s} \equiv (A(t) \mathbin {\rightarrow } \exists x\, A(x))^{s} \equiv A(x)^{s} \mathbin {\rightarrow } A(x)^{s}$ , which again is in $\mathbf {L}$ . (Clearly, $A(t)^{s} \equiv A(x)^{s}$ .)

If C is the conclusion of modus ponens from premises A and $A \mathbin {\rightarrow } C$ , then $C^{s}$ follows from $A^{s}$ and $(A \mathbin {\rightarrow } C)^{s}$ by modus ponens. If C is the conclusion of a quantifier rule, the shadows of premise and conclusion are identical, e.g.,

$$ \begin{align*} (B \mathbin{\rightarrow} A(x))^{s} \equiv B^{s} \mathbin{\rightarrow} A(x)^{s} \equiv (B \mathbin{\rightarrow} \forall x\, A(x))^{s}. \end{align*} $$

Thus we have shown that $\pi ^{s}$ is a derivation of $B^{s}$ from $A_{1}^{s}$ , …, $A_{n}^{s}$ in $\mathbf {L}$ .

This still holds if identity is present, as the shadows of identity axioms are: $(t = t)^{s} \equiv \top $ and

$$ \begin{align*} (t_{1} = t_{2} \mathbin{\rightarrow} (A(t_{1}) \mathbin{\rightarrow} A(t_{2})))^{s} \equiv \top \mathbin{\rightarrow} (A(t_{1})^{s} \mathbin{\rightarrow} A(t_{1})^{s}), \end{align*} $$

since $A(t_{1})^{s} \equiv A(t_{2})^{s}$ . Both are provable in $\mathbf {H}{}$ and thus in $\mathbf {L}$ .⊣

All intermediate predicate logics mentioned above are preserved under shadow. They are axiomatized by various quantifier shift principles. As we have seen in the preceding section, the ${\varepsilon \tau }$ -translations of all such quantifier shift principles become provable in the corresponding ${\varepsilon \tau }$ -calculus. However, the shadow of such a quantifier shift principle is a formula of the form $B \mathbin {\rightarrow } B$ . As a consequence, we have the following conservativity result for all intermediate ${\varepsilon \tau }$ -calculi:

Theorem 5.3. If $\mathbf {Q}\mathbf {L} + \mathit {Ax}$ is preserved under shadow, then $(\mathbf {Q}\mathbf {L}+\mathit {Ax}){\varepsilon \tau }$ is conservative over $\mathbf {L}$ for propositional formulas. In particular, no new propositional formulas become provable in $\mathbf {L}{\varepsilon \tau }$ by the addition of critical formulas to any intermediate logic $\mathbf {L}$ , including intuitionistic logic itself.

Bell [Reference Bell9] claimed that in the extended intuitionistic $\varepsilon $ -calculus for $\mathbf {Q}\mathbf {H}$ with identity, we have $D \vdash _{\mathbf {Q}\mathbf {H}{\varepsilon \tau }} M$ , where M is

(M) $$\begin{align} \lnot(B \land C) \mathbin{\rightarrow} (\lnot B \lor \lnot C) \end{align}$$

and D is $\forall x(x = a \lor \lnot x =a)$ . M is an intuitionistically invalid direction of De Morgan’s laws. Since the shadow $D^{s}$ of D is the intuitionistically valid formula $\top \lor \lnot \top $ , this seems to contradict Lemma 5.2. The proof starts by asserting that

$$ \begin{align*} \forall x\,[(x = a \land B) \lor (x \neq a \land C)] \mathbin{\rightarrow} (B \land C) \end{align*} $$

is provable in $\mathbf {Q}\mathbf {H}$ with identity. This is false, however, as the formula is not true in any one-element model when B is true and C is false. Theorem 7 of [Reference DeVidi12] fails for the same reason. The results are correct with the additional assumption $a \neq b$ .Footnote 5

6 The extended first ${\varepsilon \tau }$ -theorem fails unless $\mathbf {L} \vdash B_{m}$

In classical first-order logic, the main result about the $\varepsilon $ -calculus is the extended first $\varepsilon $ -theorem. It states that if $A(e_{1}, \dots , e_{n})$ , where the $e_{i}$ are $\varepsilon $ -terms, is provable in the pure $\varepsilon $ -calculus, then there are $\varepsilon $ -free terms $t_{i}^{j}$ such that

$$ \begin{align*} A(t_{i}^{1}, \dots, t_{n}^{1}) \lor \dots \lor A(t_{i}^{k}, \dots, t_{n}^{k}) \end{align*} $$

is provable in classical propositional logic alone. Such $\varepsilon $ -terms $e_{i}$ appear as the result of translating $\exists x_{1}\dots \exists x_{n}\,A(x_{1}, \dots , x_{n})$ into the $\varepsilon $ -calculus.

In the context of intermediate logics, we may formulate the statement as follows:

Definition 6.1. An intermediate logic $\mathbf {L}$ has the extended first  ${\varepsilon \tau }$ -theorem, if, whenever $\mathbf {L}{\varepsilon \tau } \vdash A(e_{1}, \dots , e_{n})$ for some $\varepsilon $ - or $\tau $ -terms $e_{1}$ , …, $e_{n}$ , then there are ${\varepsilon \tau }$ -free terms $t_{i}^{j}$ such that

$$ \begin{align*} \mathbf{L} \vdash A(t_{i}^{1}, \dots, t_{n}^{1}) \lor \dots \lor A(t_{i}^{k}, \dots, t_{n}^{k}). \end{align*} $$

We obtain a first negative result: if $\mathbf {L}{\varepsilon \tau }$ has the extended first epsilon theorem, then an instance of ${B_{m}}$ , i.e.,

$$ \begin{align*} (A_{1} \mathbin{\rightarrow} A_{2}) \lor \cdots \lor (A_{m} \mathbin{\rightarrow} A_{m+1}) \end{align*} $$

for some $m \ge 2$ is provable already in the propositional fragment $\mathbf {L}$ .Footnote 6 This rules out an extended first ${\varepsilon \tau }$ -theorem for, e.g., ${\varepsilon \tau }$ -calculi for intuitionistic logic and infinite-valued Gödel–Dummett logic.

Theorem 6.2. Suppose $\mathbf {L}{\varepsilon \tau }$ has the extended first ${\varepsilon \tau }$ -theorem. Then $\mathbf {L} \vdash {B_{m}}$ for some $m \ge 2$ .

Proof. Consider $A(z) \equiv (P(f(z)) \mathbin {\rightarrow } P(z))$ and $\exists z\, A(z)$ , i.e., $\exists z(P(f(z)) \mathbin {\rightarrow } P(z))$ . Let $e \equiv \varepsilon _{z}\,(P(f(z)) \mathbin {\rightarrow } P(z))$ . The ${\varepsilon \tau }$ -translation of $\exists z\, A(z)$ is $P(f(e)) \mathbin {\rightarrow } P(e)$ , i.e.,

$$ \begin{align*} V \equiv P(f(\varepsilon_{z}\,(P(f(z)) \mathbin{\rightarrow} P(z)))) \mathbin{\rightarrow} P(\varepsilon_{z}\,(P(f(z)) \mathbin{\rightarrow} P(z))). \end{align*} $$

Let $U \equiv A(\varepsilon _{x}\,P(x)) \equiv P(f(\varepsilon _{x}\,P(x))) \mathbin {\rightarrow } P(\varepsilon _{x}\,P(x))$ . Note that U is of the form $P(t) \mathbin {\rightarrow } P(\varepsilon _{x}\,P(x))$ , so it is a critical formula. Also note that $U \mathbin {\rightarrow } V$ is of the form $A(t) \mathbin {\rightarrow } A(e)$ , and so $U \mathbin {\rightarrow } V$ is also a critical formula.

Since $\mathbf {L} \vdash (U \mathbin {\rightarrow } V) \mathbin {\rightarrow } (U \mathbin {\rightarrow } V)$ , and $U \mathbin {\rightarrow } V$ and U are critical formulas, $\mathbf {L}{\varepsilon \tau } \vdash V$ . By assumption, $\mathbf {L}{\varepsilon \tau }$ has the extended first ${\varepsilon \tau }$ -theorem, so $\mathbf {L}$ proves a disjunction of the form

$$ \begin{align*} (P(f(t_{1})) \mathbin{\rightarrow} P(t_{1})) \lor \dots \lor (P(f(t_{k})) \mathbin{\rightarrow} P(t_{k})) \end{align*} $$

for some terms $t_{1}$ , …, $t_{k}$ . (This is a Herbrand disjunction of $\exists z\, A(z)$ .) Each term $t_{i}$ is of the form $f^{i}(s)$ for some $i \ge 0$ and a term s which does not start with f. By rearranging the disjuncts to group disjuncts with the same innermost term s together (using commutativity of $\lor $ ) and by adding additional disjuncts as needed (using weakening), from this we obtain a formula

$$ \begin{align*} (P(f^{j_{1}+1}(s_{1})) \mathbin{\rightarrow} P(f^{j_{1}}(s_{1}))) \lor {} & \dots \lor (P(f(s_{1})) \mathbin{\rightarrow} P(s_{1})) \lor \\ & \vdots \\ (P(f^{j_{l}+1}(s_{l})) \mathbin{\rightarrow} P(f^{j_{l}}(s_{l}))) \lor {} & \dots \lor (P(f(s_{l})) \mathbin{\rightarrow} P(s_{l})). \end{align*} $$

Let j be the largest among $j_{1}$ , …, $j_{l}$ . By uniformly replacing $P(f^{i}(s_{j}))$ by $A_{j+2-i}$ in the proof of the last formula and contracting identical disjuncts, we obtain a proof in $\mathbf {L}$ of $(A_{1} \mathbin {\rightarrow } A_{2}) \lor \dots \lor (A_{j+1} \mathbin {\rightarrow } A_{j+2})$ . This is ${B_{m}}$ for $m=j+1$ , and since $j\ge 1$ , $m\ge 2$ .⊣

A formula of the form ${B_{m}}$ is provable in $\mathbf {L}$ iff $\mathbf {L}$ is a finite-valued Gödel logic $\mathbf {LC}_{n}$ (Proposition 6.5). By contrast, no ${B_{m}}$ is provable in intuitionistic logic $\mathbf {H}$ , Jankov logic $\mathbf {KC}$ , or infinite-valued Gödel logic $\mathbf {LC}$ (Proposition 6.3).

Proposition 6.3.

  1. 1. $\mathbf {LC}_{m} \vdash {B_{m}}$ .

  2. 2. $\mathbf {L} \not \vdash {B_{n}}$ for $\mathbf {L}$ any of $\mathbf {LC}_{m}$ with $m>n$ , $\mathbf {LC}$ , $\mathbf {KC}$ , or $\mathbf {H}$ .

Proof. (1) Follows by definition, since $\mathbf {LC}_{m} = \mathbf {LC} + {B_{m}}$ .

(2) Let $v(A_{i}) = 1/i$ if $i < n$ and $v(A_{n+1}) = 0$ . This is a valuation in a truth value set with m elements if $m> n$ (i.e., a valuation in the Gödel semantics for $\mathbf {LC}_{m}$ ). It is also a valuation in the infinite truth value set $[0,1]$ of $\mathbf {LC}$ . For all $i \le n$ , $v(A_{i})> v(A_{i+1})$ and hence $v({B_{n}}) < 1$ . So ${B_{n}}$ is not a tautology of $\mathbf {LC}_{m}$ or $\mathbf {LC}$ . Since $\mathbf {H} \subsetneq \mathbf {KC} \subsetneq \mathbf {LC}$ , the result also follows for $\mathbf {KC}{}$ and $\mathbf {H}$ .⊣

Proposition 6.4. $\mathbf {H} + {B_{m}} \vdash \mathit {Lin}$ .

Proof. Simultaneously substitute A for $A_{i}$ if i is odd, and B for $A_{i}$ if i is even in ${B_{m}}$ . The result is one of

$$ \begin{align*} & (A \mathbin{\rightarrow} B) \lor (B \mathbin{\rightarrow} A) \lor \dots \lor (A \mathbin{\rightarrow} B) \text{ and}\\ & (A \mathbin{\rightarrow} B) \lor (B \mathbin{\rightarrow} A) \lor \dots \lor (B \mathbin{\rightarrow} A). \end{align*} $$

Both are equivalent in $\mathbf {H}{}$ to $(A \mathbin {\rightarrow } B) \lor (B \mathbin {\rightarrow } A)$ .⊣

Proposition 6.5. If $\mathbf {L} \vdash {B_{n}}$ , then $\mathbf {L} = \mathbf {LC}_{m}$ for some m.

Proof. Hosoi [Reference Hosoi16] showed that the n-valued Gödel logic is axiomatized by $\mathbf {H} + R_{n-1}$ , where $R_{n}$ is

$$ \begin{align*} A_{1} \lor (A_{1} \mathbin{\rightarrow} A_{2}) \lor \dots \lor (A_{n-1} \mathbin{\rightarrow} A_{n}) \lor \lnot A_{n}. \end{align*} $$

By simultaneously substituting $\top $ for $A_{1}$ , $\bot $ for $A_{n+1}$ , and $A_{i-1}$ for $A_{i}$ ( $i = 2$ , …, n) in ${B_{n}}$ , we obtain

$$ \begin{align*} (\top \mathbin{\rightarrow} A_{1}) \lor (A_{1} \mathbin{\rightarrow} A_{2}) \lor \dots \lor (A_{n-2} \mathbin{\rightarrow} A_{n-1}) \lor (A_{n-1} \mathbin{\rightarrow} \bot), \end{align*} $$

which is equivalent to $R_{n-1}$ in $\mathbf {H}$ . Hence, since $\mathbf {L} \vdash {B_{n}}$ , $\mathbf {LC}_{n} \subseteq \mathbf {L}$ .

Furthermore, Hosoi [Reference Hosoi17, Lemma 4.1] showed that if $\mathbf {L} \vdash \mathit {Lin}$ then $\mathbf {L} = \mathbf {LC}_{m}$ for some m or $L = \mathbf {LC}$ . Since $\mathbf {L} \vdash {B_{n}}$ , $\mathbf {L} \vdash \mathit {Lin}$ by Proposition 6.4. The result follows as $\mathbf {LC} \nvdash {B_{n}}$ and so $\mathbf {L} \neq \mathbf {LC}$ .Footnote 7

Corollary 6.6. No intermediate logic except $\mathbf {LC}_{m}$ has the extended first ${\varepsilon \tau }$ -theorem. In particular, intuitionistic logic $\mathbf {H}$ , Jankov logic $\mathbf {KC}$ , and infinite-valued Gödel logic $\mathbf {LC}{}$ do not have the extended first ${\varepsilon \tau }$ -theorem.

We have restricted $\mathbf {L}$ here to be an intermediate propositional logic. However, it bears remarking that Theorem 6.2 does not require that $\mathbf {L}$ contains $\mathbf {H}$ . An inspection of the proof shows that all that is required is that $\mathbf {L} \vdash A \mathbin {\rightarrow } A$ , and in $\mathbf {L}$ , $\lor $ is provably commutative, associative, and idempotent, and has weakening ( $\mathbf {L} \vdash A \mathbin {\rightarrow } (A \lor B)$ ). Thus, Corollary 6.6 applies to any ${\varepsilon \tau }$ -calculus based on a logic which has these properties (such as, say, Łukasiewicz logic).

The extended first $\varepsilon $ -theorem in classical logic shows that if an existential formula $\exists x\,A(x)$ is provable, so is a disjunction of instances $\bigvee _{i} A(t_{i})$ . Clearly this is equivalent to: if $\forall x\,A(x) \mathbin {\rightarrow } B$ is provable so is $\bigwedge A(t_{i}) \mathbin {\rightarrow } B$ . Without the interdefinability of $\forall $ and $\exists $ , the question arises whether the alternative form of the $\varepsilon $ -theorem might hold in an intermediate ${\varepsilon \tau }$ -calculus even if the standard form does not. We’ll show that the versions are, in fact, equivalent even in intermediate logics.

Proposition 6.7. The following are equivalent $:$

  1. 1. If $\mathbf {L}{\varepsilon \tau } \vdash A(e)$ then $\mathbf {L} \vdash \bigvee _{i} A(t)$ .

  2. 2. If $B(e^{\prime }) \vdash _{\mathbf {L}{\varepsilon \tau }} C$ then $\bigwedge _{j} B(s_{j}) \vdash _{\mathbf {L}} C$ for $C\ {\varepsilon \tau }$ -free.

  3. 3. If $B(e^{\prime }) \vdash _{\mathbf {L}{\varepsilon \tau }} C(e)$ then $\bigwedge _{j} B(s_{j}) \vdash _{\mathbf {L}} \bigvee _{i} C(t_{i})$ .

Proof. (1) implies (3): Suppose

$$ \begin{align*} B(e^{\prime}) \vdash_{\mathbf{L}{\varepsilon\tau}} C(e). \end{align*} $$

By the deduction theorem,

$$ \begin{align*} \vdash_{\mathbf{L}{\varepsilon\tau}} B(e^{\prime}) \mathbin{\rightarrow} C(e). \end{align*} $$

By (1) we have terms ${s_{i}, t_{i}}$ so that

$$ \begin{align*} \vdash \bigvee_{i} (B(s_{i}) \mathbin{\rightarrow} C(t_{i})). \end{align*} $$

By intuitionistic logic,

$$ \begin{align*} & \vdash \bigwedge_{i} B(s_{i}) \mathbin{\rightarrow} \bigvee_{i} C(t_{i}) \text{ and so}\\ \bigwedge_{i} B(s_{i}) & \vdash \bigvee_{i} C(t_{i}), \end{align*} $$

by the deduction theorem.

(3) clearly implies (1) and (2).

(2) implies (1): Let X be a propositional variable. $A \vdash _{\mathbf {H}} (A \mathbin {\rightarrow } X) \mathbin {\rightarrow } X$ . So if $\mathbf {L}{\varepsilon \tau } \vdash A(e)$ then by the deduction theorem,

$$ \begin{align*} A(e) \mathbin{\rightarrow} X & \vdash_{\mathbf{L}{\varepsilon\tau}} X \text{ and by (2),}\\ \bigwedge_{i} (A(s_{i}) \mathbin{\rightarrow} X) & \vdash_{\mathbf{L}} X. \end{align*} $$

Now substitute $\bigvee _{i} A(s_{i})$ for X:

$$ \begin{align*} \bigwedge_{i} (A(s_{i}) \mathbin{\rightarrow} \bigvee_{i} A(s_{i})) \vdash_{\mathbf{L}} \bigvee_{i} A(s_{i}). \end{align*} $$

The formula on the left is provable intuitionistically.⊣

7 Elimination sets and excluded middle

The basic idea of Hilbert’s proof of the extended first $\varepsilon $ -theorem is this: Suppose we have a proof of $E \equiv D(e)$ from critical formulas $\Gamma , \Lambda (e)$ , where e is a critical $\varepsilon $ -term and $\Lambda (e)$ is a set of critical formulas belonging to e. Now we find terms $t_{1}$ , …, $t_{k}$ such that replacing e by $t_{i}$ allows us to remove the critical formulas $\Lambda (e)$ , while at the same time replacing the end-formula $D(e)$ by $\bigvee _{i=1}^{k} D(t_{i})$ and the remaining critical formulas by $\Gamma [t_{1}/e]$ , …, $\Gamma [t_{k}/e]$ . We repeat this procedure in such a way that eventually all critical formulas are removed and we are left with a disjunction of instances of E, as required by the first $\varepsilon $ -theorem. The difficulty of making this work lies in three challenges. The first is to find a suitable way of selecting $\varepsilon $ -terms e and corresponding critical formulas $\Lambda (e)$ so that the $\Lambda (e)$ can be removed. The second is to ensure that in passing from $\Gamma $ to $\Gamma [t_{i}/e]$ we again obtain critical formulas.Footnote 8 The third challenge is to guarantee that the process eventually terminates with no critical formulas remaining.

In this section we address the first challenge by considering the condition that suffices to overcome it: the existence of complete e-elimination sets (defined below) for every ${\varepsilon \tau }$ -term e. We then show why this condition is satisfied in classical logic, so we can clarify the role of excluded middle in the proof for the classical case, as well as how the proof for classical logic and those for intermediate logics given later correspond to one another. We will discuss the condition for intermediate logics in Section 8 and the remaining challenges in Section 9.

Definition 7.1. Suppose $\Gamma \vdash _{\mathbf {L}{\varepsilon \tau }}^{\pi } D$ with critical formulas $\Gamma $ , and e is an $\varepsilon $ -term $\varepsilon _{x}\,A(x)$ ( $\tau $ -term $\tau _{x}\,A(x)$ ). If $C\equiv A(t) \mathbin {\rightarrow } A(\varepsilon _{x}\,A(x)) \in \Gamma $ ( $C \equiv A(\tau _{x}\,A(x)) \mathbin {\rightarrow } A(t) \in \Gamma $ ) we say e is the critical  ${\varepsilon \tau }$ -term of C, that e belongs to C, and that e is a critical  ${\varepsilon \tau }$ -term of $\pi $ .

Definition 7.2. Suppose $\Gamma , \Lambda (e), \Lambda ^{\prime }(e) \vdash _{\mathbf {L}{\varepsilon \tau }}^{\pi } D(e)$ where $\Lambda (e) \cup \Lambda (e)^{\prime }$ are all critical formulas belonging to e. A set of terms $s_{1}$ , …, $s_{k}$ is an e-elimination set for $\pi $ and $\Lambda (e)$ if

$$ \begin{align*} \Gamma[s_{1}/e], \dots, \Gamma[s_{k}/e], \Lambda^{\prime}(e) & \vdash_{\mathbf{L}} D(s_{1}) \lor \dots \lor D(s_{k}). \end{align*} $$

If $\Lambda (e)$ is the set of all critical formulas belonging to e (i.e., $\Lambda ^{\prime }(e) = \emptyset $ ) then an e-elimination set for $\Lambda (e)$ is called a complete e-elimination set.

Here, $\Gamma [s_{i}/e]$ means the result of replacing, in each formula in $\Gamma $ , every occurrence of e by $s_{i}$ . If $T = \{s_{1}, \dots , s_{k}\}$ we write $\Gamma [T]$ for $\Gamma [s_{1}]$ , …, $\Gamma [s_{k}]$ . Note that we do not require in the definition of e-elimination sets that the formulas in $\Gamma [s_{i}/e]$ are actually critical formulas.

Lemma 7.3. If $C \equiv A(t) \mathbin {\rightarrow } A(e)$ or $C \equiv A(e) \mathbin {\rightarrow } A(t)$ is a critical formula with critical ${\varepsilon \tau }$ -term e, then $C[s/e]$ is $A(t[s/e]) \mathbin {\rightarrow } A(s)$ or $A(s) \mathbin {\rightarrow } A(t[s/e])$ , respectively.

Proof. Since e is the critical $\varepsilon $ -term of C, $e \equiv \varepsilon _{x}\,A(x)$ or $e \equiv \tau _{x}\,A(x)$ . Hence, e cannot occur in $A(x)$ , since otherwise it would be a proper subterm of itself.⊣

Lemma 7.4. If $\Gamma \vdash _{\mathbf {L}} D$ then $\Gamma [t/e] \vdash _{\mathbf {L}} D[t/e]$ .

Proof. Any proof of D from $\Gamma $ using modus ponens and axioms of $\mathbf {L}$ remains correct if terms in it are uniformly replaced by other terms.⊣

Lemma 7.5. In any intermediate logic $\mathbf {L}:$

  1. 1. If $\Gamma , A \vdash C$ and $\Gamma ^{\prime }, B \vdash _{\mathbf {L}} D$ then $\Gamma , \Gamma ^{\prime }, A \lor B \vdash C \lor D$ .

  2. 2. If $\Gamma , A \vdash _{\mathbf {L}} C$ and $B \vdash A$ , then $\Gamma , B \vdash C$ .

We are now in a position to apply the preceding lemmas and the concept of elimination sets to the case of classical logic. This elucidates how the first challenge is solved in the proof of the extended first ${\varepsilon \tau }$ -theorem for classical logic where $\tau $ -terms and corresponding critical formulas may also be present. (For Hilbert’s original proof for the $\varepsilon $ -calculus without $\tau $ -terms, see [Reference Hilbert and Bernays15] or [Reference Moser and Zach23].)

Proposition 7.6. In $\mathbf {C}{\varepsilon \tau }$ , every critical formula $C(e)$ has an e-elimination set.

Proof. Suppose first that e is an $\varepsilon $ -term; then $C(e)$ is $A(s) \mathbin {\rightarrow } A(e)$ . Let $\Lambda ^{\prime }(e)$ be the critical formulas belonging to e other than $C(e)$ , and $\Gamma $ the remaining critical formulas for which e is not critical. So we have:

$$ \begin{align*} \Gamma, \Lambda^{\prime}(e), A(s) \mathbin{\rightarrow} A(e) & \vdash_{\mathbf{C}} D(e). \hspace{2pc} \end{align*} $$

On the one hand, by replacing e everywhere by s we get

$$ \begin{align*} \Gamma[s/e], \Lambda^{\prime}(s), A(s[s/e]) \mathbin{\rightarrow} A(s) & \vdash_{\mathbf{C}} D(s), \hspace{5.4pc} \end{align*} $$

and by Lemma 7.5(2), since A(s) $\vdash A(t[s/e]) \mathbin {\rightarrow } A(s) \in \Lambda ^{\prime }(s)$ and $A(s) \vdash C(s)$ ,

$$ \begin{align*} \hspace{1.6pc} \Gamma[s/e], A(s) & \vdash_{\mathbf{C}} D(s). \end{align*} $$

On the other hand, since $\lnot A(s) \vdash A(s) \mathbin {\rightarrow } A(e)$ ,

$$ \begin{align*} \Gamma, \lnot A(s) & \vdash_{\mathbf{C}} D(e) \text{ and so,}\\ \Gamma, \Gamma[s/e], A(s) \lor \lnot A(s)& \vdash_{\mathbf{C}} D(e) \lor D(s), \end{align*} $$

by Lemma 7.5(1). Since $\mathbf {C} \vdash A(s) \lor \lnot A(s)$ we have

$$ \begin{align*} \hspace{5.2pc} \Gamma, \Gamma[s/e] & \vdash_{\mathbf{C}} D(e) \lor D(s). \end{align*} $$

Thus, $\{e, s\}$ is an e-elimination set for the critical formula $C(e)$ .

Similarly, if e is a ${\tau }$ -term and $C(e)$ is $A(e) \mathbin {\rightarrow } A(s)$ we get

$$ \begin{align*} \Gamma[s/e], \Lambda^{\prime}(s), C(s) & \vdash_{\mathbf{C}} D(s), \hspace{1pc} \end{align*} $$

and by Lemma 7.5(2), since $\lnot A(s) \vdash C^{\prime } \in \Lambda ^{\prime }(e)$ and $\lnot A(s) \vdash C(s)$ ,

$$ \begin{align*} \hspace{1pc} \Gamma[s/e], \lnot A(s) & \vdash_{\mathbf{C}} D(s). \end{align*} $$

On the other hand, $A(s) \vdash C(e)$ , so

$$ \begin{align*} \Gamma, A(s) & \vdash_{\mathbf{C}} D(e) \text{ and}\\ \Gamma, \Gamma[s/e], \lnot A(s) \lor A(s)& \vdash_{\mathbf{C}} D(e) \lor D(s), \end{align*} $$

by Lemma 7.5(1).⊣

More generally, the set of all critical formulas belonging to e has a (complete) e-elimination set in $\mathbf {C}$ :

Proposition 7.7. In $\mathbf {C}{\varepsilon \tau }$ , every critical ${\varepsilon \tau }$ -term has a complete e-elimination set.

Proof. Let $C_{1} \equiv A(s_{1}) \mathbin {\rightarrow } A(e)$ , …, $C_{k} \equiv A(s_{k}) \mathbin {\rightarrow } A(e)$ be the critical formulas belonging to e if e is an $\varepsilon $ -term. Since

$$ \begin{align*} \Gamma, C_{1}(e), \dots, C_{k}(e) & \vdash_{\mathbf{C}} D(e) \text{, also}\\ \Gamma[s_{i}/e], C_{1}(s_{i}), \dots, C_{k}(s_{i}) & \vdash_{\mathbf{C}} D(s_{i}) \end{align*} $$

(writing ${C_{j}(s_{i})}$ for ${C_{j}[s_{i}/e]}$ ). Since ${A(s_{i}) \vdash A(s_{j}(s_{i})) \mathbin {\rightarrow } A(s_{i}) \equiv C_{j}(s_{i})}$ ,

$$ \begin{align*} \hspace{2.8pc} \Gamma[s_{i}/e], A(s_{i}) & \vdash_{\mathbf{C}} D(s_{i}), \end{align*} $$

by Lemma 7.5(2). By applying Lemma 7.5(1),

$$ \begin{align*} \Gamma[s_{1}/e], \dots, \Gamma[s_{k}/e], A(s_{1}) \lor \dots \lor A(s_{k}) & \vdash_{\mathbf{C}} D(s_{1}) \lor \dots \lor D(s_{k}). \hspace{1.4pc} \end{align*} $$

On the other hand, since ${\lnot A(s_{i}) \vdash A(s_{i}) \mathbin {\rightarrow } A(e)}$ , we get $\lnot A(s_{1}) \land \cdots \land \lnot A(s_{k}) \vdash C_{j}(e)$ for each ${j = 1, \dots , \textit {k}}$ , so we also have, from the first line by Lemma 7.5(2),

$$ \begin{align*} \Gamma, \lnot A(s_{1}) \land \cdots \land \lnot A(s_{k}) & \vdash_{\mathbf{C}} D(e). \hspace{1.2pc} \end{align*} $$

Since ${\bigvee _{i} A(s_{i}) \lor (\bigwedge _{i} \lnot A(s_{i}))}$ is an instance of excluded middle, we have

$$ \begin{align*} \hspace{7.4pc} \Gamma, \Gamma[s_{1}/e], \dots, \Gamma[s_{k}/e] & \vdash_{\mathbf{C}} D(e) \lor D(s_{1}) \lor \dots \lor D(s_{k}). \end{align*} $$

If e is a $\tau $ -term, then the critical formulas are of the form $C_{j}(e) \equiv A(e) \mathbin {\rightarrow } A(s_{j}(e))$ and consequently $C_{j}(s_{i})$ is $A(s_{i}) \mathbin {\rightarrow } A(s_{j}(s_{i}))$ . Each is implied by $\lnot A(s_{i})$ , so we have

$$ \begin{align*} \Gamma[s_{1}/e], \dots, \Gamma[s_{k}, e], \lnot A(s_{1}) \lor \dots \lor \lnot A(s_{k}) & \vdash_{\mathbf{C}} D(s_{1}) \lor \dots \lor D(s_{k}). \hspace{2.4pc} \end{align*} $$

On the other hand, $A(s_{1}) \land \dots \land A(s_{k}) \vdash A(e) \mathbin {\rightarrow } A(s_{j})$ , so

$$ \begin{align*} \Gamma, A(s_{1}) \land \dots \land A(s_{k}) & \vdash_{\mathbf{C}} D(e), \end{align*} $$

and consequently

$$ \begin{align*} \hspace{7.2pc} \Gamma, \Gamma[s_{1}/e], \dots, \Gamma[s_{k}/e] & \vdash_{\mathbf{C}} D(e) \lor D(s_{1}) \lor \dots \lor D(s_{k}), \end{align*} $$

since $\bigwedge _{i} A(s_{i}) \lor \bigvee _{i} \lnot A(s_{i})$ is a tautology.

In each case, $\{e, s_{1}, \dots , s_{k}\}$ is an e-elimination set.⊣

Remark 7.8. Of course, the fact that in $\mathbf {C}{}$ we have complete e-elimination sets can also be obtained by applying Proposition 7.6 k-many times. Applying it to $C_{1}(e)$ results in $T_{1} = \{e, s_{1}(e)\}$ , applying it to $C_{2}(e)$ in $T_{2} = \{e, s_{1}(e), s_{2}(e), s_{1}(s_{2}(e))\}$ , to $C_{3}$ in $T_{3} = \{e, s_{1}(e), s_{2}(e), s_{1}(s_{2}(e)), s_{3}(e), s_{1}(s_{3}(e)), s_{2}(s_{3}(e)), s_{1}(s_{2}(s_{3}(e)))\}$ , etc., i.e., the resulting disjunction has $2^{k+1}$ disjuncts, whereas the disjunction resulting from Proposition 7.7 only has $k+1$ disjuncts. However, see Remark 9.7.

We know that intermediate logics other than $\mathbf {LC}_{m}$ do not have the extended first ${\varepsilon \tau }$ -theorem and so not every ${\varepsilon \tau }$ -term will have complete e-elimination sets. However, if the starting formula E is of a special form, they sometimes do. In the proof for the classical case above, this required excluded middle. But it need not. For instance, if E is negated, then weak excluded middle ( $\lnot A \lor \lnot \lnot A$ ) is enough.

Proposition 7.9. If $\mathbf {L} \vdash J$ , then every ${\varepsilon \tau }$ -term in an $\mathbf {L}{\varepsilon \tau }$ -proof of $\bigvee _{j} \lnot D_{j}$ has a complete e-elimination set.

Proof. Let $C_{1} \equiv A(s_{1}) \mathbin {\rightarrow } A(e)$ , …, $C_{k} \equiv A(s_{k}) \mathbin {\rightarrow } A(e)$ be the critical formulas belonging to e if e is an $\varepsilon $ -term. As before, we have

$$ \begin{align*} \Gamma[s_{i}/e], A(s_{i}) & \vdash_{\mathbf{L}} \bigvee_{j} \lnot D_{j}(s_{i}). \end{align*} $$

In ${\mathbf {KC}, B \mathbin {\rightarrow } (\lnot C_{1} \lor \lnot C_{2}) \vdash \lnot \lnot B \mathbin {\rightarrow } (\lnot C_{1} \lor \lnot C_{2})}$ , so

$$ \begin{align*} \Gamma[s_{i}/e], \lnot\lnot A(s_{i}) & \vdash_{\mathbf{L}} \bigvee_{j} \lnot D_{j}(s_{i}). \end{align*} $$

We obtain

$$ \begin{align*} \Gamma[s_{1}/e], \dots, \Gamma[s_{k}/e], &\\ \lnot \lnot A(s_{1}) \lor \dots \lor \lnot\lnot A(s_{k}) & \vdash_{\mathbf{L}} \bigvee_{j} \lnot D_{j}(s_{1}) \lor \dots \lor \bigvee_{j} \lnot D_{j}(s_{k}). \end{align*} $$

Again as before, we have

$$ \begin{align*} \Gamma, \lnot A(s_{1}) \land \cdots \land \lnot A(s_{k}) & \vdash_{\mathbf{L}} \bigvee_{j} \lnot D_{j}(e), \end{align*} $$

and together

$$ \begin{align*} \Gamma[s_{1}/e], \dots, \Gamma[s_{k}/e], \Gamma, &\\ (\lnot A(s_{1}) \land \cdots \land \lnot A(s_{k})) \lor {} &\\ \lnot \lnot A(s_{1}) \lor \dots \lor \lnot\lnot A(s_{k}) & \vdash_{\mathbf{L}} \bigvee_{j} \lnot D_{j}(e) \lor \bigvee_{j} \lnot D_{j}(s_{1}) \lor \dots \lor \bigvee_{j} \lnot D_{j}(s_{k}). \end{align*} $$

Since

$$ \begin{align*} (\lnot A(s_{1}) \land \cdots \land \lnot A(s_{k})) \lor {} \lnot \lnot A(s_{1}) \lor \dots \lor \lnot\lnot A(s_{k}) \end{align*} $$

is provable from weak excluded middle, the claim is proved.⊣

8 Elimination sets using ${B_{m}}$ and $\mathit {Lin}$

We’ve showed in Theorem 6.2 that the provability of ${B_{m}}$ for some $m \ge 2$ is a necessary condition for an intermediate logic to have the extended first ${\varepsilon \tau }$ -theorem. In this section, we show that it is also sufficient: if $\mathbf {L} \vdash B_{m}$ for some m, then every critical ${\varepsilon \tau }$ -term has a complete e-elimination set.

We use the notion of e-elimination sets developed in Section 7: the existence of an e-elimination set for a set of critical formulas $\Lambda $ guarantees that these critical formulas can be removed from a proof, while the end-formula is replaced by a disjunction of instances of the original end-formula. The proofs of the existence of e-elimination sets proceed by replacing an ${\varepsilon \tau }$ -term e in such a way that the disjunction of formulas $\Lambda ^{\prime }$ resulting from such replacements become (provable from) tautologies in the underlying propositional logic. In Proposition 7.6, we showed how to do this for a single critical formula $A(s) \mathbin {\rightarrow } A(e)$ in classical logic: We replace e first by s resulting in $A(s[s/e]) \mathbin {\rightarrow } A(s) \vdash D(s)$ , and then by itself (i.e., no replacement), resulting in $A(s) \mathbin {\rightarrow } A(e) \vdash D(e)$ . This gives

$$ \begin{align*} (A(s[s/e]) \mathbin{\rightarrow} A(s)) \lor (A(s) \mathbin{\rightarrow} A(e)) \vdash D(s) \lor D(e), \end{align*} $$

but the formula on the left is a classical tautology. When eliminating multiple critical formulas at the same time (e.g., all critical formulas belonging to a single ${\varepsilon \tau }$ -term), the resulting tautologies are more complicated. In the original proof, they are all equivalent to excluded middle, and so the proofs do not apply to intermediate propositional logics. Below, we show how this can nevertheless be done as long as the underlying propositional logic contains ${B_{m}}$ . We have to distinguish two kinds of critical formulas:

Definition 8.1. A critical formula $A(s) \mathbin {\rightarrow } A(e)$ (or $A(e) \mathbin {\rightarrow } A(s)$ if e is a $\tau $ -term) is called predicative if e does not occur in s, and impredicative otherwise.Footnote 9

If all critical formulas are predicative, $\mathbf {L} \vdash \mathit {Lin}$ suffices (Lemma 8.5). If $\mathbf {L} \vdash {B_{m}}$ for some m, we can eliminate the impredicative critical formulas for some