# Theoretical Equivalence in Physics

###### Abstract

I review the philosophical literature on the question of when two physical theories are equivalent. This includes a discussion of empirical equivalence, which is often taken to be necessary, and sometimes taken to be sufficient, for theoretical equivalence; and “interpretational” equivalence, which is the idea that two theories are equivalent just in case they have the same interpretation. It also includes a discussion of several formal notions of equivalence that have been considered in the recent philosophical literature, including (generalized) definitional equivalence and categorical equivalence. The article concludes with a brief discussion of the relationship between equivalence and duality.

Theoretical Equivalence in Physics

Department of Logic and Philosophy of Science

University of California, Irvine

Keywords: Theoretical equivalence, empirical equivalence, categorical equivalence, definitional equivalence, Morita equivalence, duality, interpretational equivalence, physics

## 1 Introduction

Many of the things we wish to say about the world can be expressed in multiple ways. We might use different choices of words, for instance, or speak in different languages. A physical theory may also be expressed in many ways. For instance, Newton initially published his magnum opus, the Philosophiae Naturalis Principia Mathematica, in Latin in 1687; it was later translated into English in 1729 by Andrew Motte and into French in 1749 by Émile du Châtelet. It is hardly tenable to say that each of these translations presents a new physical theory, as opposed to a re-expression of a single theory.

Other cases are more difficult. Often physicists re-express theoretical claims by changing more than the (natural) language in which they are expressed: they also change the mathematical structures used in formulating the theory. Such changes in mathematical structure are often accompanied by the introduction of new physical principles, new patterns of inference, and new ways of representing physical systems—even in cases where the new and old formulations of the theory do not differ in their empirical predictions.

For instance, in 1788, the French mathematician Joseph-Louis Lagrange introduced a new formulation of mechanics. There is a certain sense in which Newtonian and Lagrangian mechanics are empirically equivalent, at least for the systems of interest to Newton and Lagrange. But whereas on Newton’s theory one describes the motions of bodies by specifying the forces acting on those bodies and then using Newton’s second law, , to derive the resulting accelerations, on Lagrange’s theory one specifies a quantity now known as the “Lagrangian”, which is a function on possible configurations of a physical system related to the energy associated with that system. From the Lagrangian one can then derive equations governing the motion of bodies—which turn out to agree, in standard cases, with those you would find from the Newtonian approach.

Did Lagrange introduce a new theory? Or did he merely re-express, perhaps with some elaboration, the theoretical content of Newtonian physics? In other words, are Newtonian and Lagrangian mechanics theoretically equivalent? Lagrange’s methods certainly allow one to solve more problems, or at least solve some problems more easily, than did Newton’s. In this sense, Lagrange certainly made a novel contribution to classical mechanics. But whether one should say he proposed a new physical theory depends on what one takes the content of a physical theory to be (in general) and what one takes each of these particular theories to assert about the world. Many philosophers of physics have argued that the answers to these questions depend, at least in part, on formal or mathematical relationships between the mathematical structures used to represent physical situations in these theories. Furthermore, these considerations are deeply related, since the formal relationships one takes to capture a suitable notion of theoretical equivalence often reflect a position on theoretical content and interpretation (Barrett, 2017b, a).

The question of whether two given theories are equivalent has often arisen in practice. Heinrich Hertz, for instance, in The Principles of Mechanics Presented in a New Form, presents two “images” of mechanics that appear to agree in all of their empirical consequences, but which Hertz takes to disagree in the conceptions of the physical world that they suggest (Hertz, 1899 [1894]); faced with the choice between these images, Hertz suggests a number of extra-theoretical considerations that one might use to distinguish between them. Similarly, John von Neumann, in Mathematical Foundations of Quantum Mechanics, provides an argument that the wave mechanics developed by Schrödinger should be taken to be equivalent to the matrix mechanics developed by Heisenberg, Born, Jordan, and others (Von Neumann, 1955 [1932], though see also Dirac 1930 and Muller 1997a; 1997b). More recently, physicists working in high energy theory and string theory have introduced and studied a large number of dualities, which are relationships between pairs of theories that appear to differ wildly, but which, by dint of their formal properties, are often taken to provide equivalent descriptions of the same states of affairs (Rickles, 2011; Polchinski, 2017; De Haro and Butterfield, 2017).

This article will describe different approaches one might take to the question of when two physical theories are equivalent. This will include a discussion of empirical equivalence, which is often taken to be necessary, and sometimes taken to be sufficient, for theoretical equivalence; and what I will call “interpretational” equivalence, which is the idea that two theories are equivalent just in case they have the same interpretation. It will also include a discussion of several formal notions of equivalence that have been considered in the recent philosophical literature, including (generalized) definitional equivalence and categorical equivalence. The article will conclude with a brief discussion of the relationship between equivalence and duality.

## 2 Empirical Equivalence

Suppose that two physical theories both have the resources to describe a given state of affairs, but are such that they differ in the predictions they make for various measurements one might perform. For instance, both Newtonian gravitational theory and general relativity may be used to describe planetary orbits in our solar system. They agree, to a high degree of accuracy, concerning the orbits of planets far from the sun, including the Earth. But the orbits they predict do not agree exactly, and in the case of Mercury, the disagreement is sufficiently significant to be measured using terrestrial telescopes. (Indeed, the actual orbit of Mercury substantially agrees with that predicted by general relativity, but disagrees with that predicted by Newtonian gravitation.)

In such a case, the two theories in question disagree concerning their empirical predictions: they can be discriminated from one another via observation or experiment. There is surely no sense in which theories that differ in this way could be said to provide equivalent descriptions of the world. After all, they appear to provide detectably different descriptions of planetary orbits (among many other situations). (Of course, there may be situations in which the differences between the theories’ predictions in realizable experiments are sufficiently small as to be undetectable, but that is not the point.) In other words, the theories are empirically inequivalent. Empirical equivalence is often taken as a minimal necessary condition for theoretical equivalence.

Some philosophers of science, mostly working in the first two thirds of the twentieth century, have argued that empirical equivalence is also a sufficient condition for theoretical equivalence (eg. Reichenbach, 1938; Grünbaum, 1962; Salmon, 1966). In other words, these philosophers held that what it means for two theories to be equivalent is precisely that they may be applied in the same cases, and when so applied, they always yield the same predictions (or other claims about empirically verifiable matters).

There are several reasons why one might endorse this view. One is a commitment to some variety of operationalism (Bridgman, 1927), which, roughly speaking, is the view that a theory is nothing but a tool for making predictions of a certain sort; philosophers subscribing to operationalism might take it that if two theories agree on their predictions, then they agree on everything that the theory asserts about the world. Another reason to accept this view would be commitment to positivism, which, again roughly, is the view that the only meaningful assertions that one can make are those that are verifiable, either through empirical measurement or mathematical proof. On this view, one might think that the only meaningful assertions that a theory makes are those that are amenable to empirical testing; and so if two theories agree on all of their testable claims, then they agree on all of their meaningful claims.

One does not have to accept a strict verificationist criterion of meaning to endorse a view similar to this last one. There is a tradition, for instance, originating with Ramsey (1931) and Carnap (1958, 1966) and famously articulated and defended by Lewis (1970, 1972) (see also Hempel (1958, 1973)), according to which the meaning of “theoretical terms”, i.e., terms that are introduced in the course of theorizing, such as “gravitational field” or even “electron”, is to be identified via what is known as a “Ramsey sentence” (Psillos, 2000). Without going into formal detail, a Ramsey sentence is a sentence that asserts that there exists something that has certain properties, which in turn are expressed using terms that we already “understand”; the theoretical-term-to-be-defined, then, is identified with whatever has those properties already. Thus, to define “gravitational field,” one might say that there exists something that disposes massive bodies to accelerate, depending on their location; and the way in which it so disposes massive bodies is itself determined by the distribution of massive bodies at a time according to a particular equation. Of course, in this description, we have used other theoretical terms—“massive body”, “accelerate”, even “time”—but we presume that those terms are already understood; or else that they, too, may be defined using similar expressions.

Holding something like the Ramsey-Carnap-Lewis view of theoretical terms does not necessarily commit one to the view that empirical equivalence is sufficient for theoretical equivalence, since Ramsey sentences could arguably capture further “structural” content of a theory (cf. Maxwell, 1962, 1970; Ketland, 2004; Cruse, 2005; Melia and Saatsi, 2006; Worrall, 2007; Dewar, 2018). But it does raise the question of where Ramsey sentences bottom out, i.e., what sorts of terms or expressions may be taken as sufficiently basic to be used in all theoretical definitions—or even whether such “basic” terms exist. One view, endorsed by some logical positivists and logical empiricists (but not, for instance, by Lewis), is that it is propositions that may be expressed in an “observation language”, with no theoretical terms at all, that ground theoretical terms. On this view, even if one were to acknowledge that some expressions that are not empirically verifiable are meaningful, one might nonetheless think that the meanings of theoretical claims ultimately reduce to empirical claims, and thus that if two theories agree on all of their empirical claims, differences between their theoretical claims are matters (only) of definition or convention.

Philosophers of science often cavalierly assert that empirically equivalent alternatives to various theories are possible. But it is not clear that there is a recipe for finding such theories—or at least, for finding examples that are not trivial notational variants of one another. Conversely, establishing that two apparently different theories are in fact empirically equivalent or inequivalent is often a subtle matter. Indeed, even establishing precisely what is meant by “empirical equivalence” can be tricky. For instance, as we indicated above, one would like to say that two theories are empirically equivalent if they can be applied in precisely the same circumstances (and always yield the same predictions). But in order to assess if this is true, one needs some suitably theory-neutral way of describing possible applications of a theory—i.e., some language for describing physical situations that does not invoke the resources of either theory. To see the difficulty, consider the question of what predictions Newtonian gravitation makes in the vicinity of a black hole, or what predictions classical electrodynamics and Newtonian mechanics make concerning measurements of an electron’s spin. In such cases, it is hard to see how the theories at issue have the expressive resources necessary to even represent the relevant situations, nevermind the further question of making accurate predictions about them.

It is fair to say in both of these examples that the theories in question are simply empirically inequivalent. But one needs to be careful, because the mere fact that two theories would give very different theoretical descriptions of a situation does not by itself imply that they are empirically inequivalent, since some systematic translation may be available between different descriptions. To take an example, the de Boglie-Bohm pilot wave theory (a.k.a. Bohmian mechanics) is widely taken to be empirically equivalent to the standard von Neumann-Dirac formulation of quantum theory (Cushing, 1994; Dürr et al., 2012)—even though in Bohmian mechanics, particles always have precise and definite positions, and every measurement one makes in the theory is ultimately understood as a measurement of position; whereas in the von Nuemann-Dirac formulation, particles never have definite positions, and generically one can measure any number of different quantities. To establish the empirical equivalence of these theories, one needs to show that measurements of generic observables can always be reconceived as measurements of position. Such arguments have been given, though they rely on background assumptions about what sorts of measurement apparatuses are, in principle, available.

Reflecting on what the translations needed to establish empirical equivalence involve—in general, the ability to take any situation as described by one theory and identify it with a suitable corresponding description in the other theory, and vice versa—has led some philosophers to argue that empirical equivalence is a much stronger relationship than is usually supposed. Norton (2008), for instance, has argued that underdetermination of theories by evidence is not a real problem in science, precisely because any two theories that truly could not be distinguished by empirical considerations—that is, theories that were fully empirically equivalent—would necessarily have such a close relationship that one should probably want to say that they were either fully equivalent, or else differed in a way that makes one clearly preferable to the other (say because one posits unnecessary structure or entities). For replies to Norton, see Magnus and Frost-Arnold (2010) and Bradley (2018).

For further discussion of the interpretational options available when presented with empirically equivalent theories, see Le Bihan and Read (2019), though be forewarned that they use the term “duality” to refer to theories that are empirically equivalent, which is a bit different from the usage here.

## 3 Definitional Equivalence and Intertranslatability

Empirical equivalence is a weak form of equivalence between theories: one might think that two theories could be empirically equivalent, but nonetheless inequivalent in some stronger sense. For instance, two theories might make the same predictions, but nonetheless differ with regard to what structure they attribute to the world, what sorts of entities exist in the world, or what the laws of nature are. A well-known example is Bohmian mechanics, which is a quantum theory that arguably makes precisely the same predictions as the standard von Neumann-Dirac formulation of quantum mechanics (Cushing, 1994). And yet, these theories are rarely taken to be equivalent, since they differ dramatically in their laws, in the sorts of properties objects have, and various other features, such as whether the world is deterministic. Indeed, many philosophers of physics have argued that the “standard” formulation of quantum theory is inconsistent or incoherent, whereas Bohmian mechanics is not (Barrett, 2003). This suggests that we need a finer grained notion of equivalence.

During the 1970s, Glymour (1970, 1980) and Quine (1975) offered proposals for a stronger notion of equivalence between theories. In both cases, they argued that theories should be taken to be equivalent if they are inter-translatable, in the following sense: one should be able to take assertions in one theory and systematically translate them into assertions in the other theory, and vice versa, in a truth-preserving way. (One might think of this proposal as a generalization of the idea with which we began, that Newton’s Principia may be translated into English or French without changing the theory.) Glymour made this proposal precise using the notion of definitional equivalence, which is a criterion of equivalence used in first order logic (Montague, 1957; de Bouvere, 1965b, a; Kanger, 1968; Artigue et al., 1978). Quine, too, offered a technical proposal. But Barrett and Halvorson (2016a) have recently shown it to have some serious deficiencies and have argued that if one attempts to adjust it so as to capture Quine’s original motivation, Quine’s proposal collapses into Glymour’s. And so, in what follow we will focus on definitional equivalence and set aside the technical details of Quine’s proposal.

Before introducing definitional equivalence in more detail, it is worth emphasizing that both Glymour and Quine, in introducing stronger notions of equivalence between theories, wished to emphasize that theories could be empirically equivalent to one another, but nonetheless theoretically inequivalent, in the sense of making substantially different claims about the world. For Glymour in particular, this point was made in defense of a variety of scientific realism, according to which physical theories make assertions about the world that extend beyond their direct empirical consequences. If two theories could be empirically equivalent, but nonetheless inequivalent in some stronger sense, Glymour concluded, then at most one of them could be a true description of the world. Conversely, Quine was motivated by the question of whether all empirically equivalent theories were necessarily rivals, or if at least some such theories could be seen as equivalent to one another. (On this latter point, Glymour (1980) also argued that two theories could be empirically equivalent, and yet we could have better confirmation of one than the other.)

We will now turn to defining definitional equivalence. (For a more precise characterization, see, e.g., Barrett and Halvorson (2016b).) In the first instance, it is a relationship that can hold between two theories in first order logic. A first-order theory consists of two ingredients: (1) a signature , which is a set of symbols, including constant symbols, functions, and predicates (we suppose we have fixed a collection of logical symbols, such as connectives and quantifiers); and (2) a set of axioms in that signature, which are formulae (with no free variables) constructed using only the symbols in , logical connectives, quantifiers, and variables. The signature can be thought of as a sort of vocabulary: it contains the terms that the theory uses. And the axioms may be thought of as the basic assertions that the theory makes, expressed using just the vocabulary of the theory.

Now suppose we are given a new symbol, , that is not already in the signature . Assume, for the sake of simplicity, that is a unary predicate, i.e., a symbol such that for any constant , is a sentence. An explicit definition of in terms of is a sentence (in the signature )

where is a formula (with one free variable, ) in the signature . This sentence asserts that for all , is true if and only if holds Thus defines using only the vocabulary available in the theory . One can define similar sentences for functions and for constant symbols. (The details will not matter, but we remark for completeness that explicit definitions of constants and functions imply certain further sentences, known as admissibility conditions, in . These conditions reflect constraints on new symbols of a given type, and so one only considers explicit definitions of new functions and constants for which the admissibility conditions can be proved in .)

By adding new symbols to the signature , and by adding explicit definitions of those symbols to the axioms of , one can construct a new theory, , in a new signature . Such a new theory is called a definitional extension of . More precisely, a definitional extension of a theory in signature is a theory in signature whose axioms consist of (1) the axioms of and (2) for each symbol that does not appear in , an explicit definition of in terms of (with the property that all of the admissibility conditions for these explicit definitions are satisfied by ). A definitional extension of a theory is naturally understood to have precisely the same expressive resources as : it says the same things, makes the same sentences (in the signature ) true and false, and so on. It merely has a larger vocabulary available with which to express the assertions of .

Consider two first order theories, and . Assume, for simplicity, that the signatures of the two theories, and , are disjoint: they do not both use the term “electron”, say. (This assumption is generally benign—though see (Lefever and Székely, 2018).) We will say that and are definitionally equivalent if there exist definitional extensions and of and , respectively, both in signature , with the following property: and are logically equivalent. Here logical equivalence means that, given any sentence in , is provable (so, by the completeness of first order logic, true) in if and only if it is provable in : or, in logical notation, if and only if . Observe that it makes sense to ask whether and are logically equivalent precisely because they have the same signature, and thus they both rule on the truth or falsity of precisely the same set of sentences. The theories and , meanwhile, have different languages, and so one cannot even ask whether they make the same sentences true.

Intuitively, definitional equivalence says that given any sentence of theory , I can, by using explicit definitions of each of the non-logical symbols in in terms of , translate that sentence into a sentence in the language of that is provable in iff was provable in ; and vice versa. Anything true I can say in one theory can be said, equally well and with the same truth conditions, in the other theory. In this way, it captures Glymour’s (and, to an extent, Quine’s) suggestion that two theories should be equivalent if (and only if) they are inter-translatable.

Definitional equivalence certainly captures an interesting and important sense in which two theories may be equivalent, at least in first order logic. But is has some problems. The most immediate problem is that definitional equivalence applies to first order theories, while we are interested in equivalence between physical theories, which are rarely expressed in first order logic. Worse, many of the mathematical tools that we regularly use in physics, such as topology, apparently do not have first order axiomatizations. And even if they did, it is not clear that we could capture the full range of theoretical practice in physics in the framework of first order logic. If we do not have, once and for all, a “language” or “axioms” for each of our physical theories, how could we hope to apply definitional equivalence in practice?

Glymour (1980) recognized this concern. But, he argued, definitional equivalence could still be useful in physics. He illustrated this claim with an example from Newtonian gravitation. First, Glymour pointed out that while definitional equivalence is a syntactic notion, it has a semantic counterpart, using the theory of models of first order logic (Hodges, 1993, 1997). Roughly speaking, a model of a first order theory is an ordered collection of sets, consisting of a domain of quantification (i.e., a set of objects about which the theory is interpreted to make assertions) and subsets of that domain (or subsets of products of that domain with itself), corresponding to the extensions of each of the various predicates, functions, and relations in the theory’s signature, . A model may be thought of as a structure, along with an interpretation of the sentences of as assertions about that structure, such that those assertions are all true.

How can we express definitional equivalence using models? To answer this, we need two facts. First, observe that another, equivalent characterization of logical equivalence of two theories in the same signature is that they have (precisely) the same models. Second, observe that, given any theory , any model of , and any definitional extension of , there exists a unique definitional expansion of , , which is a model of (Hodges, 1997, p.53). Thus, I can take the models of —the structures of which is true—and uniquely expand them to yield new structures of which the extended theory is true. Putting these pieces together, we can conclude that theories and are definitionally equivalent only if for every model of , there is a unique definitional expansion of to a structure, whose “reduct”, i.e., restriction to , is a model of , and vice versa.

This semantic version of definitional equivalence does not immediately solve the first problem above, because it still concerns first order logic. But it is suggestive, because one does, in physics, often have “models” of a physical theory—and these models are generally mathematical structures that “realize” the assertions of the theory. The models of general relativity, for instance, are smooth, four dimensional manifolds (satisfying some further conditions) endowed with smooth, Lorentz-signature metrics. The models of non-relativistic quantum mechanics are Hilbert spaces, along with a suitable subalgebra of the bounded operators on that Hilbert space. And so on. (One has to be careful here, because a “model” in physics is not generally a -structure for any first order theory; indeed, scientists and philosophers of science use the term “model” in an enormous variety of ways (Downes, 1992; Weisberg, 2012; O’Connor and Weatherall, 2016).)

In light of these considerations, Glymour proposed a “liberal” notion of definitional equivalence—one inspired by the ideas of first order logic, but applicable more broadly. He did not precisely define this notion of equivalence, but he did argue that it should imply the following: two theories are equivalent only if, given any model of one of the theories, one can uniquely transform it into a model of the other theory; and vice versa. Thus we find, if not a sufficient condition, at least a necessary condition for theoretical equivalence that can be applied in real cases.

Glymour (1980) illustrated the point with an influential example. Newtonian gravitation is standardly formulated as a theory on in which bodies accelerate in the presence of a gravitational field, which in turn depends on the distribution of matter. But there is another formulation of the theory, of interest to philosophers because it bears some qualitative similarities to general relativity, on which gravitation is “geometrized”. In this theory, there is no gravitational field and bodies do not accelerate due to gravitation; instead, they follow geodesics in curved spacetime, with curvature proportional to the distribution of mass (cf. e.g., Malament, 2012, Ch. 4). One might then ask: are these two theories equivalent to one another? There is a classic result due to Trautman (1965) that bears on this. It says: given any model of standard Newtonian gravitation, there exists a unique model of the geometrized theory with the same mass density in which bodies follow the same trajectories; and conversely, given any model of the geometrized theory (satisfying certain conditions), there exists a model of the standard theory that, likewise, agrees on mass density and trajectories. Thus, we have a precise sense in which the theories are empirically equivalent, and we can even translate between them in a certain sense. But, Glymour argues, they are not equivalent according to the criterion of definitional equivalence. The reason is that while the translation from the standard theory to the geometrized theory is unique, it is many-to-one, and so the translation in the opposite direction is not unique. Applying definitional equivalence in this sort of way may not be dispositive, but it certainly seems probative.

In this example, Glymour uses definitional equivalence as a necessary condition. But does definitional equivalence also provide us with sufficient conditions for equivalence? There is reason to think the answer is “no”, at least so far. As described, definitional equivalence is a purely formal relation between theories. Nothing in the notion of “explicit definition” or the ensuring characterization of definitional equivalence requires that translations between theories preserve any prior “meaning” of the terms or sentences of either theory. But, as Sklar (1982) pointed out in an influential response to Glymour, physical theories have (physical) interpretations; presumably two theories are equivalent only if their interpreted claims about the physical world are in some good sense the same. Consider, for instance, the mathematical theory of Brownian motion as applied to a particle of pollen suspended in a fluid, and the same mathematical theory as applied to stock market prices (cf. Van Fraassen, 2014, p. 279). There is surely some sense in which these theories are “intertranslatable”: every time the word “position” appears in the first theory, one can substitute “price” in the second, and so on. And yet they are surely not equivalent theories since they are talking about different subject matter.

We will return to the issue of “interpretational equivalence” in section 5, but at least in the first instance, it seems that if definitional equivalence is to be a satisfactory criterion of equivalence between physical theories, it must at very least be supplemented with a requirement that equivalent theories be empirically equivalent, and moreover, that the translations between the theories that realize the definitional equivalence be suitably compatible with this empirical equivalence. It is not clear how to make this requirement precise, and in any case, as we saw above, empirical equivalence is itself a difficult notion. But it is also true that, as in the case of Newtonian gravitation just mentioned, in some cases theories are empirically equivalent, and moreover, that one can require definitional equivalence as a further, strictly stronger condition.

We conclude this section by remarking on another problem with definitional equivalence, which is that it is arguably too strong, even in the case of first order theories. For instance, as Barrett and Halvorson (2016b) note, definitional equivalence cannot capture cases in which two theories that use (multiple) different sorts, i.e., different classes of entity to which different predicates apply, might be equivalent. (See also Barrett and Halvorson, 2017, for a concrete example in which the difference matters.) Instead, they propose a weaker notion of equivalence that they call Morita equivalence, but which others have called generalized definitional equivalence (Andréka et al., 2002; Madarász, 2002; Andréka and Németi, 2014). Similarly, Andréka and Németi (2014) and Lefever and Székely (2018) argue that definitional equivalence needs to be generalized to accommodate theories with non-disjoint signatures.

These considerations lead to a number of related notions of equivalence, all motivated by similar considerations as definitional equivalence, that can help decide cases of theories in first order logic (as, for instance, in Lefever and Székely (2017)). In general, however, the difficulties described above concerning how to apply these notions to cases in which a first order formulation is not available remain open.

## 4 Categorical Equivalence

As noted at the end of the previous section, there are reasons to think that definitional equivalence is either inadequate or of limited use in practice (or both). Motivated by these and related concerns, Halvorson (2012) and Weatherall (2016a) have proposed a different criterion of equivalence between physical theories, using methods from category theory; this proposal has subsequently been pursued and developed by numerous authors (Halvorson, 2015b; Halvorson and Tsementzis, 2017; Rosenstock et al., 2015; Nguyen et al., 2018; Dewar and Eva, 2017; Barrett, 2014, 2015, 2017a; Weatherall, 2017, 2016b).^{1}^{1}1The development of the subject is difficult to establish based on the published literature, since a large number of papers either appeared online or were published at approximately the same time, even though they were produced sequentially over the course of about five years. Briefly, Halvorson first publicly discussed the ideas that led to (Halvorson, 2012) in a February 2011 lecture in Irvine; (Weatherall, 2016a) was first drafted the following summer (in close conversation with Halvorson while on a visit to Princeton) and distributed to the Southern California Philosophy of Physics Group and others in Fall 2011. It went on the Pittsburgh Philosophy of Science Archive in 2014 and was accepted for publication in early 2015, but was not published until 2016. In the meantime, a number of papers that drew on, extended, and criticized the ideas of Halvorson (2012) and Weatherall (2016a)—such as Rosenstock et al. (2015), Rosenstock and Weatherall (2016), Halvorson (2015b, a), Barrett and Halvorson (2016a, b)—were drafted, circulated, and published, with many appearing online in 2016.

Categorical equivalence has a number of virtues, including some that definitional equivalence lacks. For instance, it can be readily applied to real cases, and it appears to render intuitively plausible verdicts in a number of cases of real interest. Moreover, it captures senses of equivalence that have been implicitly invoked in earlier philosophical literature (Rynasiewicz, 1992; Rosenstock et al., 2015; Weatherall, 2016b), and related notions of equivalence are often used in mathematical physics (eg. Schreiber and Waldorf, 2007; Schreiber, 2013) and, recently, in the foundations of mathematics (eg. Univalent Foundations Program, 2013; Visser, 2017).

Before introducing categorical equivalence, we remark on the motivation for the proposal. Halvorson (2012) begins with a critique of the so-called “semantic view of theories”, according to which a theory should be identified with a collection of models, on the grounds that it delivers the wrong verdicts on equivalence, as seen in a series of examples. (Recall the discussion above concerning the ambiguity in the meaning of the term “model” here.) At the end of the article, he sketches an alternative proposal: to adequately capture the structure of a theory, he suggests, one needs to consider structured sets of models, i.e., collections of models with further information about the relationship between those models. Invoking recent work in categorical logic (Makkai, 1993; Awodey and Forssell, 2013), he suggests that categories of models may be natural candidates to represent theories. (In reply, Glymour (2013) argued that in fact definitional equivalence provides a suitably semantic characterization of equivalence (recall section 3); to which Halvorson (2013) replied that definitional equivalence, even formulated using tools from model theory, invokes the signature of the theory in a way that is in tension with the rhetoric of the semantic view’s defenders. For our purposes, this is a side issue; but see Lutz (2014, 2017); Van Fraassen (2014); Hudetz (2017); Button and Walsh (2018) for further discussion.)

Weatherall (2016a) comes to categorical equivalence by reflecting on examples. Recall that Glymour applied definitional equivalence to study the relationship between “standard” and geometrized Newtonian gravitation, concluding that these theories cannot be equivalent. Weatherall first shows that if one applies Glymour’s argument to another pair of theories—classical electromagnetism in a “gauge dependent” formulation, in terms of 4-potentials; and classical electromagnetism in a “gauge free” formulation, in terms of an electromagnetic field strength (Faraday) tensor—one likewise concludes that the theories are inequivalent. And the reason is the same: there is a many-one relationship between 4-potentials and electromagnetic fields.

The trouble with this conclusion is that physicists are well-aware of the asymmetry between potentials and electromagnetic fields, and yet the two formulations of electromagnetism are used interchangeably. As Weatherall argues, the reason is simple: physicists recognize additional relationships between the models of the theories. In particular, 4-potentials associated with the same electromagnetic field are understood to be equivalent, in the sense that they can be used interchangeably to represent the same physical situations. Such 4-potentials are said to be related by gauge transformations. (Note that there are two senses of equivalence under discussion, now: one is a relationship between theories, or formulations of theories, while the other is between models of a single theory.)

Weatherall then proposes adjusting Glymour’s criterion so that two theories are equivalent if the models of one can be uniquely transformed into models of the other, and vice versa, in a way that takes you back to the model with which you began, up to model equivalence. Applying this moral to Newtonian gravitation, then, Weatherall argues that whether standard and geometrized Newtonian gravitation are equivalent depends on a prior choice of whether models of the standard theory associated to a single model of the geometrized theory should be taken as equivalent. If one concludes that they are—which one might have independent motivation for doing (Norton, 1992; Malament, 1995; Norton, 1995; Wallace, 2017)—then the two theories should be taken to be equivalent; if not, then Glymour’s original argument stands.

This line of thought has just been rehearsed without invoking any category theory. So where does categorical equivalence come in? It turns out that categorical equivalence is a way to make the idea of “unique transformation up to equivalence” precise.

A category consists of two sort of data: a collection of objects and, for each ordered pair of objects and , a collection of arrows between them, denoted . These are required to satisfy the following criteria: given arrows and , such that terminates at the object at which originates, then there exists a unique arrow, , called the composition of with , originating at the same object as and terminating at the same object as ; composition of arrows is associative, in the sense that, given suitably composable arrows , , and , we have ; and finally, for every object , there exists a unique arrow identity arrow , originating and terminating at , with the properties that for any originating at , , and for any arrow terminating at , . (For further details on category theory, see Mac Lane, 1998; Awodey, 2006; Leinster, 2014)

On a first pass, it is helpful to think about so-called “concrete” categories, to get a sense of how they work. Consider, for instance, the category Set, which has, as objects, sets, and as arrows, functions between sets; or the category Group, which has, as objects, groups, and as arrows, group homomorphisms. (Note that there is a problem, here, related to “size”, since there is no set of all sets. We set this aside for present purposes.)

A functor is a map between categories that takes objects to objects, arrows to arrows, and which preserves domains, codomains, composition, and identity. Functors can have various properties, analogous to the way in which functions (between sets, say) may be “injective” or “surjective”. Fix a functor . We will say that is full if, given any two objects and of , the map that induces between and is surjective. It is faithful if that map is injective. And it is essentially surjective if, for every object of , there is an object of such that is isomorphic to , where by “isomorphism”, here, we mean that there exist arrows and , from to and vice versa, respectively, such that and . Now suppose we are given two categories, and . We say that and are equivalent if there exists a functor that is full, faithful, and essentially surjective. If this holds, then there exists a functor that is also full, faithful, and essentially surjective that is “almost inverse” to , in the sense that the composition takes objects of to isomorphic objects of , and likewise for .

What does this relationship capture? For one, it says that the two categories are “almost” isomorphic, in the sense that the objects of and can be uniquely identified with one another “up to isomorphism”, in a way that preserves all of the relations between objects encoded in the arrows of the categories. This sort of “almost isomorphism” of categories turns out to be enormously fruitful in mathematics. Indeed, many deep theorems relating disparate areas of mathematics turn out to be expressible as assertions of equivalence (or other functorial relations) between categories.

Intuitively, the idea of categorical equivalence is close to the “up to isomorphism” relationship described between the formulations of electromagnetism discussed above. And indeed, this idea can be made precise, by defining two categories: one of which has as objects electromagnetic field tensors on Minkowski spacetime, and has as arrows “isomorphisms” of that structure; and another has as objects 4-potentials, and has as arrows “isomorphisms” of that structure, including gauge transformations. (See Weatherall, 2016a, b; Nguyen et al., 2018, for further details.) One then finds that these categories are equivalent, with the equivalence realized by functors whose action on objects is given by the relationship described above. If, on the other hand, one did not include the gauge transformations, the resulting categories would not be equivalent. Similarly for Newtonian gravitation.

Observe that the functors just described may also be said to preserve “empirical structure”, i.e., they take models of one formulation to models of the other that have the same observational significance. One can spell this out in several ways, but they all turn on the same idea: ultimately, the empirical significance of a model of electromagnetism is entirely encoded in the electromagnetic field strength. Since the functor described preserves the electromagnetic field strength associated with models in each formulation, the functor must preserve empirical structure, or “predictions”. Presumably, this is a condition we should demand on any functor that is a candidate for realizing an “equivalence” between theories, for the same reasons we considered above in connection with definitional equivalence.

We can extract from this discussion a candidate criterion of theoretical equivalence. Suppose we are given two theories, each represented as categories whose objects are the models of the theories and whose arrows preserve, in some suitable sense, the structure of the models. We say that the two theories are (categorically) equivalent if there exists an equivalence of these categories that preserves empirical structure in the sense just sketched (Weatherall, 2016a).

What considerations recommend this criterion? Many of our physical theories lend themselves to characterizations as collections of certain mathematical structures. If categorical equivalence is a fruitful notion of equivalence between mathematical theories, then it can presumably be used to capture a sense in which the mathematical structures used in those theories are equivalent (qua mathematics); and if, in addition, the physical theories are empirically equivalent, in the sense described above, one captures a sense in which categorically equivalent theories use equivalent mathematics to capture the same empirical regularities. (Of course, this argument depends on a prior acceptance of categorical equivalence as fruitful in mathematics, which has not been defended here.) Moreover, categorical equivalence is readily applicable to many cases (Weatherall, 2017): in addition to the cases already discussed, these methods have clarified the sense in which Hamiltonian mechanics is equivalent to Lagrangian mechanics (Barrett, 2017a) and general relativity is equivalent (actually, dual) to the theory of Einstein algebras (Rosenstock et al., 2015). Finally, categorical equivalence offers a fruitful guide to the ways in which two otherwise similar theories fail to be equivalent. In particular a given functor may fail to be full, to be faithful, or to be essentially surjective (or more than one of these conditions may fail); such functors are sometimes said to be forgetful (Baez et al., 2004). Studying the properties of such functors can allow one to say what is “forgotten” when moving from one theory to the other (Weatherall, 2016b; Rosenstock and Weatherall, 2016; Nguyen et al., 2018; Bradley et al., 2019).

But there are also reasons to be cautious about this criterion of equivalence. One problem is how to choose the right categories in the first place. Arguably, many physical theories can be expressed by describing a collection of models. But to construct a category, one needs to provide additional information: arrows between these models. And as we have already seen, there are often multiple choices available. How do we know we have constructed the correct category of models for a given theory? One reason why this problem may not be devastating is that, in the cases we have considered, different choices reflect different ways of understanding the theory itself, in a way that may draw attention to salient interpretational ambiguities. We saw this already in the cases of electromagnetism and Newtonian gravitation; Barrett (2017a) has made the point particularly clearly in the context of Lagrangian and Hamiltonian mechanics—two theories whose relations have been a matter of some dispute in the recent literature (North, 2009; Curiel, 2013; Barrett, 2014).

There is another version of this worry, however, that may have more teeth. Categorical equivalence makes sense only if we represent a theory by a category that adequately represents the structure of that theory. Under what circumstances can we be confident that we have done so? For instance, given a category of models, and no further information, can one reconstruct a theory? In particular, we usually think of the “internal” structure of models of a theory as representing physical situations—for instance, in general relativity it is the points of a manifold that represent events in space and time. But it is not always possible to reconstruct this internal structure from the categorical structure—that is, from the arrows between models. This worry has been sharpened and emphasized by Hudetz (2018), who has argued that simply identifying a functor between theories that is full, faithful, and essentially surjective is sometimes not sufficient to establish that the theories are equivalent, even if the functor preserves empirical equivalence. The reason is that such a functor may not take models to other models with suitably related internal structure. To see this worry, consider an extreme case: define a category of models of some physical theory, and then take another category whose objects are “structureless points” and whose arrows are chosen precisely so as to make the two categories equivalent. The latter category arguably does not represent a physical theory at all, much less one equivalent to what one began with.

To avoid this trivialization worry, Hudetz proposes that the functor needs to be what he calls a “reconstruction functor”, which means that the model-to-model mapping determined by the functor must take models to models that can be “reconstructed” from the model with which one began. The notion of “reconstruction” offered here is very similar to that used in section 3, in “defining” models of one theory from models of another in the context of definitional equivalence or generalized definitional equivalence.

Hudetz’s proposal may be thought of as a hybrid of (generalized) definitional equivalence and categorical equivalence, and it arguably has the advantages of both. On the other hand, it also inherits disadvantages of both. Moreover, it is not clear that the trivialization worry is real, at least if one has chosen the categories one begins with judiciously: for a sufficiently rich category, one may be able to “reconstruct” the internal structure of any object in a category, up to isomorphism, from the arrows of the category. (Indeed, one can do precisely this in, for instance, the category of sets and functions; see also Barrett (2017c) for a discussion of how much structure is captured by the isomorphisms between models of a first order theory.) From this perspective, the “structureless points” of the example above are not structureless after all, and the further restriction to reconstruction functors is unnecessary or, in other words, automatic. (For further discussion of worries along these lines, see Weatherall (2018).)

There are many open questions, both mathematical and methodological, related to these last two arguments. But they both point at the same worry: categorical equivalence may be too weak. (Indeed, Barrett and Halvorson (2016b) explicitly show that it is strictly weaker than generalized definitional equivalence in the first order case.) One would like to have better control over when categorical equivalence yields reliable guidance, and when it does not. But in the meantime, there is another attitude one can adopt, defended by Sarita Rosenstock. On her view, categorical equivalence is not a formal criterion of equivalence that one can use, with no background understanding of the theories involves, to individuate theories. Instead, it is a heuristic for evaluating proposed relationships between theories of prior interest. In other words, the right question to ask is: given two theories and some proposed relationship between them, can one capture that relationship as a certain functor between categories associated with the theories, and if so, is that functor full, faithful, and essentially surjective? (One could also extend this, and ask: is the functor a reconstruction functor in Hudetz’s sense, above?) If so, that suggests that the relationship may realize a sense in which the theories are equivalent; if not, then one can use the properties of the functor to clarify what is “forgotten” as one moves between the theories.

## 5 Interpretational Equivalence

(Generalized) definitional and categorical equivalence rely on formal relationships that might obtain between pairs of scientific theories, expressed in a particular way (e.g., as a first order theory). But some authors have recently argued that formal criteria of equivalence cannot succeed. In particular, Coffey (2014) has argued that whether two theories are equivalent—or perhaps better, whether two theories are different presentations of a single underlying theory—is a question of how we interpret the theories: that is, do these theories represent the same physical ontology, possibly structured in the same ways, governed by the same laws, etc.? (For a closely related view, see Maudlin (2018).) This is not a question about formulations of a theory, so much as a question about our intentions and interpretations; as such, the question cannot be settled by formal criteria. Coffey argues that recognizing the role of interpretation in judgments of equivalence and inequivalence can explain why there are disagreements about particular cases of putative equivalence—including the examples of electromagnetism and Newtonian gravitation discussed above—and why in some cases there are asymmetries in these judgments, such that, for instance, one may need to “fix” one formulation of a theory, for instance by introducing gauge transformations. In such cases, interpretation of the formalism is surely playing an essential role in guiding our understanding of what “parts” of a theory to take seriously for purposes of judging their equivalence. (See also Butterfield, 2019; Le Bihan and Read, 2019, discussed in section 6)

Along similar lines—but beginning with the literature on scientific representation—Nguyen (2017) has recently argued that whether two theories are equivalent should be understood as a question about whether they permit one to model the same “target systems”, and, if they do, whether the models they offer warrant the same claims or inferences about those target systems. He, too, suggests that “purely formal” accounts of theoretical equivalence cannot be sufficient, because they cannot capture the role that intention and interpretation play in the semantics of scientific theories and models. His alternative proposal is that two theories are equivalent precisely when they permit the same claims and inferences under the same circumstances.

Do these sorts of challenges pose threats to the formal criteria discussed above? Yes and no. On the one hand, there is clearly something right about Coffey and Nguyen’s arguments, and the earlier arguments of Sklar (1982), that any adequate account of theoretical equivalence will need to respect the semantics of physical theories, including what we interpret them to be saying about the world. If, when the dust settles, one wishes to say that two theories make substantially different claims or warrant different inferences in a given situation, one surely would not wish to say they are equivalent. On the other hand, it is not clear how much tension there is between these views and the criteria already discussed. In particular, it bears emphasizing that neither definitional equivalence nor categorical equivalence is a purely formal criterion, at least as applied in the recent literature, insofar as empirical equivalence is also taken as a necessary condition for both, and, as we noted above, specifying the empirical content of a theory is itself a difficult task that connects to (though does not exhaust) issues in the semantics and interpretation of scientific theories.

Where perhaps there is disagreement concerns how to undertake the project of interpreting physical theories in the first place, and whether questions of (formal) equivalence and inequivalence have a role to play in those discussions. If one has the view that interpretation is “easy”, in the sense that one can read off from either the formalism of a theory, or perhaps the formalism plus sociological data about what inferences are drawn from that formalism, what the theory “says” about the world, then presumably there is no need for formal notions of equivalence of theories: one can simply ask, of two theories, whether they “say” the same thing. Interpretational equivalence is the end of the story.

If, on the other hand, one thinks that there are challenges in extracting from a theory’s formalism the theoretical claims that it makes, or if one thinks that it is possible to say or represent the same things in apparently different ways, then even if one endorses the idea that theories are equivalent just in case they have the same interpretation, one might think that formal criteria for equivalence can and do play an important role in establishing what the interpretational options are. (Recall that in the case of categorical equivalence, at least, representing a theory by a category itself raised interpretational questions that may otherwise have been obscure; formulating a first-order formalization of a theory will generally do the same.) On this point, Barrett (2017b, a) has argued that there is a close relationship between one’s attitudes concerning what it means to interpret a theory and when two theories are equivalent, and suggests that exploring different criteria for equivalence is a proxy for exploring different strategies for interpreting physical theories in the first place. Finally, one might argue that interpretational or representational equivalence merely defers the difficult issues, since now one needs to establish when two interpretations are equivalent—and presumably interpretations, like theories, may be expressed in multiple ways, for example by using different languages.

## 6 Duality

The discussion above has concerned issues of theoretical equivalence as they have played out in philosophy of science, particularly over the last four decades. But over the same period, a parallel discussion of the meaning and significance of “distinct but (somehow) equivalent” theories has occurred within physics. Physicists often say that such theories are “dual”, or that pairs of such theories exhibit (or are) a “duality”. Dual descriptions of the same physical situations have become particularly important in the context of string theory, which is an ambitious program to unify gravitational physics with the Standard Model of particle physics (Polchinski, 2017). Still more recently, philosophers of physics have attempted to understand the character and interpretational significance of these dualities (Matsubara, 2013; Read, 2016; Rickles, 2011, 2017; Read and Moller-Nielsen, 2018; De Haro et al., 2016a; De Haro, 2017; De Haro et al., 2016b, 2017; De Haro and Butterfield, 2017; Butterfield, 2019; Le Bihan and Read, 2019). We will not give a complete review of the issues related to dualities here; instead, we will focus on how the literature on dualities relates to the issues of equivalence already introduced. (For a recent review of other philosophical issues related to dualities, see Le Bihan and Read (2019), though as noted above, they use the term “duality” to refer to any empirically equivalent theories, which is somewhat different from the usage adopted here.)

The contemporary notion of duality in physics arguably originates with the “wave-particle” duality discussed by physicists in what has come to be called the “old quantum theory” in the early part of the twentieth century. (Of course, one can also trace the concept back still further.) In that context, the idea was that microphysical systems—those systems described by quantum theory—admit of descriptions as constituted by particles and as constituted by waves. These two descriptions corresponded to distinct conceptual frameworks for understanding the quantum world: on their face, they were incompatible, and yet, there was a sense in which one could, by a change of perspective, reconceive a system in either of these two ways. This notion of duality was also related to Bohr’s ideas about complementarity, according to which there were certain properties of a physical systems—position and momentum, say—that could not be simultaneously ascribed (Bokulich, 2017). One could conceive of a system as having a definite position, or as having a definite momentum, but not both.

More recent examples of dualities have had a different character (Polchinski, 2017). In these cases, one has two theories, each of which can be expressed separately, but which stand in some relationship to one another. This is to be contrasted with the wave-particle cases, where one had different “pictures” of the world, the apparent incompatibility of which was ultimately resolved by moving to a new theory—modern quantum mechanics—that had some of the qualitative features associated with each of the two pictures. One did not, however, have a well-defined “wave theory” and a well-defined “particle theory” that were then discovered to stand in some formal relationship to one another. (This situation regarding wave-particle duality should not be conflated with the equivalence of wave mechanics and matrix mechanics, as shown by Dirac (1930) and Von Neumann (1955 [1932]) and discussed in section 1; that example is much closer to the sort of duality relationships considered in contemporary physics.)

For example, consider the famous AdS/CFT correspondence (Maldacena, 1999; De Haro et al., 2016a; De Haro, 2017), also known as gauge-gravity duality. According to this duality, a particular (quantum) theory of gravity, satisfying certain geometrical constraints (namely, in which spacetime is asymptotically anti-de Sitter—hence “AdS”), bears a formal relationship to a (conformal) quantum field theory (the CFT) described on the boundary of the spacetime. (We note: although many physicists take this duality to be well-established, it is not a mathematical theorem, and indeed, it is not entirely clear how to give a precise mathematical statement of the duality.) In this case, one has a theory of gravity in dimensions that is taken to be “dual” to a theory without gravity in dimensions. The sense of “duality” here is given by a certain translation manual that allows one to take assertions about states and quantities in one theory and associate them with assertions about states and quantities in the other theory, and vice versa, in a way that bears a passing resemblance to definitional equivalence.

Other examples of dualities studied in string theory include T-duality, in which a theory characterized in a certain spatially compact spacetime with radius is shown to be dual to another theory in a spatially compact spacetime with radius (Giveon et al., 1994; Alvarez et al., 1995); and S-duality, in which one relates a theory with certain coupling constants to another theory in which the fields are permuted and the coupling constant becomes (Montonen and Olive, 1977; Seiberg and Witten, 1994; Alvarez-Gaumé and Hassan, 1997). The term “duality” is also used by physicists to describe relationships in, for instance, statistical physics, such as the Kramers-Wannier duality, which relates the free energy of Ising models at different temperatures (Kramers and Wannier, 1941a, b; Wegner, 1971). These cases, like gauge-gravity duality, involve translation manuals for associating states and quantities between the different theories—though it should be noted that the notion of “translation” here does not generally involve precise “definitions”, as in, for instance, definitional equivalence (recall section 3).

Dualities raise a number of questions that have been of interest to philosophers. Following the physicists who seem to take dual theories to be equivalent descriptions of the world, De Haro et al. (2016b, 2017), Rickles (2017), and Dawid (2017) have explored the interpretational consequences of such equivalences. Rickles (2017), for instance, argues that dual theories provide apparently different descriptions of the same physical “structure”, and suggests that any apparent incompatibilities between them should be viewed as illusory or physically insignificant, in the same sense that “gauge structure” may be taken to lack physical significance. He conjectures that given any pair of dual theories, one will always be able to find some third theory that captures the shared structural relationships, without exhibiting any of the apparent inconsistencies. De Haro et al. (2016b, 2017), meanwhile, have emphasized the similarities and dissimilarities between dualities and “gauge” structure.

On these and similar readings, dualities are examples of “equivalent theories” in the wild; they might even be considered as the sorts of test cases that could be used to adjudicate whether various notions of theoretical equivalence discussed by philosophers, such as those described above, are adequate. But this suggestion raises another question. The examples already mentioned form a diverse bestiary, and there are many other dualities described in the literature (eg. Witten, 1994; Karch and Tong, 2016). In some cases, these dualities are precisely defined mathematical relationships; in other cases, they are conjectured relationships; and in still other cases, they have a yet more uncertain status. Given this situation, it is not clear that “duality” captures a unique or specific mathematical (or conceptual) relationship. Could it be that “duality” captures a range of different ways in which theories could be related?

In this vein, De Haro (2019) has proposed a “schema” for understanding duality, according to which a duality is an isomorphism between two different possible realizations or formulations of a single physical theory (see also De Haro et al., 2016b, 2017; De Haro and Butterfield, 2017). (De Haro and collaborators use the term “model” to refer to these realizations, but we avoid their usage here, since “model” has already been used in a different sense above, and we wish to avoid confusion. We emphasize that a model in De Haro’s sense is more like a “formulation” in the sense used above.) It is essential to this approach that one considers what De Haro calls a “bare” theory (or, a “bare realization” of that theory), which is a purely formal structure without physical interpretation (or, sometimes, with only partial physical interpretation). The reason is that dualities, he argues, do not necessarily preserve interpretation: that is, they may relate realizations of a theory that would generally be taken to make different assertions about the world.

More recently, Butterfield (2019), building on De Haro and Butterfield (2017), has suggested that De Haro’s schema, when applied in some cases, reveals that dualities are not necessarily examples of equivalent theories, at least in the sense that philosophers usually have in mind. The reason is precisely that dualities may not preserve interpretation: de Haro’s schema suggests that dualities may exist between theories that are formally equivalent in some sense (in particular, logically equivalent), but which nonetheless make incompatible claims about the world after all. Observe that this claim is not simply the observation that dual theories may have prima facie different interpretations. It also involves the claim that, once we translate between the theories using whatever manual establishes the existence of the duality, we do not preserve physical meaning. For instance, in the context of T-duality, one should not say that a theory in which space has radius is equivalent to one in which space has radius , because the quantities and refer to different radii. This view amounts to rejecting Rickles’ suggestion that dual theories reflect the same underlying physical structure.

## 7 Conclusion

I have discussed several senses in which two physical theories might be said to be equivalent, including that they make the same empirical predictions; they are “inter-translatable”; they give rise to equivalent categories of models; and they have the same interpretation. I have also briefly drawn connections between the theoretical equivalence literature in philosophy of science and the literature on dualities in physics. But many open questions remain. From my perspective, the most pressing issues concern (a) understanding the character of the gap between generalized definitional / Morita equivalence and categorical equivalence; (b) understanding in what ways and under what circumstances a category of model adequately captures the structure of a physical theory; and (c) further clarifying the relationships between different senses of “duality” as they arise in the physics literature, and senses of equivalence as they arise in philosophy of science.

## Acknowledgments

I am grateful to Thomas Barrett, Clara Bradley, Jeremy Butterfield, Sebastian De Haro, David Malament, James Nguyen, and Nic Teh for comments and suggestions on previous drafts of this article.

## References

- Alvarez et al. (1995) Alvarez, E., Alvarez-Gaume, L., Lozano, Y., 1995. An introduction to t-duality in string theory. Nuclear Physics B-Proceedings Supplements 41 (1-3), 1–20.
- Alvarez-Gaumé and Hassan (1997) Alvarez-Gaumé, L., Hassan, S., 1997. Introduction to s-duality in n= 2 supersymmetric gauge theories (a pedagogical review of the work of seiberg and witten). Fortschritte der Physik 45 (3-4), 159–236.
- Andréka et al. (2002) Andréka, H., Madarász, J. X., Németi, I., 2002. On the logical structure of relativity theories. Tech. rep., Alfred Rényi Institute of Mathematics, Hungarian Academy of Science, https://old.renyi.hu/pub/algebraic-logic/Contents.html.
- Andréka and Németi (2014) Andréka, H., Németi, I., 2014. Comparing theories: the dynamics of changing vocabulary. In: Johan van Benthem on logic and information dynamics. Springer, pp. 143–172.
- Artigue et al. (1978) Artigue, M., Isambert, E., Perrin, M., Zalc, A., 1978. Some remarks on bicommutability. Fundamenta Mathematicae 101 (3), 207–226.
- Awodey (2006) Awodey, S., 2006. Category Theory. Oxford University Press, New York.
- Awodey and Forssell (2013) Awodey, S., Forssell, H., 2013. First-order logical duality. Annals of Pure and Applied Logic 164 (3), 319–348.
- Baez et al. (2004) Baez, J., Bartel, T., Dolan, J., 2004. Property, structure, and stuff, available at: http://math.ucr.edu/home/baez/qg-spring2004/discussion.html.
- Barrett (2003) Barrett, J. A., 2003. Are our best physical theories (probably and/or approximately) true? Philosophy of Science 70 (5), 1206–1218.
- Barrett (2014) Barrett, T., 2014. On the structure of classical mechanics. The British Journal for the Philosophy of Science 66 (4), 801–828.
- Barrett (2015) Barrett, T. W., 2015. Spacetime structure. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 51, 37–43.
- Barrett (2017a) Barrett, T. W., 2017a. Equivalent and inequivalent formulations of classical mechanics. British Journal for Philosophy of ScienceForthcoming. http://philsci-archive.pitt.edu/13092/.
- Barrett (2017b) Barrett, T. W., 2017b. On the structure and equivalence of theories. Ph.D. thesis, Princeton University, http://arks.princeton.edu/ark:/88435/dsp0112579v91j.
- Barrett (2017c) Barrett, T. W., 2017c. What do symmetries tell us about structure?Forthcoming in Philosophy of Science.
- Barrett and Halvorson (2016a) Barrett, T. W., Halvorson, H., 2016a. Glymour and Quine on theoretical equivalence. Journal of Philosophical Logic 45 (5), 467–483.
- Barrett and Halvorson (2016b) Barrett, T. W., Halvorson, H., 2016b. Morita equivalence. The Review of Symbolic Logic 9 (3), 556–582.
- Barrett and Halvorson (2017) Barrett, T. W., Halvorson, H., 2017. From geometry to conceptual relativity. Erkenntnis 82 (5), 1043–1063.
- Bokulich (2017) Bokulich, P., 2017. Complementarity, wave-particle duality, and domains of applicability. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 59, 136–142.
- Bradley (2018) Bradley, C., 2018. The non-equivalence of einstein and lorentz, unpublished ms.
- Bradley et al. (2019) Bradley, C., Rosenstock, S., Weatherall, J. O., 2019. Structure, stuff, and gauge, unpublished ms.
- Bridgman (1927) Bridgman, P. W., 1927. The Logic of Modern Physics. Macmillan, New York, NY.
- Butterfield (2019) Butterfield, J., 2019. On dualities and equivalences between physical theories. In: Huggett, N., Wüthrich, C. (Eds.), Spacetime After Quantum Gravity. Forthcoming.
- Button and Walsh (2018) Button, T., Walsh, S., 2018. Philosophy and model theory. Oxford University Press, Oxford.
- Carnap (1958) Carnap, b. R., 1958. Beobachtungssprache und theoretische sprache. Dialectica 12 (3-4), 236–248.
- Carnap (1966) Carnap, R., 1966. Philosophical Foundations of Physics: An Introduction to the Philosophy of Science. Basic Books, New York.
- Coffey (2014) Coffey, K., 2014. Theoretical equivalence as interpretive equivalence, forthcoming from The British Journal for the Philosophy of Science.
- Cruse (2005) Cruse, P., 2005. Ramsey sentences, structural realism and trivial realization. Studies in History and Philosophy of Science 36, 557–â576.
- Curiel (2013) Curiel, E., 2013. Classical mechanics is Lagrangian; it is not Hamiltonian. The British Journal for Philosophy of Science 65 (2), 269–321.
- Cushing (1994) Cushing, J., 1994. Quantum Mechanics, Historical Contingency, and the Copenhagen Hegemony. University of Chicago Press, Chicago, IL.
- Dawid (2017) Dawid, R., 2017. String dualities and empirical equivalence. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 59, 21–29.
- de Bouvere (1965a) de Bouvere, K., 1965a. Logical synonymity. Indagationes mathematicae 27, 622–629.
- de Bouvere (1965b) de Bouvere, K., 1965b. Synonymous theories. In: Addison, J. W., Henkin, L., Tarski, A. (Eds.), The theory of models. North-Holland Pub. Co., Amsterdam, pp. 402–406.
- De Haro (2017) De Haro, S., 2017. Dualities and emergent gravity: Gauge/gravity duality. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 59, 109–125.
- De Haro (2019) De Haro, S., 2019. Spacetime and physical equivalence. In: Huggett, N., Wüthrich, C. (Eds.), Spacetime After Quantum Gravity. Forthcoming. arXiv:1707.06581.
- De Haro and Butterfield (2017) De Haro, S., Butterfield, J., 2017. A schema for duality, illustrated by bosonization. arXiv preprint arXiv:1707.06681.
- De Haro et al. (2016a) De Haro, S., Mayerson, D. R., Butterfield, J. N., 2016a. Conceptual aspects of gauge/gravity duality. Foundations of Physics 46 (11), 1381–1425.
- De Haro et al. (2016b) De Haro, S., Teh, N., Butterfield, J., 2016b. On the relation between dualities and gauge symmetries. Philosophy of Science 83 (5), 1059–1069.
- De Haro et al. (2017) De Haro, S., Teh, N., Butterfield, J., 2017. Comparing dualities and gauge symmetries. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 59, 68–80.
- Dewar (2018) Dewar, N., 2018. Ramsey equivalence. ErkenntnisForthcoming.
- Dewar and Eva (2017) Dewar, N., Eva, B., 2017. A categorical perspective on symmetry and equivalence.
- Dirac (1930) Dirac, P. A. M., 1930. The Principles of Quantum Mechanics. Oxford University Press, Oxford.
- Downes (1992) Downes, S. M., 1992. The importance of models in theorizing: A deflationary semantic view. In: PSA: Proceedings of the biennial meeting of the philosophy of science association. Vol. 1992. Philosophy of Science Association, pp. 142–153.
- Dürr et al. (2012) Dürr, D., Goldstein, S., Zanghì, N., 2012. Quantum physics without quantum philosophy. Springer Science & Business Media, Heidelberg.
- Giveon et al. (1994) Giveon, A., Porrati, M., Rabinovici, E., 1994. Target space duality in string theory. Physics Reports 244 (2-3), 77–202.
- Glymour (1970) Glymour, C., 1970. Theoretical equivalence and theoretical realism. PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association 1970, 275–288.
- Glymour (1980) Glymour, C., 1980. Theory and Evidence. Princeton University Press, Princeton, NJ.
- Glymour (2013) Glymour, C., 2013. Theoretical equivalence and the semantic view of theories. Philosophy of Science 80 (2), 286–297.
- Grünbaum (1962) Grünbaum, A., 1962. Geometry, chronometry and empiricism, 405–526.
- Halvorson (2012) Halvorson, H., 2012. What scientific theories could not be. Philosophy of Science 79 (2), 183–206.
- Halvorson (2013) Halvorson, H., 2013. The semantic view, if plausible, is syntactic. Philosophy of Science 80 (3), 475–478.
- Halvorson (2015a) Halvorson, H., 2015a. Categories of scientific theories. In: Landry, E. (Ed.), Categories for the Working Philosopher. Oxford University Press, Oxford, UK, this volume.
- Halvorson (2015b) Halvorson, H., 2015b. Scientific theories. In: Humphreys, P. (Ed.), The Oxford Handbook of the Philosophy of Science. Oxford University Press, Oxford, UK, . Forthcoming. http://philsci-archive.pitt.edu/11347/.
- Halvorson and Tsementzis (2017) Halvorson, H., Tsementzis, D., 2017. Categories of scientific theories. In: Landry, E. (Ed.), Categories for the working philosopher. Oxford University Press, Oxford, pp. 402–429.
- Hempel (1958) Hempel, C. G., 1958. The theoretician’s dilemma: A study in the logic of theory construction. In: Feigl, H., Scriven, M., Maxwell, G. (Eds.), Concepts, Theories, and the Mind-Body Problem. University of Minnesota Press, Minneapolis, MN, pp. 37–98.
- Hempel (1973) Hempel, C. G., 1973. The meaning of theoretical terms: A critique of the standard empiricist construal. In: Suppes, P., Henkin, L., Joja, A., Moisil, G. C. (Eds.), Logic, Methodology and Philosophy of Science IV. North-Holland Publishing Co., Amsterdam, pp. 367–â378.
- Hertz (1899 [1894]) Hertz, H., 1899 [1894]. The Principles of Mechanics Presented in a New Form. MacMillan & Co., London.
- Hodges (1993) Hodges, W., 1993. Model theory. Vol. 42. Cambridge University Press.
- Hodges (1997) Hodges, W., 1997. A shorter model theory. Cambridge university press, Cambridge, UK.
- Hudetz (2017) Hudetz, L., 2017. The semantic view of theories and higher-order languages. Synthese, 1–19.
- Hudetz (2018) Hudetz, L., 2018. Definable categorical equivalence. Philosophy of ScienceForthcoming. http://philsci-archive.pitt.edu/14297/.
- Kanger (1968) Kanger, S., 1968. Equivalent theories. Theoria 34 (1), 1–6.
- Karch and Tong (2016) Karch, A., Tong, D., 2016. Particle-vortex duality from 3d bosonization. Physical Review X 6 (3), 031043.
- Ketland (2004) Ketland, J., 2004. Empirical adequacy and ramsification. The British Journal for the Philosophy of Science 55 (2), 287–300.
- Kramers and Wannier (1941a) Kramers, H. A., Wannier, G. H., 1941a. Statistics of the two-dimensional ferromagnet. part i. Physical Review 60 (3), 252.
- Kramers and Wannier (1941b) Kramers, H. A., Wannier, G. H., 1941b. Statistics of the two-dimensional ferromagnet. part ii. Physical Review 60 (3), 263.
- Le Bihan and Read (2019) Le Bihan, B., Read, J., 2019. Duality and ontology. Philosophy CompassForthcoming.
- Lefever and Székely (2017) Lefever, K., Székely, G., 2017. Comparing classical and relativistic kinematics in first-order logic, arXiv:1707.05371.
- Lefever and Székely (2018) Lefever, K., Székely, G., 2018. On generalization of definitional equivalence to languages with non-disjoint signatures, arXiv:1802.06844.
- Leinster (2014) Leinster, T., 2014. Basic Category Theory. Cambridge University Press, Cambridge.
- Lewis (1970) Lewis, D., 1970. How to define theoretical terms. Journal of Philosophy 67 (13), 427â–446.
- Lewis (1972) Lewis, D., 1972. Psychophysical and theoretical identifications. Australasian Journal of Philosophy 50 (3), 249–258.
- Lutz (2014) Lutz, S., 2014. Whatâs right with a syntactic approach to theories and models? Erkenntnis 79 (8), 1475–1492.
- Lutz (2017) Lutz, S., 2017. What was the syntax-semantics debate in the philosophy of science about? Philosophy and Phenomenological Research 95 (2), 319–352.
- Mac Lane (1998) Mac Lane, S., 1998. Categories for the Working Mathematician, 2nd Edition. Springer, New York.
- Madarász (2002) Madarász, J. X., 2002. Logic and relativity (in the light of definability theory). Ph.D. thesis, Eotvös Loränd University, Budapest.
- Magnus and Frost-Arnold (2010) Magnus, P., Frost-Arnold, G., 2010. The identical rivals response to underdetermination. In: Magnus, P., Busch, J. (Eds.), New Waves in Philosophy of Science. Palgrave Macmillan, London, pp. 112–130.
- Makkai (1993) Makkai, M., 1993. Duality and definability in first order logic. Vol. 503. American Mathematical Soc., Providence, RI.
- Malament (1995) Malament, D., 1995. Is Newtonian cosmology really inconsistent? Philosophy of Science 62 (4), 489–510.
- Malament (2012) Malament, D., 2012. Topics in the Foundations of General Relativity and Newtonian Gravitation Theory. University of Chicago Press, Chicago.
- Maldacena (1999) Maldacena, J., 1999. The large-n limit of superconformal field theories and supergravity. International journal of theoretical physics 38 (4), 1113–1133.
- Matsubara (2013) Matsubara, K., 2013. Realism, underdetermination and string theory dualities. Synthese 190 (3), 471–489.
- Maudlin (2018) Maudlin, T., 2018. Ontological clarity via canonical presentation: Electromagnetism and the aharonov–bohm effect. Entropy 20 (6), 465.
- Maxwell (1962) Maxwell, G., 1962. The ontological status of theoretical entities. In: Feigl, H., Maxwell, G. (Eds.), Scientific Explanation, Space, and Time. University of Minnesota Press, Minneapolis, MN, pp. 3–14.
- Maxwell (1970) Maxwell, G., 1970. Structural realism and the meaning of theoretical terms. In: Winokur, S., Radner, M. (Eds.), Analyses of Theories and Methods of Physics and Psychology. University of Minnesota Press, Minneapolis, pp. 181–192.
- Melia and Saatsi (2006) Melia, J., Saatsi, J., 2006. Ramseyfication and theoretical content. The British Journal for the Philosophy of Science 57 (3), 561–585.
- Montague (1957) Montague, R., 1957. Contributions to the axiomatic foundations of set theory. Ph.D. thesis, University of California, Berkeley.
- Montonen and Olive (1977) Montonen, C., Olive, D., 1977. Magnetic monopoles as gauge particles? Physics Letters B 72 (1), 117–120.
- Muller (1997a) Muller, F. A., 1997a. The equivalence myth of quantum mechanicsâpart i. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 28 (1), 35–61.
- Muller (1997b) Muller, F. A., 1997b. The equivalence myth of quantum mechanicsâpart ii. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 28 (2), 219–247.
- Nguyen (2017) Nguyen, J., 2017. Scientific representation and theoretical equivalence. Philosophy of Science 84 (5), 982–995.
- Nguyen et al. (2018) Nguyen, J., Teh, N. J., Wells, L., 2018. Why surplus structure is not superfluous. British Journal for Philosophy of ScienceForthcoming.
- North (2009) North, J., 2009. The ‘structure’ of physics: A case study. Journal of Philosophy 106 (2), 57–88.
- Norton (1992) Norton, J., 1992. A paradox in Newtonian gravitation theory. PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association 1992, 412–420.
- Norton (1995) Norton, J., 1995. The force of Newtonian cosmology: Acceleration is relative. Philosophy of Science 62 (4), 511–522.
- Norton (2008) Norton, J., 2008. Must evidence underdetermine theory. In: Kourany, J. A., Carrier, M., Howard, D. (Eds.), The challenge of the social and the pressure of practice: Science and values revisited. University of Pittsburgh Press Pittsburgh, Pittsburgh, PA, pp. 17–44.
- O’Connor and Weatherall (2016) O’Connor, C., Weatherall, J. O., 2016. Black holes, black-scholes, and prairie voles: An essay review of simulation and similarity, by michael weisberg. Philosophy of Science 83 (4), 613–626.
- Polchinski (2017) Polchinski, J., 2017. Dualities of fields and strings. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 59, 6–20.
- Psillos (2000) Psillos, S., 2000. Carnap, the ramsey-sentence and realistic empiricism. Erkenntnis 52 (2), 253–279.
- Quine (1975) Quine, W. V., 1975. On empirically equivalent systems of the world. Erkenntnis 9 (3), 313–328.
- Ramsey (1931) Ramsey, F. P., 1931. The Foundations of Mathematics. Routledge & Kegan Paul, London, UK, Ch. Theories, pp. 212–236.
- Read (2016) Read, J., 2016. The interpretation of string-theoretic dualities. Foundations of Physics 46 (2), 209–235.
- Read and Moller-Nielsen (2018) Read, J., Moller-Nielsen, T., 2018. Motivating dualities. SyntheseForthcoming.
- Reichenbach (1938) Reichenbach, H., 1938. Experience and prediction: An analysis of the foundations and the structure of knowledge. University of Chicago Press, Chicago, IL.
- Rickles (2011) Rickles, D., 2011. A philosopher looks at string dualities. Studies in Histoy and Philosophy of Modern Physics 42 (1), 54–67.
- Rickles (2017) Rickles, D., 2017. Dual theories: ‘same but different’ or ‘different but same’? Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 59, 62–67.
- Rosenstock et al. (2015) Rosenstock, S., Barrett, T. W., Weatherall, J. O., 2015. On Einstein algebras and relativistic spacetimes. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 52, 309–316.
- Rosenstock and Weatherall (2016) Rosenstock, S., Weatherall, J. O., 2016. A categorical equivalence between generalized holonomy maps on a connected manifold and principal connections on bundles over that manifold. Journal of Mathematical Physics 57 (10), 102902.
- Rynasiewicz (1992) Rynasiewicz, R., 1992. Rings, holes and substantivalism: On the program of leibniz algebras. Philosophy of Science 59 (4), 572–589.
- Salmon (1966) Salmon, W. C., 1966. Verifiability and logic. In: Feyerabend, P. K., Maxwell, G. (Eds.), Mind, matter and method: essays in philosophy and science in honor of Herbert Feigl. University of Minnesota Press, Minneapolis, MN, pp. 354–366.
- Schreiber (2013) Schreiber, U., 2013. Differential cohomology in a cohesive infinity-topos, arXiv:1310.7930.
- Schreiber and Waldorf (2007) Schreiber, U., Waldorf, K., 2007. Parallel transport and functorsArXiv:0705.0452.
- Seiberg and Witten (1994) Seiberg, N., Witten, E., 1994. Electric-magnetic duality, monopole condensation, and confinement in n= 2 supersymmetric yang-mills theory. Nuclear Physics B 426 (1), 19–52.
- Sklar (1982) Sklar, L., 1982. Saving the noumena. Philosophical Topics 13 (1), 89–110.
- Trautman (1965) Trautman, A., 1965. Foundations and current problem of general relativity. In: Deser, S., Ford, K. W. (Eds.), Lectures on General Relativity. Prentice-Hall, Englewood Cliffs, NJ, pp. 1–248.
- Univalent Foundations Program (2013) Univalent Foundations Program, T., 2013. Homotopy Type Theory: Univalent Foundations of Mathematics. https://homotopytypetheory.org/book, Institute for Advanced Study.
- Van Fraassen (2014) Van Fraassen, B. C., 2014. One or two gentle remarks about hans halvorsonâs critique of the semantic view. Philosophy of Science 81 (2), 276–283.
- Visser (2017) Visser, A., 2017. Categories of theories and interpretations. In: Enayat, A., Kalantari, I., Moniri, M. (Eds.), Logic in Tehran. Cambridge University Press, Cambridge, UK, pp. 284–341.
- Von Neumann (1955 [1932]) Von Neumann, J., 1955 [1932]. Mathematical foundations of quantum theory. Princeton University Press.
- Wallace (2017) Wallace, D., 2017. More problems for Newtonian cosmology. Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 57, 35–40.
- Weatherall (2016a) Weatherall, J. O., 2016a. Are Newtonian gravitation and geometrized Newtonian gravitation theoretically equivalent? Erkenntnis 81 (5), 1073–1091.
- Weatherall (2016b) Weatherall, J. O., 2016b. Understanding gauge. Philosophy of Science 83 (5), 1039–1049.
- Weatherall (2017) Weatherall, J. O., 2017. Category theory and the foundations of classical space-time theories. In: Landry, E. (Ed.), Categories for the Working Philosopher. Oxford University Press, Oxford, pp. 329–348.
- Weatherall (2018) Weatherall, J. O., 2018. Why not categorical equivalence?, unpublished ms.
- Wegner (1971) Wegner, F. J., 1971. Duality in generalized Ising models and phase transitions without local order parameters. Journal of Mathematical Physics 12 (10), 2259–2272.
- Weisberg (2012) Weisberg, M., 2012. Simulation and similarity: Using models to understand the world. Oxford University Press, New York.
- Witten (1994) Witten, E., 1994. Non-Abelian bosonization in two dimensions. In: Bosonization. World Scientific, pp. 201–218.
- Worrall (2007) Worrall, J., 2007. Miracles and models: Why reports of the death of structural realism may be exaggerated. Royal Institute of Philosophy Supplements 82 (61), 125–â154.