Information equilibrium as an economic principle
A general information equilibrium model in the case of ideal information transfer is defined and then used to derive the relationship between supply (information destination) and demand (information source) with the price as the detector of information exchange between demand and supply. We recover the properties of the traditional economic supply-demand diagram. Information equilibrium is then applied to macroeconomic problems, recovering some common macroeconomic models in particular limits like the AD-AS model, IS-LM model (in a low inflation limit), the quantity theory of money (in a high inflation limit) and the Solow-Swan growth model. Information equilibrium results in empirically accurate models of inflation and interest rates, and can be used to motivate a “statistical economics”, analogous to statistical mechanics for thermodynamics.
Keywords: Information theory, macroeconomics, microeconomics
Journal of Economic Literature Classification: C00, E10, E30, E40.
In the natural sciences, complex non-linear systems composed of large numbers of smaller subunits provide an opportunity to apply the tools of statistical mechanics and information theory. From this intuition Lee Smolin (2009) suggested a new discipline of statistical economics to study the collective behavior of economies composed of large numbers of economic agents.
A serious impasse to this approach is the lack of well-defined or even definable constraints enabling the use of Lagrange multipliers, partition functions and the machinery of statistical mechanics for systems away from equilibrium or for non-physical systems. The latter – in particular economic systems – lack e.g. fundamental conservation laws like the conservation of energy to form the basis of these constraints. In order to address this impasse, Fielitz and Borchardt (2014) introduced the concept of natural information equilibrium. They produced a framework based on information equilibrium and showed it was applicable to several physical systems. The present paper seeks to apply that framework to economic systems.
The idea of applying mathematical frameworks used in the physical sciences to economic systems is an old one; even the idea of applying principles from thermodynamics is an old one. Willard Gibbs – who coined the term “statistical mechanics” – supervised Irving Fisher’s thesis [Fisher (1892)] in which he applied a rigorous approach to economic equilibrium. Samuelson later codified the Lagrange multiplier approach to utility maximization commonly used in economics today.
The specific thrust of Fielitz and Borchardt (2014) is that it looks at how far you can go with the maximum entropy or information theoretic arguments without having to specify constraints. This refers to partition function constraints optimized with the use of Lagrange multipliers. In thermodynamics language it’s a little more intuitive: basically the information transfer model allows you to look at thermodynamic systems without having defined a temperature (Lagrange multiplier) and without having the related constraint (that the system observables have some fixed value, i.e. equilibrium).
A word of caution before proceeding; the term “information” is somewhat overloaded across various technical fields. Our use of the word information differs from its more typical usage in economics, such as in “information economics” or “perfect information” in game theory. Instead of focusing on a board position in chess, we are assuming all possible board positions (even potentially some impossible ones such as those including three kings). The definition of information we use is the definition required when specifying a random chess board out of all possible chess positions, and it comes from Hartley and Shannon. It is a quantity measured in bits (or nats), and has a direct connection to probability. As stated in Shannon (1949), “information must not be confused with meaning”.
This is in contrast to Akerlof information asymmetry, for example, where knowledge (meaningful information) of the quality of a vehicle is better known to the seller than the buyer. We can see that this is a different use of the term information – how many bits this quality score requires to store (and hence how many available ‘quality states’ there are) is irrelevant to Akerlof’s argument. The perfect information in a chess board represents bits; this quantity is irrelevant in an analysis of chess strategies in game theory (except as a practical limit to computation of all possible chess moves).
We propose the idea that information equilibrium should be used as a guiding principle in economics and organize this paper as follows. We will begin in Section 2 by introducing and deriving the primary equations of the information equilibrium framework, and proceed to show how the information equilibrium framework can be understood in terms of the general market forces of supply and demand. This framework will also provide a definition of the regime where market forces fail to reach equilibrium through information loss.
Since the framework itself is agnostic about the goods and services sold or the behaviors of the relevant economic agents, the generalization from widgets in a single market to an economy composed of a large number of markets is straightforward. We will describe macroeconomics in Section 3, and demonstrate the effectiveness of the principle of information equilibrium both empirically an in derivations of standard macroeconomic models. In particular we will address the price level and the labor market where we show that information equilibrium leads to well-known stylized facts in economics. The quantity theory of money will be shown to be an approximation to information equilibrium when inflation is high, and Okun’s law will be shown to follow from information equilibrium. Lastly, we establish in Section 4 an economic partition function, define a concept of economic entropy and discuss how nominal rigidity and the so-called liquidity trap in Krugman (1998) may be best understood as entropic forces for which there are no microfoundations.
2 Information equilibrium
We will describe the economic laws of supply and demand as the result of an information transfer model. Much of the description of the information transfer model follows Fielitz and Borchardt (2014). Following Shannon (1948) we have a system that transfers information
We will follow Fielitz and Borchardt (2014) and use the Hartley definition
Let us define which we will call the information transfer index and rearrange so that
Compared to Fielitz and Borchardt (2014), we have changed some of the notation, e.g. becomes . We have set up the condition required by information theory for a signal measured by the stick of length to be received as a signal and measured by a stick of length . These signals will contain the same amount of information if .
Now we define a process signal detector that relates the process source signal emitted from the process source to a process destination signal that is detected at the process destination and delivers an output value:
If our source and destination are large compared to our signals () we can take , we can re-arrange the information transfer condition:
In the following, we will use the notation
2.1 Supply and demand
At this point we will take our information transfer process and apply it to the generic economic problem of supply and demand. We will drop the absolute values and use positive quantities. In that case, we will identify the information transfer process source as the demand , the information transfer process destination as the supply , and the process signal detector as the price . The price detector relates the demand signal emitted from the demand to a supply signal that is detected at the supply and delivers a price . We translate Condition 1 in Fielitz and Borchardt (2014) for the applicability of our information theoretical description into the language of supply and demand:
Condition 1: The considered economic process can be sufficiently described by only two independent process variables (supply and demand: ) and is able to transfer information.
We are now going to solve the differential equation 2.3. But first we assume ideal information transfer such that:
Note that Eq. (2.4) represents movement of the supply and demand curves where is a “floating-restriction” information source in the language of Fielitz and Borchardt (2014), as opposed to movement along the supply and demand curves where is a “constant-restriction information source”, again in the language of Fielitz and Borchardt (2014). The differential equation (2.5) can be solved by integration
and we can then solve for the price using Eq. (2.4)
These equations represent the general equilibrium solution where and change in response to each other.
If we hold the information source or destination effectively constant, responding only slowly to changes in the other variable, we can describe ‘partial equilibrium’ solutions that will lead to supply and demand diagrams. We will take to be a constant-restriction information source in the language of Fielitz and Borchardt (2014) and integrate the differential equation Eq. (2.5)
Analogously, we can define a supply curve by using a constant information destination and follow the above procedure to find:
plus terms of order such that
where , , and . This recovers a simple linear model of supply and demand (where you could add a time dependence to the price e.g. to produce a simple dynamic model).
We can explicitly show the supply and demand curves using equations (2.14,2.15) and (2.16, 2.17) and plotting price vs change in quantity or in Figure 2. In the figure we also show a shift in the supply curve (red) to the right. The new (lower) equilibrium price is the intersection of the new displaced supply curve and the unchanged demand curve.
from which we could measure the information transfer index .
There is a third way to solve Eq. (2.5) where both supply and demand are considered to vary slowly (i.e. be approximately constant). In that case the integral becomes
If we define
solving the integral shows us that the price is also constant
2.2 Physical analogy with ideal gases
In the original paper, Fielitz and Borchardt (2014) use the information transfer model to build the ideal gas law. This specific application gives us some analogies that are useful. In the model we have
the pressure is the price , volume is the supply and the energy
2.3 Alternative motivation
We would like to provide an alternative and more macro- and micro-economic motivation of Eq. (2.5) rooted in two economic principles: homogeneity of degree zero and marginalism. For example, according to Bennett McCallum (2004), the quantity theory of money (QTM) is the macroeconomic observation that the economy obeys long run neutrality of money which is captured in the assumption of homogeneity constraints. In particular, supply and demand functions will be homogeneous of degree zero, i.e. ratios of to such that if and then . The simplest differential equation
Fisher (1892) looks at the exchange of some number of gallons of for some number of bushels of and states: “the last increment is exchanged at the same rate for as was exchanged for ″. Fisher writes this as an equation on page 5:
Fisher notes that this marginalist argument was introduced by both Jevons and Marshall. Of course it is generally false. Many goods exhibit economies of scale, fixed costs or other effects so that either the last increments of and are cheaper (e.g. software) or more expensive (e.g. oil) than the first increments. The simplest way to account for this is by multiplying one side of Eq. (2.22) by a constant. Thus we can say using information equilibrium as an economic principle enforces a generalized marginal thinking. The information equilibrium approach can also be interpreted as an application of information theory to Irving Fisher’s measuring stick.
Since the information equilibrium framework depends on a large number of states for the information source and destination, it ostensibly would be better applied to the macroeconomic problem. Below we make a connection to some classic macroeconomic toy models and a macroeconomic relationship: AD-AS model, Okun’s law, the IS-LM model, the Solow growth model, and the quantity theory of money. A summary of the models described in Section 3 appears in Appendix A. The details of the Mathematica codes used to fit the parameters are provides in Appendix B.
3.1 AD-AS model
The AD-AS model uses the price level as the detector, aggregate demand (NGDP) as the information source and aggregate supply as the destination, or , which immediately allows us to write down the aggregate demand and (short run) aggregate supply (SRAS) curves for the case of partial equilibrium.
Positive shifts in the aggregate demand curve raise the price level along with negative shifts in the supply curve. Traveling along the aggregate demand curve lowers the price level (more aggregate supply at constant demand).
The long run aggregate supply (LRAS) curve would be vertical in Figure 3 representing the general equilibrium solution
with price .
Another interesting result in this model is that it can be used to illuminate the role of money in macroeconomics as a tool of information mediation. If we start with the AD-AS model information equilibrium condition
we can in general make the following transformation using a new variable (i.e. money):
If we take to be in information equilibrium with the intermediate quantity , which is in information equilibrium with , i.e.
then we can use the information equilibrium condition
to show that equation (3.1) can be re-written
where we have defined . The solution to the differential equation (3.5) defines a quantity theory of money where the price level goes as
We will discuss this more in Section 3.4 on the price level and inflation.
3.2 Labor market and Okun’s law
The description of the labor market uses the price level as the detector, aggregate demand as the information source and total hours worked
Re-arranging and taking the logarithmic derivative of both sides:
where is RGDP. The total hours worked (or total employed ) fluctuates with the change in RGDP growth. This is one form of Okun’s law, from Okun (1962). The model is shown in Figure 4. The model parameters are listed in Appendix A.
3.3 IS-LM and interest rates
The classical Hicksian Investment-Savings Liquidity-Money Supply (IS-LM) model uses two markets along with an information equilibrium relationship. Let be the price of money in the money market (LM market) where is aggregate demand and is the money supply. We have:
We assume that the interest rate is in information equilibrium with the price of money , so that we have the information equilibrium relationship (no need to define a detector at this point). Therefore the differential equation is:
with solution (we will not need the additional constants or ):
And we can write:
Already this is fairly empirically accurate as we can see in Figure 5.
We can now rewrite the money (LM) market and add the goods (IS) market as coupled markets with the same information source (aggregate demand) and same detector (interest rate, directly related to – i.e. in information equilibrium with – the price of money):
where is the aggregate supply. Changes in the LM market manifest as increases in the money supply as well as shifts in the information source , so we write the LM curve as a demand curve Eqs. (2.14, 2.14) with shifts:
The IS curve can be straight-forwardly be written as the demand curve in the IS market:
This model assumes that does not move strongly with , so only applies to a low inflation scenario. For high inflation, acquires a strong dependence on and the quantity theory of money in Section 3.4 becomes a more accurate description.
Long and short term interest rates
The short term interest rate is empirically given by the same model with the same parameters (see Fig. 8); the difference is that the full monetary base including central bank reserves is used instead of just the currency component. These are FRED (2015) series AMBSL (call this variable ) and MBCURRCIR, respectively. The full market for the long and short term interest rates would be:
where and (i.e. the parameters for both models are the same).
The theoretical reason both the long and short term interest rate are given by the same model simply by exchanging currency (monetary base minus reserves) for the full monetary base (including reserves) is not immediately obvious. As the relationship was observed in empirical data, we can only provide a hand-waving argument based on the properties of central bank reserves (which are purely electronic) as opposed to currency which manifests as physical pieces of paper. Reserves may be seen as temporary by the market - they only exist in the short run. Therefore they need to be included as part of the supply of so-called high powered money for short term interest rates. Physical currency in circulation may be seen as more permanent by the market, and therefore represent the proper supply of high powered money for long term interest rates. This argument is speculative and involves the expected path of the monetary base, something not empirically measurable.
Assumptions in the IS-LM model
One useful property of the information equilibrium approach is that is makes explicit several assumptions in the IS-LM model.
It is a partial equilibrium model and we use the partial equilibrium solutions to the information equilibrium equation Eq. (2.3).
No distinction is made between real and nominal quantities (all quantities are treated as nominal). Since we have partial equilibrium, is assumed to be slowly varying which implies that if , must be slowly varying unless and conspire to make slowly varying.
If the price of money is scaled by a constant factor , the only change to the model is a change in the value of the constant .
3.4 Price level and inflation
Let us begin our discussion of the price level with the market described in the AD-AS model in section 3.1 with being NGDP, the information source, and being the monetary base minus reserves and return to the differential equation (2.3). Assuming ideal information transfer we have
Let us allow to be a slowly varying function of and , i.e.
We can approximately solve the differential equation (3.13) by integration such that
so that, using Eq. (3.13) again, we obtain the price level as a function of and
where is an arbitrary constant (because the normalization of the price level is arbitrary).
Now the information transfer index is related to the number of symbols , used by the information source and information destination, specifically:
Let us posit a simple model where and are proportional to and
where we have introduced the new
for . The rationale for introducing such a model for a changing information transfer index is that the units of and are the same: the national unit of account. Therefore the information content of e.g. $ 1 billion of nominal output depends on the size of the monetary base – and vice versa, and so we should expect . However, we will see in Section 4 that this functional form is a good approximation to the case where we consider markets with a distribution of constant values of , meaning effectively describes emergent properties of the macroeconomy. There is an additional benefit of introducing this functional form and constant that may assist in cross-national comparisons that we discuss in Appendix C.
The full price level model is
with free dimensionless parameters and along with , which has dimensions of currency. If we fit these parameters using data from FRED (2015) for being so-called core price level of Personal Consumption Expenditures (PCE price level, less food and energy, series PCEPILFE), being nominal gross domestic product (series GDP), and being the currency component of the monetary base (series MBCURRCIR), performing a LOESS smoothing (of order 2, with smoothing parameter 1.0, see Appendix B) on the inputs and we arrive at Figures 9 and 10. The empirical accuracy of the model is on the order of the model of Hallman (1989) (see Appendix A for fit parameters).
If we look at Eq. (3.24) we can see that when , we have
so that price level grows proportionally with the monetary base, the essence of the quantity theory of money. Additionally, when we have, using Eq. (3.19),
If we use the fact that . If we take and to be exponentially growing with growth rates and (i.e. ), respectively, . In general, we have (introducing the inflation rate )
Defining a real growth rate , then for large we have
which implies large means high inflation. In contrast, means that . When , the IS-LM model becomes a better approximation since changes in do not result in strong changes in the price level since . We will discuss this more in Section 4.1.
3.5 Solow-Swan growth model
Let us assume two markets and :
The economics rationale for equations (3.32) are that the left hand sides are the marginal productivity of capital/labor which are assumed to be proportional to the right hand sides – the productivity per unit capital/labor. In the information transfer model, the relationship follows from a model of aggregate demand sending information to aggregate supply (capital and labor) where the information transfer is “ideal” i.e. no information loss. The solutions are:
and therefore we have
Equation (3.34) is the generic Cobb-Douglas form. In the information equilibrium model, the exponents are free to take on any value (not restricted to constant returns to scale, i.e. ). The resulting model is remarkably accurate as seen in Figure 11.
It also has no changes in so-called total factor productivity ( is constant). The results above use nominal capital and nominal GDP rather than the usual real capital and real output (RGDP, ). We use the FRED (2015) data series RKNANPUSA666NRUG for the real capital stock (capital stock at constant prices) and inflate to nominal capital stock via CPI less food and energy (CPILFESL).
Let us assume two additional information equilibrium relationships with capital being the information source and investment and depreciation (include population growth in here if desired) being information destinations. In the notation we have been using: and . This immediately leads to the solutions of the differential equation Eq. (2.5):
Therefore we have (the first relationship coming from the Cobb-Douglas production function)
If and we recover the original Solow model, but in general any allows there to be an equilibrium. Figure 12 represents a generic plot of the relationships above.
Assuming the relationships and hold simultaneously gives us the equilibrium value of :
This equilibrium value represents simultaneous information equilibrium in the two markets and . Fluctuations in the value of capital away from will experience an entropic force to return , so the equilibrium would be stable. Entropic forces will be discussed in more detail in Section 4.1.
As a side note, the small region in Figure 12 does not appear because it is not a valid region of the model. The information equilibrium model is not valid for small values of (or any process variable). That allows one to choose parameters for investment and depreciation that could be e.g. greater than output for small – a nonsense result in the traditional Solow model, but just an invalid region of the model in the information equilibrium framework. Another useful observation is that and have a supply and demand relationship in partial equilibrium with capital being demand and investment being supply since by transitivity (see Appendix D) they are in information equilibrium (i.e. ).
There might be more to the information equilibrium picture of the Solow model than just the basic mechanics – in particular we might be able to analyze dynamics of the savings rate relative to demand shocks. We have built the model:
Where is output, is capital and is investment. Since information equilibrium is an equivalence relation (see Appendix D), we have the model:
with abstract price . If we write down the differential equation resulting from that model
There are a few things we can glean from this that are described below using general equilibrium, partial equilibrium, and making a connection to interest rates.
General equilibrium in the Solow model
We can solve equation (3.35) under general equilibrium giving us . Empirically, we have . Combining that with the results from the Solow model, we have
which tells us that – one of the conditions that gave us the original Solow model result.
Partial equilibrium in the Solow model
Since we have a supply and demand relationship between output and investment in partial equilibrium. We can use equation (3.35) and to write
where we have defined the saving rate as to be (the inverse of) the abstract price in the investment market. A shock to aggregate demand would be associated in a fall in the abstract price and thus a rise in the savings rate. Overall, an economy does not always have pure supply or demand shocks, so there might be some deviations from a pure demand shock view. In particular, a “supply shock” (investment shock) should lead to a fall in the savings rate.
Interest rates in the Solow model
If we add the IS-LM model from section 3.3 to include the interest rate () model using written in terms of investment and the money supply/base money:
where is the abstract price of money (which is in IE with the interest rate), we have a pretty complete model of economic growth that combines the Solow model with the IS-LM model. The interest rate model in Figure 5 joins the empirically accurate Cobb-Douglas production function in this section Figure 11.
3.6 A note on constructing models
In the previous sections we have used simultaneous markets that look formally the same. However, they are interpreted differently:
In the Solow-Swan model, we used and to define the production function. These are taken to be independent equations in general equilibrium. The represent two channels of information flow to two destinations as shown in Figure 14.
In the Solow-Swan model, we also used and . These were taken to be simultaneous equations in general equilibrium. The information transfer figure would look like a single channel with two destinations as shown in Figure 13.
In the IS-LM model, we used and . These are taken to be simultaneous equations in partial equilibrium (i.e. moves slowly). The information transfer figure would look like the single channel Solow-Swan model diagram in Figure 13.
4 Statistical economics
Analogies between physics and economics only have merit inasmuch as they are useful. In this section we will take some initial steps toward defining the “statistical economics” of Smolin (2009) analogous to statistical mechanics. Consider a collection of individual market information sources . In the following we will work in “natural units” and take and . The are the demands in the individual markets and is the money supply (it does not matter which aggregate at this point). The individual markets are the solutions to the equations:
following from the introduction of the money-mediated information transfer model as was shown in Section 3.1. One interesting thing is that the defining quality of these individual markets – equation (4.1) leads to supply and demand diagrams – is homogeneity of degree zero in the supply and demand functions (as noted in Section 2.3), which is one of the few properties that survive aggregation in the Sonnenschein-Mantel-Debreu theorem.
Now consider the sum (defining aggregate nominal output or NGDP across all the markets)
This has a form similar to a partition function
Proceeding by analogy, we will define the macroeconomic partition function to be:
With this partition function, the ensemble average (or expectation value, denoted with angle brackets) of the exponent is:
which corresponds to the aggregate information transfer index . Additionally, the nominal economy will be the number of markets times the ensemble average of an individual market , i.e.
Equation (4.7) simplifies when () to
First there is an interesting new analogy with thermodynamics: is playing the role of , the Lagrange multiplier (thermodynamic temperature). As gets larger the states with higher (high growth markets) become less probable, meaning that a large economy (with a large money supply) is more like a cold thermodynamic system. The meaning of large here is measured by . As an economy grows, it cools, which leads to slower growth – going by the terms “the great stagnation” in Cowen (2011) or “secular stagnation” in Summers (2013) – and as we shall see a bending of the price level vs money curve (low inflation in economies with large money supplies).
Let us take 100 random markets with normally distributed with average and standard deviation and plot 500 Monte Carlo runs of the information transfer index , the price level and the nominal output .
In Figure 15 we can see the economies start out well described by the quantity theory () and move towards lower as the money supply increases. We can see the bending of the price level versus money supply in Figure 16. In Figure 17, we can see the trend towards lower growth relative to the growth in the money supply.
The question now is: how well does this oversimplified picture work with real data? After normalizing the price level and scaling the money supply, the function almost exactly matches the information transfer model for the price level in Section 3.4. The information transfer model of Section 3.4 and the partition function version above are graphed in Figure 18. There are only small deviations.
We apply the ensemble average result calculated using Eq. (4.7) and presented in Figure 17 to empirical data for the US and show it in Figure 19. This general trend is frequently encountered in the data for several countries as part of a growing survey, see Smith (2015a), and will be explored in future work.
With the partition function approach, we can see that reduced inflation with a large money supply (a thermodynamically colder system) as well as reduced growth in Figure 19 are emergent properties. They do not exist for the individual markets; it is important to emphasize this aspect of the model. An economy with a larger money supply is more likely to be realized as a large number of lower growth states (higher entropy) than a smaller number of high growth states
4.1 Entropic forces and emergent properties
There are several novel interpretations of observed or theorized macroeconomic effects that come from this partition function treatment. First, partition functions are maximum entropy distributions so macroeconomic equilibrium may be thought of as a maximum entropy state. Second, while the trend towards lower is apparent in the ensemble average, there is no microeconomic rationale. Lower growth as an economy increases in size is an “emergent” property of economies. Larger economies are “cold”, but small economies without asymptotically large do not have a well-defined “temperature” and individual markets may dominate output.
An additional emergent property is the slow decline in the response of the price level to changes in the monetary base since
Using the interest rate model of section 3.3 to connect to means that the impact of monetary policy is reduced for larger, “colder” economies. Lowering interest rates (expanding the base) has a smaller and smaller effect as falls. This idea of ineffective monetary policy is similar to the concept of the liquidity trap, see e.g. Krugman (1998), however there are some key differences:
This information trap does not depend on the zero lower bound for interest rates. However, lower interest rates are related. As , so that if , increasing will lower interest rates (this describes the liquidity effect). Therefore will tend to be associated with lower interest rates.
The information trap does not have a sudden onset, but is part of a gradual trend towards lower interest rates. The onset may appear sudden in economies that use monetary policy for macroeconomic stabilization when a large shock hits (for example, the global financial crisis of 2008) and monetary policy appears more ineffective than during previous shocks.
There is no microeconomic rationale for the information trap. One mechanism for the liquidity trap is based on the idea that at the zero lower bound for interest rates, there is no difference between short term treasury securities and zero-interest money. In the information equilibrium approach, the information trap is an emergent property dependent on an ensemble of markets.
Finally, interpreting the realized in an ensemble of markets as the occupied growth states in an economy lets us form a novel hypothesis: nominal rigidity is an entropic force. Entropic forces in thermodynamics are forces that have no microscopic analogy, yet have observable macroscopic effects. One of the most commonly encountered physical entropic forces is diffusion
However molecules behave in aggregate as if such a microscopic force existed, evening the distribution of molecules and producing a uniform distribution. Individual molecules feel no such force. If the distribution is perturbed away from a uniform distribution, it will feel an entropic force to return to the original uniform distribution. Thus in thermodynamics we say there exists an entropic force (diffusion) maintaining a uniform distribution.
Returning to our ensemble of markets we can imagine an equilibrium distribution of growth states . Analogous to the physical system, markets will feel an entropic force to maintain the distribution of growth states set by macroeconomic observables. Growth states will not spontaneously over-represent the negative (or simply sub-inflation) growth states and will behave as if there was a force keeping them in the distribution. Specifically, while the distribution of prices (or wages) in the economy may not adjust to adverse shocks as an aggregate (e.g. the price level will not fall), individual prices may fluctuate by a large amount, for example see Eichenbaum (2008). Microfounded mechanisms like Calvo pricing (e.g. menu costs) enforcing nominal price or wage rigidity would be analogous to the fictitious density dependent force mentioned above for diffusion.
The physical concept of entropic forces is similar to the economic concept of tâtonnement. In the case of entropic forces, individual molecules restore equilibrium by random chance because equilibrium is the most likely state. The molecules are “coordinated” by entropy. In the case of tâtonnement, individual agents restore equilibrium by announcing their guesses at equilibrium prices to the Walrasian auctioneer, who coordinates agent prices until equilibrium (zero excess demand/supply) is achieved. Jaynes (1991) referred to this process as “dither” and noted its relevance for economics.
We can take this entropic description further by analogy with physical systems. In the beginning of Section 4 we connected (where M is the currency supply) with in the partition function. If we take the thermodynamic definition of temperature:
as an analogy (where is entropy and is energy), we can write
where we have used the correspondence
then we can write down
So that, integrating both sides (with being a slowly varying function of ), we obtain
Using Stirling’s approximation for large allows us to write (dropping the integration constant )
If we compare this equation with the Boltzmann definition of entropy
We can identify with the number of microstates in the economy and being the ‘economic Boltzmann constant’. The factorial counts the number of permutations of objects and one possible interpretation is that adjusts for the distinguishability of given permutations – all the permutations where dollars are moved around in the same firm or industry are likely indistinguishable or approximately so. This could lend itself to an interpretation of the constant across countries discussed in Appendix C: large economies are diverse and likely have similar relative sizes of their manufacturing sectors and service sectors, for example. Once you set the scale of the money supply , the relative industry sizes (approximately the same in advanced economies) are set by . This picture provides the analogy that a larger economy () has larger entropy (economic growth produces entropy) and lower temperature ().
For small changes in , we can show
Economic growth represents a rise in economic entropy. If the second law of thermodynamics applied to economic systems, then one would expect that . However this is not true in real macroeconomic systems. In particular, one heuristic indicator for a recession is two consecutive quarters of falling NGDP. The second law of thermodynamics is statistically violated on small scales per the so-called fluctuation theorem, see e.g. Evans (2002), however this would imply a specific form of the violation in terms of the probabilities
The tail of the actual distribution of changes in NGDP is over-represented relative to a naive application
However there is another way an economic system could violate the second law of thermodynamics that is not available to a physical system composed of molecules: coordination among the constituents. An ideal gas that changes from a state where molecules have randomly oriented velocities to a state where velocities are aligned represents a large fall in the entropy of that ideal gas. This will not spontaneously happen with meaningful probability in large physical systems. In economic systems, agents will occasionally coordinate (for example, so-called “herd behavior”), and this may be the source of the fall in economic entropy – and hence output – associated with recessions. It is also extremely unlikely that economic agents will re-coordinate themselves in order to undo the fall in NGDP. Absent reactions from the central bank or central government (monetary or fiscal stimulus), the return to NGDP growth will continue at the previous growth rate.
5 Summary and conclusion
We have constructed a framework for economic theory based on the concept of generalized information equilibrium of Fielitz and Borchardt (2014) and used it to recover several macroeconomic toy models and show they are empirically accurate over post-war US economic data. A question that comes to the forefront: does the model work for other countries? The answer is generally yes
This framework gives us a new perspective from which to interpret macroeconomic observations and tells us that sometimes macroeconomic effects are emergent and may not have microeconomic rationales
In general, the information equilibrium approach is agnostic about what mediates macroeconomic activity at the agent level or precisely how it operates. This may be unsatisfying for much of the field. However a useful analogy may be seen in physics. When Boltzmann developed statistical mechanics, the atoms he was describing – although he believed they existed – had not been established scientifically. The present approach can be thought of as looking at the economy from a telescope on a distant planet and treating economic agents as invisible atoms.
Even if it does not lead any further than the models presented here, the information equilibrium framework may still have a pedagogical use in standardizing and simplifying the approach to Marshallian crossing diagrams, partial equilibrium models and common classroom examples. A future paper Smith (2015b) will look into the connection between the utility maximization approach and an entropy maximization approach including: re-framing utility maximization as entropy maximization and interpreting the Euler equation and the asset pricing equation as maximum entropy conditions.
We would like to thank Peter Fielitz, Guenter Borchardt and Tom Brown for helpful discussions and review of this manuscript.
Appendix A Appendix
We have shown that several macroeconomic relationships and toy models can be easily represented using the information equilibrium framework, and in fact are remarkably accurate empirically. Below we list a summary of the information equilibrium models in the notation
i.e. . Also the information equilibrium models that do not require detectors are shown as
All data for the US is available at FRED (2015), including the Solow model data for Mexico (real capital is inflated using the CPI less food and energy). The UK data is from the Bank of England website and FRED. The Japan data is from the Bank of Japan website and FRED. The Eurozone data is from the European Central Bank website and FRED. The models shown in Section 3 are:
Labor market (Okun’s law)
Model parameters for the US
Model parameters for the US interest rates (simultaneous fit)
Model parameters for the UK interest rates (separate long, short fit)
Solow growth model
Model parameters for Mexico
Model parameters for the US
Price level and inflation/quantity theory of money
Model parameters for the US, using the PCE price level
Model parameters for Japan, using the core CPI price level 2010 index
The definitions for the variables for all of these models are:
|nominal aggregate demand/output (NGDP)|
|monetary base minus reserves|
|total hours worked|
|total employed persons|
|price level (core CPI or core PCE)|
|nominal long term interest rate (10-year rate)|
|price of money|
|nominal capital stock|
Appendix B Appendix
In this appendix we show the numerical codes for the optimizations in Sections 3 and 4. They are written in Mathematica using versions 8, 9 and 10. Mathematica does not have its own local weighted regression (LOESS or LOWESS) smoothing function so we wrote one; the code is shown in Figure 22.
The parameter fits were accomplished by minimizing the residuals using the Mathematica function using the method , a derivative-free minimization method. The functions of the form are a Mathematica interpolating function with interpolation order set to linear using FRED (2015) data as input. is the monetary base minus reserves (currency component), FRED series MBCURRCIR. is nominal gross domestic product FRED series GDP. is the personal consumption expenditures price level, excluding food and energy. is the monetary base, FRED series AMBSL. is the three month treasury bill secondary market interest rate, FRED series TB3MS.
Appendix C Appendix
If we keep the parameter constant across countries, it can aid cross-national comparisons as we show in this appendix. First, set up the variables
setting up the constant . I call these the information transfer index (from the original theory) and the normalized monetary base, respectively. Defining the constant
we can write
Calculating the derivative above (after dividing by ), one obtains
The bracketed term must be zero since the piece outside the bracket is positive, so therefore, after some substitutions
And we arrive at