Market shape formation, statistical equilibrium
and neutral evolution theory
Abstract
Mathematical methods of population genetics and framework of exchangeability provide a Markov chain model for analysis and interpretation of stochastic behaviour of equity markets, explaining, in particular, market shape formation, statistical equilibrium and temporal stability of market weights.
all \setpapersizeA4 \setmargins1.1cm0.7cm19.0cm24.5cm12pt1cm0pt1.25cm
1 Introduction
Loglog plot of normalized stock market capitalizations ranked in descending order is called capital distribution curve. For example, figures below display distribution of capital on the NASDAQ market on three dates in 2014 (data source is http://www.google.com/finance#stockscreener). Ranked market weights experienced relatively small fluctuations, despite significant changes in overall capitalization of the NASDAQ market during that period of time.
One of the aims of this paper is to provide an example of a possible mechanism explaining temporal stability and statistical equilibrium of normalized stock capitalizations by means of the PolyaDirichlet Markov chain, analogous to the WrightFisher model of neutral theory of evolution. Classic and neutral evolution theory. Classic form of Darwinian theory suggests that forces of natural selection play central role in evolution of species. Theory of neutral evolution, proposed by Kimura, complements the classic theory by adding genetic dimension. Kimura observed that discrepancies in traits, such as small variations in colouring of beaks or feathers in a population of birds, occur at moleculargenetic level due to random effects in reproduction and majority of these variations are neutral with respect to fitness. According to the neutral theory, force of natural selection is still be important since it purges deleterious mutations. However, majority of surviving mutations are neutral, and possibly only few are advantageous.
Mutations and random combinations of genes in new generations lead to fluctuations of allelic frequencies or genetic drift. The WrightFisher and Moran models describe stochastic evolution of genetic frequencies as statistical equilibrium fluctuations, modelled by diffusion process with stationary Dirichlet distribution.
Evolution theory and finance. Application of evolutionary ideas in finance has a long history dating back to Malthus, Marshall and many others. Recently Evstigneev, Hens, and SchenkHoppé [4] developed descriptive model of Evolutionary Finance, which employs principle of natural selection for modeling dynamics of asset prices and analysis of investment strategies.
Kirman [14] considered version of the WrightFisher model with mutation in a context of economic interpretation of behavior of ants searching for a food source. He observed that proportion of ants choosing one of the possible food channels is better described by stationary distribution of a Markov chain, rather than by single point of equilibrium. He proposed that the ’herding’ behaviour on financial markets as well is better described by means of stochastic equilibrium, rather than by single or multiple equilibria.
Formation of market limit shape. Standard and nonlinear versions of the Polya process have been used by Arthur et al. [1] for illustration of appearance of market structure. Polya scheme has the following interesting property: proportions of balls converge to some limiting values, but these limits are random and described by the Dirichlet distribution.
Markov lattice and reversibility. PolyaDirichlet Markov chain with state space defined on lattice of ordered integer partitions provides a framework for analysis and modeling of stochastic equilibrium of market weights. Transitions on the lattice of partitions are described in terms of random up and dn operators proposed by Kerov [13], Fulman[8], Borodin and Olshanski [2] and Petrov [16]. Historically, Markov chains with dn/uptransitions in a context of Polya model were first studied by Costantini, Garibaldi, et al. in [3], [9].
Exchangeability and random fluctuations. Infinite exchangeability implies existence of up and dn random transitions, connecting adjacent levels of integer compositions. It is shown in Section 4 that probabilities of these transitions satisfy reversibility conditions and therefore induce a lattice of Markov chains. Random transitions on this lattice correspond to statistical equilibrium behaviour of market weights or allelic frequencies not only for fixed, but also for varying market or population sizes.
Neutral theory and financial markets. The PolyaDirichlet Markov lattice corresponds to the discrete version of the WrightFisher process with mutations and provides a toy model of equilibrium markets behaviour.

After initial phase of rapid expansion, in a same way as proportions of balls converge to random limits in Polya scheme, market weights settle down and form capital distribution curve.

Up and down changes in overall market capitalization lead to random drift of market weights fluctuating in stochastic equilibrium around limiting values, given by the capital distribution curve. The stationary distribution of market weights can be modeled by means of the up and dn Markov chain.

In general, increase of market capitalization enforces market structure and decrease of capitalization leads to weakening of the structure and higher volatility, which creates an opportunity for market reshaping. This mechanism is analogous to the socalled nearly neutral theory of evolution proposed by Ohta [15], in which smaller populations have faster moleculargenetic evolution and adaptation rate.

This theory provides interpretation of market crises as markets selfadaption to changing economic conditions, where capitalization decrease leads to market reshaping and faster adjustment to new econofinancial landscape.

Arbitrage opportunities can be considered as corresponding to deleterious mutations, eliminated by forces of natural selection.
Mechanics, economics and reversible equilibrium. As pointed out by Garibaldi and Scalas [11], equilibrium modeling in economics and finance was developed under strong influence of ideas of static or classical mechanics. Alternative approach is provided by framework of stochastic equilibrium and reversibility conditions, which have roots in Boltzmann’s work on statistical mechanics. Exhaustive treatment of econophysics from the point of view of exchangeability is contained in the book of Garibaldi and Scalas [10].
Excellent explanation of the framework of reversible equilibrium is contained in the classic book of Kelly [12].
2 Polya process with down/up transitions
In a classic form of Polya process colored balls are placed into a box with probabilities proportional to weights of balls of existing colors. The process provides a discrete counterpart of the Dirichlet distribution, since if vector represents initial/prior weights of balls of each color, limiting values of proportions of weights in Polya scheme have Dirichlet distribution with the same vector of parameters .
Modified Polya process, in which balls can also be removed illustrates important ideas of

appearance and temporal stability of ranked proportions, and

stochastic equilibrium of these weights.
Let us consider an artificial stock market with finite number of stocks represented by different colours. Initially in the box there are ’prior’ balls of each colour and the same weight , such that total weight of all balls is . In other words, all stocks start with the same initial conditions and colours (or tickers) are used only to distinguish the stocks. Vector represents stock capitalizations equal to number of placed balls of each color at stage , so at initial stage this vector is . All possible market configurations with overall capitalization are represented by compositions (ordered partitions) in the integervalued simplex
At the first step one of the prior balls is drawn with probability . The ball is placed back into the box together with a ball of the same color and unit weight. At stage probability to add a ball of color is
where denotes number of balls of color in the box. For instance, with colors, say red, green and blue, probability of drawing 3 red balls, 2 green ones and 1 blue ball in this particular sequence is
with raising or ascending factorial power defined as
By combinatorial argument probability of configuration at stage is
(1) 
For each level this formula establishes probability distribution in the simplex , moreover this distribution is exchangeable or symmetric, in a sense that probability of any sequence does not depend on the order of balls drawn and depends only on the number of balls of each color.
Using asymptotic
which corresponds to density of symmetric Dirichlet distribution
up and dn transitions. In a standard Polya model number of balls increases at each stage, such that in configuration component increases by one with conditional probability
(2) 
This can be considered as random uptransitions of configuration from simplex to . In financial terms, upmoves correspond to investment into particular stock and increase of capitalization. Clearly, stochastic dynamics of these transitions is of preferential attachment type, since conditional probability (2) of stock growth is proportional to its capitalization . As shown below upmoves preserve probability distributions (1) on simplexes .
It turns out, that upmoves also implicitly define dntransitions, which randomly move configuration backwards from simplex to . These dntransitions also preserve probability structure on simplexes. In terms of Polya’s model dnmove corresponds to removing a ball of some color at random and financially it has interpretation of decrease of capitalization of one of the stocks by one unit.
Structure of exchangeable probability distribution plays an important role connecting up and dn transitions, dual to each other. For the sake of illustration let us consider case of two stocks, labelled by two colours. Structure of connections of probabilities between simplexes with is shown below, where denotes probability of configuration with balls of the first color and balls of the second color.
Let us consider probability flows between levels and for
For instance, configuration can migrate to state with probability or go to state with probability , thus contribution or forward probability flow to configuration is
Similarly it can be shown that probability flow from state to state is
It is easy to see that upmoves preserve probability measure (1) on simplexes. For instance, total flow of probabilities into state is
In other words, fraction of probability comes from configuration and fraction comes from configuration . This means that given probability of state backward probability flow can be interpreted as random removal of one of the balls with :
In general, random downtransition moves configuration from simplex to , and in terms of Polya model it removes a ball of color with probability
(3) 
It is straightforward to show that dnmoves also preserve probability measure (1). In terms of artificial market model up and dn transitions correspond to increase and decrease of capitalization (buying/selling) of a stock by one unit of money.
3 Market shape formation
Classic Polya scheme (with uptransitions) provides a model for simulation of market growth. It turns out that in this model, even if initial conditions are the same for all stocks, over certain period of time, or after reaching certain market capitalization, powerlaw shaped market structure begins to appear.
Figures below illustrate shape formation of the capital distribution curve on the artificial market. Left side of figures contains realization of market weights, modeled by uptransitions and right side displays loglog plot of ranked weights at the terminal stage. All stocks have the same initial conditions modeled by equal prior . After rapid period of ’Big Bang’like chaotic expansion, market weights begin to settle down and after step 1500 do not change significantly.
Next figure illustrates that for smaller values of parameter there is greater variation of market weights. Particular choice of parameter corresponds to the uniform stickbreaking niche model.
4 Statistical equilibrium and PolyaDirichlet Markov process
One of the central ideas of neutral evolution theory is that proportions of genes in a population (allelic frequencies) experience random drift, in other words they fluctuate around some values. It is important that population size may increase or decrease (in reasonable amounts), but allelic frequencies remain approximately the same. The same phenomenon is observed on financial markets: ranked equity capitalizations, comprising capital distribution curve display remarkable temporal stability, despite significant fluctuations of overall capitalization.
WrightFisher model (WF) and its generalizations provide a framework for modeling evolution of proportions of genes fluctuating in stochastic equilibrium. Finite form of the allele WFmodel is based on a Markov chain with state space of ordered integer partitions (compositions) with elements. Since stationary distribution of proportions in limiting case of the WFmodel with mutation is given by Dirichlet distribution, for consistency it is assumed that finite version of WFmodel is approximated by the Polya distribution (1).
In Polya model with dn/uptransitions ordered partition may represent:

vector of stock capitalizations with overall market capitalization ,

number of genes of each specific type in a population of size .
As above it is assumed that vector of priors (or mutation rates, correspondingly), is a symmetric vector with all components equal to and denotes sum of all parameters. Stationary distribution with transitions in integer simplex can be constructed by combining dn and uptransitions, as proposed in [2],[8],[16] and [9]. For instance, transition where one item moves from category to category in dn/upscheme, is modeled in two steps.
with probability for , and with as probability of return. Reversibility and stochastic equilibrium.
If denotes conditional probability of migration from state to state , then reversibility (detailed balance) conditions imply for all states in state space
(4) 
Besides timereversibility, there are some other interesting interpretations of detailed balance conditions.

In mechanical systems if each process is matched by its reverse process, then the system is in equilibrium. Equilibrium may take place on a microlevel, while on a macrolevel the system may stand still.

In terms of probability flows, essentially (4) states that unconditional probability flow from to must be equal to probability flow from state to state .
It is easy to show that both dn/up and up/dn schemes satisfy detailed balance conditions and therefore they induce reversible and hence stationary Markov chain with a state space of ordered partitions for each .
More detailed analysis reveals that reversibility conditions connect probability distributions on all adjacent simplexes and
For instance, probability flows between compositions and satisfy
which also clarifies role of up and dn operators. In other words, probability distributions (1) on simplexes are pairwise connected. Since for sufficiently large Polya distribution approximates Dirichlet distribution, sequences of up and dnoperators between simplexes approximate equilibrium process with stationary Dirichlet distribution even with changing values of .
This provides a framework for modeling evolution of market weights for fixed and variables values of . It turns out that behavior of equilibrium process depends on the population size. When overall market capitalization is sufficiently large market structure becomes enforced. In contrast, decrease of market cap leads to higher variations of market weights.
For instance, figures below illustrate two phases of evolution of market weights. During the first phase, for (Figure 5) or (Figure 6), the market experiences period of growth and formation of limiting weights, representing capital distribution curve, displayed on the right subplots with blue circles. Once market value reaches threshold value of or market capitalization begin fluctuating in dn/uptransitions, which generates stochastic evolution of market weights. Corresponding capital distribution curve at the terminal period is shown with red circles. As it can be seen from the figures, stocks with larger capitalizations have higher trading activity.
Such behavior of weights, dependent on level is consistent with the socalled ’nearly neutral theory’, developed by Ohta [15], where it is proposed that smaller populations experience faster rate of genetic evolution. In financial terms, it suggests the market crises can be ’explained’ as markets selfadaption to changing economic conditions, where reduced market capitalization allows faster reshaping and adjustment of capitalization structure to new econofinancial landscape.
References
 [1] W. B. Arthur. Increasing returns and path dependence in the economy. University of Michigan Press, 1994.
 [2] A. Borodin and G. Olshanski. Infinitedimensional diffusions as limits of random walks on partitions. Probability theory and related fields, 144(12):281–318, 2009.
 [3] D. Costantini and U. Garibaldi. A purely probabilistic representation for the dynamics of a gas of particles. Foundations of Physics, 30(1):81–99, 2000.
 [4] I. V. Evstigneev, T. Hens, and K. R. SchenkHoppé. Evolutionary finance. Swiss Finance Institute Research Paper, (0814), 2008.
 [5] E. R. Fernholz. Stochastic Portfolio Theory. Springer, 2002.
 [6] R. Fernholz, T. Ichiba, and I. Karatzas. A secondorder stock market model. Annals of Finance, 9(3):439–454, 2013.
 [7] R. Fernholz and I. Karatzas. Stochastic Portfolio Theory: an overview. 2009.
 [8] J. Fulman. Stein’s method and Plancherel measure of the symmetric group. Transactions of the American Mathematical Society, 357(2):555–570, 2005.
 [9] U. Garibaldi, D. Costantini, and P. Viarengo. A finitary characterization of the Ewens sampling formula. Advances in Complex Systems, 7(02):265–284, 2004.
 [10] U. Garibaldi and E. Scalas. Finitary probabilistic methods in econophysics. Cambridge University Press, 2010.
 [11] U. Garibaldi and E. Scalas. Tolstoy’s dream and the quest for statistical equilibrium in economics and the social sciences. In Mathematical Modeling of Collective Behavior in SocioEconomic and Life Sciences, pages 115–133. Springer, 2010.
 [12] F. P. Kelly. Reversibility and stochastic networks. Cambridge University Press, 2011.
 [13] S. V. Kerov. Asymptotic representation theory of the symmetric group and its applications in analysis. American Mathematical Society Providence, 2003.
 [14] A. Kirman. Ants, rationality, and recruitment. The Quarterly Journal of Economics, 108(1):137–156, 1993.
 [15] T. Ohta. The nearly neutral theory of molecular evolution. Annual Review of Ecology and Systematics, pages 263–286, 1992.
 [16] L. Petrov. A twoparameter family of infinitedimensional diffusions in the Kingman simplex. arXiv preprint arXiv:0708.1930, 2007.