Trading Lightly: Cross-Impact and Optimal Portfolio Execution
Executing trading decisions in real markets is a difficult business. Moving around substantial amounts is often foiled by the lack of liquidity. When orders exceed the small volumes typically available at the best bid or ask in lit order books, the incurred slippage costs rapidly become detrimental to the trading strategy. Accurately predicting these trading costs is quite a non-trivial exercise, and one must often resort to statistical models. Most of the complexity of such models arises from price impact, i.e. the fact that trades tend to move market prices in their own direction. This effect has recently triggered a large amount of theoretical activity, due to its direct relation to supply and demand, and also because of the abundantly available data on financial transactions. As alluded to above, this interest is not purely academic, since reliable estimates of trading costs are crucial in order to judge whether one should enter into a position and – if the cost model is accurate enough – to attempt to optimize the execution strategy. For example, splitting large orders into a stream of smaller ones over time is one of the universally accepted ways to mitigate transaction cost. However, as always, the devil is in the details, and the precise nature and time scheduling of the orders can make a large difference to the final result.
Optimal execution problems are widespread in the literature and many different formulas and techniques have been developed to solve them. However, most of these problems are restricted to single asset execution. Recently in Benzaquen et al. (2017) we have shown that even a simple linear model of cross-asset price impact leads to a very rich phenomenology, in line with the results of Wang et al. (2015); Wang and Guhr (2016). In the present paper we provide a practical recipe to optimize the execution of a portfolio of trades, taking into account the cross-impact on the different underlying products within the multivariate framework of Schied et al. (2010). We will show that proper synchronization of the legs of the execution schedule is very important. To quantify the slippage incurred by the strategy we introduce the EigenLiquidity Model (ELM). The model is directly related to statistical risk factors which have been used for portfolio risk management for several decades. Based on a Principal Component Analysis of the correlation matrix, which provides a practical method to quantify the amount of different kinds of market risk (long the market, sectorial, etc.) one can trade while staying within a prescribed budget of transaction cost.
1 A quadratic cost model for a single stock
To set the stage, let us first discuss the execution of a single company’s stock over a trading day that starts at time and lasts until . The total volume we have to trade is shares, which is obtained over the day by a continuous execution schedule whose local speed is , normalized as . If we were to hold this position, our expected risk, defined as the standard deviation of the daily PnL, would be , where is the daily volatility of the stock, expressed in dollars. For convenience let us define the trading speed in risk units as , and the total risk exchanged over the day as , which naturally equals .
A standard model for estimating the cost incurred when trading a certain volume has first been introduced by Almgren and Chriss (2001). We will consider their framework in a setting in which the trader is risk-neutral, and impact is linear and transient (Gatheral et al., 2012; Gatheral and Schied, 2013; Busseti and Lillo, 2012; Alfonsi and Schied, 2013). This allows one to express the trading costs as
where is an impact kernel (Bouchaud et al., 2004), which quantifies the effect of a small trade on the price at a later time . is typically a decreasing function, dropping from a maximum value obtained at to zero after a slow decay. Consistently with the results of Bouchaud et al. (2004), it can be written as , with
has units of , and its inverse corresponds to the amount of risk one would have to trade, in the absence of decay, to move the stock’s price by its typical daily volatility .
In spite of some limitations (e.g. impact is empirically found to be a sub-linear function of volume), the model above provides a reasonable estimation of trading costs for large trades and it is able to capture the main effects of the trade schedule on costs (Gatheral et al., 2012; Gatheral and Schied, 2013; Alfonsi and Schied, 2013). Extensions of this model to account for risk-aversion have been considered in (Almgren and Chriss, 2001; Obizhaeva and Wang, 2013; Curato et al., 2016), and for bid-ask spread effects in (Obizhaeva and Wang, 2013; Curato et al., 2016).
2 Optimal trading of a single stock
Optimal trading schedules under the cost function (1) for a fixed total volume to execute have been extensively investigated in the finance literature (Gatheral et al., 2012; Gatheral and Schied, 2013; Alfonsi and Schied, 2013; Obizhaeva and Wang, 2013; Busseti and Lillo, 2012). The trajectory of minimum cost can be written as , where and can be determined by solving a linear integral equation (Gatheral et al., 2012). This yields the well-known symmetric bucket shape solution for that is depicted in Figure 1 (red curve).
The optimal solution indicates that after an initial period of faster trading, one should slow down the execution in order to limit the extra cost due the impact of one’s own trades, and then accelerate again the trading near the market close. Since trading does not extend beyond that point, strongly impacting the price in this final period does not penalize any further executions. As an example, the optimal policy is about 30% less expensive than a localized flat two-hour execution, and approximately 7% cheaper than the linear trading profile represented in Figure 1.
3 A quadratic cost model for portfolios
The problem of optimal execution across multiple instruments has been first considered in Schied et al. (2010); Kratz and Schöneborn (2014); Schöneborn (2016), whereas the cost model defined above has been generalized to the multivariate case in Alfonsi et al. (2016) (see also Schneider and Lillo (2016) for a non-linear generalization). Within that framework, our definition of risk above is readily extended to a portfolio of multiple stocks with risk positions as
where the correlation matrix is constructed from the price covariance matrix through
The daily volatilities are generalized to .
Similarly, Eq. (1) can be extended to this setting as
The interpretation of the matrix elements of is as before: After trading dollars of risk on the contract at time , we expect the price of contract to change by units of its daily dollar volatility . The terms with correspond to direct price impact, which was already described by earlier models where each stock is independent. In addition, the new terms with describe cross-impact between stocks which, as it was shown by Benzaquen et al. (2017), is a highly relevant effect since it explains an important fraction of the cross-correlation between stocks, a feature that we will use below.
Benzaquen et al. (2017) have further found that within a good degree of approximation, one can write the kernel in the factorized form , with given by Eq. (2), and denote respectively the symmetric and antisymmetric part of . Schneider and Lillo (2016) have shown that when the antisymmetric part of the propagator is large, then price manipulation is possible, leading to an ill-defined cost of optimal strategies. Since empirically is small, we set it to zero so that the cost associated with the execution of a portfolio of trades reads:
The condition of symmetry for is not the only one required in order not to have price manipulation. In fact, formula (6) is free of price manipulation if and only if is positive semidefinite. This amounts to saying that buying a portfolio always pushes its price up and vice versa, regardless of its composition, resulting in an impact cost that is always greater or equal to zero independently of the portfolio that is actually traded.
4 The EigenLiquidity Model
In principle, can be determined using simultaneous Trades and Quotes data for the corresponding pool of stocks. However, the empirical impact matrix is in general extremely noisy, so some cleaning scheme is necessary. By leveraging the empirical results of Benzaquen et al. (2017) (see Fig.7 therein), showing that the structure of the impact matrix is in a suitable statistical sense “close” to the one of the correlation matrix, we make the assumption that the impact matrix has the same set of eigenvectors as those of the correlation matrix . Intuitively, the eigenvectors of correspond to portfolios with uncorrelated returns. Our assumption means, quite naturally, that trading one of these portfolios will only impact (to first approximation) the returns of that portfolio, but not of any other orthogonal one. Besides being an empirically reasonable cleaning scheme for , this choice is motivated by the results illustrated in this section, showing that it leads to a cost function that satisfy three fundamental consistency requirements: symmetry, positive semi-definiteness and fragmentation invariance.
More precisely, one can write
where is an orthogonal matrix of eigenvectors and is a diagonal matrix of non-negative eigenvalues. Our assumption is that the matrix has the following structure:
where is a diagonal matrix, and .
An important property of this decomposition is that it leads to a fragmentation invariant cost formula in the following sense: When trading two completely correlated products and (i.e., when ), the impact of a trading trajectory does not depend on how the volume is split between the two instruments. More formally, a fragmentation invariant cost is left unchanged under the transformation
where is completely arbitrary. Intuitively, Eq. (8) fulfills this property because of the factor multiplying . If instruments and are completely correlated, then the relative mode “rel” is an eigenvector of zero risk, with . When used for estimating the cost of an execution trajectory , Eq. (8) will single out such relative modes through the projection , and it will weight them by the corresponding risk which is zero.
The impact model (8) will be called the EigenLiquidity Model (ELM). It can be seen as the most natural choice among all the models implementing fragmentation invariance. In fact, it continuously interpolates between small risk modes (that are thus expected to be characterized by small impact) and large risk modes (for which impact costs can be substantial).
Empirically, Eq. (8) has been shown to hold to a good degree of approximation in Benzaquen et al. (2017). On the other hand, the condition of positive semi-definiteness of , i.e. , , is not guaranteed from Eq. (8) and thus should be checked on empirical data. This is what we display in Figure 3 using real data, confirming that all the ’s are actually strictly positive. The quantity has the natural interpretation of a liquidity per mode. It expresses the amount of daily risk in dollars that one can trade on the eigen-portfolio to move its price by its daily volatility .
5 Optimal trading of portfolios
Under the ELM, the impact cost of any schedule admits an interpretation in terms of the modes of the correlation matrix of normalized returns through the decomposition
where . We have also introduced the notation
denoting the projection of the executed volumes on a set of uncorrelated, unit risk eigen-portfolios .
The notion of eigen-portfolios is very useful for intuitively characterizing the cost formula (11). The name comes from the fact that these portfolios are not only uncorrelated and unit-risk (i.e., ), but furthermore trading an amount of the basket according to the weights given by has no impact on the total value of the basket and vice versa (i.e., ). This is precisely the intuition behind our central assumption, Eq. (8).
This construction implies that the cost can be calculated by first projecting the strategy on the portfolios via Eq. (12) and then by taking a sum of an impact cost per mode with a weighting factor given by such projections.
Eq. (11) also shows clearly that the positivity of the matrix and the kernel makes the optimization problem convex, which always has a unique solution. The optimum under the terminal constraint is necessarily achieved under a synchronous execution schedule where at any given point in time all stocks are traded with the same time profile, i.e., , resembling the case without cross-impact.
Intuitively, an asynchronous execution strategy can be seen in the mode space
as the optimal one above, plus a round-trip along some of these modes.
The convexity of the cost function (6) implies that round-trips
always increase execution costs, so they should be avoided.
A toy example to explain the implications of the formalism is given in Box 5.
Box 5. A toy example with two stocks
A simple realistic case is stocks and an impact matrix of the form
with . Let us also suppose the target volumes to have the same magnitude:
After defining , the cost becomes
The interpretation of this result is the following:
The cost of trading is proportional to the eigenvalues and (where abs and rel stand for absolute and relative mode). It is obviously minimized by choosing , in which case the cost is
When trading directionally (i.e., if ), the minimum cost is proportional to , while the neutral strategy yields a smaller cost proportional to .
One could be tempted to locally trade the cheaper relative mode, but this would construct a long-short position which would have to be closed at a cost later. It is easy to check that the convexity of the cost and the terminal requirement prevent this from being optimal.
Which trajectory is cheaper
in terms of risk? If we assume the correlation matrix to be given by
its eigenvalues are equal to . The cost of trading per unit of risk can be written as
where is the liquidity in dollars of the absolute and relative mode, respectively. We can interpret this as follows:
The cost of trading per unit risk depends trivially on , but it has an implicit dependence on the correlation through . The more correlated are the stocks, the more expensive it is to obtain a target risk.
The liquidity per mode accounts for both effects, describing how expensive it is to obtain a given target risk by trading either the symmetric or the antisymmetric mode.
Box 5. Fitting the EigenLiquidity Model
We present a step-by-step procedure for calibrating our cost model to real data.
Compute the covariance matrix of prices
and extract the volatilities .
Standardize prices, their covariances, and market volumes:
(B2.2) (B2.3) (B2.4)
Compute the covariation of prices and volumes:
Compute the derivative of the kernel by solving
where and . Get the norm from the condition .
Extract indepedent portfolios from the correlation matrix by computing eigenvectors and eigenvalues according to Eq. (7).
Project the covariances and to indepedent portfolios:
Estimate with the maximum-likelihood estimator
6 Applications to real data
A visual representation of the impact matrix is shown in Figure 2. The inhomogeneity of captures the sectorial structure of the market, encoding the specific dependence of stock on its sector and/or its most correlated stocks .
The main difference of our ELM and a model without cross-impact is that the impact costs of a large buy program can no longer be reduced as effectively by spreading the orders across multiple correlated instruments. The impact kernel diffuses the interaction across markets and sectors through the modes. The transaction cost of trading dollars of risk in the mode is equal to dollars. Figure 3 shows the inverse of the eigenvalues , this is about 30M dollars of risk on the most liquid mode. Most of the cross-interaction between stocks is captured by this market mode, which accounts for the fact that when buying a dollar of risk of a stock picked at random in our pool, the other stocks in the pool rise on average by times their daily volatility, where is the average of the off-diagonal elements of . Smaller risk modes may be up to 30 times less liquid. Empirically, the liquidity per mode is well fitted by , see Figure 3. This finding is consistent with the assumption of fragmentation invariance, which implicitly requires the parameters when (see Box 5).
In order to illustrate the relevance of these findings when executing a portfolio of trades, let us study numerically a toy daily execution problem of a trader who has target volumes corresponding to a fraction 1%,5%, 10% of the daily liquidity of each of US stocks. We assume that the trader uses the optimal, synchronous policy derived above: .
To explore a variety of trading styles, we set the sign ( for buy, for sell) of the orders from biased coin tosses. We vary the bias parameter in the interval . The interpretation of this construction for or is very simple:
For , the order is a long or short directional one, and is strongly exposed to the market mode of risk.
For , the strategy is neutral, and its exposition to the market mode is therefore limited.
The average cost under such an execution policy can be expressed analytically, allowing to obtain a relation between the cost and :
where denotes average dollar risk exchanged by the market on the different stocks.
Comparing Eq. (12) between the above special cases, one can see that the dollar cost is higher for than for by a factor that can be estimated on the basis of Eq. (12) to roughly be . This makes sense, as this is the ratio of the top eigenvalue of , expressing the cost of trading the market directionally, versus the average of all eigenvalues selecting the direct impact contribution in the left term of Eq. (12). Figure 4 shows the full evolution of this ratio as a function of the bias .
However, this result holds for a fixed dollar volume traded, but the risk of the resulting position is in fact much higher for than for . The cost of trading per unit risk taken, expressed by is found to be approximately independent of . By generalizing Eq. (12) to the ratio , one can relate this finding to the numerical result .
In this paper we have shown how to leverage the recent quantitative results of Benzaquen et al. (2017) on cross-impact effects in order to estimate the execution cost of a basket of correlated instruments. We confirm empirically on a pool of 150 US stocks that cross-impact is a very substantial part of the impact of trades on prices. We show that neglecting cross-interactions leads to a distorted vision of the liquidity available on the market: it overestimates the liquidity on large risk modes and underestimates the liquidity of low-risk modes.
In order to distill these findings into a cost formula, we have assumed that the impact matrix has the same eigenvectors as the correlation matrix itself, and impact eigenvalues are proportional to the risk of the corresponding modes. This specification prevents arbitrage opportunities and price manipulation strategies. It also abides by the principle of fragmentation invariance, which states that trading zero risk portfolios should have no effect whatsoever on trading costs. We have provided the solution of the corresponding optimal trading problem, which leads to a synchronous U-shaped trading profile across products. This avoids round trips on unwanted positions at a potentially large cost.
In order to keep our approach as simple as possible, we have neglected other sources of cost (spread costs, fees), and considered no risk aversion effects nor intraday predictive signals. Moreover, we have deliberately disregarded the non-linear nature of the price impact function, which is known to be better represented by a square-root law (Grinold and Kahn, 2000; Tóth et al., 2011; Schneider and Lillo, 2016). These features could be progressively reintroduced into our framework by preserving the idea of an interaction that is diagonal in the space of correlation modes. Indeed, we believe our simper approach to be better suited in order to illustrate transparently the main effects of cross-impact between financial instruments.
- One may compare this with a related discussion by Wang (2017) investigating the case of round-trips on two stocks.
- A. Alfonsi and A. Schied. Optimal trade execution and absence of price manipulations in limit order book models. SIAM Journal on Financial Mathematics, 1(1):490–522, 2010.
- A. Alfonsi and A. Schied. Capacitary measures for completely monotone kernels via singular control. SIAM Journal on Control and Optimization, 51(2):1758–1780, 2013.
- A. Alfonsi, A. Schied, and A. Slynko. Order book resilience, price manipulation, and the positive portfolio problem. SIAM Journal on Financial Mathematics, 3(1):511–533, 2012.
- A. Alfonsi, F. Klöck, and A. Schied. Multivariate transient price impact and matrix-valued positive definite functions. Mathematics of Operations Research, 41(3):914–934, 2016.
- R. Almgren and N. Chriss. Optimal execution of portfolio transactions. Journal of Risk, 3:5–40, 2001.
- M. Benzaquen, I. Mastromatteo, Z. Eisler, and J.-P. Bouchaud. Dissecting cross-impact on stock markets: An empirical analysis. Journal of Statistical Mechanics: Theory and Experiment, 2017(2):023406, 2017.
- J.-P. Bouchaud, Y. Gefen, M. Potters, and M. Wyart. Fluctuations and response in financial markets: the subtle nature of “random” price changes. Quantitative Finance, 4(2):176–190, 2004.
- J. Bun, J. Bouchaud, and M. Potters. Cleaning correlation matrices. Risk Magazine, 2016.
- E. Busseti and F. Lillo. Calibration of optimal execution of financial transactions in the presence of transient market impact. Journal of Statistical Mechanics: Theory and Experiment, 2012(09):P09010, 2012.
- G. Curato, J. Gatheral, and F. Lillo. Optimal execution with non-linear transient market impact. Quantitative Finance, 17(1):41–54, 2016.
- J. Gatheral. No-dynamic-arbitrage and market impact. Quantitative Finance, 10(7):749–759, 2010.
- J. Gatheral and A. Schied. Dynamical models of market impact and algorithms for order execution. Handbook on Systemic Risk, Jean-Pierre Fouque, Joseph A. Langsam (eds.), pages 579–599, 2013.
- J. Gatheral, A. Schied, and A. Slynko. Transient linear price impact and fredholm integral equations. Mathematical Finance, 22(3):445–474, 2012.
- R. C. Grinold and R. N. Kahn. Active portfolio management. McGraw-Hill New York, NY, 2000.
- P. Kratz and T. Schöneborn. Optimal liquidation in dark pools. Quantitative Finance, 14(9):1519–1539, 2014.
- A. A. Obizhaeva and J. Wang. Optimal trading strategy and supply/demand dynamics. Journal of Financial Markets, 16(1):1–32, 2013.
- A. Schied, T. Schöneborn, and M. Tehranchi. Optimal basket liquidation for cara investors is deterministic. Applied Mathematical Finance, 17(6):471–489, 2010.
- M. Schneider and F. Lillo. Cross-impact and no-dynamic-arbitrage. arXiv:1612.07742, 2016.
- T. Schöneborn. Adaptive basket liquidation. Finance and Stochastics, 20(2):455–493, 2016.
- B. Tóth, Y. Lempérière, C. Deremble, J. De Lataillade, J. Kockelkoren, and J.-P. Bouchaud. Anomalous price impact and the critical nature of liquidity in financial markets. Physical Review X, 1(2):021006, 2011.
- S. Wang. Trading strategies for stock pairs regarding to the cross-impact cost. arXiv:1701.03098, 2017.
- S. Wang and T. Guhr. Microscopic understanding of cross-responses between stocks: a two-component price impact model. Available at https://ssrn.com/abstract=2892266, 2016.
- S. Wang, R. Schäfer, and T. Guhr. Price response in correlated financial markets: empirical results. arXiv:1510.03205, 2015.