Distributed Robust Output Regulation of Heterogeneous Uncertain Linear Agents by Adaptive Internal Model Principle
Abstract
We study a multiagent output regulation problem, where not all agents have access to the exosystem’s dynamics. We propose a distributed controller that solves the problem for linear, heterogeneous, and uncertain agent dynamics as well as timevarying directed networks. The distributed controller consists of two parts: (1) an exosystem generator that creates a local copy of the exosystem dynamics by using consensus protocols, and (2) a dynamic compensator that uses (again) consensus to approach the internal model of the exosystem and thereby achieves perfect output regulation. Our approach leverages methods from internal model based controller synthesis, multiagent consensus over directed networks, and stability of timevarying linear systems; the derived result is an adaptation of the (centralized) internal model principle to the distributed, networked setting.
I Introduction
Over the past decade, many distributed control problems of networked multiagent systems have been extensively studied; these include e.g. consensus, averaging, synchronization, coverage, and formation (e.g. [1, 2, 3]). Progressing beyond first/secondorder and homogeneous agent dynamics, the distributed output regulation problem with general linear (timeinvariant, finitedimensional) and heterogeneous agent dynamics has received much recent attention (e.g. [4, 5, 6, 7, 8, 9]). In this problem, a network of agents each tries to match its output with a reference signal, under the constraint that only a few agents can directly measure the reference. The reference signal itself is typically generated by an external (linear) dynamic system, called “exosystem”. The distributed output regulation problem not only subsumes some earlier problems like (leaderfollowing) consensus and synchronization, but also addresses issues of disturbance rejection and robustness to parameter uncertainty. Also see e.g. [10, 11] for further extensions of this problem to deal with nonlinear agent dynamics.
Output regulation has a wellstudied centralized version: A single plant tries to match its output with a reference signal (while maintaining the plant’s internal stability) [12, 13, 14]. In the absence of system parameter uncertainty, the solution of the “regulator equations”, embedding a copy of the exosystem dynamics, provides a solution to output regulation [14]. When system parameters are subject to uncertainty, a dynamic compensator/controller must be used embedding copy of the exosystem, where is the number of (independent) output variables to be regulated. The latter is wellknown as the internal model principle [13]. These methods for solving the centralized output regulation problem, however, cannot be applied directly to the distributed version, inasmuch as not all agents have direct access to the reference signal or the exosystem dynamics.
The distributed output regulation of networks of heterogeneous linear agents is studied in [5]. The proposed distributed controller consists of two parts: an exosystem generator and a controller based on regulator equation solutions. Specifically, the exosystem generator of each agent aims to (asymptotically) synchronize with the exosystem using consensus protocols, thereby creating a local copy of the exosystem. Meanwhile each agent independently tracks the signal of its local generator, by applying standard centralized methods (in [5] regulator equation solutions). This solution effectively separates the controller synthesis into two parts – distributed exosystem generators by network consensus and local output regulation by regulator equation solution.
One important limitation, however, of the above solution is: in both the exosystem generator design and the regulator equation solution, it is assumed that each agent uses exactly the same dynamic model as the exosystem. This assumption may be unreasonable in the distributed network setting, because those agents that cannot directly measure the reference signal are unlikely to know the precise dynamic model of the exosystem. To deal with this challenge, [8] proposes (in the case of static networks) an “adpative” exosystem generator and an adaptive solution to the regulator equations. In essence, each agent runs an additional consensus algorithm to update their “local estimates” of the exosystem dynamics.
All the regulatorequation based solutions above fall short in addressing the issue of system parameter uncertainty. In practice one may not have precise knowledge of some entries of the system matrices, or over time the values of some parameters drift. The distributed output regulation problem considering parameter uncertainty is studied in [4, 6]. The proposed controller is based on the internal model principle, but does not employ the twopart structure mentioned above. It seems to be for this reason that restrictive conditions (acyclic graph or homogeneous nominal agent dynamics) have to be imposed in order to ensure solving output regulation. Moreover, it is also assumed in [4, 6] that each agent knows the exact model of the exosystem dynamics.
In this paper, we provide a new solution to the distributed output regulation problem of heterogeneous linear systems (generally nonminimum phase), where the agents do not have an accurate dynamic model of the exosystem and the agent dynamics are subject to parameter uncertainty. In particular, we propose to use the twopart structure of the distributed controller in the following manner: The first part is an exosystem generator that works over timevarying networks ([15, 16])^{1}^{1}1This was apparently developed in [15] and in [16] independently. The first versions of [15] and [16] appeared on arXiv.org, with the former three months earlier than the latter. We thank Dr. Liu and Dr. Huang for in a correspondance bringing our attention to their work., and the second part is a dynamic compensator embedding an internal model of the exosystem that addresses parameter uncertainty. The challenge here is, in the design of the dynamic compensator, those agents that cannot directly measure the exosystem have no knowledge of the internal model of the exosystem; on the other hand, we know from [13] that a precise internal model is necessary to achieve perfect regulation with uncertain parameters. To deal with this problem, we propose an extra consensus protocol to update the agents’ local estimates of the internal model of the exosystem; in this process, a unique feature is to avoid certain transmission zeros of the agents’ dynamics in order to guarantee the existence of a dynamic compensator. Inasmuch as the agents gradually ‘learn’ the internal model of the exosystem in a purely distributed fashion, we call this control design ‘distributed internal model principle’ (with reference to the centralized version in [13]).
The main contribution of this paper is the proposed internalmodel based distributed controller, with the novel design of a dynamic compensator that does not require all agents to know the internal model (copy) of the exosystem a priori. This proposed controller provably solves the distributed output regulation problem in which the following constraints/conditions simultaneously hold:
In addition we note that [15] proposes a distributed controller to solve the consensus problem whose design idea is similar to ours. We point out, however, a few important differences. First, the consensus problem is different from the output regulation problem (the former is usually viewed as a special case of the latter with fullstate observation). Second, while [15] deals with a class of nonlinear systems, the eigenvalues of the exosystem are required to be distinct. We do not make such an assumption; thus (i) the set of signals that can be generated by the exosystem is a strict superset of that in [15], and (ii) the minimal polynomial of the exosystem is generally different from the characteristic polynomial. Third, our designed distributed controller is based on the internal model principle, which is different from the controller in [15]. Finally, while the parameter uncertainty considered in [15] is represented by a vector, the uncertainty in this paper is represented by matrices.
The paper is organized as follows. Section II introduces the concept of communication graphs and formulates the distributed output regulation problem. Section III presents the solution distributed controller, and Section IV states our main result. In Section V We design a more general distributed controller. Section VI illustrates our results by a simulation example. Finally, Section VII states our conclusions.
Ii Preliminaries
In this paper, we will use the following notation. Let , and be the identity matrix. For a complex number , denote its complex conjugate by . Write for the closed right half (complex) plane; for the set of all eigenvalues of . We say that a (square) matrix is stable if the real parts of its eigenvalues are negative.
Iia Agents and an exosystem
We consider a network of agents that are linear, timeinvariant, and finitedimensional. The dynamics of each agent is given by
(1)  
(2) 
where is the state vector, the control input, the output to be regulated, and the exogeneous signal generated by the exosystem
(3) 
Here and are real matrices of appropriate sizes. The signal represents reference to be tracked and/or disturbance to be rejected: in (1) represents disturbance acting on the agent ’s dynamics and in (2) represents reference signals to be tracked by agent .
Note that the agents are generally heterogeneous: Each of the matrices and may have different dimensions and entries. Furthermore, we consider that the matricies may have uncertainty; namely
(4) 
where are the nominal parts of agent and are the uncertain parts.
IiB Communication digraphs
Given a multiagent system with agents and an exosystem, we represent the timevarying interconnection among the agents and the exosystem by a digraph , where , , is the node set, and is the edge set. The node , , represents the th agent, and the node 0 the exosystem. Moreover, is the node set including the exosystem and is the node set except for the exosyetm. The th node receives information from the th node at time if and only if . Then the union digraph for a time interval is defined as .
Definition
The digraph uniformly contains a spanning tree if there is such that for every the union digraph contains a spanning tree.
We define the communication weight by if and if . We assume that is piecewise continuous and bounded for all (a technical assumption to be used in Lemma IV below). Note that the exosystem does not receive information from all the agents for all , and thus .
For time and digraph , the graph Laplacian is defined as
where .
IiC Problem statement
We represent by the timevarying interconnection among the agents and the exosystem again. We regard the exosystem as node 0. In particular, at any time only some agents (possibly different across time) can receive information from the exosystem. This differs the current problem from the traditional, centralized output regulation problem [12, 17, 13, 14].
Problem (Distributed Output Regulation Problem)
IiD Example
The literature [8] treat almost the same problem. The literature uses the regulator equations approach. This approach is successful to address noperturbation case. In this section, we show the example that the algorithm in [8] cannot solve a distributed output regulation ploblem if there are perturbation terms. We consider that the exosystem has , and 3 agents have and for all . We also consider the (timeinvariant) network as displayed in Fig. 2. The node numbered 0 is represented the exosystem. Then the initial states are selected uniformly at random from the interval .
Iii Structure of Distributed Controller
At the outset we make the following assumptions.
Assumption
The digraph uniformly contains a spanning tree and its root is node 0 (the exosystem).
Assumption
For each agent , is stabilizable.
Assumption
For each agent , is detectable.
Assumption
For each agent and for every eigenvalues of ,
(5) 
Assumption
The real parts of all eigenvalues of are zeros.
Remark
Assumption III and Assumptions IIIIII are necessary conditions for consensus over timevarying networks [18] and for output regulation [12], respectively. Only Assumption III is a sufficient condition for output regulation; the reason is the following. By [19, 3.4. Discussion], if contains exponentially unstable modes, then one needs to consider stronger connectedness assumptions on the digraph . The exogeneous signal with in the unstable mode diverge exponentially fast. In order to track the diverging signal, the agents must be connected to the exosystem with sufficient large weights. It is difficult to make assumptions satisfying the conditions that the agents can track the exogeneous signal. In this work, we make Assumption III for simplicity.
Remark
Because not all agents can access the exosystem (i.e. cannot be measured by all agents), we cannot use (2) directly. Instead we consider the following (estimated) error vector
(6) 
where is the estimated exogeneous signal. This is in (2) with replace by .
In order to solve Problem IIC, we describe two parts of the structure of the controller: (1) distributed exosystem generator and (2) distributed dynamic compensator.
Iiia Distributed exosystem generator
It is reasonable that each agent has a local estimate of the exosystem’s dynamics since not all agents can access the exosystem. Let be the estimete of and consider
(7)  
(8) 
By using (7) and (8), it is guaranteed under Assumption III that
We show this statement in detail in Section IV below.
IiiB Distributed dynamic compensator
We consider the following dynamic compensator
(9) 
where is defined in (6). Our strategy is the following:

Design and such that .

The desired will ensure.
In order to specify the matrices in (9), we extend the control law in [20, Section 1.3] to the multiagent system setting. Let be the roots of the minimal polynomial of . Note that . Then we define . Let , , be the coefficients of the polynomial satisfying
(10) 
For each agent , let be a local estimate of , and , , the estimated coefficients generated by that satisfy
(11) 
Consider the following consensus algorithm:
(12) 
It follows from Assumption III that as . As a result, the coefficient for each . Note that by Assumption III the entries of are purely imaginary, and hence we only need to consider the initial condition (thus for all ).
Since we consider that the agents may have uncertainty, the regulator equation approach (e.g. [8]) does not work (as shown in Fig. 2). For the robust output regulation problem, we consider the copy internal model as [20, Section 1.3]. Let be the copy internal model, where
(13) 
We state the following lemma using the above matrices.
Lemma
Proof
In Lemma IIIB the sufficient condition (14) means that every does not correspond to the transmission zeros of agent . In (14), is timevarying because it is updated according to (12). Since for all , if agent ’s dynamics has purely imaginary transmission zeros, it is possible that (14) is violated. In order to satisfy (14) for all , we make the following (simpifying) assumption.
Assumption
For every agent , there are no transmission zeros on the imaginary axis, i.e.
for all .
If every agent is minimumphase, then Assumption IIIB is satisfied. In addition, this assumption allows transmission zeros on the open right (complex) plane, thus admitting nonminimumphase system. In the case where Assumption IIIB does not hold, it is a challenge to ensure that (14) holds for all . Nevertheless, in Section V below we shall present a novel strategy to guarantee (14) even in the presence of purely imaginary transmission zeros.
From Lemma IIIB and Assumption IIIB, we may synthesize such that the matrix
(15) 
is stable for all . In addition, we choose such that the matrix is stable under Assumption III.
Now we are ready to present the matrices and in the dynamic compensator (9):
(16) 
Note that in (16), and are timevarying as is timevarying, while is timeinvariant; and by (12) there hold
Using the distributed dynamic compesator (9), we ensure that the estimated error and the output to be regulated converge to 0. For details, see next section.
Iv Main Result
Our main result is the following.
Theorem
Several remarks on Theorem IV are in order.
Remark
For the distributed output regulation problem, Theorem IV extends previous results in the literature in several aspects: The proposed distributed controller (i) employs (timevarying) internal model to deal with system parameter uncertainty (cf. [21, 5, 8]), (ii) needs no a priori knowledge of the exosystem (cf. [21, 4, 5, 6]), and (iii) deals with generally nonminimumphase systems (cf. [7]).
Remark
This algorithm is useful even for the singleagent system. In this case, our approach is the extension of the conventional method [12, 17, 13, 14][20, Section 1.3]. Even if the agent does not know the exosystem at first, the output regulation problem is solvable by the (disributed) exosystem generator and the (distributed) dynamic compensator.
Remark
If the exosystem is a leader agent that possesses computation and communication abilities, then the leader can compute the roots of its own minimal polynomial and send the information to other connected agents. If the exosystem is some entity that cannot compute or communicate, then those agents that can measure the exosystem (in particular know ) compute the corresponding minimal polynomial and the roots, and send the information to the rest of the network.
Remark
For each agent to “learn” the internal model of the exosystem, our strategy is to make the agents reach consensus by (12) for the roots of the exosystem’s minimal polynomial (i.e. eigenvalues of ). It might appear more straightforward to reach consensus for the coefficients of the exosystem’s minimal polynomial; the advantage of updating with (12), nevertheless, is that we may directly guarantee the equality in (5) in Assumption III.
Remark
In (7), there are entries to update and communicate. If the minimal polynomial of equals its characteristic polynomial () and is in the companion form
where , , and are as defined in (10), then each agent does not need to exchange and update the whole . Each agent only needs to exchange and update by (12) and make also in the companion form.
Remark
In the equation (12), we do not need to use all entries of , because the eigenvalues of the real matrices must be in conjugate pairs. Indeed, for all we may write in the following form
where . From this form, each agent can make their entire after exchanging and updating only .
The first lemma states a stability result for a particular type of timevarying systems. This lemma is generalized version of [15, Proof of Lemma 3.1].
Lemma
Consider
(17) 
where are piecewise continuous and bounded on . Suppose that the origin is a uniformly exponentially stable equilibrium of , and , as . Then as .
The second lemma asserts that the distributed exosystem generators proposed in Section IIIA synchronize with the exosystem. This lemma is the same as [15, Lemma 3.1].
Lemma
Now we are ready to prove Theorem IV.
Proof of Theorem IV: Let be the combined state. From (1), (2), (6) and (16), we derive
(18)  
(19) 
where
(20) 
First, we define
and obtain
where
If are all zeros, the upperleft submatrix equals (15) and thus the submatrix is stable. Also is stable. Since and are similar, is stable for all . Even if there exist sufficiently small perturbation terms, from the continuity of eigenvalues (with respect to matrix entries), remains stable for all . This implies that internal stability (the (i) in Problem IIC) is satisfied.
Next, from Assumption III and the above statement we have , and thus the following equations
(21) 
have a unique solution (see e.g. [22, Appendix A]). Let , . Then from (18) and (21) we obtain
From Lemma IV, . Moreover, is stable and . Therefore, from Lemma IV.
V Purely Imaginary Transmission Zeros
In this section, we generalize Theorem IV by designing a distributed controller for the case where Assumption IIIB does not hold, i.e. there exist transmisssion zeros of the agents on the imaginary axis. In this case, because each (vector) is updated continuously, entries of may coincide with the transmission zeros of agent , which would violate (14). Consequently we cannot design with Lemma IIIB.
In order to choose the local estimate satisfying the condition (14) and design , must converge to and at the same time avoid the transmission zeros of the agent . Fig. 3 shows examples of the trajectory of . The circles and the crosses represent respectively the transmission zeros of the agent and the eigenvalue of .
The initial value is in and moves toward . We divide the arrangement of transmission zeros into three cases:

If there is a purely imaginary transmission zero of agent , and there are also other transmission zeros on the open righthalfplane (see Fig. 3(iii)), the radius of semicircle should be smaller than (e.g. half of) the distance between these transmission zeros.
To formalize the above idea, we define several quantities. Let
(22) 
be the set of closed righthalfplane transmission zeros of agent and
(23) 
the subset of purely imaginary transmission zeros. We do not need to avoid open lefthalfplane transmission zeros because . Note that Assumption III and are equivalent, and Assumption IIIB holds if and only if .
We define a new function. For two sets of finite number of complex numbers, define the distance between and by
Then define the radius of the semicircle shown in Fig. 3 as
(24) 
This has three cases:

If there is no purely imaginary transmission zeros of agent , i.e. , then Assumption IIIB holds and simply the radius is zero.

If there are only purely imaginary transmission zeros of agent , i.e. , then the radius is half of the distance between and .

If there are transmission zeros of agent on both purely imaginary axis and righthalfplane, then the radius is half of the smaller of the distance between and and that between and .
In the definition of , we consider the coefficient for simplicity, but we can choose any coefficient from . By using this , we ensure the radius of the semicircle for the three cases as illustrated in Fig. 3.
Then we consider for all ,
(25)  
(26)  
(27) 
where
Note that means the distance between and its closest (purely imaginary) transmission zero, and for all . Moreover from Assumption III, for all . Finally, we update by (26) and (27) instead of (12). It follows immediately from these definitions the following result.
Lemma
Using the above method, we state the main result of this section.
Theorem
Given the multiagent system (1), (2), (4) and the exosystem (3), suppose that Assumptions IIIIII hold and , , , , , are sufficiently small. Then for each agent , the distributed exosystem generator (7), (8) and the distributed dynamic compensator (9) with (16), (25), (26), (27) solve Problem IIC.
Proof
Remark
As in Remark IV, we do not need to use all entries of