In this section, we consider the situation that agents are allowed to predict MF and adjust their strategies in real-time. Then, we analysis the estimation error affection on the results. Due to the existence of random terms, ’s estimation of current MF based on its actual trajectory may be incorrect, so we consider the situation where ’s estimation of MF and strategy change over time.
Consider ’s behavior at any given moment . At time , estimates current MF-S at time as , predicts MF-S and MF-C after time as , , and gives its feedback optimal control corresponding to . ’s control input at time can be represented as .
5.1 Assumptions
A1:
estimates ’s average estimation of MF-S at time as , and takes it as the actual agents’ average estimation to give its strategy.
A2:
takes as the actual MF-S to give its strategy at time , and this criterion is known to all agents.
A3:
takes to give its strategy at time , and this criterion is known to all agents.
A4:
believes that .
5.3 Predicted MF under Augmented Information
In this subsection, we consider the situation where ’s predicted MF and strategy , are available to at time , which means agents share their predictions on MF and strategies with each other. Then gives its prediction on MF based on the augmented information set .
Substitute the optimal control into dynamics, for , we have
|
|
|
(58) |
Since , where , we have
|
|
|
(59) |
When , the actual MF-S satisfies
|
|
|
(60) |
where according to (57), the actual satisfies
|
|
|
(61) |
where .
According to A2 and (60), for given at time , , predicts as
|
|
|
(62) |
According to A3, at time , takes as , as , so , (62) changes to
|
|
|
(63) |
So when agents share their predictions and strategies with each other, under A2, A3, agents’ average prediction on MF-S and average strategy satisfy
|
|
|
(64) |
We notice that , where satisfies a matrix riccati differential equation
|
|
|
(65) |
and satisfies the backward ordinary differential equations (BODEs)
|
|
|
(66) |
Remark 5.1 Let , we have
|
|
|
(67) |
and .
5.4 Predicted MF under Restricted Information
In this subsection, we consider ’s strategy at based on the restricted information set and above assumptions. We show that under A1, A2, A3, A4, only needs to estimate to give its prediction on MF-S .
We notice that (64) can be solved only based on and , so can compute only using . By substituting into (62), can compute , and further can solve (57) for its control. So can calculate its strategy under augmented information, but only using its estimation of MF-S and agents’ average estimation of MF-S.
Under A1 and A4, believes all agents have the same correct , so all agents can compute the same correct and under augmented information through (64). Then believes all agents can give their strategies under augmented information by solving (63) and (57), and the game under restricted information is consistent with that under augmented information. predicts and from
|
|
|
(68) |
Then we can give the following theorem
Theorem 5.1 Suppose A1, A2, A3, A4. At time , predicted by and can be computed only based on and parameters .
5.5 Strategies under Restricted Information
Consider ’s strategy under restricted information. The following system gives ’s feedback control at time .
MF-S and MF-C
Predict
predicts by uniquely solved
|
|
|
(69) |
and can be given by .
can also solve (67) for
Predict -1
predicts by uniquely solved
|
|
|
(70) |
and can be given by .
Predict
predicts by uniquely solved
|
|
|
(71) |
and can be given by .
Feedback Control
For computed and , can solve (57) for . It’s feedback optimal control is
|
|
|
(72) |
5.6 Estimation Error Affection on Predicted Mean Field
In this subsection, we analysis the estimation error affection on the mean field equilibrium predicted by . We represent the MF-S and MF-C under correct information as and .
We set , . Then according to (6) and (70), we have
|
|
|
(73) |
We have defined as a basis solution of (16), then can be solved according to . The solution of (73) is given by
|
|
|
(74) |
So we have . According to (71), we have
|
|
|
(75) |
The solution of the above equation is given by
|
|
|
(76) |
where .
Since can be calculated without knowing the information of initial states, has an all-agents-known linear relationship with . We can get the following theorem
Theorem 5.2 has a linear relationship with , and this linear relationship can be computed by all agents without knowing , which is
|
|
|
(77) |
This theorem gives the deviation of the MF in the prediction of and that under correct information.
5.7 Estimation Error Affection on Feedback Control
In this subsection, we analysis the estimation error affection on the feedback control law used by . We represent the feedback control under correct information as , and the actual trajectory of as .
We set , then according to (5) and (57), we have
|
|
|
(78) |
Define as a basis solution of (20), then can be solved according to . Using the method of variation of parameters, the solution of (78) is given by
|
|
|
(79) |
where .
Applying the conclusion of Theorem 4.1, we can get the following theorem
Theorem 5.3 has a linear relationship with , and this linear relationship can be computed by all agents without knowing , which is
|
|
|
(80) |
According to Theorem 5.3, we have .
Remark 5.2 The relationship between and can be represented as
|
|
|
(81) |
5.8 Estimation Error Affection on Actual Mean Field
Let , , then when , we have
|
|
|
(82) |
Then according to (60), we have
|
|
|
(83) |
Substitute in to the above equation, we have
|
|
|
(84) |
Using the method of variation of parameters, the solution of (84) is given by
|
|
|
(85) |
where .
Then we can get the following theorem
Theorem 5.4 The relationship between and can be represented as
|
|
|
(86) |
and can be computed by all agents.
Remark 5.3 Notice that when , we have .