\subcaptionsetup

[figure]skip=-6mm,singlelinecheck=off \AtAppendix \AtAppendix \AtAppendix \AtAppendix

Quantum algorithm for large-scale market equilibrium computation

Po-Wei Huang1 and Patrick Rebentrost1, 2
1Centre for Quantum Technologies, National University of Singapore
2Department of Computer Science, National University of Singapore
[email protected], [email protected]
Abstract

Classical algorithms for market equilibrium computation such as proportional response dynamics face scalability issues with Internet-based applications such as auctions, recommender systems, and fair division, despite having an almost linear runtime in terms of the product of buyers and goods. In this work, we provide the first quantum algorithm for market equilibrium computation with sub-linear performance. Our algorithm provides a polynomial runtime speedup in terms of the product of the number of buyers and goods while reaching the same optimization objective value as the classical algorithm. Numerical simulations of a system with 16384 buyers and goods support our theoretical results that our quantum algorithm provides a significant speedup.

1 Introduction

The balance of supply and demand is a fundamental and well-known law that determines the price of goods in a market. In a market with a set of n𝑛nitalic_n buyers and m𝑚mitalic_m goods, the competitive equilibrium [1, 2] determines the optimal price and allocation of goods such that the supply equals the demand in the given market. The computation of the competitive equilibrium is known as the market equilibrium computation problem, whose unique solution was shown to exist under a general model of the economics in the seminal work of Arrow and Debreu [3]. The relevance of such problems in algorithmic game theory [4, 5] is substantiated by the first welfare theorem, which implies that the competitive equilibria are Pareto-efficient [6], where no allocation is available that makes one agent better without making another one worse. In competitive equilibrium from equal income (CEEI) scenarios, such equilibria are further known to by envy-free [7, 8], where no agent would prefer an allocation received by another agent over their own.

The market equilibrium computation problem has, in recent years, been extended to various large-scale Internet-based markets [9], including auction markets [10], fair item allocation/fair division [11, 12, 13], scheduling problems [14] and recommender systems [15]. Such developments call for the need to further develop algorithmic theories for markets and the computation of market equilibria.

We focus on a particular type of market known as the Fisher market [16, 17], where there is a set of n𝑛nitalic_n buyers interested in buying m𝑚mitalic_m infinitely-divisible goods, and where each buyer has their monetary budget that has no intrinsic value apart from being used to purchase goods. We mainly consider Fisher markets with linear utilities, where the total utility gained by purchasing goods is strictly linear to the value and proportion of the goods obtained.

While combinatorial algorithms that can obtain exact and approximate solutions to such solutions have been discovered [18, 19, 20, 21], these algorithms tend to scale poorly against the growing number of buyers and goods. One can otherwise formulate the market equilibrium computation problem as an optimization problem that maximizes a convex objective function known as the Eisenberg-Gale (EG) convex program [22, 23]. Such optimization algorithms can produce approximate solutions much faster than that of combinatorial algorithms. One such example that is commonly used for the market equilibrium problem is the proportional response (PR) dynamics [24, 25]. The PR dynamics is an iterative algorithm that converges with a rate of 1/T1𝑇\nicefrac{{1}}{{T}}/ start_ARG 1 end_ARG start_ARG italic_T end_ARG where T𝑇Titalic_T is the number of iterations. Each iteration of the PR dynamics has a cost of 𝒪(mn)𝒪𝑚𝑛\mathcal{O}(mn)caligraphic_O ( italic_m italic_n ) from proportionally updating individual bids that a buyer should make for different goods.

Given the high number of buyers and goods that can exist in Internet-based markets, the problem of further algorithmic speedups to the computation continues to be an active field of research. Gao and Kroer [26] discovered that by using projected gradient descent instead of PR dynamics, the market equilibrium can be found with linear convergence. Apart from the number of iterations, attempts to reduce the cost per iteration, such as using clustering to reduce the problem size [15], have also been made. However, it is not yet clear whether these methods can provide advantages beyond a constant-factor speedup.

In this work, we consider a Fisher market with n𝑛nitalic_n buyers and m𝑚mitalic_m goods, where the objective is to find an approximate market equilibrium whose EG objective function is within an additive error ϵitalic-ϵ\epsilonitalic_ϵ of the optimal EG objective value. We provide a method to reduce the cost per iteration by utilizing quantum norm estimation and quantum inner product estimation [27, 28] and provide the first quantum algorithm to achieve sublinear performance in terms of the product of the buyers and goods in market equilibrium computation. To arrive at the quantum algorithm, we show an alternate version of the PR dynamics with erroneous updates, which we term the faulty proportional response (FPR) dynamics. We then provide a quantum algorithm that provides a quadratic speedup in terms of the smaller dimension between buyers and goods, as well as less memory consumption, albeit being QRAM instead of classical RAM. We summarize our results in Table 1.

Algorithm Iterations Runtime Memory Result Prep.
PR dynamics [24] logmε𝑚𝜀\displaystyle\frac{\log m}{\varepsilon}divide start_ARG roman_log italic_m end_ARG start_ARG italic_ε end_ARG 𝒪~(mnε)~𝒪𝑚𝑛𝜀\displaystyle\tilde{\mathcal{O}}\left(\frac{mn}{\varepsilon}\right)over~ start_ARG caligraphic_O end_ARG ( divide start_ARG italic_m italic_n end_ARG start_ARG italic_ε end_ARG ) 𝒪(mn)𝒪𝑚𝑛\mathcal{O}(mn)caligraphic_O ( italic_m italic_n ) NA, in RAM
Quantum alg. 2logmε2𝑚𝜀\displaystyle\frac{2\log m}{\varepsilon}divide start_ARG 2 roman_log italic_m end_ARG start_ARG italic_ε end_ARG 𝒪~(mnmax(m,n)ε2)~𝒪𝑚𝑛𝑚𝑛superscript𝜀2\displaystyle\tilde{\mathcal{O}}\left(\frac{\sqrt{mn\max(m,n)}}{\varepsilon^{2% }}\right)over~ start_ARG caligraphic_O end_ARG ( divide start_ARG square-root start_ARG italic_m italic_n roman_max ( italic_m , italic_n ) end_ARG end_ARG start_ARG italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) 𝒪(m+n)𝒪superscript𝑚𝑛\mathcal{O}(m+n)^{*}caligraphic_O ( italic_m + italic_n ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT 𝒪(polylogmn)𝒪poly𝑚𝑛\mathcal{O}(\operatorname{poly}\log mn)caligraphic_O ( roman_poly roman_log italic_m italic_n )
Table 1: Main result. n𝑛nitalic_n is the number of buyers, m𝑚mitalic_m is the number of goods, and ε𝜀\varepsilonitalic_ε indicates the additive error of the computed values with minimal-achievable EG objective value. The memory complexity for the quantum algorithm (annotated with *) refers to the use of quantum access to classical memory, which QRAM can achieve (see Definition 1), instead of classical RAM. As the computed competitive equilibrium takes 𝒪(mn)𝒪𝑚𝑛\mathcal{O}(mn)caligraphic_O ( italic_m italic_n ) memory to store, our quantum algorithm does not provide the entire matrix, but instead provide query access to the result. The result preparation column refers to the runtime cost of preparing query access to the competitive equilibrium.

2 Preliminaries

Notations.

Let [n]:={0,1,,n1}assigndelimited-[]𝑛01𝑛1[n]:=\{0,1,\dots,n-1\}[ italic_n ] := { 0 , 1 , … , italic_n - 1 }. We use direct-product\odot to represent element-wise multiplication, as well as direct-sum\oplus for bit-wise XOR operation and tensor-product\otimes for tensor products. For vectors uN𝑢superscript𝑁u\in\mathbb{R}^{N}italic_u ∈ blackboard_R start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT, we denote a vector’s psubscript𝑝\ell_{p}roman_ℓ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT norm by vp:=i=1N|vi|ppassignsubscriptnorm𝑣𝑝𝑝superscriptsubscript𝑖1𝑁superscriptsubscript𝑣𝑖𝑝\|v\|_{p}:=\sqrt[p]{\sum_{i=1}^{N}|v_{i}|^{p}}∥ italic_v ∥ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT := nth-root start_ARG italic_p end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT | italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT end_ARG. Let M×N()subscript𝑀𝑁\mathcal{M}_{M\times N}(\mathbb{R})caligraphic_M start_POSTSUBSCRIPT italic_M × italic_N end_POSTSUBSCRIPT ( blackboard_R ) indicate the space of square matrices of size M×N𝑀𝑁M\times Nitalic_M × italic_N over \mathbb{R}blackboard_R. We denote the i𝑖iitalic_i-th row vector of A𝐴Aitalic_A by Ai,subscript𝐴𝑖A_{i,*}italic_A start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT and the j𝑗jitalic_j-th column vector of A𝐴Aitalic_A by A,jsubscript𝐴𝑗A_{*,j}italic_A start_POSTSUBSCRIPT ∗ , italic_j end_POSTSUBSCRIPT. We further define 𝕀𝕀\mathbb{I}blackboard_I as [0,1]01[0,1][ 0 , 1 ], and the n𝑛nitalic_n-unit simplex as 𝕊nsuperscript𝕊𝑛\mathbb{S}^{n}blackboard_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, i.e. 𝕊n={v𝕀n,v1=1}superscript𝕊𝑛formulae-sequence𝑣superscript𝕀𝑛subscriptnorm𝑣11\mathbb{S}^{n}=\{v\in\mathbb{I}^{n},\|v\|_{1}=1\}blackboard_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = { italic_v ∈ blackboard_I start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , ∥ italic_v ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 }. For sets of numbers, we add the subscript +subscript\cdot_{+}⋅ start_POSTSUBSCRIPT + end_POSTSUBSCRIPT to indicate a constraint on positivity for elements in the set. We use |kket𝑘\ket{k}| start_ARG italic_k end_ARG ⟩ to denote a binary encoding of a real number k𝑘kitalic_k up to arbitrary precision into a quantum state, and |0¯ket¯0\ket{\bar{0}}| start_ARG over¯ start_ARG 0 end_ARG end_ARG ⟩ to denote a multi-qubit zero state whose number of qubits can be inferred from the context. Lastly, we use 𝒪~()~𝒪\mathcal{\tilde{O}}(\cdot)over~ start_ARG caligraphic_O end_ARG ( ⋅ ) to omit polylogarithmic factors in asymptotic runtime/memory analysis.

Quantum computation.

Quantum algorithms are shown to be able to provide asymptotic speedups over classical counterparts [29, 30, 31] by utilizing characteristics of quantum mechanics such as superposition to access data all at once. In this work, the main quantum speedup stems from the fast computation of 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norms and inner products [27, 28], which is in turn powered by a technique known as quantum amplitude estimation (QAE) [32]. Classical approximation algorithms that use Monte Carlo methods for probability estimation up to precision ϵitalic-ϵ\epsilonitalic_ϵ have runtime 𝒪(1/ϵ2)𝒪1superscriptitalic-ϵ2\mathcal{O}(1/\epsilon^{2})caligraphic_O ( 1 / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) due to the concentration of precision being correlated to the variance. On the other hand, QAE can achieve the same precision by reading input at once in superposition and repeatedly amplifying the precision of our estimation, which takes 𝒪(1/ϵ)𝒪1italic-ϵ\mathcal{O}(1/\epsilon)caligraphic_O ( 1 / italic_ϵ ) runtime and provides a quadratic speedup. Many subtle improvements to the QAE algorithm have since been made after its discovery, such as simplifying subroutines [33, 34, 35], restoring the initial state [36, 37] and compensating for bias [38].

Theorem 2.1 (Quantum amplitude estimation; Theorem 2, [39]).

Let t𝑡t\in\mathbb{N}italic_t ∈ blackboard_N. We are given one copy of a quantum state |ψket𝜓\ket{\psi}| start_ARG italic_ψ end_ARG ⟩ as input, as well as a unitary transformation U=I2|ψψ|U=I-2\lvert\psi\rangle\langle\psi\rvertitalic_U = italic_I - 2 | italic_ψ ⟩ ⟨ italic_ψ |, and a unitary transformation V=I2P𝑉𝐼2𝑃V=I-2Pitalic_V = italic_I - 2 italic_P for some projector P𝑃Pitalic_P. There exists a quantum algorithm that outputs a~~𝑎\tilde{a}over~ start_ARG italic_a end_ARG, an estimate of a=P|ψ2𝑎superscriptnorm𝑃ket𝜓2a=\|P\ket{\psi}\|^{2}italic_a = ∥ italic_P | start_ARG italic_ψ end_ARG ⟩ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, such that

|a~a|2πa(1a)M+π2M2~𝑎𝑎2𝜋𝑎1𝑎𝑀superscript𝜋2superscript𝑀2|\tilde{a}-a|\leq 2\pi\frac{\sqrt{a(1-a)}}{M}+\frac{\pi^{2}}{M^{2}}| over~ start_ARG italic_a end_ARG - italic_a | ≤ 2 italic_π divide start_ARG square-root start_ARG italic_a ( 1 - italic_a ) end_ARG end_ARG start_ARG italic_M end_ARG + divide start_ARG italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG

with probability at least 8/π28superscript𝜋28/\pi^{2}8 / italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, using M𝑀Mitalic_M applications of U𝑈Uitalic_U and V𝑉Vitalic_V each.

In this paper, we use QAE to estimate 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norms and inner products of vectors v𝕀N𝑣superscript𝕀𝑁v\in\mathbb{I}^{N}italic_v ∈ blackboard_I start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT up to a multiplicative error in M𝒪(Nϵln(1δ))𝑀𝒪𝑁italic-ϵ1𝛿M\in\mathcal{O}(\frac{\sqrt{N}}{\epsilon}\ln(\frac{1}{\delta}))italic_M ∈ caligraphic_O ( divide start_ARG square-root start_ARG italic_N end_ARG end_ARG start_ARG italic_ϵ end_ARG roman_ln ( divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ) ) runtime with probability 1δ1𝛿1-\delta1 - italic_δ, invoking a quadratic speedup in both the dimension and the error rate. We defer the formulation and details to Appendix A.

Apart from quantum subroutines that provide speedups, we also require the usage of arithmetic operations such as addition, subtraction, multiplication, and division on quantum computers. We assume the arithmetic model, which would allow us to ignore issues arising from the fixed point representation of numbers111If the fixed point representation with an additive error of μ𝜇\muitalic_μ is considered, the additional multiplicative cost required for operations is then 𝒪(polylog1/μ)𝒪poly1𝜇\mathcal{O}(\operatorname{poly}\log\nicefrac{{1}}{{\mu}})caligraphic_O ( roman_poly roman_log / start_ARG 1 end_ARG start_ARG italic_μ end_ARG ). Considering μΩ(1/poly(m,n))𝜇Ω1poly𝑚𝑛\mu\in\Omega(1/\operatorname{poly}(m,n))italic_μ ∈ roman_Ω ( 1 / roman_poly ( italic_m , italic_n ) ), the additional cost is 𝒪(polylog(m,n))𝒪poly𝑚𝑛\mathcal{O}(\operatorname{poly}\log(m,n))caligraphic_O ( roman_poly roman_log ( italic_m , italic_n ) ), which are polylogarithmic factors that we already omit in this paper.. We further assume that we have access to quantum arithmetic circuits [40, 41] that can perform such arithmetic operations in 𝒪(1)𝒪1\mathcal{O}(1)caligraphic_O ( 1 ) gates, and that by using such circuits, computation of the n𝑛nitalic_n-th power of a number, where n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N, can be achieved in 𝒪(polylogn)𝒪poly𝑛\mathcal{O}\left(\operatorname{poly}\log n\right)caligraphic_O ( roman_poly roman_log italic_n ) gates, using methods like binary exponentiation [42]. We note that quantum arithmetic circuits can be used to execute the same operation on multiple numbers in parallel if the numbers are held in superposition.

Lastly, we need to access the input matrices and intermediate vectors as a superposition of encoded quantum states. Such quantum access to the classical data in memory can be achieved by quantum random access memory (QRAM)222Our memory unit can be more precisely termed QRACM [43, 44] or QROM [45] as opposed to QRAQM [43, 44] or QRAG [46], whose memory registers store quantum states instead of classical numbers. However, both are more commonly and jointly referred to as QRAM in literature. as follows. We refer the reader to [44] for a survey on QRAM.

Definition 1 (Quantum random access memory; [47, 48]).

Let n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N and c𝒪(1)𝑐𝒪1c\in\mathcal{O}(1)italic_c ∈ caligraphic_O ( 1 ). Also let w𝑤witalic_w be a vector of bit strings such that i[n],wi{0,1}cformulae-sequencefor-all𝑖delimited-[]𝑛subscript𝑤𝑖superscript01𝑐\forall i\in[n],w_{i}\in\{0,1\}^{c}∀ italic_i ∈ [ italic_n ] , italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ { 0 , 1 } start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT. A quantum RAM provides access to wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in superposition after a one-time construction cost of 𝒪~(n)~𝒪𝑛\tilde{\mathcal{O}}\left(n\right)over~ start_ARG caligraphic_O end_ARG ( italic_n ), where each access costs 𝒪(polylogn)𝒪poly𝑛\mathcal{O}\left(\operatorname{poly}\log n\right)caligraphic_O ( roman_poly roman_log italic_n ).

Fisher market equilibrium.

In the Fisher market model [16, 17], we are given a market of m𝑚mitalic_m infinitely divisible goods to be divided among n𝑛nitalic_n buyers. Without loss of generality, we assume a unit supply for each good. Each buyer i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] has a budget of Bi>0subscript𝐵𝑖0B_{i}>0italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 that has no intrinsic value apart from being used to purchase goods where, again without loss of generality, we assume B𝕊n𝐵superscript𝕊𝑛B\in\mathbb{S}^{n}italic_B ∈ blackboard_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Each buyer also has a utility function ui:m+:subscript𝑢𝑖superscript𝑚subscriptu_{i}:\mathbb{R}^{m}\to\mathbb{R}_{+}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT that maps an allocation of portions of m𝑚mitalic_m items to a utility value. We can then define the allocation matrix xn×m(+)𝑥subscript𝑛𝑚subscriptx\in\mathcal{M}_{n\times m}(\mathbb{R}_{+})italic_x ∈ caligraphic_M start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT ( blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ) such that xijsubscript𝑥𝑖𝑗x_{ij}italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is the portion of item j𝑗jitalic_j allocated to buyer i𝑖iitalic_i, where ximsubscript𝑥𝑖superscript𝑚x_{i}\in\mathbb{R}^{m}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT is the bundle of products allocated to buyer i𝑖iitalic_i. In this paper, we consider linear utility functions such that ui(xi)=j[m]vijxijsubscript𝑢𝑖subscript𝑥𝑖subscript𝑗delimited-[]𝑚subscript𝑣𝑖𝑗subscript𝑥𝑖𝑗u_{i}(x_{i})=\sum_{j\in[m]}v_{ij}x_{ij}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT, where vij>0subscript𝑣𝑖𝑗0v_{ij}>0italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT > 0 is the value for a unit of item j𝑗jitalic_j for buyer i𝑖iitalic_i.

Given the Fisher market, we want to compute its competitive equilibrium, which consists of the price vector pm𝑝superscript𝑚p\in\mathbb{R}^{m}italic_p ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT for each item j𝑗jitalic_j and allocation matrix x𝑥xitalic_x such that each buyer i𝑖iitalic_i exhausts their entire budget Bisubscript𝐵𝑖B_{i}italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to acquire a bundle of items xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT that maximizes each of their utility ui(xi)subscript𝑢𝑖subscript𝑥𝑖u_{i}(x_{i})italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ).

The market equilibrium of Fisher markets can be captured by solving the Eisenberg-Gale (EG) convex program [22, 23]. The program is derived from maximizing the budget-weighted geometric mean of the buyers’ utilities, which satisfies natural properties such as invariance of the optimal solution to rescaling and splitting [49]. Using the log\logroman_log on the geometric mean, the EG program is as follows:

maxx0i[n]Bilogui(xi) s.t. i[n]xij1,j[m].formulae-sequencesubscript𝑥0subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑢𝑖subscript𝑥𝑖 s.t. subscript𝑖delimited-[]𝑛subscript𝑥𝑖𝑗1for-all𝑗delimited-[]𝑚\max_{x\geq 0}\sum_{i\in[n]}B_{i}\log u_{i}(x_{i})\text{ s.t. }\sum_{i\in[n]}x% _{ij}\leq 1,\forall j\in[m].roman_max start_POSTSUBSCRIPT italic_x ≥ 0 end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) s.t. ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≤ 1 , ∀ italic_j ∈ [ italic_m ] . (2.1)

Such convex programs (maximization of a concave function subject to constraints) can be solved by interior point methods [50], but may not scale to large markets. We discuss this further in Section 6.

For the linear Fisher market, an alternative convex program that obtains the same market equilibrium was shown by Shmyrev [51]. Supposing that each buyer i𝑖iitalic_i submits a bid bijsubscript𝑏𝑖𝑗b_{ij}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT for item j𝑗jitalic_j such that the sum of the bid of the buyer matches their budget Bisubscript𝐵𝑖B_{i}italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT such that each buyer i𝑖iitalic_i is allocated xij=bij/pjsubscript𝑥𝑖𝑗subscript𝑏𝑖𝑗subscript𝑝𝑗x_{ij}=b_{ij}/p_{j}italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT of item j𝑗jitalic_j, we have the following convex program:

maxb0ijbijlogvijpj s.t. i[n]bij=pj,j[m];j[m]bij=Bi,i[n].formulae-sequencesubscript𝑏0subscript𝑖𝑗subscript𝑏𝑖𝑗subscript𝑣𝑖𝑗subscript𝑝𝑗 s.t. subscript𝑖delimited-[]𝑛subscript𝑏𝑖𝑗subscript𝑝𝑗formulae-sequencefor-all𝑗delimited-[]𝑚formulae-sequencesubscript𝑗delimited-[]𝑚subscript𝑏𝑖𝑗subscript𝐵𝑖for-all𝑖delimited-[]𝑛\max_{b\geq 0}\sum_{ij}b_{ij}\log\frac{v_{ij}}{p_{j}}\text{ s.t. }\sum_{i\in[n% ]}b_{ij}=p_{j},\forall j\in[m];\sum_{j\in[m]}b_{ij}=B_{i},\forall i\in[n].roman_max start_POSTSUBSCRIPT italic_b ≥ 0 end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT roman_log divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG s.t. ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , ∀ italic_j ∈ [ italic_m ] ; ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ∀ italic_i ∈ [ italic_n ] . (2.2)

As the allocation matrix and price vector can be directly computed from and be used to compute the bid matrix, the bid matrix can be used as a direct representation of the market equilibrium itself, and hence, is the output of the algorithms we discuss in our paper.

Proportional response dynamics.

The proportional response (PR) dynamics is an iterative algorithm [24, 25, 52] that obtains the Fisher market equilibrium computation by updating the bids bijsubscript𝑏𝑖𝑗b_{ij}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT submitted by buyer i𝑖iitalic_i for item j𝑗jitalic_j. For each time step, the elements of the price vector pjsubscript𝑝𝑗p_{j}italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are computed by summing the bids for item j𝑗jitalic_j such that pj=ibijsubscript𝑝𝑗subscript𝑖subscript𝑏𝑖𝑗p_{j}=\sum_{i}b_{ij}italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT. The allocation xijsubscript𝑥𝑖𝑗x_{ij}italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is then obtained by taking xij=bij/pjsubscript𝑥𝑖𝑗subscript𝑏𝑖𝑗subscript𝑝𝑗x_{ij}=b_{ij}/p_{j}italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. The buyers then update the bids such that the new bid is proportional to the utility ui=jvijxijsubscript𝑢𝑖subscript𝑗subscript𝑣𝑖𝑗subscript𝑥𝑖𝑗u_{i}=\sum_{j}v_{ij}x_{ij}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT gained in the current time step such that bij(t+1)=Bivijxij(t)/ui(t)superscriptsubscript𝑏𝑖𝑗𝑡1subscript𝐵𝑖subscript𝑣𝑖𝑗superscriptsubscript𝑥𝑖𝑗𝑡superscriptsubscript𝑢𝑖𝑡b_{ij}^{(t+1)}=B_{i}v_{ij}x_{ij}^{(t)}/u_{i}^{(t)}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT / italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT. It was shown by Birnbaum et al. [53] that the PR dynamics is equivalent to mirror descent [54, 55] to a Bregman divergence [56] of the Shmyrev convex program.

For ease of discussion, we write the objective function of the EG and Shmyrev convex programs as functions of the bid matrix b𝑏bitalic_b, obtaining the EG objective function Φ(b)=i[n]BiloguiΦ𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑢𝑖\Phi(b)=-\sum_{i\in[n]}B_{i}\log u_{i}roman_Φ ( italic_b ) = - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and Shmyrev objective function Ψ(b)=i[n],j[m]bijlogpjvijΨ𝑏subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚subscript𝑏𝑖𝑗subscript𝑝𝑗subscript𝑣𝑖𝑗\Psi(b)=\sum_{i\in[n],j\in[m]}b_{ij}\log\frac{p_{j}}{v_{ij}}roman_Ψ ( italic_b ) = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG. We denote the optimal bid b=argminb𝒮Φ(b)superscript𝑏subscriptargmin𝑏𝒮Φsuperscript𝑏b^{*}=\operatorname{arg\,min}_{b\in\mathcal{S}}\Phi(b^{*})italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = start_OPFUNCTION roman_arg roman_min end_OPFUNCTION start_POSTSUBSCRIPT italic_b ∈ caligraphic_S end_POSTSUBSCRIPT roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), where 𝒮={bn×m(𝕀):j[m]bi,=Bi}𝒮conditional-set𝑏subscript𝑛𝑚𝕀subscript𝑗delimited-[]𝑚subscript𝑏𝑖subscript𝐵𝑖\mathcal{S}=\left\{b\in\mathcal{M}_{n\times m}(\mathbb{I}):\sum_{j\in[m]}b_{i,% *}=B_{i}\right\}caligraphic_S = { italic_b ∈ caligraphic_M start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT ( blackboard_I ) : ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }.

The convergence bounds of the PR dynamics regarding the EG and Shmyrev objective functions for linear Fisher markets were found as follows:

Theorem 2.2 (Convergence of PR dynamics; [53]).

Considering a linear Fisher market, for bij(t)superscriptsubscript𝑏𝑖𝑗𝑡b_{ij}^{(t)}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT as iteratively defined by the proportional response dynamics where bij(0)=Bimsuperscriptsubscript𝑏𝑖𝑗0subscript𝐵𝑖𝑚b_{ij}^{(0)}=\frac{B_{i}}{m}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG, we have

Ψ(b(T))Ψ(b)logmT,Φ(b(T1))Φ(b)logmT.formulae-sequenceΨsuperscript𝑏𝑇Ψsuperscript𝑏𝑚𝑇Φsuperscript𝑏𝑇1Φsuperscript𝑏𝑚𝑇\Psi(b^{(T)})-\Psi(b^{*})\leq\frac{\log m}{T},\quad\Phi(b^{(T-1)})-\Phi(b^{*})% \leq\frac{\log m}{T}.roman_Ψ ( italic_b start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT ) - roman_Ψ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG roman_log italic_m end_ARG start_ARG italic_T end_ARG , roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_T - 1 ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG roman_log italic_m end_ARG start_ARG italic_T end_ARG . (2.3)

An alternate end-to-end proof of the convergence of both convex programs that varies from Birnbaum et al. [53]’s approach and centered around the EG function can be found in Appendix B, elements of which we use in the proof of later sections. Two notable results that we prove and utilize are: 1) Ψ(b(t+1))Φ(b(t))+i[n]BilogBiΨ(b(t))Ψsuperscript𝑏𝑡1Φsuperscript𝑏𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖Ψsuperscript𝑏𝑡\Psi(b^{(t+1)})\leq\Phi(b^{(t)})+\sum_{i\in[n]}B_{i}\log B_{i}\leq\Psi(b^{(t)})roman_Ψ ( italic_b start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT ) ≤ roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ roman_Ψ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ), and 2) the telescoping sum of the difference of the KL divergence of the optimal bid and the iterating bids can be lower bounded by the difference of the current EG objective function and the optimal EG function.

3 Faulty proportional response dynamics

Before moving on to our quantum algorithm, we propose the faulty proportional response (FPR) dynamics, which computes an erroneous update to compute a sequence of bids b^ij(t)superscriptsubscript^𝑏𝑖𝑗𝑡\hat{b}_{ij}^{(t)}over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT, which still retains a convergence guarantee, serving as a counterpart to Theorem 2.2. We first define a faulty update we use for the FPR dynamics:

Definition 2 (Faulty proportional response update).

Let t0𝑡0t\geq 0italic_t ≥ 0 and b^(t)n×m(+)superscript^𝑏𝑡subscript𝑛𝑚subscript\hat{b}^{(t)}\in\mathcal{M}_{n\times m}(\mathbb{R}_{+})over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∈ caligraphic_M start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT ( blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ). Given ϵp(0,0.5)subscriptitalic-ϵ𝑝00.5\epsilon_{p}\in(0,0.5)italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ∈ ( 0 , 0.5 ) such that j,t,p~j(t)p^j(t)p^j(t)ϵpfor-all𝑗𝑡normsuperscriptsubscript~𝑝𝑗𝑡superscriptsubscript^𝑝𝑗𝑡superscriptsubscript^𝑝𝑗𝑡subscriptitalic-ϵ𝑝\forall j,t,\|\tilde{p}_{j}^{(t)}-\hat{p}_{j}^{(t)}\|\leq\hat{p}_{j}^{(t)}% \epsilon_{p}∀ italic_j , italic_t , ∥ over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∥ ≤ over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT where p^j(t)=i[n]b^ij(t)superscriptsubscript^𝑝𝑗𝑡subscript𝑖delimited-[]𝑛superscriptsubscript^𝑏𝑖𝑗𝑡\hat{p}_{j}^{(t)}=\sum_{i\in[n]}\hat{b}_{ij}^{(t)}over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT. Further, given ϵν(0,0.5)subscriptitalic-ϵ𝜈00.5\epsilon_{\nu}\in(0,0.5)italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ∈ ( 0 , 0.5 ) such that i,t,ν~i(t)ν^i(t)ν^j(t)ϵνfor-all𝑖𝑡normsuperscriptsubscript~𝜈𝑖𝑡superscriptsubscript^𝜈𝑖𝑡superscriptsubscript^𝜈𝑗𝑡subscriptitalic-ϵ𝜈\forall i,t,\|\tilde{\nu}_{i}^{(t)}-\hat{\nu}_{i}^{(t)}\|\leq\hat{\nu}_{j}^{(t% )}\epsilon_{\nu}∀ italic_i , italic_t , ∥ over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - over^ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∥ ≤ over^ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT where ν^i(t)=j[m]vijb^ij(t)/p~j(t)superscriptsubscript^𝜈𝑖𝑡subscript𝑗delimited-[]𝑚subscript𝑣𝑖𝑗superscriptsubscript^𝑏𝑖𝑗𝑡superscriptsubscript~𝑝𝑗𝑡\hat{\nu}_{i}^{(t)}=\sum_{j\in[m]}v_{ij}\hat{b}_{ij}^{(t)}/\tilde{p}_{j}^{(t)}over^ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT / over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT. A faulty proportional response update of the bids from timestep t𝑡titalic_t to t+1𝑡1t+1italic_t + 1 is then expressed as follows:

x^ij(t)=b^ij(t)p~j(t),b^ij(t+1)=Bivijxij(t)ν~i(t).formulae-sequencesuperscriptsubscript^𝑥𝑖𝑗𝑡superscriptsubscript^𝑏𝑖𝑗𝑡superscriptsubscript~𝑝𝑗𝑡superscriptsubscript^𝑏𝑖𝑗𝑡1subscript𝐵𝑖subscript𝑣𝑖𝑗superscriptsubscript𝑥𝑖𝑗𝑡superscriptsubscript~𝜈𝑖𝑡\hat{x}_{ij}^{(t)}=\frac{\hat{b}_{ij}^{(t)}}{\tilde{p}_{j}^{(t)}},\quad\hat{b}% _{ij}^{(t+1)}=B_{i}\frac{v_{ij}x_{ij}^{(t)}}{\tilde{\nu}_{i}^{(t)}}.over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG , over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG .

Note that p~jsubscript~𝑝𝑗\tilde{p}_{j}over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT provides an estimation to the price p^j=i[n]b^ijsubscript^𝑝𝑗subscript𝑖delimited-[]𝑛subscript^𝑏𝑖𝑗\hat{p}_{j}=\sum_{i\in[n]}\hat{b}_{ij}over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT, ν~isubscript~𝜈𝑖\tilde{\nu}_{i}over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT does not provide an estimation to the exact utility u^i=j[m]vijb^ij/p^jsubscript^𝑢𝑖subscript𝑗delimited-[]𝑚subscript𝑣𝑖𝑗subscript^𝑏𝑖𝑗subscript^𝑝𝑗\hat{u}_{i}=\sum_{j\in[m]}v_{ij}\hat{b}_{ij}/\hat{p}_{j}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, but νisubscript𝜈𝑖\nu_{i}italic_ν start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, which replaces p^jsubscript^𝑝𝑗\hat{p}_{j}over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in the computation of uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT with p~jsubscript~𝑝𝑗\tilde{p}_{j}over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT.

The convergence bounds of the FPR dynamics regarding the EG objective function for linear Fisher markets were found as follows:

Theorem 3.1 (Convergence of the FPR dynamics).

Considering a linear Fisher market, for bij(t)superscriptsubscript𝑏𝑖𝑗𝑡b_{ij}^{(t)}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT as iteratively defined by the faulty proportional response dynamics where b^ij(0)=Bimsuperscriptsubscript^𝑏𝑖𝑗0subscript𝐵𝑖𝑚\hat{b}_{ij}^{(0)}=\frac{B_{i}}{m}over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG, we have

mint[T]Φ(b^(t))Φ(b)2logmTsubscript𝑡delimited-[]𝑇Φsuperscript^𝑏𝑡Φsuperscript𝑏2𝑚𝑇\min_{t\in[T]}\Phi(\hat{b}^{(t)})-\Phi(b^{*})\leq\frac{2\log m}{T}roman_min start_POSTSUBSCRIPT italic_t ∈ [ italic_T ] end_POSTSUBSCRIPT roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG 2 roman_log italic_m end_ARG start_ARG italic_T end_ARG

when ϵνlogm8Tsubscriptitalic-ϵ𝜈𝑚8𝑇\epsilon_{\nu}\leq\frac{\log m}{8T}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ≤ divide start_ARG roman_log italic_m end_ARG start_ARG 8 italic_T end_ARG and ϵplogm6Tsubscriptitalic-ϵ𝑝𝑚6𝑇\epsilon_{p}\leq\frac{\log m}{6T}italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≤ divide start_ARG roman_log italic_m end_ARG start_ARG 6 italic_T end_ARG.

A high-level idea of the proof follows from the telescoping sum trick to upper bound the EG objective functions with KL divergence from our proof of PR dynamics but with the consideration of error. We show an end-to-end proof of the convergence of the EG objective function in Appendix C.

Notice that in the FPR dynamics, we do not enforce the monotonicity of the iterations, but instead simply take the minimum value over all iterations. The error terms ϵpsubscriptitalic-ϵ𝑝\epsilon_{p}italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and ϵνsubscriptitalic-ϵ𝜈\epsilon_{\nu}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT in the FPR dynamics are only upper bounded such that the total sum of objective values over T𝑇Titalic_T iterations (plus the original iteration) can be upper bounded by logm𝑚\log mroman_log italic_m plus an accumulated error over T𝑇Titalic_T iterations also within logm𝑚\log mroman_log italic_m. If we enforce the monotonicity of the iterations to take the last iteration, the error would require 𝒪(1/T2)𝒪1superscript𝑇2\mathcal{O}(1/T^{2})caligraphic_O ( 1 / italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) precision and would produce a 𝒪(T3)𝒪superscript𝑇3\mathcal{O}(T^{3})caligraphic_O ( italic_T start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) algorithmic dependency instead of 𝒪(T2)𝒪superscript𝑇2\mathcal{O}(T^{2})caligraphic_O ( italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ).

However, given the formulation of a faulty update, a problem that comes into question is whether the computation of the exact value of the function Φ(b)Φ𝑏\Phi(b)roman_Φ ( italic_b ) is supported, as we do not compute uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in the process of updating. Without computation of Φ(b)Φ𝑏\Phi(b)roman_Φ ( italic_b ), one can not be sure which iteration of b^(t)superscript^𝑏𝑡\hat{b}^{(t)}over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT is the minimum. However, we use the computed value of ν~i(t)superscriptsubscript~𝜈𝑖𝑡\tilde{\nu}_{i}^{(t)}over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT as an estimator for the function Φ(b)Φ𝑏\Phi(b)roman_Φ ( italic_b ). The following result is then obtained.

Theorem 3.2.

Considering a linear Fisher market, for bij(t)superscriptsubscript𝑏𝑖𝑗𝑡b_{ij}^{(t)}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT as iteratively defined by the faulty proportional response dynamics where b^ij(0)=Bimsuperscriptsubscript^𝑏𝑖𝑗0subscript𝐵𝑖𝑚\hat{b}_{ij}^{(0)}=\frac{B_{i}}{m}over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG. Let t=argmaxt[T]Bilogν~i(t)superscript𝑡subscriptargmax𝑡delimited-[]𝑇subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡t^{*}=\operatorname{arg\,max}_{t\in[T]}B_{i}\log\tilde{\nu}_{i}^{(t)}italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = start_OPFUNCTION roman_arg roman_max end_OPFUNCTION start_POSTSUBSCRIPT italic_t ∈ [ italic_T ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT. Then

Φ(b^(t))Φ(b)2logmTΦsuperscript^𝑏superscript𝑡Φsuperscript𝑏2𝑚𝑇\Phi(\hat{b}^{(t^{*})})-\Phi(b^{*})\leq\frac{2\log m}{T}roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG 2 roman_log italic_m end_ARG start_ARG italic_T end_ARG

when ϵνlogm8Tsubscriptitalic-ϵ𝜈𝑚8𝑇\epsilon_{\nu}\leq\frac{\log m}{8T}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ≤ divide start_ARG roman_log italic_m end_ARG start_ARG 8 italic_T end_ARG and ϵplogm6Tsubscriptitalic-ϵ𝑝𝑚6𝑇\epsilon_{p}\leq\frac{\log m}{6T}italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≤ divide start_ARG roman_log italic_m end_ARG start_ARG 6 italic_T end_ARG.

The proof of this theorem can similarly be found in Appendix C, which has the same proof idea as Theorem 3.1 apart from some slight differences in error handling.

4 Quantum algorithm

We present our quantum algorithm for solving linear Fisher market equilibrium computation based on the FPR dynamics. Our quantum algorithm does not aim to provide speedups in terms of the number of iterations but provides speedups on the iteration cost of the PR dynamics algorithm. Our algorithm, while reducing the runtime in terms of the number of buyers n𝑛nitalic_n or goods m𝑚mitalic_m, increases runtime in terms of the number of iterations T𝑇Titalic_T but as the T𝑇Titalic_T is logarithmically dependent on m𝑚mitalic_m, there is an overall quadratic speedup provided in the smaller of the two dimensions.

In this section, we further assume that vn×m(𝕀)𝑣subscript𝑛𝑚𝕀v\in\mathcal{M}_{n\times m}(\mathbb{I})italic_v ∈ caligraphic_M start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT ( blackboard_I ). We note that the multiplicative scaling of vijsubscript𝑣𝑖𝑗v_{ij}italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT does not affect the bid matrix b𝑏bitalic_b generated in the FPR dynamics as errors are multiplicative. Hence if the values are larger than 1111, we scale down the values by dividing the queried vijsubscript𝑣𝑖𝑗v_{ij}italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT by a number that is larger than maxijvijsubscript𝑖𝑗subscript𝑣𝑖𝑗\max_{ij}v_{ij}roman_max start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT.

To compute the market equilibrium for the Fisher market by the FPR dynamics in the quantum setting, we require the data input of both the budget vector B𝕊n𝐵superscript𝕊𝑛B\in\mathbb{S}^{n}italic_B ∈ blackboard_S start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and the value matrix vn×m(𝕀)𝑣subscript𝑛𝑚𝕀v\in\mathcal{M}_{n\times m}(\mathbb{I})italic_v ∈ caligraphic_M start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT ( blackboard_I ). We assume quantum query access to the budget and vector and value matrix by the index is readily given to us as part of the problem input without having to load classical data into a quantum system. That is, given an index state and ancilla quantum registers we can store the value of the budget and value according to the index in the ancilla register. Note that these operations can be performed in superposition, such that 1mni,j|𝒊|𝒋|0¯1mni,j|𝒊|𝒋|vij1𝑚𝑛subscript𝑖𝑗ket𝒊ket𝒋ket¯01𝑚𝑛subscript𝑖𝑗ket𝒊ket𝒋ketsubscript𝑣𝑖𝑗\frac{1}{\sqrt{mn}}\sum_{i,j}\boldsymbol{\ket{i}}\boldsymbol{\ket{j}}\ket{\bar% {0}}\rightarrow\frac{1}{\sqrt{mn}}\sum_{i,j}\boldsymbol{\ket{i}}\boldsymbol{% \ket{j}}\ket{v_{ij}}divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_m italic_n end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT bold_| start_ARG bold_italic_i end_ARG bold_⟩ bold_| start_ARG bold_italic_j end_ARG bold_⟩ | start_ARG over¯ start_ARG 0 end_ARG end_ARG ⟩ → divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_m italic_n end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT bold_| start_ARG bold_italic_i end_ARG bold_⟩ bold_| start_ARG bold_italic_j end_ARG bold_⟩ | start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG ⟩.

We do not explicitly state how the data input of the budget and value entries are generated; they could be extracted from entries of a matrix already preloaded in QRAM, or generated/reconstructed from a low-rank approximation of the matrix [15], which takes 𝒪(kpolylogmn)𝒪𝑘poly𝑚𝑛\mathcal{O}\left(k\operatorname{poly}\log mn\right)caligraphic_O ( italic_k roman_poly roman_log italic_m italic_n ) cost to access k𝑘kitalic_k-rank approximations using quantum arithmetic circuits, but with much lower memory consumption333With low-rank approximations, the loading of classical data into QRAM would only take 𝒪~(k(m+n))~𝒪𝑘𝑚𝑛\tilde{\mathcal{O}}(k(m+n))over~ start_ARG caligraphic_O end_ARG ( italic_k ( italic_m + italic_n ) ) runtime.. We note that the low-rank approximation assumption of the value matrix has not yet been utilized to produce reductions in resource consumption in classical methods as the PR dynamics and other methods to compute market equilibrium [26] require all 𝒪(mn)𝒪𝑚𝑛\mathcal{O}(mn)caligraphic_O ( italic_m italic_n ) entries of the full value matrix.

Storing the results of the computed bids b^(t)n×m(𝕀)superscript^𝑏𝑡subscript𝑛𝑚𝕀\hat{b}^{(t)}\in\mathcal{M}_{n\times m}(\mathbb{I})over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∈ caligraphic_M start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT ( blackboard_I ) in QRAM would require a cost of 𝒪~(mn)~𝒪𝑚𝑛\tilde{\mathcal{O}}(mn)over~ start_ARG caligraphic_O end_ARG ( italic_m italic_n ) which would remove all possibility of potential speedups. The same applies to the allocation matrix x^(t)superscript^𝑥𝑡\hat{x}^{(t)}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT. Hence, every time we require the usage of b^(t)superscript^𝑏𝑡\hat{b}^{(t)}over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT oder x^(t)superscript^𝑥𝑡\hat{x}^{(t)}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT, we compute them on-the-fly as follows:

b^ij(T)=BiT+1vijTmt=0T1p~j(t)k=0T1ν~i(t),x^ij(T)=BiTvijT1mt=0Tp~j(t)k=0T1ν~i(t)formulae-sequencesuperscriptsubscript^𝑏𝑖𝑗𝑇superscriptsubscript𝐵𝑖𝑇1superscriptsubscript𝑣𝑖𝑗𝑇𝑚superscriptsubscriptproduct𝑡0𝑇1superscriptsubscript~𝑝𝑗𝑡superscriptsubscriptproduct𝑘0𝑇1superscriptsubscript~𝜈𝑖𝑡superscriptsubscript^𝑥𝑖𝑗𝑇superscriptsubscript𝐵𝑖𝑇superscriptsubscript𝑣𝑖𝑗𝑇1𝑚superscriptsubscriptproduct𝑡0𝑇superscriptsubscript~𝑝𝑗𝑡superscriptsubscriptproduct𝑘0𝑇1superscriptsubscript~𝜈𝑖𝑡\hat{b}_{ij}^{(T)}=\frac{B_{i}^{T+1}v_{ij}^{T}}{m\prod_{t=0}^{T-1}\tilde{p}_{j% }^{(t)}\prod_{k=0}^{T-1}\tilde{\nu}_{i}^{(t)}},\quad\hat{x}_{ij}^{(T)}=\frac{B% _{i}^{T}v_{ij}^{T-1}}{m\prod_{t=0}^{T}\tilde{p}_{j}^{(t)}\prod_{k=0}^{T-1}% \tilde{\nu}_{i}^{(t)}}over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T + 1 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_ARG start_ARG italic_m ∏ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG , over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_m ∏ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG (4.1)

Given quantum access to the values of Πp(T):=t=0Tp~j(t)assignsuperscriptsubscriptΠ𝑝𝑇superscriptsubscriptproduct𝑡0𝑇superscriptsubscript~𝑝𝑗𝑡\Pi_{p}^{(T)}:=\prod_{t=0}^{T}\tilde{p}_{j}^{(t)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT := ∏ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and Πν(T):=t=0Tν~i(t)assignsuperscriptsubscriptΠ𝜈𝑇superscriptsubscriptproduct𝑡0𝑇superscriptsubscript~𝜈𝑖𝑡\Pi_{\nu}^{(T)}:=\prod_{t=0}^{T}\tilde{\nu}_{i}^{(t)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT := ∏ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT, one can encode the values of b^(T)superscript^𝑏𝑇\hat{b}^{(T)}over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT and x^(T)superscript^𝑥𝑇\hat{x}^{(T)}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT into a quantum state in superposition via quantum arithmetic circuits in runtime of 𝒪(polylogTmn)𝒪poly𝑇𝑚𝑛\mathcal{O}\left(\operatorname{poly}\log Tmn\right)caligraphic_O ( roman_poly roman_log italic_T italic_m italic_n ). The quantum access Πp(T)superscriptsubscriptΠ𝑝𝑇\Pi_{p}^{(T)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT and Πν(T)superscriptsubscriptΠ𝜈𝑇\Pi_{\nu}^{(T)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT cost 𝒪(polylogmn)𝒪poly𝑚𝑛\mathcal{O}\left(\operatorname{poly}\log mn\right)caligraphic_O ( roman_poly roman_log italic_m italic_n ) as they are obtained from QRAM, and the operation of taking the T𝑇Titalic_T-th power of the budget and value cost 𝒪(polylogT)𝒪poly𝑇\mathcal{O}\left(\operatorname{poly}\log T\right)caligraphic_O ( roman_poly roman_log italic_T ) by binary exponentiation [42].

The remaining steps are to compute the price vector p~~𝑝\tilde{p}over~ start_ARG italic_p end_ARG and utility vector u~~𝑢\tilde{u}over~ start_ARG italic_u end_ARG in each iteration. Each entry p~jsubscript~𝑝𝑗\tilde{p}_{j}over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is the estimation of the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norm of b^,jsubscript^𝑏𝑗\hat{b}_{*,j}over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT ∗ , italic_j end_POSTSUBSCRIPT and each entry ν~isubscript~𝜈𝑖\tilde{\nu}_{i}over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the estimation of the inner product between x^i,subscript^𝑥𝑖\hat{x}_{i,*}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT and vi,subscript𝑣𝑖v_{i,*}italic_v start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT, which can both be obtained using amplitude estimation. Πp(t)superscriptsubscriptΠ𝑝𝑡\Pi_{p}^{(t)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and Πν(t)superscriptsubscriptΠ𝜈𝑡\Pi_{\nu}^{(t)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT can then be iteratively updated by multiplying by the values of p~jsubscript~𝑝𝑗\tilde{p}_{j}over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and ν~isubscript~𝜈𝑖\tilde{\nu}_{i}over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT each iteration. The full algorithm is shown in Algorithm 1.

Input: Quantum access to B𝐵Bitalic_B and v𝑣vitalic_v, Timestep T𝑇Titalic_T, Price error ϵpsubscriptitalic-ϵ𝑝\epsilon_{p}italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT, Utility error ϵνsubscriptitalic-ϵ𝜈\epsilon_{\nu}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT
Output: Query access to the values of a bid matrix estimator b^(t)superscript^𝑏superscript𝑡\hat{b}^{(t^{*})}over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT constructed using values in QRAM and budget and value access
1 maxEGVal=infmaxEGValinfimum\texttt{maxEGVal}=-\infmaxEGVal = - roman_inf, bij(0)=Bimsuperscriptsubscript𝑏𝑖𝑗0subscript𝐵𝑖𝑚b_{ij}^{(0)}=\frac{B_{i}}{m}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG
2 for t=0𝑡0t=0italic_t = 0 to T𝑇Titalic_T do
3       for j=0𝑗0j=0italic_j = 0 to m𝑚mitalic_m do
4             p~j(t)=(1±ϵp)b^,j(t)1superscriptsubscript~𝑝𝑗𝑡plus-or-minus1subscriptitalic-ϵ𝑝subscriptnormsuperscriptsubscript^𝑏𝑗𝑡1\tilde{p}_{j}^{(t)}=(1\pm\epsilon_{p})\left\|\hat{b}_{*,j}^{(t)}\right\|_{1}over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ( 1 ± italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ∥ over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT ∗ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT via q norm estimation with success prob. 1δ2mT1𝛿2𝑚𝑇1-\frac{\delta}{2mT}1 - divide start_ARG italic_δ end_ARG start_ARG 2 italic_m italic_T end_ARG
5            
6      Store vector Πp(t)=p~(t)Πp(t1)superscriptsubscriptΠ𝑝𝑡direct-productsuperscript~𝑝𝑡superscriptsubscriptΠ𝑝𝑡1\Pi_{p}^{(t)}=\tilde{p}^{(t)}\odot\Pi_{p}^{(t-1)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = over~ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ⊙ roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT into QRAM
7       Gain access to x^ij(t)superscriptsubscript^𝑥𝑖𝑗𝑡\hat{x}_{ij}^{(t)}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT via Πp(t)superscriptsubscriptΠ𝑝𝑡\Pi_{p}^{(t)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and Πν(t1)superscriptsubscriptΠ𝜈𝑡1\Pi_{\nu}^{(t-1)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT in QRAM
8       for i=0𝑖0i=0italic_i = 0 to n𝑛nitalic_n do
9             ν~i(t)=(1±ϵν)xi,(t),vi,superscriptsubscript~𝜈𝑖𝑡plus-or-minus1subscriptitalic-ϵ𝜈superscriptsubscript𝑥𝑖𝑡subscript𝑣𝑖\tilde{\nu}_{i}^{(t)}=(1\pm\epsilon_{\nu})\left\langle x_{i,*}^{(t)},v_{i,*}\right\rangleover~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ( 1 ± italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ) ⟨ italic_x start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , italic_v start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT ⟩ via q inner product estimation with success prob. 1δ2nT1𝛿2𝑛𝑇1-\frac{\delta}{2nT}1 - divide start_ARG italic_δ end_ARG start_ARG 2 italic_n italic_T end_ARG
10            
11      Store vector Πν(t)=ν~(t)Πν(t1)superscriptsubscriptΠ𝜈𝑡direct-productsuperscript~𝜈𝑡superscriptsubscriptΠ𝜈𝑡1\Pi_{\nu}^{(t)}=\tilde{\nu}^{(t)}\odot\Pi_{\nu}^{(t-1)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = over~ start_ARG italic_ν end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ⊙ roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT into QRAM
12       Gain access to b^ij(t+1)superscriptsubscript^𝑏𝑖𝑗𝑡1\hat{b}_{ij}^{(t+1)}over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT via Πp(t)superscriptsubscriptΠ𝑝𝑡\Pi_{p}^{(t)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and Πν(t)superscriptsubscriptΠ𝜈𝑡\Pi_{\nu}^{(t)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT in QRAM
13       Classically compute Φ~(t)=i[n]Bilog(νi(t))superscript~Φ𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript𝜈𝑖𝑡\tilde{\Phi}^{(t)}=\sum_{i\in[n]}B_{i}\log(\nu_{i}^{(t)})over~ start_ARG roman_Φ end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ( italic_ν start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT )
14       if Φ~(t)>maxEGValsuperscript~Φ𝑡maxEGVal\tilde{\Phi}^{(t)}>\texttt{maxEGVal}over~ start_ARG roman_Φ end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT > maxEGVal then
15             maxEGVal=Φ~,bestPiP=Πp(t1),bestPiNu=Πν(t1)formulae-sequencemaxEGVal~Φformulae-sequencebestPiPsuperscriptsubscriptΠ𝑝𝑡1bestPiNusuperscriptsubscriptΠ𝜈𝑡1\texttt{maxEGVal}=\tilde{\Phi},\,\texttt{bestPiP}=\Pi_{p}^{(t-1)},\,\texttt{% bestPiNu}=\Pi_{\nu}^{(t-1)}maxEGVal = over~ start_ARG roman_Φ end_ARG , bestPiP = roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT , bestPiNu = roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT
16            
17      
18return bestPiP and bestPiNu in QRAM
Algorithm 1 Quantum faulty proportional response dynamics
Theorem 4.1 (Quantum algorithm for faulty proportional response dynamics).

Let δ(0,0.5),n,m,Tformulae-sequence𝛿00.5𝑛𝑚𝑇\delta\in(0,0.5),n,m,T\in\mathbb{N}italic_δ ∈ ( 0 , 0.5 ) , italic_n , italic_m , italic_T ∈ blackboard_N, ϵplogm8Tsubscriptitalic-ϵ𝑝𝑚8𝑇\epsilon_{p}\leq\frac{\log m}{8T}italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≤ divide start_ARG roman_log italic_m end_ARG start_ARG 8 italic_T end_ARG, and ϵν16Tsubscriptitalic-ϵ𝜈16𝑇\epsilon_{\nu}\leq\frac{1}{6T}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 6 italic_T end_ARG. Given quantum access to B𝐵Bitalic_B and v𝑣vitalic_v, and access to QRAM, with success probability 1δ1𝛿1-\delta1 - italic_δ, Algorithm 1 produces values stored in QRAM such that query access to the values of b^(t)superscript^𝑏superscript𝑡\hat{b}^{(t^{*})}over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT can be constructed, where

Φ(b^(t))Φ(b)2logmT,Φsuperscript^𝑏superscript𝑡Φsuperscript𝑏2𝑚𝑇\Phi(\hat{b}^{(t^{*})})-\Phi(b^{*})\leq\frac{2\log m}{T},roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG 2 roman_log italic_m end_ARG start_ARG italic_T end_ARG ,

with 𝒪~(T2mnmax(m,n)log1/δ)~𝒪superscript𝑇2𝑚𝑛𝑚𝑛1𝛿\tilde{\mathcal{O}}(T^{2}\sqrt{mn\max(m,n)}\log\nicefrac{{1}}{{\delta}})over~ start_ARG caligraphic_O end_ARG ( italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT square-root start_ARG italic_m italic_n roman_max ( italic_m , italic_n ) end_ARG roman_log / start_ARG 1 end_ARG start_ARG italic_δ end_ARG ) runtime and 𝒪~(m+n)~𝒪𝑚𝑛\tilde{\mathcal{O}}(m+n)over~ start_ARG caligraphic_O end_ARG ( italic_m + italic_n ) QRAM space. To provide query access to b^(t)superscript^𝑏superscript𝑡\hat{b}^{(t^{*})}over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT, an cost of 𝒪(polylogTmn)𝒪poly𝑇𝑚𝑛\mathcal{O}\left(\operatorname{poly}\log Tmn\right)caligraphic_O ( roman_poly roman_log italic_T italic_m italic_n ) is incurred from accessing Πp(t1)superscriptsubscriptΠ𝑝superscript𝑡1\Pi_{p}^{(t^{*}-1)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - 1 ) end_POSTSUPERSCRIPT and Πν(t1)superscriptsubscriptΠ𝜈superscript𝑡1\Pi_{\nu}^{(t^{*}-1)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - 1 ) end_POSTSUPERSCRIPT in QRAM.

Proof.

Per union bound [57], we find that the total success probability is at least 1δ1𝛿1-\delta1 - italic_δ. Note that the output of Algorithm 1 of bestPiP and bestPiNu corresponds to the values of Πp(t1)superscriptsubscriptΠ𝑝superscript𝑡1\Pi_{p}^{(t^{*}-1)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - 1 ) end_POSTSUPERSCRIPT and Πν(t1)superscriptsubscriptΠ𝜈superscript𝑡1\Pi_{\nu}^{(t^{*}-1)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - 1 ) end_POSTSUPERSCRIPT that can be used to construct b(t)superscript𝑏superscript𝑡b^{(t^{*})}italic_b start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT per Equation 4.1. This gives us the guarantee of convergence shown in Theorem 3.2.

Moving to the runtime analysis, the quantum norm estimation subroutine takes 𝒪(T2nlogmTδ)𝒪superscript𝑇2𝑛𝑚𝑇𝛿\mathcal{O}(T^{2}\sqrt{n}\log\frac{mT}{\delta})caligraphic_O ( italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT square-root start_ARG italic_n end_ARG roman_log divide start_ARG italic_m italic_T end_ARG start_ARG italic_δ end_ARG ) for mT𝑚𝑇mTitalic_m italic_T iterations, while quantum inner product estimation takes 𝒪(T2mlognTδ)𝒪superscript𝑇2𝑚𝑛𝑇𝛿\mathcal{O}(T^{2}\sqrt{m}\log\frac{nT}{\delta})caligraphic_O ( italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT square-root start_ARG italic_m end_ARG roman_log divide start_ARG italic_n italic_T end_ARG start_ARG italic_δ end_ARG ) for nT𝑛𝑇nTitalic_n italic_T iterations, resulting in a runtime of 𝒪~(T2mnmax(m,n)log1δ)~𝒪superscript𝑇2𝑚𝑛𝑚𝑛1𝛿\tilde{\mathcal{O}}(T^{2}\sqrt{mn\max(m,n)}\log\frac{1}{\delta})over~ start_ARG caligraphic_O end_ARG ( italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT square-root start_ARG italic_m italic_n roman_max ( italic_m , italic_n ) end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ). For uses of QRAM, the construction on Lines 5 and 9, is a one-time cost of 𝒪~(n)~𝒪𝑛\tilde{\mathcal{O}}(n)over~ start_ARG caligraphic_O end_ARG ( italic_n ) and 𝒪~(m)~𝒪𝑚\tilde{\mathcal{O}}(m)over~ start_ARG caligraphic_O end_ARG ( italic_m ), respectively, with a total runtime of 𝒪~(T(m+n))~𝒪𝑇𝑚𝑛\tilde{\mathcal{O}}(T(m+n))over~ start_ARG caligraphic_O end_ARG ( italic_T ( italic_m + italic_n ) ). The classical computation of the EG value in Line 11 costs 𝒪(Tn)𝒪𝑇𝑛\mathcal{O}(Tn)caligraphic_O ( italic_T italic_n ). We note that the quantum norm and inner product estimation subroutine is the main bottleneck of the algorithm, and hence the total runtime is then 𝒪~(T2mnmax(m,n)log1δ)~𝒪superscript𝑇2𝑚𝑛𝑚𝑛1𝛿\tilde{\mathcal{O}}(T^{2}\sqrt{mn\max(m,n)}\log\frac{1}{\delta})over~ start_ARG caligraphic_O end_ARG ( italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT square-root start_ARG italic_m italic_n roman_max ( italic_m , italic_n ) end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ).

For the memory complexity, for the t𝑡titalic_t-th iteration, we require 6 vectors in QRAM: the current iteration Πp(t)superscriptsubscriptΠ𝑝𝑡\Pi_{p}^{(t)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and Πν(t)superscriptsubscriptΠ𝜈𝑡\Pi_{\nu}^{(t)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT, the best iteration bestPiP and bestPiNu and the previous iteration Πp(t1)superscriptsubscriptΠ𝑝𝑡1\Pi_{p}^{(t-1)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT and Πν(t1)superscriptsubscriptΠ𝜈𝑡1\Pi_{\nu}^{(t-1)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT, in case we need to update bestPiP and bestPiNu. Note that to update the best iteration, we simply reroute the register of the previous iteration to being the best iteration. There is no need to copy data or reconstruct a new QRAM as the data from the previous iteration is no longer needed in the next iteration. Therefore, the memory is 𝒪~(m+n)~𝒪𝑚𝑛\tilde{\mathcal{O}}(m+n)over~ start_ARG caligraphic_O end_ARG ( italic_m + italic_n ) for storing the 6 vectors. ∎

5 Numerical simulations

We simulate the market equilibrium computation under PR dynamics and our quantum algorithm. To showcase the effects of quantum speedups, we fixed the number of queries to all bid matrices b(t)superscript𝑏𝑡b^{(t)}italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and observed the reduction of the objective value over the number of queries.

As an actual simulation of amplitude estimation using quantum gates over multiple qubits is costly, we directly compute the probability vector of Pr[Z=z]Pr𝑍𝑧\Pr[Z=z]roman_Pr [ italic_Z = italic_z ] for z[M]𝑧delimited-[]𝑀z\in[M]italic_z ∈ [ italic_M ] that one would obtain by amplitude estimation [32] for a target value a𝑎aitalic_a,

Pr[Z=z]=sin2(MΔzπ)M2sin2(Δzπ)Pr𝑍𝑧superscript2𝑀subscriptΔ𝑧𝜋superscript𝑀2superscript2subscriptΔ𝑧𝜋\Pr[Z=z]=\frac{\sin^{2}(M\Delta_{z}\pi)}{M^{2}\sin^{2}(\Delta_{z}\pi)}roman_Pr [ italic_Z = italic_z ] = divide start_ARG roman_sin start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_M roman_Δ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT italic_π ) end_ARG start_ARG italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_sin start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Δ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT italic_π ) end_ARG (5.1)

where Δz=min(|zsin1(a)/π|,|1z+sin1(a)/π|)subscriptΔ𝑧𝑧superscript1𝑎𝜋1𝑧superscript1𝑎𝜋\Delta_{z}=\min(\lvert z-\sin^{-1}(\sqrt{a})/\pi\rvert,\lvert 1-z+\sin^{-1}(% \sqrt{a})/\pi\rvert)roman_Δ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT = roman_min ( | italic_z - roman_sin start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( square-root start_ARG italic_a end_ARG ) / italic_π | , | 1 - italic_z + roman_sin start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( square-root start_ARG italic_a end_ARG ) / italic_π | ), and M𝑀Mitalic_M is the number of times that call the unitaries U𝑈Uitalic_U and V𝑉Vitalic_V in QAE (see Theorem 2.1), and is linearly correlated to the runtime We then sample the output according to the computed probabilities to obtain an estimator a~=sin2(πzM)~𝑎superscript2𝜋𝑧𝑀\tilde{a}=\sin^{2}(\pi\frac{z}{M})over~ start_ARG italic_a end_ARG = roman_sin start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_π divide start_ARG italic_z end_ARG start_ARG italic_M end_ARG ).

For our experiments, we generate data the input data v𝑣vitalic_v where the value v𝑣vitalic_v is sampled from a uniform distribution with range [0,1)01[0,1)[ 0 , 1 ) and a normal distribution 𝒩(0.5,0.25)𝒩0.50.25\mathcal{N}(0.5,0.25)caligraphic_N ( 0.5 , 0.25 ), where we resample values that fall outside the range of [0,1]01[0,1][ 0 , 1 ]. For the budget B𝐵Bitalic_B, we either sample from the same distribution as the value matrix or set the same budget for all buyers to simulate competitive equilibrium from equal income (CEEI) applications. Our simulation includes n=16384𝑛16384n=16384italic_n = 16384 buyers, m=16384𝑚16384m=16384italic_m = 16384 goods, and iterate for T=16𝑇16T=16italic_T = 16 iterations for the PR dynamics. For the quantum algorithm, note that the queries per iteration would be reduced by n𝑛\sqrt{n}square-root start_ARG italic_n end_ARG if we use an actual quantum computer, hence increasing the number of iterations to fix the number of queries. For amplitude estimation, we run for Tn=512𝑇𝑛512\sqrt{T\sqrt{n}}=512square-root start_ARG italic_T square-root start_ARG italic_n end_ARG end_ARG = 512 iterations and set M𝒪(Tn)𝑀𝒪𝑇𝑛M\in\mathcal{O}(\sqrt{T\sqrt{n}})italic_M ∈ caligraphic_O ( square-root start_ARG italic_T square-root start_ARG italic_n end_ARG end_ARG ). As the classical algorithms are deterministic, we rerun our quantum algorithm over 15151515 times with the same sample of B𝐵Bitalic_B and v𝑣vitalic_v to observe the variance of convergence progress. Experimental results are shown in Figure 1(a). Details on implementation and further experimental setup are found in Appendix D.

(a)
Refer to caption
(b)
Refer to caption
Figure 1: Experimental results. We perform a on n=16384𝑛16384n=16384italic_n = 16384 buyers and goods given the same amount of queries for all algorithms. We observe in Figure 1(a) that over different distributions, our quantum algorithm (green) significantly outperforms the PR dynamics (blue), which aligns with our theoretical results. Furthermore, our results also show that both our quantum algorithm and the PR dynamics outperform projected gradient descent (orange) in the mid-accuracy regime. Figure 1(b) shows the convergence of a single run of the quantum algorithm despite its instability from faulty updates, as well as the variance over the multiple runs (shaded in grey).

From the plots of Figure 1, we note that the results fit our theoretical results in that the quantum algorithm converges much faster than that of the PR dynamics [24]. Further, we also compare against the convergence of projected gradient descent, which supports empirical results by Gao and Kroer [26] that in the regime of mid-level accuracy and low iterations, PR dynamics-related algorithms, both classical and quantum, converge faster than projected gradient descent.

6 Discussion

Quasi-linear utilities.

For the bulk of our paper, we focus on the setting of linear utilities for Fisher markets. However, applications of market equilibrium computation in large-scale Fisher markets involve mostly quasi-linear utilities [9]. An approach for using PR dynamics for quasi-linear utilities proposed by Gao and Kroer [26]444There is another method proposed by Cheung et al. [58], which we find difficult to convert to quantum due to its use of thresholding, which would cause problems with faulty updates from the FPR dynamics. includes the usage of slack variables δ=(δ1,,δm)𝛿subscript𝛿1subscript𝛿𝑚\delta=(\delta_{1},\cdots,\delta_{m})italic_δ = ( italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_δ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) that represent the buyers’ leftover budgets. The PR updates are then modified as follows:

bij(t+1)=Bivijxij(t)jvijxij(t)+δi(t),δi(t+1)=Biδi(t)jvijxij(t)+δi(t).formulae-sequencesuperscriptsubscript𝑏𝑖𝑗𝑡1subscript𝐵𝑖subscript𝑣𝑖𝑗superscriptsubscript𝑥𝑖𝑗𝑡subscriptsuperscript𝑗subscript𝑣𝑖superscript𝑗superscriptsubscript𝑥𝑖superscript𝑗𝑡superscriptsubscript𝛿𝑖𝑡superscriptsubscript𝛿𝑖𝑡1subscript𝐵𝑖superscriptsubscript𝛿𝑖𝑡subscriptsuperscript𝑗subscript𝑣𝑖superscript𝑗superscriptsubscript𝑥𝑖superscript𝑗𝑡superscriptsubscript𝛿𝑖𝑡b_{ij}^{(t+1)}=B_{i}\frac{v_{ij}x_{ij}^{(t)}}{\sum_{j^{\prime}}v_{ij^{\prime}}% x_{ij^{\prime}}^{(t)}+\delta_{i}^{(t)}},\;\delta_{i}^{(t+1)}=B_{i}\frac{\delta% _{i}^{(t)}}{\sum_{j^{\prime}}v_{ij^{\prime}}x_{ij^{\prime}}^{(t)}+\delta_{i}^{% (t)}}.italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT + italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG , italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT + italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG . (6.1)

Further, PR dynamics for quasi-linear utilities exhibit a convergence rate of 𝒪(log(m+1)/T)𝒪𝑚1𝑇\mathcal{O}(\log(m+1)/T)caligraphic_O ( roman_log ( italic_m + 1 ) / italic_T ). Using the methods discussed in previous sections, the quasi-linear version of PR dynamics can then be readily adapted to its quantum version by employing the same techniques of computing and storing in QRAM the values of Πp(t)superscriptsubscriptΠ𝑝𝑡\Pi_{p}^{(t)}roman_Π start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and Πν(t)superscriptsubscriptΠ𝜈𝑡\Pi_{\nu}^{(t)}roman_Π start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT in conjunction with on-the-fly computation of b^(t)superscript^𝑏𝑡\hat{b}^{(t)}over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT, x^(t)superscript^𝑥𝑡\hat{x}^{(t)}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and δ^(t)superscript^𝛿𝑡\hat{\delta}^{(t)}over^ start_ARG italic_δ end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT.

Constant number of buyers.

Notice that our quantum algorithm provides a quadratic speedup on the smaller value in regards to the number of buyers n𝑛nitalic_n and number of goods m𝑚mitalic_m. Therefore given extreme cases where the number of buyers n𝒪(1)𝑛𝒪1n\in\mathcal{O}(1)italic_n ∈ caligraphic_O ( 1 ), our algorithm does not provide a speedup. However, in such cases, quantum speedups may still be obtained simply by removing the amplitude estimation step for estimating the price for each item and replacing it with using quantum arithmetic circuits to compute the exact sum. We use a total of 𝒪(nTpolylog(T,m,n))𝒪𝑛𝑇poly𝑇𝑚𝑛\mathcal{O}(nT\operatorname{poly}\log(T,m,n))caligraphic_O ( italic_n italic_T roman_poly roman_log ( italic_T , italic_m , italic_n ) ) qubits to compute the values of bi,subscript𝑏𝑖b_{i,*}italic_b start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT separately on the i𝑖iitalic_i-th set of qubits, and only conduct amplitude estimation when estimating the utility value for each buyer. Given that in this setting, n𝒪(1)𝑛𝒪1n\in\mathcal{O}(1)italic_n ∈ caligraphic_O ( 1 ), the total runtime would then be 𝒪(T2mlog1/δ)𝒪superscript𝑇2𝑚1𝛿\mathcal{O}\left(T^{2}\sqrt{m}\log\nicefrac{{1}}{{\delta}}\right)caligraphic_O ( italic_T start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT square-root start_ARG italic_m end_ARG roman_log / start_ARG 1 end_ARG start_ARG italic_δ end_ARG ), gaining a quadratic speedup over the number of goods m𝑚mitalic_m.

Dequantization.

Given the work in recent years towards the development of quantum-inspired classical algorithms [59, 60, 61, 62] that achieve similar performances as quantum algorithms using sampling-based techniques, a natural question that arises is whether our algorithm can be “de-quantized”. The main speedup in our algorithm stems from the usage of estimation of 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norms and inner products. While the use of sampling techniques can indeed provide inner product estimations, they retain the same 𝒪(1/ϵ2)𝒪1superscriptitalic-ϵ2\mathcal{O}(1/\epsilon^{2})caligraphic_O ( 1 / italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) dependency instead of the 𝒪(1/ϵ)𝒪1italic-ϵ\mathcal{O}(1/\epsilon)caligraphic_O ( 1 / italic_ϵ ) dependency of QAE. Hence, our algorithm performance may be hard to replicate in classical settings.

On the other hand, while it has been suggested that the computation of market equilibrium may benefit from low-rank approximations [15], methods of using such properties to accelerate the computation of gradients have not been proposed, given that the update of the PR dynamics rely on element-wise multiplication of matrices instead matrix multiplication. This would suggest that using sampling techniques to accelerate updates would be similarly difficult.

Potential and limitations for further quantum speedups.

Our quantum algorithm shares similarities to other quantum algorithms that are based on the multiplicative weight update (MWU) method [63, 64]. Such methods have found success in obtaining quantum speedups for LPs [65] and SDPs [66, 67, 68, 69], which have been extended to applications such as zero-sum games [65, 70], quadratic binary optimization [71], and financial applications [28, 72]. Apart from the MWU-esque PR dynamics, various other methods for computing market equilibrium have also been proposed. Can quantum speedups obtained from these methods exceed those of our quantum algorithm?

Tracing back to the roots of the EG convex program [22, 23] and Shmyrev convex program [51], it is well known that such programs can be solved in polynomial time with interior-point methods (IPM) [50]. However, as IPMs require using linear solvers as subroutines, and as there is no guarantee of well-conditioned systems, the quantum linear systems solver [73, 74] may not provide significant speedup. Therefore, it may be unlikely that quantum IPMs [75] can provide significant speedups.

First-order methods such as the Frank-Wolfe (FW) algorithm [76] and projected gradient descent (PGD) have also been discussed as candidates for solving market equilibrium [26], with PGD achieving linear convergence classically. While PGD obtains a superior asymptotic convergence rate in terms of the error ϵitalic-ϵ\epsilonitalic_ϵ compared to PR dynamics, as our quantum speedups stem from faster computations of results within a single iteration, it may be harder to find such speedups for PGD as there has been no evidence for quantum speedups in projections onto a simplex [77, 78] as required.

On the other hand, the FW algorithm has been shown to provide quantum speedups for regression [79, 80]. However, convergence results of FW [81, 82] show that Φ(b(T))Φ(b)CΦ/(T+2)Φsuperscript𝑏𝑇Φsuperscript𝑏subscript𝐶Φ𝑇2\Phi(b^{(T)})-\Phi(b^{*})\leq C_{\Phi}/(T+2)roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ italic_C start_POSTSUBSCRIPT roman_Φ end_POSTSUBSCRIPT / ( italic_T + 2 ), where CΦsubscript𝐶ΦC_{\Phi}italic_C start_POSTSUBSCRIPT roman_Φ end_POSTSUBSCRIPT can be shown to be 𝒪(n)𝒪𝑛\mathcal{O}(n)caligraphic_O ( italic_n ) by computing relevant values. The number of iterations T𝑇Titalic_T required for convergence to additive error ε𝜀\varepsilonitalic_ε is then 𝒪(n/ε)𝒪𝑛𝜀\mathcal{O}(n/\varepsilon)caligraphic_O ( italic_n / italic_ε ) as compared to 𝒪(logm/ε)𝒪𝑚𝜀\mathcal{O}(\log m/\varepsilon)caligraphic_O ( roman_log italic_m / italic_ε ) of PR dynamics. This matches the results of Gao and Kroer [26], which show that FW has slow convergence empirically for market equilibrium computation. Prior no-go results suggest that quantum algorithms cannot provide speedups for the number of iterations T𝑇Titalic_T when T𝑇Titalic_T is independent of the problem dimension [83, 84]. Assuming no quantum speedups in the number of iterations, given the 𝒪(n)𝒪𝑛\mathcal{O}(n)caligraphic_O ( italic_n ) upper bound in the FW algorithm, the quantum algorithm based on FW can potentially have a higher dependency on n𝑛nitalic_n than the classical PR dynamics.

Lastly, we ask whether random sampling of buyers or goods can provide further speedups. Classical results in first-order updates for randomly sampled buyers and goods [85] indicate that the number of iterations would increase multiplicatively by m𝑚mitalic_m and n𝑛nitalic_n, respectively, such that the total runtime cost of the algorithms remain at 𝒪~(mn)~𝒪𝑚𝑛\tilde{\mathcal{O}}(mn)over~ start_ARG caligraphic_O end_ARG ( italic_m italic_n ). Assuming no quantum speedups on the number of iterations, further quantum speedups by incorporating sampling may be difficult.

Acknowledgments and Disclosure of Funding

The authors thank Gregory Kang Ruey Lau for discussions. This work is supported by the National Research Foundation, Singapore, and A*STAR under its CQT Bridging Grant and its Quantum Engineering Programme under grant NRF2021-QEP2-02-P05.

References

Appendix A Quantum subroutines

In this section, we show prior results that obtain 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norms and inner products with quadratic speedups with amplitude estimation.

Lemma A.1 (Quantum state preparation and norm estimation; Lemma 5, [28]).

Let n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N. We are given quantum query access to non-zero vector w𝕀n𝑤superscript𝕀𝑛w\in\mathbb{I}^{n}italic_w ∈ blackboard_I start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, with maxjwj=1subscript𝑗subscript𝑤𝑗1\max_{j}w_{j}=1roman_max start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 1.

  1. 1.

    There exists a quantum circuit that prepares the state 1nj=1n|𝒋(wj|0+1wj|1)1𝑛superscriptsubscript𝑗1𝑛ket𝒋subscript𝑤𝑗ket01subscript𝑤𝑗ket1\frac{1}{\sqrt{n}}\sum_{j=1}^{n}\boldsymbol{\ket{j}}\left(\sqrt{w_{j}}\ket{0}+% \sqrt{1-w_{j}}\ket{1}\right)divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_n end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT bold_| start_ARG bold_italic_j end_ARG bold_⟩ ( square-root start_ARG italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG | start_ARG 0 end_ARG ⟩ + square-root start_ARG 1 - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG | start_ARG 1 end_ARG ⟩ ) with two queries and 𝒪(logn)𝒪𝑛\mathcal{O}\left(\log n\right)caligraphic_O ( roman_log italic_n ) gates.

  2. 2.

    Let ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0 and δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ). There exists a quantum algorithm that provides an estimate ΓwsubscriptΓ𝑤\Gamma_{w}roman_Γ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT of the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm w1subscriptnorm𝑤1\|w\|_{1}∥ italic_w ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT such that |w1Γw|ϵw1subscriptnorm𝑤1subscriptΓ𝑤italic-ϵsubscriptnorm𝑤1\left|\|w\|_{1}-\Gamma_{w}\right|\leq\epsilon\|w\|_{1}| ∥ italic_w ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - roman_Γ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT | ≤ italic_ϵ ∥ italic_w ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, with probability at least 1δ1𝛿1-\delta1 - italic_δ. The algorithm requires 𝒪(nϵlog1δ)𝒪𝑛italic-ϵ1𝛿\mathcal{O}\left(\frac{\sqrt{n}}{\epsilon}\log\frac{1}{\delta}\right)caligraphic_O ( divide start_ARG square-root start_ARG italic_n end_ARG end_ARG start_ARG italic_ϵ end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ) queries and 𝒪~(nϵlog1δ)~𝒪𝑛italic-ϵ1𝛿\tilde{\mathcal{O}}\left(\frac{\sqrt{n}}{\epsilon}\log\frac{1}{\delta}\right)over~ start_ARG caligraphic_O end_ARG ( divide start_ARG square-root start_ARG italic_n end_ARG end_ARG start_ARG italic_ϵ end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG )quantum gates.

Proof.

We reiterate the proof of Lemma 5 in [28] for the convenience of the reader.

  1. 1.

    First, using 𝒪(logn)𝒪𝑛\mathcal{O}(\log n)caligraphic_O ( roman_log italic_n ) Hadamard gates, prepare the state 1nj=1n|𝒋1𝑛superscriptsubscript𝑗1𝑛ket𝒋\frac{1}{\sqrt{n}}\sum_{j=1}^{n}\boldsymbol{\ket{j}}divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_n end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT bold_| start_ARG bold_italic_j end_ARG bold_⟩. Then, by quantum query access to w𝑤witalic_w, obtain 1nj=1n|𝒋|wj1𝑛superscriptsubscript𝑗1𝑛ket𝒋ketsubscript𝑤𝑗\frac{1}{\sqrt{n}}\sum_{j=1}^{n}\boldsymbol{\ket{j}}\ket{w_{j}}divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_n end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT bold_| start_ARG bold_italic_j end_ARG bold_⟩ | start_ARG italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ⟩. By controlled rotation gates, we can then obtain 1nj=1n|𝒋|wj(wj|0+1wj|1)1𝑛superscriptsubscript𝑗1𝑛ket𝒋ketsubscript𝑤𝑗subscript𝑤𝑗ket01subscript𝑤𝑗ket1\frac{1}{\sqrt{n}}\sum_{j=1}^{n}\boldsymbol{\ket{j}}\ket{w_{j}}(\sqrt{w_{j}}% \ket{0}+\sqrt{1-w_{j}}\ket{1})divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_n end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT bold_| start_ARG bold_italic_j end_ARG bold_⟩ | start_ARG italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ⟩ ( square-root start_ARG italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG | start_ARG 0 end_ARG ⟩ + square-root start_ARG 1 - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG | start_ARG 1 end_ARG ⟩ ). By another quantum query access to w𝑤witalic_w, we can uncompute the intermediate registers and obtain 1nj=1n|𝒋(wj|0+1wj|1)1𝑛superscriptsubscript𝑗1𝑛ket𝒋subscript𝑤𝑗ket01subscript𝑤𝑗ket1\frac{1}{\sqrt{n}}\sum_{j=1}^{n}\boldsymbol{\ket{j}}(\sqrt{w_{j}}\ket{0}+\sqrt% {1-w_{j}}\ket{1})divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_n end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT bold_| start_ARG bold_italic_j end_ARG bold_⟩ ( square-root start_ARG italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG | start_ARG 0 end_ARG ⟩ + square-root start_ARG 1 - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG | start_ARG 1 end_ARG ⟩ ).

  2. 2.

    First observe that that with projector P=In|00|P=I_{n}\otimes\lvert 0\rangle\langle 0\rvertitalic_P = italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊗ | 0 ⟩ ⟨ 0 | and |ψ=1nj=1n|𝒋(wj|0+1wj|1)ket𝜓1𝑛superscriptsubscript𝑗1𝑛ket𝒋subscript𝑤𝑗ket01subscript𝑤𝑗ket1\ket{\psi}=\frac{1}{\sqrt{n}}\sum_{j=1}^{n}\boldsymbol{\ket{j}}(\sqrt{w_{j}}% \ket{0}+\sqrt{1-w_{j}}\ket{1})| start_ARG italic_ψ end_ARG ⟩ = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_n end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT bold_| start_ARG bold_italic_j end_ARG bold_⟩ ( square-root start_ARG italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG | start_ARG 0 end_ARG ⟩ + square-root start_ARG 1 - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG | start_ARG 1 end_ARG ⟩ ), one can obtain a=P|ψ22=w1n𝑎superscriptsubscriptnorm𝑃ket𝜓22subscriptnorm𝑤1𝑛a=\|P\ket{\psi}\|_{2}^{2}=\frac{\|w\|_{1}}{n}italic_a = ∥ italic_P | start_ARG italic_ψ end_ARG ⟩ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = divide start_ARG ∥ italic_w ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG. Setting M6πϵN𝑀6𝜋italic-ϵ𝑁M\geq\frac{6\pi}{\epsilon}\sqrt{N}italic_M ≥ divide start_ARG 6 italic_π end_ARG start_ARG italic_ϵ end_ARG square-root start_ARG italic_N end_ARG, we obtain an estimate

    |a~esta|2πa(1a)M+π2M2ϵ6N(2a+ϵ12)3aϵ6Nw1ϵ2Na2ϵsubscript~𝑎est𝑎2𝜋𝑎1𝑎𝑀superscript𝜋2superscript𝑀2italic-ϵ6𝑁2𝑎italic-ϵ123𝑎italic-ϵ6𝑁subscriptnorm𝑤1italic-ϵ2𝑁𝑎2italic-ϵ|\tilde{a}_{\mathrm{est}}-a|\leq 2\pi\frac{\sqrt{a(1-a)}}{M}+\frac{\pi^{2}}{M^% {2}}\leq\frac{\epsilon}{6\sqrt{N}}\left(2\sqrt{a}+\frac{\epsilon}{12}\right)% \leq\frac{3\sqrt{a}\epsilon}{6\sqrt{N}}\leq\frac{\sqrt{\|w\|_{1}}\cdot\epsilon% }{2N}\leq\frac{a}{2}\cdot\epsilon| over~ start_ARG italic_a end_ARG start_POSTSUBSCRIPT roman_est end_POSTSUBSCRIPT - italic_a | ≤ 2 italic_π divide start_ARG square-root start_ARG italic_a ( 1 - italic_a ) end_ARG end_ARG start_ARG italic_M end_ARG + divide start_ARG italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ≤ divide start_ARG italic_ϵ end_ARG start_ARG 6 square-root start_ARG italic_N end_ARG end_ARG ( 2 square-root start_ARG italic_a end_ARG + divide start_ARG italic_ϵ end_ARG start_ARG 12 end_ARG ) ≤ divide start_ARG 3 square-root start_ARG italic_a end_ARG italic_ϵ end_ARG start_ARG 6 square-root start_ARG italic_N end_ARG end_ARG ≤ divide start_ARG square-root start_ARG ∥ italic_w ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ⋅ italic_ϵ end_ARG start_ARG 2 italic_N end_ARG ≤ divide start_ARG italic_a end_ARG start_ARG 2 end_ARG ⋅ italic_ϵ (A.1)

    with probability at least 8π28superscript𝜋2\frac{8}{\pi^{2}}divide start_ARG 8 end_ARG start_ARG italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. Using the powering lemma [86], we can boost the success probability to 1δ1𝛿1-\delta1 - italic_δ by taking the median of O(log1/δ)𝑂1𝛿O(\log 1/\delta)italic_O ( roman_log 1 / italic_δ ) runs of the QAE algorithm.

Remark A.1.1.

Note that Lemma A.1 has the requirement that maxjwj=1subscript𝑗subscript𝑤𝑗1\max_{j}w_{j}=1roman_max start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 1. For cases where this is not the case, we can use a maximum finding algorithm to divide all entries by the largest value. Such can be achieved by the following quantum minimum/maximum finding algorithm in 𝒪(n)𝒪𝑛\mathcal{O}(\sqrt{n})caligraphic_O ( square-root start_ARG italic_n end_ARG ) runtime, which we introduce below. Recall that division takes 𝒪(1)𝒪1\mathcal{O}(1)caligraphic_O ( 1 ) runtime with quantum arithmetic circuits.

Lemma A.2 (Quantum minimum finding; Theorem 1, [87]).

Let n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N. Given quantum query access to non-zero vector w𝕀n𝑤superscript𝕀𝑛w\in\mathbb{I}^{n}italic_w ∈ blackboard_I start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, we can find the minimum wmin=minj[n]wjsubscript𝑤subscript𝑗delimited-[]𝑛subscript𝑤𝑗w_{\min}=\min_{j\in[n]}w_{j}italic_w start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT italic_j ∈ [ italic_n ] end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT with success probability 1δ1𝛿1-\delta1 - italic_δ with 𝒪(nlog1δ)𝒪𝑛1𝛿\mathcal{O}\left(\sqrt{n}\log\frac{1}{\delta}\right)caligraphic_O ( square-root start_ARG italic_n end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ) queries and 𝒪~(nlog1δ)~𝒪𝑛1𝛿\tilde{\mathcal{O}}\left(\sqrt{n}\log\frac{1}{\delta}\right)over~ start_ARG caligraphic_O end_ARG ( square-root start_ARG italic_n end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG )quantum gates.

Corollary A.2.1 (Quantum maximum finding).

Let n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N. Given quantum query access to non-zero vector w𝕀n𝑤superscript𝕀𝑛w\in\mathbb{I}^{n}italic_w ∈ blackboard_I start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, we can find the maximum wmax=minj[n]wjsubscript𝑤subscript𝑗delimited-[]𝑛subscript𝑤𝑗w_{\max}=\min_{j\in[n]}w_{j}italic_w start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT italic_j ∈ [ italic_n ] end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT with success probability 1δ1𝛿1-\delta1 - italic_δ with 𝒪(nlog1δ)𝒪𝑛1𝛿\mathcal{O}\left(\sqrt{n}\log\frac{1}{\delta}\right)caligraphic_O ( square-root start_ARG italic_n end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ) queries and 𝒪~(nlog1δ)~𝒪𝑛1𝛿\tilde{\mathcal{O}}\left(\sqrt{n}\log\frac{1}{\delta}\right)over~ start_ARG caligraphic_O end_ARG ( square-root start_ARG italic_n end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG )quantum gates.

Below we present a quantum inner product estimation algorithm simplified from Lemma 6 of [28].

Lemma A.3 (Quantum inner product estimation with relative accuracy).

Let n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N, ϵ<0italic-ϵ0\epsilon<0italic_ϵ < 0 and δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ). We are given quantum query access to two vectors u,v𝕀n𝑢𝑣superscript𝕀𝑛u,v\in\mathbb{I}^{n}italic_u , italic_v ∈ blackboard_I start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. An estimate Γu,vsubscriptΓ𝑢𝑣\Gamma_{u,v}roman_Γ start_POSTSUBSCRIPT italic_u , italic_v end_POSTSUBSCRIPT for the inner product can be provided such that |Γu,vuv|ϵuvsubscriptΓ𝑢𝑣𝑢𝑣italic-ϵ𝑢𝑣|\Gamma_{u,v}-u\cdot v|\leq\epsilon\ u\cdot v| roman_Γ start_POSTSUBSCRIPT italic_u , italic_v end_POSTSUBSCRIPT - italic_u ⋅ italic_v | ≤ italic_ϵ italic_u ⋅ italic_v with success probability 1δ1𝛿1-\delta1 - italic_δ. This estimate is obtained with 𝒪(nϵlog1δ)𝒪𝑛italic-ϵ1𝛿\mathcal{O}\left(\frac{\sqrt{n}}{\epsilon}\log\frac{1}{\delta}\right)caligraphic_O ( divide start_ARG square-root start_ARG italic_n end_ARG end_ARG start_ARG italic_ϵ end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ) queries and 𝒪~(nϵlog1δ)~𝒪𝑛italic-ϵ1𝛿\tilde{\mathcal{O}}\left(\frac{\sqrt{n}}{\epsilon}\log\frac{1}{\delta}\right)over~ start_ARG caligraphic_O end_ARG ( divide start_ARG square-root start_ARG italic_n end_ARG end_ARG start_ARG italic_ϵ end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG )quantum gates.

Proof.

Using quantum arithmetic circuits, we can obtain zj=ujvjsubscript𝑧𝑗subscript𝑢𝑗subscript𝑣𝑗z_{j}=u_{j}v_{j}italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, i.e., z=uv𝑧direct-product𝑢𝑣z=u\odot vitalic_z = italic_u ⊙ italic_v, by the following:

|𝒋|𝒋|uj|vj|𝒋|zj|vj|𝒋|zj|0¯ket𝒋ket𝒋ketsubscript𝑢𝑗ketsubscript𝑣𝑗ket𝒋ketsubscript𝑧𝑗ketsubscript𝑣𝑗ket𝒋ketsubscript𝑧𝑗ket¯0\boldsymbol{\ket{j}}\to\boldsymbol{\ket{j}}\ket{u_{j}}\ket{v_{j}}\to% \boldsymbol{\ket{j}}\ket{z_{j}}\ket{v_{j}}\to\boldsymbol{\ket{j}}\ket{z_{j}}% \ket{\bar{0}}bold_| start_ARG bold_italic_j end_ARG bold_⟩ → bold_| start_ARG bold_italic_j end_ARG bold_⟩ | start_ARG italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ⟩ | start_ARG italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ⟩ → bold_| start_ARG bold_italic_j end_ARG bold_⟩ | start_ARG italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ⟩ | start_ARG italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ⟩ → bold_| start_ARG bold_italic_j end_ARG bold_⟩ | start_ARG italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ⟩ | start_ARG over¯ start_ARG 0 end_ARG end_ARG ⟩ (A.2)

Using quantum maximum finding in Corollary A.2.1 to find zmaxsubscript𝑧z_{\max}italic_z start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT up to probability 1δ/21𝛿21-\delta/21 - italic_δ / 2, we can then obtain zj/zmaxsubscript𝑧𝑗subscript𝑧z_{j}/z_{\max}italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT / italic_z start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT. Lastly, using Lemma A.1, we can obtain Γu,v=ΓzsubscriptΓ𝑢𝑣subscriptΓ𝑧\Gamma_{u,v}=\Gamma_{z}roman_Γ start_POSTSUBSCRIPT italic_u , italic_v end_POSTSUBSCRIPT = roman_Γ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT such that |Γu,vuv|=|Γzz1|ϵuvsubscriptΓ𝑢𝑣𝑢𝑣subscriptΓ𝑧subscriptnorm𝑧1italic-ϵ𝑢𝑣|\Gamma_{u,v}-u\cdot v|=|\Gamma_{z}-\|z\|_{1}|\leq\epsilon u\cdot v| roman_Γ start_POSTSUBSCRIPT italic_u , italic_v end_POSTSUBSCRIPT - italic_u ⋅ italic_v | = | roman_Γ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT - ∥ italic_z ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ≤ italic_ϵ italic_u ⋅ italic_v up to probability 1δ/21𝛿21-\delta/21 - italic_δ / 2. Using a union bound [57], we find the total success probability of the entire process is 1δ1𝛿1-\delta1 - italic_δ. ∎

Appendix B Convergence guarantees for the PR dynamics

We show the convergence guarantee of the proportional response (PR) dynamics in regards to the Eisenberg-Gale convex program by Zhang [25] and improved upon by Birnbaum et al. [53], and the convergence in regards to the Shmyrev convex program, first shown also by Birnbaum et al. [53]. Recall that the negative target function from the Eisenberg-Gale convex program is

Φ(b)=i[n]Bilogui,Φ𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑢𝑖\Phi(b)=-\sum_{i\in[n]}B_{i}\log u_{i},roman_Φ ( italic_b ) = - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , (B.1)

and the negative target function from the Shmyrev convex program is

Ψ(b)=i[n],j[m]bijlogvijpj.Ψ𝑏subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚subscript𝑏𝑖𝑗subscript𝑣𝑖𝑗subscript𝑝𝑗\Psi(b)=-\sum_{i\in[n],j\in[m]}b_{ij}\log\frac{v_{ij}}{p_{j}}.roman_Ψ ( italic_b ) = - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT roman_log divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG . (B.2)

We first set up the following convex set:

={bn×m(+):jbij=Bi}conditional-set𝑏subscript𝑛𝑚subscriptsubscript𝑗subscript𝑏𝑖𝑗subscript𝐵𝑖\mathcal{B}=\left\{b\in\mathcal{M}_{n\times m}(\mathbb{R}_{+}):\sum_{j}b_{ij}=% B_{i}\right\}caligraphic_B = { italic_b ∈ caligraphic_M start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT ( blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ) : ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } (B.3)

To show convergence of the PR dynamics, we first need the following inequalities:

Lemma B.1.

Let b=argminbΦ(b)superscript𝑏subscriptargmin𝑏Φ𝑏b^{*}=\operatorname{arg\,min}_{b\in\mathcal{B}}\Phi(b)italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = start_OPFUNCTION roman_arg roman_min end_OPFUNCTION start_POSTSUBSCRIPT italic_b ∈ caligraphic_B end_POSTSUBSCRIPT roman_Φ ( italic_b ). Then Φ(b)=Ψ(b)i[n]BilogBiΦsuperscript𝑏Ψsuperscript𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\Phi(b^{*})=\Psi(b^{*})-\sum_{i\in[n]}B_{i}\log B_{i}roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = roman_Ψ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

Proof.

By KKT optimality constraints of the Eisenberg-Gale convex program, we see that

Biui=pjvij,i,j,subscript𝐵𝑖superscriptsubscript𝑢𝑖superscriptsubscript𝑝𝑗subscript𝑣𝑖𝑗for-all𝑖𝑗\frac{B_{i}}{u_{i}^{*}}=\frac{p_{j}^{*}}{v_{ij}},\quad\forall i,j,divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG = divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG , ∀ italic_i , italic_j , (B.4)

which we use to show that Φ(b)=Ψ(b)i[n]BilogBiΦsuperscript𝑏Ψsuperscript𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\Phi(b^{*})=\Psi(b^{*})-\sum_{i\in[n]}B_{i}\log B_{i}roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = roman_Ψ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

Φ(b)Φsuperscript𝑏\displaystyle\Phi(b^{*})roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) =i[n]Bilogui=i[n],j[m]bijloguiabsentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript𝑢𝑖subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑢𝑖\displaystyle=-\sum_{i\in[n]}B_{i}\log u_{i}^{*}=-\sum_{i\in[n],j\in[m]}b_{ij}% ^{*}\log u_{i}^{*}= - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT (B.5)
=i[n],j[m]bijlogBivijpj=i[n],j[m]bijlogvijpji[n]BilogBiabsentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖subscript𝑣𝑖𝑗superscriptsubscript𝑝𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑝𝑗subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=-\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log B_{i}\frac{v_{ij}}{p_{j}^{% *}}=-\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{v_{ij}}{p_{j}^{*}}-\sum_{i\in[n% ]}B_{i}\log B_{i}= - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG = - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.6)
=Ψ(b)i[n]BilogBiabsentΨsuperscript𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=\Psi(b^{*})-\sum_{i\in[n]}B_{i}\log B_{i}= roman_Ψ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.7)

Lemma B.2 (Lemma 19, [53]).

b,Φ(b)Ψ(b)i[n]BilogBi.formulae-sequencefor-all𝑏Φ𝑏Ψ𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\forall b\in\mathcal{B},\Phi(b)\leq\Psi(b)-\sum_{i\in[n]}B_{i}\log B_{i}.∀ italic_b ∈ caligraphic_B , roman_Φ ( italic_b ) ≤ roman_Ψ ( italic_b ) - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT .

Proof.

We reiterate the proof of Lemma 19 in [53] for the convenience of the reader. By convexity of log-\log- roman_log, we see

Φ(b)Φ𝑏\displaystyle\Phi(b)roman_Φ ( italic_b ) =i[n]Bilogui=i[n]Bilogj[m]bijpjvijabsentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑢𝑖subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑗delimited-[]𝑚subscript𝑏𝑖𝑗subscript𝑝𝑗subscript𝑣𝑖𝑗\displaystyle=-\sum_{i\in[n]}B_{i}\log u_{i}=-\sum_{i\in[n]}B_{i}\log\sum_{j% \in[m]}\frac{b_{ij}}{p_{j}}v_{ij}= - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT (B.8)
=i[n]Bilogj[m]bijBivijpji[n]BilogBiabsentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑗delimited-[]𝑚subscript𝑏𝑖𝑗subscript𝐵𝑖subscript𝑣𝑖𝑗subscript𝑝𝑗subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=-\sum_{i\in[n]}B_{i}\log\sum_{j\in[m]}\frac{b_{ij}}{B_{i}}\frac{% v_{ij}}{p_{j}}-\sum_{i\in[n]}B_{i}\log B_{i}= - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.9)
i[n],j[m]bijBiBilogvijpji[n]BilogBiabsentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚subscript𝑏𝑖𝑗subscript𝐵𝑖subscript𝐵𝑖subscript𝑣𝑖𝑗subscript𝑝𝑗subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle\leq-\sum_{i\in[n],j\in[m]}\frac{b_{ij}}{B_{i}}B_{i}\log\frac{v_{% ij}}{p_{j}}-\sum_{i\in[n]}B_{i}\log B_{i}≤ - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.10)
=Ψ(b)i[n]BilogBiabsentΨ𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=\Psi(b)-\sum_{i\in[n]}B_{i}\log B_{i}= roman_Ψ ( italic_b ) - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.11)

Lemma B.3.

Let bij=Bibijvij/pj(b)ui(b)superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖subscript𝑏𝑖𝑗subscript𝑣𝑖𝑗subscript𝑝𝑗𝑏subscript𝑢𝑖𝑏\displaystyle b_{ij}^{\prime}=B_{i}\frac{b_{ij}v_{ij}/p_{j}(b)}{u_{i}(b)}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_b ) end_ARG start_ARG italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_b ) end_ARG. Then b,Ψ(b)Φ(b)+i[n]BilogBi.formulae-sequencefor-all𝑏Ψsuperscript𝑏Φ𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\forall b\in\mathcal{B},\Psi(b^{\prime})\leq\Phi(b)+\sum_{i\in[n]}B_{i}\log B_% {i}.∀ italic_b ∈ caligraphic_B , roman_Ψ ( italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≤ roman_Φ ( italic_b ) + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT .

Proof.

Let pj=ibijsuperscriptsubscript𝑝𝑗subscript𝑖superscriptsubscript𝑏𝑖𝑗p_{j}^{\prime}=\sum_{i}b_{ij}^{\prime}italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. By concavity of log\logroman_log:

Ψ(b)Ψsuperscript𝑏\displaystyle\Psi(b^{\prime})roman_Ψ ( italic_b start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) =i[n],j[m]bijlogpjvij=i[n],j[m]Bibijvij/pjuilogpj(t+1)vijabsentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑝𝑗subscript𝑣𝑖𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚subscript𝐵𝑖subscript𝑏𝑖𝑗subscript𝑣𝑖𝑗subscript𝑝𝑗subscript𝑢𝑖superscriptsubscript𝑝𝑗𝑡1subscript𝑣𝑖𝑗\displaystyle=\sum_{i\in[n],j\in[m]}b_{ij}^{\prime}\log\frac{p_{j}^{\prime}}{v% _{ij}}=\sum_{i\in[n],j\in[m]}B_{i}\frac{b_{ij}v_{ij}/p_{j}}{u_{i}}\log\frac{p_% {j}^{(t+1)}}{v_{ij}}= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG (B.12)
i[n]Bilogj[m]bijvij/pjuipjvij=i[n]Bilog(1uij[m]bijpjpj)absentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑗delimited-[]𝑚subscript𝑏𝑖𝑗subscript𝑣𝑖𝑗subscript𝑝𝑗subscript𝑢𝑖superscriptsubscript𝑝𝑗subscript𝑣𝑖𝑗subscript𝑖delimited-[]𝑛subscript𝐵𝑖1subscript𝑢𝑖subscript𝑗delimited-[]𝑚subscript𝑏𝑖𝑗superscriptsubscript𝑝𝑗subscript𝑝𝑗\displaystyle\leq\sum_{i\in[n]}B_{i}\log\sum_{j\in[m]}\frac{b_{ij}v_{ij}/p_{j}% }{u_{i}}\frac{p_{j}^{\prime}}{v_{ij}}=\sum_{i\in[n]}B_{i}\log\left(\frac{1}{u_% {i}}\sum_{j\in[m]}\frac{b_{ij}p_{j}^{\prime}}{p_{j}}\right)≤ ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ( divide start_ARG 1 end_ARG start_ARG italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) (B.13)
=i[n]Bilogj[m]bijpjBipji[n]Bilogui+i[n]BilogBiabsentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑗delimited-[]𝑚subscript𝑏𝑖𝑗superscriptsubscript𝑝𝑗subscript𝐵𝑖subscript𝑝𝑗subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑢𝑖subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=\sum_{i\in[n]}B_{i}\log\sum_{j\in[m]}\frac{b_{ij}p_{j}^{\prime}}% {B_{i}p_{j}}-\sum_{i\in[n]}B_{i}\log u_{i}+\sum_{i\in[n]}B_{i}\log B_{i}= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.14)
logi[n],j[m]bijpjpj(t)i[n]Bilogui+i[n]BilogBiabsentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚subscript𝑏𝑖𝑗superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑢𝑖subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle\leq\log\sum_{i\in[n],j\in[m]}\frac{b_{ij}p_{j}^{\prime}}{p_{j}^{% (t)}}-\sum_{i\in[n]}B_{i}\log u_{i}+\sum_{i\in[n]}B_{i}\log B_{i}≤ roman_log ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.15)
=logj[m]pji[n]Bilogui+i[n]BilogBiabsentsubscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑢𝑖subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=\log\sum_{j\in[m]}p_{j}^{\prime}-\sum_{i\in[n]}B_{i}\log u_{i}+% \sum_{i\in[n]}B_{i}\log B_{i}= roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.16)
=Φ(b)+i[n]BilogBi.absentΦ𝑏subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=\Phi(b)+\sum_{i\in[n]}B_{i}\log B_{i}.= roman_Φ ( italic_b ) + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT . (B.17)

From the above two lemmas, we gain the monotonically decreasing properties of iteratively updating b𝑏bitalic_b via the PR dynamics on the negative target functions of the Eisenberg-Gale and Shmyrev convex programs:

Lemma B.4.

t0,Φ(b(t+1))Φ(b(t))formulae-sequencefor-all𝑡0Φsuperscript𝑏𝑡1Φsuperscript𝑏𝑡\forall t\geq 0,\Phi(b^{(t+1)})\leq\Phi(b^{(t)})∀ italic_t ≥ 0 , roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT ) ≤ roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ).

Proof.

Apply Lemma B.2 and Lemma B.3 consequently. ∎

Corollary B.4.1 (Lemma 5, [53]).

t0,Ψ(b(t+1))Ψ(b(t))formulae-sequencefor-all𝑡0Ψsuperscript𝑏𝑡1Ψsuperscript𝑏𝑡\forall t\geq 0,\Psi(b^{(t+1)})\leq\Psi(b^{(t)})∀ italic_t ≥ 0 , roman_Ψ ( italic_b start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT ) ≤ roman_Ψ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ).

We now use the following lemmas to construct an end-to-end proof of the convergence of the PR dynamics. In a slight abuse of notation, we adapt the definition of KL divergence to matrices such that for u,v(+)p×q𝑢𝑣subscriptsubscript𝑝𝑞u,v\in\mathcal{M}(\mathbb{R}_{+})_{p\times q}italic_u , italic_v ∈ caligraphic_M ( blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_p × italic_q end_POSTSUBSCRIPT, let D(uv):=i[p],j[q]uijloguijvijassign𝐷conditional𝑢𝑣subscriptformulae-sequence𝑖delimited-[]𝑝𝑗delimited-[]𝑞subscript𝑢𝑖𝑗subscript𝑢𝑖𝑗subscript𝑣𝑖𝑗D(u\|v):=\sum_{i\in[p],j\in[q]}u_{ij}\log\frac{u_{ij}}{v_{ij}}italic_D ( italic_u ∥ italic_v ) := ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_p ] , italic_j ∈ [ italic_q ] end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT roman_log divide start_ARG italic_u start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG. The following can then be shown:

Lemma B.5.

t0,t=0T(Φ(b(t))Φ(b))D(bb(0))formulae-sequencefor-all𝑡0superscriptsubscript𝑡0𝑇Φsuperscript𝑏𝑡Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏0\forall t\geq 0,\sum_{t=0}^{T}\left(\Phi(b^{(t)})-\Phi(b^{*})\right)\leq D(b^{% *}\|b^{(0)})∀ italic_t ≥ 0 , ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ≤ italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ).

Proof.

Similar by the proof of Theorem 3 of [25], we first lower bound Δt=D(bb(t))D(bb(t+1))subscriptΔ𝑡𝐷conditionalsuperscript𝑏superscript𝑏𝑡𝐷conditionalsuperscript𝑏superscript𝑏𝑡1\Delta_{t}=D(b^{*}\|b^{(t)})-D(b^{*}\|b^{(t+1)})roman_Δ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT ) as follows:

ΔtsubscriptΔ𝑡\displaystyle\Delta_{t}roman_Δ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =D(bb(t))D(bb(t+1))absent𝐷conditionalsuperscript𝑏superscript𝑏𝑡𝐷conditionalsuperscript𝑏superscript𝑏𝑡1\displaystyle=D(b^{*}\|b^{(t)})-D(b^{*}\|b^{(t+1)})= italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT ) (B.18)
=i[n],j[m]bijlogbij(t+1)bij(t)=i[n],j[m]bijlogBivijpj(t)ui(t)absentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscriptsuperscript𝑏𝑡1𝑖𝑗subscriptsuperscript𝑏𝑡𝑖𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖subscript𝑣𝑖𝑗superscriptsubscript𝑝𝑗𝑡superscriptsubscript𝑢𝑖𝑡\displaystyle=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{b^{(t+1)}_{ij}}{b^{(t)% }_{ij}}=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{B_{i}v_{ij}}{p_{j}^{(t)}u_{i% }^{(t)}}= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG (B.19)
=i[n],j[m](bijlogvijpj+bijlogpjpj(t)bijlogui(t)+bijlogBi)absentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑝𝑗superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗𝑡superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑢𝑖𝑡superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖\displaystyle=\sum_{i\in[n],j\in[m]}\left(b_{ij}^{*}\log\frac{v_{ij}}{p_{j}^{*% }}+b_{ij}^{*}\log\frac{p_{j}^{*}}{p_{j}^{(t)}}-b_{ij}^{*}\log u_{i}^{(t)}+b_{% ij}^{*}\log B_{i}\right)= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT ( italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG + italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT + italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (B.20)
=i[n],j[m]bijlogvijpj+j[m]pjlogpjpj(t)i[n]Bilogui(t)+i[n]BilogBiabsentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑝𝑗subscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript𝑢𝑖𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{v_{ij}}{p_{j}^{*}}+% \sum_{j\in[m]}p_{j}^{*}\log\frac{p_{j}^{*}}{p_{j}^{(t)}}-\sum_{i\in[n]}B_{i}% \log u_{i}^{(t)}+\sum_{i\in[n]}B_{i}\log B_{i}= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.21)
=Ψ(b)+D(pjpj(t))+Φ(b(t))+i[n]BilogBiabsentΨsuperscript𝑏𝐷conditionalsuperscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗𝑡Φsuperscript𝑏𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=-\Psi(b^{*})+D(p_{j}^{*}\|p_{j}^{(t)})+\Phi(b^{(t)})+\sum_{i\in[% n]}B_{i}\log B_{i}= - roman_Ψ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) + italic_D ( italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) + roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (B.22)
=D(pjpj(t))+Φ(b(t))Φ(b)absent𝐷conditionalsuperscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗𝑡Φsuperscript𝑏𝑡Φsuperscript𝑏\displaystyle=D(p_{j}^{*}\|p_{j}^{(t)})+\Phi(b^{(t)})-\Phi(b^{*})= italic_D ( italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) + roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) (B.23)
Φ(b(t))Φ(b)absentΦsuperscript𝑏𝑡Φsuperscript𝑏\displaystyle\geq\Phi(b^{(t)})-\Phi(b^{*})≥ roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) (B.24)

where the second-to-last equality is by Lemma B.1 and the inequality is by the positivity of KL divergence. Taking the telescoping sum of ΔtsubscriptΔ𝑡\Delta_{t}roman_Δ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, we see that

t=0TΔt=t=0TD(bb(t))D(bb(t+1))=D(bb(0))D(bb(T+1))t=0T(Φ(b(t))Φ(b))superscriptsubscript𝑡0𝑇subscriptΔ𝑡superscriptsubscript𝑡0𝑇𝐷conditionalsuperscript𝑏superscript𝑏𝑡𝐷conditionalsuperscript𝑏superscript𝑏𝑡1𝐷conditionalsuperscript𝑏superscript𝑏0𝐷conditionalsuperscript𝑏superscript𝑏𝑇1superscriptsubscript𝑡0𝑇Φsuperscript𝑏𝑡Φsuperscript𝑏\sum_{t=0}^{T}\Delta_{t}=\sum_{t=0}^{T}D(b^{*}\|b^{(t)})-D(b^{*}\|b^{(t+1)})=D% (b^{*}\|b^{(0)})-D(b^{*}\|b^{(T+1)})\geq\sum_{t=0}^{T}(\Phi(b^{(t)})-\Phi(b^{*% }))∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Δ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT ) = italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) - italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( italic_T + 1 ) end_POSTSUPERSCRIPT ) ≥ ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) (B.25)

Hence, we obtain t=0T(Φ(b(t))Φ(b))D(bb(0))superscriptsubscript𝑡0𝑇Φsuperscript𝑏𝑡Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏0\sum_{t=0}^{T}\left(\Phi(b^{(t)})-\Phi(b^{*})\right)\leq D(b^{*}\|b^{(0)})∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ≤ italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ). ∎

Proposition B.6.

t0,Φ(b(T1))Φ(b)D(b(0)b)Tformulae-sequencefor-all𝑡0Φsuperscript𝑏𝑇1Φsuperscript𝑏𝐷conditionalsuperscript𝑏0superscript𝑏𝑇\forall t\geq 0,\Phi(b^{(T-1)})-\Phi(b^{*})\leq\frac{D(b^{(0)}\|b^{*})}{T}∀ italic_t ≥ 0 , roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_T - 1 ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG italic_D ( italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T end_ARG.

Proof.

Combining Lemma B.4 and Lemma B.5, we can write

Φ(b(T1))Φ(b)1Tt=0T1Φ(b(t))Φ(b)D(bb(0))T.Φsuperscript𝑏𝑇1Φsuperscript𝑏1𝑇superscriptsubscript𝑡0𝑇1Φsuperscript𝑏𝑡Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏0𝑇\Phi(b^{(T-1)})-\Phi(b^{*})\leq\frac{1}{T}\sum_{t=0}^{T-1}\Phi(b^{(t)})-\Phi(b% ^{*})\leq\frac{D(b^{*}\|b^{(0)})}{T}.roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_T - 1 ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT roman_Φ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T end_ARG . (B.26)

Corollary B.6.1 (Lemma 3, [53]).

t0,Ψ(b(t))Ψ(b)D(b(0)b)Tformulae-sequencefor-all𝑡0Ψsuperscript𝑏𝑡Ψsuperscript𝑏𝐷conditionalsuperscript𝑏0superscript𝑏𝑇\forall t\geq 0,\Psi(b^{(t)})-\Psi(b^{*})\leq\frac{D(b^{(0)}\|b^{*})}{T}∀ italic_t ≥ 0 , roman_Ψ ( italic_b start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Ψ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG italic_D ( italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T end_ARG.

Proof.

Lastly, we can upper bound the value D(bb(0))𝐷conditionalsuperscript𝑏superscript𝑏0D(b^{*}\|b^{(0)})italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) in terms of dimensions m𝑚mitalic_m and n𝑛nitalic_n given that each buyer initially divides the budget equally between all items such that bij(0)=Bimsubscriptsuperscript𝑏0𝑖𝑗subscript𝐵𝑖𝑚b^{(0)}_{ij}=\frac{B_{i}}{m}italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG.

Lemma B.7 (Lemma 13, [53]; Theorem 7, [26]).

If bij(0)=Bimsuperscriptsubscript𝑏𝑖𝑗0subscript𝐵𝑖𝑚b_{ij}^{(0)}=\frac{B_{i}}{m}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG for all i𝑖iitalic_i and j𝑗jitalic_j, then D(bb(0))logm𝐷conditionalsuperscript𝑏superscript𝑏0𝑚D(b^{*}\|b^{(0)})\leq\log mitalic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) ≤ roman_log italic_m.

Proof.

Evaluating D(bb(0))𝐷conditionalsuperscript𝑏superscript𝑏0D(b^{*}\|b^{(0)})italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ), we have

D(bb(0))=ijbijlogbijbij(0)=ijbijlogmbijBi=logm+ijbijlogbijBilogm𝐷conditionalsuperscript𝑏superscript𝑏0subscript𝑖𝑗superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗0subscript𝑖𝑗superscriptsubscript𝑏𝑖𝑗𝑚superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖𝑚subscript𝑖𝑗superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖𝑚D(b^{*}\|b^{(0)})=\sum_{ij}b_{ij}^{*}\log\frac{b_{ij}^{*}}{b_{ij}^{(0)}}=\sum_% {ij}b_{ij}^{*}\log\frac{mb_{ij}^{*}}{B_{i}}=\log m+\sum_{ij}b_{ij}^{*}\log% \frac{b_{ij}^{*}}{B_{i}}\leq\log mitalic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT end_ARG = ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_m italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG = roman_log italic_m + ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ≤ roman_log italic_m (B.27)

Plugging Lemma B.7 into Proposition B.6 and Corollary B.6.1, we obtain the convergence guarantee of Theorem 2.2.

Appendix C Convergence guarantees for the FPR dynamics

In this section, we prove the convergence guarantee of the faulty proportional response (FPR) dynamics. We first examine the immediate effects of allowing erroneous estimations of u𝑢uitalic_u and p𝑝pitalic_p in the FPR dynamics. Let B^i(t)=j[m]b^ij(t)superscriptsubscript^𝐵𝑖𝑡subscript𝑗delimited-[]𝑚superscriptsubscript^𝑏𝑖𝑗𝑡\hat{B}_{i}^{(t)}=\sum_{j\in[m]}\hat{b}_{ij}^{(t)}over^ start_ARG italic_B end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT. Note that B^i(t)Bisuperscriptsubscript^𝐵𝑖𝑡subscript𝐵𝑖\hat{B}_{i}^{(t)}\neq B_{i}over^ start_ARG italic_B end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≠ italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as the normalization step of constructing b^ijsubscript^𝑏𝑖𝑗\hat{b}_{ij}over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is erroneous. By the construction of b^(t)superscript^𝑏𝑡\hat{b}^{(t)}over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT by the FPR dynamics,

b^ij(t)=Biν~i(t1)vijb^ij(t1)p~j(t1),superscriptsubscript^𝑏𝑖𝑗𝑡subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡1subscript𝑣𝑖𝑗superscriptsubscript^𝑏𝑖𝑗𝑡1superscriptsubscript~𝑝𝑗𝑡1\hat{b}_{ij}^{(t)}=\frac{B_{i}}{\tilde{\nu}_{i}^{(t-1)}}v_{ij}\frac{\hat{b}_{% ij}^{(t-1)}}{\tilde{p}_{j}^{(t-1)}},over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG , (C.1)

we can find that

B^i(t)=j[m]b^ij(t)=j[m]Biν~i(t1)vijb^ij(t1)p~j(t1)=Biν~i(t1)j[m]vijb^ij(t1)p~j(t1)=Biν^i(t1)ν~i(t1),superscriptsubscript^𝐵𝑖𝑡subscript𝑗delimited-[]𝑚superscriptsubscript^𝑏𝑖𝑗𝑡subscript𝑗delimited-[]𝑚subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡1subscript𝑣𝑖𝑗superscriptsubscript^𝑏𝑖𝑗𝑡1superscriptsubscript~𝑝𝑗𝑡1subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡1subscript𝑗delimited-[]𝑚subscript𝑣𝑖𝑗superscriptsubscript^𝑏𝑖𝑗𝑡1superscriptsubscript~𝑝𝑗𝑡1subscript𝐵𝑖superscriptsubscript^𝜈𝑖𝑡1superscriptsubscript~𝜈𝑖𝑡1\hat{B}_{i}^{(t)}=\sum_{j\in[m]}\hat{b}_{ij}^{(t)}=\sum_{j\in[m]}\frac{B_{i}}{% \tilde{\nu}_{i}^{(t-1)}}v_{ij}\frac{\hat{b}_{ij}^{(t-1)}}{\tilde{p}_{j}^{(t-1)% }}=\frac{B_{i}}{\tilde{\nu}_{i}^{(t-1)}}\sum_{j\in[m]}v_{ij}\frac{\hat{b}_{ij}% ^{(t-1)}}{\tilde{p}_{j}^{(t-1)}}=B_{i}\frac{\hat{\nu}_{i}^{(t-1)}}{\tilde{\nu}% _{i}^{(t-1)}},over^ start_ARG italic_B end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG over^ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT end_ARG , (C.2)

where we can obtain the following inequality by definition of ν~isubscript~𝜈𝑖\tilde{\nu}_{i}over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT:

Bi1+ϵνB^i(t)Bi1ϵνsubscript𝐵𝑖1subscriptitalic-ϵ𝜈superscriptsubscript^𝐵𝑖𝑡subscript𝐵𝑖1subscriptitalic-ϵ𝜈\frac{B_{i}}{1+\epsilon_{\nu}}\leq\hat{B}_{i}^{(t)}\leq\frac{B_{i}}{1-\epsilon% _{\nu}}divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT end_ARG ≤ over^ start_ARG italic_B end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≤ divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT end_ARG (C.3)

By summing Bi^^subscript𝐵𝑖\hat{B_{i}}over^ start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG, we find that

11+ϵνi[n]B^i(t)=i[n],j[m]b^ij(t)=j[m]p^j(t)11ϵν.11subscriptitalic-ϵ𝜈subscript𝑖delimited-[]𝑛superscriptsubscript^𝐵𝑖𝑡subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript^𝑏𝑖𝑗𝑡subscript𝑗delimited-[]𝑚superscriptsubscript^𝑝𝑗𝑡11subscriptitalic-ϵ𝜈\frac{1}{1+\epsilon_{\nu}}\leq\sum_{i\in[n]}\hat{B}_{i}^{(t)}=\sum_{i\in[n],j% \in[m]}\hat{b}_{ij}^{(t)}=\sum_{j\in[m]}\hat{p}_{j}^{(t)}\leq\frac{1}{1-% \epsilon_{\nu}}.divide start_ARG 1 end_ARG start_ARG 1 + italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT end_ARG ≤ ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT over^ start_ARG italic_B end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 1 - italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT end_ARG . (C.4)

We now prove Theorem 3.1.

See 3.1

Proof.

Similar to the proof of Lemma B.5, we first lower bound Δ^t=i[n],j[m]bijlogbijb^ij(t)i[n],j[m]bijlogbijb^ij(t+1)subscript^Δ𝑡subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscriptsuperscript^𝑏𝑡𝑖𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscriptsuperscript^𝑏𝑡1𝑖𝑗\hat{\Delta}_{t}=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{b_{ij}^{*}}{\hat{b}% ^{(t)}_{ij}}-\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{b_{ij}^{*}}{\hat{b}^{(t% +1)}_{ij}}over^ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG, where we use as follows:

Δ^tsubscript^Δ𝑡\displaystyle\hat{\Delta}_{t}over^ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =i[n],j[m]bijlogb^ij(t+1)b^ij(t)=i[n],j[m]bijlogBivijp~j(t)ν~i(t)absentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscriptsuperscript^𝑏𝑡1𝑖𝑗subscriptsuperscript^𝑏𝑡𝑖𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖subscript𝑣𝑖𝑗superscriptsubscript~𝑝𝑗𝑡superscriptsubscript~𝜈𝑖𝑡\displaystyle=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{\hat{b}^{(t+1)}_{ij}}{% \hat{b}^{(t)}_{ij}}=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{B_{i}v_{ij}}{% \tilde{p}_{j}^{(t)}\tilde{\nu}_{i}^{(t)}}= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG (C.5)
=i[n],j[m](bijlogvijpj+bijlogpjp~j(t)bijlogν~i(t)+bijlogBi)absentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑝𝑗superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑝𝑗superscriptsubscript~𝑝𝑗𝑡superscriptsubscript𝑏𝑖𝑗superscriptsubscript~𝜈𝑖𝑡superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖\displaystyle=\sum_{i\in[n],j\in[m]}\left(b_{ij}^{*}\log\frac{v_{ij}}{p_{j}^{*% }}+b_{ij}^{*}\log\frac{p_{j}^{*}}{\tilde{p}_{j}^{(t)}}-b_{ij}^{*}\log\tilde{% \nu}_{i}^{(t)}+b_{ij}^{*}\log B_{i}\right)= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT ( italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG + italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT + italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (C.6)
=i[n],j[m]bijlogvijpj+j[m]pjlogpjp~j(t)i[n]Bilogν~i(t)+i[n]BilogBiabsentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscript𝑣𝑖𝑗superscriptsubscript𝑝𝑗subscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗superscriptsubscript~𝑝𝑗𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝐵𝑖\displaystyle=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{v_{ij}}{p_{j}^{*}}+% \sum_{j\in[m]}p_{j}^{*}\log\frac{p_{j}^{*}}{\tilde{p}_{j}^{(t)}}-\sum_{i\in[n]% }B_{i}\log\tilde{\nu}_{i}^{(t)}+\sum_{i\in[n]}B_{i}\log B_{i}= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (C.7)
=Φ(b)+j[m]pjlogpjp~j(t)i[n]Bilogν~i(t).absentΦsuperscript𝑏subscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗superscriptsubscript~𝑝𝑗𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡\displaystyle=-\Phi(b^{*})+\sum_{j\in[m]}p_{j}^{*}\log\frac{p_{j}^{*}}{\tilde{% p}_{j}^{(t)}}-\sum_{i\in[n]}B_{i}\log\tilde{\nu}_{i}^{(t)}.= - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT . (C.8)

We now lower bound the second and third terms from the above individually as follows. Starting with the second term,

j[m]pjlogpjp~j(t)subscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗superscriptsubscript~𝑝𝑗𝑡\displaystyle\sum_{j\in[m]}p_{j}^{*}\log\frac{p_{j}^{*}}{\tilde{p}_{j}^{(t)}}∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG =j[m]pjlogpjp^j(t)/j[m]p^j(t)+j[m]pjlogp^j(t)p~j(t)j[m]pjlogj[m]p^j(t)absentsubscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗superscriptsubscript^𝑝𝑗𝑡subscriptsuperscript𝑗delimited-[]𝑚superscriptsubscript^𝑝superscript𝑗𝑡subscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗superscriptsubscript^𝑝𝑗𝑡superscriptsubscript~𝑝𝑗𝑡subscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗subscriptsuperscript𝑗delimited-[]𝑚superscriptsubscript^𝑝superscript𝑗𝑡\displaystyle=\sum_{j\in[m]}p_{j}^{*}\log\frac{p_{j}^{*}}{\hat{p}_{j}^{(t)}/% \sum_{j^{\prime}\in[m]}\hat{p}_{j^{\prime}}^{(t)}}+\sum_{j\in[m]}p_{j}^{*}\log% \frac{\hat{p}_{j}^{(t)}}{\tilde{p}_{j}^{(t)}}-\sum_{j\in[m]}p_{j}^{*}\log\sum_% {j^{\prime}\in[m]}\hat{p}_{j^{\prime}}^{(t)}= ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT / ∑ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT (C.9)
=D(pjp^j(t)j[m]p^j(t))+j[m]pjlogp^j(t)p~j(t)logj[m]p^j(t)\displaystyle=D\left(p_{j}^{*}\middle\|\frac{\hat{p}_{j}^{(t)}}{\sum_{j^{% \prime}\in[m]}\hat{p}_{j^{\prime}}^{(t)}}\right)+\sum_{j\in[m]}p_{j}^{*}\log% \frac{\hat{p}_{j}^{(t)}}{\tilde{p}_{j}^{(t)}}-\log\sum_{j\in[m]}\hat{p}_{j}^{(% t)}= italic_D ( italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ divide start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG ) + ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT (C.10)
D(pjp^j(t)j[m]p^j(t))+j[m]pjlog11+ϵplog11ϵν\displaystyle\geq D\left(p_{j}^{*}\middle\|\frac{\hat{p}_{j}^{(t)}}{\sum_{j^{% \prime}\in[m]}\hat{p}_{j^{\prime}}^{(t)}}\right)+\sum_{j\in[m]}p_{j}^{*}\log% \frac{1}{1+\epsilon_{p}}-\log\frac{1}{1-\epsilon_{\nu}}≥ italic_D ( italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ divide start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG ) + ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG 1 end_ARG start_ARG 1 + italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG - roman_log divide start_ARG 1 end_ARG start_ARG 1 - italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT end_ARG (C.11)
D(pjp^j(t)j[m]p^j(t))ϵp2ϵνϵp2ϵν\displaystyle\geq D\left(p_{j}^{*}\middle\|\frac{\hat{p}_{j}^{(t)}}{\sum_{j^{% \prime}\in[m]}\hat{p}_{j^{\prime}}^{(t)}}\right)-\epsilon_{p}-2\epsilon_{\nu}% \geq-\epsilon_{p}-2\epsilon_{\nu}≥ italic_D ( italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ divide start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG ) - italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - 2 italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ≥ - italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - 2 italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT (C.12)

Moving on the the third term,

i[n]Bilogν~i(t)subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡\displaystyle-\sum_{i\in[n]}B_{i}\log\tilde{\nu}_{i}^{(t)}- ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT =i[n]Bilogν^i(t)i[n]Bilogν~i(t)ν^i(t)absentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript^𝜈𝑖𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡superscriptsubscript^𝜈𝑖𝑡\displaystyle=-\sum_{i\in[n]}B_{i}\log\hat{\nu}_{i}^{(t)}-\sum_{i\in[n]}B_{i}% \log\frac{\tilde{\nu}_{i}^{(t)}}{\hat{\nu}_{i}^{(t)}}= - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over^ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log divide start_ARG over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG (C.13)
i[n]Bilogν^i(t)i[n]Bilog(1+ϵν)absentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript^𝜈𝑖𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖1subscriptitalic-ϵ𝜈\displaystyle\geq-\sum_{i\in[n]}B_{i}\log\hat{\nu}_{i}^{(t)}-\sum_{i\in[n]}B_{% i}\log(1+\epsilon_{\nu})≥ - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over^ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ( 1 + italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ) (C.14)
=i[n]Bilogj[m]p^j(t)p~j(t)vijb^ij(t)p^j(t)log(1+ϵν)absentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑗delimited-[]𝑚superscriptsubscript^𝑝𝑗𝑡superscriptsubscript~𝑝𝑗𝑡subscript𝑣𝑖𝑗superscriptsubscript^𝑏𝑖𝑗𝑡superscriptsubscript^𝑝𝑗𝑡1subscriptitalic-ϵ𝜈\displaystyle=-\sum_{i\in[n]}B_{i}\log{\sum_{j\in[m]}\frac{\hat{p}_{j}^{(t)}}{% \tilde{p}_{j}^{(t)}}}\frac{v_{ij}\hat{b}_{ij}^{(t)}}{\hat{p}_{j}^{(t)}}-\log(1% +\epsilon_{\nu})= - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - roman_log ( 1 + italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ) (C.15)
i[n]Bilogj[m]11ϵpvijb^ij(t)p^j(t)log(1+ϵν)absentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑗delimited-[]𝑚11subscriptitalic-ϵ𝑝subscript𝑣𝑖𝑗superscriptsubscript^𝑏𝑖𝑗𝑡superscriptsubscript^𝑝𝑗𝑡1subscriptitalic-ϵ𝜈\displaystyle\geq-\sum_{i\in[n]}B_{i}\log\sum_{j\in[m]}\frac{1}{1-\epsilon_{p}% }\frac{v_{ij}\hat{b}_{ij}^{(t)}}{\hat{p}_{j}^{(t)}}-\log(1+\epsilon_{\nu})≥ - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 1 - italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - roman_log ( 1 + italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ) (C.16)
=i[n]Bilogj[m]vijb^ij(t)p^j(t)+log(1ϵp)log(1+ϵν)absentsubscript𝑖delimited-[]𝑛subscript𝐵𝑖subscript𝑗delimited-[]𝑚subscript𝑣𝑖𝑗superscriptsubscript^𝑏𝑖𝑗𝑡superscriptsubscript^𝑝𝑗𝑡1subscriptitalic-ϵ𝑝1subscriptitalic-ϵ𝜈\displaystyle=-\sum_{i\in[n]}B_{i}\log\sum_{j\in[m]}\frac{v_{ij}\hat{b}_{ij}^{% (t)}}{\hat{p}_{j}^{(t)}}+\log(1-\epsilon_{p})-\log(1+\epsilon_{\nu})= - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT divide start_ARG italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG + roman_log ( 1 - italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) - roman_log ( 1 + italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ) (C.17)
Φ(b^(t))2ϵpϵνabsentΦsuperscript^𝑏𝑡2subscriptitalic-ϵ𝑝subscriptitalic-ϵ𝜈\displaystyle\geq\Phi(\hat{b}^{(t)})-2\epsilon_{p}-\epsilon_{\nu}≥ roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - 2 italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT (C.18)

Hence, in total, we find that

Δ^t=i[n],j[m]bijlogb^ij(t+1)b^ij(t)Φ(b^(t))Φ(b)3ϵp3ϵν.subscript^Δ𝑡subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscriptsuperscript^𝑏𝑡1𝑖𝑗subscriptsuperscript^𝑏𝑡𝑖𝑗Φsuperscript^𝑏𝑡Φsuperscript𝑏3subscriptitalic-ϵ𝑝3subscriptitalic-ϵ𝜈\hat{\Delta}_{t}=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{\hat{b}^{(t+1)}_{ij% }}{\hat{b}^{(t)}_{ij}}\geq\Phi(\hat{b}^{(t)})-\Phi(b^{*})-3\epsilon_{p}-3% \epsilon_{\nu}.over^ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG ≥ roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - 3 italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - 3 italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT . (C.19)

Taking the telescoping sum of Δ^tsubscript^Δ𝑡\hat{\Delta}_{t}over^ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, we see that

t=0TΔ^t=i[n],j[m]bijlogbijbij(0)i[n],j[m]bijlogbijb^ij(t+1)t=0T(Φ(b^(t))Φ(b)3ϵp3ϵν).superscriptsubscript𝑡0𝑇subscript^Δ𝑡subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscriptsuperscript𝑏0𝑖𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscriptsuperscript^𝑏𝑡1𝑖𝑗superscriptsubscript𝑡0𝑇Φsuperscript^𝑏𝑡Φsuperscript𝑏3subscriptitalic-ϵ𝑝3subscriptitalic-ϵ𝜈\sum_{t=0}^{T}\hat{\Delta}_{t}=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{b_{ij% }^{*}}{b^{(0)}_{ij}}-\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{b_{ij}^{*}}{% \hat{b}^{(t+1)}_{ij}}\geq\sum_{t=0}^{T}\left(\Phi(\hat{b}^{(t)})-\Phi(b^{*})-3% \epsilon_{p}-3\epsilon_{\nu}\right).∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over^ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG ≥ ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - 3 italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - 3 italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ) . (C.20)

Taking the upper bound of t=0TΔ^tsuperscriptsubscript𝑡0𝑇subscript^Δ𝑡\sum_{t=0}^{T}\hat{\Delta}_{t}∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over^ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, we obtain

t=0TΔ^tsuperscriptsubscript𝑡0𝑇subscript^Δ𝑡\displaystyle\sum_{t=0}^{T}\hat{\Delta}_{t}∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over^ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =i[n],j[m]bijlogbijbij(0)i[n],j[m]bijlogb^ijb^ij(t+1)absentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscriptsuperscript𝑏0𝑖𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript^𝑏𝑖𝑗subscriptsuperscript^𝑏𝑡1𝑖𝑗\displaystyle=\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{b_{ij}^{*}}{b^{(0)}_{% ij}}-\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{\hat{b}_{ij}^{*}}{\hat{b}^{(t+1% )}_{ij}}= ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG (C.21)
i[n],j[m]bijlogbijbij(0)i[n],j[m]bijlogbijb^ij(t+1)/i[n],j[m]b^ij(t+1)+i[n],j[m]bijlogi[n],j[m]b^ij(t+1)absentsubscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscriptsuperscript𝑏0𝑖𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscriptsuperscript^𝑏𝑡1𝑖𝑗subscriptformulae-sequencesuperscript𝑖delimited-[]𝑛superscript𝑗delimited-[]𝑚subscriptsuperscript^𝑏𝑡1superscript𝑖superscript𝑗subscriptformulae-sequence𝑖delimited-[]𝑛𝑗delimited-[]𝑚superscriptsubscript𝑏𝑖𝑗subscriptformulae-sequencesuperscript𝑖delimited-[]𝑛superscript𝑗delimited-[]𝑚subscriptsuperscript^𝑏𝑡1superscript𝑖superscript𝑗\displaystyle\leq\begin{multlined}\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{b_% {ij}^{*}}{b^{(0)}_{ij}}-\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\frac{b_{ij}^{*}}{% \hat{b}^{(t+1)}_{ij}/\sum_{i^{\prime}\in[n],j^{\prime}\in[m]}\hat{b}^{(t+1)}_{% i^{\prime}j^{\prime}}}\\ +\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\sum_{i^{\prime}\in[n],j^{\prime}\in[m]}% \hat{b}^{(t+1)}_{i^{\prime}j^{\prime}}\end{multlined}\sum_{i\in[n],j\in[m]}b_{% ij}^{*}\log\frac{b_{ij}^{*}}{b^{(0)}_{ij}}-\sum_{i\in[n],j\in[m]}b_{ij}^{*}% \log\frac{b_{ij}^{*}}{\hat{b}^{(t+1)}_{ij}/\sum_{i^{\prime}\in[n],j^{\prime}% \in[m]}\hat{b}^{(t+1)}_{i^{\prime}j^{\prime}}}\\ +\sum_{i\in[n],j\in[m]}b_{ij}^{*}\log\sum_{i^{\prime}\in[n],j^{\prime}\in[m]}% \hat{b}^{(t+1)}_{i^{\prime}j^{\prime}}≤ start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / ∑ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_n ] , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG end_CELL end_ROW start_ROW start_CELL + ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] , italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log ∑ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_n ] , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW (C.24)
=D(bb(0))D(bb^(t+1)i[n],j[m]b^ij(t+1))+logi[n],j[m]b^ij(t+1)\displaystyle=D(b^{*}\|b^{(0)})-D\left(b^{*}\middle\|\frac{\hat{b}^{(t+1)}}{% \sum_{i^{\prime}\in[n],j^{\prime}\in[m]}\hat{b}^{(t+1)}_{i^{\prime}j^{\prime}}% }\right)+\log\sum_{i^{\prime}\in[n],j^{\prime}\in[m]}\hat{b}^{(t+1)}_{i^{% \prime}j^{\prime}}= italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) - italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_n ] , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG ) + roman_log ∑ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_n ] , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT (C.25)
D(bb(0))D(bb^(t+1)i[n],j[m]b^ij(t+1))+log(1+ϵν)\displaystyle\leq D(b^{*}\|b^{(0)})-D\left(b^{*}\middle\|\frac{\hat{b}^{(t+1)}% }{\sum_{i^{\prime}\in[n],j^{\prime}\in[m]}\hat{b}^{(t+1)}_{i^{\prime}j^{\prime% }}}\right)+\log(1+\epsilon_{\nu})≤ italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) - italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ divide start_ARG over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_n ] , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_m ] end_POSTSUBSCRIPT over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG ) + roman_log ( 1 + italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ) (C.26)
D(bb(0))+ϵνabsent𝐷conditionalsuperscript𝑏superscript𝑏0subscriptitalic-ϵ𝜈\displaystyle\leq D(b^{*}\|b^{(0)})+\epsilon_{\nu}≤ italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) + italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT (C.27)

Hence, we obtain t=0T(Φ(b^(T))Φ(b))D(bb(0))+(3T+4)ϵν+(3T+3)ϵpsuperscriptsubscript𝑡0𝑇Φsuperscript^𝑏𝑇Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏03𝑇4subscriptitalic-ϵ𝜈3𝑇3subscriptitalic-ϵ𝑝\sum_{t=0}^{T}\left(\Phi(\hat{b}^{(T)})-\Phi(b^{*})\right)\leq D(b^{*}\|b^{(0)% })+(3T+4)\epsilon_{\nu}+(3T+3)\epsilon_{p}∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ≤ italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) + ( 3 italic_T + 4 ) italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT + ( 3 italic_T + 3 ) italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. Instead of T𝑇Titalic_T, we plug in T1𝑇1T-1italic_T - 1 to obtain

t=0T1(Φ(b^(t))Φ(b))D(bb(0))+(3T+1)ϵν+3Tϵp.superscriptsubscript𝑡0𝑇1Φsuperscript^𝑏𝑡Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏03𝑇1subscriptitalic-ϵ𝜈3𝑇subscriptitalic-ϵ𝑝\sum_{t=0}^{T-1}\left(\Phi(\hat{b}^{(t)})-\Phi(b^{*})\right)\leq D(b^{*}\|b^{(% 0)})+(3T+1)\epsilon_{\nu}+3T\epsilon_{p}.∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT ( roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ≤ italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) + ( 3 italic_T + 1 ) italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT + 3 italic_T italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT . (C.28)

With a simple observation that

Tmint[T](Φ(b^(t))Φ(b))t=0T1(Φ(b^(t))Φ(b)),𝑇subscript𝑡delimited-[]𝑇Φsuperscript^𝑏𝑡Φsuperscript𝑏superscriptsubscript𝑡0𝑇1Φsuperscript^𝑏𝑡Φsuperscript𝑏T\cdot\min_{t\in[T]}\left(\Phi(\hat{b}^{(t)})-\Phi(b^{*})\right)\leq\sum_{t=0}% ^{T-1}\left(\Phi(\hat{b}^{(t)})-\Phi(b^{*})\right),italic_T ⋅ roman_min start_POSTSUBSCRIPT italic_t ∈ [ italic_T ] end_POSTSUBSCRIPT ( roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ≤ ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT ( roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) , (C.29)

we find

mint[T](Φ(b^(t))Φ(b))D(bb(0))T+4ϵν+3ϵp.subscript𝑡delimited-[]𝑇Φsuperscript^𝑏𝑡Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏0𝑇4subscriptitalic-ϵ𝜈3subscriptitalic-ϵ𝑝\min_{t\in[T]}\left(\Phi(\hat{b}^{(t)})-\Phi(b^{*})\right)\leq\frac{D(b^{*}\|b% ^{(0)})}{T}+4\epsilon_{\nu}+3\epsilon_{p}.roman_min start_POSTSUBSCRIPT italic_t ∈ [ italic_T ] end_POSTSUBSCRIPT ( roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ≤ divide start_ARG italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T end_ARG + 4 italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT + 3 italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT . (C.30)

To upper bound D(bb(0))𝐷conditionalsuperscript𝑏superscript𝑏0D(b^{*}\|b^{(0)})italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ), we use the result of Lemma 13 of [53] and Theorem 7 of [26] as follows:

D(bb(0))=ijbijlogbijbij(0)=ijbijlogmbijBi=logm+ijbijlogbijBilogm𝐷conditionalsuperscript𝑏superscript𝑏0subscript𝑖𝑗superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗0subscript𝑖𝑗superscriptsubscript𝑏𝑖𝑗𝑚superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖𝑚subscript𝑖𝑗superscriptsubscript𝑏𝑖𝑗superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖𝑚D(b^{*}\|b^{(0)})=\sum_{ij}b_{ij}^{*}\log\frac{b_{ij}^{*}}{b_{ij}^{(0)}}=\sum_% {ij}b_{ij}^{*}\log\frac{mb_{ij}^{*}}{B_{i}}=\log m+\sum_{ij}b_{ij}^{*}\log% \frac{b_{ij}^{*}}{B_{i}}\leq\log mitalic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT end_ARG = ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_m italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG = roman_log italic_m + ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ≤ roman_log italic_m (C.31)

where the last inequality is due to bijBi1superscriptsubscript𝑏𝑖𝑗subscript𝐵𝑖1\frac{b_{ij}^{*}}{B_{i}}\leq 1divide start_ARG italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ≤ 1.

Then by setting ϵν=18Tsubscriptitalic-ϵ𝜈18𝑇\epsilon_{\nu}=\frac{1}{8T}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 8 italic_T end_ARG and ϵν=16Tsubscriptitalic-ϵ𝜈16𝑇\epsilon_{\nu}=\frac{1}{6T}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 6 italic_T end_ARG, we obtain

mint[T]Φ(b^(t))Φ(b)2logmT.subscript𝑡delimited-[]𝑇Φsuperscript^𝑏𝑡Φsuperscript𝑏2𝑚𝑇\min_{t\in[T]}\Phi(\hat{b}^{(t)})-\Phi(b^{*})\leq\frac{2\log m}{T}.roman_min start_POSTSUBSCRIPT italic_t ∈ [ italic_T ] end_POSTSUBSCRIPT roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG 2 roman_log italic_m end_ARG start_ARG italic_T end_ARG . (C.32)

Next, we prove Theorem 3.2.

See 3.2

Proof.

We slightly modify the proof of Theorem 3.1, and note that by Equation C.8 and Equation C.12, we have

Δ^tsubscript^Δ𝑡\displaystyle\hat{\Delta}_{t}over^ start_ARG roman_Δ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =Φ(b)+j[m]pjlogpjp~j(t)i[n]Bilogν~i(t)absentΦsuperscript𝑏subscript𝑗delimited-[]𝑚superscriptsubscript𝑝𝑗superscriptsubscript𝑝𝑗superscriptsubscript~𝑝𝑗𝑡subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡\displaystyle=-\Phi(b^{*})+\sum_{j\in[m]}p_{j}^{*}\log\frac{p_{j}^{*}}{\tilde{% p}_{j}^{(t)}}-\sum_{i\in[n]}B_{i}\log\tilde{\nu}_{i}^{(t)}= - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_j ∈ [ italic_m ] end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT Φ(b)ϵp2ϵνi[n]Bilogν~i(t)absentΦsuperscript𝑏subscriptitalic-ϵ𝑝2subscriptitalic-ϵ𝜈subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡\displaystyle\geq-\Phi(b^{*})-\epsilon_{p}-2\epsilon_{\nu}-\sum_{i\in[n]}B_{i}% \log\tilde{\nu}_{i}^{(t)}≥ - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - 2 italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT (C.33)

Taking the telescoping sum and the upper bound from Equation C.27, we obtain

t=0T1(i[n]Bilogν~i(t)Φ(b))D(bb(0))+(2T+1)ϵν+Tϵp,superscriptsubscript𝑡0𝑇1subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏02𝑇1subscriptitalic-ϵ𝜈𝑇subscriptitalic-ϵ𝑝\sum_{t=0}^{T-1}\left(-\sum_{i\in[n]}B_{i}\log\tilde{\nu}_{i}^{(t)}-\Phi(b^{*}% )\right)\leq D(b^{*}\|b^{(0)})+(2T+1)\epsilon_{\nu}+T\epsilon_{p},∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT ( - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ≤ italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) + ( 2 italic_T + 1 ) italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT + italic_T italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , (C.34)

where we can note

mint[T](i[n]Bilogν~i(t)Φ(b))D(bb(0))T+3ϵν+ϵp.subscript𝑡delimited-[]𝑇subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏0𝑇3subscriptitalic-ϵ𝜈subscriptitalic-ϵ𝑝\min_{t\in[T]}\left(-\sum_{i\in[n]}B_{i}\log\tilde{\nu}_{i}^{(t)}-\Phi(b^{*})% \right)\leq\frac{D(b^{*}\|b^{(0)})}{T}+3\epsilon_{\nu}+\epsilon_{p}.roman_min start_POSTSUBSCRIPT italic_t ∈ [ italic_T ] end_POSTSUBSCRIPT ( - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ≤ divide start_ARG italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T end_ARG + 3 italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT . (C.35)

Let t=argmint[T](i[n]Bilogν~i(t)Φ(b))superscript𝑡subscriptargmin𝑡delimited-[]𝑇subscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡Φsuperscript𝑏t^{*}=\operatorname{arg\,min}_{t\in[T]}\left(-\sum_{i\in[n]}B_{i}\log\tilde{% \nu}_{i}^{(t)}-\Phi(b^{*})\right)italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = start_OPFUNCTION roman_arg roman_min end_OPFUNCTION start_POSTSUBSCRIPT italic_t ∈ [ italic_T ] end_POSTSUBSCRIPT ( - ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ). Then by Equation C.18, we have the following:

i[n]Bilogν~i(t)Φ(b^(t))2ϵpϵνsubscript𝑖delimited-[]𝑛subscript𝐵𝑖superscriptsubscript~𝜈𝑖𝑡Φsuperscript^𝑏𝑡2subscriptitalic-ϵ𝑝subscriptitalic-ϵ𝜈-\sum_{i\in[n]}B_{i}\log\tilde{\nu}_{i}^{(t)}\geq\Phi(\hat{b}^{(t)})-2\epsilon% _{p}-\epsilon_{\nu}- ∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log over~ start_ARG italic_ν end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≥ roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - 2 italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT (C.36)

Then we can obtain

Φ(b^(t))Φ(b)D(bb(0))T+4ϵν+3ϵp.Φsuperscript^𝑏superscript𝑡Φsuperscript𝑏𝐷conditionalsuperscript𝑏superscript𝑏0𝑇4subscriptitalic-ϵ𝜈3subscriptitalic-ϵ𝑝\Phi(\hat{b}^{(t^{*})})-\Phi(b^{*})\leq\frac{D(b^{*}\|b^{(0)})}{T}+4\epsilon_{% \nu}+3\epsilon_{p}.roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG italic_D ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T end_ARG + 4 italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT + 3 italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT . (C.37)

Lastly by setting ϵν=18Tsubscriptitalic-ϵ𝜈18𝑇\epsilon_{\nu}=\frac{1}{8T}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 8 italic_T end_ARG and ϵν=16Tsubscriptitalic-ϵ𝜈16𝑇\epsilon_{\nu}=\frac{1}{6T}italic_ϵ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 6 italic_T end_ARG, we obtain

Φ(b^(t))Φ(b)2logmT.Φsuperscript^𝑏superscript𝑡Φsuperscript𝑏2𝑚𝑇\Phi(\hat{b}^{(t^{*})})-\Phi(b^{*})\leq\frac{2\log m}{T}.roman_Φ ( over^ start_ARG italic_b end_ARG start_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ divide start_ARG 2 roman_log italic_m end_ARG start_ARG italic_T end_ARG . (C.38)

Appendix D Experimental and implementation details

Our experiments are conducted on a single NVIDIA P100 GPU and written with the PyTorch library [88]. The optimal objective value is approximately computed by taking the results of the 1000100010001000-th iteration of the PR dynamics.

For the projected gradient descent (PGD) algorithm, our implementation is unlike Gao and Kroer [26], whose task is based on the CEEI scenario where agents are given a unit of fake money and whose end goal is only the allocation. We require information on both the allocation x𝑥xitalic_x and price p𝑝pitalic_p, hence our algorithm output should be the bids b𝑏bitalic_b. Therefore, instead of formulating the problem after the EG objective function, we mirror555Pun intended. the PR dynamics in its equivalence to mirror descent [53] on the Shmyrev objective function and perform PGD on the latter (see Algorithm 2).

Input: Budget B𝐵Bitalic_B, Value v𝑣vitalic_v Learning rate γ𝛾\gammaitalic_γ, Iterations T𝑇Titalic_T
Output: Bids b𝑏bitalic_b
1 bij(0)=Bimsuperscriptsubscript𝑏𝑖𝑗0subscript𝐵𝑖𝑚b_{ij}^{(0)}=\frac{B_{i}}{m}italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = divide start_ARG italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG
2 for t=0𝑡0t=0italic_t = 0 to T𝑇Titalic_T do
       rij(t)=bij(t)γ(1logvij/pj(t))superscriptsubscript𝑟𝑖𝑗𝑡superscriptsubscript𝑏𝑖𝑗𝑡𝛾1subscript𝑣𝑖𝑗superscriptsubscript𝑝𝑗𝑡r_{ij}^{(t)}=b_{ij}^{(t)}-\gamma\cdot(1-\log v_{ij}/p_{j}^{(t)})italic_r start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = italic_b start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - italic_γ ⋅ ( 1 - roman_log italic_v start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) // Gradient step
3       for i=0𝑖0i=0italic_i = 0 to n𝑛nitalic_n do
4             bi,(t+1)=Proj(ri,(t){x+n,kxk=Bi})superscriptsubscript𝑏𝑖𝑡1Projsuperscriptsubscript𝑟𝑖𝑡formulae-sequence𝑥superscriptsubscript𝑛subscript𝑘subscript𝑥𝑘subscript𝐵𝑖b_{i,*}^{(t+1)}=\operatorname{Proj}(r_{i,*}^{(t)}\to\{x\in\mathbb{R}_{+}^{n},% \sum_{k}x_{k}=B_{i}\})italic_b start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = roman_Proj ( italic_r start_POSTSUBSCRIPT italic_i , ∗ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT → { italic_x ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , ∑ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } )
            // Projection step onto a B_i-simplex
5            
6      
return b(T)superscript𝑏𝑇b^{(T)}italic_b start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT
Algorithm 2 Projected Gradient Descent

We formulate the Shmyrev objective function into the following form to obtain convergence guarantees and the step size:

f(x)=h(Ax)+q,x𝑓𝑥𝐴𝑥𝑞𝑥f(x)=h(Ax)+\langle q,x\rangleitalic_f ( italic_x ) = italic_h ( italic_A italic_x ) + ⟨ italic_q , italic_x ⟩ (D.1)

where xn,Ad×n(),h:d,qn:formulae-sequence𝑥superscript𝑛𝐴subscript𝑑𝑛formulae-sequencesuperscript𝑑𝑞superscript𝑛x\in\mathbb{R}^{n},A\in\mathcal{M}_{d\times n}(\mathbb{R}),h:\mathbb{R}^{d}\to% \mathbb{R},q\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_A ∈ caligraphic_M start_POSTSUBSCRIPT italic_d × italic_n end_POSTSUBSCRIPT ( blackboard_R ) , italic_h : blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT → blackboard_R , italic_q ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Considering a flattened vector of the bids b𝑏bitalic_b, we note that if

A=n{(111)(100010001)m,q=(logv11logv12logvmn),h(x)=ixilogxiA=n\left\{\begin{pmatrix}1\\ 1\\ \vdots\\ 1\end{pmatrix}\right.\otimes\underbrace{\begin{pmatrix}1&0&\cdots&0\\ 0&1&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&1\end{pmatrix}}_{m},\quad q=\begin{pmatrix}-\log v_{11}\\ -\log v_{12}\\ \vdots\\ -\log v_{mn}\end{pmatrix},\quad h(x)=\sum_{i}x_{i}\log x_{i}italic_A = italic_n { ( start_ARG start_ROW start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW end_ARG ) ⊗ under⏟ start_ARG ( start_ARG start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 1 end_CELL end_ROW end_ARG ) end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_q = ( start_ARG start_ROW start_CELL - roman_log italic_v start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL - roman_log italic_v start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL - roman_log italic_v start_POSTSUBSCRIPT italic_m italic_n end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) , italic_h ( italic_x ) = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (D.2)

then f=Ψ𝑓Ψf=\Psiitalic_f = roman_Ψ. Then by Theorem 3 of [26], by setting a learning rate of γ=1/LA2𝛾1𝐿superscriptnorm𝐴2\gamma=1/L\|A\|^{2}italic_γ = 1 / italic_L ∥ italic_A ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, where L=1/minj,tpj(t)𝐿1subscript𝑗𝑡superscriptsubscript𝑝𝑗𝑡L=1/\min_{j,t}p_{j}^{(t)}italic_L = 1 / roman_min start_POSTSUBSCRIPT italic_j , italic_t end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT, we get linear convergence. Note that A2=nsuperscriptnorm𝐴2𝑛\|A\|^{2}=n∥ italic_A ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_n. Gao and Kroer [26] further provide a line search procedure to set the constant multiplier in the learning rate as well as provide sharper convergence guarantees, but as we only run for 16161616 iterations, we do not perform the line search and fix the learning rate to the initial learning rate that Gao and Kroer [26] use in their empirical studies, which is 1000/LA21000𝐿superscriptnorm𝐴21000/L\|A\|^{2}1000 / italic_L ∥ italic_A ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT.

For amplitude estimation, we set M=Tn/16=32𝑀𝑇𝑛1632M=\sqrt{T\sqrt{n}}/16=32italic_M = square-root start_ARG italic_T square-root start_ARG italic_n end_ARG end_ARG / 16 = 32. We scale down M𝑀Mitalic_M by the constant factor of 16161616 to save memory consumption on the GPU, as we simulate amplitude estimation by computing the full probability distribution over [M]delimited-[]𝑀[M][ italic_M ]. We compensate for the loss in accuracy of the estimation by employing the median-of-means estimator [54], where we take the median of 3333 estimators constructed from the mean of 7777 samples from the amplitude estimation subroutine. We also assume that the maximum finding algorithm is always successful in our algorithm.