When NOMA Meets AIGC: Enhanced Wireless Federated Learning

Ding Xu, , Lingjie Duan, , and Hongbo Zhu Ding Xu is with the Jiangsu Key Laboratory of Wireless Communications, Nanjing University of Posts and Telecommunications, Nanjing 210003, China (E-mail: [email protected]). He is also with the Pillar of Engineering Systems and Design, Singapore University of Technology and Design, Singapore 487372, Singapore.Lingjie Duan is with the Pillar of Engineering Systems and Design, Singapore University of Technology and Design, Singapore 487372, Singapore (E-mail: [email protected]).Hongbo Zhu is with the Jiangsu Key Laboratory of Wireless Communications, Nanjing University of Posts and Telecommunications, Nanjing 210003, China (E-mail: [email protected]).
Abstract

Wireless federated learning (WFL) enables devices to collaboratively train a global model via local model training, uploading and aggregating. However, WFL faces the data scarcity/heterogeneity problem (i.e., data are limited and unevenly distributed among devices) that degrades the learning performance. In this regard, artificial intelligence generated content (AIGC) can synthesize various types of data to compensate for the insufficient local data. Nevertheless, downloading synthetic data or uploading local models iteratively takes a lot of time, especially for a large amount of devices. To address this issue, we propose to leverage non-orthogonal multiple access (NOMA) to achieve efficient synthetic data and local model transmission. This paper is the first to combine AIGC and NOMA with WFL to maximally enhance the learning performance. For the proposed NOMA+AIGC-enhanced WFL, the problem of jointly optimizing the synthetic data distribution, two-way communication and computation resource allocation to minimize the global learning error is investigated. The problem belongs to NP-hard mixed integer nonlinear programming, whose optimal solution is intractable to find. We first employ the block coordinate descent method to decouple the complicated-coupled variables, and then resort to our analytical method to derive an efficient low-complexity local optimal solution with partial closed-form results. Extensive simulations validate the superiority of the proposed scheme compared to the existing and benchmark schemes such as the frequency/time division multiple access based AIGC-enhanced schemes.

Index Terms:
Non-orthogonal multiple access, artificial intelligence generated content, wireless federated learning, synthetic data.

I Introduction

Nowadays, due to the explosive growth of Internet of Things (IoT) devices, the data collected by IoT devices are needed to be analyzed using machine learning techniques to support various IoT applications such as augmented reality and virtual reality [1]. However, because of the limited communication resources, it is unbearable to transmit all of the collected data to a data center to perform centralized machine learning. Luckily, the computing capability of devices is surging due to the fast development of chip technology, and it motivates to implement distributed machine learning that lets each device train a learning model locally using its collected data. Wireless federated learning (WFL) is one of the most famous distributed learning framework that allows devices to collaboratively learn a global model while protecting the data privacy [2]. In WFL, devices train local models using local data independently based on a received global model, then the trained local models are transmitted to the WFL server for aggregating to a global model, while the new global model is redistributed back to the devices, and the process repeated until the global model converges. Therefore, wireless transmission schemes affect the WFL performance greatly and need to be tailored for WFL.

Meanwhile, due to data scarcity and heterogeneity, data are limited and unevenly distributed among devices and some important portions of data are missing locally at some devices, which leads to poor learning convergence accuracy [3]. Therefore, to improve global model accuracy, some measures have to be taken. For example, device selection can be performed to select proper devices for global model aggregation [2]. However, this will introduce the fairness issue. Another method is to let devices collect the missing portions of data. However, due to physical limitations, sometimes it is difficult for the devices to collect the missing portions of local data by themselves. Even if collecting data is possible, it will certainly introduce much more latency and energy consumption, which is not desirable for energy-limited devices and delay-sensitive services. In this regard, artificial intelligence generated content (AIGC) [4], which is a promising technology for synthesizing data, can be adopted to generate synthetic data for device local training. Specifically, AIGC can automatically create various data such as texts, images, and videos as the devices request [5, 6], and can save time and resources that may otherwise be spent on the data collection. Since AIGC usually requires high computation capability, it can be deployed at an AIGC server in the cloud computing center, and devices can request the server for synthesizing specific data and then download the synthetic data for local model training.

I-A Related Works

In the context of WFL networks, many works such as [7, 8, 9, 10, 11] have studied various important problems regarding the implementation of WFL. Specifically, in [7], an algorithm for jointly optimizing the learning, radio resource allocation and device selection based on the Hungarian method was proposed to minimize the WFL loss function. In [8], the problem of joint learning and communication resource optimization to minimize the total energy consumption was investigated, and an iterative algorithm was proposed to achieve a local optimal solution. In [9], the computation and the communication resources as well as the number of the local model parameter quantization bits were jointly optimized to minimize the WFL convergence time. In [10], an iterative algorithm for jointly scheduling devices, local iterations and radio resources was developed based on the pointer network embedded deep reinforcement learning method and the breadth-first search method. In [11], an incentive mechanism based on the Stackelberg game was designed to motivate the devices to participate in collaborative model training.

In the above works, [7, 8, 10, 11] adopted the frequency division multiple access (FDMA) wireless transmission scheme, and [9] adopted the time division multiple access (TDMA) wireless transmission scheme. Both FDMA and TDMA belong to orthogonal multiple access (OMA) transmission scheme. However, OMA suffers from low transmission efficiency [12, 13]. To fully realize the potential capability of WFL, more efficient transmission schemes other than OMA are needed. In this context, non-orthogonal multiple access (NOMA) [13, 14], which supports multiple concurrent transmissions on the same spectrum band, is a promising technique to realize high transmission efficiency in WFL networks. Particularly, devices can upload their local models simultaneously using the superposition coding, and the WFL server can decode different device signals by using the successive interference cancellation (SIC) technique.

There have been some works on NOMA-enhanced WFL networks. Specifically, in [15], devices were assumed to be wirelessly powered by the base station (BS), and the problem of joint optimization of the NOMA communication and computation resources to minimize the system-wise cost was investigated. A layered algorithm based on the monotonic optimization was developed to solve the joint optimization problem. In [16], the problem of jointly optimizing device scheduling, transmit power and computation frequency allocation in a relay-assisted NOMA-enhanced WFL network to minimize the energy consumption was solved by graph theory. In [17], devices were grouped into different NOMA groups for local model uploading, and the transmit power and bandwidth of NOMA groups were jointly optimized to maximize the system convergence metric based on convex optimization. In [18], the joint optimization of device selection and resource allocation to minimize the total training latency in a NOMA-enhanced WFL network was carried by the monotonicity analysis and dual decomposition method.

Meanwhile, AIGC in wireless networks has attracted a lot of attention recently, such as the AIGC-enhanced semantic communications [19, 20, 21], blockchain-enabled AIGC [22], distributed AIGC [23], and AIGC service provider selection [24]. AIGC has also been integrated with WFL in recent works [25, 26]. Particularly, the work in [25] applied WFL to achieve efficient AIGC, and presented WFL-based techniques for AIGC to generate diverse and personalized contents. While [25] focused on how WFL can empower the AIGC, the work in [26] adopted AIGC to empower WFL. Specifically, in [26], AIGC was proposed to generate more training data for devices to minimize the device energy consumption under the learning performance constraint.

I-B Motivation and Contributions

Since devices are heterogeneous and the data are limited and unevenly distributed, AIGC can be used to generate the specified data missing at the devices for more efficient local training and improved global convergence performance in WFL. In this regard, the pioneer work [26] has considered to use AIGC to enhance the WFL performance. However, the work [26] ignored the procedure of synthetic data downloading from the AIGC server to the devices, the time of which cannot be neglected in practice. In addition, the work [26] used a low-efficient FDMA transmission scheme for local model uploading, which is inadequate for a large amount of devices. The above research gaps motivate the work in this paper to use the highly efficient NOMA transmission scheme both for synthetic data downloading and local model uploading in AIGC-enhanced WFL networks, and jointly design the synthetic data distribution, two-way communication resource and computation resource allocation to maximize the learning performance. Note that jointly designing resource optimization policy in NOMA+AIGC-enhanced WFL networks is non-trivial, since the optimization variables are highly coupled in such complicated non-convex optimization problems, while the SIC decoding policy in NOMA can further complicate the optimization.

The main contributions of this paper are summarized as follows:

(1) We are the first to propose a NOMA+AIGC-enhanced WFL system model, where devices can download synthetic data from the AIGC server based on NOMA, then train the local models based on the local data and synthetic data, and upload the local models to the WFL server based on NOMA for global model aggregation. The problem of jointly optimizing the synthetic data allocation, the time allocation, the transmit power allocation of the BS and the devices, the SIC decoding order, and the computing frequency allocation, is formulated with the objective of minimizing the global learning error, under various system constraints.

(2) We propose an efficient low-complexity algorithm with partial closed-form results to achieve a local optimal solution to the problem. First, we analytically derive the closed-form optimal computing frequency allocation to simplify the problem. Then, the block coordinate descent (BCD) method is adopted to decouple the complex problem into two simpler subproblems. The first subproblem optimizes the synthetic data allocation and the time allocation, where the optimal time allocation is analytical obtained in closed form, and the optimal synthetic data allocation is obtained via convex optimization. The second subproblem optimizes the transmit power allocation of the BS, the transmit power of the devices, and the SIC decoding order, where the optimal transmit power allocation of the BS and the transmit power of the devices are analytically obtained in a recursive form, and the optimal SIC decoding order is analytically obtained in closed form.

(3) Extensive simulation results are illustrated to demonstrate the superiority of the proposed NOMA+AIGC-enhanced scheme. The results show the advantages of combining NOMA and AIGC with WFL. It is shown that the proposed scheme outperforms the existing and benchmark schemes such as the FDMA or TDMA+AIGC-enhanced schemes, in terms of global learning accuracy, under various system configurations. Particularly, the learning performance improvement of our proposed scheme is more obvious when the data synthesizing capability of the AIGC server is stronger, the maximum latency requirement is more stringent, and the number of devices is larger.

The remainder of this paper is organized as follows. Section II presents the system model of the proposed NOMA+AIGC-enhanced WFL network. Section III formulates the optimization problem of maximizing the learning performance and presents an algorithm framework to solve the optimization problem. Section IV and V presents algorithms to optimally solve the two subproblems. Section VI provides simulation examples to demonstrate the effectiveness of the proposed NOMA+AIGC-enhanced scheme. Finally, the paper is concluded in Section VII.

II System Model

Refer to caption
Figure 1: NOMA+AIGC-enhanced WFL system model.

We consider a NOMA+AIGC-enhanced WFL network consisting of an AIGC server in the cloud, a BS and K𝐾Kitalic_K devices, as shown in Fig. 1. The BS is equipped with a WFL server for simple computation of global model aggregation, and connects with the upper AIGC server via high-speed wired backhaul. The AIGC server is assumed to be deployed at the cloud computing center and equipped with powerful computation capacity for data synthesis. Each device has a set of original local dataset for local model training. Let Dklocsuperscriptsubscript𝐷𝑘locD_{k}^{\mathrm{loc}}italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT denote the number of samples in the local dataset of device k.𝑘k.italic_k . In order to improve the learning model accuracy, AIGC is adopted to synthesize the data required by the devices for performing local training. Specifically, each device can send the data synthesis request to the AIGC server indicating the specific data distributions/properties. After the AIGC server receives the requests from the devices, it will synthesize the requested data and then send the synthetic data to the BS and then the devices for local model training. Let Dkgensuperscriptsubscript𝐷𝑘genD_{k}^{\mathrm{gen}}italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT denote the amount of synthetic data samples that are generated by a pre-trained AI generative model and are downloaded from the AIGC server to the device k𝑘kitalic_k. After the AIGC server pushes the synthetic data to the devices, the devices can train the local models based on both the local dataset and the synthetic dataset.

II-A AIGC-Enhanced WFL Model

The whole AIGC-enhanced WFL procedure consists of three phases [26]. In the first phase, all the devices send the data synthesis request to the AIGC server indicating the specific data distributions/properties, and then the AIGC server synthesizes the data required by all the devices. Due to data scarcity/heterogeneity, the local data available at the devices may lack particular types of data, which may hinder the convergence of the global model. Thus, the synthetic data can compensate for the missing portions of local data to improve the learning convergence [26]. We assume that due to physical limitations, the AIGC server can generate a maximum amount of Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT synthetic data [6]. Since the synthetic data received by all the devices cannot exceed the maximum amount to bear by the AIGC server, we have the following synthetic data constraint as given by

k=1KDkgenDgen.superscriptsubscript𝑘1𝐾superscriptsubscript𝐷𝑘gensuperscript𝐷gen\sum_{k=1}^{K}D_{k}^{\mathrm{gen}}\leq D^{\mathrm{gen}}.∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ≤ italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT . (1)

In the second phase, the synthetic data are transmitted from the AIGC server in the cloud to the BS and then the devices with time Tdownsuperscript𝑇downT^{\mathrm{down}}italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT. Since the AIGC server connects with the BS via high-speed wired backhaul, the time for transmission the synthetic data from the AIGC server to the BS is ignored. In the third phase, the WFL server first broadcasts the initial global model to the devices, then each device trains the local model using the local dataset and the synthetic dataset, and finally uploads the trained local model to the WFL server for global model aggregation and update. We assume that the number of global model iterations is fixed at N.𝑁N.italic_N . Let Dmod,superscript𝐷modD^{\mathrm{mod}},italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT , Tsyn,superscript𝑇synT^{\mathrm{syn}},italic_T start_POSTSUPERSCRIPT roman_syn end_POSTSUPERSCRIPT , Tbr,superscript𝑇brT^{\mathrm{br}},italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT , Tlocsuperscript𝑇locT^{\mathrm{loc}}italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT and Tupsuperscript𝑇upT^{\mathrm{up}}italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT denote the local/global model data size in bits, the time required for generating the synthetic data for all the devices, the time for the global model broadcasting of the WFL server, the time for the local model training of all the devices, and the time for the local model uploading of all the devices, respectively. Then, the time of the third phase is N(Tbr+Tloc+Tup).𝑁superscript𝑇brsuperscript𝑇locsuperscript𝑇upN\left(T^{\mathrm{br}}+T^{\mathrm{loc}}+T^{\mathrm{up}}\right).italic_N ( italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ) . We assume that there is a pre-determined maximum latency Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT for the whole WFL procedure, i.e.,

Tsyn+Tdown+N(Tbr+Tloc+Tup)Tmax.superscript𝑇synsuperscript𝑇down𝑁superscript𝑇brsuperscript𝑇locsuperscript𝑇upsuperscript𝑇maxT^{\mathrm{syn}}+T^{\mathrm{down}}+N\left(T^{\mathrm{br}}+T^{\mathrm{loc}}+T^{% \mathrm{up}}\right)\leq T^{\mathrm{max}}.italic_T start_POSTSUPERSCRIPT roman_syn end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT + italic_N ( italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ) ≤ italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT . (2)

Since the synthetic data are generated by the pre-trained AI generative model in the AIGC server, we assume that Tsynsuperscript𝑇synT^{\mathrm{syn}}italic_T start_POSTSUPERSCRIPT roman_syn end_POSTSUPERSCRIPT is modeled as a linear function of the amount of synthetic data generated for all the devices [6], i.e., Tsyn=ϱk=1KDkgensuperscript𝑇synitalic-ϱsuperscriptsubscript𝑘1𝐾superscriptsubscript𝐷𝑘genT^{\mathrm{syn}}=\varrho\sum_{k=1}^{K}D_{k}^{\mathrm{gen}}italic_T start_POSTSUPERSCRIPT roman_syn end_POSTSUPERSCRIPT = italic_ϱ ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT.

According to the results in [26], the global learning error (i.e., the global learning model accuracy) of the WFL depends on the local training dataset sizes of the devices and the number of global iterations, and can be modeled as

(𝐃gen)=eN(αKk=1K(Dkloc+Dkgen)βγ1)ζ,superscript𝐃gensuperscript𝑒𝑁𝛼𝐾superscriptsubscript𝑘1𝐾superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen𝛽𝛾1𝜁\triangle(\mathbf{D}^{\mathrm{gen}})=e^{\frac{N\left(\frac{\alpha}{K}\sum_{k=1% }^{K}\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}\right)^{-\beta}-\gamma-1% \right)}{\zeta}},△ ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) = italic_e start_POSTSUPERSCRIPT divide start_ARG italic_N ( divide start_ARG italic_α end_ARG start_ARG italic_K end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT - italic_γ - 1 ) end_ARG start_ARG italic_ζ end_ARG end_POSTSUPERSCRIPT , (3)

where 𝐃gen={Dkgen,k=1,,K},\mathbf{D}^{\mathrm{gen}}=\{D_{k}^{\mathrm{gen}},k=1,\ldots,K\},bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT = { italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , italic_k = 1 , … , italic_K } , ζ𝜁\zetaitalic_ζ is a positive constant parameter, and α,𝛼\alpha,italic_α , β,𝛽\beta,italic_β , γ𝛾\gammaitalic_γ are positive hyper-parameters that can be obtained through curve fitting [26].

II-B Computation Model

Let fk,subscript𝑓𝑘f_{k},italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , w𝑤witalic_w and τ𝜏\tauitalic_τ denote the computing frequency of the device k,𝑘k,italic_k , the number of CPU cycles of the devices to locally train one data sample, and the local epoch, respectively. The computing frequency of each device is capped by its maximum value, i.e.,

fkfkmax,k,subscript𝑓𝑘superscriptsubscript𝑓𝑘maxfor-all𝑘f_{k}\leq f_{k}^{\mathrm{max}},\forall k,italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , ∀ italic_k , (4)

where fkmaxsuperscriptsubscript𝑓𝑘maxf_{k}^{\mathrm{max}}italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is the maximum computing frequency of the device k.𝑘k.italic_k . Then, the time of the device k𝑘kitalic_k for a single-round local model training is given by

Tkloc=wτ(Dkloc+Dkgen)fk.superscriptsubscript𝑇𝑘loc𝑤𝜏superscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gensubscript𝑓𝑘T_{k}^{\mathrm{loc}}=\frac{w\tau\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}% }\right)}{f_{k}}.italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT = divide start_ARG italic_w italic_τ ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG . (5)

Since each device has to finish the local model training within the required time Tloc,superscript𝑇locT^{\mathrm{loc}},italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , we have

TklocTloc,k.superscriptsubscript𝑇𝑘locsuperscript𝑇locfor-all𝑘T_{k}^{\mathrm{loc}}\leq T^{\mathrm{loc}},\forall k.italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT ≤ italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , ∀ italic_k . (6)

In addition, the energy consumption per single-round local model training of the device k𝑘kitalic_k is written as

Ekloc=wτϖkfk2(Dkloc+Dkgen),superscriptsubscript𝐸𝑘loc𝑤𝜏subscriptitalic-ϖ𝑘superscriptsubscript𝑓𝑘2superscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘genE_{k}^{\mathrm{loc}}=w\tau\varpi_{k}f_{k}^{2}\left(D_{k}^{\mathrm{loc}}+D_{k}^% {\mathrm{gen}}\right),italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT = italic_w italic_τ italic_ϖ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) , (7)

where ϖksubscriptitalic-ϖ𝑘\varpi_{k}italic_ϖ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote the hardware energy coefficient [27].

II-C NOMA-Enhanced Communication Model

Let hksubscript𝑘h_{k}italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and gksubscript𝑔𝑘g_{k}italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote the channel gains from the BS to the device k,𝑘k,italic_k , and from the device k𝑘kitalic_k to the BS, respectively. The system bandwidth is B.𝐵B.italic_B . There are three procedures that involve wireless transmissions, i.e., synthetic data downloading from the BS to the devices, global model broadcasting from the BS to the devices, and local model uploading from the devices to the BS.

For synthetic data downloading, downlink NOMA is applied. Specifically, according to the downlink NOMA principle, the messages intended for the devices are transmitted simultaneously based on the superposition coding. Each device can apply SIC to cancel the messages of other devices whose channel gains are smaller than its own channel gain. Without loss of the generality, we assume the devices are sorted in the ascending order of the channel gain hk,subscript𝑘h_{k},italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , i.e., h1<h2<<hKsubscript1subscript2subscript𝐾h_{1}<h_{2}<\ldots<h_{K}italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_h start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < … < italic_h start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT. Let pksubscript𝑝𝑘p_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote the transmit power of the message for the device k𝑘kitalic_k. Thus, the achievable rate for synthetic data downloading of the device k𝑘kitalic_k is given by

Rkdown=Blog2(1+hkpkσ2B+hkj>kpj),superscriptsubscript𝑅𝑘down𝐵subscript21subscript𝑘subscript𝑝𝑘superscript𝜎2𝐵subscript𝑘subscript𝑗𝑘subscript𝑝𝑗R_{k}^{\mathrm{down}}=B\log_{2}\left(1+\frac{h_{k}p_{k}}{\sigma^{2}B+h_{k}\sum% _{j>k}p_{j}}\right),italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT = italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j > italic_k end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) , (8)

where σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is the noise power spectral density. Since the allocated synthetic data has to be finished transmitting within the time Tdown,superscript𝑇downT^{\mathrm{down}},italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT , we have

TdownRkdownΓDkgen,k,superscript𝑇downsuperscriptsubscript𝑅𝑘downΓsuperscriptsubscript𝐷𝑘genfor-all𝑘T^{\mathrm{down}}R_{k}^{\mathrm{down}}\geq\Gamma D_{k}^{\mathrm{gen}},\forall k,italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT ≥ roman_Γ italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , ∀ italic_k , (9)

where ΓΓ\Gammaroman_Γ is the size of one data sample in bits. Furthermore, the maximum transmit power of the BS is limited as

k=1KpkP,superscriptsubscript𝑘1𝐾subscript𝑝𝑘𝑃\sum_{k=1}^{K}p_{k}\leq P,∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_P , (10)

where P𝑃Pitalic_P is the maximum transmit power of the BS.

For global model broadcasting, the BS is assumed to transmit at its maximum power P,𝑃P,italic_P , and in order for all the devices to successfully receive the global model, the broadcasting rate is chosen as the minimum rate achievable for all the devices, i.e., Blog2(1+h1Pσ2B).𝐵subscript21subscript1𝑃superscript𝜎2𝐵B\log_{2}\left(1+\frac{h_{1}P}{\sigma^{2}B}\right).italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_P end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B end_ARG ) . Since the global model data has to be transmitted within the time Tbr,superscript𝑇brT^{\mathrm{br}},italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT , we have

TbrBlog2(1+h1Pσ2B)Dmod.superscript𝑇br𝐵subscript21subscript1𝑃superscript𝜎2𝐵superscript𝐷modT^{\mathrm{br}}B\log_{2}\left(1+\frac{h_{1}P}{\sigma^{2}B}\right)\geq D^{% \mathrm{mod}}.italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_P end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B end_ARG ) ≥ italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT . (11)

For local model uploading, uplink NOMA is adopted. Specifically, the messages of all devices are transmitted to the BS simultaneously. Let qksubscript𝑞𝑘q_{k}italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote the transmit power of the device k,𝑘k,italic_k , and Qksubscript𝑄𝑘Q_{k}italic_Q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote the maximum transmit power of the device k.𝑘k.italic_k . Then, we have

qkQk,k.subscript𝑞𝑘subscript𝑄𝑘for-all𝑘q_{k}\leq Q_{k},\forall k.italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_Q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k . (12)

At the BS, the SIC is adopted to decode the messages of all the devices. Denote by πksubscript𝜋𝑘\pi_{k}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT the SIC decoding order of the device k𝑘kitalic_k. Let 𝝅={πk,k},𝝅subscript𝜋𝑘for-all𝑘\boldsymbol{\pi}=\{\pi_{k},\forall k\},bold_italic_π = { italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k } , and it belongs to the set ΠΠ\Piroman_Π of all possible SIC decoding orders of all K𝐾Kitalic_K messages. Thus, the achievable rate of the device k𝑘kitalic_k for local model uploading is expressed as

Rkup=Blog2(1+gkqkσ2B+j=1,πj>πkgjqj).superscriptsubscript𝑅𝑘up𝐵subscript21subscript𝑔𝑘subscript𝑞𝑘superscript𝜎2𝐵subscriptformulae-sequence𝑗1subscript𝜋𝑗subscript𝜋𝑘subscript𝑔𝑗subscript𝑞𝑗\displaystyle R_{k}^{\mathrm{up}}=B\log_{2}\left(1+\frac{g_{k}q_{k}}{\sigma^{2% }B+\sum_{j=1,\pi_{j}>\pi_{k}}g_{j}q_{j}}\right).italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT = italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + ∑ start_POSTSUBSCRIPT italic_j = 1 , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) . (13)

In order to upload the local model to the BS within the time Tup,superscript𝑇upT^{\mathrm{up}},italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT , we have

TupRkupDmod,k.superscript𝑇upsuperscriptsubscript𝑅𝑘upsuperscript𝐷modfor-all𝑘T^{\mathrm{up}}R_{k}^{\mathrm{up}}\geq D^{\mathrm{mod}},\forall k.italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ≥ italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT , ∀ italic_k . (14)

In addition, the energy consumption of the device k𝑘kitalic_k for local model uploading per single-round training is given by

Ekup=Tupqk.superscriptsubscript𝐸𝑘upsuperscript𝑇upsubscript𝑞𝑘E_{k}^{\mathrm{up}}=T^{\mathrm{up}}q_{k}.italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT = italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (15)

III NOMA+AIGC-Enhanced WFL

III-A Problem Formulation

Based on the system description in the previous section, the energy consumption of the device k𝑘kitalic_k for a single-round training is written as

Ek=Ekloc+Ekup.subscript𝐸𝑘superscriptsubscript𝐸𝑘locsuperscriptsubscript𝐸𝑘upE_{k}=E_{k}^{\mathrm{loc}}+E_{k}^{\mathrm{up}}.italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT . (16)

We assume that there is an energy budget for each device, i.e.,

EkEkmax,k,subscript𝐸𝑘superscriptsubscript𝐸𝑘maxfor-all𝑘E_{k}\leq E_{k}^{\mathrm{max}},\forall k,italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , ∀ italic_k , (17)

where Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is the maximum energy consumption of the device k𝑘kitalic_k per single-round training.

Our focus is maximizing the learning performance under the constraints analyzed in the system description. Specifically, the optimization objective is minimizing the global learning error (i.e., to maximize the global learning model accuracy), and the optimization variables are the synthetic data allocation 𝐃gen,superscript𝐃gen\mathbf{D}^{\mathrm{gen}},bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , the time allocation 𝐓={Tdown,Tbr,Tloc,Tup},𝐓superscript𝑇downsuperscript𝑇brsuperscript𝑇locsuperscript𝑇up\mathbf{T}=\{T^{\mathrm{down}},T^{\mathrm{br}},T^{\mathrm{loc}},T^{\mathrm{up}% }\},bold_T = { italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT , italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT , italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT } , the transmit power allocation of the BS 𝐩={pk,k=1,,K},\mathbf{p}=\{p_{k},k=1,\ldots,K\},bold_p = { italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k = 1 , … , italic_K } , the transmit power of the devices 𝐪={qk,k=1,,K},\mathbf{q}=\{q_{k},k=1,\ldots,K\},bold_q = { italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k = 1 , … , italic_K } , the SIC decoding order 𝝅,𝝅\boldsymbol{\pi},bold_italic_π , and the computing frequency allocation 𝐟={fk,k=1,,K}\mathbf{f}=\{f_{k},k=1,\ldots,K\}bold_f = { italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k = 1 , … , italic_K }. Mathematically, the optimization problem for the NOMA+AIGC-enhanced WFL is formulated as

min\displaystyle\minroman_min (𝐃gen)=eN(αKk=1K(Dkloc+Dkgen)βγ1)ζsuperscript𝐃gensuperscript𝑒𝑁𝛼𝐾superscriptsubscript𝑘1𝐾superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen𝛽𝛾1𝜁\displaystyle\>\triangle(\mathbf{D}^{\mathrm{gen}})=e^{\frac{N\left(\frac{% \alpha}{K}\sum_{k=1}^{K}\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}\right)% ^{-\beta}-\gamma-1\right)}{\zeta}}△ ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) = italic_e start_POSTSUPERSCRIPT divide start_ARG italic_N ( divide start_ARG italic_α end_ARG start_ARG italic_K end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT - italic_γ - 1 ) end_ARG start_ARG italic_ζ end_ARG end_POSTSUPERSCRIPT (18a)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . k=1KDkgenDgen,superscriptsubscript𝑘1𝐾superscriptsubscript𝐷𝑘gensuperscript𝐷gen\displaystyle\>\sum_{k=1}^{K}D_{k}^{\mathrm{gen}}\leq D^{\mathrm{gen}},∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ≤ italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , (18b)
Tsyn+Tdown+N(Tbr+Tloc+Tup)Tmax,superscript𝑇synsuperscript𝑇down𝑁superscript𝑇brsuperscript𝑇locsuperscript𝑇upsuperscript𝑇max\displaystyle\>T^{\mathrm{syn}}+T^{\mathrm{down}}+N\left(T^{\mathrm{br}}+T^{% \mathrm{loc}}+T^{\mathrm{up}}\right)\leq T^{\mathrm{max}},italic_T start_POSTSUPERSCRIPT roman_syn end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT + italic_N ( italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ) ≤ italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , (18c)
fkfkmax,k,subscript𝑓𝑘superscriptsubscript𝑓𝑘maxfor-all𝑘\displaystyle\>f_{k}\leq f_{k}^{\mathrm{max}},\forall k,italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , ∀ italic_k , (18d)
TklocTloc,ksuperscriptsubscript𝑇𝑘locsuperscript𝑇locfor-all𝑘\displaystyle\>T_{k}^{\mathrm{loc}}\leq T^{\mathrm{loc}},\forall kitalic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT ≤ italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , ∀ italic_k (18e)
TdownRkdownΓDkgen,k,superscript𝑇downsuperscriptsubscript𝑅𝑘downΓsuperscriptsubscript𝐷𝑘genfor-all𝑘\displaystyle\>T^{\mathrm{down}}R_{k}^{\mathrm{down}}\geq\Gamma D_{k}^{\mathrm% {gen}},\forall k,italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT ≥ roman_Γ italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , ∀ italic_k , (18f)
k=1KpkP,superscriptsubscript𝑘1𝐾subscript𝑝𝑘𝑃\displaystyle\>\sum_{k=1}^{K}p_{k}\leq P,∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_P , (18g)
TbrBlog2(1+h1Pσ2B)Dmod,superscript𝑇br𝐵subscript21subscript1𝑃superscript𝜎2𝐵superscript𝐷mod\displaystyle\>T^{\mathrm{br}}B\log_{2}\left(1+\frac{h_{1}P}{\sigma^{2}B}% \right)\geq D^{\mathrm{mod}},italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_P end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B end_ARG ) ≥ italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT , (18h)
qkQk,k,subscript𝑞𝑘subscript𝑄𝑘for-all𝑘\displaystyle\>q_{k}\leq Q_{k},\forall k,italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_Q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k , (18i)
TupRkupDmod,k,superscript𝑇upsuperscriptsubscript𝑅𝑘upsuperscript𝐷modfor-all𝑘\displaystyle\>T^{\mathrm{up}}R_{k}^{\mathrm{up}}\geq D^{\mathrm{mod}},\forall k,italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ≥ italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT , ∀ italic_k , (18j)
EkEkmax,k,subscript𝐸𝑘superscriptsubscript𝐸𝑘maxfor-all𝑘\displaystyle\>E_{k}\leq E_{k}^{\mathrm{max}},\forall k,italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , ∀ italic_k , (18k)
𝝅Π,𝝅Π\displaystyle\>\boldsymbol{\pi}\in\Pi,bold_italic_π ∈ roman_Π , (18l)
Dkgen0,k,superscriptsubscript𝐷𝑘gen0for-all𝑘\displaystyle\>D_{k}^{\mathrm{gen}}\geq 0,\forall k,italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ≥ 0 , ∀ italic_k , (18m)
pk0,k,subscript𝑝𝑘0for-all𝑘\displaystyle\>p_{k}\geq 0,\forall k,italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 , ∀ italic_k , (18n)
qk0,k,subscript𝑞𝑘0for-all𝑘\displaystyle\>q_{k}\geq 0,\forall k,italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 , ∀ italic_k , (18o)
Tdown0,Tbr0,Tloc0,Tup0,formulae-sequencesuperscript𝑇down0formulae-sequencesuperscript𝑇br0formulae-sequencesuperscript𝑇loc0superscript𝑇up0\displaystyle\>T^{\mathrm{down}}\geq 0,T^{\mathrm{br}}\geq 0,T^{\mathrm{loc}}% \geq 0,T^{\mathrm{up}}\geq 0,italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT ≥ 0 , italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT ≥ 0 , italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT ≥ 0 , italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ≥ 0 , (18p)
fk0,k,subscript𝑓𝑘0for-all𝑘\displaystyle\>f_{k}\geq 0,\forall k,italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 , ∀ italic_k , (18q)
o.v.formulae-sequenceov\displaystyle\mathrm{o.v.}roman_o . roman_v . 𝐃gen,𝐓,𝐩,𝐪,𝝅,𝐟.superscript𝐃gen𝐓𝐩𝐪𝝅𝐟\displaystyle\>\mathbf{D}^{\mathrm{gen}},\mathbf{T},\mathbf{p},\mathbf{q},% \boldsymbol{\pi},\mathbf{f}.bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T , bold_p , bold_q , bold_italic_π , bold_f . (18r)

where the abbreviation ‘s.t.formulae-sequencest\mathrm{s.t.}roman_s . roman_t .’ stands for ‘subject to’ and the abbreviation ‘o.v.formulae-sequenceov\mathrm{o.v.}roman_o . roman_v .’ stands for ‘optimization variables’. The constraint (18b) restricts that the total transmitted synthetic data is smaller than the totally synthetic data that can be generated in the AIGC server. The constraint (18c) requires that the total time for N𝑁Nitalic_N WFL rounds is less than the pre-defined maximum latency. The constraint (18d) restricts the maximum computing frequency of each device. The constraint (18e) requires that each device finishes the local model training within the required time. The constraint (18f) requires that the synthetic data can finish transmitting within the synthetic data transmission time. The constraint (18g) restricts the transmit power of the BS. The constraint (18h) requires that the global model data can finish transmitting within the global model broadcasting time. The constraint (18i) restricts the transmit power of each device. The constraint (18j) requires that the local model data can finish transmitting within the local model uploading time. The constraint (18k) restricts the energy consumption of each device. The constraint (18l) restricts the SIC decoding order. The constraints (18m)-(18q) restrict that the optimization variables are non-negative.

III-B Proposed Solution Framework

Since 𝝅𝝅\boldsymbol{\pi}bold_italic_π is discrete and the constraints (18f), (18j), (18k) are non-convex, the problem (18) is a mixed integer nonlinear programming (MINLP) problem, which is NP-hard in general, and its optimal solution is thus generally intractable to find. Before proposing an algorithm to solve the problem (18), we present analysis to equivalently simplify the problem (18).

Lemma 1.

The constraint (18e) is satisfied with strict equality by the optimal solution to the problem (18).

Proof:

See Appendix -A. ∎

Lemma 1 indicates that each device shall use up the allocated local model training time for saving energy consumption. From Lemma 1, the optimal 𝐟𝐟\mathbf{f}bold_f is derived as

fk=wτ(Dkloc+Dkgen)Tloc.subscript𝑓𝑘𝑤𝜏superscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gensuperscript𝑇locf_{k}=\frac{w\tau\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}\right)}{T^{% \mathrm{loc}}}.italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_w italic_τ ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT end_ARG . (19)

By inserting (19) into the constraints (18d) and (18k), we have

wτ(Dkloc+Dkgen)Tlocfkmax,k,𝑤𝜏superscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gensuperscript𝑇locsuperscriptsubscript𝑓𝑘maxfor-all𝑘\displaystyle\frac{w\tau\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}\right)% }{T^{\mathrm{loc}}}\leq f_{k}^{\mathrm{max}},\forall k,divide start_ARG italic_w italic_τ ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT end_ARG ≤ italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , ∀ italic_k , (20)
ϖkw3τ3(Dkloc+Dkgen)3(Tloc)2+EkupEkmax,k.subscriptitalic-ϖ𝑘superscript𝑤3superscript𝜏3superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen3superscriptsuperscript𝑇loc2superscriptsubscript𝐸𝑘upsuperscriptsubscript𝐸𝑘maxfor-all𝑘\displaystyle\frac{\varpi_{k}w^{3}\tau^{3}\left(D_{k}^{\mathrm{loc}}+D_{k}^{% \mathrm{gen}}\right)^{3}}{(T^{\mathrm{loc}})^{2}}+E_{k}^{\mathrm{up}}\leq E_{k% }^{\mathrm{max}},\forall k.divide start_ARG italic_ϖ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_w start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_τ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ≤ italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , ∀ italic_k . (21)

In addition, from (3), it can be shown that the objective function (𝐃gen)superscript𝐃gen\triangle(\mathbf{D}^{\mathrm{gen}})△ ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) is a monotonically increasing function of k=1K(Dkloc+Dkgen)βsuperscriptsubscript𝑘1𝐾superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen𝛽\sum_{k=1}^{K}\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}\right)^{-\beta}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT. Thus, minimizing (𝐃gen)superscript𝐃gen\triangle(\mathbf{D}^{\mathrm{gen}})△ ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) is equivalent to minimizing k=1K(Dkloc+Dkgen)βsuperscriptsubscript𝑘1𝐾superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen𝛽\sum_{k=1}^{K}\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}\right)^{-\beta}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT.

Thanks to Lemma 1, the number of optimization variables is reduced. After the above equivalent problem transformation, the problem (18) is rewritten as

min\displaystyle\minroman_min k=1K(Dkloc+Dkgen)βsuperscriptsubscript𝑘1𝐾superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen𝛽\displaystyle\>\sum_{k=1}^{K}\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}% \right)^{-\beta}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT (22a)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (18b),(18c),(18f)(18j),(18l)(18p),(20),(21),italic-(18bitalic-)italic-(18citalic-)italic-(18fitalic-)italic-(18jitalic-)italic-(18litalic-)italic-(18pitalic-)italic-(20italic-)italic-(21italic-)\displaystyle\>\eqref{eq:p1-c1},\eqref{eq:p1-c2},\eqref{eq:p1-c6}-\eqref{eq:p1% -c10},\eqref{eq:p1-c12}-\eqref{eq:p1-c16},\eqref{eq:new-p1-c3},\eqref{eq:new-p% 1-c11},italic_( italic_) , italic_( italic_) , italic_( italic_) - italic_( italic_) , italic_( italic_) - italic_( italic_) , italic_( italic_) , italic_( italic_) ,
o.v.formulae-sequenceov\displaystyle\mathrm{o.v.}roman_o . roman_v . 𝐃gen,𝐓,𝐩,𝐪,𝝅.superscript𝐃gen𝐓𝐩𝐪𝝅\displaystyle\>\mathbf{D}^{\mathrm{gen}},\mathbf{T},\mathbf{p},\mathbf{q},% \boldsymbol{\pi}.bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T , bold_p , bold_q , bold_italic_π . (22b)

The problem (22) is still a MINLP problem, and its optimal solution is hard to obtain. Therefore, we develop an efficient low-complexity algorithm based on the BCD method to achieve a local optimal solution. The overall flowchart of solving the problem (18) is shown in Fig. 2. Specifically, the problem (22) is decoupled into two subproblems. One subproblem on the left-hand-side optimizes 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T with given 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π as shown in the left part of Fig. 2, and the other one on the right-hand-side optimizes 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π with given 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T as shown in the right part of Fig. 2. The two subproblems are solved iteratively until the objective function value in (22a) converges. We will optimally solve the two subproblems in the next two sections.

Refer to caption
Figure 2: Overall flowchart of solving the problem (18).

IV Efficient Algorithm for Optimally Optimizing 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T

In this subsection, the subproblem of optimizing 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T with given 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π (the left part of Fig. 2) is investigated as

min\displaystyle\minroman_min k=1K(Dkloc+Dkgen)βsuperscriptsubscript𝑘1𝐾superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen𝛽\displaystyle\>\sum_{k=1}^{K}\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}% \right)^{-\beta}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT (23a)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (18b),(18c),(18f),(18h),(18j),(18m),(18p),(20),(21),italic-(18bitalic-)italic-(18citalic-)italic-(18fitalic-)italic-(18hitalic-)italic-(18jitalic-)italic-(18mitalic-)italic-(18pitalic-)italic-(20italic-)italic-(21italic-)\displaystyle\>\eqref{eq:p1-c1},\eqref{eq:p1-c2},\eqref{eq:p1-c6},\eqref{eq:p1% -c8},\eqref{eq:p1-c10},\eqref{eq:p1-c13},\eqref{eq:p1-c16},\eqref{eq:new-p1-c3% },\eqref{eq:new-p1-c11},italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) ,
o.v.formulae-sequenceov\displaystyle\mathrm{o.v.}roman_o . roman_v . 𝐃gen,𝐓.superscript𝐃gen𝐓\displaystyle\>\mathbf{D}^{\mathrm{gen}},\mathbf{T}.bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T . (23b)

In what follows, we present important properties of the problem (23).

Lemma 2.

The optimal solution to the problem (23) satisfies the constraint (18c) with strict equality.

Proof:

See Appendix -B. ∎

Lemma 2 indicates that the total allocated time for the whole WFL procedure shall use up the maximum latency for saving the energy consumption of the devices. From Lemma 2, we get

Tsyn+Tdown+N(Tbr+Tloc+Tup)=Tmax.superscript𝑇synsuperscript𝑇down𝑁superscript𝑇brsuperscript𝑇locsuperscript𝑇upsuperscript𝑇maxT^{\mathrm{syn}}+T^{\mathrm{down}}+N\left(T^{\mathrm{br}}+T^{\mathrm{loc}}+T^{% \mathrm{up}}\right)=T^{\mathrm{max}}.italic_T start_POSTSUPERSCRIPT roman_syn end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT + italic_N ( italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ) = italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT . (24)
Lemma 3.

The optimal solution to the problem (23) satisfies the constraint (18f) with strict equality for a given j=argmaxkDkgenRkdown𝑗subscript𝑘superscriptsubscript𝐷𝑘gensuperscriptsubscript𝑅𝑘downj=\arg\max_{k}\frac{D_{k}^{\mathrm{gen}}}{R_{k}^{\mathrm{down}}}italic_j = roman_arg roman_max start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG.

Proof:

See Appendix -C. ∎

Lemma 3 indicates that the synthetic data transmission time is determined by the device with the maximum DkgenRkdown.superscriptsubscript𝐷𝑘gensuperscriptsubscript𝑅𝑘down\frac{D_{k}^{\mathrm{gen}}}{R_{k}^{\mathrm{down}}}.divide start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG .

Lemma 4.

The optimal solution to the problem (23) satisfies the constraint (18h) with strict equality.

Proof:

It can be proved similar to Lemma 3, and is thus omitted here for brevity. ∎

Lemma 4 indicates that the BS shall use up the allocated global model downloading time for saving the energy consumption of the devices.

Lemma 5.

The optimal solution to the problem (23) satisfies the constraint (18j) with strict equality for a given j=minkRkup.𝑗subscript𝑘superscriptsubscript𝑅𝑘upj=\min_{k}R_{k}^{\mathrm{up}}.italic_j = roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT .

Proof:

It can be proved similar to Lemma 3, and is thus omitted here for brevity. ∎

Lemma 5 indicates that only the device with the minimum local model uploading rate uses up the allocated local model uploading time, while the other devices with higher local model uploading rate shall wait until the end of the local model uploading time even though they finish the local model uploading early.

Theorem 1.

The optimal 𝐓𝐓\mathbf{T}bold_T for the problem (23) given 𝐃gensuperscript𝐃gen\mathbf{D}^{\mathrm{gen}}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT is given by

Tdown=superscript𝑇downabsent\displaystyle T^{\mathrm{down}}=italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT = ΓmaxkDkgenRkdown,Γsubscript𝑘superscriptsubscript𝐷𝑘gensuperscriptsubscript𝑅𝑘down\displaystyle\Gamma\max_{k}\frac{D_{k}^{\mathrm{gen}}}{R_{k}^{\mathrm{down}}},roman_Γ roman_max start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG , (25)
Tbr=superscript𝑇brabsent\displaystyle T^{\mathrm{br}}=italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT = DmodBlog2(1+h1Pσ2B),superscript𝐷mod𝐵subscript21subscript1𝑃superscript𝜎2𝐵\displaystyle\frac{D^{\mathrm{mod}}}{B\log_{2}\left(1+\frac{h_{1}P}{\sigma^{2}% B}\right)},divide start_ARG italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT end_ARG start_ARG italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_P end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B end_ARG ) end_ARG , (26)
Tup=superscript𝑇upabsent\displaystyle T^{\mathrm{up}}=italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT = DmodminkRkup,superscript𝐷modsubscript𝑘superscriptsubscript𝑅𝑘up\displaystyle\frac{D^{\mathrm{mod}}}{\min_{k}R_{k}^{\mathrm{up}}},divide start_ARG italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT end_ARG start_ARG roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT end_ARG , (27)
Tloc=superscript𝑇locabsent\displaystyle T^{\mathrm{loc}}=italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT = Tˇlocϱk=1KDkgenNΓNmaxkDkgenRkdown,superscriptˇ𝑇locitalic-ϱsuperscriptsubscript𝑘1𝐾superscriptsubscript𝐷𝑘gen𝑁Γ𝑁subscript𝑘superscriptsubscript𝐷𝑘gensuperscriptsubscript𝑅𝑘down\displaystyle\check{T}^{\mathrm{loc}}-\frac{\varrho\sum_{k=1}^{K}D_{k}^{% \mathrm{gen}}}{N}-\frac{\Gamma}{N}\max_{k}\frac{D_{k}^{\mathrm{gen}}}{R_{k}^{% \mathrm{down}}},overroman_ˇ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT - divide start_ARG italic_ϱ ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_N end_ARG - divide start_ARG roman_Γ end_ARG start_ARG italic_N end_ARG roman_max start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG , (28)

where

Tˇloc=TmaxNDmodminkRkupDmodBlog2(1+h1Pσ2B).superscriptˇ𝑇locsuperscript𝑇max𝑁superscript𝐷modsubscript𝑘superscriptsubscript𝑅𝑘upsuperscript𝐷mod𝐵subscript21subscript1𝑃superscript𝜎2𝐵\check{T}^{\mathrm{loc}}=\frac{T^{\mathrm{max}}}{N}-\frac{D^{\mathrm{mod}}}{% \min_{k}R_{k}^{\mathrm{up}}}-\frac{D^{\mathrm{mod}}}{B\log_{2}\left(1+\frac{h_% {1}P}{\sigma^{2}B}\right)}.overroman_ˇ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT = divide start_ARG italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG start_ARG italic_N end_ARG - divide start_ARG italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT end_ARG start_ARG roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT end_ARG - divide start_ARG italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT end_ARG start_ARG italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_P end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B end_ARG ) end_ARG . (29)
Proof:

The optimal 𝐓𝐓\mathbf{T}bold_T can be easily derived based on Lemmas 2-5. Thus the details are omitted here for brevity. ∎

Using Theorem 1, the problem (23) is simplified as

min𝐃gensubscriptsuperscript𝐃gen\displaystyle\min_{\mathbf{D}^{\mathrm{gen}}}roman_min start_POSTSUBSCRIPT bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_POSTSUBSCRIPT k=1K(Dkloc+Dkgen)βsuperscriptsubscript𝑘1𝐾superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen𝛽\displaystyle\>\sum_{k=1}^{K}\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen}}% \right)^{-\beta}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT (30a)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . Dkgen+ΓfkmaxNwτmaxjDjgenRjdown+fkmaxϱk=1KDkgenNwτsuperscriptsubscript𝐷𝑘genΓsuperscriptsubscript𝑓𝑘max𝑁𝑤𝜏subscript𝑗superscriptsubscript𝐷𝑗gensuperscriptsubscript𝑅𝑗downsuperscriptsubscript𝑓𝑘maxitalic-ϱsuperscriptsubscript𝑘1𝐾superscriptsubscript𝐷𝑘gen𝑁𝑤𝜏\displaystyle\>D_{k}^{\mathrm{gen}}+\frac{\Gamma f_{k}^{\mathrm{max}}}{Nw\tau}% \max_{j}\frac{D_{j}^{\mathrm{gen}}}{R_{j}^{\mathrm{down}}}+\frac{f_{k}^{% \mathrm{max}}\varrho\sum_{k=1}^{K}D_{k}^{\mathrm{gen}}}{Nw\tau}italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT + divide start_ARG roman_Γ italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG start_ARG italic_N italic_w italic_τ end_ARG roman_max start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT divide start_ARG italic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_ϱ ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_N italic_w italic_τ end_ARG
fkmaxTˇlocwτDkloc,k,absentsuperscriptsubscript𝑓𝑘maxsuperscriptˇ𝑇loc𝑤𝜏superscriptsubscript𝐷𝑘locfor-all𝑘\displaystyle\>\leq\frac{f_{k}^{\mathrm{max}}\check{T}^{\mathrm{loc}}}{w\tau}-% D_{k}^{\mathrm{loc}},\forall k,≤ divide start_ARG italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT overroman_ˇ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT end_ARG start_ARG italic_w italic_τ end_ARG - italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , ∀ italic_k , (30b)
ϖkw3τ3(Dkloc+Dkgen)3EkmaxEkup+ΓNmaxjDjgenRjdownsubscriptitalic-ϖ𝑘superscript𝑤3superscript𝜏3superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen3superscriptsubscript𝐸𝑘maxsuperscriptsubscript𝐸𝑘upΓ𝑁subscript𝑗superscriptsubscript𝐷𝑗gensuperscriptsubscript𝑅𝑗down\displaystyle\>\sqrt{\!\frac{\varpi_{k}w^{3}\tau^{3}\left(D_{k}^{\mathrm{loc}}% +D_{k}^{\mathrm{gen}}\right)^{3}}{E_{k}^{\mathrm{max}}-E_{k}^{\mathrm{up}}}}+% \frac{\Gamma}{N}\max_{j}\frac{D_{j}^{\mathrm{gen}}}{R_{j}^{\mathrm{down}}}square-root start_ARG divide start_ARG italic_ϖ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_w start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_τ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT - italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT end_ARG end_ARG + divide start_ARG roman_Γ end_ARG start_ARG italic_N end_ARG roman_max start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT divide start_ARG italic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG
+ϱk=1KDkgenNTˇloc,k,italic-ϱsuperscriptsubscript𝑘1𝐾superscriptsubscript𝐷𝑘gen𝑁superscriptˇ𝑇locfor-all𝑘\displaystyle\>+\frac{\varrho\sum_{k=1}^{K}D_{k}^{\mathrm{gen}}}{N}\leq\!% \check{T}^{\mathrm{loc}},\forall k,+ divide start_ARG italic_ϱ ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_N end_ARG ≤ overroman_ˇ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , ∀ italic_k , (30c)
(18b),(18m).italic-(18bitalic-)italic-(18mitalic-)\displaystyle\>\eqref{eq:p1-c1},\eqref{eq:p1-c13}.italic_( italic_) , italic_( italic_) .

It can be verified that the objective function in (30a) is a convex function of 𝐃gensuperscript𝐃gen\mathbf{D}^{\mathrm{gen}}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT and all the constraints are convex or linear with respect to 𝐃gensuperscript𝐃gen\mathbf{D}^{\mathrm{gen}}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT. Thus, the problem (30) is a convex optimization problem and can be optimally solved via CVX [28].

The proposed algorithm to optimally optimize 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T is summarized in Algorithm 1. Since the complexity of solving the problem (30) is 𝒪(K3)𝒪superscript𝐾3\mathcal{O}(K^{3})caligraphic_O ( italic_K start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) [29], the complexity of Algorithm 1 is 𝒪(K3).𝒪superscript𝐾3\mathcal{O}(K^{3}).caligraphic_O ( italic_K start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) .

1:  Solve the problem (30) using CVX to obtain 𝐃gensuperscript𝐃gen\mathbf{D}^{\mathrm{gen}}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT.
2:  Obtain the closed-form expressions for Tdown,superscript𝑇downT^{\mathrm{down}},italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT , Tbr,superscript𝑇brT^{\mathrm{br}},italic_T start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT , Tup,superscript𝑇upT^{\mathrm{up}},italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT , and Tlocsuperscript𝑇locT^{\mathrm{loc}}italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT from (25), (26), (27), and (28), respectively.
Algorithm 1 Proposed algorithm to optimally optimize 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T based on the convex optimization.

V Efficient Algorithm for Optimally Optimizing 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π

In this section, the subproblem of optimizing 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π with given 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T (the right part of Fig. 2) is investigated. It can be shown that the constraints related to 𝐩𝐩\mathbf{p}bold_p and the constraints related to 𝐪,𝝅𝐪𝝅\mathbf{q},\boldsymbol{\pi}bold_q , bold_italic_π are different. Therefore, the subproblem of optimizing 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π can be decoupled into two problems, given by

FindFind\displaystyle\mathrm{Find}roman_Find 𝐩𝐩\displaystyle\>\mathbf{p}bold_p (31)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (18f),(18g),(18n),italic-(18fitalic-)italic-(18gitalic-)italic-(18nitalic-)\displaystyle\>\eqref{eq:p1-c6},\eqref{eq:p1-c7},\eqref{eq:p1-c14},italic_( italic_) , italic_( italic_) , italic_( italic_) ,

and

FindFind\displaystyle\mathrm{Find}roman_Find 𝐪,𝝅𝐪𝝅\displaystyle\>\mathbf{q},\boldsymbol{\pi}bold_q , bold_italic_π (32)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (18i),(18j),(18l),(18o),(21).italic-(18iitalic-)italic-(18jitalic-)italic-(18litalic-)italic-(18oitalic-)italic-(21italic-)\displaystyle\>\eqref{eq:p1-c9},\eqref{eq:p1-c10},\eqref{eq:p1-c12},\eqref{eq:% p1-c15},\eqref{eq:new-p1-c11}.italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) .

Both the problems in (31) and (32) try to find feasible solutions, since the objective function in the original problem (22) does not depend on the optimization variables 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π. However, in order to find feasible solutions which can lead to more favorable results such that the subproblem (23) has higher objective function value, the objective functions in the problems (31) and (32) are modified respectively as

max𝐩subscript𝐩\displaystyle\max_{\mathbf{p}}roman_max start_POSTSUBSCRIPT bold_p end_POSTSUBSCRIPT minkRkdownDkgensubscript𝑘superscriptsubscript𝑅𝑘downsuperscriptsubscript𝐷𝑘gen\displaystyle\>\min_{k}\frac{R_{k}^{\mathrm{down}}}{D_{k}^{\mathrm{gen}}}roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG (33)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (18f),(18g),(18n),italic-(18fitalic-)italic-(18gitalic-)italic-(18nitalic-)\displaystyle\>\eqref{eq:p1-c6},\eqref{eq:p1-c7},\eqref{eq:p1-c14},italic_( italic_) , italic_( italic_) , italic_( italic_) ,

and

max𝐪,𝝅subscript𝐪𝝅\displaystyle\max_{\mathbf{q},\boldsymbol{\pi}}roman_max start_POSTSUBSCRIPT bold_q , bold_italic_π end_POSTSUBSCRIPT minkRkupsubscript𝑘superscriptsubscript𝑅𝑘up\displaystyle\>\min_{k}R_{k}^{\mathrm{up}}roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT (34)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (18i),(18j),(18l),(18o),(21).italic-(18iitalic-)italic-(18jitalic-)italic-(18litalic-)italic-(18oitalic-)italic-(21italic-)\displaystyle\>\eqref{eq:p1-c9},\eqref{eq:p1-c10},\eqref{eq:p1-c12},\eqref{eq:% p1-c15},\eqref{eq:new-p1-c11}.italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) , italic_( italic_) .

It is shown that by selecting the objective function as expressed in (33), the constraint (18f) can be more relaxed in the subproblem (23), which can lead to higher objective function value. It is also shown that by selecting the objective function as expressed in (34), the constraint (18j) can be more relaxed in the subproblem (23), which can also lead to higher objective function value. In what follows, we first solve the problem (33), and then solve the problem (34).

V-A Proposed algorithm for optimally optimizing 𝐩𝐩\mathbf{p}bold_p

By introducing an auxiliary variable η=minkRkdownDkgen𝜂subscript𝑘superscriptsubscript𝑅𝑘downsuperscriptsubscript𝐷𝑘gen\eta=\min_{k}\frac{R_{k}^{\mathrm{down}}}{D_{k}^{\mathrm{gen}}}italic_η = roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG, the problem (33) can be reformulated as

max𝐩,ηsubscript𝐩𝜂\displaystyle\max_{\mathbf{p},\eta}roman_max start_POSTSUBSCRIPT bold_p , italic_η end_POSTSUBSCRIPT η𝜂\displaystyle\>\etaitalic_η (35a)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . RkdownηDkgen,k,superscriptsubscript𝑅𝑘down𝜂superscriptsubscript𝐷𝑘genfor-all𝑘\displaystyle\>R_{k}^{\mathrm{down}}\geq\eta D_{k}^{\mathrm{gen}},\forall k,italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT ≥ italic_η italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , ∀ italic_k , (35b)
(18f),(18g),(18n).italic-(18fitalic-)italic-(18gitalic-)italic-(18nitalic-)\displaystyle\>\eqref{eq:p1-c6},\eqref{eq:p1-c7},\eqref{eq:p1-c14}.italic_( italic_) , italic_( italic_) , italic_( italic_) .

It is noted that the constraint (18f) is inactive as long as the problem is feasible, i.e., ηΓTdown.𝜂Γsuperscript𝑇down\eta\geq\frac{\Gamma}{T^{\mathrm{down}}}.italic_η ≥ divide start_ARG roman_Γ end_ARG start_ARG italic_T start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG . Next, we present an important property of the optimal solution to the problem (35).

Lemma 6.

The optimal solution to the problem (35) satisfies the constraint (35b) with strict equality.

Proof:

See Appendix -D

From Lemma 6, the optimal 𝐩𝐩\mathbf{p}bold_p given η𝜂\etaitalic_η satisfies the following equalities

Blog2(1+hkpkσ2B+hkj>kpj)=ηDkgen,k,𝐵subscript21subscript𝑘subscript𝑝𝑘superscript𝜎2𝐵subscript𝑘subscript𝑗𝑘subscript𝑝𝑗𝜂superscriptsubscript𝐷𝑘genfor-all𝑘B\log_{2}\left(1+\frac{h_{k}p_{k}}{\sigma^{2}B+h_{k}\sum_{j>k}p_{j}}\right)=% \eta D_{k}^{\mathrm{gen}},\forall k,italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j > italic_k end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) = italic_η italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , ∀ italic_k , (36)
Theorem 2.

The optimal pk,k=1,,Kformulae-sequencesubscript𝑝𝑘𝑘1𝐾p_{k},k=1,\ldots,Kitalic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k = 1 , … , italic_K for the problem (35) given η𝜂\etaitalic_η can be sequentially determined from pKsubscript𝑝𝐾p_{K}italic_p start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT to p1subscript𝑝1p_{1}italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT according to

pk=(2ηDkgenB1)(σ2Bhk+j>kpj),k.subscript𝑝𝑘superscript2𝜂superscriptsubscript𝐷𝑘gen𝐵1superscript𝜎2𝐵subscript𝑘subscript𝑗𝑘subscript𝑝𝑗for-all𝑘p_{k}=\left(2^{\frac{\eta D_{k}^{\mathrm{gen}}}{B}}-1\right)\left(\frac{\sigma% ^{2}B}{h_{k}}+\sum_{j>k}p_{j}\right),\forall k.italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_η italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B end_ARG start_ARG italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG + ∑ start_POSTSUBSCRIPT italic_j > italic_k end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , ∀ italic_k . (37)
Proof:

This theorem is a direct result from Lemma 6. By rewriting (36), we can have (37), where the optimal pksubscript𝑝𝑘p_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is determined by pj,j>ksubscript𝑝𝑗𝑗𝑘p_{j},j>kitalic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_j > italic_k. This completes the proof. ∎

After obtaining the optimal 𝐩𝐩\mathbf{p}bold_p, what remains is optimizing η𝜂\etaitalic_η in the problem (35). From (37), it shows that pksubscript𝑝𝑘p_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is an increasing function of η.𝜂\eta.italic_η . Therefore, if η𝜂\etaitalic_η is larger than the optimal η,𝜂\eta,italic_η , the constraint (18g) will be violated. Accordingly, the optimal η𝜂\etaitalic_η for the problem (35) can be derived by a bisection search of η𝜂\etaitalic_η, where η𝜂\etaitalic_η is increased if the constraint (18g) with given η𝜂\etaitalic_η is obeyed, and is decreased if the constraint (18g) with given η𝜂\etaitalic_η is violated.

The proposed algorithm to optimize 𝐩𝐩\mathbf{p}bold_p is summarized in Algorithm 2. Since the bisection search method converges in finite number of iterations which is independent of the number of devices [30], the complexity of Algorithm 2 is merely linear in the number of devices, i.e., 𝒪(K).𝒪𝐾\mathcal{O}(K).caligraphic_O ( italic_K ) .

1:  Initialize ηminsubscript𝜂𝑚𝑖𝑛\eta_{min}italic_η start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT and ηmaxsubscript𝜂𝑚𝑎𝑥\eta_{max}italic_η start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT.
2:  repeat
3:     η=ηmin+ηmax2.𝜂subscript𝜂𝑚𝑖𝑛subscript𝜂𝑚𝑎𝑥2\eta=\frac{\eta_{min}+\eta_{max}}{2}.italic_η = divide start_ARG italic_η start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG .
4:     for all k=K𝑘𝐾k=Kitalic_k = italic_K to 1111 do
5:        Obtain pksubscript𝑝𝑘p_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT from (37).
6:     end for
7:     if the constraint (18g) is obeyed then
8:        ηmin=η.subscript𝜂𝑚𝑖𝑛𝜂\eta_{min}=\eta.italic_η start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT = italic_η .
9:     else
10:        ηmax=η.subscript𝜂𝑚𝑎𝑥𝜂\eta_{max}=\eta.italic_η start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT = italic_η .
11:     end if
12:  until η𝜂\etaitalic_η converges.
Algorithm 2 Proposed algorithm to obtain the optimal 𝐩𝐩\mathbf{p}bold_p.

V-B Proposed algorithm for optimally optimizing 𝐪,𝛑𝐪𝛑\mathbf{q},\boldsymbol{\pi}bold_q , bold_italic_π

By integrating the constraints (18i), (18o) and (21), it follows that

0qkqkmax,k,formulae-sequence0subscript𝑞𝑘superscriptsubscript𝑞𝑘maxfor-all𝑘0\leq q_{k}\leq q_{k}^{\mathrm{max}},\forall k,0 ≤ italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , ∀ italic_k , (38)

where

qkmax=min(Qk,EkmaxTupϖkw3τ3(Dkloc+Dkgen)3(Tloc)2Tup).superscriptsubscript𝑞𝑘maxsubscript𝑄𝑘superscriptsubscript𝐸𝑘maxsuperscript𝑇upsubscriptitalic-ϖ𝑘superscript𝑤3superscript𝜏3superscriptsuperscriptsubscript𝐷𝑘locsuperscriptsubscript𝐷𝑘gen3superscriptsuperscript𝑇loc2superscript𝑇upq_{k}^{\mathrm{max}}=\min\left(Q_{k},\frac{E_{k}^{\mathrm{max}}}{T^{\mathrm{up% }}}-\frac{\varpi_{k}w^{3}\tau^{3}\left(D_{k}^{\mathrm{loc}}+D_{k}^{\mathrm{gen% }}\right)^{3}}{(T^{\mathrm{loc}})^{2}T^{\mathrm{up}}}\right).italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT = roman_min ( italic_Q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , divide start_ARG italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG start_ARG italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT end_ARG - divide start_ARG italic_ϖ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_w start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_τ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT end_ARG ) . (39)

Furthermore, it shows that the constraint (18j) is redundant given the objective function formulated in (34), as long as the problem (34) is feasible. Thus, the problem (34) is rewritten as

max𝐪,𝝅subscript𝐪𝝅\displaystyle\max_{\mathbf{q},\boldsymbol{\pi}}roman_max start_POSTSUBSCRIPT bold_q , bold_italic_π end_POSTSUBSCRIPT minkRkupsubscript𝑘superscriptsubscript𝑅𝑘up\displaystyle\>\min_{k}R_{k}^{\mathrm{up}}roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT (40)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (18l),(38).italic-(18litalic-)italic-(38italic-)\displaystyle\>\eqref{eq:p1-c12},\eqref{eq:p15-c1}.italic_( italic_) , italic_( italic_) .

By defining 𝐪={qk,k=1,,K}\mathbf{q}^{{}^{\prime}}=\{q_{k}^{{}^{\prime}},k=1,\ldots,K\}bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT = { italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , italic_k = 1 , … , italic_K }, where qk=qkqkmaxsuperscriptsubscript𝑞𝑘subscript𝑞𝑘superscriptsubscript𝑞𝑘maxq_{k}^{{}^{\prime}}=\frac{q_{k}}{q_{k}^{\mathrm{max}}}italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT = divide start_ARG italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG, the problem (40) can be reformulated as

max𝐪,𝝅subscriptsuperscript𝐪𝝅\displaystyle\max_{\mathbf{q}^{{}^{\prime}},\boldsymbol{\pi}}roman_max start_POSTSUBSCRIPT bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , bold_italic_π end_POSTSUBSCRIPT minkRkupsubscript𝑘superscriptsubscript𝑅𝑘up\displaystyle\>\min_{k}R_{k}^{\mathrm{up}}roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT (41a)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t .  0qk1,k,formulae-sequence 0superscriptsubscript𝑞𝑘1for-all𝑘\displaystyle\>0\leq q_{k}^{{}^{\prime}}\leq 1,\forall k,0 ≤ italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ≤ 1 , ∀ italic_k , (41b)
(18l),italic-(18litalic-)\displaystyle\>\eqref{eq:p1-c12},italic_( italic_) ,

where Rkupsuperscriptsubscript𝑅𝑘upR_{k}^{\mathrm{up}}italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT is rewritten as

Rkup=Blog2(1+gkqkmaxqkσ2B+j=1,πj>πkgjqjmaxqj).superscriptsubscript𝑅𝑘up𝐵subscript21subscript𝑔𝑘superscriptsubscript𝑞𝑘maxsuperscriptsubscript𝑞𝑘superscript𝜎2𝐵subscriptformulae-sequence𝑗1subscript𝜋𝑗subscript𝜋𝑘subscript𝑔𝑗superscriptsubscript𝑞𝑗maxsuperscriptsubscript𝑞𝑗R_{k}^{\mathrm{up}}=B\log_{2}\left(1+\frac{g_{k}q_{k}^{\mathrm{max}}q_{k}^{{}^% {\prime}}}{\sigma^{2}B+\sum_{j=1,\pi_{j}>\pi_{k}}g_{j}q_{j}^{\mathrm{max}}q_{j% }^{{}^{\prime}}}\right).italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT = italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + ∑ start_POSTSUBSCRIPT italic_j = 1 , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG ) . (42)

By introducing an auxiliary variable θ=minkRkup𝜃subscript𝑘superscriptsubscript𝑅𝑘up\theta=\min_{k}R_{k}^{\mathrm{up}}italic_θ = roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT, the problem (41) can be reformulated as

max𝐪,𝝅,θsubscriptsuperscript𝐪𝝅𝜃\displaystyle\max_{\mathbf{q}^{{}^{\prime}},\boldsymbol{\pi},\theta}roman_max start_POSTSUBSCRIPT bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , bold_italic_π , italic_θ end_POSTSUBSCRIPT θ𝜃\displaystyle\>\thetaitalic_θ (43a)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . Rkupθ,k,superscriptsubscript𝑅𝑘up𝜃for-all𝑘\displaystyle\>R_{k}^{\mathrm{up}}\geq\theta,\forall k,italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ≥ italic_θ , ∀ italic_k , (43b)
(18l),(41b).italic-(18litalic-)italic-(41bitalic-)\displaystyle\>\eqref{eq:p1-c12},\eqref{eq:p16-c1}.italic_( italic_) , italic_( italic_) .

Due to the discrete constraint (18l), it is intractable to find the optimal solution to the problem (43a) by standard optimization methods. In what follows, we derive the optimal SIC decoding order 𝝅𝝅\boldsymbol{\pi}bold_italic_π by exploring the problem structure.

Theorem 3.

The optimal 𝛑𝛑\boldsymbol{\pi}bold_italic_π for the problem (43) is in the descending order of gkqkmax.subscript𝑔𝑘superscriptsubscript𝑞𝑘maxg_{k}q_{k}^{\mathrm{max}}.italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT .

Proof:

See Appendix -E. ∎

From Theorem 3, the optimal decoding order 𝝅𝝅\boldsymbol{\pi}bold_italic_π for the problem (43) can be obtained, and the problem (43) is simplified as

max𝐪,θsubscriptsuperscript𝐪𝜃\displaystyle\max_{\mathbf{q}^{{}^{\prime}},\theta}roman_max start_POSTSUBSCRIPT bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , italic_θ end_POSTSUBSCRIPT θ𝜃\displaystyle\>\thetaitalic_θ (44)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (41b),(43b).italic-(41bitalic-)italic-(43bitalic-)\displaystyle\>\eqref{eq:p16-c1},\eqref{eq:p17-c1}.italic_( italic_) , italic_( italic_) .

Since the above problem given θ𝜃\thetaitalic_θ is feasible only when θ𝜃\thetaitalic_θ is smaller than or equal to the optimal θ𝜃\thetaitalic_θ, the optimal θ𝜃\thetaitalic_θ can be obtained by a simple bisection search of θ𝜃\thetaitalic_θ, where in each search, the problem of optimizing 𝐪superscript𝐪\mathbf{q}^{{}^{\prime}}bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT given θ𝜃\thetaitalic_θ is investigated as

find 𝐪superscript𝐪\displaystyle\>\mathbf{q}^{{}^{\prime}}bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT (45)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . (41b),(43b).italic-(41bitalic-)italic-(43bitalic-)\displaystyle\>\eqref{eq:p16-c1},\eqref{eq:p17-c1}.italic_( italic_) , italic_( italic_) .
Lemma 7.

A feasible solution to the problem (45) satisfies the constraint (43b) with strict equality.

Proof:

See Appendix -F. ∎

From Lemma 7, the feasible solution to the problem (45) satisfies the following equalities

Blog2(1+gkqkmaxqkσ2B+j=1,πj>πkgjqjmaxqj)=θ,k.𝐵subscript21subscript𝑔𝑘superscriptsubscript𝑞𝑘maxsuperscriptsubscript𝑞𝑘superscript𝜎2𝐵subscriptformulae-sequence𝑗1subscript𝜋𝑗subscript𝜋𝑘subscript𝑔𝑗superscriptsubscript𝑞𝑗maxsuperscriptsubscript𝑞𝑗𝜃for-all𝑘\displaystyle B\log_{2}\left(1+\frac{g_{k}q_{k}^{\mathrm{max}}q_{k}^{{}^{% \prime}}}{\sigma^{2}B+\sum_{j=1,\pi_{j}>\pi_{k}}g_{j}q_{j}^{\mathrm{max}}q_{j}% ^{{}^{\prime}}}\right)=\theta,\forall k.italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + ∑ start_POSTSUBSCRIPT italic_j = 1 , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG ) = italic_θ , ∀ italic_k . (46)
Theorem 4.

The feasible qk,k=1,,Kformulae-sequencesuperscriptsubscript𝑞𝑘𝑘1𝐾q_{k}^{{}^{\prime}},k=1,\ldots,Kitalic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , italic_k = 1 , … , italic_K for the problem (45) can be recursively obtained from the last decoding device to the first decoding device according to

qk=2θB1gkqkmax(σ2B+j=1,πj>πkgjqjmaxqj),k.superscriptsubscript𝑞𝑘superscript2𝜃𝐵1subscript𝑔𝑘superscriptsubscript𝑞𝑘maxsuperscript𝜎2𝐵subscriptformulae-sequence𝑗1subscript𝜋𝑗subscript𝜋𝑘subscript𝑔𝑗superscriptsubscript𝑞𝑗maxsuperscriptsubscript𝑞𝑗for-all𝑘q_{k}^{{}^{\prime}}=\frac{2^{\frac{\theta}{B}}-1}{g_{k}q_{k}^{\mathrm{max}}}% \left(\sigma^{2}B+\sum_{j=1,\pi_{j}>\pi_{k}}g_{j}q_{j}^{\mathrm{max}}q_{j}^{{}% ^{\prime}}\right),\forall k.italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT = divide start_ARG 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 end_ARG start_ARG italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + ∑ start_POSTSUBSCRIPT italic_j = 1 , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) , ∀ italic_k . (47)
Proof:

This theorem is a direct result from Lemma 7. After rewritting (46), we can have (47), which shows that qksuperscriptsubscript𝑞𝑘q_{k}^{{}^{\prime}}italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT depends only on qj,πj>πksuperscriptsubscript𝑞𝑗subscript𝜋𝑗subscript𝜋𝑘q_{j}^{{}^{\prime}},\pi_{j}>\pi_{k}italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. This completes the proof. ∎

After 𝐪superscript𝐪\mathbf{q}^{{}^{\prime}}bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT has been obtained from (47), we can check whether the constraint (41b) is satisfied by the obtained 𝐪superscript𝐪\mathbf{q}^{{}^{\prime}}bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT. Specifically, the problem (45) with the given θ𝜃\thetaitalic_θ is infeasible if the constraint (41b) is not satisfied, and is feasible otherwise.

The proposed algorithm to optimally optimize 𝐪,𝝅𝐪𝝅\mathbf{q},\boldsymbol{\pi}bold_q , bold_italic_π is summarized in Algorithm 3. Since the optimal decoding order is derived in closed form and the convergence of the bisection search method does not depend on the number of devices [30], the complexity of Algorithm 3 is only linear in the number of devices, i.e., 𝒪(K).𝒪𝐾\mathcal{O}(K).caligraphic_O ( italic_K ) .

1:  Obtain the optimal decoding order 𝝅𝝅\boldsymbol{\pi}bold_italic_π in the descending order of gkqkmax.subscript𝑔𝑘superscriptsubscript𝑞𝑘maxg_{k}q_{k}^{\mathrm{max}}.italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT .
2:  repeat
3:     θ=θmin+θmax2.𝜃subscript𝜃𝑚𝑖𝑛subscript𝜃𝑚𝑎𝑥2\theta=\frac{\theta_{min}+\theta_{max}}{2}.italic_θ = divide start_ARG italic_θ start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG .
4:     Obtain 𝐪superscript𝐪\mathbf{q}^{{}^{\prime}}bold_q start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT recursively from the last decoding device to the first decoding device from (47).
5:     if the constraint (41b) is obeyed then
6:        θmin=θ.subscript𝜃𝑚𝑖𝑛𝜃\theta_{min}=\theta.italic_θ start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT = italic_θ .
7:     else
8:        θmax=θ.subscript𝜃𝑚𝑎𝑥𝜃\theta_{max}=\theta.italic_θ start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT = italic_θ .
9:     end if
10:  until θ𝜃\thetaitalic_θ converges.
11:  Obtain qk=qkmaxqk,k=1,,K.formulae-sequencesubscript𝑞𝑘superscriptsubscript𝑞𝑘maxsuperscriptsubscript𝑞𝑘𝑘1𝐾q_{k}=q_{k}^{\mathrm{max}}q_{k}^{{}^{\prime}},k=1,\ldots,K.italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , italic_k = 1 , … , italic_K .
Algorithm 3 Proposed algorithm to obtain the optimal 𝐪,𝝅𝐪𝝅\mathbf{q},\boldsymbol{\pi}bold_q , bold_italic_π.

V-C Convergence and Complexity Analysis of the Overall Proposed Algorithm

1:  Initialize 𝐃gen,𝐓,𝐩,𝐪,𝝅superscript𝐃gen𝐓𝐩𝐪𝝅\mathbf{D}^{\mathrm{gen}},\mathbf{T},\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T , bold_p , bold_q , bold_italic_π.
2:  repeat
3:     Optimizes 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T with given 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π using Algorithm 1.
4:     Optimizes 𝐩𝐩\mathbf{p}bold_p with given 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T using Algorithm 2.
5:     Optimizes 𝐪,𝝅𝐪𝝅\mathbf{q},\boldsymbol{\pi}bold_q , bold_italic_π with given 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T using Algorithm 3.
6:  until the objective function value in (22a) converges.
7:  Obtain 𝐟𝐟\mathbf{f}bold_f from (19).
Algorithm 4 Overall proposed algorithm for solving the problem (18) based on the BCD method.

The overall proposed algorithm for solving the problem (18) is summarized in Algorithm 4. In this subsection, we provide convergence and complexity analysis of the overall proposed algorithm.

The convergence of the proposed algorithm is affected by the BCD method used for iteratively solving the problem (22), i.e., the algorithm 1 for optimizing 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T and the algorithms 2 and 3 for optimizing 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π are iteratively performed until convergence. The following proposition presents the convergence analysis of the adopted BCD method.

Proposition 1.

The BCD method in Algorithm 4 used for iteratively solving the problem (22) converges to a local optimal solution the problem (22).

Proof:

See Appendix -G. ∎

The above Proposition has shown the convergence of the proposed algorithm to a locally optimal solution. Then, we provide the complexity analysis. Note that the BCD method converges in finite number of iterations which is independent of the number of devices. Thus, based on the complexity analysis in Section IV, Section V-A and Section V-B, the total complexity of Algorithm 4 is 𝒪(K3+2K)𝒪superscript𝐾32𝐾\mathcal{O}(K^{3}+2K)caligraphic_O ( italic_K start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + 2 italic_K ), which is only polynomial in the number of devices.

VI Simulation Results

This section provides illustrative simulation results to demonstrate the effectiveness of the proposed NOMA+AIGC-enhanced WFL scheme. Unless otherwise noted, the simulation parameters are set as follows. The setting of the AIGC server is similar to [26], where eight RTX A5000 GPUs are equipped by the AIGC server and synthesizing one data sample requires approximately 0.06460.06460.06460.0646 s, i.e., ϱ=0.0646italic-ϱ0.0646\varrho=0.0646italic_ϱ = 0.0646. The number of devices is K=15𝐾15K=15italic_K = 15, where the devices are randomly distributed around the BS within the distance range [150,300]150300[150,300][ 150 , 300 ] m. The wireless channels are assumed to follow Rayleigh fading, where the channel gains hksubscript𝑘h_{k}italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and gksubscript𝑔𝑘g_{k}italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT are modeled as hk=h^kh~ksubscript𝑘subscript^𝑘subscript~𝑘h_{k}=\hat{h}_{k}\tilde{h}_{k}italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over~ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and gk=g^kg~ksubscript𝑔𝑘subscript^𝑔𝑘subscript~𝑔𝑘g_{k}=\hat{g}_{k}\tilde{g}_{k}italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, respectively. Specifically, the h^ksubscript^𝑘\hat{h}_{k}over^ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (or g^ksubscript^𝑔𝑘\hat{g}_{k}over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT) is the mean value of hksubscript𝑘h_{k}italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (or gksubscript𝑔𝑘g_{k}italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT) and is modeled as 128.1+37.6log10(d)128.137.6subscript10𝑑128.1+37.6\log_{10}(d)128.1 + 37.6 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_d ) in dB [31], where d𝑑ditalic_d is the distance in km. The h~ksubscript~𝑘\tilde{h}_{k}over~ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (or g~k)\tilde{g}_{k})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) is an exponentially distributed random variable with unit mean. The Dklocsuperscriptsubscript𝐷𝑘locD_{k}^{\mathrm{loc}}italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT, Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT, Dmod,superscript𝐷modD^{\mathrm{mod}},italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT , w,𝑤w,italic_w , fkmaxsuperscriptsubscript𝑓𝑘maxf_{k}^{\mathrm{max}}italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT are assumed to be uniformly distributed within [300,500]300500[300,500][ 300 , 500 ] samples, [3000,5000]30005000[3000,5000][ 3000 , 5000 ] samples, [1.5,2.5]1.52.5[1.5,2.5][ 1.5 , 2.5 ] Mbits, [1,2]×10612superscript106[1,2]\times 10^{6}[ 1 , 2 ] × 10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT cycles, and [1,2]12[1,2][ 1 , 2 ] GHz, respectively. In addition, we set B=1𝐵1B=1italic_B = 1 MHz, σ2=160superscript𝜎2160\sigma^{2}=-160italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = - 160 dBm/Hz, N=100,𝑁100N=100,italic_N = 100 , Tmax=900superscript𝑇max900T^{\mathrm{max}}=900italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT = 900 s, τ=1𝜏1\tau=1italic_τ = 1, ζ=50,𝜁50\zeta=50,italic_ζ = 50 , α=3.819,𝛼3.819\alpha=3.819,italic_α = 3.819 , β=0.198,𝛽0.198\beta=0.198,italic_β = 0.198 , γ=0.231𝛾0.231\gamma=0.231italic_γ = 0.231 [26], P=35𝑃35P=35italic_P = 35 dBm, Qk=20subscript𝑄𝑘20Q_{k}=20italic_Q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 20 dBm, ϖ=1027italic-ϖsuperscript1027\varpi=10^{-27}italic_ϖ = 10 start_POSTSUPERSCRIPT - 27 end_POSTSUPERSCRIPT [32], Γ=20Γ20\Gamma=20roman_Γ = 20 Kbits, and Ekmax=1.2superscriptsubscript𝐸𝑘𝑚𝑎𝑥1.2E_{k}^{max}=1.2italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT = 1.2 Joule.

For the purpose of comparison, the following four schemes developed in existing literature or coined for benchmarking are considered:

  • FDMA+AIGC: In this scheme, FDMA is used for both synthetic data downloading and local model uploading as in [26]. Note that since [26] did not consider the synthetic data downloading phase, we modify the algorithm developed in [26] to suit our considered model.

  • TDMA+AIGC: In this scheme, TDMA is used for both synthetic data downloading and local model uploading.

  • NOMA-w/o-AIGC: In this scheme, synthetic data are not transmitted to the devices, and the proposed NOMA scheme is used for the local model uploading.

  • FDMA-w/o-AIGC: In this scheme, synthetic data are not transmitted to the devices, and FDMA is used for local model uploading as in [26].

Refer to caption
Figure 3: Impact of P𝑃Pitalic_P on the learning performance.

Fig. 3 illustrates the impact of the maximum transmit power of the BS P𝑃Pitalic_P on the learning performance. It shows that the learning performance of the proposed NOMA+AIGC scheme outperforms all the other schemes including FDMA+AIGC. This indicates that our proposed scheme is more effective in improving the WFL performance. Meanwhile, it shows that the learning performance with AIGC improves a lot compared to that without AIGC. This is because by downloading synthetic data from the server, more training data are available for local training to improve the training performance. It is also shown that TDMA+AIGC even underperforms the NOMA and the FDMA schemes without AIGC. This is due to the fact that the devices that are unscheduled for synthetic data downloading in TDMA shall wait and this significantly lowers the time resource utilization and makes the maximum latency constraint vulnerable.

As P𝑃Pitalic_P increases, Fig. 3 shows that the learning performance improves. The reason for this is that a higher P𝑃Pitalic_P can let the server transmit more synthetic data to the devices to improve the learning performance for the schemes with AIGC. Meanwhile, for the schemes without AIGC, a higher P𝑃Pitalic_P can let the BS broadcast the global model to the devices with less time, and thus it is easier to satisfy the maximum latency constraint and make the investigated problem feasible, since we set the learning error to one when the problem is infeasible. When P𝑃Pitalic_P is very large, it shows that the learning performance saturates. This is since the system will be restricted by the other factors such as the energy consumption constraint and the latency constraint, and no more synthetic data can be transmitted to the devices if P𝑃Pitalic_P is very high.

Refer to caption
Figure 4: Impact of Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT on the learning performance.

Fig. 4 illustrates the impact of the total amount of synthetic data Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT on the learning performance. It shows that the learning performance of the schemes without AIGC remains unchanged as Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT increases. This is since the schemes without AIGC do not use synthetic data for local training. It also shows that the schemes with AIGC achieve better learning performance as Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT increases. This is because higher amount of synthetic data available at the server can let the server transmit more synthetic data to the devices. Whereas when Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT is very large, a larger Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT cannot improve the learning performance further. This is since the energy consumption constraint and the latency constraint may restrict the learning performance and the synthetic data received by the devices cannot be increased even if the server has more synthetic data available. Furthermore, it shows that the learning performance of the proposed NOMA+AIGC scheme is the best among all the schemes and the performance improvement is more obvious when Dgensuperscript𝐷genD^{\mathrm{gen}}italic_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT is larger. This means that the proposed scheme is more effective in utilizing the synthetic data if more synthetic data are available at the server for improving the learning performance.

Refer to caption
Figure 5: Impact of Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT on the learning performance.

Fig. 5 illustrates the impact of the maximum latency constraint Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT on the learning performance. It shows that all the schemes can achieve better learning performance as Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT increases. This is because a larger value of Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT can let the server transmit more synthetic data to the devices for better learning performance of the schemes with AIGC, while the maximum latency constraint is easier to be satisfied for improving the learning performance of the schemes without AIGC. It also shows that when Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is very large, the schemes with/without AIGC converge to the same learning performance. This is since a very large Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT can let the server transmit as much data as possible under other constraints such as the energy consumption constraint, and the efficiencies of different multiple access schemes will have no impact on the learning performance. There is still a gap between the schemes with and without AIGC, since the schemes without AIGC are only easier to satisfy the problem constraints when Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is larger, and the learning performance will remain the same if the problem is feasible.

In addition, Fig. 5 shows that the performance improvement of the proposed scheme compared to FDMA+AIGC is more obvious when Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is smaller. This is due to the fact that the proposed scheme can utilize the time resource more efficiently than FDMA+AIGC. It also shows that the performance improvement of TDMA+AIGC with the increases of Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is more obvious than the other schemes. This is due to the fact that the time resource utilization efficiency of TDMA+AIGC is low and a larger Tmaxsuperscript𝑇maxT^{\mathrm{max}}italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT can efficiently offset the impact of low time resource utilization efficiency on the learning performance.

Refer to caption
Figure 6: Impact of Dmodsuperscript𝐷modD^{\mathrm{mod}}italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT on the learning performance.

Fig. 6 illustrates the impact of the model data size Dmodsuperscript𝐷modD^{\mathrm{mod}}italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT on the learning performance. It shows that the learning performance degrades as Dmodsuperscript𝐷modD^{\mathrm{mod}}italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT increases. This is because a larger Dmodsuperscript𝐷modD^{\mathrm{mod}}italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT requires more time for local model uploading and global model broadcasting, which leads to less time for synthetic data downloading and may also render the problem to be infeasible. It also shows that TDMA+AIGC quickly saturates to the worst learning performance. This is since TDMA+AIGC has the lowest time resource utilization efficiency, and a larger Dmodsuperscript𝐷modD^{\mathrm{mod}}italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT will make the required time for local model uploading and global model broadcasting much larger. Moreover, it shows that the proposed NOMA+AIGC scheme achieves the highest learning performance and the performance degradation due to the increase of Dmodsuperscript𝐷modD^{\mathrm{mod}}italic_D start_POSTSUPERSCRIPT roman_mod end_POSTSUPERSCRIPT is the least among all the schemes. This means that the proposed scheme can minimize the impact of the increased time needed for local model uploading and global model broadcasting to guarantee the learning performance.

Refer to caption
Figure 7: Impact of Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT on the learning performance.

Fig. 7 illustrates the impact of the energy budget Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT on the learning performance. It shows that the learning performance improves with the increase of Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT. This is because higher energy budget can let the device train with more local data for better learning performance, and can also let the device finish the local model uploading with less time consumption to satisfy the maximum latency constraint. It also shows that the learning performance saturates with the increase of Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT when Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is very large. This is since when Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is very large, the other constraints such as the transmit power constraint and the maximum latency constraint restrict the amount of synthetic data that can be downloaded from the server and become the main bottleneck of the learning performance.

Furthermore, Fig. 7 shows that as Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT increases, the learning performance of the schemes without AIGC saturates earlier than the schemes with AIGC except TDMA+AIGC. The reasons for this are explained as follows. A larger Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT can let the devices upload the local model to the server more quickly or train the local model in time such that the constraints of the problem for the schemes without AIGC can be satisfied more easily. However, since the training data cannot be increased, the learning performance of the schemes without AIGC is capped. While for the schemes with AIGC, besides the benefit mentioned above, more synthetic data can be downloaded from the server for local training with a higher Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT such that the learning performance can be further improved. It also shows that the learning performance improvement of the proposed NOMA+AIGC scheme compared to other schemes is obvious and does not change as Ekmaxsuperscriptsubscript𝐸𝑘maxE_{k}^{\mathrm{max}}italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT varies.

Refer to caption
Figure 8: Impact of K𝐾Kitalic_K on the learning performance.

Fig. 8 illustrates the impact of the number of devices K𝐾Kitalic_K on the learning performance. It shows that the learning performance degrades as K𝐾Kitalic_K increases. This is due to the fact that under the maximum latency constraint, more devices lead to a higher probability of violating this constraint, since the global model broadcasting rate decreases as K𝐾Kitalic_K increases and the required time increases. Besides, more devices also lead to less synthetic data available for each device, which will degrade the learning performance, since the objective function in (22a) is a convex function of the amount of the synthetic data. It also shows that the impact of K𝐾Kitalic_K on the learning performance of TDMA+AIGC is the severest. This is because less time resource can be allocated to each device for a larger K𝐾Kitalic_K by TDMA+AIGC, and thus the constraints of the problem are much harder to be satisfied. The proposed NOMA+AIGC scheme is shown to achieve the best learning performance, and such performance improvement is still impressive when K𝐾Kitalic_K is large.

VII Conclusions

In this paper, AIGC and NOMA are jointly adopted to enhance the WFL performance. The synthetic data distribution, two-way communication and computation resource allocation are jointly optimized to minimize the global learning error, under various system constraints. Specifically, an efficient low-complexity local optimal solution to the problem with partial closed-form results is proposed based on the BCD method and the analytical method. Extensive simulation results verify the superiority of the proposed NOMA+AIGC-enhanced scheme compared to the existing and benchmark schemes such as FDMA or TDMA+AIGC-enhanced schemes, under various system configurations. Our results have demonstrated the effectiveness of jointly combining NOMA and AIGC to enhance the WFL performance.

-A Proof of Lemma 1

Suppose that the optimal computing frequency allocation is 𝐟={fk,k=1,,K},\mathbf{f}^{*}=\{f_{k}^{*},k=1,\ldots,K\},bold_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = { italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_k = 1 , … , italic_K } , where the constraint (18e) is satisfied with strict inequality by fjsuperscriptsubscript𝑓𝑗f_{j}^{*}italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for a given j,𝑗j,italic_j , i.e., fj>wτ(Djloc+Djgen)Tloc.superscriptsubscript𝑓𝑗𝑤𝜏superscriptsubscript𝐷𝑗locsuperscriptsubscript𝐷𝑗gensuperscript𝑇locf_{j}^{*}>\frac{w\tau\left(D_{j}^{\mathrm{loc}}+D_{j}^{\mathrm{gen}}\right)}{T% ^{\mathrm{loc}}}.italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT > divide start_ARG italic_w italic_τ ( italic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT end_ARG . Then, we consider another computing frequency allocation 𝐟={fk,k=1,,K}\mathbf{f}^{\star}=\{f_{k}^{\star},k=1,\ldots,K\}bold_f start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = { italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_k = 1 , … , italic_K } with fk=fk,kjformulae-sequencesuperscriptsubscript𝑓𝑘superscriptsubscript𝑓𝑘𝑘𝑗f_{k}^{\star}=f_{k}^{*},k\neq jitalic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_k ≠ italic_j and fj=wτ(Djloc+Djgen)Tlocsuperscriptsubscript𝑓𝑗𝑤𝜏superscriptsubscript𝐷𝑗locsuperscriptsubscript𝐷𝑗gensuperscript𝑇locf_{j}^{\star}=\frac{w\tau\left(D_{j}^{\mathrm{loc}}+D_{j}^{\mathrm{gen}}\right% )}{T^{\mathrm{loc}}}italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = divide start_ARG italic_w italic_τ ( italic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT end_ARG. It is clear that fj<fjsuperscriptsubscript𝑓𝑗superscriptsubscript𝑓𝑗f_{j}^{\star}<f_{j}^{*}italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT < italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and Tjloc=Tlocsuperscriptsubscript𝑇𝑗locsuperscript𝑇locT_{j}^{\mathrm{loc}}=T^{\mathrm{loc}}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT = italic_T start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT with fj=fj.subscript𝑓𝑗superscriptsubscript𝑓𝑗f_{j}=f_{j}^{\star}.italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT . Since 𝐟𝐟\mathbf{f}bold_f is only related with the constraints (18d), (18e), (18k), and (18q) in the problem (18), it can be easily verified that 𝐟superscript𝐟\mathbf{f}^{\star}bold_f start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is a feasible solution to the problem (18) and achieves the same objective function value as the optimal solution 𝐟superscript𝐟\mathbf{f}^{*}bold_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. This means that the solution 𝐟superscript𝐟\mathbf{f}^{\star}bold_f start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is also optimal. From (7) and (16), it follows that Ejsubscript𝐸𝑗E_{j}italic_E start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT with fj=fjsubscript𝑓𝑗superscriptsubscript𝑓𝑗f_{j}=f_{j}^{\star}italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is smaller than that with fj=fjsubscript𝑓𝑗superscriptsubscript𝑓𝑗f_{j}=f_{j}^{*}italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. This means that 𝐟superscript𝐟\mathbf{f}^{\star}bold_f start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT has lower energy consumption than 𝐟superscript𝐟\mathbf{f}^{*}bold_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Thus, the optimal solution 𝐟superscript𝐟\mathbf{f}^{\star}bold_f start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is more desirable than 𝐟superscript𝐟\mathbf{f}^{*}bold_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. This completes the proof.

-B Proof of Lemma 2

Suppose that the optimal time allocation solution is 𝐓^={T^down,T^br,T^loc,T^up},^𝐓superscript^𝑇downsuperscript^𝑇brsuperscript^𝑇locsuperscript^𝑇up\hat{\mathbf{T}}=\{\hat{T}^{\mathrm{down}},\hat{T}^{\mathrm{br}},\hat{T}^{% \mathrm{loc}},\hat{T}^{\mathrm{up}}\},over^ start_ARG bold_T end_ARG = { over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT } , where Tsyn+T^down+N(T^br+T^loc+T^up)<Tmax.superscript𝑇synsuperscript^𝑇down𝑁superscript^𝑇brsuperscript^𝑇locsuperscript^𝑇upsuperscript𝑇maxT^{\mathrm{syn}}+\hat{T}^{\mathrm{down}}+N\left(\hat{T}^{\mathrm{br}}+\hat{T}^% {\mathrm{loc}}+\hat{T}^{\mathrm{up}}\right)<T^{\mathrm{max}}.italic_T start_POSTSUPERSCRIPT roman_syn end_POSTSUPERSCRIPT + over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT + italic_N ( over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT + over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ) < italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT . Then, we construct another time allocation solution 𝐓~={T~down,T~br,T~loc,T~up}~𝐓superscript~𝑇downsuperscript~𝑇brsuperscript~𝑇locsuperscript~𝑇up\tilde{\mathbf{T}}=\{\tilde{T}^{\mathrm{down}},\tilde{T}^{\mathrm{br}},\tilde{% T}^{\mathrm{loc}},\tilde{T}^{\mathrm{up}}\}over~ start_ARG bold_T end_ARG = { over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT , over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT , over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT } with 𝐓~=𝐓^~𝐓^𝐓\tilde{\mathbf{T}}=\hat{\mathbf{T}}over~ start_ARG bold_T end_ARG = over^ start_ARG bold_T end_ARG except that T~loc=TmaxTsynT^downNT^brT^upsuperscript~𝑇locsuperscript𝑇maxsuperscript𝑇synsuperscript^𝑇down𝑁superscript^𝑇brsuperscript^𝑇up\tilde{T}^{\mathrm{loc}}=\frac{T^{\mathrm{max}}-T^{\mathrm{syn}}-\hat{T}^{% \mathrm{down}}}{N}-\hat{T}^{\mathrm{br}}-\hat{T}^{\mathrm{up}}over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT = divide start_ARG italic_T start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT - italic_T start_POSTSUPERSCRIPT roman_syn end_POSTSUPERSCRIPT - over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG start_ARG italic_N end_ARG - over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT - over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT. It is shown that the (18c) is satisfied with equality by 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG and T~loc>T^loc.superscript~𝑇locsuperscript^𝑇loc\tilde{T}^{\mathrm{loc}}>\hat{T}^{\mathrm{loc}}.over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT > over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT . It can be shown that all the remaining constraints of the problem (23) are still satisfied by the solution 𝐓~,~𝐓\tilde{\mathbf{T}},over~ start_ARG bold_T end_ARG , and the objective function value achieved by 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG is the same as the optimal solution 𝐓^.^𝐓\hat{\mathbf{T}}.over^ start_ARG bold_T end_ARG . Thus, 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG is not only a feasible solution but also an optimal solution. From (21), it is shown that the energy consumption of each device achieved by the solution 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG is lower than that achieved by the solution 𝐓^,^𝐓\hat{\mathbf{T}},over^ start_ARG bold_T end_ARG , which means that the optimal solution 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG is more desirable than the optimal solution 𝐓^.^𝐓\hat{\mathbf{T}}.over^ start_ARG bold_T end_ARG . This completes the proof.

-C Proof of Lemma 3

Suppose that the optimal time allocation solution is 𝐓^={T^down,T^br,T^loc,T^up},^𝐓superscript^𝑇downsuperscript^𝑇brsuperscript^𝑇locsuperscript^𝑇up\hat{\mathbf{T}}=\{\hat{T}^{\mathrm{down}},\hat{T}^{\mathrm{br}},\hat{T}^{% \mathrm{loc}},\hat{T}^{\mathrm{up}}\},over^ start_ARG bold_T end_ARG = { over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT } , where T^downRjdown>ΓDjgensuperscript^𝑇downsuperscriptsubscript𝑅𝑗downΓsuperscriptsubscript𝐷𝑗gen\hat{T}^{\mathrm{down}}R_{j}^{\mathrm{down}}>\Gamma D_{j}^{\mathrm{gen}}over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT > roman_Γ italic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT for j=argmaxkDkgenRkdown𝑗subscript𝑘superscriptsubscript𝐷𝑘gensuperscriptsubscript𝑅𝑘downj=\arg\max_{k}\frac{D_{k}^{\mathrm{gen}}}{R_{k}^{\mathrm{down}}}italic_j = roman_arg roman_max start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG. This means that T^downRkdown>ΓDkgen,k.superscript^𝑇downsuperscriptsubscript𝑅𝑘downΓsuperscriptsubscript𝐷𝑘genfor-all𝑘\hat{T}^{\mathrm{down}}R_{k}^{\mathrm{down}}>\Gamma D_{k}^{\mathrm{gen}},% \forall k.over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT > roman_Γ italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , ∀ italic_k . Then, we construct another solution 𝐓~={T~down,T~br,T~loc,T~up}~𝐓superscript~𝑇downsuperscript~𝑇brsuperscript~𝑇locsuperscript~𝑇up\tilde{\mathbf{T}}=\{\tilde{T}^{\mathrm{down}},\tilde{T}^{\mathrm{br}},\tilde{% T}^{\mathrm{loc}},\tilde{T}^{\mathrm{up}}\}over~ start_ARG bold_T end_ARG = { over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT , over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_br end_POSTSUPERSCRIPT , over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT } with 𝐓~=𝐓^~𝐓^𝐓\tilde{\mathbf{T}}=\hat{\mathbf{T}}over~ start_ARG bold_T end_ARG = over^ start_ARG bold_T end_ARG except that T~down=ΓmaxkDkgenRkdown,T~loc=T^loc+T^downT~down.formulae-sequencesuperscript~𝑇downΓsubscript𝑘superscriptsubscript𝐷𝑘gensuperscriptsubscript𝑅𝑘downsuperscript~𝑇locsuperscript^𝑇locsuperscript^𝑇downsuperscript~𝑇down\tilde{T}^{\mathrm{down}}=\Gamma\max_{k}\frac{D_{k}^{\mathrm{gen}}}{R_{k}^{% \mathrm{down}}},\tilde{T}^{\mathrm{loc}}=\hat{T}^{\mathrm{loc}}+\hat{T}^{% \mathrm{down}}-\tilde{T}^{\mathrm{down}}.over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT = roman_Γ roman_max start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT end_ARG , over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT = over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT + over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT - over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT . This means that T~downRjdown=ΓDjgen,T~downRkdown>ΓDkgen,kjformulae-sequencesuperscript~𝑇downsuperscriptsubscript𝑅𝑗downΓsuperscriptsubscript𝐷𝑗genformulae-sequencesuperscript~𝑇downsuperscriptsubscript𝑅𝑘downΓsuperscriptsubscript𝐷𝑘gen𝑘𝑗\tilde{T}^{\mathrm{down}}R_{j}^{\mathrm{down}}=\Gamma D_{j}^{\mathrm{gen}},% \tilde{T}^{\mathrm{down}}R_{k}^{\mathrm{down}}>\Gamma D_{k}^{\mathrm{gen}},k\neq jover~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT = roman_Γ italic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT > roman_Γ italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , italic_k ≠ italic_j, T~down<T^downsuperscript~𝑇downsuperscript^𝑇down\tilde{T}^{\mathrm{down}}<\hat{T}^{\mathrm{down}}over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT < over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_down end_POSTSUPERSCRIPT and T~loc>T^loc.superscript~𝑇locsuperscript^𝑇loc\tilde{T}^{\mathrm{loc}}>\hat{T}^{\mathrm{loc}}.over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT > over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT . It can be verified that 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG satisfies all the constraints of the problem (23), and achieves the same objective function value as the optimal solution 𝐓^^𝐓\hat{\mathbf{T}}over^ start_ARG bold_T end_ARG. Thus, 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG is also an optimal solution. Since T~loc>T^loc,superscript~𝑇locsuperscript^𝑇loc\tilde{T}^{\mathrm{loc}}>\hat{T}^{\mathrm{loc}},over~ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT > over^ start_ARG italic_T end_ARG start_POSTSUPERSCRIPT roman_loc end_POSTSUPERSCRIPT , it can be shown from (21) that the solution 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG achieves lower energy consumption compared to 𝐓^.^𝐓\hat{\mathbf{T}}.over^ start_ARG bold_T end_ARG . This means that the optimal solution 𝐓~~𝐓\tilde{\mathbf{T}}over~ start_ARG bold_T end_ARG is more desirable than the optimal solution 𝐓^.^𝐓\hat{\mathbf{T}}.over^ start_ARG bold_T end_ARG . This completes the proof.

-D Proof of Lemma 6

Suppose that the optimal solution to the problem (35) is 𝐩^={p^k,k=1,,K},η^,\hat{\mathbf{p}}=\{\hat{p}_{k},k=1,\ldots,K\},\hat{\eta},over^ start_ARG bold_p end_ARG = { over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k = 1 , … , italic_K } , over^ start_ARG italic_η end_ARG , where Blog2(1+hk^p^k^σ2B+hk^j>k^p^j)>ηDk^gen𝐵subscript21subscript^𝑘subscript^𝑝^𝑘superscript𝜎2𝐵subscript^𝑘subscript𝑗^𝑘subscript^𝑝𝑗𝜂superscriptsubscript𝐷^𝑘genB\log_{2}\left(1+\frac{h_{\hat{k}}\hat{p}_{\hat{k}}}{\sigma^{2}B+h_{\hat{k}}% \sum_{j>\hat{k}}\hat{p}_{j}}\right)>\eta D_{\hat{k}}^{\mathrm{gen}}italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_h start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j > over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) > italic_η italic_D start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT for a given k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG, and Blog2(1+hkp^kσ2B+hkj>kp^j)=ηDkgen𝐵subscript21subscript𝑘subscript^𝑝𝑘superscript𝜎2𝐵subscript𝑘subscript𝑗𝑘subscript^𝑝𝑗𝜂superscriptsubscript𝐷𝑘genB\log_{2}\left(1+\frac{h_{k}\hat{p}_{k}}{\sigma^{2}B+h_{k}\sum_{j>k}\hat{p}_{j% }}\right)=\eta D_{k}^{\mathrm{gen}}italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j > italic_k end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) = italic_η italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT for kk^𝑘^𝑘k\neq\hat{k}italic_k ≠ over^ start_ARG italic_k end_ARG. Then, we can consider another solution 𝐩~={p~k,k=1,,K},η~\tilde{\mathbf{p}}=\{\tilde{p}_{k},k=1,\ldots,K\},\tilde{\eta}over~ start_ARG bold_p end_ARG = { over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k = 1 , … , italic_K } , over~ start_ARG italic_η end_ARG with 𝐩~=𝐩^,η~=η^formulae-sequence~𝐩^𝐩~𝜂^𝜂\tilde{\mathbf{p}}=\hat{\mathbf{p}},\tilde{\eta}=\hat{\eta}over~ start_ARG bold_p end_ARG = over^ start_ARG bold_p end_ARG , over~ start_ARG italic_η end_ARG = over^ start_ARG italic_η end_ARG, except that p~ksubscript~𝑝𝑘\tilde{p}_{k}over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is determined by Blog2(1+hkp~kσ2B+hkj>kp~j)=ηDkgen𝐵subscript21subscript𝑘subscript~𝑝𝑘superscript𝜎2𝐵subscript𝑘subscript𝑗𝑘subscript~𝑝𝑗𝜂superscriptsubscript𝐷𝑘genB\log_{2}\left(1+\frac{h_{k}\tilde{p}_{k}}{\sigma^{2}B+h_{k}\sum_{j>k}\tilde{p% }_{j}}\right)=\eta D_{k}^{\mathrm{gen}}italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j > italic_k end_POSTSUBSCRIPT over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) = italic_η italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT. It is shown that p~k<p^ksubscript~𝑝𝑘subscript^𝑝𝑘\tilde{p}_{k}<\hat{p}_{k}over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT < over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and thus all the constraints of the problem (35) are satisfied by the solution 𝐩~,η~~𝐩~𝜂\tilde{\mathbf{p}},\tilde{\eta}over~ start_ARG bold_p end_ARG , over~ start_ARG italic_η end_ARG. Therefore, the solution 𝐩~,η~~𝐩~𝜂\tilde{\mathbf{p}},\tilde{\eta}over~ start_ARG bold_p end_ARG , over~ start_ARG italic_η end_ARG is a feasible solution to the problem (35) and consumes less power than the optimal solution, which indicates that the solution 𝐩~,η~~𝐩~𝜂\tilde{\mathbf{p}},\tilde{\eta}over~ start_ARG bold_p end_ARG , over~ start_ARG italic_η end_ARG is more desirable than the optimal solution 𝐩^,η^.^𝐩^𝜂\hat{\mathbf{p}},\hat{\eta}.over^ start_ARG bold_p end_ARG , over^ start_ARG italic_η end_ARG . This completes the proof.

-E Proof of Theorem 3

We consider two devices k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG and k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG with adjacent decoding orders and gk^qk^max<gk~qk~maxsubscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsubscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxg_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}<g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT < italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT. There are two possible decoding orders for the devices k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG and k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG, i.e., the device k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG is decoded first or the device k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG is decoded first. Let I𝐼Iitalic_I denote the interference caused by the devices k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG and k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG to the devices whose decoding orders are smaller than the devices k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG and k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG, and Isuperscript𝐼I^{{}^{\prime}}italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT denote the interference caused by the devices whose decoding orders are larger than the devices k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG and k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG to the devices k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG and k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG. To guarantee the performance of other devices, the interference caused by the devices k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG and k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG is restricted as gk^qk^maxqk^+gk~qk~maxqk~I.subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscriptsubscript𝑞^𝑘subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscriptsubscript𝑞~𝑘𝐼g_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}q_{\hat{k}}^{{}^{\prime}}+g_{\tilde{k}}q_% {\tilde{k}}^{\mathrm{max}}q_{\tilde{k}}^{{}^{\prime}}\leq I.italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ≤ italic_I . The problem of optimizing qk^superscriptsubscript𝑞^𝑘q_{\hat{k}}^{{}^{\prime}}italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT and qk~superscriptsubscript𝑞~𝑘q_{\tilde{k}}^{{}^{\prime}}italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT is formulated as

maxqk^,qk~,θsubscriptsuperscriptsubscript𝑞^𝑘superscriptsubscript𝑞~𝑘𝜃\displaystyle\max_{q_{\hat{k}}^{{}^{\prime}},q_{\tilde{k}}^{{}^{\prime}},\theta}roman_max start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT , italic_θ end_POSTSUBSCRIPT θ𝜃\displaystyle\>\thetaitalic_θ (48a)
s.t.formulae-sequencest\displaystyle\mathrm{s.t.}roman_s . roman_t . Rkupθ,k{k^,k~},formulae-sequencesuperscriptsubscript𝑅𝑘up𝜃𝑘^𝑘~𝑘\displaystyle\>R_{k}^{\mathrm{up}}\geq\theta,k\in\{\hat{k},\tilde{k}\},italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT ≥ italic_θ , italic_k ∈ { over^ start_ARG italic_k end_ARG , over~ start_ARG italic_k end_ARG } , (48b)
gk^qk^maxqk^+gk~qk~maxqk~I,subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscriptsubscript𝑞^𝑘subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscriptsubscript𝑞~𝑘𝐼\displaystyle\>g_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}q_{\hat{k}}^{{}^{\prime}}+% g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}q_{\tilde{k}}^{{}^{\prime}}\leq I,italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ≤ italic_I , (48c)
 0qk1,k{k^,k~}.formulae-sequence 0superscriptsubscript𝑞𝑘1𝑘^𝑘~𝑘\displaystyle\>0\leq q_{k}^{{}^{\prime}}\leq 1,k\in\{\hat{k},\tilde{k}\}.0 ≤ italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ≤ 1 , italic_k ∈ { over^ start_ARG italic_k end_ARG , over~ start_ARG italic_k end_ARG } . (48d)

Suppose that the device k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG is decoded first. Then, we have

Rk^upsuperscriptsubscript𝑅^𝑘up\displaystyle R_{\hat{k}}^{\mathrm{up}}italic_R start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT =Blog2(1+gk^qk^maxqk^σ2B+j=1,πj>πk^gjqjmaxqj)absent𝐵subscript21subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscriptsubscript𝑞^𝑘superscript𝜎2𝐵subscriptformulae-sequence𝑗1subscript𝜋𝑗subscript𝜋^𝑘subscript𝑔𝑗superscriptsubscript𝑞𝑗maxsuperscriptsubscript𝑞𝑗\displaystyle=B\log_{2}\left(1+\frac{g_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}q_{% \hat{k}}^{{}^{\prime}}}{\sigma^{2}B+\sum_{j=1,\pi_{j}>\pi_{\hat{k}}}g_{j}q_{j}% ^{\mathrm{max}}q_{j}^{{}^{\prime}}}\right)= italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + ∑ start_POSTSUBSCRIPT italic_j = 1 , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG )
=Blog2(σ2B+gk^qk^maxqk^+gk~qk~maxqk~+Iσ2B+gk~qk~maxqk~+I),absent𝐵subscript2superscript𝜎2𝐵subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscriptsubscript𝑞^𝑘subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscriptsubscript𝑞~𝑘superscript𝐼superscript𝜎2𝐵subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscriptsubscript𝑞~𝑘superscript𝐼\displaystyle=B\log_{2}\left(\frac{\sigma^{2}B+g_{\hat{k}}q_{\hat{k}}^{\mathrm% {max}}q_{\hat{k}}^{{}^{\prime}}+g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}q_{% \tilde{k}}^{{}^{\prime}}+I^{{}^{\prime}}}{\sigma^{2}B+g_{\tilde{k}}q_{\tilde{k% }}^{\mathrm{max}}q_{\tilde{k}}^{{}^{\prime}}+I^{{}^{\prime}}}\right),= italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG ) , (49)
Rk~upsuperscriptsubscript𝑅~𝑘up\displaystyle R_{\tilde{k}}^{\mathrm{up}}italic_R start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT =Blog2(1+gk~qk~maxqk~σ2B+j=1,πj>πk~gjqjmaxqj)absent𝐵subscript21subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscriptsubscript𝑞~𝑘superscript𝜎2𝐵subscriptformulae-sequence𝑗1subscript𝜋𝑗subscript𝜋~𝑘subscript𝑔𝑗superscriptsubscript𝑞𝑗maxsuperscriptsubscript𝑞𝑗\displaystyle=B\log_{2}\left(1+\frac{g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}% q_{\tilde{k}}^{{}^{\prime}}}{\sigma^{2}B+\sum_{j=1,\pi_{j}>\pi_{\tilde{k}}}g_{% j}q_{j}^{\mathrm{max}}q_{j}^{{}^{\prime}}}\right)= italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + ∑ start_POSTSUBSCRIPT italic_j = 1 , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG )
=Blog2(σ2B+gk~qk~maxqk~+Iσ2B+I).absent𝐵subscript2superscript𝜎2𝐵subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscriptsubscript𝑞~𝑘superscript𝐼superscript𝜎2𝐵superscript𝐼\displaystyle=B\log_{2}\left(\frac{\sigma^{2}B+g_{\tilde{k}}q_{\tilde{k}}^{% \mathrm{max}}q_{\tilde{k}}^{{}^{\prime}}+I^{{}^{\prime}}}{\sigma^{2}B+I^{{}^{% \prime}}}\right).= italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG ) . (50)

The optimal solution to the problem (48) can satisfy the constraint (48b) with strict equality, otherwise qk^superscriptsubscript𝑞^𝑘q_{\hat{k}}^{{}^{\prime}}italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT oder qk~superscriptsubscript𝑞~𝑘q_{\tilde{k}}^{{}^{\prime}}italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT can be decreased to let the constraint (48b) be satisfied with equality to save energy consumption, or θ𝜃\thetaitalic_θ can be increased to achieve a higher objective function value. Thus, from Rkup=θ,k{k^,k~},formulae-sequencesuperscriptsubscript𝑅𝑘up𝜃𝑘^𝑘~𝑘R_{k}^{\mathrm{up}}=\theta,k\in\{\hat{k},\tilde{k}\},italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT = italic_θ , italic_k ∈ { over^ start_ARG italic_k end_ARG , over~ start_ARG italic_k end_ARG } , we have

qk^=(2θB1)(σ2B+I)2θBgk^qk^max,superscriptsubscript𝑞^𝑘superscript2𝜃𝐵1superscript𝜎2𝐵superscript𝐼superscript2𝜃𝐵subscript𝑔^𝑘superscriptsubscript𝑞^𝑘max\displaystyle q_{\hat{k}}^{{}^{\prime}}=\frac{\left(2^{\frac{\theta}{B}}-1% \right)\left(\sigma^{2}B+I^{{}^{\prime}}\right)2^{\frac{\theta}{B}}}{g_{\hat{k% }}q_{\hat{k}}^{\mathrm{max}}},italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT = divide start_ARG ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG , (51)
qk~=(2θB1)(σ2B+I)gk~qk~max.superscriptsubscript𝑞~𝑘superscript2𝜃𝐵1superscript𝜎2𝐵superscript𝐼subscript𝑔~𝑘superscriptsubscript𝑞~𝑘max\displaystyle q_{\tilde{k}}^{{}^{\prime}}=\frac{\left(2^{\frac{\theta}{B}}-1% \right)\left(\sigma^{2}B+I^{{}^{\prime}}\right)}{g_{\tilde{k}}q_{\tilde{k}}^{% \mathrm{max}}}.italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT = divide start_ARG ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG . (52)

By inserting (51) and (52) into the constraints (48c) and (48d), we have

θB2log2(1+Iσ2B+I),𝜃𝐵2subscript21𝐼superscript𝜎2𝐵superscript𝐼\displaystyle\theta\leq\frac{B}{2}\log_{2}\left(1+\frac{I}{\sigma^{2}B+I^{{}^{% \prime}}}\right),italic_θ ≤ divide start_ARG italic_B end_ARG start_ARG 2 end_ARG roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_I end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG ) , (53)
gk^qk^max(2θB1)(σ2B+I)2θB,subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscript2𝜃𝐵1superscript𝜎2𝐵superscript𝐼superscript2𝜃𝐵\displaystyle g_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}\geq\left(2^{\frac{\theta}{% B}}-1\right)\left(\sigma^{2}B+I^{{}^{\prime}}\right)2^{\frac{\theta}{B}},italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≥ ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT , (54)
gk~qk~max(2θB1)(σ2B+I).subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscript2𝜃𝐵1superscript𝜎2𝐵superscript𝐼\displaystyle g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}\geq\left(2^{\frac{% \theta}{B}}-1\right)\left(\sigma^{2}B+I^{{}^{\prime}}\right).italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≥ ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) . (55)

The inequality (53) restricts the maximum θ𝜃\thetaitalic_θ that can be supported under the given interference constraint I,𝐼I,italic_I , while inequalities (54) and (55) also restrict the maximum θ𝜃\thetaitalic_θ that can be supported under the given gk^qk^maxsubscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxg_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT and gk~qk~max.subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxg_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}.italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT . Since gk^qk^max<gk~qk~max,subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsubscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxg_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}<g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}},italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT < italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , the inequality (54) is tighter than the inequality (55). Thus, the inequality (55) is dumb, and the maximum θ𝜃\thetaitalic_θ can be derived from (53) and (54) accordingly.

Then, assume that the device k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG is decoded first, and we get

Rk^upsuperscriptsubscript𝑅^𝑘up\displaystyle R_{\hat{k}}^{\mathrm{up}}italic_R start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT =Blog2(1+gk^qk^maxqk^σ2B+j=1,πj>πk^gjqjmaxqj)absent𝐵subscript21subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscriptsubscript𝑞^𝑘superscript𝜎2𝐵subscriptformulae-sequence𝑗1subscript𝜋𝑗subscript𝜋^𝑘subscript𝑔𝑗superscriptsubscript𝑞𝑗maxsuperscriptsubscript𝑞𝑗\displaystyle=B\log_{2}\left(1+\frac{g_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}q_{% \hat{k}}^{{}^{\prime}}}{\sigma^{2}B+\sum_{j=1,\pi_{j}>\pi_{\hat{k}}}g_{j}q_{j}% ^{\mathrm{max}}q_{j}^{{}^{\prime}}}\right)= italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + ∑ start_POSTSUBSCRIPT italic_j = 1 , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG )
=Blog2(σ2B+gk^qk^maxqk^+Iσ2B+I),absent𝐵subscript2superscript𝜎2𝐵subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscriptsubscript𝑞^𝑘superscript𝐼superscript𝜎2𝐵superscript𝐼\displaystyle=B\log_{2}\left(\frac{\sigma^{2}B+g_{\hat{k}}q_{\hat{k}}^{\mathrm% {max}}q_{\hat{k}}^{{}^{\prime}}+I^{{}^{\prime}}}{\sigma^{2}B+I^{{}^{\prime}}}% \right),= italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG ) , (56)
Rk~upsuperscriptsubscript𝑅~𝑘up\displaystyle R_{\tilde{k}}^{\mathrm{up}}italic_R start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT =Blog2(1+gk~qk~maxqk~σ2B+j=1,πj>πk~gjqjmaxqj)absent𝐵subscript21subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscriptsubscript𝑞~𝑘superscript𝜎2𝐵subscriptformulae-sequence𝑗1subscript𝜋𝑗subscript𝜋~𝑘subscript𝑔𝑗superscriptsubscript𝑞𝑗maxsuperscriptsubscript𝑞𝑗\displaystyle=B\log_{2}\left(1+\frac{g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}% q_{\tilde{k}}^{{}^{\prime}}}{\sigma^{2}B+\sum_{j=1,\pi_{j}>\pi_{\tilde{k}}}g_{% j}q_{j}^{\mathrm{max}}q_{j}^{{}^{\prime}}}\right)= italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + ∑ start_POSTSUBSCRIPT italic_j = 1 , italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG )
=Blog2(σ2B+gk^qk^maxqk^+gk~qk~maxqk~+Iσ2B+gk^qk^maxqk^+I).absent𝐵subscript2superscript𝜎2𝐵subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscriptsubscript𝑞^𝑘subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscriptsubscript𝑞~𝑘superscript𝐼superscript𝜎2𝐵subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscriptsubscript𝑞^𝑘superscript𝐼\displaystyle=B\log_{2}\left(\frac{\sigma^{2}B+g_{\hat{k}}q_{\hat{k}}^{\mathrm% {max}}q_{\hat{k}}^{{}^{\prime}}+g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}q_{% \tilde{k}}^{{}^{\prime}}+I^{{}^{\prime}}}{\sigma^{2}B+g_{\hat{k}}q_{\hat{k}}^{% \mathrm{max}}q_{\hat{k}}^{{}^{\prime}}+I^{{}^{\prime}}}\right).= italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG ) . (57)

Similarly, the optimal qk^superscriptsubscript𝑞^𝑘q_{\hat{k}}^{{}^{\prime}}italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT and qk~superscriptsubscript𝑞~𝑘q_{\tilde{k}}^{{}^{\prime}}italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT for the problem (48) when the device k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG is decoded first shall let the constraint (48b) be satisfied with equality. Thus, from Rkup=θ,k{k^,k~},formulae-sequencesuperscriptsubscript𝑅𝑘up𝜃𝑘^𝑘~𝑘R_{k}^{\mathrm{up}}=\theta,k\in\{\hat{k},\tilde{k}\},italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT = italic_θ , italic_k ∈ { over^ start_ARG italic_k end_ARG , over~ start_ARG italic_k end_ARG } , we get

qk^=(2θB1)(σ2B+I)gk^qk^max,superscriptsubscript𝑞^𝑘superscript2𝜃𝐵1superscript𝜎2𝐵superscript𝐼subscript𝑔^𝑘superscriptsubscript𝑞^𝑘max\displaystyle q_{\hat{k}}^{{}^{\prime}}=\frac{\left(2^{\frac{\theta}{B}}-1% \right)\left(\sigma^{2}B+I^{{}^{\prime}}\right)}{g_{\hat{k}}q_{\hat{k}}^{% \mathrm{max}}},italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT = divide start_ARG ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG , (58)
qk~=(2θB1)(σ2B+I)2θBgk~qk~max.superscriptsubscript𝑞~𝑘superscript2𝜃𝐵1superscript𝜎2𝐵superscript𝐼superscript2𝜃𝐵subscript𝑔~𝑘superscriptsubscript𝑞~𝑘max\displaystyle q_{\tilde{k}}^{{}^{\prime}}=\frac{\left(2^{\frac{\theta}{B}}-1% \right)\left(\sigma^{2}B+I^{{}^{\prime}}\right)2^{\frac{\theta}{B}}}{g_{\tilde% {k}}q_{\tilde{k}}^{\mathrm{max}}}.italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT = divide start_ARG ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_ARG . (59)

By substituting (58) and (59) into the constraints (48c) and (48d), we get

θB2log2(1+Iσ2B+I),𝜃𝐵2subscript21𝐼superscript𝜎2𝐵superscript𝐼\displaystyle\>\theta\leq\frac{B}{2}\log_{2}\left(1+\frac{I}{\sigma^{2}B+I^{{}% ^{\prime}}}\right),italic_θ ≤ divide start_ARG italic_B end_ARG start_ARG 2 end_ARG roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_I end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG ) , (60)
gk^qk^max(2θB1)(σ2B+I),subscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsuperscript2𝜃𝐵1superscript𝜎2𝐵superscript𝐼\displaystyle\>g_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}\geq\left(2^{\frac{\theta}% {B}}-1\right)\left(\sigma^{2}B+I^{{}^{\prime}}\right),italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≥ ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) , (61)
gk~qk~max(2θB1)(σ2B+I)2θB.subscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxsuperscript2𝜃𝐵1superscript𝜎2𝐵superscript𝐼superscript2𝜃𝐵\displaystyle\>g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}\geq\left(2^{\frac{% \theta}{B}}-1\right)\left(\sigma^{2}B+I^{{}^{\prime}}\right)2^{\frac{\theta}{B% }}.italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≥ ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT - 1 ) ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_B + italic_I start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) 2 start_POSTSUPERSCRIPT divide start_ARG italic_θ end_ARG start_ARG italic_B end_ARG end_POSTSUPERSCRIPT . (62)

It is noted that the expressions (53) and (60) are the same. It is shown the constraint (61) is looser than the constraint (54), while the constraint (62) is also looser than the constraint (54) due to the fact that gk^qk^max<gk~qk~maxsubscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsubscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxg_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}<g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT < italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT. Thus, the maximum allowable θ𝜃\thetaitalic_θ under the constraints (60), (61) and (62) when the device k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG is decoded first is equal to or larger than that when the device k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG is decoded first. Therefore, the optimal decoding order is decoding the device k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG first.

In the above, we have proved that the optimal decoding order for the two devices k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG and k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG with adjacent decoding orders and gk^qk^max<gk~qk~maxsubscript𝑔^𝑘superscriptsubscript𝑞^𝑘maxsubscript𝑔~𝑘superscriptsubscript𝑞~𝑘maxg_{\hat{k}}q_{\hat{k}}^{\mathrm{max}}<g_{\tilde{k}}q_{\tilde{k}}^{\mathrm{max}}italic_g start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT < italic_g start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT over~ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT is decoding the device k~~𝑘\tilde{k}over~ start_ARG italic_k end_ARG first. For multiple devices, since any two adjacent devices shall satisfy this decoding criterion, we can conclude that the optimal decoding order is in the descending order of gkqkmax.subscript𝑔𝑘superscriptsubscript𝑞𝑘maxg_{k}q_{k}^{\mathrm{max}}.italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT . This completes the proof.

-F Proof of Lemma 7

Suppose that a feasible solution to the problem (35) satisfies Rk^up>θsuperscriptsubscript𝑅^𝑘up𝜃R_{\hat{k}}^{\mathrm{up}}>\thetaitalic_R start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT > italic_θ for a given k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG. Then, we can easily decrease the value of qksuperscriptsubscript𝑞𝑘q_{k}^{{}^{\prime}}italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT until Rk^up=θsuperscriptsubscript𝑅^𝑘up𝜃R_{\hat{k}}^{\mathrm{up}}=\thetaitalic_R start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_up end_POSTSUPERSCRIPT = italic_θ such that all the constraints are still satisfied. Thus, the constraint (43b) can be satisfied with strict equality by the feasible solution to the problem (35). This completes the proof.

-G Proof of Proposition 1

Let F(𝐃gen,𝐓,𝐩,𝐪,𝝅)𝐹superscript𝐃gen𝐓𝐩𝐪𝝅F(\mathbf{D}^{\mathrm{gen}},\mathbf{T},\mathbf{p},\mathbf{q},\boldsymbol{\pi})italic_F ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T , bold_p , bold_q , bold_italic_π ) denote the objective function in (22a). Then, F(𝐃gen,𝐓,𝐩,𝐪,𝝅)𝐹superscript𝐃gen𝐓𝐩𝐪𝝅F(\mathbf{D}^{\mathrm{gen}},\mathbf{T},\mathbf{p},\mathbf{q},\boldsymbol{\pi})italic_F ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T , bold_p , bold_q , bold_italic_π ) in the l𝑙litalic_l-th iteration of the BCD method is given by

F(𝐃gen(l),𝐓(l),𝐩(l),𝐪(l),𝝅(l))𝐹superscript𝐃gen𝑙𝐓𝑙𝐩𝑙𝐪𝑙𝝅𝑙\displaystyle F(\mathbf{D}^{\mathrm{gen}}(l),\mathbf{T}(l),\mathbf{p}(l),% \mathbf{q}(l),\boldsymbol{\pi}(l))italic_F ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ( italic_l ) , bold_T ( italic_l ) , bold_p ( italic_l ) , bold_q ( italic_l ) , bold_italic_π ( italic_l ) )
\displaystyle\leq F(𝐃gen(l1),𝐓(l1),𝐩(l),𝐪(l),𝝅(l))𝐹superscript𝐃gen𝑙1𝐓𝑙1𝐩𝑙𝐪𝑙𝝅𝑙\displaystyle F(\mathbf{D}^{\mathrm{gen}}(l-1),\mathbf{T}(l-1),\mathbf{p}(l),% \mathbf{q}(l),\boldsymbol{\pi}(l))italic_F ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ( italic_l - 1 ) , bold_T ( italic_l - 1 ) , bold_p ( italic_l ) , bold_q ( italic_l ) , bold_italic_π ( italic_l ) )
\displaystyle\leq F(𝐃gen(l1),𝐓(l1),𝐩(l1),𝐪(l1),𝝅(l1)).𝐹superscript𝐃gen𝑙1𝐓𝑙1𝐩𝑙1𝐪𝑙1𝝅𝑙1\displaystyle F(\mathbf{D}^{\mathrm{gen}}(l-1),\mathbf{T}(l-1),\mathbf{p}(l-1)% ,\mathbf{q}(l-1),\boldsymbol{\pi}(l-1)).italic_F ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ( italic_l - 1 ) , bold_T ( italic_l - 1 ) , bold_p ( italic_l - 1 ) , bold_q ( italic_l - 1 ) , bold_italic_π ( italic_l - 1 ) ) . (63)

The first inequality in (63) holds because we optimally solve the problem of optimizing 𝐃gen,𝐓superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T}bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T with given 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π, and 𝐃gen(l),𝐓(l)superscript𝐃gen𝑙𝐓𝑙\mathbf{D}^{\mathrm{gen}}(l),\mathbf{T}(l)bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ( italic_l ) , bold_T ( italic_l ) optimally maximizes the objective function as compared to 𝐃gen(l1),𝐓(l1)superscript𝐃gen𝑙1𝐓𝑙1\mathbf{D}^{\mathrm{gen}}(l-1),\mathbf{T}(l-1)bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT ( italic_l - 1 ) , bold_T ( italic_l - 1 ). The second inequality in (63) holds because we optimally solve the problem of optimizing 𝐩,𝐪,𝝅𝐩𝐪𝝅\mathbf{p},\mathbf{q},\boldsymbol{\pi}bold_p , bold_q , bold_italic_π with given 𝐃gen,𝐓,superscript𝐃gen𝐓\mathbf{D}^{\mathrm{gen}},\mathbf{T},bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T , and 𝐩(l),𝐪(l),𝝅(l)𝐩𝑙𝐪𝑙𝝅𝑙\mathbf{p}(l),\mathbf{q}(l),\boldsymbol{\pi}(l)bold_p ( italic_l ) , bold_q ( italic_l ) , bold_italic_π ( italic_l ) optimally maximizes the objective function as compared to 𝐩(l1),𝐪(l1),𝝅(l1).𝐩𝑙1𝐪𝑙1𝝅𝑙1\mathbf{p}(l-1),\mathbf{q}(l-1),\boldsymbol{\pi}(l-1).bold_p ( italic_l - 1 ) , bold_q ( italic_l - 1 ) , bold_italic_π ( italic_l - 1 ) . Since F(𝐃gen,𝐓,𝐩,𝐪,𝝅)𝐹superscript𝐃gen𝐓𝐩𝐪𝝅F(\mathbf{D}^{\mathrm{gen}},\mathbf{T},\mathbf{p},\mathbf{q},\boldsymbol{\pi})italic_F ( bold_D start_POSTSUPERSCRIPT roman_gen end_POSTSUPERSCRIPT , bold_T , bold_p , bold_q , bold_italic_π ) is clearly lower-bounded, the BCD method used for iteratively solving the problem (22) converges to a local optimal solution. This completes the proof.

References

  • [1] F. Javed, M. K. Afzal, M. Sharif, and B.-S. Kim, “Internet of things (IoT) operating systems support, networking technologies, applications, and challenges: A comparative review,” IEEE Commun. Surveys Tuts., vol. 20, no. 3, pp. 2062–2100, 2018.
  • [2] L. U. Khan, W. Saad, Z. Han, E. Hossain, and C. S. Hong, “Federated learning for internet of things: Recent advances, taxonomy, and open challenges,” IEEE Commun. Surveys Tuts., vol. 23, no. 3, pp. 1759–1799, 2021.
  • [3] S. Wang, Y.-C. Wu, M. Xia, R. Wang, and H. V. Poor, “Machine intelligence at the edge with learning centric power allocation,” IEEE Trans. Wireless Commun., vol. 19, no. 11, pp. 7293–7308, 2020.
  • [4] A. Celik and A. M. Eltawil, “At the dawn of generative AI era: A tutorial-cum-survey on new frontiers in 6G wireless intelligence,” IEEE Open J. Commun. Soc., vol. 5, pp. 2433–2489, 2024.
  • [5] A. Karapantelakis, P. Alizadeh, A. Alabassi, K. Dey, and A. Nikou, “Generative AI in mobile networks: A survey,” Ann. Telecommun., vol. 79, no. 1, pp. 15–33, 2024.
  • [6] M. Xu et al., “Unleashing the power of edge-cloud generative AI in mobile networks: A survey of AIGC services,” IEEE Commun. Surveys Tuts., 2024.
  • [7] M. Chen, Z. Yang, W. Saad, C. Yin, H. V. Poor, and S. Cui, “A joint learning and communications framework for federated learning over wireless networks,” IEEE Trans. Wireless Commun., vol. 20, no. 1, pp. 269–283, 2020.
  • [8] Z. Yang, M. Chen, W. Saad, C. S. Hong, and M. Shikh-Bahaei, “Energy efficient federated learning over wireless communication networks,” IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1935–1949, 2021.
  • [9] P. S. Bouzinis, P. D. Diamantoulakis, and G. K. Karagiannidis, “Wireless quantized federated learning: a joint computation and communication design,” IEEE Trans. Commun., vol. 71, no. 5, pp. 2756–2770, 2023.
  • [10] J. Zhang, S. Chen, X. Zhou, X. Wang, and Y.-B. Lin, “Joint scheduling of participants, local iterations, and radio resources for fair federated learning over mobile edge networks,” IEEE Trans. Mobile Comput., vol. 22, no. 07, pp. 3985–3999, 2023.
  • [11] H. Zhao, M. Zhou, W. Xia, Y. Ni, G. Gui, and H. Zhu, “Economic and energy-efficient wireless federated learning based on stackelberg game,” IEEE Trans. Veh. Technol., vol. 73, no. 2, pp. 2995–2999, 2024.
  • [12] D. Xu and H. Zhu, “Sum-rate maximization of wireless powered primary users for cooperative CRNs: NOMA or TDMA at cognitive users?” IEEE Trans. Commun., vol. 69, no. 7, pp. 4862–4876, 2021.
  • [13] Y. Liu et al., “Evolution of NOMA toward next generation multiple access (NGMA) for 6G,” IEEE J. Sel. Areas Commun., vol. 40, no. 4, pp. 1037–1071, 2022.
  • [14] D. Xu, “Device scheduling and computation offloading in mobile edge computing networks: A novel NOMA scheme,” IEEE Trans. Veh. Technol., early access, Jan. 10, 2024, doi: 10.1109/TVT.2024.3352262.
  • [15] Y. Wu, Y. Song, T. Wang, L. Qian, and T. Q. Quek, “Non-orthogonal multiple access assisted federated learning via wireless power transfer: A cost-efficient approach,” IEEE Trans. Commun., vol. 70, no. 4, pp. 2853–2869, 2022.
  • [16] M. S. Al-Abiad, M. Z. Hassan, and M. J. Hossain, “Energy-efficient resource allocation for federated learning in NOMA-enabled and relay-assisted Internet of Things networks,” IEEE Internet Things J., vol. 9, no. 24, pp. 24 736–24 753, 2022.
  • [17] W. Li, T. Lv, Y. Cao, W. Ni, and M. Peng, “Multi-carrier NOMA-empowered wireless federated learning with optimal power and bandwidth allocation,” IEEE Trans. Wireless Commun., vol. 22, no. 12, pp. 9762–9777, 2023.
  • [18] B. Wu, F. Fang, and X. Wang, “Joint age-based client selection and resource allocation for communication-efficient federated learning over NOMA networks,” IEEE Trans. Commun., vol. 72, no. 1, pp. 179–192, 2024.
  • [19] G. Liu et al., “Semantic communications for artificial intelligence generated content (AIGC) toward effective content creation,” IEEE Netw., early access, Jan. 11, 2024, doi: 10.1109/MNET.2024.3352917.
  • [20] R. Cheng, Y. Sun, D. Niyato, L. Zhang, L. Zhang, and M. A. Imran, “A wireless AI-generated content (AIGC) provisioning framework empowered by semantic communication,” arXiv preprint arXiv:2310.17705, 2023.
  • [21] T. Wu et al., “CDDM: Channel denoising diffusion models for wireless semantic communications,” IEEE Trans. Wireless Commun., early access, Mar. 28, 2024, doi: 10.1109/TWC.2024.3379244.
  • [22] Y. Liu et al., “Blockchain-empowered lifecycle management for AI-generated content products in edge networks,” IEEE Wireless Commun., early access, Feb. 05, 2024, doi: 10.1109/MWC.003.2300053.
  • [23] H. Du et al., “Exploring collaborative distributed diffusion-based AI-generated content (AIGC) in wireless networks,” IEEE Netw., early access, Jul. 03, 2024, doi: 10.1109/MNET.006.2300223.
  • [24] ——, “Diffusion-based reinforcement learning for edge-enabled AI-generated content services,” IEEE Trans. Mobile Comput., early access, Jan. 19, 2024, doi: 10.1109/TMC.2024.3356178.
  • [25] X. Huang et al., “Federated learning-empowered AI-generated content in wireless networks,” IEEE Netw., early access, Jan. 12, 2024, doi: 10.1109/MNET.2024.3353377.
  • [26] P. Li et al., “Filling the missing: Exploring generative AI for enhanced federated learning over heterogeneous mobile edge devices,” IEEE Trans. Mobile Comput., early access, Feb. 29, 2024, doi: 10.1109/TMC.2024.3371772.
  • [27] D. Xu, “Latency minimization for TDMA-based wireless federated learning networks,” IEEE Trans. Veh. Technol., early access, Apr. 16, 2024, doi: 10.1109/TVT.2024.3389972.
  • [28] M. Grant and S. Boyd, “CVX: Matlab software for disciplined convex programming, version 2.1,” http://cvxr.com/cvx, Mar. 2014.
  • [29] Z. Yang, M. Chen, W. Saad, W. Xu, and M. Shikh-Bahaei, “Sum-rate maximization of uplink rate splitting multiple access (RSMA) communication,” IEEE Trans. Mobile Comput., vol. 21, no. 07, pp. 2596–2609, 2022.
  • [30] S. Boyd and L. Vandenberghe, Convex Optimization.   Cambridge, U.K.: Cambridge Univ. Press, 2004.
  • [31] “Further advancements for E-UTRA physical layer aspects (release 9),” 3GPP, TS 36.814 (V9.0.0), Mar. 2010.
  • [32] P. S. Bouzinis, P. D. Diamantoulakis, and G. K. Karagiannidis, “Wireless federated learning (WFL) for 6G networks–part I: Research challenges and future trends,” IEEE Commun. Lett., vol. 26, no. 1, pp. 3–7, 2022.