-
Optimizing Highway Ramp Merge Safety and Efficiency via Spatio-Temporal Cooperative Control and Vehicle-Road Coordination
Authors:
Ting Peng,
Xiaoxue Xu,
Yuan Li,
Jie Wu,
Tao Li,
Xiang Dong,
Yincai Cai,
Peng Wu
Abstract:
In view of existing automatic driving, it is difficult to accurately and timely obtain the status and driving intention of other vehicles. The safety risk and urgency of autonomous vehicles in the absence of collision are evaluated. To ensure safety and improve road efficiency, a method of pre-compiling the spatio-temporal trajectory of vehicles is established to eliminate conflicts between vehicl…
▽ More
In view of existing automatic driving, it is difficult to accurately and timely obtain the status and driving intention of other vehicles. The safety risk and urgency of autonomous vehicles in the absence of collision are evaluated. To ensure safety and improve road efficiency, a method of pre-compiling the spatio-temporal trajectory of vehicles is established to eliminate conflicts between vehicles in advance. The calculation method of the safe distance under spatio-temporal conditions is studied, considering vehicle speed differences, vehicle positioning errors, and clock errors. By combining collision acceleration and urgent acceleration, an evaluation model for vehicle conflict risk is constructed. Mainline vehicles that may have conflicts with on-ramp vehicles are identified, and the target gap for on-ramp vehicles is determined. Finally, a cooperative control method is established based on the selected target gap, preparing the vehicle travel path in advance. Taking highway ramp merge as an example, the mainline priority spatio-temporal cooperative control method is proposed and verified through simulation. Using SUMO and Python co-simulation, mainline traffic volumes of 800 veh*h-1*lane-1
△ Less
Submitted 15 August, 2024;
originally announced August 2024.
-
Design and Testing for Steel Support Axial Force Servo System
Authors:
Sana Ullah,
Yonghong Zhou,
Maokai Lai,
Xiang Dong,
Tao Li,
Xiaoxue Xu,
Yuan Li,
Ting Peng
Abstract:
Foundation excavations are deepening, expanding, and approaching structures. Steel supports measure and manage axial force. The study regulates steel support structure power during deep excavation using a novel axial force management system for safety, efficiency, and structural integrity. Closed-loop control changes actuator output to maintain axial force based on force. In deep excavation, the s…
▽ More
Foundation excavations are deepening, expanding, and approaching structures. Steel supports measure and manage axial force. The study regulates steel support structure power during deep excavation using a novel axial force management system for safety, efficiency, and structural integrity. Closed-loop control changes actuator output to maintain axial force based on force. In deep excavation, the servo system regulates unstable soil, side pressure, and structural demands. Modern engineering and tech are used. Temperature changes automatically adjust the jack to maintain axial force. Includes hydraulic jacks, triple-acting cylinders, temperature, and deformation sensors, and automatic control. Foundation pit excavation is dynamic, yet structure tension is constant. There is no scientific way to regulate axial force foundation pit excavation. The revolutionary Servo system adjusts temperature, compression, and axial force to deform pits. System control requires foundation pit direction detection and modification. This engineering method has performed effectively for deep foundation pit excavation at railway crossings and other infrastructure projects. The surrounding protective structure may reduce the steel support's axial stress, making deep foundation excavation safe and efficient. Keywords: Servo systems, Steel strut support design, Deformation control, Monitoring and control, Deep excavation projects.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
Design and Optimization on Successive RIS-assisted Multi-hop Wireless Communications
Authors:
Rujing Xiong,
Jialong Lu,
Jianan Zhang,
Minggang Liu,
Xuehui Dong,
Tiebin Mi,
Robert Caiming Qiu
Abstract:
As an emerging wireless communication technology, reconfigurable intelligent surface (RIS) has become a basic choice for providing signal coverage services in scenarios with dense obstacles or long tunnels through multi-hop configurations. Conventional works of literature mainly focus on alternating optimization or single-beam calculation in RIS phase configuration, which is limited in considering…
▽ More
As an emerging wireless communication technology, reconfigurable intelligent surface (RIS) has become a basic choice for providing signal coverage services in scenarios with dense obstacles or long tunnels through multi-hop configurations. Conventional works of literature mainly focus on alternating optimization or single-beam calculation in RIS phase configuration, which is limited in considering energy efficiency, and often suffers from inaccurate channel state information (CSI), poor convergence, and high computational complexity. This paper addresses the design and optimization challenges for successive RIS-assisted multi-hop systems. Specifically, we establish a general model for multi-hop communication based on the relationship between the input and output electric fields within each RIS. Meanwhile, we derive the half-power beamwidth of the RIS-reflected beams, considering the beam direction. Leveraging these models and derivations, we propose deployment optimization and beam optimization strategies for multi-hop systems, which feature high aperture efficiency and significant gains in signal power. Simulation and prototype experiment results validate the effectiveness and superiority of the proposed systems and methods.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
Arbitrary Waveform Generated Metasurface: A New Paradigm for Direct Modulation and Beamforming Decoupling
Authors:
Xuehui Dong,
Bokai Lai,
Rujing Xiong,
Jianan Zhang,
Miyu Feng,
Tiebin Mi,
Robert Caiming Qiu
Abstract:
Information Metasurface, also known as reconfigurable intelligent surface (RIS) has gained significant attention owing to its impressive abilities in electromagnetic (EM) wave manipulation with simple structures. Numerous studies focus on achieving efficient and versatile information transmission using RIS across various fields like wireless communication, radar detection, integrated sensing, and…
▽ More
Information Metasurface, also known as reconfigurable intelligent surface (RIS) has gained significant attention owing to its impressive abilities in electromagnetic (EM) wave manipulation with simple structures. Numerous studies focus on achieving efficient and versatile information transmission using RIS across various fields like wireless communication, radar detection, integrated sensing, and communications, among others. Previous studies demonstrate diverse approaches to achieve reflection modulation by utilizing the superposition of the quantified reflection coefficient (RC) of each unit but suffer from the computing complexity of codebook sequence, the safety of communication, and the flexibility of modulation. To address these challenges, we introduce a novel concept of information metasurface, namely AWG-RIS, which is capable of independently producing arbitrary baseband waveforms and beam patterns through a design that decouples magnitude and phase, without changing the beam pattern. The AWG-RIS functions as a reflection mixer, directly embedding the intended signal into the incoming EM waves. Subsequently, we developed an analysis framework and introduced the waveform factor and beamforming factor into the new model, offering theoretical support for the transition from the control signal to the outgoing electromagnetic wave. Additionally, we unveil the world's first prototype showcasing passive arbitrary waveform generation while maintaining the beam pattern unaltered. Leveraging the decoupling of direct modulation and beamforming, we explore additional applications in several domains relative to traditional RISs. Finally, we present experiments that confirm the generation of arbitrary waveforms and particular spectrograms.
△ Less
Submitted 24 July, 2024; v1 submitted 5 July, 2024;
originally announced July 2024.
-
Spatio-temporal cooperative control Method of Highway Ramp Merge Based on Vehicle-road Coordination
Authors:
Xiaoxue Xu,
Maokai Lai,
Haitao Zhang,
Xiang Dong,
Tao Li,
Jie Wu,
Yuan Li,
Ting Peng
Abstract:
The merging area of highway ramps faces multiple challenges, including traffic congestion, collision risks, speed mismatches, driver behavior uncertainties, limited visibility, and bottleneck effects. However, autonomous vehicles engaging in depth coordination between vehicle and road in merging zones, by pre-planning and uploading travel trajectories, can significantly enhance the safety and effi…
▽ More
The merging area of highway ramps faces multiple challenges, including traffic congestion, collision risks, speed mismatches, driver behavior uncertainties, limited visibility, and bottleneck effects. However, autonomous vehicles engaging in depth coordination between vehicle and road in merging zones, by pre-planning and uploading travel trajectories, can significantly enhance the safety and efficiency of merging zones.In this paper,we mainly introduce mainline priority cooperation method to achieve the time and space cooperative control of highway merge.Vehicle-mounted intelligent units share real-time vehicle status and driving intentions with Road Section Management Units, which pre-plan the spatiotemporal trajectories of vehicle travel. After receiving these trajectories, Vehicle Intelligent Units strictly adhere to them. Through this deep collaboration between vehicles and roads, conflicts in time and space during vehicle travel are eliminated in advance.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
On the Impact of Sample Size in Reconstructing Noisy Graph Signals: A Theoretical Characterisation
Authors:
Baskaran Sripathmanathan,
Xiaowen Dong,
Michael Bronstein
Abstract:
Reconstructing a signal on a graph from noisy observations of a subset of the vertices is a fundamental problem in the field of graph signal processing. This paper investigates how sample size affects reconstruction error in the presence of noise via an in-depth theoretical analysis of the two most common reconstruction methods in the literature, least-squares reconstruction (LS) and graph-Laplaci…
▽ More
Reconstructing a signal on a graph from noisy observations of a subset of the vertices is a fundamental problem in the field of graph signal processing. This paper investigates how sample size affects reconstruction error in the presence of noise via an in-depth theoretical analysis of the two most common reconstruction methods in the literature, least-squares reconstruction (LS) and graph-Laplacian regularised reconstruction (GLR). Our theorems show that at sufficiently low signal-to-noise ratios (SNRs), under these reconstruction methods we may simultaneously decrease sample size and decrease average reconstruction error. We further show that at sufficiently low SNRs, for LS reconstruction we have a $Λ$-shaped error curve and for GLR reconstruction, a sample size of $ \mathcal{O}(\sqrt{N})$, where $N$ is the total number of vertices, results in lower reconstruction error than near full observation. We present thresholds on the SNRs, $τ$ and $τ_{GLR}$, below which the error is non-monotonic, and illustrate these theoretical results with experiments across multiple random graph models, sampling schemes and SNRs. These results demonstrate that any decision in sample-size choice has to be made in light of the noise levels in the data.
△ Less
Submitted 11 July, 2024; v1 submitted 24 June, 2024;
originally announced June 2024.
-
Frequency stabilization of self-sustained oscillations in a sideband-driven electromechanical resonator
Authors:
B. Zhang,
Yingming Yan,
X. Dong,
M. I. Dykman,
H. B. Chan
Abstract:
We present a method to stabilize the frequency of self-sustained vibrations in micro- and nanomechanical resonators. The method refers to a two-mode system with the vibrations at significantly different frequencies. The signal from one mode is used to control the other mode. In the experiment, self-sustained oscillations of micromechanical modes are excited by pumping at the blue-detuned sideband…
▽ More
We present a method to stabilize the frequency of self-sustained vibrations in micro- and nanomechanical resonators. The method refers to a two-mode system with the vibrations at significantly different frequencies. The signal from one mode is used to control the other mode. In the experiment, self-sustained oscillations of micromechanical modes are excited by pumping at the blue-detuned sideband of the higher-frequency mode. Phase fluctuations of the two modes show near perfect anti-correlation. They can be compensated in either one of the modes by a stepwise change of the pump phase. The phase change of the controlled mode is proportional to the pump phase change, with the proportionality constant independent of the pump amplitude and frequency. This finding allows us to stabilize the phase of one mode against phase diffusion using the measured phase of the other mode. We demonstrate that phase fluctuations of either the high or low frequency mode can be significantly reduced. The results open new opportunities in generating stable vibrations in a broad frequency range via parametric downconversion in nonlinear resonators.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
STAR-RIS Aided Secure MIMO Communication Systems
Authors:
Xiequn Dong,
Zesong Fei,
Xinyi Wang,
Meng Hua,
Qingqing Wu
Abstract:
This paper investigates simultaneous transmission and reflection reconfigurable intelligent surface (STAR-RIS) aided physical layer security (PLS) in multiple-input multiple-output (MIMO) systems, where the base station (BS) transmits secrecy information with the aid of STAR-RIS against multiple eavesdroppers equipped with multiple antennas. We aim to maximize the secrecy rate by jointly optimizin…
▽ More
This paper investigates simultaneous transmission and reflection reconfigurable intelligent surface (STAR-RIS) aided physical layer security (PLS) in multiple-input multiple-output (MIMO) systems, where the base station (BS) transmits secrecy information with the aid of STAR-RIS against multiple eavesdroppers equipped with multiple antennas. We aim to maximize the secrecy rate by jointly optimizing the active beamforming at the BS and passive beamforming at the STAR-RIS, subject to the hardware constraint for STAR-RIS. To handle the coupling variables, a minimum mean-square error (MMSE) based alternating optimization (AO) algorithm is applied. In particular, the amplitudes and phases of STAR-RIS are divided into two blocks to simplify the algorithm design. Besides, by applying the Majorization-Minimization (MM) method, we derive a closed-form expression of the STAR-RIS's phase shifts. Numerical results show that the proposed scheme significantly outperforms various benchmark schemes, especially as the number of STAR-RIS elements increases.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis
Authors:
Shaojie Li,
Haichen Qu,
Xinqi Dong,
Bo Dang,
Hengyi Zang,
Yulu Gong
Abstract:
Exploring the application of deep learning technologies in the field of medical diagnostics, Magnetic Resonance Imaging (MRI) provides a unique perspective for observing and diagnosing complex neurodegenerative diseases such as Alzheimer Disease (AD). With advancements in deep learning, particularly in Convolutional Neural Networks (CNNs) and the Xception network architecture, we are now able to a…
▽ More
Exploring the application of deep learning technologies in the field of medical diagnostics, Magnetic Resonance Imaging (MRI) provides a unique perspective for observing and diagnosing complex neurodegenerative diseases such as Alzheimer Disease (AD). With advancements in deep learning, particularly in Convolutional Neural Networks (CNNs) and the Xception network architecture, we are now able to analyze and classify vast amounts of MRI data with unprecedented accuracy. The progress of this technology not only enhances our understanding of brain structural changes but also opens up new avenues for monitoring disease progression through non-invasive means and potentially allows for precise diagnosis in the early stages of the disease.
This study aims to classify MRI images using deep learning models to identify different stages of Alzheimer Disease through a series of innovative data processing and model construction steps. Our experimental results show that the deep learning framework based on the Xception model achieved a 99.6% accuracy rate in the multi-class MRI image classification task, demonstrating its potential application value in assistive diagnosis. Future research will focus on expanding the dataset, improving model interpretability, and clinical validation to further promote the application of deep learning technology in the medical field, with the hope of bringing earlier diagnosis and more personalized treatment plans to Alzheimer Disease patients.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation
Authors:
Shuangrui Ding,
Zihan Liu,
Xiaoyi Dong,
Pan Zhang,
Rui Qian,
Conghui He,
Dahua Lin,
Jiaqi Wang
Abstract:
We present SongComposer, an innovative LLM designed for song composition. It could understand and generate melodies and lyrics in symbolic song representations, by leveraging the capability of LLM. Existing music-related LLM treated the music as quantized audio signals, while such implicit encoding leads to inefficient encoding and poor flexibility. In contrast, we resort to symbolic song represen…
▽ More
We present SongComposer, an innovative LLM designed for song composition. It could understand and generate melodies and lyrics in symbolic song representations, by leveraging the capability of LLM. Existing music-related LLM treated the music as quantized audio signals, while such implicit encoding leads to inefficient encoding and poor flexibility. In contrast, we resort to symbolic song representation, the mature and efficient way humans designed for music, and enable LLM to explicitly compose songs like humans. In practice, we design a novel tuple design to format lyric and three note attributes (pitch, duration, and rest duration) in the melody, which guarantees the correct LLM understanding of musical symbols and realizes precise alignment between lyrics and melody. To impart basic music understanding to LLM, we carefully collected SongCompose-PT, a large-scale song pretraining dataset that includes lyrics, melodies, and paired lyrics-melodies in either Chinese or English. After adequate pre-training, 10K carefully crafted QA pairs are used to empower the LLM with the instruction-following capability and solve diverse tasks. With extensive experiments, SongComposer demonstrates superior performance in lyric-to-melody generation, melody-to-lyric generation, song continuation, and text-to-song creation, outperforming advanced LLMs like GPT-4.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Simplifying Hypergraph Neural Networks
Authors:
Bohan Tang,
Zexi Liu,
Keyue Jiang,
Siheng Chen,
Xiaowen Dong
Abstract:
Hypergraphs are crucial for modeling higher-order interactions in real-world data. Hypergraph neural networks (HNNs) effectively utilise these structures by message passing to generate informative node features for various downstream tasks like node classification. However, the message passing block in existing HNNs typically requires a computationally intensive training process, which limits thei…
▽ More
Hypergraphs are crucial for modeling higher-order interactions in real-world data. Hypergraph neural networks (HNNs) effectively utilise these structures by message passing to generate informative node features for various downstream tasks like node classification. However, the message passing block in existing HNNs typically requires a computationally intensive training process, which limits their practical use. To tackle this challenge, we propose an alternative approach by decoupling the usage of the hypergraph structural information from the model training stage. The proposed model, simplified hypergraph neural network (SHNN), contains a training-free message-passing block that can be precomputed before the training of SHNN, thereby reducing the computational burden. We theoretically support the efficiency and effectiveness of SHNN by showing that: 1) It is more training-efficient compared to existing HNNs; 2) It utilises as much information as existing HNNs for node feature generation; and 3) It is robust against the oversmoothing issue while using long-range interactions. Experiments based on six real-world hypergraph benchmarks in node classification and hyperlink prediction present that, compared to state-of-the-art HNNs, SHNN shows both competitive performance and superior training efficiency. Specifically, on Cora-CA, SHNN achieves the highest node classification accuracy with just 2% training time of the best baseline.
△ Less
Submitted 22 May, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Hypergraph-MLP: Learning on Hypergraphs without Message Passing
Authors:
Bohan Tang,
Siheng Chen,
Xiaowen Dong
Abstract:
Hypergraphs are vital in modelling data with higher-order relations containing more than two entities, gaining prominence in machine learning and signal processing. Many hypergraph neural networks leverage message passing over hypergraph structures to enhance node representation learning, yielding impressive performances in tasks like hypergraph node classification. However, these message-passing-…
▽ More
Hypergraphs are vital in modelling data with higher-order relations containing more than two entities, gaining prominence in machine learning and signal processing. Many hypergraph neural networks leverage message passing over hypergraph structures to enhance node representation learning, yielding impressive performances in tasks like hypergraph node classification. However, these message-passing-based models face several challenges, including oversmoothing as well as high latency and sensitivity to structural perturbations at inference time. To tackle those challenges, we propose an alternative approach where we integrate the information about hypergraph structures into training supervision without explicit message passing, thus also removing the reliance on it at inference. Specifically, we introduce Hypergraph-MLP, a novel learning framework for hypergraph-structured data, where the learning model is a straightforward multilayer perceptron (MLP) supervised by a loss function based on a notion of signal smoothness on hypergraphs. Experiments on hypergraph node classification tasks demonstrate that Hypergraph-MLP achieves competitive performance compared to existing baselines, and is considerably faster and more robust against structural perturbations at inference.
△ Less
Submitted 2 June, 2024; v1 submitted 15 December, 2023;
originally announced December 2023.
-
Wireless Communications in Cavity: A Reconfigurable Boundary Modulation based Approach
Authors:
Xuehui Dong,
Xiang Ren,
Bokai Lai,
Rujing Xiong,
Tiebin Mi,
Robert Caiming Qiu
Abstract:
This paper explores the potential wireless communication applications of Reconfigurable Intelligent Surfaces (RIS) in reverberant wave propagation environments. Unlike in free space, we utilize the sensitivity to boundaries of the enclosed electromagnetic (EM) field and the equivalent perturbation of RISs. For the first time, we introduce the framework of reconfigurable boundary modulation in the…
▽ More
This paper explores the potential wireless communication applications of Reconfigurable Intelligent Surfaces (RIS) in reverberant wave propagation environments. Unlike in free space, we utilize the sensitivity to boundaries of the enclosed electromagnetic (EM) field and the equivalent perturbation of RISs. For the first time, we introduce the framework of reconfigurable boundary modulation in the cavities . We have proposed a robust boundary modulation scheme that exploits the continuity of object motion and the mutation of the codebook switch, which achieves pulse position modulation (PPM) by RIS-generated equivalent pulses for wireless communication in cavities. This approach achieves around 2 Mbps bit rate in the prototype and demonstrates strong resistance to channel's frequency selectivity resulting in an extremely low bit error rate (BER).
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Hypergraph Structure Inference From Data Under Smoothness Prior
Authors:
Bohan Tang,
Siheng Chen,
Xiaowen Dong
Abstract:
Hypergraphs are important for processing data with higher-order relationships involving more than two entities. In scenarios where explicit hypergraphs are not readily available, it is desirable to infer a meaningful hypergraph structure from the node features to capture the intrinsic relations within the data. However, existing methods either adopt simple pre-defined rules that fail to precisely…
▽ More
Hypergraphs are important for processing data with higher-order relationships involving more than two entities. In scenarios where explicit hypergraphs are not readily available, it is desirable to infer a meaningful hypergraph structure from the node features to capture the intrinsic relations within the data. However, existing methods either adopt simple pre-defined rules that fail to precisely capture the distribution of the potential hypergraph structure, or learn a mapping between hypergraph structures and node features but require a large amount of labelled data, i.e., pre-existing hypergraph structures, for training. Both restrict their applications in practical scenarios. To fill this gap, we propose a novel smoothness prior that enables us to design a method to infer the probability for each potential hyperedge without labelled data as supervision. The proposed prior indicates features of nodes in a hyperedge are highly correlated by the features of the hyperedge containing them. We use this prior to derive the relation between the hypergraph structure and the node features via probabilistic modelling. This allows us to develop an unsupervised inference method to estimate the probability for each potential hyperedge via solving an optimisation problem that has an analytical solution. Experiments on both synthetic and real-world data demonstrate that our method can learn meaningful hypergraph structures from data more efficiently than existing hypergraph structure inference methods.
△ Less
Submitted 31 August, 2023; v1 submitted 27 August, 2023;
originally announced August 2023.
-
Network Momentum across Asset Classes
Authors:
Xingyue Pu,
Stephen Roberts,
Xiaowen Dong,
Stefan Zohren
Abstract:
We investigate the concept of network momentum, a novel trading signal derived from momentum spillover across assets. Initially observed within the confines of pairwise economic and fundamental ties, such as the stock-bond connection of the same company and stocks linked through supply-demand chains, momentum spillover implies a propagation of momentum risk premium from one asset to another. The s…
▽ More
We investigate the concept of network momentum, a novel trading signal derived from momentum spillover across assets. Initially observed within the confines of pairwise economic and fundamental ties, such as the stock-bond connection of the same company and stocks linked through supply-demand chains, momentum spillover implies a propagation of momentum risk premium from one asset to another. The similarity of momentum risk premium, exemplified by co-movement patterns, has been spotted across multiple asset classes including commodities, equities, bonds and currencies. However, studying the network effect of momentum spillover across these classes has been challenging due to a lack of readily available common characteristics or economic ties beyond the company level. In this paper, we explore the interconnections of momentum features across a diverse range of 64 continuous future contracts spanning these four classes. We utilise a linear and interpretable graph learning model with minimal assumptions to reveal the intricacies of the momentum spillover network. By leveraging the learned networks, we construct a network momentum strategy that exhibits a Sharpe ratio of 1.5 and an annual return of 22%, after volatility scaling, from 2000 to 2022. This paper pioneers the examination of momentum spillover across multiple asset classes using only pricing data, presents a multi-asset investment strategy based on network momentum, and underscores the effectiveness of this strategy through robust empirical analysis.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
On the Impact of Sample Size in Reconstructing Graph Signals
Authors:
Baskaran Sripathmanathan,
Xiaowen Dong,
Michael Bronstein
Abstract:
Reconstructing a signal on a graph from observations on a subset of the vertices is a fundamental problem in the field of graph signal processing. It is often assumed that adding additional observations to an observation set will reduce the expected reconstruction error. We show that under the setting of noisy observation and least-squares reconstruction this is not always the case, characterising…
▽ More
Reconstructing a signal on a graph from observations on a subset of the vertices is a fundamental problem in the field of graph signal processing. It is often assumed that adding additional observations to an observation set will reduce the expected reconstruction error. We show that under the setting of noisy observation and least-squares reconstruction this is not always the case, characterising the behaviour both theoretically and experimentally.
△ Less
Submitted 1 July, 2023;
originally announced July 2023.
-
View Adaptive Light Field Deblurring Networks with Depth Perception
Authors:
Zeqi Shen,
Shuo Zhang,
Zhuhao Zhang,
Qihua Chen,
Xueyao Dong,
Youfang Lin
Abstract:
The Light Field (LF) deblurring task is a challenging problem as the blur images are caused by different reasons like the camera shake and the object motion. The single image deblurring method is a possible way to solve this problem. However, since it deals with each view independently and cannot effectively utilize and maintain the LF structure, the restoration effect is usually not ideal. Beside…
▽ More
The Light Field (LF) deblurring task is a challenging problem as the blur images are caused by different reasons like the camera shake and the object motion. The single image deblurring method is a possible way to solve this problem. However, since it deals with each view independently and cannot effectively utilize and maintain the LF structure, the restoration effect is usually not ideal. Besides, the LF blur is more complex because the degree is affected by the views and depth. Therefore, we carefully designed a novel LF deblurring network based on the LF blur characteristics. On one hand, since the blur degree varies a lot in different views, we design a novel view adaptive spatial convolution to deblur blurred LFs, which calculates the exclusive convolution kernel for each view. On the other hand, because the blur degree also varies with the depth of the object, a depth perception view attention is designed to deblur different depth areas by selectively integrating information from different views. Besides, we introduce an angular position embedding to maintain the LF structure better, which ensures the model correctly restores the view information. Quantitative and qualitative experimental results on synthetic and real images show that the deblurring effect of our method is better than other state-of-the-art methods.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
Multi-RIS-aided Wireless Communications in Real-world: Prototyping and Field Trials
Authors:
Rujing Xiong,
Jianan Zhang,
Xuehui Dong,
Zhengyu Wang,
Junshuo Liu,
Wei Yang,
Tiebin Mi,
Wenbo Huang,
Robert Caiming Qiu
Abstract:
The performance of multiple reconfigurable intelligent surfaces (RISs) receives limited attention in previous studies. This article fills this research gap by investigating the capabilities of multiple RISs in real-world networks. We propose a simplified yet highly scalable sandwich architecture for implementing one-bit unit cells, with the flexibility to accommodate multi-bit unit cells. To effec…
▽ More
The performance of multiple reconfigurable intelligent surfaces (RISs) receives limited attention in previous studies. This article fills this research gap by investigating the capabilities of multiple RISs in real-world networks. We propose a simplified yet highly scalable sandwich architecture for implementing one-bit unit cells, with the flexibility to accommodate multi-bit unit cells. To effectively control multiple RISs, we present a cost-effective remote-controlling scheme and develop a cloud-based RIS management system. Through a series of four field trials, we demonstrate the effectiveness of multi-hop routing schemes in establishing reliable links. Our experiments reveal significant improvements in signal strength and data transmission in multi-RIS-aided Wi-Fi and commercial 5G networks. Furthermore, we investigate the power scaling law of RIS-aided beamforming and provide insights into the roles of the later nodes in multi-hop relay chains.
△ Less
Submitted 1 July, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
High speed free-space optical communication using standard fiber communication component without optical amplification
Authors:
Yao Zhang,
Hua-Ying Liu,
Xiaoyi Liu,
Peng Xu,
Xiang Dong,
Pengfei Fan,
Xiaohui Tian,
Hua Yu,
Dong Pan,
Zhijun Yin,
Guilu Long,
Shi-Ning Zhu,
Zhenda Xie
Abstract:
Free-space optical communication (FSO) can achieve fast, secure and license-free communication without need for physical cables, making it a cost-effective, energy-efficient and flexible solution when the fiber connection is unavailable. To establish FSO connection on-demand, it is essential to build portable FSO devices with compact structure and light weight. Here, we develop a miniaturized FSO…
▽ More
Free-space optical communication (FSO) can achieve fast, secure and license-free communication without need for physical cables, making it a cost-effective, energy-efficient and flexible solution when the fiber connection is unavailable. To establish FSO connection on-demand, it is essential to build portable FSO devices with compact structure and light weight. Here, we develop a miniaturized FSO system and realize 9.16 Gbps FSO between two nodes that is 1 km apart, using a commercial single-mode-fiber-coupled optical transceiver module without optical amplification. Using our 4-stage acquisition, pointing and tracking (APT) systems, the tracking error is within 3 μrad and results an average link loss of 13.7 dB, which is the key for this high-bandwidth FSO demonstration without optical amplification. Our FSO link has been tested up to 4 km, with link loss of 18 dB that is limited by the foggy weather during the test. Longer FSO distances can be expected with better weather condition and optical amplification. With single FSO device weight of only 9.5 kg, this result arouses massive applications of field-deployable high-speed wireless communication.
△ Less
Submitted 16 April, 2023; v1 submitted 27 February, 2023;
originally announced February 2023.
-
Transforming RIS-Assisted Passive Beamforming from Tedious to Simple: A Relaxation Algorithm for Rician Channel
Authors:
Xuehui Dong,
Rujing Xiong,
Tiebin Mi,
Yuan Xie,
Robert Caiming Qiu
Abstract:
This paper investigates the problem of maximizing the signal-to-noise ratio (SNR) in reconfigurable intelligent surface (RIS)-assisted MISO communication systems. The problem will be reformulated as a complex quadratic form problem with unit circle constraints. We proved that the SNR maximizing problem has a closed-form global optimal solution when it is a rank-one problem, whereas the former rese…
▽ More
This paper investigates the problem of maximizing the signal-to-noise ratio (SNR) in reconfigurable intelligent surface (RIS)-assisted MISO communication systems. The problem will be reformulated as a complex quadratic form problem with unit circle constraints. We proved that the SNR maximizing problem has a closed-form global optimal solution when it is a rank-one problem, whereas the former researchers regarded it as an optimization problem. Moreover, We propose a relaxation algorithm (RA) that relaxes the constraints to that of Rayleigh's quotient problem and then projects the solution back, where the SNR obtained by RA achieves much the same SNR as the upper bound but with significantly low time consumption. Then we asymptotically analyze its performance when the transmitter antennas n_t and the number of units of RIS N grow large together, with N/n_t -> c. Finally, our numerical simulations show that RA achieves over 98% of the performance of the upper bound and takes below 1% time consumption of manifold optimization (MO) and 0.1% of semidefinite relaxation (SDR).
△ Less
Submitted 21 November, 2022; v1 submitted 11 November, 2022;
originally announced November 2022.
-
Optimal Discrete Beamforming of RIS-Aided Wireless Communications: An Inner Product Maximization Approach
Authors:
Rujing Xiong,
Xuihui Dong,
Tiebin Mi,
Kai Wan,
Robert Caiming Qiu
Abstract:
This paper studies the beamforming optimization challenge in reconfigurable intelligent surface (RIS)-aided multiple-input single-output (MISO) systems, where the RIS phase configuration is discrete. Conventional optimization methods for this discrete optimization problem necessitate resource-intensive exponential search and thus fall within the universal (NP-hard) category. We formally define thi…
▽ More
This paper studies the beamforming optimization challenge in reconfigurable intelligent surface (RIS)-aided multiple-input single-output (MISO) systems, where the RIS phase configuration is discrete. Conventional optimization methods for this discrete optimization problem necessitate resource-intensive exponential search and thus fall within the universal (NP-hard) category. We formally define this task as a discrete inner product maximization problem. Leveraging the inherent structure of this problem, we propose an efficient divide-and-sort (DaS) search algorithm to reach the global optimality for the maximization problem. The complexity of the proposed algorithm can be minimized to $\mathcal{O}(2^BN)$, a linear correlation with the count of phase discrete levels $2^B$ and reflecting units $N$. This is notably lower than the exhaustive search complexity of $\mathcal{O}(2^{BN})$. Numerical evaluations and experiments over real prototype also demonstrate the efficiency of the proposed DaS algorithm. Finally, by using the proposed algorithm, we show that over some resolution quantization level on each RIS unit (4-bit and above), there is no noticeable difference in power gains between continuous and discrete phase configurations.
△ Less
Submitted 14 January, 2024; v1 submitted 8 November, 2022;
originally announced November 2022.
-
A Large-Scale Study of a Sleep Tracking and Improving Device with Closed-loop and Personalized Real-time Acoustic Stimulation
Authors:
Anh Nguyen,
Galen Pogoncheff,
Ban Xuan Dong,
Nam Bui,
Hoang Truong,
Nhat Pham,
Linh Nguyen,
Hoang Huu Nguyen,
Sy Duong-Quy,
Sangtae Ha,
Tam Vu
Abstract:
Various intervention therapies ranging from pharmaceutical to hi-tech tailored solutions have been available to treat difficulty in falling asleep commonly caused by insomnia in modern life. However, current techniques largely remain ill-suited, ineffective, and unreliable due to their lack of precise real-time sleep tracking, in-time feedback on the therapies, an ability to keep people asleep dur…
▽ More
Various intervention therapies ranging from pharmaceutical to hi-tech tailored solutions have been available to treat difficulty in falling asleep commonly caused by insomnia in modern life. However, current techniques largely remain ill-suited, ineffective, and unreliable due to their lack of precise real-time sleep tracking, in-time feedback on the therapies, an ability to keep people asleep during the night, and a large-scale effectiveness evaluation. Here, we introduce a novel sleep aid system, called Earable, that can continuously sense multiple head-based physiological signals and simultaneously enable closed-loop auditory stimulation to entrain brain activities in time for effective sleep promotion. We develop the system in a lightweight, comfortable, and user-friendly headband with a comprehensive set of algorithms and dedicated own-designed audio stimuli. We conducted multiple protocols from 883 sleep studies on 377 subjects (241 women, 119 men) wearing either a gold-standard device (PSG), Earable, or both concurrently. We demonstrate that our system achieves (1) a strong correlation (0.89 +/- 0.03) between the physiological signals acquired by Earable and those from the gold-standard PSG, (2) an 87.8 +/- 5.3% agreement on sleep scoring using our automatic real-time sleep staging algorithm with the consensus scored by three sleep technicians, and (3) a successful non-pharmacological stimulation alternative to effectively shorten the duration of sleep falling by 24.1 +/- 0.1 minutes. These results show that the efficacy of Earable exceeds existing techniques in intentions to promote fast falling asleep, track sleep state accurately, and achieve high social acceptance for real-time closed-loop personalized neuromodulation-based home sleep care.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
Learning Hypergraphs From Signals With Dual Smoothness Prior
Authors:
Bohan Tang,
Siheng Chen,
Xiaowen Dong
Abstract:
Hypergraph structure learning, which aims to learn the hypergraph structures from the observed signals to capture the intrinsic high-order relationships among the entities, becomes crucial when a hypergraph topology is not readily available in the datasets. There are two challenges that lie at the heart of this problem: 1) how to handle the huge search space of potential hyperedges, and 2) how to…
▽ More
Hypergraph structure learning, which aims to learn the hypergraph structures from the observed signals to capture the intrinsic high-order relationships among the entities, becomes crucial when a hypergraph topology is not readily available in the datasets. There are two challenges that lie at the heart of this problem: 1) how to handle the huge search space of potential hyperedges, and 2) how to define meaningful criteria to measure the relationship between the signals observed on nodes and the hypergraph structure. In this paper, for the first challenge, we adopt the assumption that the ideal hypergraph structure can be derived from a learnable graph structure that captures the pairwise relations within signals. Further, we propose a hypergraph structure learning framework HGSL with a novel dual smoothness prior that reveals a mapping between the observed node signals and the hypergraph structure, whereby each hyperedge corresponds to a subgraph with both node signal smoothness and edge signal smoothness in the learnable graph structure. Finally, we conduct extensive experiments to evaluate HGSL on both synthetic and real world datasets. Experiments show that HGSL can efficiently infer meaningful hypergraph topologies from observed signals.
△ Less
Submitted 14 March, 2023; v1 submitted 3 November, 2022;
originally announced November 2022.
-
Distributed Reconfigurable Intelligent Surfaces for Energy Efficient Indoor Terahertz Wireless Communications
Authors:
Yiming Huo,
Xiaodai Dong,
Nuwan Ferdinand
Abstract:
With the fifth-generation (5G) networks widely commercialized and fast deployed, the sixth-generation (6G) wireless communication is envisioned to provide competitive quality of service (QoS) in multiple aspects to global users. The critical and underlying research of the 6G is, firstly, highly dependent on the precise modeling and characterization of the wireless propagation when the spectrum is…
▽ More
With the fifth-generation (5G) networks widely commercialized and fast deployed, the sixth-generation (6G) wireless communication is envisioned to provide competitive quality of service (QoS) in multiple aspects to global users. The critical and underlying research of the 6G is, firstly, highly dependent on the precise modeling and characterization of the wireless propagation when the spectrum is believed to expand to the terahertz (THz) domain. Moreover, future networks' power consumption and energy efficiency are critical factors to consider. In this research, based on a review of the fundamental mechanisms of reconfigurable intelligent surface (RIS) assisted wireless communications, we utilize the 3D ray-tracing method to analyze a realistic indoor THz propagation environment with the existence of human blockers. Furthermore, we propose a distributed RISs framework (DRF) to assist the indoor THz wireless communication to achieve overall energy efficiency. The numerical analysis of simulation results based on more than 2,900 indoor THz wireless communication sub-scenarios has demonstrated the significant efficacy of applying distributed RISs to overcome the mobile human blockage issue, improve the THz signal coverage, increase signal-to-noise ratios (SNRs), and QoS. With practical hardware design constraints investigated, we eventually envision how to utilize the existing integrated sensing and communication techniques to deploy and operate such a system in reality. Such a distributed RISs framework can also lay the foundation of efficient THz communications for Internet-of-Things (IoT) networks.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
RIS-aided Wireless Communication with $1$-bit Discrete Optimization for Signal Enhancement
Authors:
Rujing Xiong,
Xuehui Dong,
Tiebin Mi,
Robert caiming Qiu
Abstract:
In recent years, a brand-new technology, reconfigurable intelligent surface (RIS) has been widely studied for reconfiguring the wireless propagation environment. RIS is an artificial surface of electromagnetic material that is capable of customizing the propagation of the wave impinging upon it. Utilizing RIS for communication service like signal enhancement usually lead to non-convex optimization…
▽ More
In recent years, a brand-new technology, reconfigurable intelligent surface (RIS) has been widely studied for reconfiguring the wireless propagation environment. RIS is an artificial surface of electromagnetic material that is capable of customizing the propagation of the wave impinging upon it. Utilizing RIS for communication service like signal enhancement usually lead to non-convex optimization problems. Existing optimization methods either suffers from scalability issues for $N$ number of RIS elements large, or may lead to suboptimal solutions in some scenario. In this paper, we propose a divide-and-sort (DaS) discrete optimization approach, that is guaranteed to find the global optimal phase shifts for $1$-bit RIS, and has time complexity $\mathcal{O}(N \log(N))$. Numerical experiments show that the proposed approach achieves a better ``performance--complexity tradeoff'' over other methods for $1$-bit RIS.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
TFN: An Interpretable Neural Network with Time-Frequency Transform Embedded for Intelligent Fault Diagnosis
Authors:
Qian Chen,
Xingjian Dong,
Guowei Tu,
Dong Wang,
Baoxuan Zhao,
Zhike Peng
Abstract:
Convolutional Neural Networks (CNNs) are widely used in fault diagnosis of mechanical systems due to their powerful feature extraction and classification capabilities. However, the CNN is a typical black-box model, and the mechanism of CNN's decision-making are not clear, which limits its application in high-reliability-required fault diagnosis scenarios. To tackle this issue, we propose a novel i…
▽ More
Convolutional Neural Networks (CNNs) are widely used in fault diagnosis of mechanical systems due to their powerful feature extraction and classification capabilities. However, the CNN is a typical black-box model, and the mechanism of CNN's decision-making are not clear, which limits its application in high-reliability-required fault diagnosis scenarios. To tackle this issue, we propose a novel interpretable neural network termed as Time-Frequency Network (TFN), where the physically meaningful time-frequency transform (TFT) method is embedded into the traditional convolutional layer as an adaptive preprocessing layer. This preprocessing layer named as time-frequency convolutional (TFconv) layer, is constrained by a well-designed kernel function to extract fault-related time-frequency information. It not only improves the diagnostic performance but also reveals the logical foundation of the CNN prediction in the frequency domain. Different TFT methods correspond to different kernel functions of the TFconv layer. In this study, four typical TFT methods are considered to formulate the TFNs and their effectiveness and interpretability are proved through three mechanical fault diagnosis experiments. Experimental results also show that the proposed TFconv layer can be easily generalized to other CNNs with different depths. The code of TFN is available on https://github.com/ChenQian0618/TFN.
△ Less
Submitted 19 June, 2023; v1 submitted 5 September, 2022;
originally announced September 2022.
-
Calibrated Bagging Deep Learning for Image Semantic Segmentation: A Case Study on COVID-19 Chest X-ray Image
Authors:
Lucy Nwosu,
Xiangfang Li,
Lijun Qian,
Seungchan Kim,
Xishuang Dong
Abstract:
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) causes coronavirus disease 2019 (COVID-19). Imaging tests such as chest X-ray (CXR) and computed tomography (CT) can provide useful information to clinical staff for facilitating a diagnosis of COVID-19 in a more efficient and comprehensive manner. As a breakthrough of artificial intelligence (AI), deep learning has been applied to perfo…
▽ More
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) causes coronavirus disease 2019 (COVID-19). Imaging tests such as chest X-ray (CXR) and computed tomography (CT) can provide useful information to clinical staff for facilitating a diagnosis of COVID-19 in a more efficient and comprehensive manner. As a breakthrough of artificial intelligence (AI), deep learning has been applied to perform COVID-19 infection region segmentation and disease classification by analyzing CXR and CT data. However, prediction uncertainty of deep learning models for these tasks, which is very important to safety-critical applications like medical image processing, has not been comprehensively investigated. In this work, we propose a novel ensemble deep learning model through integrating bagging deep learning and model calibration to not only enhance segmentation performance, but also reduce prediction uncertainty. The proposed method has been validated on a large dataset that is associated with CXR image segmentation. Experimental results demonstrate that the proposed method can improve the segmentation performance, as well as decrease prediction uncertainties.
△ Less
Submitted 27 May, 2022;
originally announced June 2022.
-
Flying-Qubit Control via a Three-level Atom with Tunable Waveguide Couplings
Authors:
Wenlong Li,
Xue Dong,
Guofeng Zhang,
Re-Bing Wu
Abstract:
The control of flying qubits is at the core of quantum networks. As often carried by single-photon fields, the flying-qubit control involves not only their logical states but also their shapes. In this paper, we explore a variety of flying-qubit control problems using a three-level atom with time-varying tunable couplings to two input-output channels. It is shown that one can tune the couplings of…
▽ More
The control of flying qubits is at the core of quantum networks. As often carried by single-photon fields, the flying-qubit control involves not only their logical states but also their shapes. In this paper, we explore a variety of flying-qubit control problems using a three-level atom with time-varying tunable couplings to two input-output channels. It is shown that one can tune the couplings of a $Λ$-type atom to distribute a single photon into the two channels with arbitrary shapes, or use a $V$-type atom to catch an arbitrary-shape distributed single photon. The $Λ$-type atom can also be designed to transfer a flying qubit from one channel to the other, with both the central frequency and the photon shape being converted. With a $Ξ$-type atom, one can use the tunable coupling to shape a pair of correlated photons via cascaded emission. In all cases, analytical formulas are derived for the coupling functions to fulfil these control tasks, and their physical limitations are discussed as well. These results provide useful control protocols for high-fidelity quantum information transmission over complex quantum networks.
△ Less
Submitted 25 May, 2022; v1 submitted 24 May, 2022;
originally announced May 2022.
-
Penetration trajectory optimization for the hypersonic gliding vehicle encountering two interceptors
Authors:
Zhipeng Shen,
Jianglong Yu,
Xiwang Dong,
Yongzhao Hua,
Zhang Ren
Abstract:
The penetration trajectory optimization problem for the hypersonic gliding vehicle (HGV) encountering two interceptors is investigated. The HGV penetration trajectory optimization problem considering the terminal target area is formulated as a nonconvex optimal control problem. The nonconvex optimal control problem is transformed into a second-order cone programming (SOCP) problem, which can be so…
▽ More
The penetration trajectory optimization problem for the hypersonic gliding vehicle (HGV) encountering two interceptors is investigated. The HGV penetration trajectory optimization problem considering the terminal target area is formulated as a nonconvex optimal control problem. The nonconvex optimal control problem is transformed into a second-order cone programming (SOCP) problem, which can be solved by state-of-the-art interior-point methods. In addition, a penetration strategy that only requires the initial line-of-sight angle information of the interceptors is proposed. The convergent trajectory obtained by the proposed method allows the HGV to evade two interceptors and reach the target area successfully. Furthermore, a successive SOCP method with a variable trust region is presented, which is critical to balancing the trade-off between time consumption and optimality. Finally, the effectiveness and performance of the proposed method are verified by numerical simulations.
△ Less
Submitted 17 April, 2022;
originally announced April 2022.
-
Wi-Fi and Bluetooth Contact Tracing Without User Intervention
Authors:
Brosnan Yuen,
Yifeng Bie,
Duncan Cairns,
Geoffrey Harper,
Jason Xu,
Charles Chang,
Xiaodai Dong,
Tao Lu
Abstract:
Previous contact tracing systems required the users to perform many manual actions, such as installing smartphone applications, joining wireless networks, or carrying custom user devices. This increases the barrier to entry and lowers the user adoption rate. As a result, the contact tracing effectiveness is reduced. Unlike the systems above, we propose a new privacy preserving Wi-Fi and Bluetooth…
▽ More
Previous contact tracing systems required the users to perform many manual actions, such as installing smartphone applications, joining wireless networks, or carrying custom user devices. This increases the barrier to entry and lowers the user adoption rate. As a result, the contact tracing effectiveness is reduced. Unlike the systems above, we propose a new privacy preserving Wi-Fi and Bluetooth (BLE) contact tracing system that does not require smartphone applications, joining wireless networks, or custom user devices. Our specially built routers seamlessly track smartphones, laptops, smartwatches, BLE headphones, and tablets without any user action, but do not trace user identity. Mapping between devices and users is only carried out for confirmed cases and suspected contacts. Moreover, we can track the absolute positions of user devices within 1.0 m due to using bidirectional long short-term memory neural networks that are trained with data pre-collected by an autonomous robot. This allows public health authorities to track indirect droplet and surface transmissions that other contact tracing systems often overlook.
△ Less
Submitted 23 July, 2022; v1 submitted 30 March, 2022;
originally announced April 2022.
-
Abandoning the Bayer-Filter to See in the Dark
Authors:
Xingbo Dong,
Wanyan Xu,
Zhihui Miao,
Lan Ma,
Chao Zhang,
Jiewen Yang,
Zhe Jin,
Andrew Beng Jin Teoh,
Jiajun Shen
Abstract:
Low-light image enhancement - a pervasive but challenging problem, plays a central role in enhancing the visibility of an image captured in a poor illumination environment. Due to the fact that not all photons can pass the Bayer-Filter on the sensor of the color camera, in this work, we first present a De-Bayer-Filter simulator based on deep neural networks to generate a monochrome raw image from…
▽ More
Low-light image enhancement - a pervasive but challenging problem, plays a central role in enhancing the visibility of an image captured in a poor illumination environment. Due to the fact that not all photons can pass the Bayer-Filter on the sensor of the color camera, in this work, we first present a De-Bayer-Filter simulator based on deep neural networks to generate a monochrome raw image from the colored raw image. Next, a fully convolutional network is proposed to achieve the low-light image enhancement by fusing colored raw data with synthesized monochrome raw data. Channel-wise attention is also introduced to the fusion process to establish a complementary interaction between features from colored and monochrome raw images. To train the convolutional networks, we propose a dataset with monochrome and color raw pairs named Mono-Colored Raw paired dataset (MCR) collected by using a monochrome camera without Bayer-Filter and a color camera with Bayer-Filter. The proposed pipeline take advantages of the fusion of the virtual monochrome and the color raw images and our extensive experiments indicate that significant improvement can be achieved by leveraging raw sensor data and data-driven learning.
△ Less
Submitted 22 March, 2022; v1 submitted 8 March, 2022;
originally announced March 2022.
-
ROMNet: Renovate the Old Memories
Authors:
Runsheng Xu,
Zhengzhong Tu,
Yuanqi Du,
Xiaoyu Dong,
Jinlong Li,
Zibo Meng,
Jiaqi Ma,
Hongkai Yu
Abstract:
Renovating the memories in old photos is an intriguing research topic in computer vision fields. These legacy images often suffer from severe and commingled degradations such as cracks, noise, and color-fading, while lack of large-scale paired old photo datasets makes this restoration task very challenging. In this work, we present a novel reference-based end-to-end learning framework that can joi…
▽ More
Renovating the memories in old photos is an intriguing research topic in computer vision fields. These legacy images often suffer from severe and commingled degradations such as cracks, noise, and color-fading, while lack of large-scale paired old photo datasets makes this restoration task very challenging. In this work, we present a novel reference-based end-to-end learning framework that can jointly repair and colorize the degraded legacy pictures. Specifically, the proposed framework consists of three modules: a restoration sub-network for degradation restoration, a similarity sub-network for color histogram matching and transfer, and a colorization subnet that learns to predict the chroma elements of the images conditioned on chromatic reference signals. The whole system takes advantage of the color histogram priors in a given reference image, which vastly reduces the dependency on large-scale training data. Apart from the proposed method, we also create, to our knowledge, the first public and real-world old photo dataset with paired ground truth for evaluating old photo restoration models, wherein each old photo is paired with a manually restored pristine image by PhotoShop experts. Our extensive experiments conducted on both synthetic and real-world datasets demonstrate that our method significantly outperforms state-of-the-arts both quantitatively and qualitatively.
△ Less
Submitted 27 April, 2022; v1 submitted 5 February, 2022;
originally announced February 2022.
-
Passive Indoor Localization with WiFi Fingerprints
Authors:
Minh Tu Hoang,
Brosnan Yuen,
Kai Ren,
Ahmed Elmoogy,
Xiaodai Dong,
Tao Lu,
Robert Westendorp,
Kishore Reddy Tarimala
Abstract:
This paper proposes passive WiFi indoor localization. Instead of using WiFi signals received by mobile devices as fingerprints, we use signals received by routers to locate the mobile carrier. Consequently, software installation on the mobile device is not required. To resolve the data insufficiency problem, flow control signals such as request to send (RTS) and clear to send (CTS) are utilized. I…
▽ More
This paper proposes passive WiFi indoor localization. Instead of using WiFi signals received by mobile devices as fingerprints, we use signals received by routers to locate the mobile carrier. Consequently, software installation on the mobile device is not required. To resolve the data insufficiency problem, flow control signals such as request to send (RTS) and clear to send (CTS) are utilized. In our model, received signal strength indicator (RSSI) and channel state information (CSI) are used as fingerprints for several algorithms, including deterministic, probabilistic and neural networks localization algorithms. We further investigated localization algorithms performance through extensive on-site experiments with various models of phones at hundreds of testing locations. We demonstrate that our passive scheme achieves an average localization error of 0.8 m when the phone is actively transmitting data frames and 1.5 m when it is not transmitting data frames.
△ Less
Submitted 28 November, 2021;
originally announced November 2021.
-
Recognizing Vector Graphics without Rasterization
Authors:
Xinyang Jiang,
Lu Liu,
Caihua Shan,
Yifei Shen,
Xuanyi Dong,
Dongsheng Li
Abstract:
In this paper, we consider a different data format for images: vector graphics. In contrast to raster graphics which are widely used in image recognition, vector graphics can be scaled up or down into any resolution without aliasing or information loss, due to the analytic representation of the primitives in the document. Furthermore, vector graphics are able to give extra structural information o…
▽ More
In this paper, we consider a different data format for images: vector graphics. In contrast to raster graphics which are widely used in image recognition, vector graphics can be scaled up or down into any resolution without aliasing or information loss, due to the analytic representation of the primitives in the document. Furthermore, vector graphics are able to give extra structural information on how low-level elements group together to form high level shapes or structures. These merits of graphic vectors have not been fully leveraged in existing methods. To explore this data format, we target on the fundamental recognition tasks: object localization and classification. We propose an efficient CNN-free pipeline that does not render the graphic into pixels (i.e. rasterization), and takes textual document of the vector graphics as input, called YOLaT (You Only Look at Text). YOLaT builds multi-graphs to model the structural and spatial information in vector graphics, and a dual-stream graph neural network is proposed to detect objects from the graph. Our experiments show that by directly operating on vector graphics, YOLaT out-performs raster-graphic based object detection baselines in terms of both average precision and efficiency.
△ Less
Submitted 23 December, 2021; v1 submitted 5 November, 2021;
originally announced November 2021.
-
Learning to Learn Graph Topologies
Authors:
Xingyue Pu,
Tianyue Cao,
Xiaoyun Zhang,
Xiaowen Dong,
Siheng Chen
Abstract:
Learning a graph topology to reveal the underlying relationship between data entities plays an important role in various machine learning and data analysis tasks. Under the assumption that structured data vary smoothly over a graph, the problem can be formulated as a regularised convex optimisation over a positive semidefinite cone and solved by iterative algorithms. Classic methods require an exp…
▽ More
Learning a graph topology to reveal the underlying relationship between data entities plays an important role in various machine learning and data analysis tasks. Under the assumption that structured data vary smoothly over a graph, the problem can be formulated as a regularised convex optimisation over a positive semidefinite cone and solved by iterative algorithms. Classic methods require an explicit convex function to reflect generic topological priors, e.g. the $\ell_1$ penalty for enforcing sparsity, which limits the flexibility and expressiveness in learning rich topological structures. We propose to learn a mapping from node data to the graph structure based on the idea of learning to optimise (L2O). Specifically, our model first unrolls an iterative primal-dual splitting algorithm into a neural network. The key structural proximal projection is replaced with a variational autoencoder that refines the estimated graph with enhanced topological properties. The model is trained in an end-to-end fashion with pairs of node data and graph samples. Experiments on both synthetic and real-world data demonstrate that our model is more efficient than classic iterative algorithms in learning a graph with specific topological properties.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
A Variational Bayes Moving Horizon Estimation Adaptive Filter with Guaranteed Stability
Authors:
Xiangxiang Dong,
Giorgio Battistelli,
Luigi Chisci,
Yunze Cai
Abstract:
This paper addresses state estimation of linear systems with special attention on unknown process and measurement noise covariances, aiming to enhance estimation accuracy while preserving the stability guarantee of the Kalman filter. To this end, the full information estimation problem over a finite interval is firstly addressed. Then, a novel adaptive variational Bayesian (VB) moving horizon esti…
▽ More
This paper addresses state estimation of linear systems with special attention on unknown process and measurement noise covariances, aiming to enhance estimation accuracy while preserving the stability guarantee of the Kalman filter. To this end, the full information estimation problem over a finite interval is firstly addressed. Then, a novel adaptive variational Bayesian (VB) moving horizon estimation (MHE) method is proposed, exploiting VB inference, MHE and Monte Carlo integration with importance sampling for joint estimation of the unknown process and measurement noise covariances, along with the state trajectory over a moving window of fixed length. Further, it is proved that the proposed adaptive VB MHE filter ensures mean-square boundedness of the estimation error with any number of importance samples and VB iterations, as well as for any window length. Finally, simulation results on a target tracking example demonstrate the effectiveness of the VB MHE filter with enhanced estimation accuracy and convergence properties compared to the conventional non-adaptive Kalman filter and other existing adaptive filters.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
Fast query-by-example speech search using separable model
Authors:
Yuguang Yang,
Yu Pan,
Xin Dong,
Minqiang Xu
Abstract:
Traditional Query-by-Example (QbE) speech search approaches usually use methods based on frame-level features, while state-of-the-art approaches tend to use models based on acoustic word embeddings (AWEs) to transform variable length audio signals into fixed length feature vector representations. However, these approaches cannot meet the requirements of the search quality as well as speed at the s…
▽ More
Traditional Query-by-Example (QbE) speech search approaches usually use methods based on frame-level features, while state-of-the-art approaches tend to use models based on acoustic word embeddings (AWEs) to transform variable length audio signals into fixed length feature vector representations. However, these approaches cannot meet the requirements of the search quality as well as speed at the same time. In this paper, we propose a novel fast QbE speech search method based on separable models to fix this problem. First, a QbE speech search training framework is introduced. Second, we design a novel model inference scheme based on RepVGG which can efficiently improve the QbE search quality. Third, we modify and improve our QbE speech search model according to the proposed model inference scheme. Experiments on keywords dataset shows that our proposed method can improve the GPU Real-time Factor (RTF) from 1/150 to 1/2300 by just applying separable model scheme and outperforms other state-of-the-art methods.
△ Less
Submitted 18 September, 2021;
originally announced September 2021.
-
Vision-Based Target Localization for a Flapping-Wing Aerial Vehicle
Authors:
Xinghao Dong,
Qiang Fu,
Chunhua Zhang,
Wei He
Abstract:
The flapping-wing aerial vehicle (FWAV) is a new type of flying robot that mimics the flight mode of birds and insects. However, FWAVs have their special characteristics of less load capacity and short endurance time, so that most existing systems of ground target localization are not suitable for them. In this paper, a vision-based target localization algorithm is proposed for FWAVs based on a ge…
▽ More
The flapping-wing aerial vehicle (FWAV) is a new type of flying robot that mimics the flight mode of birds and insects. However, FWAVs have their special characteristics of less load capacity and short endurance time, so that most existing systems of ground target localization are not suitable for them. In this paper, a vision-based target localization algorithm is proposed for FWAVs based on a generic camera model. Since sensors exist measurement error and the camera exists jitter and motion blur during flight, Gaussian noises are introduced in the simulation experiment, and then a first-order low-pass filter is used to stabilize the localization values. Moreover, in order to verify the feasibility and accuracy of the target localization algorithm, we design a set of simulation experiments where various noises are added. From the simulation results, it is found that the target localization algorithm has a good performance.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
Joint Time and Power Allocation for 5G NR Unlicensed Systems
Authors:
Haizhou Bao,
Yiming Huo,
Xiaodai Dong,
Chuanhe Huang
Abstract:
The fifth-generation (5G) and beyond networks are designed to efficiently utilize the spectrum resources to meet various quality of service (QoS) requirements. The unlicensed frequency bands used by WiFi are mainly deployed for indoor applications and are not always fully occupied. The cellular industry has been working to enable cellular and WiFi coexistence. In particular, 5G New Radio in unlice…
▽ More
The fifth-generation (5G) and beyond networks are designed to efficiently utilize the spectrum resources to meet various quality of service (QoS) requirements. The unlicensed frequency bands used by WiFi are mainly deployed for indoor applications and are not always fully occupied. The cellular industry has been working to enable cellular and WiFi coexistence. In particular, 5G New Radio in unlicensed channel spectrum (NR-U) supports the uplink and downlink transmission on the maximum channel occupation time (MCOT) duration. In this paper, we consider maximizing the total throughput of both downlink and uplink in NR-U by jointly optimizing the time and power allocation during MCOT while ensuring fair coexistence with WiFi. Fairness is guaranteed in two steps: 1) tuning the access related parameters of NR-U to achieve proportional fairness, and 2) including 3GPP fairness from the throughput perspective as a constraint in NR-U throughput maximization. Numerical analysis and simulation have demonstrated the superior performance of the proposed resource allocation algorithm compared to conventional deployment strategies.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Meteorologically Introduced Impacts on Aerial Channels and UAV Communications
Authors:
Mengan Song,
Yiming Huo,
Tao Lu,
Xiaodai Dong,
Zhonghua Liang
Abstract:
As 5G wireless systems and networks are now being globally commercialized and deployed, more diversified application scenarios are emerging, quickly reshaping our societies and paving the road to the beyond 5G (6G) era when terahertz (THz) and unmanned aerial vehicle (UAV) communications may play critical roles. In this paper, aerial channel models under multiple meteorological conditions such as…
▽ More
As 5G wireless systems and networks are now being globally commercialized and deployed, more diversified application scenarios are emerging, quickly reshaping our societies and paving the road to the beyond 5G (6G) era when terahertz (THz) and unmanned aerial vehicle (UAV) communications may play critical roles. In this paper, aerial channel models under multiple meteorological conditions such as rain, fog and snow, have been investigated at frequencies of interest (from 2 GHz to 900 GHz) for UAV communications. Furthermore, the link budget and the received signal-to-noise ratio (SNR) performance under the existing air-to-ground (A2G) channel models are studied with antenna(s) system considered. The relationship between the 3D coverage radius and UAV altitude under the influence of multiple weather (MW) conditions is simulated. Numerical results show that medium rain has the most effects on the UAV's coverage for UAV communications at millimeter wave (mmWave) bands, while snow has the largest impacts at near THz bands. In addition, when the frequency increases, the corresponding increase in the number of antennas can effectively compensate for the propagation loss introduced by weather factors, while its form factor and weight can be kept to maintain the UAV's payload.
△ Less
Submitted 14 April, 2021;
originally announced April 2021.
-
Semi-supervised Learning for COVID-19 Image Classification via ResNet
Authors:
Lucy Nwosu,
Xiangfang Li,
Lijun Qian,
Seungchan Kim,
Xishuang Dong
Abstract:
Coronavirus disease 2019 (COVID-19) is an ongoing global pandemic in over 200 countries and territories, which has resulted in a great public health concern across the international community. Analysis of X-ray imaging data can play a critical role in timely and accurate screening and fighting against COVID-19. Supervised deep learning has been successfully applied to recognize COVID-19 pathology…
▽ More
Coronavirus disease 2019 (COVID-19) is an ongoing global pandemic in over 200 countries and territories, which has resulted in a great public health concern across the international community. Analysis of X-ray imaging data can play a critical role in timely and accurate screening and fighting against COVID-19. Supervised deep learning has been successfully applied to recognize COVID-19 pathology from X-ray imaging datasets. However, it requires a substantial amount of annotated X-ray images to train models, which is often not applicable to data analysis for emerging events such as COVID-19 outbreak, especially in the early stage of the outbreak. To address this challenge, this paper proposes a two-path semi-supervised deep learning model, ssResNet, based on Residual Neural Network (ResNet) for COVID-19 image classification, where two paths refer to a supervised path and an unsupervised path, respectively. Moreover, we design a weighted supervised loss that assigns higher weight for the minority classes in the training process to resolve the data imbalance. Experimental results on a large-scale of X-ray image dataset COVIDx demonstrate that the proposed model can achieve promising performance even when trained on very few labeled training images.
△ Less
Submitted 31 March, 2021; v1 submitted 26 February, 2021;
originally announced March 2021.
-
Superresolving second-order correlation imaging using synthesized colored noise speckles
Authors:
Zheng Li,
Xiaoyu Nie,
Fan Yang,
Xiangpei Liu,
Dongyu Liu,
Xiaolong Dong,
Xingchen Zhao,
Tao Peng,
M. Suhail Zubairy,
Marlan O. Scully
Abstract:
We present a novel method to synthesize non-trivial speckles that can enable superresolving second-order correlation imaging. The speckles acquire a unique anti-correlation in the spatial intensity fluctuation by introducing the blue noise spectrum to the input light fields through amplitude modulation. Illuminating objects with the blue noise speckle patterns can lead to a sub-diffraction limit i…
▽ More
We present a novel method to synthesize non-trivial speckles that can enable superresolving second-order correlation imaging. The speckles acquire a unique anti-correlation in the spatial intensity fluctuation by introducing the blue noise spectrum to the input light fields through amplitude modulation. Illuminating objects with the blue noise speckle patterns can lead to a sub-diffraction limit imaging system with a resolution more than three times higher than first-order imaging, which is comparable to the resolving power of ninth order correlation imaging with thermal light. Our method opens a new route towards non-trivial speckle generation by tailoring amplitudes of the input light fields and provides a versatile scheme for constructing superresolving imaging and microscopy systems without invoking complicated higher-order correlations.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
Learning to Fuse Asymmetric Feature Maps in Siamese Trackers
Authors:
Wencheng Han,
Xingping Dong,
Fahad Shahbaz Khan,
Ling Shao,
Jianbing Shen
Abstract:
Recently, Siamese-based trackers have achieved promising performance in visual tracking. Most recent Siamese-based trackers typically employ a depth-wise cross-correlation (DW-XCorr) to obtain multi-channel correlation information from the two feature maps (target and search region). However, DW-XCorr has several limitations within Siamese-based tracking: it can easily be fooled by distractors, ha…
▽ More
Recently, Siamese-based trackers have achieved promising performance in visual tracking. Most recent Siamese-based trackers typically employ a depth-wise cross-correlation (DW-XCorr) to obtain multi-channel correlation information from the two feature maps (target and search region). However, DW-XCorr has several limitations within Siamese-based tracking: it can easily be fooled by distractors, has fewer activated channels, and provides weak discrimination of object boundaries. Further, DW-XCorr is a handcrafted parameter-free module and cannot fully benefit from offline learning on large-scale data. We propose a learnable module, called the asymmetric convolution (ACM), which learns to better capture the semantic correlation information in offline training on large-scale data. Different from DW-XCorr and its predecessor(XCorr), which regard a single feature map as the convolution kernel, our ACM decomposes the convolution operation on a concatenated feature map into two mathematically equivalent operations, thereby avoiding the need for the feature maps to be of the same size (width and height)during concatenation. Our ACM can incorporate useful prior information, such as bounding-box size, with standard visual features. Furthermore, ACM can easily be integrated into existing Siamese trackers based on DW-XCorror XCorr. To demonstrate its generalization ability, we integrate ACM into three representative trackers: SiamFC, SiamRPN++, and SiamBAN. Our experiments reveal the benefits of the proposed ACM, which outperforms existing methods on six tracking benchmarks. On the LaSOT test set, our ACM-based tracker obtains a significant improvement of 5.8% in terms of success (AUC), over the baseline.
△ Less
Submitted 30 March, 2021; v1 submitted 4 December, 2020;
originally announced December 2020.
-
LiteDepthwiseNet: An Extreme Lightweight Network for Hyperspectral Image Classification
Authors:
Benlei Cui,
XueMei Dong,
Qiaoqiao Zhan,
Jiangtao Peng,
Weiwei Sun
Abstract:
Deep learning methods have shown considerable potential for hyperspectral image (HSI) classification, which can achieve high accuracy compared with traditional methods. However, they often need a large number of training samples and have a lot of parameters and high computational overhead. To solve these problems, this paper proposes a new network architecture, LiteDepthwiseNet, for HSI classifica…
▽ More
Deep learning methods have shown considerable potential for hyperspectral image (HSI) classification, which can achieve high accuracy compared with traditional methods. However, they often need a large number of training samples and have a lot of parameters and high computational overhead. To solve these problems, this paper proposes a new network architecture, LiteDepthwiseNet, for HSI classification. Based on 3D depthwise convolution, LiteDepthwiseNet can decompose standard convolution into depthwise convolution and pointwise convolution, which can achieve high classification performance with minimal parameters. Moreover, we remove the ReLU layer and Batch Normalization layer in the original 3D depthwise convolution, which significantly improves the overfitting phenomenon of the model on small sized datasets. In addition, focal loss is used as the loss function to improve the model's attention on difficult samples and unbalanced data, and its training performance is significantly better than that of cross-entropy loss or balanced cross-entropy loss. Experiment results on three benchmark hyperspectral datasets show that LiteDepthwiseNet achieves state-of-the-art performance with a very small number of parameters and low computational cost.
△ Less
Submitted 15 October, 2020;
originally announced October 2020.
-
Under Water Waste Cleaning by Mobile Edge Computing and Intelligent Image Processing Based Robotic Fish
Authors:
Subhadeep Sahoo,
Xiao Han Dong,
Zi Qian Liu,
Joydeep Sahoo
Abstract:
As water pollution is a serious threat to underwater resources, i.e., underwater plants and species, we focus on protecting the resources by cleaning the non-biodegradable waste from the water. The waste can be recycled for further usage. Here we design a robotic fish which mainly comprises optical biosensor, camera module, piston module, and wireless transceiver. By exploiting the LTE and 5G netw…
▽ More
As water pollution is a serious threat to underwater resources, i.e., underwater plants and species, we focus on protecting the resources by cleaning the non-biodegradable waste from the water. The waste can be recycled for further usage. Here we design a robotic fish which mainly comprises optical biosensor, camera module, piston module, and wireless transceiver. By exploiting the LTE and 5G network architecture, the fish stores the information about the underwater waste in the nearest mobile edge computing server as well as in the centralized cloud server. Finally, when the fish clears the underwater waste, it offloads the captured image of the located object to the mobile edge computing server or sometimes to the cloud server for making a decision. The servers employ intelligent image processing technology and an adaptive learning process to make a decision. However, if the servers fail to make a decision, then the fish utilizes its optical biosensor. By this scheme, the time delay for clearing any water body is minimized and the waste collection capacity of the fish is maximized. This technique can effectively help the government or municipal personnel for making clean water without manual efforts.
△ Less
Submitted 31 August, 2020;
originally announced September 2020.
-
Kernel-based Graph Learning from Smooth Signals: A Functional Viewpoint
Authors:
Xingyue Pu,
Siu Lun Chau,
Xiaowen Dong,
Dino Sejdinovic
Abstract:
The problem of graph learning concerns the construction of an explicit topological structure revealing the relationship between nodes representing data entities, which plays an increasingly important role in the success of many graph-based representations and algorithms in the field of machine learning and graph signal processing. In this paper, we propose a novel graph learning framework that inc…
▽ More
The problem of graph learning concerns the construction of an explicit topological structure revealing the relationship between nodes representing data entities, which plays an increasingly important role in the success of many graph-based representations and algorithms in the field of machine learning and graph signal processing. In this paper, we propose a novel graph learning framework that incorporates the node-side and observation-side information, and in particular the covariates that help to explain the dependency structures in graph signals. To this end, we consider graph signals as functions in the reproducing kernel Hilbert space associated with a Kronecker product kernel, and integrate functional learning with smoothness-promoting graph learning to learn a graph representing the relationship between nodes. The functional learning increases the robustness of graph learning against missing and incomplete information in the graph signals. In addition, we develop a novel graph-based regularisation method which, when combined with the Kronecker product kernel, enables our model to capture both the dependency explained by the graph and the dependency due to graph signals observed under different but related circumstances, e.g. different points in time. The latter means the graph signals are free from the i.i.d. assumptions required by the classical graph learning models. Experiments on both synthetic and real-world data show that our methods outperform the state-of-the-art models in learning a meaningful graph topology from graph signals, in particular under heavy noise, missing values, and multiple dependency.
△ Less
Submitted 23 August, 2020;
originally announced August 2020.
-
Deep Q-Network Based Dynamic Movement Strategy in a UAV-Assisted Network
Authors:
Xukai Zhong,
Yiming Huo,
Xiaodai Dong,
Zhonghua Liang
Abstract:
Unmanned aerial vehicle (UAV)-assisted communications is a promising solution to improve the performance of future wireless networks, where UAVs are deployed as base stations for enhancing the quality of service (QoS) provided to ground users when traditional terrestrial base stations are unavailable or not sufficient. An effective framework is proposed in this paper to manage the dynamic movement…
▽ More
Unmanned aerial vehicle (UAV)-assisted communications is a promising solution to improve the performance of future wireless networks, where UAVs are deployed as base stations for enhancing the quality of service (QoS) provided to ground users when traditional terrestrial base stations are unavailable or not sufficient. An effective framework is proposed in this paper to manage the dynamic movement of multiple unmanned aerial vehicles (UAVs) in response to ground user mobility, with the objective to maximize the sum data rate of the ground users. First, we discuss the relationship between the air-to-ground (A2G) path loss (PL) and the location of UAVs. Then a deep Q-network (DQN) based method is proposed to adjust the locations of UAVs to maximize the sum data rate of the user equipment (UE). Finally, simulation results show that the proposed method is capable of adjusting UAV locations in a real-time condition to improve the QoS of the entire network.
△ Less
Submitted 7 August, 2020;
originally announced August 2020.
-
QoS-Compliant 3D Deployment Optimization Strategy for UAV Base Stations
Authors:
Xukai Zhong,
Yiming Huo,
Xiaodai Dong,
Zhonghua Liang
Abstract:
Unmanned aerial vehicle (UAV) is being integrated as an active element in 5G and beyond networks. Because of its flexibility and mobility, UAV base stations (UAV-BSs) can be deployed according to the ground user distributions and their quality of service (QoS) requirement. Although there has been quite some prior research on the UAV deployment, no work has studied this problem in a 3 dimensional (…
▽ More
Unmanned aerial vehicle (UAV) is being integrated as an active element in 5G and beyond networks. Because of its flexibility and mobility, UAV base stations (UAV-BSs) can be deployed according to the ground user distributions and their quality of service (QoS) requirement. Although there has been quite some prior research on the UAV deployment, no work has studied this problem in a 3 dimensional (3D) setting and taken into account the UAV-BS capacity limit and the quality of service (QoS) requirements of ground users. Therefore, in this paper, we focus on the problem of deploying UAV-BSs to provide satisfactory wireless communication services, with the aim to maximize the total number of covered user equipment (UE) subject to user data rate requirements and UAV-BSs' capacity limit. First, we model the relationship between the air-to-ground (A2G) path loss (PL) and the location of UAV-BSs in both horizontal and vertical dimensions which has not been considered in previous works. Unlike the conventional UAV deployment problem formulation, the 3D deployment problem is decoupled into a 2D horizontal placement and altitude determination connected by path loss requirement and minimization. Then, we propose a novel genetic algorithm (GA) based 2D placement approach in which UAV-BSs are placed to have maximum coverage of the users with consideration of data rate distribution. Finally, numerical and simulation results show that the proposed approach has enabled a better coverage percentage comparing with other schemes.
△ Less
Submitted 7 August, 2020;
originally announced August 2020.
-
Axiom-based Grad-CAM: Towards Accurate Visualization and Explanation of CNNs
Authors:
Ruigang Fu,
Qingyong Hu,
Xiaohu Dong,
Yulan Guo,
Yinghui Gao,
Biao Li
Abstract:
To have a better understanding and usage of Convolution Neural Networks (CNNs), the visualization and interpretation of CNNs has attracted increasing attention in recent years. In particular, several Class Activation Mapping (CAM) methods have been proposed to discover the connection between CNN's decision and image regions. In spite of the reasonable visualization, lack of clear and sufficient th…
▽ More
To have a better understanding and usage of Convolution Neural Networks (CNNs), the visualization and interpretation of CNNs has attracted increasing attention in recent years. In particular, several Class Activation Mapping (CAM) methods have been proposed to discover the connection between CNN's decision and image regions. In spite of the reasonable visualization, lack of clear and sufficient theoretical support is the main limitation of these methods. In this paper, we introduce two axioms -- Conservation and Sensitivity -- to the visualization paradigm of the CAM methods. Meanwhile, a dedicated Axiom-based Grad-CAM (XGrad-CAM) is proposed to satisfy these axioms as much as possible. Experiments demonstrate that XGrad-CAM is an enhanced version of Grad-CAM in terms of conservation and sensitivity. It is able to achieve better visualization performance than Grad-CAM, while also be class-discriminative and easy-to-implement compared with Grad-CAM++ and Ablation-CAM. The code is available at https://github.com/Fu0511/XGrad-CAM.
△ Less
Submitted 19 August, 2020; v1 submitted 5 August, 2020;
originally announced August 2020.
-
Graph signal processing for machine learning: A review and new perspectives
Authors:
Xiaowen Dong,
Dorina Thanou,
Laura Toni,
Michael Bronstein,
Pascal Frossard
Abstract:
The effective representation, processing, analysis, and visualization of large-scale structured data, especially those related to complex domains such as networks and graphs, are one of the key questions in modern machine learning. Graph signal processing (GSP), a vibrant branch of signal processing models and algorithms that aims at handling data supported on graphs, opens new paths of research t…
▽ More
The effective representation, processing, analysis, and visualization of large-scale structured data, especially those related to complex domains such as networks and graphs, are one of the key questions in modern machine learning. Graph signal processing (GSP), a vibrant branch of signal processing models and algorithms that aims at handling data supported on graphs, opens new paths of research to address this challenge. In this article, we review a few important contributions made by GSP concepts and tools, such as graph filters and transforms, to the development of novel machine learning algorithms. In particular, our discussion focuses on the following three aspects: exploiting data structure and relational priors, improving data and computational efficiency, and enhancing model interpretability. Furthermore, we provide new perspectives on future development of GSP techniques that may serve as a bridge between applied mathematics and signal processing on one side, and machine learning and network science on the other. Cross-fertilization across these different disciplines may help unlock the numerous challenges of complex data analysis in the modern age.
△ Less
Submitted 31 July, 2020;
originally announced July 2020.