-
Statistical Tests for Replacing Human Decision Makers with Algorithms
Authors:
Kai Feng,
Han Hong,
Ke Tang,
Jingyuan Wang
Abstract:
This paper proposes a statistical framework with which artificial intelligence can improve human decision making. The performance of each human decision maker is first benchmarked against machine predictions; we then replace the decisions made by a subset of the decision makers with the recommendation from the proposed artificial intelligence algorithm. Using a large nationwide dataset of pregnanc…
▽ More
This paper proposes a statistical framework with which artificial intelligence can improve human decision making. The performance of each human decision maker is first benchmarked against machine predictions; we then replace the decisions made by a subset of the decision makers with the recommendation from the proposed artificial intelligence algorithm. Using a large nationwide dataset of pregnancy outcomes and doctor diagnoses from prepregnancy checkups of reproductive age couples, we experimented with both a heuristic frequentist approach and a Bayesian posterior loss function approach with an application to abnormal birth detection. We find that our algorithm on a test dataset results in a higher overall true positive rate and a lower false positive rate than the diagnoses made by doctors only. We also find that the diagnoses of doctors from rural areas are more frequently replaceable, suggesting that artificial intelligence assisted decision making tends to improve precision more in less developed regions.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
The Weight Distributions of Two Classes of Linear Codes From Perfect Nonlinear Functions
Authors:
Huawei Wu,
Jing Yang,
Keqin Feng
Abstract:
In this paper, we employ general results on the value distributions of perfect nonlinear functions from $\mathbb{F}_{p^m}$ to $\mathbb{F}_p$ together with a specific group action to give a unified approach to determining the weight distributions of two classes of linear codes over $\mathbb{F}_p$ constructed from perfect nonlinear functions, where $p$ is an odd prime number and $m\in\mathbb{N}_+$.
In this paper, we employ general results on the value distributions of perfect nonlinear functions from $\mathbb{F}_{p^m}$ to $\mathbb{F}_p$ together with a specific group action to give a unified approach to determining the weight distributions of two classes of linear codes over $\mathbb{F}_p$ constructed from perfect nonlinear functions, where $p$ is an odd prime number and $m\in\mathbb{N}_+$.
△ Less
Submitted 16 November, 2023; v1 submitted 10 June, 2023;
originally announced June 2023.
-
Robust Active and Passive Beamforming for RIS-Assisted Full-Duplex Systems under Imperfect CSI
Authors:
Li-Hsiang Shen,
Chia-Jou Ku,
Kai-Ten Feng
Abstract:
The sixth-generation (6G) wireless technology recognizes the potential of reconfigurable intelligent surfaces (RIS) as an effective technique for intelligently manipulating channel paths through reflection to serve desired users. Full-duplex (FD) systems, enabling simultaneous transmission and reception from a base station (BS), offer the theoretical advantage of doubled spectrum efficiency. Howev…
▽ More
The sixth-generation (6G) wireless technology recognizes the potential of reconfigurable intelligent surfaces (RIS) as an effective technique for intelligently manipulating channel paths through reflection to serve desired users. Full-duplex (FD) systems, enabling simultaneous transmission and reception from a base station (BS), offer the theoretical advantage of doubled spectrum efficiency. However, the presence of strong self-interference (SI) in FD systems significantly degrades performance, which can be mitigated by leveraging the capabilities of RIS. Moreover, accurately obtaining channel state information (CSI) from RIS poses a critical challenge. Our objective is to maximize downlink (DL) user data rates while ensuring quality-of-service (QoS) for uplink (UL) users under imperfect CSI from reflected channels. To address this, we propose a robust active BS and passive RIS beamforming (RAPB) scheme for RIS-FD, accounting for both SI and imperfect CSI. RAPB incorporates distributionally robust design, conditional value-at-risk (CVaR), and penalty convex-concave programming (PCCP) techniques. Simulation results demonstrate the UL/DL rate improvement are achieved by considering different levels of imperfect CSI. The proposed RAPB schemes validate their effectiveness across different RIS deployments and RIS/BS configurations. Benefited from robust beamforming, RAPB outperforms the existing methods in terms of non-robustness, deployment without RIS, conventional approximation, and half-duplex systems.
△ Less
Submitted 19 November, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
How Do UX Practitioners Communicate AI as a Design Material? Artifacts, Conceptions, and Propositions
Authors:
K. J. Kevin Feng,
Maxwell James Coppock,
David W. McDonald
Abstract:
UX practitioners (UXPs) face novel challenges when working with and communicating artificial intelligence (AI) as a design material. We explore how UXPs communicate AI concepts when given hands-on experience training and experimenting with AI models. To do so, we conducted a task-based design study with 27 UXPs in which they prototyped and created a design presentation for a AI-enabled interface w…
▽ More
UX practitioners (UXPs) face novel challenges when working with and communicating artificial intelligence (AI) as a design material. We explore how UXPs communicate AI concepts when given hands-on experience training and experimenting with AI models. To do so, we conducted a task-based design study with 27 UXPs in which they prototyped and created a design presentation for a AI-enabled interface while having access to a simple AI model training tool. Through analyzing UXPs' design presentations and post-activity interviews, we found that although UXPs struggled to clearly communicate some AI concepts, tinkering with AI broadened common ground when communicating with technical stakeholders. UXPs also identified key risks and benefits of AI in their designs, and proposed concrete next steps for both UX and AI work. We conclude with a sensitizing concept and recommendations for design and AI tools to enhance multi-stakeholder communication and collaboration when crafting human-centered AI experiences.
△ Less
Submitted 27 May, 2023;
originally announced May 2023.
-
Constructions of $k$-uniform states in heterogeneous systems
Authors:
Keqin Feng,
Lingfei Jin,
Chaoping Xing,
Chen Yuan
Abstract:
A pure quantum state of $n$ parties associated with the Hilbert space $\CC^{d_1}\otimes \CC^{d_2}\otimes\cdots\otimes \CC^{d_n}$ is called $k$-uniform if all the reductions to $k$-parties are maximally mixed. The $n$ partite system is called homogenous if the local dimension $d_1=d_2=\cdots=d_n$, while it is called heterogeneous if the local dimension are not all equal. $k$-uniform sates play an i…
▽ More
A pure quantum state of $n$ parties associated with the Hilbert space $\CC^{d_1}\otimes \CC^{d_2}\otimes\cdots\otimes \CC^{d_n}$ is called $k$-uniform if all the reductions to $k$-parties are maximally mixed. The $n$ partite system is called homogenous if the local dimension $d_1=d_2=\cdots=d_n$, while it is called heterogeneous if the local dimension are not all equal. $k$-uniform sates play an important role in quantum information theory. There are many progress in characterizing and constructing $k$-uniform states in homogeneous systems. However, the study of entanglement for heterogeneous systems is much more challenging than that for the homogeneous case. There are very few results known for the $k$-uniform states in heterogeneous systems for $k>3$. We present two general methods to construct $k$-uniform states in the heterogeneous systems for general $k$. The first construction is derived from the error correcting codes by establishing a connection between irredundant mixed orthogonal arrays and error correcting codes. We can produce many new $k$-uniform states such that the local dimension of each subsystem can be a prime power. The second construction is derived from a matrix $H$ meeting the condition that $H_{A\times \bar{A}}+H^T_{\bar{A}\times A}$ has full rank for any row index set $A$ of size $k$. These matrix construction can provide more flexible choices for the local dimensions, i.e., the local dimensions can be any integer (not necessarily prime power) subject to some constraints. Our constructions imply that for any positive integer $k$, one can construct $k$-uniform states of a heterogeneous system in many different Hilbert spaces.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Degradation-Noise-Aware Deep Unfolding Transformer for Hyperspectral Image Denoising
Authors:
Haijin Zeng,
Jiezhang Cao,
Kai Feng,
Shaoguang Huang,
Hongyan Zhang,
Hiep Luong,
Wilfried Philips
Abstract:
Hyperspectral imaging (HI) has emerged as a powerful tool in diverse fields such as medical diagnosis, industrial inspection, and agriculture, owing to its ability to detect subtle differences in physical properties through high spectral resolution. However, hyperspectral images (HSIs) are often quite noisy because of narrow band spectral filtering. To reduce the noise in HSI data cubes, both mode…
▽ More
Hyperspectral imaging (HI) has emerged as a powerful tool in diverse fields such as medical diagnosis, industrial inspection, and agriculture, owing to its ability to detect subtle differences in physical properties through high spectral resolution. However, hyperspectral images (HSIs) are often quite noisy because of narrow band spectral filtering. To reduce the noise in HSI data cubes, both model-driven and learning-based denoising algorithms have been proposed. However, model-based approaches rely on hand-crafted priors and hyperparameters, while learning-based methods are incapable of estimating the inherent degradation patterns and noise distributions in the imaging procedure, which could inform supervised learning. Secondly, learning-based algorithms predominantly rely on CNN and fail to capture long-range dependencies, resulting in limited interpretability. This paper proposes a Degradation-Noise-Aware Unfolding Network (DNA-Net) that addresses these issues. Firstly, DNA-Net models sparse noise, Gaussian noise, and explicitly represent image prior using transformer. Then the model is unfolded into an end-to-end network, the hyperparameters within the model are estimated from the noisy HSI and degradation model and utilizes them to control each iteration. Additionally, we introduce a novel U-Shaped Local-Non-local-Spectral Transformer (U-LNSA) that captures spectral correlation, local contents, and non-local dependencies simultaneously. By integrating U-LNSA into DNA-Net, we present the first Transformer-based deep unfolding HSI denoising method. Experimental results show that DNA-Net outperforms state-of-the-art methods, and the modeling of noise distributions helps in cases with heavy noise.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
Federated Deep Reinforcement Learning for THz-Beam Search with Limited CSI
Authors:
Po-Chun Hsu,
Li-Hsiang Shen,
Chun-Hung Liu,
Kai-Ten Feng
Abstract:
Terahertz (THz) communication with ultra-wide available spectrum is a promising technique that can achieve the stringent requirement of high data rate in the next-generation wireless networks, yet its severe propagation attenuation significantly hinders its implementation in practice. Finding beam directions for a large-scale antenna array to effectively overcome severe propagation attenuation of…
▽ More
Terahertz (THz) communication with ultra-wide available spectrum is a promising technique that can achieve the stringent requirement of high data rate in the next-generation wireless networks, yet its severe propagation attenuation significantly hinders its implementation in practice. Finding beam directions for a large-scale antenna array to effectively overcome severe propagation attenuation of THz signals is a pressing need. This paper proposes a novel approach of federated deep reinforcement learning (FDRL) to swiftly perform THz-beam search for multiple base stations (BSs) coordinated by an edge server in a cellular network. All the BSs conduct deep deterministic policy gradient (DDPG)-based DRL to obtain THz beamforming policy with limited channel state information (CSI). They update their DDPG models with hidden information in order to mitigate inter-cell interference. We demonstrate that the cell network can achieve higher throughput as more THz CSI and hidden neurons of DDPG are adopted. We also show that FDRL with partial model update is able to nearly achieve the same performance of FDRL with full model update, which indicates an effective means to reduce communication load between the edge server and the BSs by partial model uploading. Moreover, the proposed FDRL outperforms conventional non-learning-based and existing non-FDRL benchmark optimization methods.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Time-Selective RNN for Device-Free Multi-Room Human Presence Detection Using WiFi CSI
Authors:
Li-Hsiang Shen,
An-Hung Hsiao,
Fang-Yu Chu,
Kai-Ten Feng
Abstract:
Device-free human presence detection is a crucial technology for various applications, including home automation, security, and healthcare. While camera-based systems have traditionally been used for this purpose, they raise privacy concerns. To address this issue, recent research has explored the use of wireless channel state information (CSI) extracted from commercial WiFi access points (APs) to…
▽ More
Device-free human presence detection is a crucial technology for various applications, including home automation, security, and healthcare. While camera-based systems have traditionally been used for this purpose, they raise privacy concerns. To address this issue, recent research has explored the use of wireless channel state information (CSI) extracted from commercial WiFi access points (APs) to provide detailed channel characteristics. In this paper, we propose a device-free human presence detection system for multi-room scenarios using a time-selective conditional dual feature extract recurrent network (TCD-FERN). Our system is designed to capture significant time features on current human features using a dynamic and static data preprocessing technique. We extract both moving and spatial features of people and differentiate between line-of-sight (LoS) and non-line-of-sight (NLoS) cases. Subcarrier fusion is carried out in order to provide more objective variation of each sample while reducing the computational complexity. A voting scheme is further adopted to mitigate the feature attenuation problem caused by room partitions, with around 3% improvement of human presence detection accuracy. Experimental results have revealed the significant improvement of leveraging subcarrier fusion, dual-feature recurrent network, time selection and condition mechanisms. Compared to the existing works in open literature, our proposed TCD-FERN system can achieve above 97% of human presence detection accuracy for multi-room scenarios with the adoption of fewer WiFi APs.
△ Less
Submitted 11 December, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Attention-Enhanced Deep Learning for Device-Free Through-the-Wall Presence Detection Using Indoor WiFi Systems
Authors:
Li-Hsiang Shen,
An-Hung Hsiao,
Kuan-I Lu,
Kai-Ten Feng
Abstract:
Accurate detection of human presence in indoor environments is important for various applications, such as energy management and security. In this paper, we propose a novel system for human presence detection using the channel state information (CSI) of WiFi signals. Our system named attention-enhanced deep learning for presence detection (ALPD) employs an attention mechanism to automatically sele…
▽ More
Accurate detection of human presence in indoor environments is important for various applications, such as energy management and security. In this paper, we propose a novel system for human presence detection using the channel state information (CSI) of WiFi signals. Our system named attention-enhanced deep learning for presence detection (ALPD) employs an attention mechanism to automatically select informative subcarriers from the CSI data and a bidirectional long short-term memory (LSTM) network to capture temporal dependencies in CSI. Additionally, we utilize a static feature to improve the accuracy of human presence detection in static states. We evaluate the proposed ALPD system by deploying a pair of WiFi access points (APs) for collecting CSI dataset, which is further compared with several benchmarks. The results demonstrate that our ALPD system outperforms the benchmarks in terms of accuracy, especially in the presence of interference. Moreover, bidirectional transmission data is beneficial to training improving stability and accuracy, as well as reducing the costs of data collection for training. To elaborate a little further, we have also evaluated the potential of ALPD for detecting more challenging human activities in multi-rooms. Overall, our proposed ALPD system shows promising results for human presence detection using WiFi CSI signals.
△ Less
Submitted 8 February, 2024; v1 submitted 25 April, 2023;
originally announced April 2023.
-
A New Paradigm for Device-free Indoor Localization: Deep Learning with Error Vector Spectrum in Wi-Fi Systems
Authors:
Wen Liu,
An-Hung Hsiao,
Li-Hsiang Shen,
Kai-Ten Feng
Abstract:
The demand for device-free indoor localization using commercial Wi-Fi devices has rapidly increased in various fields due to its convenience and versatile applications. However, random frequency offset (RFO) in wireless channels poses challenges to the accuracy of indoor localization when using fluctuating channel state information (CSI). To mitigate the RFO problem, an error vector spectrum (EVS)…
▽ More
The demand for device-free indoor localization using commercial Wi-Fi devices has rapidly increased in various fields due to its convenience and versatile applications. However, random frequency offset (RFO) in wireless channels poses challenges to the accuracy of indoor localization when using fluctuating channel state information (CSI). To mitigate the RFO problem, an error vector spectrum (EVS) is conceived thanks to its higher resolution of signal and robustness to RFO. To address these challenges, this paper proposed a novel error vector assisted learning (EVAL) for device-free indoor localization. The proposed EVAL scheme employs deep neural networks to classify the location of a person in the indoor environment by extracting ample channel features from the physical layer signals. We conducted realistic experiments based on OpenWiFi project to extract both EVS and CSI to examine the performance of different device-free localization techniques. Experimental results show that our proposed EVAL scheme outperforms conventional machine learning methods and benchmarks utilizing either CSI amplitude or phase information. Compared to most existing CSI-based localization schemes, a new paradigm with higher positioning accuracy by adopting EVS is revealed by our proposed EVAL system.
△ Less
Submitted 25 March, 2023;
originally announced April 2023.
-
WiRiS: Transformer for RIS-Assisted Device-Free Sensing for Joint People Counting and Localization using Wi-Fi CSI
Authors:
Wei-Yu Chung,
Li-Hsiang Shen,
Kai-Ten Feng,
Yuan-Chun Lin,
Shih-Cheng Lin,
Sheng-Fuh Chang
Abstract:
Channel State Information (CSI) is widely adopted as a feature for indoor localization. Taking advantage of the abundant information from the CSI, people can be accurately sensed even without equipped devices. However, the positioning error increases severely in non-line-of-sight (NLoS) regions. Reconfigurable intelligent surface (RIS) has been introduced to improve signal coverage in NLoS areas,…
▽ More
Channel State Information (CSI) is widely adopted as a feature for indoor localization. Taking advantage of the abundant information from the CSI, people can be accurately sensed even without equipped devices. However, the positioning error increases severely in non-line-of-sight (NLoS) regions. Reconfigurable intelligent surface (RIS) has been introduced to improve signal coverage in NLoS areas, which can re-direct and enhance reflective signals with massive meta-material elements. In this paper, we have proposed a Transformer-based RIS-assisted device-free sensing for joint people counting and localization (WiRiS) system to precisely predict the number of people and their corresponding locations through configuring RIS. A series of predefined RIS beams is employed to create inputs of fingerprinting CSI features as sequence-to-sequence learning database for Transformer. We have evaluated the performance of proposed WiRiS system in both ray-tracing simulators and experiments. Both simulation and real-world experiments demonstrate that people counting accuracy exceeds 90\%, and the localization error can achieve the centimeter-level, which outperforms the existing benchmarks without employment of RIS.
△ Less
Submitted 9 November, 2023; v1 submitted 25 March, 2023;
originally announced April 2023.
-
Attention-based Learning for Sleep Apnea and Limb Movement Detection using Wi-Fi CSI Signals
Authors:
Chi-Che Chang,
An-Hung Hsiao,
Li-Hsiang Shen,
Kai-Ten Feng,
Chia-Yu Chen
Abstract:
Wi-Fi channel state information (CSI) has become a promising solution for non-invasive breathing and body motion monitoring during sleep. Sleep disorders of apnea and periodic limb movement disorder (PLMD) are often unconscious and fatal. The existing researches detect abnormal sleep disorders in impractically controlled environments. Moreover, it leads to compelling challenges to classify complex…
▽ More
Wi-Fi channel state information (CSI) has become a promising solution for non-invasive breathing and body motion monitoring during sleep. Sleep disorders of apnea and periodic limb movement disorder (PLMD) are often unconscious and fatal. The existing researches detect abnormal sleep disorders in impractically controlled environments. Moreover, it leads to compelling challenges to classify complex macro- and micro-scales of sleep movements as well as entangled similar waveforms of cases of apnea and PLMD. In this paper, we propose the attention-based learning for sleep apnea and limb movement detection (ALESAL) system that can jointly detect sleep apnea and PLMD under different sleep postures across a variety of patients. ALESAL contains antenna-pair and time attention mechanisms for mitigating the impact of modest antenna pairs and emphasizing the duration of interest, respectively. Performance results show that our proposed ALESAL system can achieve a weighted F1-score of 84.33, outperforming the other existing non-attention based methods of support vector machine and deep multilayer perceptron.
△ Less
Submitted 26 March, 2023;
originally announced April 2023.
-
Distributed Multi-Agent Deep Q-Learning for Fast Roaming in IEEE 802.11ax Wi-Fi Systems
Authors:
Ting-Hui Wang,
Li-Hsiang Shen,
Kai-Ten Feng
Abstract:
The innovation of Wi-Fi 6, IEEE 802.11ax, was be approved as the next sixth-generation (6G) technology of wireless local area networks (WLANs) by improving the fundamental performance of latency, throughput, and so on. The main technical feature of orthogonal frequency division multiple access (OFDMA) supports multi-users to transmit respective data concurrently via the corresponding access points…
▽ More
The innovation of Wi-Fi 6, IEEE 802.11ax, was be approved as the next sixth-generation (6G) technology of wireless local area networks (WLANs) by improving the fundamental performance of latency, throughput, and so on. The main technical feature of orthogonal frequency division multiple access (OFDMA) supports multi-users to transmit respective data concurrently via the corresponding access points (APs). However, the conventional IEEE 802.11 protocol for Wi-Fi roaming selects the target AP only depending on received signal strength indication (RSSI) which is obtained by the received Response frame from the APs. In the long term, it may lead to congestion in a single channel under the scenarios of dense users further increasing the association delay and packet drop rate, even reducing the quality of service (QoS) of the overall system. In this paper, we propose a multi-agent deep Q-learning for fast roaming (MADAR) algorithm to effectively minimize the latency during the station roaming for Smart Warehouse in Wi-Fi 6 system. The MADAR algorithm considers not only RSSI but also channel state information (CSI), and through online neural network learning and weighting adjustments to maximize the reward of the action selected from Epsilon-Greedy. Compared to existing benchmark methods, the MADAR algorithm has been demonstrated for improved roaming latency by analyzing the simulation result and realistic dataset.
△ Less
Submitted 25 March, 2023;
originally announced April 2023.
-
Edge Selection and Clustering for Federated Learning in Optical Inter-LEO Satellite Constellation
Authors:
Chih-Yu Chen,
Li-Hsiang Shen,
Kai-Ten Feng,
Lie-Liang Yang,
Jen-Ming Wu
Abstract:
Low-Earth orbit (LEO) satellites have been prosperously deployed for various Earth observation missions due to its capability of collecting a large amount of image or sensor data. However, traditionally, the data training process is performed in the terrestrial cloud server, which leads to a high transmission overhead. With the recent development of LEO, it is more imperative to provide ultra-dens…
▽ More
Low-Earth orbit (LEO) satellites have been prosperously deployed for various Earth observation missions due to its capability of collecting a large amount of image or sensor data. However, traditionally, the data training process is performed in the terrestrial cloud server, which leads to a high transmission overhead. With the recent development of LEO, it is more imperative to provide ultra-dense LEO constellation with enhanced on-board computation capability. Benefited from it, we have proposed a collaborative federated learning for low Earth orbit (FELLO). We allocate the entire process on LEOs with low payload inter-satellite transmissions, whilst the low-delay terrestrial gateway server (GS) only takes care for initial signal controlling. The GS initially selects an LEO server, whereas its LEO clients are all determined by clustering mechanism and communication capability through the optical inter-satellite links (ISLs). The re-clustering of changing LEO server will be executed once with low communication quality of FELLO. In the simulations, we have numerically analyzed the proposed FELLO under practical Walker-based LEO constellation configurations along with MNIST training dataset for classification mission. The proposed FELLO outperforms the conventional centralized and distributed architectures with higher classification accuracy as well as comparably lower latency of joint communication and computing.
△ Less
Submitted 10 April, 2023; v1 submitted 25 March, 2023;
originally announced March 2023.
-
Towards Open Temporal Graph Neural Networks
Authors:
Kaituo Feng,
Changsheng Li,
Xiaolu Zhang,
Jun Zhou
Abstract:
Graph neural networks (GNNs) for temporal graphs have recently attracted increasing attentions, where a common assumption is that the class set for nodes is closed. However, in real-world scenarios, it often faces the open set problem with the dynamically increased class set as the time passes by. This will bring two big challenges to the existing dynamic GNN methods: (i) How to dynamically propag…
▽ More
Graph neural networks (GNNs) for temporal graphs have recently attracted increasing attentions, where a common assumption is that the class set for nodes is closed. However, in real-world scenarios, it often faces the open set problem with the dynamically increased class set as the time passes by. This will bring two big challenges to the existing dynamic GNN methods: (i) How to dynamically propagate appropriate information in an open temporal graph, where new class nodes are often linked to old class nodes. This case will lead to a sharp contradiction. This is because typical GNNs are prone to make the embeddings of connected nodes become similar, while we expect the embeddings of these two interactive nodes to be distinguishable since they belong to different classes. (ii) How to avoid catastrophic knowledge forgetting over old classes when learning new classes occurred in temporal graphs. In this paper, we propose a general and principled learning approach for open temporal graphs, called OTGNet, with the goal of addressing the above two challenges. We assume the knowledge of a node can be disentangled into class-relevant and class-agnostic one, and thus explore a new message passing mechanism by extending the information bottleneck principle to only propagate class-agnostic knowledge between nodes of different classes, avoiding aggregating conflictive information. Moreover, we devise a strategy to select both important and diverse triad sub-graph structures for effective class-incremental learning. Extensive experiments on three real-world datasets of different domains demonstrate the superiority of our method, compared to the baselines.
△ Less
Submitted 25 May, 2023; v1 submitted 27 March, 2023;
originally announced March 2023.
-
Intelligent Load Balancing and Resource Allocation in O-RAN: A Multi-Agent Multi-Armed Bandit Approach
Authors:
Chia-Hsiang Lai,
Li-Hsiang Shen,
Kai-Ten Feng
Abstract:
The open radio access network (O-RAN) architecture offers a cost-effective and scalable solution for internet service providers to optimize their networks using machine learning algorithms. The architecture's open interfaces enable network function virtualization, with the O-RAN serving as the primary communication device for users. However, the limited frequency resources and information explosio…
▽ More
The open radio access network (O-RAN) architecture offers a cost-effective and scalable solution for internet service providers to optimize their networks using machine learning algorithms. The architecture's open interfaces enable network function virtualization, with the O-RAN serving as the primary communication device for users. However, the limited frequency resources and information explosion make it difficult to achieve an optimal network experience without effective traffic control or resource allocation. To address this, we consider mobility-aware load balancing to evenly distribute loads across the network, preventing network congestion and user outages caused by excessive load concentration on open radio unit (O-RU) governed by a single open distributed unit (O-DU). We have proposed a multi-agent multi-armed bandit for load balancing and resource allocation (mmLBRA) scheme, designed to both achieve load balancing and improve the effective sum-rate performance of the O-RAN network. We also present the mmLBRA-LB and mmLBRA-RA sub-schemes that can operate independently in non-realtime RAN intelligent controller (Non-RT RIC) and near-RT RIC, respectively, providing a solution with moderate loads and high-rate in O-RUs. Simulation results show that the proposed mmLBRA scheme significantly increases the effective network sum-rate while achieving better load balancing across O-RUs compared to rule-based and other existing heuristic methods in open literature.
△ Less
Submitted 25 March, 2023;
originally announced March 2023.
-
Hierarchical Multi-Agent Multi-Armed Bandit for Resource Allocation in Multi-LEO Satellite Constellation Networks
Authors:
Li-Hsiang Shen,
Yun Ho,
Kai-Ten Feng,
Lie-Liang Yang,
Sau-Hsuan Wu,
Jen-Ming Wu
Abstract:
Low Earth orbit (LEO) satellite constellation is capable of providing global coverage area with high-rate services in the next sixth-generation (6G) non-terrestrial network (NTN). Due to limited onboard resources of operating power, beams, and channels, resilient and efficient resource management has become compellingly imperative under complex interference cases. However, different from conventio…
▽ More
Low Earth orbit (LEO) satellite constellation is capable of providing global coverage area with high-rate services in the next sixth-generation (6G) non-terrestrial network (NTN). Due to limited onboard resources of operating power, beams, and channels, resilient and efficient resource management has become compellingly imperative under complex interference cases. However, different from conventional terrestrial base stations, LEO is deployed at considerable height and under high mobility, inducing substantially long delay and interference during transmission. As a result, acquiring the accurate channel state information between LEOs and ground users is challenging. Therefore, we construct a framework with a two-way transmission under unknown channel information and no data collected at long-delay ground gateway. In this paper, we propose hierarchical multi-agent multi-armed bandit resource allocation for LEO constellation (mmRAL) by appropriately assigning available radio resources. LEOs are considered as collaborative multiple macro-agents attempting unknown trials of various actions of micro-agents of respective resources, asymptotically achieving suitable allocation with only throughput information. In simulations, we evaluate mmRAL in various cases of LEO deployment, serving numbers of users and LEOs, hardware cost and outage probability. Benefited by efficient and resilient allocation, the proposed mmRAL system is capable of operating in homogeneous or heterogeneous orbital planes or constellations, achieving the highest throughput performance compared to the existing benchmarks in open literature.
△ Less
Submitted 25 March, 2023;
originally announced March 2023.
-
Inheriting Bayer's Legacy-Joint Remosaicing and Denoising for Quad Bayer Image Sensor
Authors:
Haijin Zeng,
Kai Feng,
Jiezhang Cao,
Shaoguang Huang,
Yongqiang Zhao,
Hiep Luong,
Jan Aelterman,
Wilfried Philips
Abstract:
Pixel binning based Quad sensors have emerged as a promising solution to overcome the hardware limitations of compact cameras in low-light imaging. However, binning results in lower spatial resolution and non-Bayer CFA artifacts. To address these challenges, we propose a dual-head joint remosaicing and denoising network (DJRD), which enables the conversion of noisy Quad Bayer and standard noise-fr…
▽ More
Pixel binning based Quad sensors have emerged as a promising solution to overcome the hardware limitations of compact cameras in low-light imaging. However, binning results in lower spatial resolution and non-Bayer CFA artifacts. To address these challenges, we propose a dual-head joint remosaicing and denoising network (DJRD), which enables the conversion of noisy Quad Bayer and standard noise-free Bayer pattern without any resolution loss. DJRD includes a newly designed Quad Bayer remosaicing (QB-Re) block, integrated denoising modules based on Swin-transformer and multi-scale wavelet transform. The QB-Re block constructs the convolution kernel based on the CFA pattern to achieve a periodic color distribution in the perceptual field, which is used to extract exact spectral information and reduce color misalignment. The integrated Swin-Transformer and multi-scale wavelet transform capture non-local dependencies, frequency and location information to effectively reduce practical noise. By identifying challenging patches utilizing Moire and zipper detection metrics, we enable our model to concentrate on difficult patches during the post-training phase, which enhances the model's performance in hard cases. Our proposed model outperforms competing models by approximately 3dB, without additional complexity in hardware or software.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
MSFA-Frequency-Aware Transformer for Hyperspectral Images Demosaicing
Authors:
Haijin Zeng,
Kai Feng,
Shaoguang Huang,
Jiezhang Cao,
Yongyong Chen,
Hongyan Zhang,
Hiep Luong,
Wilfried Philips
Abstract:
Hyperspectral imaging systems that use multispectral filter arrays (MSFA) capture only one spectral component in each pixel. Hyperspectral demosaicing is used to recover the non-measured components. While deep learning methods have shown promise in this area, they still suffer from several challenges, including limited modeling of non-local dependencies, lack of consideration of the periodic MSFA…
▽ More
Hyperspectral imaging systems that use multispectral filter arrays (MSFA) capture only one spectral component in each pixel. Hyperspectral demosaicing is used to recover the non-measured components. While deep learning methods have shown promise in this area, they still suffer from several challenges, including limited modeling of non-local dependencies, lack of consideration of the periodic MSFA pattern that could be linked to periodic artifacts, and difficulty in recovering high-frequency details. To address these challenges, this paper proposes a novel de-mosaicing framework, the MSFA-frequency-aware Transformer network (FDM-Net). FDM-Net integrates a novel MSFA-frequency-aware multi-head self-attention mechanism (MaFormer) and a filter-based Fourier zero-padding method to reconstruct high pass components with greater difficulty and low pass components with relative ease, separately. The advantage of Maformer is that it can leverage the MSFA information and non-local dependencies present in the data. Additionally, we introduce a joint spatial and frequency loss to transfer MSFA information and enhance training on frequency components that are hard to recover. Our experimental results demonstrate that FDM-Net outperforms state-of-the-art methods with 6dB PSNR, and reconstructs high-fidelity details successfully.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
Examining the Impact of Provenance-Enabled Media on Trust and Accuracy Perceptions
Authors:
K. J. Kevin Feng,
Nick Ritchie,
Pia Blumenthal,
Andy Parsons,
Amy X. Zhang
Abstract:
In recent years, industry leaders and researchers have proposed to use technical provenance standards to address visual misinformation spread through digitally altered media. By adding immutable and secure provenance information such as authorship and edit date to media metadata, social media users could potentially better assess the validity of the media they encounter. However, it is unclear how…
▽ More
In recent years, industry leaders and researchers have proposed to use technical provenance standards to address visual misinformation spread through digitally altered media. By adding immutable and secure provenance information such as authorship and edit date to media metadata, social media users could potentially better assess the validity of the media they encounter. However, it is unclear how end users would respond to provenance information, or how to best design provenance indicators to be understandable to laypeople. We conducted an online experiment with 595 participants from the US and UK to investigate how provenance information altered users' accuracy perceptions and trust in visual content shared on social media. We found that provenance information often lowered trust and caused users to doubt deceptive media, particularly when it revealed that the media was composited. We additionally tested conditions where the provenance information itself was shown to be incomplete or invalid, and found that these states have a significant impact on participants' accuracy perceptions and trust in media, leading them, in some cases, to disbelieve honest media. Our findings show that provenance, although enlightening, is still not a concept well-understood by users, who confuse media credibility with the orthogonal (albeit related) concept of provenance credibility. We discuss how design choices may contribute to provenance (mis)understanding, and conclude with implications for usable provenance systems, including clearer interfaces and user education.
△ Less
Submitted 10 September, 2023; v1 submitted 21 March, 2023;
originally announced March 2023.
-
A Light Weight Model for Active Speaker Detection
Authors:
Junhua Liao,
Haihan Duan,
Kanghui Feng,
Wanbing Zhao,
Yanbing Yang,
Liangyin Chen
Abstract:
Active speaker detection is a challenging task in audio-visual scenario understanding, which aims to detect who is speaking in one or more speakers scenarios. This task has received extensive attention as it is crucial in applications such as speaker diarization, speaker tracking, and automatic video editing. The existing studies try to improve performance by inputting multiple candidate informati…
▽ More
Active speaker detection is a challenging task in audio-visual scenario understanding, which aims to detect who is speaking in one or more speakers scenarios. This task has received extensive attention as it is crucial in applications such as speaker diarization, speaker tracking, and automatic video editing. The existing studies try to improve performance by inputting multiple candidate information and designing complex models. Although these methods achieved outstanding performance, their high consumption of memory and computational power make them difficult to be applied in resource-limited scenarios. Therefore, we construct a lightweight active speaker detection architecture by reducing input candidates, splitting 2D and 3D convolutions for audio-visual feature extraction, and applying gated recurrent unit (GRU) with low computational complexity for cross-modal modeling. Experimental results on the AVA-ActiveSpeaker dataset show that our framework achieves competitive mAP performance (94.1% vs. 94.2%), while the resource costs are significantly lower than the state-of-the-art method, especially in model parameters (1.0M vs. 22.5M, about 23x) and FLOPs (0.6G vs. 2.6G, about 4x). In addition, our framework also performs well on the Columbia dataset showing good robustness. The code and model weights are available at https://github.com/Junhua-Liao/Light-ASD.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Understanding Collaborative Practices and Tools of Professional UX Practitioners in Software Organizations
Authors:
K. J. Kevin Feng,
Tony W. Li,
Amy X. Zhang
Abstract:
User experience (UX) has undergone a revolution in collaborative practices, due to tools that enable quick feedback and continuous collaboration with a varied team across a design's lifecycle. However, it is unclear how this shift in collaboration has been received in professional UX practice, and whether new pain points have arisen. To this end, we conducted a survey (N=114) with UX practitioners…
▽ More
User experience (UX) has undergone a revolution in collaborative practices, due to tools that enable quick feedback and continuous collaboration with a varied team across a design's lifecycle. However, it is unclear how this shift in collaboration has been received in professional UX practice, and whether new pain points have arisen. To this end, we conducted a survey (N=114) with UX practitioners at software organizations based in the U.S. to better understand their collaborative practices and tools used throughout the design process. We found that while an increase in collaborative activity enhanced many aspects of UX work, some long-standing challenges -- such as handing off designs to developers -- still persist. Moreover, we observed new challenges emerging from activities enabled by collaborative tools such as design system management. Based on our findings, we discuss how UX practices can improve collaboration moving forward and provide concrete design implications for collaborative UX tools.
△ Less
Submitted 26 February, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Addressing UX Practitioners' Challenges in Designing ML Applications: an Interactive Machine Learning Approach
Authors:
K. J. Kevin Feng,
David W. McDonald
Abstract:
UX practitioners face novel challenges when designing user interfaces for machine learning (ML)-enabled applications. Interactive ML paradigms, like AutoML and interactive machine teaching, lower the barrier for non-expert end users to create, understand, and use ML models, but their application to UX practice is largely unstudied. We conducted a task-based design study with 27 UX practitioners wh…
▽ More
UX practitioners face novel challenges when designing user interfaces for machine learning (ML)-enabled applications. Interactive ML paradigms, like AutoML and interactive machine teaching, lower the barrier for non-expert end users to create, understand, and use ML models, but their application to UX practice is largely unstudied. We conducted a task-based design study with 27 UX practitioners where we asked them to propose a proof-of-concept design for a new ML-enabled application. During the task, our participants were given opportunities to create, test, and modify ML models as part of their workflows. Through a qualitative analysis of our post-task interview, we found that direct, interactive experimentation with ML allowed UX practitioners to tie ML capabilities and underlying data to user goals, compose affordances to enhance end-user interactions with ML, and identify ML-related ethical risks and challenges. We discuss our findings in the context of previously established human-AI guidelines. We also identify some limitations of interactive ML in UX processes and propose research-informed machine teaching as a supplement to future design tools alongside interactive ML.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Self-solidifying active droplets showing memory-induced chirality
Authors:
Kai Feng,
José Carlos Ureña Marcos,
Aritra K. Mukhopadhyay,
Ran Niu,
Qiang Zhao,
Jinping Qu,
Benno Liebchen
Abstract:
Most synthetic microswimmers do not reach the autonomy of their biological counterparts in terms of energy supply and diversity of motion. Here we report the first all-aqueous droplet swimmer powered by self-generated polyelectrolyte gradients, which shows memory-induced chirality while self-solidifying. An aqueous solution of surface tension-lowering polyelectrolytes self-solidifies on the surfac…
▽ More
Most synthetic microswimmers do not reach the autonomy of their biological counterparts in terms of energy supply and diversity of motion. Here we report the first all-aqueous droplet swimmer powered by self-generated polyelectrolyte gradients, which shows memory-induced chirality while self-solidifying. An aqueous solution of surface tension-lowering polyelectrolytes self-solidifies on the surface of acidic water, during which polyelectrolytes are gradually emitted into the surrounding water and induce linear self-propulsion via spontaneous symmetry breaking. The low diffusion coefficient of the polyelectrolytes leads to long-lived chemical trails which cause memory effects that drive a transition from linear to chiral motion without requiring any imposed symmetry breaking. The droplet swimmer is capable of highly efficient removal (up to 85%) of uranium from aqueous solutions within 90 min, benefiting from self-propulsion and flow-induced mixing. Our results provide a route to fueling self-propelled agents which can autonomously perform chiral motion and collect toxins.
△ Less
Submitted 24 October, 2023; v1 submitted 8 February, 2023;
originally announced February 2023.
-
Spatial Network Calculus and Performance Guarantees in Wireless Networks
Authors:
Ke Feng,
François Baccelli
Abstract:
This work develops a novel approach toward performance guarantees for all links in arbitrarily large wireless networks. It introduces a spatial network calculus, consisting of spatial regulation properties for stationary point processes and the first steps of a calculus for this regulation, which can be seen as an extension to space of the classical network calculus. Specifically, two classes of r…
▽ More
This work develops a novel approach toward performance guarantees for all links in arbitrarily large wireless networks. It introduces a spatial network calculus, consisting of spatial regulation properties for stationary point processes and the first steps of a calculus for this regulation, which can be seen as an extension to space of the classical network calculus. Specifically, two classes of regulations are defined: one includes ball regulation and shot-noise regulation, which are shown to be equivalent and upper constraint interference; the other one includes void regulation, which lower constraints the signal power. These regulations are defined both in the strong and weak sense: the former requires the regulations to hold everywhere in space, whereas the latter only requires the regulations to hold as observed by a jointly stationary point process. Using this approach, we derive performance guarantees in device-to-device, ad hoc, and cellular networks under proper regulations. We give universal bounds on the SINR for all links, which gives link service guarantees based on information-theoretic achievability. They are combined with classical network calculus to provide end-to-end latency guarantees for all packets in wireless queuing networks. Such guarantees do not exist in networks that are not spatially regulated, e.g., Poisson networks.
△ Less
Submitted 6 September, 2023; v1 submitted 3 February, 2023;
originally announced February 2023.
-
VAFER: Signal Decomposition based Mutual Interference Suppression in FMCW Radars
Authors:
Abhilash Gaur,
Po-Hsuan Tseng,
Kai-Ten Feng,
Seshan Srirangarajan
Abstract:
With increasing application of frequency-modulated continuous wave (FMCW) radars in autonomous vehicles, mutual interference among FMCW radars poses a serious threat. Through this paper, we present a novel approach to effectively and elegantly suppress mutual interference in FMCW radars. We first decompose the received signal into modes using variational mode decomposition (VMD) and perform time-f…
▽ More
With increasing application of frequency-modulated continuous wave (FMCW) radars in autonomous vehicles, mutual interference among FMCW radars poses a serious threat. Through this paper, we present a novel approach to effectively and elegantly suppress mutual interference in FMCW radars. We first decompose the received signal into modes using variational mode decomposition (VMD) and perform time-frequency analysis using Fourier synchrosqueezed transform (FSST). The interference-suppressed signal is then reconstructed by applying a proposed energy-entropy-based thresholding operation on the time-frequency spectra of VMD modes. The effectiveness of proposed method is measured in terms of signal-to-interference plus noise ratio (SINR) and correlation coefficient for both simulated and experimental automotive radar data in the presence of FMCW interference. Compared to other existing literature, our proposed method demonstrates significant improvement in the output SINR by at least 14.07 dB for simulated data and 9.87 dB for experimental data.
△ Less
Submitted 29 December, 2022; v1 submitted 28 December, 2022;
originally announced December 2022.
-
BTS: Bifold Teacher-Student in Semi-Supervised Learning for Indoor Two-Room Presence Detection Under Time-Varying CSI
Authors:
Li-Hsiang Shen,
Kai-Jui Chen,
An-Hung Hsiao,
Kai-Ten Feng
Abstract:
In recent years, indoor human presence detection based on supervised learning (SL) and channel state information (CSI) has attracted much attention. However, existing studies that rely on spatial information of CSI are susceptible to environmental changes which degrade prediction accuracy. Moreover, SL-based methods require time-consuming data labeling for retraining models. Therefore, it is imper…
▽ More
In recent years, indoor human presence detection based on supervised learning (SL) and channel state information (CSI) has attracted much attention. However, existing studies that rely on spatial information of CSI are susceptible to environmental changes which degrade prediction accuracy. Moreover, SL-based methods require time-consuming data labeling for retraining models. Therefore, it is imperative to design a continuously monitored model using a semi-supervised learning (SSL) based scheme. In this paper, we conceive a bifold teacher-student (BTS) learning approach for indoor human presence detection in an adjoining two-room scenario. The proposed SSL-based primal-dual teacher-student network intelligently learns spatial and temporal features from labeled and unlabeled CSI datasets. Additionally, the enhanced penalized loss function leverages entropy and distance measures to distinguish drifted data, i.e., features of new datasets affected by time-varying effects and altered from the original distribution. Experimental results demonstrate that the proposed BTS system sustains asymptotic accuracy after retraining the model with unlabeled data. Furthermore, BTS outperforms existing SSL-based models in terms of the highest detection accuracy while achieving the asymptotic performance of SL-based methods.
△ Less
Submitted 6 June, 2023; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Five Facets of 6G: Research Challenges and Opportunities
Authors:
Li-Hsiang Shen,
Kai-Ten Feng,
Lajos Hanzo
Abstract:
Whilst the fifth-generation (5G) systems are being rolled out across the globe, researchers have turned their attention to the exploration of radical next-generation solutions. At this early evolutionary stage we survey five main research facets of this field, namely {\em Facet~1: next-generation architectures, spectrum and services, Facet~2: next-generation networking, Facet~3: Internet of Things…
▽ More
Whilst the fifth-generation (5G) systems are being rolled out across the globe, researchers have turned their attention to the exploration of radical next-generation solutions. At this early evolutionary stage we survey five main research facets of this field, namely {\em Facet~1: next-generation architectures, spectrum and services, Facet~2: next-generation networking, Facet~3: Internet of Things (IoT), Facet~4: wireless positioning and sensing, as well as Facet~5: applications of deep learning in 6G networks.} In this paper, we have provided a critical appraisal of the literature of promising techniques ranging from the associated architectures, networking, applications as well as designs. We have portrayed a plethora of heterogeneous architectures relying on cooperative hybrid networks supported by diverse access and transmission mechanisms. The vulnerabilities of these techniques are also addressed and carefully considered for highlighting the most of promising future research directions. Additionally, we have listed a rich suite of learning-driven optimization techniques. We conclude by observing the evolutionary paradigm-shift that has taken place from pure single-component bandwidth-efficiency, power-efficiency or delay-optimization towards multi-component designs, as exemplified by the twin-component ultra-reliable low-latency mode of the 5G system. We advocate a further evolutionary step towards multi-component Pareto optimization, which requires the exploration of the entire Pareto front of all optiomal solutions, where none of the components of the objective function may be improved without degrading at least one of the other components.
△ Less
Submitted 7 November, 2022;
originally announced December 2022.
-
Arithmetic autocorrelation distribution of binary $m$-sequences
Authors:
Xiaoyan Jing,
Aixian Zhang,
Keqin Feng
Abstract:
Binary $m$-sequences are ones with the largest period $n=2^m-1$ among the binary sequences produced by linear shift registers with length $m$. They have a wide range of applications in communication since they have several desirable pseudorandomness such as balance, uniform pattern distribution and ideal (classical) autocorrelation. In his reseach on arithmetic codes, Mandelbaum \cite{9Mand} intro…
▽ More
Binary $m$-sequences are ones with the largest period $n=2^m-1$ among the binary sequences produced by linear shift registers with length $m$. They have a wide range of applications in communication since they have several desirable pseudorandomness such as balance, uniform pattern distribution and ideal (classical) autocorrelation. In his reseach on arithmetic codes, Mandelbaum \cite{9Mand} introduces a 2-adic version of classical autocorrelation of binary sequences, called arithmetic autocorrelation. Later, Goresky and Klapper \cite{3G1,4G2,5G3,6G4} generalize this notion to nonbinary case and develop several properties of arithmetic autocorrelation related to linear shift registers with carry. Recently, Z. Chen et al. \cite{1C1} show an upper bound on arithmetic autocorrelation of binary $m$-sequences and raise a conjecture on absolute value distribution on arithmetic autocorrelation of binary $m$-sequences.
△ Less
Submitted 30 November, 2022;
originally announced November 2022.
-
CRONOS: Colorization and Contrastive Learning for Device-Free NLoS Human Presence Detection using Wi-Fi CSI
Authors:
Li-Hsiang Shen,
Chia-Che Hsieh,
An-Hung Hsiao,
Kai-Ten Feng
Abstract:
In recent years, the demand for pervasive smart services and applications has increased rapidly. Device-free human detection through sensors or cameras has been widely adopted, but it comes with privacy issues as well as misdetection for motionless people. To address these drawbacks, channel state information (CSI) captured from commercialized Wi-Fi devices provides rich signal features for accura…
▽ More
In recent years, the demand for pervasive smart services and applications has increased rapidly. Device-free human detection through sensors or cameras has been widely adopted, but it comes with privacy issues as well as misdetection for motionless people. To address these drawbacks, channel state information (CSI) captured from commercialized Wi-Fi devices provides rich signal features for accurate detection. However, existing systems suffer from inaccurate classification under a non-line-of-sight (NLoS) and stationary scenario, such as when a person is standing still in a room corner. In this work, we propose a system called CRONOS (Colorization and Contrastive Learning Enhanced NLoS Human Presence Detection), which generates dynamic recurrence plots (RPs) and color-coded CSI ratios to distinguish mobile and stationary people from vacancy in a room, respectively. We also incorporate supervised contrastive learning to retrieve substantial representations, where consultation loss is formulated to differentiate the representative distances between dynamic and stationary cases. Furthermore, we propose a self-switched static feature enhanced classifier (S3FEC) to determine the utilization of either RPs or color-coded CSI ratios. Our comprehensive experimental results show that CRONOS outperforms existing systems that either apply machine learning or non-learning based methods, as well as non-CSI based features in open literature. CRONOS achieves the highest human presence detection accuracy in vacancy, mobility, line-of-sight (LoS), and NLoS scenarios.
△ Less
Submitted 16 August, 2023; v1 submitted 7 November, 2022;
originally announced November 2022.
-
Dimensional homogeneity constrained gene expression programming for discovering governing equations
Authors:
Wenjun Ma,
Jun Zhang,
Kaikai Feng,
Haoyun Xing,
Dongsheng Wen
Abstract:
Data-driven discovery of governing equations is of great significance for helping us understand intrinsic mechanisms and build physical models. Recently, numerous highly innovative algorithms have emerged, aimed at inversely discovering the underlying governing equations from data, such as sparse regression-based methods and symbolic regression-based methods. Along this direction, a novel dimensio…
▽ More
Data-driven discovery of governing equations is of great significance for helping us understand intrinsic mechanisms and build physical models. Recently, numerous highly innovative algorithms have emerged, aimed at inversely discovering the underlying governing equations from data, such as sparse regression-based methods and symbolic regression-based methods. Along this direction, a novel dimensional homogeneity constrained gene expression programming (DHC-GEP) method is proposed in this work. DHC-GEP simultaneously discovers the forms and coefficients of functions using basic mathematical operators and physical variables, without requiring pre-assumed candidate functions. The constraint of dimensional homogeneity is capable of filtering out the overfitting equations effectively. The key advantages of DHC-GEP compared to Original-GEP, including being more robust to hyperparameters, the noise level and the size of datasets, are demonstrated on two benchmark studies. Furthermore, DHC-GEP is employed to discover the unknown constitutive relations of two representative non-equilibrium flows. Galilean invariance and the second law of thermodynamics are imposed as constraints to enhance the reliability of the discovered constitutive relations. Comparisons, both quantitative and qualitative, indicate that the derived constitutive relations are more accurate than the conventional Burnett equations in a wide range of Knudsen number and Mach number, and are also applicable to the cases beyond the parameter space of the training data.
△ Less
Submitted 28 March, 2024; v1 submitted 15 November, 2022;
originally announced November 2022.
-
MARS: Message Passing for Antenna and RF Chain Selection for Hybrid Beamforming in MIMO Communication Systems
Authors:
Li-Hsiang Shen,
Yen-Chun Lo,
Kai-Ten Feng,
Sau-Hsuan Wu,
Lie-Liang Yang
Abstract:
In this paper, we consider a prospective receiving hybrid beamforming structure consisting of several radio frequency (RF) chains and abundant antenna elements in multi-input multi-output (MIMO) systems. Due to conventional costly full connections, we design an enhanced partially connected beamformer employing a low-density parity-check (LDPC)-based structure. As a benefit of the LDPC-based struct…
▽ More
In this paper, we consider a prospective receiving hybrid beamforming structure consisting of several radio frequency (RF) chains and abundant antenna elements in multi-input multi-output (MIMO) systems. Due to conventional costly full connections, we design an enhanced partially connected beamformer employing a low-density parity-check (LDPC)-based structure. As a benefit of the LDPC-based structure, information can be exchanged among clustered RF/antenna groups, which results in a low computational complexity order. Advanced message passing (MP) capable of inferring and transferring information among different paths is designed to support the LDPC-based hybrid beamformer. We propose a message-passing enhanced antenna and RF chain selection (MARS) scheme for minimizing the operational power of antennas and RF chains of the receiver as well as hybrid beamforming. Furthermore, sequential and parallel MP schemes for MARS are designed, namely, MARS-S and MARS-P, respectively, to address the convergence speed issue. A heuristic genetic algorithm is designed for receiving hybrid beamforming, comprising gene generation initialization, elite selection, crossover, and mutation. Simulations validate the convergence of both the MARS-P and the MARS-S algorithms. Due to the asynchronous information transfer of MARS-P, it requires higher power than MARS-S, which strikes a compelling balance among power consumption, convergence, and computational complexity. It is also demonstrated that the proposed MARS scheme outperforms the existing benchmarks using the heuristic method of fully/partially connected architectures in the open literature by requiring the lowest power and realizing the highest energy efficiency.
△ Less
Submitted 20 May, 2024; v1 submitted 7 November, 2022;
originally announced November 2022.
-
A knowledge-driven vowel-based approach of depression classification from speech using data augmentation
Authors:
Kexin Feng,
Theodora Chaspari
Abstract:
We propose a novel explainable machine learning (ML) model that identifies depression from speech, by modeling the temporal dependencies across utterances and utilizing the spectrotemporal information at the vowel level. Our method first models the variable-length utterances at the local-level into a fixed-size vowel-based embedding using a convolutional neural network with a spatial pyramid pooli…
▽ More
We propose a novel explainable machine learning (ML) model that identifies depression from speech, by modeling the temporal dependencies across utterances and utilizing the spectrotemporal information at the vowel level. Our method first models the variable-length utterances at the local-level into a fixed-size vowel-based embedding using a convolutional neural network with a spatial pyramid pooling layer ("vowel CNN"). Following that, the depression is classified at the global-level from a group of vowel CNN embeddings that serve as the input of another 1D CNN ("depression CNN"). Different data augmentation methods are designed for both the training of vowel CNN and depression CNN. We investigate the performance of the proposed system at various temporal granularities when modeling short, medium, and long analysis windows, corresponding to 10, 21, and 42 utterances, respectively. The proposed method reaches comparable performance with previous state-of-the-art approaches and depicts explainable properties with respect to the depression outcome. The findings from this work may benefit clinicians by providing additional intuitions during joint human-ML decision-making tasks.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
A few-shot learning approach with domain adaptation for personalized real-life stress detection in close relationships
Authors:
Kexin Feng,
Jacqueline B. Duong,
Kayla E. Carta,
Sierra Walters,
Gayla Margolin,
Adela C. Timmons,
Theodora Chaspari
Abstract:
We design a metric learning approach that aims to address computational challenges that yield from modeling human outcomes from ambulatory real-life data. The proposed metric learning is based on a Siamese neural network (SNN) that learns the relative difference between pairs of samples from a target user and non-target users, thus being able to address the scarcity of labelled data from the targe…
▽ More
We design a metric learning approach that aims to address computational challenges that yield from modeling human outcomes from ambulatory real-life data. The proposed metric learning is based on a Siamese neural network (SNN) that learns the relative difference between pairs of samples from a target user and non-target users, thus being able to address the scarcity of labelled data from the target. The SNN further minimizes the Wasserstein distance of the learned embeddings between target and non-target users, thus mitigating the distribution mismatch between the two. Finally, given the fact that the base rate of focal behaviors is different per user, the proposed method approximates the focal base rate based on labelled samples that lay closest to the target, based on which further minimizes the Wasserstein distance. Our method is exemplified for the purpose of hourly stress classification using real-life multimodal data from 72 dating couples. Results in few-shot and one-shot learning experiments indicate that proposed formulation benefits stress classification and can help mitigate the aforementioned challenges.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Authors:
Minheng Ni,
Zitong Huang,
Kailai Feng,
Wangmeng Zuo
Abstract:
Without the demand of training in reality, humans can easily detect a known concept simply based on its language description. Empowering deep learning with this ability undoubtedly enables the neural network to handle complex vision tasks, e.g., object detection, without collecting and annotating real images. To this end, this paper introduces a novel challenging learning paradigm Imaginary-Superv…
▽ More
Without the demand of training in reality, humans can easily detect a known concept simply based on its language description. Empowering deep learning with this ability undoubtedly enables the neural network to handle complex vision tasks, e.g., object detection, without collecting and annotating real images. To this end, this paper introduces a novel challenging learning paradigm Imaginary-Supervised Object Detection (ISOD), where neither real images nor manual annotations are allowed for training object detectors. To resolve this challenge, we propose ImaginaryNet, a framework to synthesize images by combining pretrained language model and text-to-image synthesis model. Given a class label, the language model is used to generate a full description of a scene with a target object, and the text-to-image model deployed to generate a photo-realistic image. With the synthesized images and class labels, weakly supervised object detection can then be leveraged to accomplish ISOD. By gradually introducing real images and manual annotations, ImaginaryNet can collaborate with other supervision settings to further boost detection performance. Experiments show that ImaginaryNet can (i) obtain about 70% performance in ISOD compared with the weakly supervised counterpart of the same backbone trained on real data, (ii) significantly improve the baseline while achieving state-of-the-art or comparable performance by incorporating ImaginaryNet with other supervision settings.
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
Toward Knowledge-Driven Speech-Based Models of Depression: Leveraging Spectrotemporal Variations in Speech Vowels
Authors:
Kexin Feng,
Theodora Chaspari
Abstract:
Psychomotor retardation associated with depression has been linked with tangible differences in vowel production. This paper investigates a knowledge-driven machine learning (ML) method that integrates spectrotemporal information of speech at the vowel-level to identify the depression. Low-level speech descriptors are learned by a convolutional neural network (CNN) that is trained for vowel classi…
▽ More
Psychomotor retardation associated with depression has been linked with tangible differences in vowel production. This paper investigates a knowledge-driven machine learning (ML) method that integrates spectrotemporal information of speech at the vowel-level to identify the depression. Low-level speech descriptors are learned by a convolutional neural network (CNN) that is trained for vowel classification. The temporal evolution of those low-level descriptors is modeled at the high-level within and across utterances via a long short-term memory (LSTM) model that takes the final depression decision. A modified version of the Local Interpretable Model-agnostic Explanations (LIME) is further used to identify the impact of the low-level spectrotemporal vowel variation on the decisions and observe the high-level temporal change of the depression likelihood. The proposed method outperforms baselines that model the spectrotemporal information in speech without integrating the vowel-based information, as well as ML models trained with conventional prosodic and spectrotemporal features. The conducted explainability analysis indicates that spectrotemporal information corresponding to non-vowel segments less important than the vowel-based information. Explainability of the high-level information capturing the segment-by-segment decisions is further inspected for participants with and without depression. The findings from this work can provide the foundation toward knowledge-driven interpretable decision-support systems that can assist clinicians to better understand fine-grain temporal changes in speech data, ultimately augmenting mental health diagnosis and care.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
The 4-Adic Complexity of Interleaved Quaternary Sequences of Even Length with Optimal Autocorrelation
Authors:
Xiaoyan Jing,
Zhefeng Xu,
Minghui Yang,
Keqin Feng
Abstract:
Su et al. proposed several new classes of quaternary sequences of even length with optimal autocorrelation interleaved by twin-prime sequences pairs, GMW sequences pairs or binary cyclotomic sequences of order four in \cite{S1}. In this paper, we determine the 4-adic complexity of these quaternary sequences with period $2n$ by using correlation function and the "Gauss periods" of order four and "q…
▽ More
Su et al. proposed several new classes of quaternary sequences of even length with optimal autocorrelation interleaved by twin-prime sequences pairs, GMW sequences pairs or binary cyclotomic sequences of order four in \cite{S1}. In this paper, we determine the 4-adic complexity of these quaternary sequences with period $2n$ by using correlation function and the "Gauss periods" of order four and "quadratic Gauss sums" on finite field $\mathbb{F}_n$ and valued in $\mathbb{Z}^{*}_{4^{2n}-1}$. Our results show that they are safe enough to resist the attack of the rational approximation algorithm.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Entropy Induced Pruning Framework for Convolutional Neural Networks
Authors:
Yiheng Lu,
Ziyu Guan,
Yaming Yang,
Maoguo Gong,
Wei Zhao,
Kaiyuan Feng
Abstract:
Structured pruning techniques have achieved great compression performance on convolutional neural networks for image classification task. However, the majority of existing methods are weight-oriented, and their pruning results may be unsatisfactory when the original model is trained poorly. That is, a fully-trained model is required to provide useful weight information. This may be time-consuming,…
▽ More
Structured pruning techniques have achieved great compression performance on convolutional neural networks for image classification task. However, the majority of existing methods are weight-oriented, and their pruning results may be unsatisfactory when the original model is trained poorly. That is, a fully-trained model is required to provide useful weight information. This may be time-consuming, and the pruning results are sensitive to the updating process of model parameters. In this paper, we propose a metric named Average Filter Information Entropy (AFIE) to measure the importance of each filter. It is calculated by three major steps, i.e., low-rank decomposition of the "input-output" matrix of each convolutional layer, normalization of the obtained eigenvalues, and calculation of filter importance based on information entropy. By leveraging the proposed AFIE, the proposed framework is able to yield a stable importance evaluation of each filter no matter whether the original model is trained fully. We implement our AFIE based on AlexNet, VGG-16, and ResNet-50, and test them on MNIST, CIFAR-10, and ImageNet, respectively. The experimental results are encouraging. We surprisingly observe that for our methods, even when the original model is only trained with one epoch, the importance evaluation of each filter keeps identical to the results when the model is fully-trained. This indicates that the proposed pruning strategy can perform effectively at the beginning stage of the training process for the original model.
△ Less
Submitted 13 August, 2022;
originally announced August 2022.
-
SBPF: Sensitiveness Based Pruning Framework For Convolutional Neural Network On Image Classification
Authors:
Yiheng Lu,
Maoguo Gong,
Wei Zhao,
Kaiyuan Feng,
Hao Li
Abstract:
Pruning techniques are used comprehensively to compress convolutional neural networks (CNNs) on image classification. However, the majority of pruning methods require a well pre-trained model to provide useful supporting parameters, such as C1-norm, BatchNorm value and gradient information, which may lead to inconsistency of filter evaluation if the parameters of the pre-trained model are not well…
▽ More
Pruning techniques are used comprehensively to compress convolutional neural networks (CNNs) on image classification. However, the majority of pruning methods require a well pre-trained model to provide useful supporting parameters, such as C1-norm, BatchNorm value and gradient information, which may lead to inconsistency of filter evaluation if the parameters of the pre-trained model are not well optimized. Therefore, we propose a sensitiveness based method to evaluate the importance of each layer from the perspective of inference accuracy by adding extra damage for the original model. Because the performance of the accuracy is determined by the distribution of parameters across all layers rather than individual parameter, the sensitiveness based method will be robust to update of parameters. Namely, we can obtain similar importance evaluation of each convolutional layer between the imperfect-trained and fully trained models. For VGG-16 on CIFAR-10, even when the original model is only trained with 50 epochs, we can get same evaluation of layer importance as the results when the model is trained fully. Then we will remove filters proportional from each layer by the quantified sensitiveness. Our sensitiveness based pruning framework is verified efficiently on VGG-16, a customized Conv-4 and ResNet-18 with CIFAR-10, MNIST and CIFAR-100, respectively.
△ Less
Submitted 9 August, 2022;
originally announced August 2022.
-
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Authors:
Hanjie Li,
Changsheng Li,
Kaituo Feng,
Ye Yuan,
Guoren Wang,
Hongyuan Zha
Abstract:
Graph structured data often possess dynamic characters in nature. Recent years have witnessed the increasing attentions paid to dynamic graph neural networks for modelling graph data. However, almost all existing approaches operate under the assumption that, upon the establishment of a new link, the embeddings of the neighboring nodes should undergo updates to learn temporal dynamics. Nevertheless…
▽ More
Graph structured data often possess dynamic characters in nature. Recent years have witnessed the increasing attentions paid to dynamic graph neural networks for modelling graph data. However, almost all existing approaches operate under the assumption that, upon the establishment of a new link, the embeddings of the neighboring nodes should undergo updates to learn temporal dynamics. Nevertheless, these approaches face the following limitation: If the node introduced by a new connection contains noisy information, propagating its knowledge to other nodes becomes unreliable and may even lead to the collapse of the model. In this paper, we propose Ada-DyGNN: a robust knowledge Adaptation framework via reinforcement learning for Dynamic Graph Neural Networks. In contrast to previous approaches, which update the embeddings of the neighbor nodes immediately after adding a new link, Ada-DyGNN adaptively determines which nodes should be updated. Considering that the decision to update the embedding of one neighbor node can significantly impact other neighbor nodes, we conceptualize the node update selection as a sequence decision problem and employ reinforcement learning to address it effectively. By this means, we can adaptively propagate knowledge to other nodes for learning robust node embedding representations. To the best of our knowledge, our approach constitutes the first attempt to explore robust knowledge adaptation via reinforcement learning specifically tailored for dynamic graph neural networks. Extensive experiments on three benchmark datasets demonstrate that Ada-DyGNN achieves the state-of-the-art performance. In addition, we conduct experiments by introducing different degrees of noise into the dataset, quantitatively and qualitatively illustrating the robustness of Ada-DyGNN.
△ Less
Submitted 11 April, 2024; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Field-Induced Magnetic States in the Metallic Rare-Earth Layered Triangular Antiferromagnet TbAuAl$_4$Ge$_2$
Authors:
Ian A. Leahy,
Keke Feng,
Roei Dey,
Ryan Baumbach,
Minhyea Lee
Abstract:
Magnetic frustration in metallic rare earth lanthanides ($Ln$) with $4f$-electrons is crucial for producing interesting magnetic phases with high magnetic anisotropy where intertwined charge and spin degrees of freedom lead to novel phenomena. Here we report on the magnetic, thermodynamic, and electrical transport properties of TbAuAl$_4$Ge$_2$. Tb ions form 2-dimensional triangular lattice layers…
▽ More
Magnetic frustration in metallic rare earth lanthanides ($Ln$) with $4f$-electrons is crucial for producing interesting magnetic phases with high magnetic anisotropy where intertwined charge and spin degrees of freedom lead to novel phenomena. Here we report on the magnetic, thermodynamic, and electrical transport properties of TbAuAl$_4$Ge$_2$. Tb ions form 2-dimensional triangular lattice layers which stack along the crystalline $c$-axis. The magnetic phase diagram reveals multiple nearly degenerate ordered states upon applying field along the magnetically easy $ab$-plane before saturation. The magnetoresistance in this configuration exhibits intricate field dependence that closely follows that of the magnetization while the specific heat reveals a region of highly enhanced entropy, suggesting the possibility of a non-trivial spin textured phase. For fields applied along the $c$-axis (hard axis), we find linear magnetoresistance over a wide range of fields. We compare the magnetic properties and magnetoresistance with an isostructral GdAuAl$_4$Ge$_2$ single crystals. These results identify TbAuAl$_4$Ge$_2$ as an environment for complex quantum spin states and pave the way for further investigations of the broader $Ln$AuAl$_4$Ge$_2$ family of materials.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
FreeKD: Free-direction Knowledge Distillation for Graph Neural Networks
Authors:
Kaituo Feng,
Changsheng Li,
Ye Yuan,
Guoren Wang
Abstract:
Knowledge distillation (KD) has demonstrated its effectiveness to boost the performance of graph neural networks (GNNs), where its goal is to distill knowledge from a deeper teacher GNN into a shallower student GNN. However, it is actually difficult to train a satisfactory teacher GNN due to the well-known over-parametrized and over-smoothing issues, leading to invalid knowledge transfer in practi…
▽ More
Knowledge distillation (KD) has demonstrated its effectiveness to boost the performance of graph neural networks (GNNs), where its goal is to distill knowledge from a deeper teacher GNN into a shallower student GNN. However, it is actually difficult to train a satisfactory teacher GNN due to the well-known over-parametrized and over-smoothing issues, leading to invalid knowledge transfer in practical applications. In this paper, we propose the first Free-direction Knowledge Distillation framework via Reinforcement learning for GNNs, called FreeKD, which is no longer required to provide a deeper well-optimized teacher GNN. The core idea of our work is to collaboratively build two shallower GNNs in an effort to exchange knowledge between them via reinforcement learning in a hierarchical way. As we observe that one typical GNN model often has better and worse performances at different nodes during training, we devise a dynamic and free-direction knowledge transfer strategy that consists of two levels of actions: 1) node-level action determines the directions of knowledge transfer between the corresponding nodes of two networks; and then 2) structure-level action determines which of the local structures generated by the node-level actions to be propagated. In essence, our FreeKD is a general and principled framework which can be naturally compatible with GNNs of different architectures. Extensive experiments on five benchmark datasets demonstrate our FreeKD outperforms two base GNNs in a large margin, and shows its efficacy to various GNNs. More surprisingly, our FreeKD has comparable or even better performance than traditional KD algorithms that distill knowledge from a deeper and stronger teacher GNN.
△ Less
Submitted 27 March, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Electrically pumped quantum-dot lasers grown on 300 mm patterned Si photonic wafers
Authors:
Chen Shang,
Kaiyin Feng,
Eamonn T. Hughes,
Andrew Clark,
Mukul Debnath,
Rosalyn Koscica,
Gerald Leake,
Joshua Herman,
David Harame,
Peter Ludewig,
Yating Wan,
John E. Bowers
Abstract:
Monolithic integration of quantum dot (QD) gain materials onto Si photonic platforms via direct epitaxial growth is a promising solution for on-chip light sources. Recent developments have demonstrated superior device reliability in blanket hetero-epitaxy of III-V devices on Si at elevated temperatures. Yet, thick, defect management epi designs prevent vertical light coupling from the gain region…
▽ More
Monolithic integration of quantum dot (QD) gain materials onto Si photonic platforms via direct epitaxial growth is a promising solution for on-chip light sources. Recent developments have demonstrated superior device reliability in blanket hetero-epitaxy of III-V devices on Si at elevated temperatures. Yet, thick, defect management epi designs prevent vertical light coupling from the gain region to the Si-on-Insulator (SOI) waveguides. Here, we demonstrate the first electrically pumped QD lasers grown on a 300 mm patterned (001) Si wafer with a butt-coupled configuration by molecular beam epitaxy (MBE). Unique growth and fabrication challenges imposed by the template architecture have been resolved, contributing to continuous wave lasing to 60 °C and a maximum double-side output power of 126.6 mW at 20 °C with a double-side wall plug efficiency of 8.6%. The potential for robust on-chip laser operation and efficient low-loss light coupling to Si photonic circuits makes this heteroepitaxial integration platform on Si promising for scalable and low-cost mass production.
△ Less
Submitted 2 June, 2022;
originally announced June 2022.
-
Magnetic Ordering in GdAuAl$_4$Ge$_2$ and TbAuAl$_4$Ge$_2$: layered compounds with triangular lanthanide nets
Authors:
Keke Feng,
Ian Andreas Leahy,
Olatunde Oladehin,
Kaya Wei,
Minhyea Lee,
Ryan Baumbach
Abstract:
We report the synthesis of the entire $Ln$AuAl$_4$Ge$_2$ ($Ln$ = Y, Pr, Nd, Sm, Gd, Tb, Dy, Ho, Er, and Tm) series and focus on the magnetic properties of GdAuAl$_4$Ge$_2$ and TbAuAl$_4$Ge$_2$. Temperature and magnetic field dependent magnetization, heat capacity, and electrical resistivity measurements reveal that both compounds exhibit several magnetically ordered states at low temperatures, wit…
▽ More
We report the synthesis of the entire $Ln$AuAl$_4$Ge$_2$ ($Ln$ = Y, Pr, Nd, Sm, Gd, Tb, Dy, Ho, Er, and Tm) series and focus on the magnetic properties of GdAuAl$_4$Ge$_2$ and TbAuAl$_4$Ge$_2$. Temperature and magnetic field dependent magnetization, heat capacity, and electrical resistivity measurements reveal that both compounds exhibit several magnetically ordered states at low temperatures, with evidence for magnetic fluctuations extending into the paramagnetic temperature region. For magnetic fields applied in the $ab$-plane there are several ordered state regions that are associated with metamagnetic phase transitions, consistent with there being multiple nearly degenerate ground states. Despite Gd being an isotropic $S$-state ion and Tb having an anisotropic $J$-state, there are similarities in the phase diagrams for the two compounds, suggesting that factors such as the symmetry of the crystalline lattice, which features well separated triangular planes of lanthanide ions, or the Ruderman-Kittel-Kasuya-Yosida interaction as defined by the Fermi surface topography control the magnetism. We also point out similarities to other centrosymmetric compounds that host skyrmion lattices such as Gd$_2$PdSi$_3$, and propose that the $Ln$AuAl$_4$Ge$_2$ family of compounds are of interest as reservoirs for complex magnetism and electronic behaviors such as the topological Hall effect.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Sound attenuation in the hyperhoneycomb Kitaev spin liquid
Authors:
Kexin Feng,
Aysel Shiralieva,
Natalia B. Perkins
Abstract:
In recent years, it has been shown that the phonon dynamics may serve as an indirect probe of fractionalization of spin degrees of freedom. Here we propose that the sound attenuation measurements allows for the characterization and identification of the Kitaev quantum spin liquid on the hyperhoneycomb lattice, which is particularly interesting since the strong Kitaev interaction was observed in th…
▽ More
In recent years, it has been shown that the phonon dynamics may serve as an indirect probe of fractionalization of spin degrees of freedom. Here we propose that the sound attenuation measurements allows for the characterization and identification of the Kitaev quantum spin liquid on the hyperhoneycomb lattice, which is particularly interesting since the strong Kitaev interaction was observed in the the hyperhoneycomb magnet $β$-Li$_2$IrO$_3$. To this end we consider the low-temperature scattering between acoustic phonons and gapless Majorana fermions with nodal-line band structure. We find that the sound attenuation has a characteristic angular dependence, which is explicitly shown for the high-symmetry planes at temperatures below the flux gap energy.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.
-
Sparse Regularized Correlation Filter for UAV Object Tracking with adaptive Contextual Learning and Keyfilter Selection
Authors:
Zhangjian Ji,
Kai Feng,
Yuhua Qian,
Jiye Liang
Abstract:
Recently, correlation filter has been widely applied in unmanned aerial vehicle (UAV) tracking due to its high frame rates, robustness and low calculation resources. However, it is fragile because of two inherent defects, i.e, boundary effect and filter corruption. Some methods by enlarging the search area can mitigate the boundary effect, yet introducing the undesired background distractors. Anot…
▽ More
Recently, correlation filter has been widely applied in unmanned aerial vehicle (UAV) tracking due to its high frame rates, robustness and low calculation resources. However, it is fragile because of two inherent defects, i.e, boundary effect and filter corruption. Some methods by enlarging the search area can mitigate the boundary effect, yet introducing the undesired background distractors. Another approaches can alleviate the temporal degeneration of learned filters by introducing the temporal regularizer, which depends on the assumption that the filers between consecutive frames should be coherent. In fact, sometimes the filers at the ($t-1$)th frame is vulnerable to heavy occlusion from backgrounds, which causes that the assumption does not hold. To handle them, in this work, we propose a novel $\ell_{1}$ regularization correlation filter with adaptive contextual learning and keyfilter selection for UAV tracking. Firstly, we adaptively detect the positions of effective contextual distractors by the aid of the distribution of local maximum values on the response map of current frame which is generated by using the previous correlation filter model. Next, we eliminate inconsistent labels for the tracked target by removing one on each distractor and develop a new score scheme for each distractor. Then, we can select the keyfilter from the filters pool by finding the maximal similarity between the target at the current frame and the target template corresponding to each filter in the filters pool. Finally, quantitative and qualitative experiments on three authoritative UAV datasets show that the proposed method is superior to the state-of-the-art tracking methods based on correlation filter framework.
△ Less
Submitted 12 October, 2022; v1 submitted 7 May, 2022;
originally announced May 2022.
-
Magnetic properties of equiatomic CrMnFeCoNi
Authors:
Timothy A. Elmslie,
Jacob Startt,
Sujeily Soto-Medina,
Yang Yang,
Keke Feng,
Ryan E. Baumbach,
Emma Zappala,
Gerald D. Morris,
Benjamin A. Frandsen,
Mark W. Meisel,
Michele V. Manuel,
Rémi Dingreville,
James J. Hamlin
Abstract:
Magnetic, specific heat, and structural properties of the equiatomic Cantor alloy system are reported for temperatures between 5 kelvin and 300 kelvin, and up to fields of 70 kilo-oersted. Magnetization measurements performed on as-cast, annealed, and cold-worked samples reveal a strong processing history dependence and that high-temperature annealing after cold-working does not restore the alloy…
▽ More
Magnetic, specific heat, and structural properties of the equiatomic Cantor alloy system are reported for temperatures between 5 kelvin and 300 kelvin, and up to fields of 70 kilo-oersted. Magnetization measurements performed on as-cast, annealed, and cold-worked samples reveal a strong processing history dependence and that high-temperature annealing after cold-working does not restore the alloy to a pristine state. Measurements on known precipitates show that the two transitions, detected at 43 kelvin and 85 kelvin, are intrinsic to the Cantor alloy and not the result of an impurity phase. Experimental and ab initio density functional theory (DFT) computational results suggest that these transitions are a weak ferrimagnetic transition and a spin-glass-like transition, respectively, and magnetic and specific heat measurements provide evidence of significant Stoner enhancement and electron-electron interactions within the material.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
Optimal Combinatorial Neural Codes with Matched Metric $δ_{r}$: Characterization and Constructions
Authors:
Aixian Zhang,
Xiaoyan Jin,
Keqin Feng
Abstract:
Based on the theoretical neuroscience, G. Cotardo and A. Ravagnavi in \cite{CR} introduced a kind of asymmetric binary codes called combinatorial neural codes (CN codes for short), with a "matched metric" $δ_{r}$ called asymmetric discrepancy, instead of the Hamming distance $d_{H}$ for usual error-correcting codes. They also presented the Hamming, Singleton and Plotkin bounds for CN codes with re…
▽ More
Based on the theoretical neuroscience, G. Cotardo and A. Ravagnavi in \cite{CR} introduced a kind of asymmetric binary codes called combinatorial neural codes (CN codes for short), with a "matched metric" $δ_{r}$ called asymmetric discrepancy, instead of the Hamming distance $d_{H}$ for usual error-correcting codes. They also presented the Hamming, Singleton and Plotkin bounds for CN codes with respect to $δ_{r}$ and asked how to construct the CN codes $\cC$ with large size $|\cC|$ and $δ_{r}(\cC).$ In this paper we firstly show that a binary code $\cC$ reaches one of the above bounds for $δ_{r}(\cC)$ if and only if $\cC$ reaches the corresponding bounds for $d_H$ and $r$ is sufficiently closed to 1. This means that all optimal CN codes come from the usual optimal codes. %(perfect codes, MDS codes or the codes meet the usual Plotkin bound). Secondly we present several constructions of CN codes with nice and flexible parameters $(n,K, δ_r(\cC))$ by using bent functions.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
Reconfigurable Intelligent Surface-Empowered Self-Interference Cancellation for 6G Full-Duplex MIMO Communication Systems
Authors:
Chia-Jou Ku,
Li-Hsiang Shen,
Kai-Ten Feng
Abstract:
Substantially increasing wireless traffic and extending serving coverage is required with the advent of sixth-generation (6G) wireless communication networks. Reconfigurable intelligent surface (RIS) is widely considered as a promising technique which is capable of improving the system sum rate and energy efficiency. Moreover, full-duplex (FD) multi-input-multi-output (MIMO) transmission provides…
▽ More
Substantially increasing wireless traffic and extending serving coverage is required with the advent of sixth-generation (6G) wireless communication networks. Reconfigurable intelligent surface (RIS) is widely considered as a promising technique which is capable of improving the system sum rate and energy efficiency. Moreover, full-duplex (FD) multi-input-multi-output (MIMO) transmission provides simultaneous transmit and received signals, which theoretically provides twice of spectrum efficiency. However, the self-interference (SI) in FD system is a challenging task requiring high-overhead cancellation, which can be resolved by configuring appropriate phase shifts of RIS. This paper has proposed an RIS-empowered full-duplex interference cancellation (RFIC) scheme in order to alleviate the severe interference in an RIS-FD system. We consider the interference minimization of RIS-FD MIMO while guaranteeing quality-of-service (QoS) of whole system. The closed-form solution of RIS phase shifts is theoretically derived with the discussion of different numbers of RIS elements and receiving antennas. Simulation results reveal that the proposed RFIC scheme outperforms existing benchmarks with more than 50% of performance gain of sum rate.
△ Less
Submitted 29 March, 2023; v1 submitted 14 December, 2021;
originally announced December 2021.
-
CoMP-Enhanced Flexible Functional Split for Mixed Services in Beyond 5G Wireless Networks
Authors:
Li-Hsiang Shen,
Yung-Ting Huang,
Kai-Ten Feng
Abstract:
With explosively escalating service demands, beyond fifth generation (B5G) aims to realize various requirements for multi-service networks, i.e., higher performance of mixed enhanced mobile broadband (eMBB) and ultra-reliable low-latency communication (URLLC) services than 5G. To flexibly serve diverse traffic, various functional split options (FSOs) are specified by 5G protocols enabling differen…
▽ More
With explosively escalating service demands, beyond fifth generation (B5G) aims to realize various requirements for multi-service networks, i.e., higher performance of mixed enhanced mobile broadband (eMBB) and ultra-reliable low-latency communication (URLLC) services than 5G. To flexibly serve diverse traffic, various functional split options (FSOs) are specified by 5G protocols enabling different network functions. In order to improve signal qualities for edge users, we consider FSO-based coordinated multi-point (CoMP) transmission as a prominent technique capable of supporting high traffic demands. However, due to conventional confined hardware processing capability, a processor sharing (PS) model is introduced to deal with high latency for multi-service FSO-based networks. Therefore, it becomes essential to assign CoMP-enhanced functional split modes under PS model. A more tractable FSO-based network in terms of ergodic rate and reliability is derived by stochastic geometry approach. Moreover, we have proposed CoMP-enhanced functional split mode allocation (CFSMA) scheme to adaptively assign FSOs to provide enhanced mixed throughput and latency-aware services. The simulation results have validated analytical derivation and demonstrated that the proposed CFSMA scheme optimizes system spectrum efficiency while guaranteeing stringent latency requirement. The proposed CFSMA scheme with the designed PS FFS-CoMP system outperforms the benchmarks of conventional FCFS scheduling, non-FSO network, fixed FSOs, and limited available FSO selections in open literature.
△ Less
Submitted 16 April, 2023; v1 submitted 7 December, 2021;
originally announced December 2021.