Zum Hauptinhalt springen

Showing 1–50 of 118 results for author: Pan, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16564  [pdf, other

    cs.MM cs.SD eess.AS

    Human-Inspired Audio-Visual Speech Recognition: Spike Activity, Cueing Interaction and Causal Processing

    Authors: Qianhui Liu, Jiadong Wang, Yang Wang, Xin Yang, Gang Pan, Haizhou Li

    Abstract: Humans naturally perform audiovisual speech recognition (AVSR), enhancing the accuracy and robustness by integrating auditory and visual information. Spiking neural networks (SNNs), which mimic the brain's information-processing mechanisms, are well-suited for emulating the human capability of AVSR. Despite their potential, research on SNNs for AVSR is scarce, with most existing audio-visual multi… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2407.20947  [pdf, other

    cs.NE

    An Asynchronous Multi-core Accelerator for SNN inference

    Authors: Zhuo Chen, De Ma, Xiaofei Jin, Qinghui Xing, Ouwen Jin, Xin Du, Shuibing He, Gang Pan

    Abstract: Spiking Neural Networks (SNNs) are extensively utilized in brain-inspired computing and neuroscience research. To enhance the speed and energy efficiency of SNNs, several many-core accelerators have been developed. However, maintaining the accuracy of SNNs often necessitates frequent explicit synchronization among all cores, which presents a challenge to overall efficiency. In this paper, we propo… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  3. arXiv:2407.20852  [pdf, other

    cs.NI cs.MM eess.SY

    Optimizing 5G-Advanced Networks for Time-critical Applications: The Role of L4S

    Authors: Guangjin Pan, Shugong Xu, Pin Jiang

    Abstract: As 5G networks strive to support advanced time-critical applications, such as immersive Extended Reality (XR), cloud gaming, and autonomous driving, the demand for Real-time Broadband Communication (RTBC) grows. In this article, we present the main mechanisms of Low Latency, Low Loss, and Scalable Throughput (L4S). Subsequently, we investigate the support and challenges of L4S technology in the la… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 7 pages, 3 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  4. arXiv:2407.19271  [pdf, other

    cs.CV eess.IV

    Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network

    Authors: Gang Pan, Chen Wang, Zhijie Sui, Shuai Guo, Yaozhi Lv, Honglie Li, Di Sun, Zixia Xia

    Abstract: The Quick-view (QV) technique serves as a primary method for detecting defects within sewerage systems. However, the effectiveness of QV is impeded by the limited visual range of its hardware, resulting in suboptimal image quality for distant portions of the sewer network. Image super-resolution is an effective way to improve image quality and has been applied in a variety of scenes. However, rese… ▽ More

    Submitted 27 August, 2024; v1 submitted 27 July, 2024; originally announced July 2024.

  5. arXiv:2407.15170  [pdf, other

    cs.CV

    Semi-Supervised Pipe Video Temporal Defect Interval Localization

    Authors: Zhu Huang, Gang Pan, Chao Kang, YaoZhi Lv

    Abstract: In sewer pipe Closed-Circuit Television (CCTV) inspection, accurate temporal defect localization is essential for effective defect classification, detection, segmentation and quantification. Industry standards typically do not require time-interval annotations, even though they are more informative than time-point annotations for defect localization, resulting in additional annotation costs when f… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: 13 pages, 3 figures

  6. arXiv:2407.07314  [pdf, ps, other

    cs.IT

    Proactive Eavesdropping in Relay Systems via Trajectory and Power Optimization

    Authors: Qian Dan, Hongjiang Lei, Ki-Hong Park, Weijia Lei, Gaofeng Pan

    Abstract: Wireless relays can effectively extend the transmission range of information. However, if relay technology is utilized unlawfully, it can amplify potential harm. Effectively surveilling illegitimate relay links poses a challenging problem. Unmanned aerial vehicles (UAVs) can proactively surveil wireless relay systems due to their flexible mobility. This work focuses on maximizing the eavesdropping… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 14 pages, 8 figures, submitted to IEEE Journal for review

  7. arXiv:2407.06521  [pdf, ps, other

    cs.IT eess.SP

    Beamforming Design for Joint Target Sensing and Proactive Eavesdropping

    Authors: Qian Dan, Hongjiang Lei, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: This work studies the beamforming design in the joint target sensing and proactive eavesdropping (JTSAPE) system. The JTSAPE base station (BS) receives the information transmitted by the illegal transmitter and transmits the waveform for target sensing. The shared waveform also serves as artificial noise to interfere with the illegal receiver, thereby achieving proactive eavesdropping. We firstly… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 26 pages, 6 figures, submitted to IEEE Journal for review

  8. arXiv:2406.07268  [pdf, other

    cs.MM cs.CL cs.CV

    Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation

    Authors: Jinyuan Li, Ziyan Li, Han Li, Jianfei Yu, Rui Xia, Di Sun, Gang Pan

    Abstract: Grounded Multimodal Named Entity Recognition (GMNER) task aims to identify named entities, entity types and their corresponding visual regions. GMNER task exhibits two challenging attributes: 1) The tenuous correlation between images and text on social media contributes to a notable proportion of named entities being ungroundable. 2) There exists a distinction between coarse-grained noun phrases u… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Extension of our Findings of EMNLP 2023 & ACL 2024 paper

  9. arXiv:2406.06842  [pdf, ps, other

    cs.IT eess.SP

    Aerial Relay to Achieve Covertness and Security

    Authors: Jiacheng Jiang, Hongjiang Lei, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: In this work, a delay-tolerant unmanned aerial vehicle (UAV) relayed covert and secure communication framework is investigated. In this framework, a legitimate UAV serves as an aerial relay to realize communication when the direct link between the terrestrial transmitter and receiver is blocked and also acts as a friendly jammer to suppress the malicious nodes presented on the ground. Subsequently… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, submitted to IEEE Journal for review

  10. arXiv:2406.05936  [pdf, ps, other

    cs.IT

    Multi-UAV Trajectory Design for Fair and Secure Communication

    Authors: Hongjiang Lei, Dongyang Meng, Haoxiang Ran, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Unmanned aerial vehicles (UAVs) play an essential role in future wireless communication networks due to their high mobility, low cost, and on-demand deployment. In air-to-ground links, UAVs are widely used to enhance the performance of wireless communication systems due to the presence of high-probability line-of-sight (LoS) links. However, the high probability of LoS links also increases the risk… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 14 pages, 10 figures, submitted to IEEE Journal for review

  11. arXiv:2406.01883  [pdf, other

    cs.NE cs.HC

    Context Gating in Spiking Neural Networks: Achieving Lifelong Learning through Integration of Local and Global Plasticity

    Authors: Jiangrong Shen, Wenyao Ni, Qi Xu, Gang Pan, Huajin Tang

    Abstract: Humans learn multiple tasks in succession with minimal mutual interference, through the context gating mechanism in the prefrontal cortex (PFC). The brain-inspired models of spiking neural networks (SNN) have drawn massive attention for their energy efficiency and biological plausibility. To overcome catastrophic forgetting when learning multiple tasks in sequence, current SNN models for lifelong… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  12. arXiv:2406.01313  [pdf, ps, other

    cs.IT eess.SP

    3D Trajectory Design for Energy-constrained Aerial CRNs Under Probabilistic LoS Channel

    Authors: Hongjiang Lei, Xiaqiu Wu, Ki-Hong Park, Gaofeng Pan

    Abstract: Unmanned aerial vehicles (UAVs) have been attracting significant attention because there is a high probability of line-of-sight links being obtained between them and terrestrial nodes in high-rise urban areas. In this work, we investigate cognitive radio networks (CRNs) by jointly designing three-dimensional (3D) trajectory, the transmit power of the UAV, and user scheduling. Considering the UAV's… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures,submitted to the IEEE journal for review

  13. arXiv:2406.01072  [pdf, other

    cs.NE cs.AI

    Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning

    Authors: Yaxin Li, Qi Xu, Jiangrong Shen, Hongming Xu, Long Chen, Gang Pan

    Abstract: The emergence of deep and large-scale spiking neural networks (SNNs) exhibiting high performance across diverse complex datasets has led to a need for compressing network models due to the presence of a significant number of redundant structural units, aiming to more effectively leverage their low-power consumption and biological interpretability advantages. Currently, most model compression techn… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2405.19194  [pdf, other

    cs.CV

    LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model

    Authors: Hongen Liu, Di Sun, Jiahao Wang, Yi Liu, Gang Pan

    Abstract: Video text spotting (VTS) aims to simultaneously localize, recognize and track text instances in videos. To address the limited recognition capability of end-to-end methods, recent methods track the zero-shot results of state-of-the-art image text spotters directly, and achieve impressive performance. However, owing to the domain gap between different datasets, these methods usually obtain limited… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  15. arXiv:2405.17879  [pdf, other

    cs.LG cs.AI

    Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree

    Authors: Lang Feng, Pengjie Gu, Bo An, Gang Pan

    Abstract: Diffusion planners have shown promise in handling long-horizon and sparse-reward tasks due to the non-autoregressive plan generation. However, their inherent stochastic risk of generating infeasible trajectories presents significant challenges to their reliability and stability. We introduce a novel approach, the Trajectory Aggregation Tree (TAT), to address this issue in diffusion planners. Compa… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: ICML 2024 (Spotlight)

  16. arXiv:2405.07689  [pdf, other

    cs.MM cs.NI eess.SY

    Quality of Experience Optimization for Real-time XR Video Transmission with Energy Constraints

    Authors: Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaojing Chen, Yanzan Sun

    Abstract: Extended Reality (XR) is an important service in the 5G network and in future 6G networks. In contrast to traditional video on demand services, real-time XR video is transmitted frame-by-frame, requiring low latency and being highly sensitive to network fluctuations. In this paper, we model the quality of experience (QoE) for real-time XR video transmission on a frame-by-frame basis. Based on the… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 6 pages, 5 figures

  17. arXiv:2405.02572  [pdf, other

    cs.LG cs.AI

    Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline

    Authors: Wenjia Meng, Qian Zheng, Long Yang, Yilong Yin, Gang Pan

    Abstract: Policy-based methods have achieved remarkable success in solving challenging reinforcement learning problems. Among these methods, off-policy policy gradient methods are particularly important due to that they can benefit from off-policy data. However, these methods suffer from the high variance of the off-policy policy gradient (OPPG) estimator, which results in poor sample efficiency during trai… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 12 pages, 3 figures

  18. arXiv:2404.19582  [pdf, other

    cs.LG cs.CR

    Leveraging Label Information for Stealthy Data Stealing in Vertical Federated Learning

    Authors: Duanyi Yao, Songze Li, Xueluan Gong, Sizai Hou, Gaoning Pan

    Abstract: We develop DMAVFL, a novel attack strategy that evades current detection mechanisms. The key idea is to integrate a discriminator with auxiliary classifier that takes a full advantage of the label information (which was completely ignored in previous attacks): on one hand, label information helps to better characterize embeddings of samples from distinct classes, yielding an improved reconstructio… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  19. arXiv:2404.18225  [pdf, other

    cs.RO

    Quadruped robot traversing 3D complex environments with limited perception

    Authors: Yi Cheng, Hang Liu, Guoping Pan, Linqi Ye, Houde Liu, Bin Liang

    Abstract: Traversing 3-D complex environments has always been a significant challenge for legged locomotion. Existing methods typically rely on external sensors such as vision and lidar to preemptively react to obstacles by acquiring environmental information. However, in scenarios like nighttime or dense forests, external sensors often fail to function properly, necessitating robots to rely on propriocepti… ▽ More

    Submitted 14 July, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: 10 pages, 8 figures,submitted to iros2024

  20. arXiv:2404.09905  [pdf, other

    cs.NI cs.MM eess.IV eess.SY

    Quality of Experience Oriented Cross-layer Optimization for Real-time XR Video Transmission

    Authors: Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaojing Chen, Yanzan Sun

    Abstract: Extended reality (XR) is one of the most important applications of beyond 5G and 6G networks. Real-time XR video transmission presents challenges in terms of data rate and delay. In particular, the frame-by-frame transmission mode of XR video makes real-time XR video very sensitive to dynamic network environments. To improve the users' quality of experience (QoE), we design a cross-layer transmiss… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 14 pages, 13 figures. arXiv admin note: text overlap with arXiv:2402.01180

  21. arXiv:2404.01612  [pdf, other

    cs.CV

    Spin-UP: Spin Light for Natural Light Uncalibrated Photometric Stereo

    Authors: Zongrui Li, Zhan Lu, Haojie Yan, Boxin Shi, Gang Pan, Qian Zheng, Xudong Jiang

    Abstract: Natural Light Uncalibrated Photometric Stereo (NaUPS) relieves the strict environment and light assumptions in classical Uncalibrated Photometric Stereo (UPS) methods. However, due to the intrinsic ill-posedness and high-dimensional ambiguities, addressing NaUPS is still an open question. Existing works impose strong assumptions on the environment lights and objects' material, restricting the effe… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Paper accepted by CVPR2024

  22. arXiv:2403.17367  [pdf, other

    cs.RO

    RoboDuet: A Framework Affording Mobile-Manipulation and Cross-Embodiment

    Authors: Guoping Pan, Qingwei Ben, Zhecheng Yuan, Guangqi Jiang, Yandong Ji, Jiangmiao Pang, Houde Liu, Huazhe Xu

    Abstract: Combining the mobility of legged robots with the manipulation skills of arms has the potential to significantly expand the operational range and enhance the capabilities of robotic systems in performing various mobile manipulation tasks. Existing approaches are confined to imprecise six degrees of freedom (DoF) manipulation and possess a limited arm workspace. In this paper, we propose a novel fra… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  23. arXiv:2402.09989  [pdf, other

    cs.CV cs.CL

    LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition

    Authors: Jinyuan Li, Han Li, Di Sun, Jiahao Wang, Wenkun Zhang, Zan Wang, Gang Pan

    Abstract: Grounded Multimodal Named Entity Recognition (GMNER) is a nascent multimodal task that aims to identify named entities, entity types and their corresponding visual regions. GMNER task exhibits two challenging properties: 1) The weak correlation between image-text pairs in social media results in a significant portion of named entities being ungroundable. 2) There exists a distinction between coars… ▽ More

    Submitted 29 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted to Findings of ACL 2024

  24. On Secure mmWave RSMA Systems

    Authors: Hongjiang Lei, Sha Zhou, Xinhu Chen, Imran Shafique Ansari, Yun Li, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: This work considers a multiple-input-single-output mmWave RSMA system wherein a base station serves two users in the presence of a passive eavesdropper. Different eavesdropping scenarios are considered corresponding to the overlapped resolvable paths between the main and the wiretap channels under the considered transmission schemes. The analytical expressions for the secrecy outage probability ar… ▽ More

    Submitted 25 February, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: 12 pages,8 figures, accepted by IEEE Internet of Things Journal

  25. arXiv:2402.01180  [pdf, other

    cs.NI cs.MM eess.SP

    Real-time Extended Reality Video Transmission Optimization Based on Frame-priority Scheduling

    Authors: Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaojing Chen, Yanzan Sun

    Abstract: Extended reality (XR) is one of the most important applications of 5G. For real-time XR video transmission in 5G networks, a low latency and high data rate are required. In this paper, we propose a resource allocation scheme based on frame-priority scheduling to meet these requirements. The optimization problem is modelled as a frame-priority-based radio resource scheduling problem to improve tran… ▽ More

    Submitted 7 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 6 pages, 7 figures

  26. arXiv:2401.15628  [pdf, other

    cs.GR

    WetSpongeCake: a Surface Appearance Model Considering Porosity and Saturation

    Authors: Gaole Pan, Yuang Cui, Jian Yang, Beibei Wang

    Abstract: Wet powdered materials, such as wet ground or moist walls, are common in the real world. Despite their particle size being larger than the wavelength, they remain invisible from a macro view. Reproducing these appearances accurately is crucial for various applications. Existing methods use different approaches, such as Monte Carlo path tracing on implicit shapes, which is accurate but computationa… ▽ More

    Submitted 6 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  27. arXiv:2401.14652  [pdf, other

    cs.NE

    LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization

    Authors: Qianhui Liu, Jiaqi Yan, Malu Zhang, Gang Pan, Haizhou Li

    Abstract: Spiking Neural Networks (SNNs) mimic the information-processing mechanisms of the human brain and are highly energy-efficient, making them well-suited for low-power edge devices. However, the pursuit of accuracy in current studies leads to large, long-timestep SNNs, conflicting with the resource constraints of these devices. In order to design lightweight and efficient SNNs, we propose a new appro… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  28. arXiv:2401.05363  [pdf, other

    eess.SP cs.LG

    Generalizable Sleep Staging via Multi-Level Domain Alignment

    Authors: Jiquan Wang, Sha Zhao, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan

    Abstract: Automatic sleep staging is essential for sleep assessment and disorder diagnosis. Most existing methods depend on one specific dataset and are limited to be generalized to other unseen datasets, for which the training data and testing data are from the same dataset. In this paper, we introduce domain generalization into automatic sleep staging and propose the task of generalizable sleep staging wh… ▽ More

    Submitted 11 July, 2024; v1 submitted 13 December, 2023; originally announced January 2024.

    Comments: Accepted by the Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

  29. arXiv:2401.03719  [pdf, other

    cs.NE

    Enhancing Adaptive History Reserving by Spiking Convolutional Block Attention Module in Recurrent Neural Networks

    Authors: Qi Xu, Yuyuan Gao, Jiangrong Shen, Yaxin Li, Xuming Ran, Huajin Tang, Gang Pan

    Abstract: Spiking neural networks (SNNs) serve as one type of efficient model to process spatio-temporal patterns in time series, such as the Address-Event Representation data collected from Dynamic Vision Sensor (DVS). Although convolutional SNNs have achieved remarkable performance on these AER datasets, benefiting from the predominant spatial feature extraction ability of convolutional structure, they ig… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  30. arXiv:2312.17582  [pdf, other

    cs.NE cs.AR

    Darwin3: A large-scale neuromorphic chip with a Novel ISA and On-Chip Learning

    Authors: De Ma, Xiaofei Jin, Shichun Sun, Yitao Li, Xundong Wu, Youneng Hu, Fangchao Yang, Huajin Tang, Xiaolei Zhu, Peng Lin, Gang Pan

    Abstract: Spiking Neural Networks (SNNs) are gaining increasing attention for their biological plausibility and potential for improved computational efficiency. To match the high spatial-temporal dynamics in SNNs, neuromorphic chips are highly desired to execute SNNs in hardware-based neuron and synapse circuits directly. This paper presents a large-scale neuromorphic chip named Darwin3 with a novel instruc… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  31. arXiv:2311.09077  [pdf, other

    cs.CV

    Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation

    Authors: Zhanfeng Liao, Qian Zheng, Yan Liu, Gang Pan

    Abstract: A crucial reason for the success of existing NeRF-based methods is to build a neural density field for the geometry representation via multiple perceptron layers (MLPs). MLPs are continuous functions, however, real geometry or density field is frequently discontinuous at the interface between the air and the surface. Such a contrary brings the problem of unfaithful geometry representation. To this… ▽ More

    Submitted 23 August, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  32. arXiv:2311.06825  [pdf, ps, other

    cs.IT eess.SP

    Secure Rate-Splitting Multiple Access Transmissions in LMS Systems

    Authors: Minjue He, Hui Zhao, Xiaqing Miao, Shuai Wang, Gaofeng Pan

    Abstract: This letter investigates the secure delivery performance of the rate-splitting multiple access scheme in land mobile satellite (LMS) systems, considering that the private messages intended by a terminal can be eavesdropped by any others from the broadcast signals. Specifically, the considered system has an N-antenna satellite and numerous single-antenna land users. Maximum ratio transmission (MRT)… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, 1 table

  33. arXiv:2310.13932  [pdf, ps, other

    cs.IT eess.SP

    Trajectory and Power Design for Aerial Multi-User Covert Communications

    Authors: Hongjiang Lei, Jiacheng Jiang, Imran Shafique Ansari, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Unmanned aerial vehicles (UAVs) can provide wireless access to terrestrial users, regardless of geographical constraints, and will be an important part of future communication systems. In this paper, a multi-user downlink dual-UAVs enabled covert communication system was investigated, in which a UAV transmits secure information to ground users in the presence of multiple wardens as well as a frien… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 30 pages, 9 figures, submitted to the IEEE journal for review

  34. arXiv:2310.13931  [pdf, ps, other

    cs.IT eess.SP

    Trajectory and power design for aerial CRNs with colluding eavesdroppers

    Authors: Hongjiang Lei, Jiacheng Jiang, Haosi Yang, Ki-Hong Park, Imran Shafique Ansari, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Unmanned aerial vehicles (UAVs) can provide wireless access services to terrestrial users without geographical limitations and will become an essential part of the future communication system. However, the openness of wireless channels and the mobility of UAVs make the security of UAV-based communication systems particularly challenging. This work investigates the security of aerial cognitive radi… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 10 pages, 7 figures.submitted to the IEEE journal for review

  35. arXiv:2310.12937  [pdf, other

    cs.DC

    End-to-End Delay Minimization based on Joint Optimization of DNN Partitioning and Resource Allocation for Cooperative Edge Inference

    Authors: Xinrui Ye, Yanzan Sun, Dingzhu Wen, Guanjin Pan, Shunqing Zhang

    Abstract: Cooperative inference in Mobile Edge Computing (MEC), achieved by deploying partitioned Deep Neural Network (DNN) models between resource-constrained user equipments (UEs) and edge servers (ESs), has emerged as a promising paradigm. Firstly, we consider scenarios of continuous Artificial Intelligence (AI) task arrivals, like the object detection for video streams, and utilize a serial queuing mode… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 7 pages, 9 figures, 1 table, 1 algorithm, to be published in IEEE 98th Vehicular Technology Conference (VTC2023-Fall)

  36. arXiv:2310.05053  [pdf, other

    cs.LG cs.AI cs.MA

    FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility

    Authors: Lang Feng, Dong Xing, Junru Zhang, Gang Pan

    Abstract: Existing multi-agent PPO algorithms lack compatibility with different types of parameter sharing when extending the theoretical guarantee of PPO to cooperative multi-agent reinforcement learning (MARL). In this paper, we propose a novel and versatile multi-agent PPO algorithm for cooperative MARL to overcome this limitation. Our approach is achieved upon the proposed full-pipeline paradigm, which… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  37. arXiv:2309.17334  [pdf, other

    eess.IV cs.CV

    Multi-Depth Branch Network for Efficient Image Super-Resolution

    Authors: Huiyuan Tian, Li Zhang, Shijian Li, Min Yao, Gang Pan

    Abstract: A longstanding challenge in Super-Resolution (SR) is how to efficiently enhance high-frequency details in Low-Resolution (LR) images while maintaining semantic coherence. This is particularly crucial in practical applications where SR models are often deployed on low-power devices. To address this issue, we propose an innovative asymmetric SR architecture featuring Multi-Depth Branch Module (MDBM)… ▽ More

    Submitted 15 January, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  38. arXiv:2309.15729  [pdf, other

    cs.CV cs.AI

    MindGPT: Interpreting What You See with Non-invasive Brain Recordings

    Authors: Jiaxuan Chen, Yu Qi, Yueming Wang, Gang Pan

    Abstract: Decoding of seen visual contents with non-invasive brain recordings has important scientific and practical values. Efforts have been made to recover the seen images from brain signals. However, most existing approaches cannot faithfully reflect the visual contents due to insufficient image quality or semantic mismatches. Compared with reconstructing pixel-level visual images, speaking is a more ef… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 13 pages, 6 figures, submitted to anonymous conference

  39. arXiv:2309.14742  [pdf, other

    cs.CR

    SyzTrust: State-aware Fuzzing on Trusted OS Designed for IoT Devices

    Authors: Qinying Wang, Boyu Chang, Shouling Ji, Yuan Tian, Xuhong Zhang, Binbin Zhao, Gaoning Pan, Chenyang Lyu, Mathias Payer, Wenhai Wang, Raheem Beyah

    Abstract: Trusted Execution Environments (TEEs) embedded in IoT devices provide a deployable solution to secure IoT applications at the hardware level. By design, in TEEs, the Trusted Operating System (Trusted OS) is the primary component. It enables the TEE to use security-based design techniques, such as data encryption and identity authentication. Once a Trusted OS has been exploited, the TEE can no long… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: To appear in the IEEE Symposium on Security and Privacy (IEEE S&P) 2024, San Francisco, CA, USA

  40. arXiv:2309.03209  [pdf, other

    cs.HC cs.AI

    A Human-Machine Joint Learning Framework to Boost Endogenous BCI Training

    Authors: Hanwen Wang, Yu Qi, Lin Yao, Yueming Wang, Dario Farina, Gang Pan

    Abstract: Brain-computer interfaces (BCIs) provide a direct pathway from the brain to external devices and have demonstrated great potential for assistive and rehabilitation technologies. Endogenous BCIs based on electroencephalogram (EEG) signals, such as motor imagery (MI) BCIs, can provide some level of control. However, mastering spontaneous BCI control requires the users to generate discriminative and… ▽ More

    Submitted 24 August, 2023; originally announced September 2023.

  41. arXiv:2307.00035  [pdf, other

    cs.LG math.NA

    Parameter Identification for Partial Differential Equations with Spatiotemporal Varying Coefficients

    Authors: Guangtao Zhang, Yiting Duan, Guanyu Pan, Qijing Chen, Huiyu Yang, Zhikun Zhang

    Abstract: To comprehend complex systems with multiple states, it is imperative to reveal the identity of these states by system outputs. Nevertheless, the mathematical models describing these systems often exhibit nonlinearity so that render the resolution of the parameter inverse problem from the observed spatiotemporal data a challenging endeavor. Starting from the observed data obtained from such systems… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  42. arXiv:2306.11950  [pdf, other

    cs.NE cs.LG q-bio.NC

    Mitigating Communication Costs in Neural Networks: The Role of Dendritic Nonlinearity

    Authors: Xundong Wu, Pengfei Zhao, Zilin Yu, Lei Ma, Ka-Wa Yip, Huajin Tang, Gang Pan, Tiejun Huang

    Abstract: Our comprehension of biological neuronal networks has profoundly influenced the evolution of artificial neural networks (ANNs). However, the neurons employed in ANNs exhibit remarkable deviations from their biological analogs, mainly due to the absence of complex dendritic trees encompassing local nonlinearity. Despite such disparities, previous investigations have demonstrated that point neurons… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  43. arXiv:2306.10944  [pdf, other

    cs.MA

    Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification

    Authors: Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan

    Abstract: Ad hoc teamwork requires an agent to cooperate with unknown teammates without prior coordination. Many works propose to abstract teammate instances into high-level representation of types and then pre-train the best response for each type. However, most of them do not consider the distribution of teammate instances within a type. This could expose the agent to the hidden risk of \emph{type confoun… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML 2023

  44. arXiv:2306.03693  [pdf, other

    cs.NE cs.AI

    ESL-SNNs: An Evolutionary Structure Learning Strategy for Spiking Neural Networks

    Authors: Jiangrong Shen, Qi Xu, Jian K. Liu, Yueming Wang, Gang Pan, Huajin Tang

    Abstract: Spiking neural networks (SNNs) have manifested remarkable advantages in power consumption and event-driven property during the inference process. To take full advantage of low power consumption and improve the efficiency of these models further, the pruning methods have been explored to find sparse SNNs without redundancy connections after training. However, parameter redundancy still hinders the… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  45. arXiv:2305.19868  [pdf, other

    cs.NE

    Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN

    Authors: Yangfan Hu, Qian Zheng, Xudong Jiang, Gang Pan

    Abstract: Spiking neural networks (SNNs) have shown advantages in computation and energy efficiency over traditional artificial neural networks (ANNs) thanks to their event-driven representations. SNNs also replace weight multiplications in ANNs with additions, which are more energy-efficient and less computationally intensive. However, it remains a challenge to train deep SNNs due to the discrete spike fun… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  46. arXiv:2305.14757  [pdf, other

    cs.CL

    Psychological Metrics for Dialog System Evaluation

    Authors: Salvatore Giorgi, Shreya Havaldar, Farhan Ahmed, Zuhaib Akhtar, Shalaka Vaidya, Gary Pan, Lyle H. Ungar, H. Andrew Schwartz, Joao Sedoc

    Abstract: We present metrics for evaluating dialog systems through a psychologically-grounded "human" lens in which conversational agents express a diversity of both states (e.g., emotion) and traits (e.g., personality), just as people do. We present five interpretable metrics from established psychology that are fundamental to human communication and relationships: emotional entropy, linguistic style and e… ▽ More

    Submitted 15 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  47. arXiv:2305.12212  [pdf, other

    cs.CL

    Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge

    Authors: Jinyuan Li, Han Li, Zhuo Pan, Di Sun, Jiahao Wang, Wenkun Zhang, Gang Pan

    Abstract: Multimodal Named Entity Recognition (MNER) on social media aims to enhance textual entity prediction by incorporating image-based clues. Existing studies mainly focus on maximizing the utilization of pertinent image information or incorporating external knowledge from explicit knowledge bases. However, these methods either neglect the necessity of providing the model with external knowledge, or en… ▽ More

    Submitted 18 October, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of EMNLP 2023

  48. NeuSort: An Automatic Adaptive Spike Sorting Approach with Neuromorphic Models

    Authors: Hang Yu, Yu Qi, Gang Pan

    Abstract: Objective. Spike sorting, a critical step in neural data processing, aims to classify spiking events from single electrode recordings based on different waveforms. This study aims to develop a novel online spike sorter, NeuSort, using neuromorphic models, with the ability to adaptively adjust to changes in neural signals, including waveform deformations and the appearance of new neurons. Approach.… ▽ More

    Submitted 17 September, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Journal ref: J. Neural Eng. 20 056006 (2023)

  49. arXiv:2304.09500  [pdf, other

    cs.NE cs.AI

    Biologically inspired structure learning with reverse knowledge distillation for spiking neural networks

    Authors: Qi Xu, Yaxin Li, Xuanye Fang, Jiangrong Shen, Jian K. Liu, Huajin Tang, Gang Pan

    Abstract: Spiking neural networks (SNNs) have superb characteristics in sensory information recognition tasks due to their biological plausibility. However, the performance of some current spiking-based models is limited by their structures which means either fully connected or too-deep structures bring too much redundancy. This redundancy from both connection and neurons is one of the key factors hindering… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  50. arXiv:2304.08817  [pdf, other

    cs.DB

    DILI: A Distribution-Driven Learned Index (Extended version)

    Authors: Pengfei Li, Hua Lu, Rong Zhu, Bolin Ding, Long Yang, Gang Pan

    Abstract: Targeting in-memory one-dimensional search keys, we propose a novel DIstribution-driven Learned Index tree (DILI), where a concise and computation-efficient linear regression model is used for each node. An internal node's key range is equally divided by its child nodes such that a key search enjoys perfect model prediction accuracy to find the relevant leaf node. A leaf node uses machine learning… ▽ More

    Submitted 18 May, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: PVLDB Volume 16