Zum Hauptinhalt springen

Showing 1–50 of 107 results for author: Hu, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.14156  [pdf, other

    eess.SP

    Integrated Sensing, Communication, and Powering over Multi-antenna OFDM Systems

    Authors: Yilong Chen, Chao Hu, Zixiang Ren, Han Hu, Jie Xu, Lexi Xu, Lei Liu, Shuguang Cui

    Abstract: This paper considers a multi-functional orthogonal frequency division multiplexing (OFDM) system with integrated sensing, communication, and powering (ISCAP), in which a multi-antenna base station (BS) transmits OFDM signals to simultaneously deliver information to multiple information receivers (IRs), provide energy supply to multiple energy receivers (ERs), and sense potential targets based on t… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 13 pages, 12 figures

  2. arXiv:2408.02095  [pdf, other

    cs.IT eess.SP

    Secure Semantic Communications: From Perspective of Physical Layer Security

    Authors: Yongkang Li, Zheng Shi, Han Hu, Yaru Fu, Hong Wang, Hongjiang Lei

    Abstract: Semantic communications have been envisioned as a potential technique that goes beyond Shannon paradigm. Unlike modern communications that provide bit-level security, the eaves-dropping of semantic communications poses a significant risk of potentially exposing intention of legitimate user. To address this challenge, a novel deep neural network (DNN) enabled secure semantic communication (DeepSSC)… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  3. arXiv:2407.20532  [pdf, other

    eess.SY

    Scalable Synthesis of Formally Verified Neural Value Function for Hamilton-Jacobi Reachability Analysis

    Authors: Yujie Yang, Hanjiang Hu, Tianhao Wei, Shengbo Eben Li, Changliu Liu

    Abstract: Hamilton-Jacobi (HJ) reachability analysis provides a formal method for guaranteeing safety in constrained control problems. It synthesizes a value function to represent a long-term safe set called feasible region. Early synthesis methods based on state space discretization cannot scale to high-dimensional problems, while recent methods that use neural networks to approximate value functions resul… ▽ More

    Submitted 31 July, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

  4. arXiv:2407.08961  [pdf

    eess.IV cs.CV

    Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT

    Authors: Jie Zheng, Ru Wen, Haiqin Hu, Lina Wei, Kui Su, Wei Chen, Chen Liu, Jun Wang

    Abstract: Existing Masked Image Modeling (MIM) depends on a spatial patch-based masking-reconstruction strategy to perceive objects'features from unlabeled images, which may face two limitations when applied to chest CT: 1) inefficient feature learning due to complex anatomical details presented in CT images, and 2) suboptimal knowledge transfer owing to input disparity between upstream and downstream model… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  5. arXiv:2407.05407  [pdf, other

    cs.SD cs.AI eess.AS

    CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens

    Authors: Zhihao Du, Qian Chen, Shiliang Zhang, Kai Hu, Heng Lu, Yexin Yang, Hangrui Hu, Siqi Zheng, Yue Gu, Ziyang Ma, Zhifu Gao, Zhijie Yan

    Abstract: Recent years have witnessed a trend that large language model (LLM) based text-to-speech (TTS) emerges into the mainstream due to their high naturalness and zero-shot capacity. In this paradigm, speech signals are discretized into token sequences, which are modeled by an LLM with text as prompts and reconstructed by a token-based vocoder to waveforms. Obviously, speech tokens play a critical role… ▽ More

    Submitted 9 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

    Comments: work in progress. arXiv admin note: substantial text overlap with arXiv:2407.04051

  6. arXiv:2407.04051  [pdf, other

    cs.SD cs.AI eess.AS

    FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

    Authors: Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang , et al. (8 additional authors not shown)

    Abstract: This report introduces FunAudioLLM, a model family designed to enhance natural voice interactions between humans and large language models (LLMs). At its core are two innovative models: SenseVoice, which handles multilingual speech recognition, emotion recognition, and audio event detection; and CosyVoice, which facilitates natural speech generation with control over multiple languages, timbre, sp… ▽ More

    Submitted 10 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Work in progress. Authors are listed in alphabetical order by family name

  7. arXiv:2406.19608  [pdf, other

    eess.SY

    Multi-service collaboration and composition of cloud manufacturing customized production based on problem decomposition

    Authors: Hao Yue, Yingtao Wu, Min Wang, Hesuan Hu, Weimin Wu, Jihui Zhang

    Abstract: Cloud manufacturing system is a service-oriented and knowledge-based one, which can provide solutions for the large-scale customized production. The service resource allocation is the primary factor that restricts the production time and cost in the cloud manufacturing customized production (CMCP). In order to improve the efficiency and reduce the cost in CMCP, we propose a new framework which con… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 12 pages, 8 figures

    ACM Class: J.0

  8. arXiv:2406.09810  [pdf, other

    cs.RO eess.SY

    Think Deep and Fast: Learning Neural Nonlinear Opinion Dynamics from Inverse Dynamic Games for Split-Second Interactions

    Authors: Haimin Hu, Jonathan DeCastro, Deepak Gopinath, Guy Rosman, Naomi Ehrich Leonard, Jaime Fernández Fisac

    Abstract: Non-cooperative interactions commonly occur in multi-agent scenarios such as car racing, where an ego vehicle can choose to overtake the rival, or stay behind it until a safe overtaking "corridor" opens. While an expert human can do well at making such time-sensitive decisions, the development of safe and efficient game-theoretic trajectory planners capable of rapidly reasoning discrete options is… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  9. arXiv:2406.08038  [pdf, other

    eess.SP

    Interference Analysis for Coexistence of UAVs and Civil Aircrafts Based on Automatic Dependent Surveillance-Broadcast

    Authors: Yiyang Liao, Ziye Jia, Chao Dong, Lei Zhang, Qihui Wu, Huiling Hu, Zhu Han

    Abstract: Due to the advantages of high mobility and easy deployment, unmanned aerial vehicles (UAVs) are widely applied in both military and civilian fields. In order to strengthen the flight surveillance of UAVs and guarantee the airspace safety, UAVs can be equipped with the automatic dependent surveillance-broadcast (ADS-B) system, which periodically sends flight information to other aircrafts and groun… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  10. arXiv:2405.13636  [pdf, other

    cs.SD cs.AI eess.AS

    Audio Mamba: Pretrained Audio State Space Model For Audio Tagging

    Authors: Jiaju Lin, Haoxuan Hu

    Abstract: Audio tagging is an important task of mapping audio samples to their corresponding categories. Recently endeavours that exploit transformer models in this field have achieved great success. However, the quadratic self-attention cost limits the scaling of audio transformer models and further constrains the development of more universal audio models. In this paper, we attempt to solve this problem b… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  11. arXiv:2405.07994  [pdf

    eess.IV cs.AI cs.CV cs.LG

    BubbleID: A Deep Learning Framework for Bubble Interface Dynamics Analysis

    Authors: Christy Dunlap, Changgen Li, Hari Pandey, Ngan Le, Han Hu

    Abstract: This paper presents BubbleID, a sophisticated deep learning architecture designed to comprehensively identify both static and dynamic attributes of bubbles within sequences of boiling images. By amalgamating segmentation powered by Mask R-CNN with SORT-based tracking techniques, the framework is capable of analyzing each bubble's location, dimensions, interface shape, and velocity over its lifetim… ▽ More

    Submitted 20 March, 2024; originally announced May 2024.

    Comments: 16 pages, 4 figures

  12. arXiv:2404.13456  [pdf, other

    cs.LG cs.RO eess.SY

    Real-Time Safe Control of Neural Network Dynamic Models with Sound Approximation

    Authors: Hanjiang Hu, Jianglin Lan, Changliu Liu

    Abstract: Safe control of neural network dynamic models (NNDMs) is important to robotics and many applications. However, it remains challenging to compute an optimal safe control in real time for NNDM. To enable real-time computation, we propose to use a sound approximation of the NNDM in the control synthesis. In particular, we propose Bernstein over-approximated neural dynamics (BOND) based on the Bernste… ▽ More

    Submitted 20 May, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: Camera-ready version of L4DC 2024, 12 pages, 3 figures, 4 tables

  13. arXiv:2403.16361  [pdf, other

    eess.IV cs.CV

    RSTAR: Rotational Streak Artifact Reduction in 4D CBCT using Separable and Circular Convolutions

    Authors: Ziheng Deng, Hua Chen, Haibo Hu, Zhiyong Xu, Jiayuan Sun, Tianling Lyu, Yan Xi, Yang Chen, Jun Zhao

    Abstract: Four-dimensional cone-beam computed tomography (4D CBCT) provides respiration-resolved images and can be used for image-guided radiation therapy. However, the ability to reveal respiratory motion comes at the cost of image artifacts. As raw projection data are sorted into multiple respiratory phases, the cone-beam projections become much sparser and the reconstructed 4D CBCT images will be covered… ▽ More

    Submitted 22 August, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  14. arXiv:2402.18070  [pdf, other

    cs.AR eess.SP

    A Hierarchical Dataflow-Driven Heterogeneous Architecture for Wireless Baseband Processing

    Authors: Limin Jiang, Yi Shi, Haiqin Hu, Qingyu Deng, Siyi Xu, Yintao Liu, Feng Yuan, Si Wang, Yihao Shen, Fangfang Ye, Shan Cao, Zhiyuan Jiang

    Abstract: Wireless baseband processing (WBP) is a key element of wireless communications, with a series of signal processing modules to improve data throughput and counter channel fading. Conventional hardware solutions, such as digital signal processors (DSPs) and more recently, graphic processing units (GPUs), provide various degrees of parallelism, yet they both fail to take into account the cyclical and… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 7 figures, conference

  15. arXiv:2402.14174  [pdf, other

    cs.RO cs.AI eess.SY math.OC

    Blending Data-Driven Priors in Dynamic Games

    Authors: Justin Lidard, Haimin Hu, Asher Hancock, Zixu Zhang, Albert Gimó Contreras, Vikash Modi, Jonathan DeCastro, Deepak Gopinath, Guy Rosman, Naomi Ehrich Leonard, María Santos, Jaime Fernández Fisac

    Abstract: As intelligent robots like autonomous vehicles become increasingly deployed in the presence of people, the extent to which these systems should leverage model-based game-theoretic planners versus data-driven policies for safe, interaction-aware motion planning remains an open question. Existing dynamic game formulations assume all agents are task-driven and behave optimally. However, in reality, h… ▽ More

    Submitted 6 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 20 pages, 12 figures

  16. arXiv:2402.09246  [pdf, other

    cs.RO cs.AI eess.SY math.OC

    Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots

    Authors: Haimin Hu, Gabriele Dragotto, Zixu Zhang, Kaiqu Liang, Bartolomeo Stellato, Jaime F. Fisac

    Abstract: We consider the multi-agent spatial navigation problem of computing the socially optimal order of play, i.e., the sequence in which the agents commit to their decisions, and its associated equilibrium in an N-player Stackelberg trajectory game. We model this problem as a mixed-integer optimization problem over the space of all possible Stackelberg games associated with the order of play's permutat… ▽ More

    Submitted 24 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Robotics: Science and Systems (RSS) 2024

  17. arXiv:2401.13766  [pdf, ps, other

    eess.AS cs.SD

    Bayesian adaptive learning to latent variables via Variational Bayes and Maximum a Posteriori

    Authors: Hu Hu, Sabato Marco Siniscalchi, Chin-Hui Lee

    Abstract: In this work, we aim to establish a Bayesian adaptive learning framework by focusing on estimating latent variables in deep neural network (DNN) models. Latent variables indeed encode both transferable distributional information and structural relationships. Thus the distributions of the source latent variables (prior) can be combined with the knowledge learned from the target data (likelihood) to… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: ASRU2023 Bayesian Symposium. arXiv admin note: text overlap with arXiv:2110.08598

  18. arXiv:2401.09455  [pdf, other

    cs.NI cs.AI cs.LG eess.SY

    Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach

    Authors: Yifeng Lyu, Han Hu, Rongfei Fan, Zhi Liu, Jianping An, Shiwen Mao

    Abstract: The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to sat… ▽ More

    Submitted 22 December, 2023; originally announced January 2024.

  19. arXiv:2401.03664  [pdf

    eess.IV cs.CV cs.LG

    Dual-Channel Reliable Breast Ultrasound Image Classification Based on Explainable Attribution and Uncertainty Quantification

    Authors: Shuge Lei, Haonan Hu, Dasheng Sun, Huabin Zhang, Kehong Yuan, Jian Dai, Jijun Tang, Yan Tong

    Abstract: This paper focuses on the classification task of breast ultrasound images and researches on the reliability measurement of classification results. We proposed a dual-channel evaluation framework based on the proposed inference reliability and predictive reliability scores. For the inference reliability evaluation, human-aligned and doctor-agreed inference rationales based on the improved feature a… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  20. arXiv:2312.15721  [pdf, ps, other

    eess.SY

    UAV Trajectory Tracking via RNN-enhanced IMM-KF with ADS-B Data

    Authors: Yian Zhu, Ziye Jia, Qihui Wu, Chao Dong, Zirui Zhuang, Huiling Hu, Qi Cai

    Abstract: With the increasing use of autonomous unmanned aerial vehicles (UAVs), it is critical to ensure that they are continuously tracked and controlled, especially when UAVs operate beyond the communication range of ground stations (GSs). Conventional surveillance methods for UAVs, such as satellite communications, ground mobile networks and radars are subject to high costs and latency. The automatic de… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  21. arXiv:2312.14563  [pdf, other

    eess.SP

    AI Generated Signal for Wireless Sensing

    Authors: Hanxiang He, Han Hu, Xintao Huan, Heng Liu, Jianping An, Shiwen Mao

    Abstract: Deep learning has significantly advanced wireless sensing technology by leveraging substantial amounts of high-quality training data. However, collecting wireless sensing data encounters diverse challenges, including unavoidable data noise, limited data scale due to significant collection overhead, and the necessity to reacquire data in new environments. Taking inspiration from the achievements of… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 6 pages, 6 figures, published to Globecom2023

  22. arXiv:2312.04786  [pdf, other

    cs.IT cs.LG eess.SP

    Joint User Association, Interference Cancellation and Power Control for Multi-IRS Assisted UAV Communications

    Authors: Zhaolong Ning, Hao Hu, Xiaojie Wang, Qingqing Wu, Chau Yuen, F. Richard Yu, Yan Zhang

    Abstract: Intelligent reflecting surface (IRS)-assisted unmanned aerial vehicle (UAV) communications are expected to alleviate the load of ground base stations in a cost-effective way. Existing studies mainly focus on the deployment and resource allocation of a single IRS instead of multiple IRSs, whereas it is extremely challenging for joint multi-IRS multi-user association in UAV communications with const… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  23. arXiv:2311.00567  [pdf

    eess.IV cs.CV cs.LG physics.med-ph q-bio.QM

    A Robust Deep Learning Method with Uncertainty Estimation for the Pathological Classification of Renal Cell Carcinoma based on CT Images

    Authors: Ni Yao, Hang Hu, Kaicong Chen, Chen Zhao, Yuan Guo, Boya Li, Jiaofen Nan, Yanting Li, Chuang Han, Fubao Zhu, Weihua Zhou, Li Tian

    Abstract: Objectives To develop and validate a deep learning-based diagnostic model incorporating uncertainty estimation so as to facilitate radiologists in the preoperative differentiation of the pathological subtypes of renal cell carcinoma (RCC) based on CT images. Methods Data from 668 consecutive patients, pathologically proven RCC, were retrospectively collected from Center 1. By using five-fold cross… ▽ More

    Submitted 12 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 16 pages, 6 figures

  24. arXiv:2310.20289  [pdf

    physics.optics eess.IV physics.app-ph

    C-Silicon-based metasurfaces for aperture-robust spectrometer/imaging with angle integration

    Authors: Weizhu Xu, Qingbin Fan, Peicheng Lin, Jiarong Wang, Hao Hu, Tao Yue, Xuemei Hu, Ting Xu

    Abstract: Compared with conventional grating-based spectrometers, reconstructive spectrometers based on spectrally engineered filtering have the advantage of miniaturization because of the less demand for dispersive optics and free propagation space. However, available reconstructive spectrometers fail to balance the performance on operational bandwidth, spectral diversity and angular stability. In this wor… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  25. arXiv:2310.06678  [pdf, other

    cs.IT eess.SP eess.SY

    Modelling and Performance Analysis of the Over-the-Air Computing in Cellular IoT Networks

    Authors: Ying Dong, Haonan Hu, Qiaoshou Liu, Tingwei Lv, Qianbin Chen, Jie Zhang

    Abstract: Ultra-fast wireless data aggregation (WDA) of distributed data has emerged as a critical design challenge in the ultra-densely deployed cellular internet of things network (CITN) due to limited spectral resources. Over-the-air computing (AirComp) has been proposed as an effective solution for ultra-fast WDA by exploiting the superposition property of wireless channels. However, the effect of acces… ▽ More

    Submitted 11 August, 2023; originally announced October 2023.

  26. arXiv:2309.16077  [pdf, other

    cs.RO cs.LG eess.SY

    Task-Oriented Koopman-Based Control with Contrastive Encoder

    Authors: Xubo Lyu, Hanyang Hu, Seth Siriya, Ye Pu, Mo Chen

    Abstract: We present task-oriented Koopman-based control that utilizes end-to-end reinforcement learning and contrastive encoder to simultaneously learn the Koopman latent embedding, operator, and associated linear controller within an iterative loop. By prioritizing the task cost as the main objective for controller learning, we reduce the reliance of controller design on a well-identified model, which, fo… ▽ More

    Submitted 1 November, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted by the 7th Annual Conference on Robot Learning (CoRL), 2023 (oral spotlight)

  27. arXiv:2309.13155  [pdf, other

    eess.SY

    Multi-Agent Reach-Avoid Games: Two Attackers Versus One Defender and Mixed Integer Programming

    Authors: Hanyang Hu, Minh Bui, Mo Chen

    Abstract: We propose a hybrid approach that combines Hamilton-Jacobi (HJ) reachability and mixed-integer optimization for solving a reach-avoid game with multiple attackers and defenders. The reach-avoid game is an important problem with potential applications in air traffic control and multi-agent motion planning; however, solving this game for many attackers and defenders is intractable due to the adversa… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  28. arXiv:2309.05837  [pdf, other

    eess.SY cs.LG cs.RO

    The Safety Filter: A Unified View of Safety-Critical Control in Autonomous Systems

    Authors: Kai-Chieh Hsu, Haimin Hu, Jaime Fernández Fisac

    Abstract: Recent years have seen significant progress in the realm of robot autonomy, accompanied by the expanding reach of robotic technologies. However, the emergence of new deployment domains brings unprecedented challenges in ensuring safe operation of these systems, which remains as crucial as ever. While traditional model-based safe control methods struggle with generalizability and scalability, emerg… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted for publication in Annual Review of Control, Robotics, and Autonomous Systems

  29. arXiv:2309.04335  [pdf, ps, other

    cs.IT eess.SP

    On the performance of an integrated communication and localization system: an analytical framework

    Authors: Yuan Gao, Haonan Hu, Jiliang Zhang, Yanliang Jin, Shugong Xu, Xiaoli Chu

    Abstract: Quantifying the performance bound of an integrated localization and communication (ILAC) system and the trade-off between communication and localization performance is critical. In this letter, we consider an ILAC system that can perform communication and localization via time-domain or frequency-domain resource allocation. We develop an analytical framework to derive the closed-form expression of… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 5 pages, 3 figures

  30. arXiv:2309.03900  [pdf, other

    eess.IV cs.CV

    Learning Continuous Exposure Value Representations for Single-Image HDR Reconstruction

    Authors: Su-Kai Chen, Hung-Lin Yen, Yu-Lun Liu, Min-Hung Chen, Hou-Ning Hu, Wen-Hsiao Peng, Yen-Yu Lin

    Abstract: Deep learning is commonly used to reconstruct HDR images from LDR images. LDR stack-based methods are used for single-image HDR reconstruction, generating an HDR image from a deep learning-generated LDR stack. However, current methods generate the stack with predetermined exposure values (EVs), which may limit the quality of HDR reconstruction. To address this, we propose the continuous exposure v… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: ICCV 2023. Project page: https://skchen1993.github.io/CEVR_web/

  31. arXiv:2309.01267  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Deception Game: Closing the Safety-Learning Loop in Interactive Robot Autonomy

    Authors: Haimin Hu, Zixu Zhang, Kensuke Nakamura, Andrea Bajcsy, Jaime F. Fisac

    Abstract: An outstanding challenge for the widespread deployment of robotic systems like autonomous vehicles is ensuring safe interaction with humans without sacrificing performance. Existing safety methods often neglect the robot's ability to learn and adapt at runtime, leading to overly conservative behavior. This paper proposes a new closed-loop paradigm for synthesizing safe control policies that explic… ▽ More

    Submitted 1 November, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: Conference on Robot Learning 2023

  32. arXiv:2309.00514  [pdf

    cs.CV eess.IV

    A Machine Vision Method for Correction of Eccentric Error: Based on Adaptive Enhancement Algorithm

    Authors: Fanyi Wang, Pin Cao, Yihui Zhang, Haotian Hu, Yongying Yang

    Abstract: In the procedure of surface defects detection for large-aperture aspherical optical elements, it is of vital significance to adjust the optical axis of the element to be coaxial with the mechanical spin axis accurately. Therefore, a machine vision method for eccentric error correction is proposed in this paper. Focusing on the severe defocus blur of reference crosshair image caused by the imaging… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  33. arXiv:2307.01534  [pdf, other

    eess.SP

    Impact of UAVs Equipped with ADS-B on the Civil Aviation Monitoring System

    Authors: Yiyang Liao, Lei Zhang, Ziye Jia, Chao Dong, Yifan Zhang, Qihui Wu, Huiling Hu, Bin Wang

    Abstract: In recent years, there is an increasing demand for unmanned aerial vehicles (UAVs) to complete multiple applications. However, as unmanned equipments, UAVs lead to some security risks to general civil aviations. In order to strengthen the flight management of UAVs and guarantee the safety, UAVs can be equipped with automatic dependent surveillance-broadcast (ADS-B) devices. In addition, as an auto… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  34. arXiv:2306.16696  [pdf

    eess.AS cs.SD

    Computationally-efficient and perceptually-motivated rendering of diffuse reflections in room acoustics simulation

    Authors: Stephan D. Ewert, Nico Gößling, Oliver Buttler, Steven van de Par, Hongmei Hu

    Abstract: Geometrical acoustics is well suited for simulating room reverberation in interactive real-time applications. While the image source model (ISM) is exceptionally fast, the restriction to specular reflections impacts its perceptual plausibility. To account for diffuse late reverberation, hybrid approaches have been proposed, e.g., using a feedback delay network (FDN) in combination with the ISM. He… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: This work has been submitted to Forum Acusticum 2023 for publication

  35. arXiv:2305.12107  [pdf, other

    cs.SD cs.CL eess.AS

    EE-TTS: Emphatic Expressive TTS with Linguistic Information

    Authors: Yi Zhong, Chen Zhang, Xule Liu, Chenxi Sun, Weishan Deng, Haifeng Hu, Zhongqian Sun

    Abstract: While Current TTS systems perform well in synthesizing high-quality speech, producing highly expressive speech remains a challenge. Emphasis, as a critical factor in determining the expressiveness of speech, has attracted more attention nowadays. Previous works usually enhance the emphasis by adding intermediate features, but they can not guarantee the overall expressiveness of the speech. To reso… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech 2023, fix some typos

  36. arXiv:2305.09236  [pdf, other

    cs.CV eess.IV

    One-shot neural band selection for spectral recovery

    Authors: Hai-Miao Hu, Zhenbo Xu, Wenshuai Xu, You Song, YiTao Zhang, Liu Liu, Zhilin Han, Ajin Meng

    Abstract: Band selection has a great impact on the spectral recovery quality. To solve this ill-posed inverse problem, most band selection methods adopt hand-crafted priors or exploit clustering or sparse regularization constraints to find most prominent bands. These methods are either very slow due to the computational cost of repeatedly training with respect to different selection frequencies or different… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted by ICASSP 2023, any questions contact [email protected]

  37. arXiv:2305.04294   

    eess.IV cs.CV

    PELE scores: Pelvic X-ray Landmark Detection by Pelvis Extraction and Enhancement

    Authors: Zhen Huang, Han Li, Shitong Shao, Heqin Zhu, Huijie Hu, Zhiwei Cheng, Jianji Wang, S. Kevin Zhou

    Abstract: The pelvis, the lower part of the trunk, supports and balances the trunk. Landmark detection from a pelvic X-ray (PXR) facilitates downstream analysis and computer-assisted diagnosis and treatment of pelvic diseases. Although PXRs have the advantages of low radiation and reduced cost compared to computed tomography (CT) images, their 2D pelvis-tissue superposition of 3D structures confuses clinica… ▽ More

    Submitted 7 June, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: will revise it and resubmit it again later

  38. arXiv:2304.02687  [pdf, other

    eess.SY cs.RO

    Emergent Coordination through Game-Induced Nonlinear Opinion Dynamics

    Authors: Haimin Hu, Kensuke Nakamura, Kai-Chieh Hsu, Naomi Ehrich Leonard, Jaime Fernández Fisac

    Abstract: We present a multi-agent decision-making framework for the emergent coordination of autonomous agents whose intents are initially undecided. Dynamic non-cooperative games have been used to encode multi-agent interaction, but ambiguity arising from factors such as goal preference or the presence of multiple equilibria may lead to coordination issues, ranging from the "freezing robot" problem to uns… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  39. Domain-knowledge Inspired Pseudo Supervision (DIPS) for Unsupervised Image-to-Image Translation Models to Support Cross-Domain Classification

    Authors: Firas Al-Hindawi, Md Mahfuzur Rahman Siddiquee, Teresa Wu, Han Hu, Ying Sun

    Abstract: The ability to classify images is dependent on having access to large labeled datasets and testing on data from the same domain that the model can train on. Classification becomes more challenging when dealing with new data from a different domain, where gathering and especially labeling a larger image dataset for retraining a classification model requires a labor-intensive human effort. Cross-dom… ▽ More

    Submitted 30 September, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.09107

  40. arXiv:2303.03625  [pdf, other

    eess.IV cs.CV

    SGDA: Towards 3D Universal Pulmonary Nodule Detection via Slice Grouped Domain Attention

    Authors: Rui Xu, Zhi Liu, Yong Luo, Han Hu, Li Shen, Bo Du, Kaiming Kuang, Jiancheng Yang

    Abstract: Lung cancer is the leading cause of cancer death worldwide. The best solution for lung cancer is to diagnose the pulmonary nodules in the early stage, which is usually accomplished with the aid of thoracic computed tomography (CT). As deep learning thrives, convolutional neural networks (CNNs) have been introduced into pulmonary nodule detection to help doctors in this labor-intensive task and dem… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE/ACM Transactions on Computational Biology and Bioinformatics

  41. arXiv:2302.00171  [pdf, other

    cs.RO cs.LG eess.SY math.OC

    Active Uncertainty Reduction for Safe and Efficient Interaction Planning: A Shielding-Aware Dual Control Approach

    Authors: Haimin Hu, David Isele, Sangjae Bae, Jaime F. Fisac

    Abstract: The ability to accurately predict others' behavior is central to the safety and efficiency of interactive robotics. Unfortunately, robots often lack access to key information on which these predictions may hinge, such as other agents' goals, attention, and willingness to cooperate. Dual control theory addresses this challenge by treating unknown parameters of a predictive model as stochastic hidde… ▽ More

    Submitted 1 November, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: The International Journal of Robotics Research. arXiv admin note: text overlap with arXiv:2202.07720

  42. arXiv:2301.01446  [pdf, other

    eess.SP

    Radio Frequency Fingerprints Extraction for LTE-V2X: A Channel Estimation Based Methodology

    Authors: Tianshu Chen, Hong Shen, Aiqun Hu, Weihang He, Jie Xu, Hongxing Hu

    Abstract: The vehicular-to-everything (V2X) technology has recently drawn a number of attentions from both academic and industrial areas. However, the openness of the wireless communication system makes it more vulnerable to identity impersonation and information tampering. How to employ the powerful radio frequency fingerprint (RFF) identification technology in V2X systems turns out to be a vital and also… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: To be published in 2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)

  43. A Framework for Generalizing Critical Heat Flux Detection Models Using Unsupervised Image-to-Image Translation

    Authors: Firas Al-Hindawi, Tejaswi Soori, Han Hu, Md Mahfuzur Rahman Siddiquee, Hyunsoo Yoon, Teresa Wu, Ying Sun

    Abstract: The detection of critical heat flux (CHF) is crucial in heat boiling applications as failure to do so can cause rapid temperature ramp leading to device failures. Many machine learning models exist to detect CHF, but their performance reduces significantly when tested on data from different domains. To deal with datasets from new domains a model needs to be trained from scratch. Moreover, the data… ▽ More

    Submitted 17 March, 2023; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: This work has been submitted to the Expert Systems With Applications Journal on Sep 25, 2022

  44. arXiv:2212.08653  [pdf, other

    cs.CV eess.IV

    Attentive Mask CLIP

    Authors: Yifan Yang, Weiquan Huang, Yixuan Wei, Houwen Peng, Xinyang Jiang, Huiqiang Jiang, Fangyun Wei, Yin Wang, Han Hu, Lili Qiu, Yuqing Yang

    Abstract: Image token removal is an efficient augmentation strategy for reducing the cost of computing image features. However, this efficient augmentation strategy has been found to adversely affect the accuracy of CLIP-based training. We hypothesize that removing a large portion of image tokens may improperly discard the semantic content associated with a given text description, thus constituting an incor… ▽ More

    Submitted 9 October, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 2771-2781

  45. arXiv:2211.14314  [pdf, other

    cs.CV cs.LG cs.SD eess.AS eess.IV physics.med-ph q-bio.QM

    The applicability of transperceptual and deep learning approaches to the study and mimicry of complex cartilaginous tissues

    Authors: J. Waghorne, C. Howard, H. Hu, J. Pang, W. J. Peveler, L. Harris, O. Barrera

    Abstract: Complex soft tissues, for example the knee meniscus, play a crucial role in mobility and joint health, but when damaged are incredibly difficult to repair and replace. This is due to their highly hierarchical and porous nature which in turn leads to their unique mechanical properties. In order to design tissue substitutes, the internal architecture of the native tissue needs to be understood and r… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  46. arXiv:2210.14645  [pdf, other

    eess.IV cs.CV

    Super-Resolution Based Patch-Free 3D Image Segmentation with High-Frequency Guidance

    Authors: Hongyi Wang, Lanfen Lin, Hongjie Hu, Qingqing Chen, Yinhao Li, Yutaro Iwamoto, Xian-Hua Han, Yen-Wei Chen, Ruofeng Tong

    Abstract: High resolution (HR) 3D images are widely used nowadays, such as medical images like Magnetic Resonance Imaging (MRI) and Computed Tomography (CT). However, segmentation of these 3D images remains a challenge due to their high spatial resolution and dimensionality in contrast to currently limited GPU memory. Therefore, most existing 3D image segmentation methods use patch-based models, which have… ▽ More

    Submitted 10 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Version #2 uploaded in Jul 10, 2023

  47. arXiv:2210.13415  [pdf

    eess.IV cs.CV cs.LG eess.SP

    Deep Learning Approach for Dynamic Sampling for Multichannel Mass Spectrometry Imaging

    Authors: David Helminiak, Hang Hu, Julia Laskin, Dong Hye Ye

    Abstract: Mass Spectrometry Imaging (MSI), using traditional rectilinear scanning, takes hours to days for high spatial resolution acquisitions. Given that most pixels within a sample's field of view are often neither relevant to underlying biological structures nor chemically informative, MSI presents as a prime candidate for integration with sparse and dynamic sampling algorithms. During a scan, stochasti… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  48. arXiv:2209.11171  [pdf, other

    eess.SP

    A Cooperative Deception Strategy for Covert Communication in Presence of a Multi-antenna Adversary

    Authors: Jiangbo Si, Zizhen Liu, Zan Li, Hang Hu, Lei Guan, Chao Wang, Naofal Al-Dhahir

    Abstract: Covert transmission is investigated for a cooperative deception strategy, where a cooperative jammer (Jammer) tries to attract a multi-antenna adversary (Willie) and degrade the adversary's reception ability for the signal from a transmitter (Alice). For this strategy, we formulate an optimization problem to maximize the covert rate when three different types of channel state information (CSI) are… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 33 pages, 8 Figures

    MSC Class: 14J60 (Primary) 14F05; 14J26 (Secondary) ACM Class: F.2.2; I.2.7

  49. arXiv:2209.05234  [pdf, other

    cs.CV eess.IV

    Low rank prior and l0 norm to remove impulse noise in images

    Authors: Haijuan Hu

    Abstract: Patch-based low rank is an important prior assumption for image processing. Moreover, according to our calculation, the optimization of l0 norm corresponds to the maximum likelihood estimation under random-valued impulse noise. In this article, we thus combine exact rank and l0 norm for removing the noise. It is solved formally using the alternating direction method of multipliers (ADMM), with our… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  50. arXiv:2209.00353  [pdf, other

    cs.SD cs.IR cs.MM eess.AS

    AccoMontage2: A Complete Harmonization and Accompaniment Arrangement System

    Authors: Li Yi, Haochen Hu, Jingwei Zhao, Gus Xia

    Abstract: We propose AccoMontage2, a system capable of doing full-length song harmonization and accompaniment arrangement based on a lead melody. Following AccoMontage, this study focuses on generating piano arrangements for popular/folk songs and it carries on the generalized template-based retrieval method. The novelties of this study are twofold. First, we invent a harmonization module (which AccoMontage… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: Accepted by ISMIR 2022