Zum Hauptinhalt springen

Showing 1–50 of 69 results for author: Huang, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.08485  [pdf, other

    eess.SP

    Generalized code index modulation-aided frequency offset realign multiple-antenna spatial modulation approach for next-generation green communication systems

    Authors: Bang Huang, Jiajie Xu, Mohamed-Slim Alouini

    Abstract: For next-generation green communication systems, this article proposes an innovative communication system based on frequency-diverse array-multiple-input multiple-output (FDA-MIMO) technology, which aims to achieve high data rates while maintaining low power consumption. This system utilizes frequency offset index realign modulation, multiple-antenna spatial index modulation, and spreading code in… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  2. arXiv:2406.10365  [pdf, other

    eess.SY math.OC

    Multi-Objective Control Co-design Using Graph-Based Optimization for Offshore Wind Farm Grid Integration

    Authors: Himanshu Sharma, Wei Wang, Bowen Huang, Thiagarajan Ramachandran, Veronica Adetola

    Abstract: Offshore wind farms have emerged as a popular renewable energy source that can generate substantial electric power with a low environmental impact. However, integrating these farms into the grid poses significant complexities. To address these issues, optimal-sized energy storage can provide potential solutions and help improve the reliability, efficiency, and flexibility of the grid. Nevertheless… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.00492  [pdf, other

    eess.IV cs.CV cs.LG

    SAM-VMNet: Deep Neural Networks For Coronary Angiography Vessel Segmentation

    Authors: Xueying Zeng, Baixiang Huang, Yu Luo, Guangyu Wei, Songyan He, Yushuang Shao

    Abstract: Coronary artery disease (CAD) is one of the most prevalent diseases in the cardiovascular field and one of the major contributors to death worldwide. Computed Tomography Angiography (CTA) images are regarded as the authoritative standard for the diagnosis of coronary artery disease, and by performing vessel segmentation and stenosis detection on CTA images, physicians are able to diagnose coronary… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2405.17167  [pdf

    eess.IV cs.CV

    Partitioned Hankel-based Diffusion Models for Few-shot Low-dose CT Reconstruction

    Authors: Wenhao Zhang, Bin Huang, Shuyue Chen, Xiaoling Xu, Weiwen Wu, Qiegen Liu

    Abstract: Low-dose computed tomography (LDCT) plays a vital role in clinical applications by mitigating radiation risks. Nevertheless, reducing radiation doses significantly degrades image quality. Concurrently, common deep learning methods demand extensive data, posing concerns about privacy, cost, and time constraints. Consequently, we propose a few-shot low-dose CT reconstruction method using Partitioned… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.05814  [pdf

    eess.IV cs.CV

    MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction

    Authors: Pinhuang Tan, Mengxiao Geng, Jingya Lu, Liu Shi, Bin Huang, Qiegen Liu

    Abstract: Computed Tomography (CT) technology reduces radiation haz-ards to the human body through sparse sampling, but fewer sampling angles pose challenges for image reconstruction. Score-based generative models are widely used in sparse-view CT re-construction, performance diminishes significantly with a sharp reduction in projection angles. Therefore, we propose an ultra-sparse view CT reconstruction me… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  7. arXiv:2404.00247  [pdf, ps, other

    eess.SY cs.AI cs.LG

    Facilitating Reinforcement Learning for Process Control Using Transfer Learning: Perspectives

    Authors: Runze Lin, Junghui Chen, Lei Xie, Hongye Su, Biao Huang

    Abstract: This paper provides insights into deep reinforcement learning (DRL) for process control from the perspective of transfer learning. We analyze the challenges of applying DRL in the field of process industries and the necessity of introducing transfer learning. Furthermore, recommendations and prospects are provided for future research directions on how transfer learning can be integrated with DRL t… ▽ More

    Submitted 1 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Final Version of Asian Control Conference (ASCC 2024)

  8. arXiv:2403.19238  [pdf, other

    cs.CV cs.AI eess.IV

    Taming Lookup Tables for Efficient Image Retouching

    Authors: Sidi Yang, Binxiao Huang, Mingdeng Cao, Yatai Ji, Hanzhong Guo, Ngai Wong, Yujiu Yang

    Abstract: The widespread use of high-definition screens in edge devices, such as end-user cameras, smartphones, and televisions, is spurring a significant demand for image enhancement. Existing enhancement models often optimize for high performance while falling short of reducing hardware inference time and power consumption, especially on edge devices with constrained computing and storage resources. To th… ▽ More

    Submitted 13 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted by ECCV2024

  9. arXiv:2403.14180  [pdf, ps, other

    eess.SP

    Adaptive Target Detection for FDA-MIMO Radar with Training Data in Gaussian noise

    Authors: Ping Li, Bang Huang, Wen-Qin Wang

    Abstract: This paper addresses the problem of detecting a moving target embedded in Gaussian noise with an unknown covariance matrix for frequency diverse array multiple-input multiple-output (FDA-MIMO) radar. To end it, assume that obtaining a set of training data is available. Moreover, we propose three adaptive detectors in accordance with the one-step generalized likelihood ratio test (GLRT), two-step G… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  10. arXiv:2403.08337  [pdf, other

    eess.SY cs.AI cs.LG

    LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments

    Authors: Maonan Wang, Aoyu Pang, Yuheng Kan, Man-On Pun, Chung Shue Chen, Bo Huang

    Abstract: Traffic congestion in metropolitan areas presents a formidable challenge with far-reaching economic, environmental, and societal ramifications. Therefore, effective congestion management is imperative, with traffic signal control (TSC) systems being pivotal in this endeavor. Conventional TSC systems, designed upon rule-based algorithms or reinforcement learning (RL), frequently exhibit deficiencie… ▽ More

    Submitted 12 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 20 pages, 11 figures

  11. Machine learning for industrial sensing and control: A survey and practical perspective

    Authors: Nathan P. Lawrence, Seshu Kumar Damarla, Jong Woo Kim, Aditya Tulsyan, Faraz Amjad, Kai Wang, Benoit Chachuat, Jong Min Lee, Biao Huang, R. Bhushan Gopaluni

    Abstract: With the rise of deep learning, there has been renewed interest within the process industries to utilize data on large-scale nonlinear sensing and control problems. We identify key statistical and machine learning techniques that have seen practical success in the process industries. To do so, we start with hybrid modeling to provide a methodological framework underlying core application areas: so… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 48 pages

    Journal ref: Control Engineering Practice 2024

  12. arXiv:2401.02662  [pdf, other

    cs.NI eess.SP

    GainNet: Coordinates the Odd Couple of Generative AI and 6G Networks

    Authors: Ning Chen, Jie Yang, Zhipeng Cheng, Xuwei Fan, Zhang Liu, Bangzhen Huang, Yifeng Zhao, Lianfen Huang, Xiaojiang Du, Mohsen Guizani

    Abstract: The rapid expansion of AI-generated content (AIGC) reflects the iteration from assistive AI towards generative AI (GAI) with creativity. Meanwhile, the 6G networks will also evolve from the Internet-of-everything to the Internet-of-intelligence with hybrid heterogeneous network architectures. In the future, the interplay between GAI and the 6G will lead to new opportunities, where GAI can learn th… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 10 pages, 5 figures, 1 table

  13. arXiv:2312.17004  [pdf, other

    eess.IV cs.CV

    Continual Learning in Medical Image Analysis: A Comprehensive Review of Recent Advancements and Future Prospects

    Authors: Pratibha Kumari, Joohi Chauhan, Afshin Bozorgpour, Boqiang Huang, Reza Azad, Dorit Merhof

    Abstract: Medical imaging analysis has witnessed remarkable advancements even surpassing human-level performance in recent years, driven by the rapid development of advanced deep-learning algorithms. However, when the inference dataset slightly differs from what the model has seen during one-time training, the model performance is greatly compromised. The situation requires restarting the training process u… ▽ More

    Submitted 22 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  14. arXiv:2312.14468  [pdf, ps, other

    eess.SP

    FDA-MIMO-based Integrated Sensing and Communication System with Frequency Offset Permutation Index Modulation

    Authors: Jiangwei Jian, Qimao Huang, Bang Huang, Wen-Qin Wang

    Abstract: Considering that frequency diverse array multiple-input multiple-output (FDA-MIMO) possesses extra range information to enhance sensing performance, this paper explores the FDA-MIMO-based integrated sensing and communication (ISAC) system. To reinforce the system communication capability, we propose the frequency offset permutation index modulation (FOPIM) scheme, which conveys extra information b… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  15. arXiv:2312.06101  [pdf, other

    eess.IV cs.CV

    Hundred-Kilobyte Lookup Tables for Efficient Single-Image Super-Resolution

    Authors: Binxiao Huang, Jason Chun Lok Li, Jie Ran, Boyu Li, Jiajun Zhou, Dahai Yu, Ngai Wong

    Abstract: Conventional super-resolution (SR) schemes make heavy use of convolutional neural networks (CNNs), which involve intensive multiply-accumulate (MAC) operations, and require specialized hardware such as graphics processing units. This contradicts the regime of edge AI that often runs on devices strained by power, computing, and storage resources. Such a challenge has motivated a series of lookup ta… ▽ More

    Submitted 8 May, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  16. arXiv:2311.11209  [pdf, other

    eess.IV cs.CV

    3D Guidewire Shape Reconstruction from Monoplane Fluoroscopic Images

    Authors: Tudor Jianu, Baoru Huang, Pierre Berthet-Rayne, Sebastiano Fichera, Anh Nguyen

    Abstract: Endovascular navigation, essential for diagnosing and treating endovascular diseases, predominantly hinges on fluoroscopic images due to the constraints in sensory feedback. Current shape reconstruction techniques for endovascular intervention often rely on either a priori information or specialized equipment, potentially subjecting patients to heightened radiation exposure. While deep learning ho… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 11 pages

  17. arXiv:2311.11205  [pdf, other

    eess.IV cs.CV

    Shape-Sensitive Loss for Catheter and Guidewire Segmentation

    Authors: Chayun Kongtongvattana, Baoru Huang, Jingxuan Kang, Hoan Nguyen, Olajide Olufemi, Anh Nguyen

    Abstract: We introduce a shape-sensitive loss function for catheter and guidewire segmentation and utilize it in a vision transformer network to establish a new state-of-the-art result on a large-scale X-ray images dataset. We transform network-derived predictions and their corresponding ground truths into signed distance maps, thereby enabling any networks to concentrate on the essential boundaries rather… ▽ More

    Submitted 19 January, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: 13 pages

  18. arXiv:2311.08823  [pdf, other

    physics.med-ph eess.IV

    Ultrafast 3-D Super Resolution Ultrasound using Row-Column Array specific Coherence-based Beamforming and Rolling Acoustic Sub-aperture Processing: In Vitro, In Vivo and Clinical Study

    Authors: Joseph Hansen-Shearer, Jipeng Yan, Marcelo Lerendegui, Biao Huang, Matthieu Toulemonde, Kai Riemer, Qingyuan Tan, Johanna Tonko, Peter D. Weinberg, Chris Dunsby, Meng-Xing Tang

    Abstract: The row-column addressed array is an emerging probe for ultrafast 3-D ultrasound imaging. It achieves this with far fewer independent electronic channels and a wider field of view than traditional 2-D matrix arrays, of the same channel count, making it a good candidate for clinical translation. However, the image quality of row-column arrays is generally poor, particularly when investigating tissu… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  19. arXiv:2311.03815  [pdf, other

    cs.NI eess.SP

    Integrated Sensing, Communication, and Computing for Cost-effective Multimodal Federated Perception

    Authors: Ning Chen, Zhipeng Cheng, Xuwei Fan, Bangzhen Huang, Yifeng Zhao, Lianfen Huang, Xiaojiang Du, Mohsen Guizani

    Abstract: Federated learning (FL) is a classic paradigm of 6G edge intelligence (EI), which alleviates privacy leaks and high communication pressure caused by traditional centralized data processing in the artificial intelligence of things (AIoT). The implementation of multimodal federated perception (MFP) services involves three sub-processes, including sensing-based multimodal data generation, communicati… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  20. arXiv:2310.13177  [pdf

    eess.SY

    Enhancing Building Energy Efficiency through Advanced Sizing and Dispatch Methods for Energy Storage

    Authors: Min Gyung Yu, Xu Ma, Bowen Huang, Karthik Devaprasad, Fredericka Brown, Di Wu

    Abstract: Energy storage and electrification of buildings hold great potential for future decarbonized energy systems. However, there are several technical and economic barriers that prevent large-scale adoption and integration of energy storage in buildings. These barriers include integration with building control systems, high capital costs, and the necessity to identify and quantify value streams for dif… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  21. arXiv:2308.15942  [pdf

    eess.IV cs.CV

    Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT Reconstruction

    Authors: Kai Xu, Shiyu Lu, Bin Huang, Weiwen Wu, Qiegen Liu

    Abstract: Diffusion models have emerged as potential tools to tackle the challenge of sparse-view CT reconstruction, displaying superior performance compared to conventional methods. Nevertheless, these prevailing diffusion models predominantly focus on the sinogram or image domains, which can lead to instability during model training, potentially culminating in convergence towards local minimal solutions.… ▽ More

    Submitted 3 September, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  22. arXiv:2308.02765  [pdf

    eess.SY cs.AI

    Surrogate Empowered Sim2Real Transfer of Deep Reinforcement Learning for ORC Superheat Control

    Authors: Runze Lin, Yangyang Luo, Xialai Wu, Junghui Chen, Biao Huang, Lei Xie, Hongye Su

    Abstract: The Organic Rankine Cycle (ORC) is widely used in industrial waste heat recovery due to its simple structure and easy maintenance. However, in the context of smart manufacturing in the process industry, traditional model-based optimization control methods are unable to adapt to the varying operating conditions of the ORC system or sudden changes in operating modes. Deep reinforcement learning (DRL… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  23. arXiv:2307.03662  [pdf, other

    eess.IV cs.CV

    Detecting the Sensing Area of A Laparoscopic Probe in Minimally Invasive Cancer Surgery

    Authors: Baoru Huang, Yicheng Hu, Anh Nguyen, Stamatia Giannarou, Daniel S. Elson

    Abstract: In surgical oncology, it is challenging for surgeons to identify lymph nodes and completely resect cancer even with pre-operative imaging systems like PET and CT, because of the lack of reliable intraoperative visualization tools. Endoscopic radio-guided cancer detection and resection has recently been evaluated whereby a novel tethered laparoscopic gamma detector is used to localize a preoperativ… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI 2023

  24. arXiv:2305.15188  [pdf, other

    cs.LG eess.SY

    Policy Learning based on Deep Koopman Representation

    Authors: Wenjian Hao, Paulo C. Heredia, Bowen Huang, Zehui Lu, Zihao Liang, Shaoshuai Mou

    Abstract: This paper proposes a policy learning algorithm based on the Koopman operator theory and policy gradient approach, which seeks to approximate an unknown dynamical system and search for optimal policy simultaneously, using the observations gathered through interaction with the environment. The proposed algorithm has two innovations: first, it introduces the so-called deep Koopman representation int… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  25. arXiv:2305.11385  [pdf, other

    eess.SY math.DS

    Robust MPC with Zone Tracking

    Authors: Zhiyinan Huang, Jinfeng Liu, Biao Huang

    Abstract: We propose a robust nonlinear model predictive control design with generalized zone tracking (ZMPC) in this work. The proposed ZMPC has guaranteed convergence into the target zone in the presence of bounded disturbance. The proposed approach achieves this by modifying the actual target zone such that the effect of disturbances is rejected. A control invariant set (CIS) inside the modified target z… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  26. arXiv:2304.10780  [pdf, other

    cs.CV eess.IV

    Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction

    Authors: Binbin Huang, Xingyue Peng, Siyuan Shen, Suan Xia, Ruiqian Li, Yanhua Yu, Yuehan Wang, Shenghua Gao, Wenzheng Chen, Shiying Li, Jingyi Yu

    Abstract: We introduce Omni-LOS, a neural computational imaging method for conducting holistic shape reconstruction (HSR) of complex objects utilizing a Single-Photon Avalanche Diode (SPAD)-based time-of-flight sensor. As illustrated in Fig. 1, our method enables new capabilities to reconstruct near-$360^\circ$ surrounding geometry of an object from a single scan spot. In such a scenario, traditional line-o… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  27. arXiv:2304.07693  [pdf, other

    eess.IV cs.CV

    Translating Simulation Images to X-ray Images via Multi-Scale Semantic Matching

    Authors: Jingxuan Kang, Tudor Jianu, Baoru Huang, Binod Bhattarai, Ngan Le, Frans Coenen, Anh Nguyen

    Abstract: Endovascular intervention training is increasingly being conducted in virtual simulators. However, transferring the experience from endovascular simulators to the real world remains an open problem. The key challenge is the virtual environments are usually not realistically simulated, especially the simulation images. In this paper, we propose a new method to translate simulation images from an en… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: 11 pages

  28. arXiv:2304.03821  [pdf, other

    eess.SY

    A Computational Efficient Pumped Storage Hydro Optimization in the Look-ahead Unit Commitment and Real-time Market Dispatch Under Uncertainty

    Authors: Bing Huang, Arezou Ghesmati, Yonghong Chen, Ross Baldick

    Abstract: Pumped storage hydro units (PSHU) are great sources of flexibility in power systems. This is especially valuable in modern systems with increasing shares of intermittent renewable resources. However, the flexibility from PSHUs, particularly in the real-time market, has not been thoroughly studied. The storage optimization in a real-time market hasn't been well addressed. To enhance the use of PSH… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 10 pages, 8 figures

  29. arXiv:2304.00819  [pdf, other

    eess.IV

    Acceleration-Based Kalman Tracking for Super-Resolution Ultrasound Imaging in vivo

    Authors: Biao Huang, Jipeng Yan, Megan Morris, Victoria Sinnett, Navita Somaiah, Meng-Xing Tang

    Abstract: Super-resolution ultrasound can image microvascular structure and flow at sub-wave-diffraction resolution based on localising and tracking microbubbles. Currently, tracking microbubbles accurately under limited imaging frame rates and high microbubble concentrations remains a challenge, especially under the effect of cardiac pulsatility and in highly curved vessels. In this study, an acceleration-… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 15 pages, 10 figures

  30. arXiv:2303.14003  [pdf

    eess.IV eess.SP

    Transthoracic super-resolution ultrasound localisation microscopy of myocardial vasculature in patients

    Authors: Jipeng Yan, Biao Huang, Johanna Tonko, Matthieu Toulemonde, Joseph Hansen-Shearer, Qingyuan Tan, Kai Riemer, Konstantinos Ntagiantas, Rasheda A Chowdhury, Pier Lambiase, Roxy Senior, Meng-Xing Tang

    Abstract: Micro-vascular flow in the myocardium is of significant importance clinically but remains poorly understood. Up to 25% of patients with symptoms of coronary heart diseases have no obstructive coronary arteries and have suspected microvascular diseases. However, such microvasculature is difficult to image in vivo with existing modalities due to the lack of resolution and sensitivity. Here, we demon… ▽ More

    Submitted 28 March, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: 22 pages, 10 figures

  31. Distributed Data-driven Predictive Control via Dissipative Behavior Synthesis

    Authors: Yitao Yan, Jie Bao, Biao Huang

    Abstract: This paper presents a distributed data-driven predictive control (DDPC) approach using the behavioral framework. It aims to design a network of controllers for an interconnected system with linear time-invariant (LTI) subsystems such that a given global (network-wide) cost function is minimized while desired control performance (e.g., network stability and disturbance rejection) is achieved using… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Journal ref: IEEE Transactions on Automatic Control, 2023

  32. arXiv:2212.03630  [pdf

    eess.IV cs.CV

    One Sample Diffusion Model in Projection Domain for Low-Dose CT Imaging

    Authors: Bin Huang, Liu Zhang, Shiyu Lu, Boyu Lin, Weiwen Wu, Qiegen Liu

    Abstract: Low-dose computed tomography (CT) plays a significant role in reducing the radiation risk in clinical applications. However, lowering the radiation dose will significantly degrade the image quality. With the rapid development and wide application of deep learning, it has brought new directions for the development of low-dose CT imaging algorithms. Therefore, we propose a fully unsupervised one sam… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 11 pages, 11 figures. arXiv admin note: text overlap with arXiv:2211.13926

  33. arXiv:2210.06272  [pdf, other

    eess.SY

    Deep Koopman Learning of Nonlinear Time-Varying Systems

    Authors: Wenjian Hao, Bowen Huang, Wei Pan, Di Wu, Shaoshuai Mou

    Abstract: This paper presents a data-driven approach to approximate the dynamics of a nonlinear time-varying system (NTVS) by a linear time-varying system (LTVS), which is resulted from the Koopman operator and deep neural networks. Analysis of the approximation error between states of the NTVS and the resulting LTVS is presented. Simulations on a representative NTVS show that the proposed method achieves s… ▽ More

    Submitted 21 June, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  34. Modern Machine Learning Tools for Monitoring and Control of Industrial Processes: A Survey

    Authors: R. Bhushan Gopaluni, Aditya Tulsyan, Benoit Chachuat, Biao Huang, Jong Min Lee, Faraz Amjad, Seshu Kumar Damarla, Jong Woo Kim, Nathan P. Lawrence

    Abstract: Over the last ten years, we have seen a significant increase in industrial data, tremendous improvement in computational power, and major theoretical advances in machine learning. This opens up an opportunity to use modern machine learning tools on large-scale nonlinear monitoring and control problems. This article provides a survey of recent results with applications in the process industry.

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: IFAC World Congress 2020

  35. arXiv:2208.06449  [pdf, other

    eess.IV cs.CV

    When CNN Meet with ViT: Towards Semi-Supervised Learning for Multi-Class Medical Image Semantic Segmentation

    Authors: Ziyang Wang, Tianze Li, Jian-Qing Zheng, Baoru Huang

    Abstract: Due to the lack of quality annotation in medical imaging community, semi-supervised learning methods are highly valued in image semantic segmentation tasks. In this paper, an advanced consistency-aware pseudo-label-based self-ensembling approach is presented to fully utilize the power of Vision Transformer(ViT) and Convolutional Neural Network(CNN) in semi-supervised learning. Our proposed framewo… ▽ More

    Submitted 8 February, 2024; v1 submitted 12 August, 2022; originally announced August 2022.

  36. arXiv:2207.07370  [pdf, other

    eess.IV cs.CV

    CKD-TransBTS: Clinical Knowledge-Driven Hybrid Transformer with Modality-Correlated Cross-Attention for Brain Tumor Segmentation

    Authors: Jianwei Lin, Jiatai Lin, Cheng Lu, Hao Chen, Huan Lin, Bingchao Zhao, Zhenwei Shi, Bingjiang Qiu, Xipeng Pan, Zeyan Xu, Biao Huang, Changhong Liang, Guoqiang Han, Zaiyi Liu, Chu Han

    Abstract: Brain tumor segmentation (BTS) in magnetic resonance image (MRI) is crucial for brain tumor diagnosis, cancer management and research purposes. With the great success of the ten-year BraTS challenges as well as the advances of CNN and Transformer algorithms, a lot of outstanding BTS models have been proposed to tackle the difficulties of BTS in different technical aspects. However, existing studie… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  37. arXiv:2206.13437  [pdf

    eess.SY

    A Generalized Probabilistic Monitoring Model with Both Random and Sequential Data

    Authors: Wanke Yu, Min Wu, Biao Huang, Chengda Lu

    Abstract: Many multivariate statistical analysis methods and their corresponding probabilistic counterparts have been adopted to develop process monitoring models in recent decades. However, the insightful connections between them have rarely been studied. In this study, a generalized probabilistic monitoring model (GPMM) is developed with both random and sequential data. Since GPMM can be reduced to variou… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: 12 pages, 4 figures, 3 tables

  38. arXiv:2204.12694  [pdf, other

    eess.SY math.DS

    Model predictive control of agro-hydrological systems based on a two-layer neural network modeling framework

    Authors: Zhiyinan Huang, Jinfeng Liu, Biao Huang

    Abstract: Water scarcity is an urgent issue to be resolved and improving irrigation water-use efficiency through closed-loop control is essential. The complex agro-hydrological system dynamics, however, often pose challenges in closed-loop control applications. In this work, we propose a two-layer neural network (NN) framework to approximate the dynamics of the agro-hydrological system. To minimize the pred… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  39. Residual Aligner Network

    Authors: Jian-Qing Zheng, Ziyang Wang, Baoru Huang, Ngee Han Lim, Bartlomiej W. Papiez

    Abstract: Image registration is important for medical imaging, the estimation of the spatial transformation between different images. Many previous studies have used learning-based methods for coarse-to-fine registration to efficiently perform 3D image registration. The coarse-to-fine approach, however, is limited when dealing with the different motions of nearby objects. Here we propose a novel Motion-Awar… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  40. arXiv:2202.13560  [pdf, other

    eess.IV cs.CV

    ConvNeXt-backbone HoVerNet for nuclei segmentation and classification

    Authors: Jiachen Li, Chixin Wang, Banban Huang, Zekun Zhou

    Abstract: This manuscript gives a brief description of the algorithm used to participate in CoNIC Challenge 2022. After the baseline was made available, we follow the method in it and replace the ResNet baseline with ConvNeXt one. Moreover, we propose to first convert RGB space to Haematoxylin-Eosin-DAB(HED) space, then use Haematoxylin composition of origin image to smooth semantic one hot label. Afterward… ▽ More

    Submitted 28 March, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

  41. arXiv:2112.14460  [pdf, other

    cs.DB cs.AI eess.SY

    Baihe: SysML Framework for AI-driven Databases

    Authors: Andreas Pfadler, Rong Zhu, Wei Chen, Botong Huang, Tianjing Zeng, Bolin Ding, Jingren Zhou

    Abstract: We present Baihe, a SysML Framework for AI-driven Databases. Using Baihe, an existing relational database system may be retrofitted to use learned components for query optimization or other common tasks, such as e.g. learned structure for indexing. To ensure the practicality and real world applicability of Baihe, its high level architecture is based on the following requirements: separation from t… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

  42. arXiv:2111.01544  [pdf

    eess.IV cs.CV physics.med-ph

    Comprehensive and Clinically Accurate Head and Neck Organs at Risk Delineation via Stratified Deep Learning: A Large-scale Multi-Institutional Study

    Authors: Dazhou Guo, Jia Ge, Xianghua Ye, Senxiang Yan, Yi Xin, Yuchen Song, Bing-shen Huang, Tsung-Min Hung, Zhuotun Zhu, Ling Peng, Yanping Ren, Rui Liu, Gong Zhang, Mengyuan Mao, Xiaohua Chen, Zhongjie Lu, Wenxiang Li, Yuzhen Chen, Lingyun Huang, Jing Xiao, Adam P. Harrison, Le Lu, Chien-Yu Lin, Dakai Jin, Tsung-Ying Ho

    Abstract: Accurate organ at risk (OAR) segmentation is critical to reduce the radiotherapy post-treatment complications. Consensus guidelines recommend a set of more than 40 OARs in the head and neck (H&N) region, however, due to the predictable prohibitive labor-cost of this task, most institutions choose a substantially simplified protocol by delineating a smaller subset of OARs and neglecting the dose di… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

  43. arXiv:2109.11115  [pdf

    cs.SD eess.AS

    Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning

    Authors: Rui Li, Dong Pu, Minnie Huang, Bill Huang

    Abstract: One-shot voice cloning aims to transform speaker voice and speaking style in speech synthesized from a text-to-speech (TTS) system, where only a shot recording from the target reference speech can be used. Out-of-domain transfer is still a challenging task, and one important aspect that impacts the accuracy and similarity of synthetic speech is the conditional representations carrying speaker or s… ▽ More

    Submitted 24 February, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: 6 pages, 5 figures, Accepted to IEEE ICASSP 2022

  44. arXiv:2108.01522  [pdf, other

    eess.IV

    CSMCNet: Scalable Video Compressive Sensing Reconstruction with Interpretable Motion Estimation

    Authors: Bowen Huang, Xiao Yan, Jinjia Zhou, Yibo Fan

    Abstract: Most deep network methods for compressive sensing reconstruction suffer from the black-box characteristic of DNN. In this paper, a deep neural network with interpretable motion estimation named CSMCNet is proposed. The network is able to realize high-quality reconstruction of video compressive sensing by unfolding the iterative steps of optimization based algorithms. A DNN based, multi-hypothesis… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: 12 pages, 10 pages, 5 tables

  45. arXiv:2107.04644  [pdf, other

    eess.IV cs.CV

    Self-Supervised Generative Adversarial Network for Depth Estimation in Laparoscopic Images

    Authors: Baoru Huang, Jianqing Zheng, Anh Nguyen, David Tuch, Kunal Vyas, Stamatia Giannarou, Daniel S. Elson

    Abstract: Dense depth estimation and 3D reconstruction of a surgical scene are crucial steps in computer assisted surgery. Recent work has shown that depth estimation from a stereo images pair could be solved with convolutional neural networks. However, most recent depth estimation models were trained on datasets with per-pixel ground truth. Such data is especially rare for laparoscopic imaging, making it h… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

  46. arXiv:2106.11258  [pdf, ps, other

    eess.SY math.OC

    A comparative study of model approximation methods applied to economic MPC

    Authors: Zhiyinan Huang, Qinyao Liu, Jinfeng Liu, Biao Huang

    Abstract: Economic model predictive control (EMPC) has attracted significant attention in recent years and is recognized as a promising advanced process control method for the next generation smart manufacturing. It can lead to improving economic performance but at the same time increases the computational complexity significantly. Model approximation has been a standard approach for reducing computational… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  47. arXiv:2105.08629  [pdf, other

    eess.IV cs.CV cs.LG

    Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Kim Byeoung-su, Radu Timofte, Angeline Pouget, Fenglong Song, Cheng Li, Shuai Xiao, Zhongqian Fu, Matteo Maggioni, Yibin Huang, Shen Cheng, Xin Lu, Yifeng Zhou, Liangyu Chen, Donghao Liu, Xiangyu Zhang, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Bin Huang , et al. (7 additional authors not shown)

    Abstract: Image denoising is one of the most critical problems in mobile photo processing. While many solutions have been proposed for this task, they are usually working with synthetic data and are too computationally expensive to run on mobile devices. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based image denoising solut… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.07809, arXiv:2105.07825

  48. arXiv:2103.10063  [pdf, other

    eess.SY

    Behavioural Approach to Distributed Control of Interconnected Systems

    Authors: Yitao Yan, Jie Bao, Biao Huang

    Abstract: This paper formulates a framework for the analysis and distributed control of interconnected systems from the behavioural perspective. The discussions are carried out from the viewpoint of set theory and the results are completely representation-free. The core of a dynamical system can be represented as the set of all trajectories admissible through the system and interconnections are interpreted… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  49. arXiv:2012.13684  [pdf, other

    eess.SY

    An Evidential Reasoning Based Approach to Building Node Selection Criterion for Network Reduction

    Authors: Bin Huang, Jiayong Li, Jianhui Wang

    Abstract: A reasonable node selection criterion (NSC) is crucial for the network reduction in power systems. In contrast to the previous works that only consider structure property, this paper proposes a comprehensive and quantitative NSC considering both structural and electrical properties. The proposed NSC is developed by employing the evidential reasoning approach, in which the quasi-one-hot encoding is… ▽ More

    Submitted 26 December, 2020; originally announced December 2020.

  50. arXiv:2012.11803  [pdf, other

    cs.CV cs.CR eess.IV

    Modeling Deep Learning Based Privacy Attacks on Physical Mail

    Authors: Bingyao Huang, Ruyi Lian, Dimitris Samaras, Haibin Ling

    Abstract: Mail privacy protection aims to prevent unauthorized access to hidden content within an envelope since normal paper envelopes are not as safe as we think. In this paper, for the first time, we show that with a well designed deep learning model, the hidden content may be largely recovered without opening the envelope. We start by modeling deep learning-based privacy attacks on physical mail content… ▽ More

    Submitted 25 March, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: Source code: https://github.com/BingyaoHuang/Neural-STE