Skip to main content

Showing 1–50 of 100 results for author: Wei, B

Searching in archive cs. Search in all archives.
.
  1. Fusion Flow-enhanced Graph Pooling Residual Networks for Unmanned Aerial Vehicles Surveillance in Day and Night Dual Visions

    Authors: Alam Noor, Kai Li, Eduardo Tovar, Pei Zhang, Bo Wei

    Abstract: Recognizing unauthorized Unmanned Aerial Vehicles (UAVs) within designated no-fly zones throughout the day and night is of paramount importance, where the unauthorized UAVs pose a substantial threat to both civil and military aviation safety. However, recognizing UAVs day and night with dual-vision cameras is nontrivial, since red-green-blue (RGB) images suffer from a low detection rate under an i… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: The article is accepted at July 08, 2024 with 13 pages and 10 figures in the Journal of Engineering Applications of Artificial Intelligence, Elsevier

  2. arXiv:2407.11414  [pdf, other

    cs.CV

    SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models

    Authors: Yang Zhou, Yongjian Wu, Jiya Saiyin, Bingzheng Wei, Maode Lai, Eric Chang, Yan Xu

    Abstract: Prompt tuning methods have achieved remarkable success in parameter-efficient fine-tuning on large pre-trained models. However, their application to dual-modal fusion-based visual-language pre-trained models (VLPMs), such as GLIP, has encountered issues. Existing prompt tuning methods have not effectively addressed the modal mapping and aligning problem for tokens in different modalities, leading… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  3. arXiv:2407.11087  [pdf, other

    eess.IV cs.CV

    Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV

    Authors: Zhiwen Yang, Hui Zhang, Dan Zhao, Bingzheng Wei, Yan Xu

    Abstract: Transformers have revolutionized medical image restoration, but the quadratic complexity still poses limitations for their application to high-resolution medical images. The recent advent of RWKV in the NLP field has attracted much attention as it can process long sequences efficiently. To leverage its advanced design, we propose Restore-RWKV, the first RWKV-based model for medical image restorati… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: This paper introduces the first RWKV-based model for image restoration

  4. arXiv:2407.09268  [pdf, other

    eess.IV cs.CV

    Region Attention Transformer for Medical Image Restoration

    Authors: Zhiwen Yang, Haowei Chen, Ziniu Qian, Yang Zhou, Hui Zhang, Dan Zhao, Bingzheng Wei, Yan Xu

    Abstract: Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (\text{e.g.} the entire image or fixed patches), resulting in interference from irrelevant regions and fragmen… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by MICCAI 2024

  5. arXiv:2407.08739  [pdf, other

    cs.CV

    MAVIS: Mathematical Visual Instruction Tuning

    Authors: Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li

    Abstract: Multi-modal Large Language Models (MLLMs) have recently emerged as a significant focus in academia and industry. Despite their proficiency in general multi-modal scenarios, the mathematical problem-solving capabilities in visual contexts remain insufficiently explored. We identify three key areas within MLLMs that need to be improved: visual encoding of math diagrams, diagram-language alignment, a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Work in progress. Data and Models are released at https://github.com/ZrrSkywalker/MAVIS

  6. arXiv:2407.08174  [pdf, other

    cs.HC q-bio.NC

    An Adaptively Weighted Averaging Method for Regional Time Series Extraction of fMRI-based Brain Decoding

    Authors: Jianfei Zhu, Baichun Wei, Jiaru Tian, Feng Jiang, Chunzhi Yi

    Abstract: Brain decoding that classifies cognitive states using the functional fluctuations of the brain can provide insightful information for understanding the brain mechanisms of cognitive functions. Among the common procedures of decoding the brain cognitive states with functional magnetic resonance imaging (fMRI), extracting the time series of each brain region after brain parcellation traditionally av… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 17 pages, 4 figures

    ACM Class: J.3

  7. arXiv:2406.18664  [pdf, other

    cs.CL cs.LG

    Evaluating Copyright Takedown Methods for Language Models

    Authors: Boyi Wei, Weijia Shi, Yangsibo Huang, Noah A. Smith, Chiyuan Zhang, Luke Zettlemoyer, Kai Li, Peter Henderson

    Abstract: Language models (LMs) derive their capabilities from extensive training on diverse data, including potentially copyrighted material. These models can memorize and generate content similar to their training data, posing potential concerns. Therefore, model creators are motivated to develop mitigation methods that prevent generating protected content. We term this procedure as copyright takedowns fo… ▽ More

    Submitted 11 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 31 pages, 9 figures, 14 tables

  8. arXiv:2406.18364  [pdf

    cs.CL cs.AI

    Research on Information Extraction of LCSTS Dataset Based on an Improved BERTSum-LSTM Model

    Authors: Yiming Chen, Haobin Chen, Simin Liu, Yunyun Liu, Fanhao Zhou, Bing Wei

    Abstract: With the continuous advancement of artificial intelligence, natural language processing technology has become widely utilized in various fields. At the same time, there are many challenges in creating Chinese news summaries. First of all, the semantics of Chinese news is complex, and the amount of information is enormous. Extracting critical information from Chinese news presents a significant cha… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: submitted to ICMIII 2024

  9. arXiv:2406.15485  [pdf, other

    cs.CL cs.CV

    SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection

    Authors: Xingjian Hu, Baole Wei, Liangcai Gao, Jun Wang

    Abstract: Text line detection is a key task in historical document analysis facing many challenges of arbitrary-shaped text lines, dense texts, and text lines with high aspect ratios, etc. In this paper, we propose a general framework for historical document text detection (SegHist), enabling existing segmentation-based text detection methods to effectively address the challenges, especially text lines with… ▽ More

    Submitted 8 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by ICDAR2024 (poster)

  10. arXiv:2406.14598  [pdf, other

    cs.AI

    SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

    Authors: Tinghao Xie, Xiangyu Qi, Yi Zeng, Yangsibo Huang, Udari Madhushani Sehwag, Kaixuan Huang, Luxi He, Boyi Wei, Dacheng Li, Ying Sheng, Ruoxi Jia, Bo Li, Kai Li, Danqi Chen, Peter Henderson, Prateek Mittal

    Abstract: Evaluating aligned large language models' (LLMs) ability to recognize and reject unsafe user requests is crucial for safe, policy-compliant deployments. Existing evaluation efforts, however, face three limitations that we address with SORRY-Bench, our proposed benchmark. First, existing methods often use coarse-grained taxonomies of unsafe topics, and are over-representing some fine-grained topics… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  11. arXiv:2406.10101  [pdf, other

    cs.SE

    Requirements are All You Need: From Requirements to Code with LLMs

    Authors: Bingyang Wei

    Abstract: The pervasive use of textual formats in the documentation of software requirements presents a great opportunity for applying large language models (LLMs) to software engineering tasks. High-quality software requirements not only enhance the manual software development process but also position organizations to fully harness the potential of the emerging LLMs technology. This paper introduces a tai… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  12. arXiv:2406.08909  [pdf, other

    cs.CV

    A Label-Free and Non-Monotonic Metric for Evaluating Denoising in Event Cameras

    Authors: Chenyang Shi, Shasha Guo, Boyi Wei, Hanxiao Liu, Yibo Zhang, Ningfang Song, Jing Jin

    Abstract: Event cameras are renowned for their high efficiency due to outputting a sparse, asynchronous stream of events. However, they are plagued by noisy events, especially in low light conditions. Denoising is an essential task for event cameras, but evaluating denoising performance is challenging. Label-dependent denoising metrics involve artificially adding noise to clean sequences, complicating evalu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  13. arXiv:2406.05746  [pdf

    cs.AI cs.HC cs.LG

    Methodology and Real-World Applications of Dynamic Uncertain Causality Graph for Clinical Diagnosis with Explainability and Invariance

    Authors: Zhan Zhang, Qin Zhang, Yang Jiao, Lin Lu, Lin Ma, Aihua Liu, Xiao Liu, Juan Zhao, Yajun Xue, Bing Wei, Mingxia Zhang, Ru Gao, Hong Zhao, Jie Lu, Fan Li, Yang Zhang, Yiming Wang, Lei Zhang, Fengwei Tian, Jie Hu, Xin Gou

    Abstract: AI-aided clinical diagnosis is desired in medical care. Existing deep learning models lack explainability and mainly focus on image analysis. The recently developed Dynamic Uncertain Causality Graph (DUCG) approach is causality-driven, explainable, and invariant across different application scenarios, without problems of data collection, labeling, fitting, privacy, bias, generalization, high cost… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Journal ref: Artificaial Intelligence Review, (2024) 57:151

  14. arXiv:2406.05707  [pdf, other

    cs.CL cs.AI

    QGEval: A Benchmark for Question Generation Evaluation

    Authors: Weiping Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu

    Abstract: Automatically generated questions often suffer from problems such as unclear expression or factual inaccuracies, requiring a reliable and comprehensive evaluation of their quality. Human evaluation is frequently used in the field of question generation (QG) and is one of the most accurate evaluation methods. It also serves as the standard for automatic metrics. However, there is a lack of unified… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  15. arXiv:2405.19769  [pdf, other

    cs.CV

    All-In-One Medical Image Restoration via Task-Adaptive Routing

    Authors: Zhiwen Yang, Haowei Chen, Ziniu Qian, Yang Yi, Hui Zhang, Dan Zhao, Bingzheng Wei, Yan Xu

    Abstract: Although single-task medical image restoration (MedIR) has witnessed remarkable success, the limited generalizability of these methods poses a substantial obstacle to wider application. In this paper, we focus on the task of all-in-one medical image restoration, aiming to address multiple distinct MedIR tasks with a single universal model. Nonetheless, due to significant differences between differ… ▽ More

    Submitted 28 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: This article has been early accepted by MICCAI 2024

  16. arXiv:2405.19524  [pdf, other

    cs.CR cs.AI

    AI Risk Management Should Incorporate Both Safety and Security

    Authors: Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Geiping, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal

    Abstract: The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this pape… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  17. arXiv:2405.15914  [pdf, other

    cs.CV

    ExactDreamer: High-Fidelity Text-to-3D Content Creation via Exact Score Matching

    Authors: Yumin Zhang, Xingyu Miao, Haoran Duan, Bo Wei, Tejal Shah, Yang Long, Rajiv Ranjan

    Abstract: Text-to-3D content creation is a rapidly evolving research area. Given the scarcity of 3D data, current approaches often adapt pre-trained 2D diffusion models for 3D synthesis. Among these approaches, Score Distillation Sampling (SDS) has been widely adopted. However, the issue of over-smoothing poses a significant limitation on the high-fidelity generation of 3D models. To address this challenge,… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  18. arXiv:2405.15544  [pdf, other

    q-bio.QM cs.AI cs.LG

    Knowledge-enhanced Relation Graph and Task Sampling for Few-shot Molecular Property Prediction

    Authors: Zeyu Wang, Tianyi Jiang, Yao Lu, Xiaoze Bao, Shanqing Yu, Bin Wei, Qi Xuan

    Abstract: Recently, few-shot molecular property prediction (FSMPP) has garnered increasing attention. Despite impressive breakthroughs achieved by existing methods, they often overlook the inherent many-to-many relationships between molecules and properties, which limits their performance. For instance, similar substructures of molecules can inspire the exploration of new compounds. Additionally, the relati… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  19. arXiv:2405.10674  [pdf, other

    cs.CV cs.AI

    From Sora What We Can See: A Survey of Text-to-Video Generation

    Authors: Rui Sun, Yumin Zhang, Tejal Shah, Jiahao Sun, Shuoying Zhang, Wenqi Li, Haoran Duan, Bo Wei, Rajiv Ranjan

    Abstract: With impressive achievements made, artificial intelligence is on the path forward to artificial general intelligence. Sora, developed by OpenAI, which is capable of minute-level world-simulative abilities can be considered as a milestone on this developmental path. However, despite its notable successes, Sora still encounters various obstacles that need to be resolved. In this survey, we embark fr… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: A comprehensive list of text-to-video generation studies in this survey is available at https://github.com/soraw-ai/Awesome-Text-to-Video-Generation

  20. arXiv:2403.14374  [pdf, other

    cs.CL cs.IR

    FIT-RAG: Black-Box RAG with Factual Information and Token Reduction

    Authors: Yuren Mao, Xuemei Dong, Wenyi Xu, Yunjun Gao, Bin Wei, Ying Zhang

    Abstract: Due to the extraordinarily large number of parameters, fine-tuning Large Language Models (LLMs) to update long-tail or out-of-date knowledge is impractical in lots of applications. To avoid fine-tuning, we can alternatively treat a LLM as a black-box (i.e., freeze the parameters of the LLM) and augment it with a Retrieval-Augmented Generation (RAG) system, namely black-box RAG. Recently, black-box… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  21. arXiv:2402.05162  [pdf, other

    cs.LG cs.AI cs.CL

    Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

    Authors: Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson

    Abstract: Large language models (LLMs) show inherent brittleness in their safety mechanisms, as evidenced by their susceptibility to jailbreaking and even non-malicious fine-tuning. This study explores this brittleness of safety alignment by leveraging pruning and low-rank modifications. We develop methods to identify critical regions that are vital for safety guardrails, and that are disentangled from util… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 22 pages, 9 figures. Project page is available at https://boyiwei.com/alignment-attribution/

  22. arXiv:2401.08185  [pdf

    cs.CV cs.AI eess.IV

    DPAFNet:Dual Path Attention Fusion Network for Single Image Deraining

    Authors: Bingcai Wei

    Abstract: Rainy weather will have a significant impact on the regular operation of the imaging system. Based on this premise, image rain removal has always been a popular branch of low-level visual tasks, especially methods using deep neural networks. However, most neural networks are but-branched, such as only using convolutional neural networks or Transformers, which is unfavourable for the multidimension… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  23. arXiv:2311.13317  [pdf, other

    cs.CV

    Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution

    Authors: Yuxuan Zhou, Liangcai Gao, Zhi Tang, Baole Wei

    Abstract: Scene Text Image Super-Resolution (STISR) aims to enhance the resolution and legibility of text within low-resolution (LR) images, consequently elevating recognition accuracy in Scene Text Recognition (STR). Previous methods predominantly employ discriminative Convolutional Neural Networks (CNNs) augmented with diverse forms of text guidance to address this issue. Nevertheless, they remain deficie… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  24. arXiv:2309.09984  [pdf

    q-bio.NC cs.NE

    BDEC:Brain Deep Embedded Clustering model

    Authors: Xiaoxiao Ma, Chunzhi Yi, Zhicai Zhong, Hui Zhou, Baichun Wei, Haiqi Zhu, Feng Jiang

    Abstract: An essential premise for neuroscience brain network analysis is the successful segmentation of the cerebral cortex into functionally homogeneous regions. Resting-state functional magnetic resonance imaging (rs-fMRI), capturing the spontaneous activities of the brain, provides the potential for cortical parcellation. Previous parcellation methods can be roughly categorized into three groups, mainly… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  25. arXiv:2309.07170  [pdf, other

    eess.SP cs.LG

    Overview of Human Activity Recognition Using Sensor Data

    Authors: Rebeen Ali Hamad, Wai Lok Woo, Bo Wei, Longzhi Yang

    Abstract: Human activity recognition (HAR) is an essential research field that has been used in different applications including home and workplace automation, security and surveillance as well as healthcare. Starting from conventional machine learning methods to the recently developing deep learning techniques and the Internet of things, significant contributions have been shown in the HAR area in the last… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  26. arXiv:2307.05249  [pdf, other

    eess.IV cs.CV cs.LG

    DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image Synthesis

    Authors: Zhiwen Yang, Yang Zhou, Hui Zhang, Bingzheng Wei, Yubo Fan, Yan Xu

    Abstract: Multi-center positron emission tomography (PET) image synthesis aims at recovering low-dose PET images from multiple different centers. The generalizability of existing methods can still be suboptimal for a multi-center study due to domain shifts, which result from non-identical data distribution among centers with different imaging systems/protocols. While some approaches address domain shifts by… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: This article has been early accepted by MICCAI 2023,but has not been fully edited. Content may change prior to final publication

  27. arXiv:2306.17659  [pdf, other

    cs.CV

    Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

    Authors: Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

    Abstract: Large-scale visual-language pre-trained models (VLPM) have proven their excellent performance in downstream object detection for natural scenes. However, zero-shot nuclei detection on H\&E images via VLPMs remains underexplored. The large gap between medical images and the web-originated text-image pairs used for pre-training makes it a challenging task. In this paper, we attempt to explore the po… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: This article has been accepted by MICCAI 2023,but has not been fully edited. Content may change prior to final publication

  28. Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation

    Authors: Yang Zhou, Yongjian Wu, Zihua Wang, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

    Abstract: Nuclei instance segmentation on histopathology images is of great clinical value for disease analysis. Generally, fully-supervised algorithms for this task require pixel-wise manual annotations, which is especially time-consuming and laborious for the high nuclei density. To alleviate the annotation burden, we seek to solve the problem through image-level weakly supervised learning, which is under… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI https://doi.org/10.1109/TMI.2023.3275609, IEEE Transactions on Medical Imaging. Code: https://github.com/wuyongjianCODE/Cyclic

  29. arXiv:2305.10198  [pdf, other

    cs.CV eess.IV

    IDO-VFI: Identifying Dynamics via Optical Flow Guidance for Video Frame Interpolation with Events

    Authors: Chenyang Shi, Hanxiao Liu, Jing Jin, Wenzhuo Li, Yuzhen Li, Boyi Wei, Yibo Zhang

    Abstract: Video frame interpolation aims to generate high-quality intermediate frames from boundary frames and increase frame rate. While existing linear, symmetric and nonlinear models are used to bridge the gap from the lack of inter-frame motion, they cannot reconstruct real motions. Event cameras, however, are ideal for capturing inter-frame dynamics with their extremely high temporal resolution. In thi… ▽ More

    Submitted 18 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  30. arXiv:2305.03270  [pdf, other

    cs.RO

    Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

    Authors: Alexander Herzog, Kanishka Rao, Karol Hausman, Yao Lu, Paul Wohlhart, Mengyuan Yan, Jessica Lin, Montserrat Gonzalez Arenas, Ted Xiao, Daniel Kappler, Daniel Ho, Jarek Rettinghouse, Yevgen Chebotar, Kuang-Huei Lee, Keerthana Gopalakrishnan, Ryan Julian, Adrian Li, Chuyuan Kelly Fu, Bob Wei, Sangeetha Ramesh, Khem Holden, Kim Kleiven, David Rendleman, Sean Kirmani, Jeff Bingham , et al. (15 additional authors not shown)

    Abstract: We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Published at Robotics: Science and Systems 2023

  31. arXiv:2303.17614  [pdf, other

    cs.HC cs.AI eess.SP

    Estimating Continuous Muscle Fatigue For Multi-Muscle Coordinated Exercise: A Pilot Study

    Authors: Chunzhi Yi, Baichun Wei, Wei Jin, Jianfei Zhu, Seungmin Rho, Zhiyuan Chen, Feng Jiang

    Abstract: Assessing the progression of muscle fatigue for daily exercises provides vital indicators for precise rehabilitation, personalized training dose, especially under the context of Metaverse. Assessing fatigue of multi-muscle coordination-involved daily exercises requires the neuromuscular features that represent the fatigue-induced characteristics of spatiotemporal adaptions of multiple muscles and… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: submitted to IEEE JBHI

  32. arXiv:2303.15107  [pdf, other

    cs.HC

    ActiveSelfHAR: Incorporating Self Training into Active Learning to Improve Cross-Subject Human Activity Recognition

    Authors: Baichun Wei, Chunzhi Yi, Qi Zhang, Haiqi Zhu, Jianfei Zhu, Feng Jiang

    Abstract: Deep learning-based human activity recognition (HAR) methods have shown great promise in the applications of smart healthcare systems and wireless body sensor network (BSN). Despite their demonstrated performance in laboratory settings, the real-world implementation of such methods is still hindered by the cross-subject issue when adapting to new users. To solve this issue, we propose ActiveSelfHA… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  33. arXiv:2303.04365  [pdf, other

    cs.CV

    SANDFORMER: CNN and Transformer under Gated Fusion for Sand Dust Image Restoration

    Authors: Jun Shi, Bingcai Wei, Gang Zhou, Liye Zhang

    Abstract: Although Convolutional Neural Networks (CNN) have made good progress in image restoration, the intrinsic equivalence and locality of convolutions still constrain further improvements in image quality. Recent vision transformer and self-attention have achieved promising results on various computer vision tasks. However, directly utilizing Transformer for image restoration is a challenging task. In… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: ICASSP 2023

  34. arXiv:2302.11095  [pdf, other

    cs.CV

    MM-SFENet: Multi-scale Multi-task Localization and Classification of Bladder Cancer in MRI with Spatial Feature Encoder Network

    Authors: Yu Ren, Guoli Wang, Pingping Wang, Kunmeng Liu, Quanjin Liu, Hongfu Sun, Xiang Li, Benzheng Wei

    Abstract: Background and Objective: Bladder cancer is a common malignant urinary carcinoma, with muscle-invasive and non-muscle-invasive as its two major subtypes. This paper aims to achieve automated bladder cancer invasiveness localization and classification based on MRI. Method: Different from previous efforts that segment bladder wall and tumor, we propose a novel end-to-end multi-scale multi-task spati… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  35. arXiv:2302.11082  [pdf, other

    cs.CV

    BB-GCN: A Bi-modal Bridged Graph Convolutional Network for Multi-label Chest X-Ray Recognition

    Authors: Guoli Wang, Pingping Wang, Jinyu Cong, Kunmeng Liu, Benzheng Wei

    Abstract: Multi-label chest X-ray (CXR) recognition involves simultaneously diagnosing and identifying multiple labels for different pathologies. Since pathological labels have rich information about their relationship to each other, modeling the co-occurrence dependencies between pathological labels is essential to improve recognition performance. However, previous methods rely on state variable coding and… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: under Computers in Biology and Medicine submission

  36. arXiv:2302.03222  [pdf, other

    cs.CL

    Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service Support

    Authors: Stephen Obadinma, Faiza Khan Khattak, Shirley Wang, Tania Sidhom, Elaine Lau, Sean Robertson, Jingcheng Niu, Winnie Au, Alif Munim, Karthik Raja K. Bhaskar, Bencheng Wei, Iris Ren, Waqar Muhammad, Erin Li, Bukola Ishola, Michael Wang, Griffin Tanner, Yu-Jia Shiah, Sean X. Zhang, Kwesi P. Apponsah, Kanishk Patel, Jaswinder Narain, Deval Pandya, Xiaodan Zhu, Frank Rudzicz , et al. (1 additional authors not shown)

    Abstract: Building Agent Assistants that can help improve customer service support requires inputs from industry users and their customers, as well as knowledge about state-of-the-art Natural Language Processing (NLP) technology. We combine expertise from academia and industry to bridge the gap and build task/domain-specific Neural Agent Assistants (NAA) with three high-level components for: (1) Intent Iden… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Camera Ready Version of Paper Published in EMNLP 2022 Industry Track

  37. arXiv:2212.14479  [pdf, other

    cs.NI cs.LG

    Pensieve 5G: Implementation of RL-based ABR Algorithm for UHD 4K/8K Content Delivery on Commercial 5G SA/NR-DC Network

    Authors: Kasidis Arunruangsirilert, Bo Wei, Hang Song, Jiro Katto

    Abstract: While the rollout of the fifth-generation mobile network (5G) is underway across the globe with the intention to deliver 4K/8K UHD videos, Augmented Reality (AR), and Virtual Reality (VR) content to the mass amounts of users, the coverage and throughput are still one of the most significant issues, especially in the rural areas, where only 5G in the low-frequency band are being deployed. This call… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: 2023 IEEE Wireless Communications and Networking Conference (WCNC), 26-29 March 2023, Glasgow, Scotland, UK

  38. arXiv:2212.10866  [pdf

    cs.CR

    CyberEye: Obtaining Data from Virtual Desktop by Video

    Authors: Bin Wei

    Abstract: VDI is no longer safe and reliable anymore. VDI(Virtual Desktop Infrastructure, also called Cloud Desktop) is being widely used as working interface to avoid data exfiltration. With VDI client, end users can access internal data without obtaining data actually. In this paper, we present a new approach named CyberEye, to extract data from VDI by video even data transmission has been forbidden. By e… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Open source code: https://github.com/bin-will/cybereye This paper contains 17 pages, 12 figures

  39. arXiv:2212.08418  [pdf, other

    cs.NI

    rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments

    Authors: Bo Wei, Mingcen Gao, Chengwen Luo, Sen Wang, Jin Zhang

    Abstract: In this paper, we propose rWiFiSLAM, an indoor localisation system based on WiFi ranging measurements. Indoor localisation techniques play an important role in mobile robots when they cannot access good quality GPS signals in indoor environments. Indoor localisation also has many other applications, such as rescue, smart buildings, etc. Inertial Measurement Units (IMU) have been used for Pedestria… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  40. arXiv:2209.12029  [pdf, other

    cs.LG cs.AI

    Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation

    Authors: Kang Xu, Yan Ma, Bingsheng Wei, Wei Li

    Abstract: While Reinforcement Learning can achieve impressive results for complex tasks, the learned policies are generally prone to fail in downstream tasks with even minor model mismatch or unexpected perturbations. Recent works have demonstrated that a policy population with diverse behavior characteristics can generalize to downstream environments with various discrepancies. However, such policies might… ▽ More

    Submitted 20 May, 2023; v1 submitted 24 September, 2022; originally announced September 2022.

  41. arXiv:2209.02916  [pdf, other

    cs.LG cs.AR

    Hardware Acceleration of Sampling Algorithms in Sample and Aggregate Graph Neural Networks

    Authors: Yuchen Gui, Boyi Wei, Wei Yuan, Xi Jin

    Abstract: Sampling is an important process in many GNN structures in order to train larger datasets with a smaller computational complexity. However, compared to other processes in GNN (such as aggregate, backward propagation), the sampling process still costs tremendous time, which limits the speed of training. To reduce the time of sampling, hardware acceleration is an ideal choice. However, state of the… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  42. arXiv:2207.12744  [pdf, other

    cs.CV cs.AI

    Distribution Learning Based on Evolutionary Algorithm Assisted Deep Neural Networks for Imbalanced Image Classification

    Authors: Yudi Zhao, Kuangrong Hao, Chaochen Gu, Bing Wei

    Abstract: To address the trade-off problem of quality-diversity for the generated images in imbalanced classification tasks, we research on over-sampling based methods at the feature level instead of the data level and focus on searching the latent feature space for optimal distributions. On this basis, we propose an iMproved Estimation Distribution Algorithm based Latent featUre Distribution Evolution (MED… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  43. arXiv:2206.09427  [pdf

    cs.NI cs.MM eess.IV

    QuDASH: Quantum-inspired rate adaptation approach for DASH video streaming

    Authors: Bo Wei, Hang Song, Makoto Nakamura, Koichi Kimura, Nozomu Togawa, Jiro Katto

    Abstract: Internet traffic is dramatically increasing with the development of network technologies and video streaming traffic accounts for large amount within the total traffic, which reveals the importance to guarantee the quality of content delivery service. Based on the network conditions, adaptive bitrate (ABR) control is utilized as a common technique which can choose the proper bitrate to ensure the… ▽ More

    Submitted 21 October, 2023; v1 submitted 19 June, 2022; originally announced June 2022.

    Comments: Accepted Version

    Journal ref: IEEE Access, 2023

  44. arXiv:2205.08878  [pdf, other

    cs.CV

    Transformer based multiple instance learning for weakly supervised histopathology image segmentation

    Authors: Ziniu Qian, Kailu Li, Maode Lai, Eric I-Chao Chang, Bingzheng Wei, Yubo Fan, Yan Xu

    Abstract: Hispathological image segmentation algorithms play a critical role in computer aided diagnosis technology. The development of weakly supervised segmentation algorithm alleviates the problem of medical image annotation that it is time-consuming and labor-intensive. As a subset of weakly supervised learning, Multiple Instance Learning (MIL) has been proven to be effective in segmentation. However, t… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Provisional accepted for MICCAI 2022

  45. Semi-Cycled Generative Adversarial Networks for Real-World Face Super-Resolution

    Authors: Hao Hou, Jun Xu, Yingkun Hou, Xiaotao Hu, Benzheng Wei, Dinggang Shen

    Abstract: Real-world face super-resolution (SR) is a highly ill-posed image restoration task. The fully-cycled Cycle-GAN architecture is widely employed to achieve promising performance on face SR, but prone to produce artifacts upon challenging cases in real-world scenarios, since joint participation in the same degradation branch will impact final performance due to huge domain gap between real-world and… ▽ More

    Submitted 25 January, 2023; v1 submitted 8 May, 2022; originally announced May 2022.

  46. arXiv:2203.10435  [pdf

    cs.CV

    Vision Transformer with Convolutions Architecture Search

    Authors: Haichao Zhang, Kuangrong Hao, Witold Pedrycz, Lei Gao, Xuesong Tang, Bing Wei

    Abstract: Transformers exhibit great advantages in handling computer vision tasks. They model image classification tasks by utilizing a multi-head attention mechanism to process a series of patches consisting of split images. However, for complex tasks, Transformer in computer vision not only requires inheriting a bit of dynamic attention and global context, but also needs to introduce features concerning n… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  47. arXiv:2201.04286  [pdf, other

    cs.NE cs.LG

    Evolutionary Action Selection for Gradient-based Policy Learning

    Authors: Yan Ma, Tianxing Liu, Bingsheng Wei, Yi Liu, Kang Xu, Wei Li

    Abstract: Evolutionary Algorithms (EAs) and Deep Reinforcement Learning (DRL) have recently been integrated to take the advantage of the both methods for better exploration and exploitation.The evolutionary part in these hybrid methods maintains a population of policy networks.However, existing methods focus on optimizing the parameters of policy network, which is usually high-dimensional and tricky for EA.… ▽ More

    Submitted 16 September, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

  48. arXiv:2111.12618  [pdf, other

    cs.IR

    Group based Personalized Search by Integrating Search Behaviour and Friend Network

    Authors: Yujia Zhou, Zhicheng Dou, Bingzheng Wei, Ruobing Xievand Ji-Rong Wen

    Abstract: The key to personalized search is to build the user profile based on historical behaviour. To deal with the users who lack historical data, group based personalized models were proposed to incorporate the profiles of similar users when re-ranking the results. However, similar users are mostly found based on simple lexical or topical similarity in search behaviours. In this paper, we propose a neur… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 10 pages

  49. arXiv:2109.12293  [pdf

    cs.NI cs.MM eess.IV

    Adaptive video transmission using QUBO method and Digital Annealer based on Ising machine

    Authors: Bo Wei, Hang Song, Jiro Katto

    Abstract: With the dramatically increasing video streaming in the total network traffic, it is critical to develop effective algorithms to promote the content delivery service of high quality. Adaptive bitrate (ABR) control is the most essential technique which determines the proper bitrate to be chosen based on network conditions, thus realize high-quality video streaming. In this paper, a novel ABR strate… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  50. arXiv:2109.10485  [pdf, other

    cs.CL

    The NiuTrans Machine Translation Systems for WMT21

    Authors: Shuhan Zhou, Tao Zhou, Binghao Wei, Yingfeng Luo, Yongyu Mu, Zefan Zhou, Chenglong Wang, Xuanjun Zhou, Chuanhao Lv, Yi Jing, Laohu Wang, Jingnan Zhang, Canan Huang, Zhongxiang Yan, Chi Hu, Bei Li, Tong Xiao, Jingbo Zhu

    Abstract: This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks. We made submissions to 9 language directions, including English$\leftrightarrow$$\{$Chinese, Japanese, Russian, Icelandic$\}$ and English$\rightarrow$Hausa tasks. Our primary systems are built on several effective variants of Transformer, e.g., Transformer-DLCL, ODE-Transformer. We also utilize… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.