Zum Hauptinhalt springen

Showing 1–50 of 189 results for author: Jia, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.17207  [pdf, other

    cs.CV cs.RO

    NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar

    Authors: Runwei Guan, Jianan Liu, Liye Jia, Haocheng Zhao, Shanliang Yao, Xiaohui Zhu, Ka Lok Man, Eng Gee Lim, Jeremy Smith, Yutao Yue

    Abstract: Recently, visual grounding and multi-sensors setting have been incorporated into perception system for terrestrial autonomous driving systems and Unmanned Surface Vehicles (USVs), yet the high complexity of modern learning-based visual grounding model using multi-sensors prevents such model to be deployed on USVs in the real-life. To this end, we design a low-power multi-task model named NanoMVG f… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 8 pages, 6 figures

  2. arXiv:2408.15263  [pdf, other

    cs.CV cs.AI

    S4DL: Shift-sensitive Spatial-Spectral Disentangling Learning for Hyperspectral Image Unsupervised Domain Adaptation

    Authors: Jie Feng, Tianshu Zhang, Junpeng Zhang, Ronghua Shang, Weisheng Dong, Guangming Shi, Licheng Jiao

    Abstract: Unsupervised domain adaptation techniques, extensively studied in hyperspectral image (HSI) classification, aim to use labeled source domain data and unlabeled target domain data to learn domain invariant features for cross-scene classification. Compared to natural images, numerous spectral bands of HSIs provide abundant semantic information, but they also increase the domain shift significantly.… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  3. arXiv:2408.13582  [pdf, other

    cs.CV

    CSS-Segment: 2nd Place Report of LSVOS Challenge VOS Track

    Authors: Jinming Chai, Qin Ma, Junpei Zhang, Licheng Jiao, Fang Liu

    Abstract: Video object segmentation is a challenging task that serves as the cornerstone of numerous downstream applications, including video editing and autonomous driving. In this technical report, we briefly introduce the solution of our team "yuanjie" for video object segmentation in the 6-th LSVOS Challenge VOS Track at ECCV 2024. We believe that our proposed CSS-Segment will perform better in videos o… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

  4. arXiv:2408.01946  [pdf, other

    cs.CV

    Masked Angle-Aware Autoencoder for Remote Sensing Images

    Authors: Zhihao Li, Biao Hou, Siteng Ma, Zitong Wu, Xianpeng Guo, Bo Ren, Licheng Jiao

    Abstract: To overcome the inherent domain gap between remote sensing (RS) images and natural images, some self-supervised representation learning methods have made promising progress. However, they have overlooked the diverse angles present in RS objects. This paper proposes the Masked Angle-Aware Autoencoder (MA3E) to perceive and learn angles during pre-training. We design a \textit{scaling center crop} o… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted by ECCV 2024

  5. arXiv:2407.19428  [pdf, other

    cs.LG cs.CR cs.CV

    Reputation-Driven Asynchronous Federated Learning for Enhanced Trajectory Prediction with Blockchain

    Authors: Weiliang Chen, Li Jia, Yang Zhou, Qianqian Ren

    Abstract: Federated learning combined with blockchain empowers secure data sharing in autonomous driving applications. Nevertheless, with the increasing granularity and complexity of vehicle-generated data, the lack of data quality audits raises concerns about multi-party mistrust in trajectory prediction tasks. In response, this paper proposes an asynchronous federated learning data sharing method based on… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  6. arXiv:2407.09162  [pdf, other

    cs.LG cs.AI

    Exploring State Space and Reasoning by Elimination in Tsetlin Machines

    Authors: Ahmed K. Kadhim, Ole-Christoffer Granmo, Lei Jiao, Rishad Shafik

    Abstract: The Tsetlin Machine (TM) has gained significant attention in Machine Learning (ML). By employing logical fundamentals, it facilitates pattern learning and representation, offering an alternative approach for developing comprehensible Artificial Intelligence (AI) with a specific focus on pattern classification in the form of conjunctive clauses. In the domain of Natural Language Processing (NLP), T… ▽ More

    Submitted 17 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: 8 pages, 8 figures

  7. arXiv:2407.05347  [pdf, other

    cs.NI

    A Queueing Theoretic Perspective on Low-Latency LLM Inference with Variable Token Length

    Authors: Yuqing Yang, Yuedong Xu, Lei Jiao

    Abstract: Large language models (LLMs) propel the prosperity of interactive AI applications showcased by ChatGPT that demand timely response of inference services. However, LLM inference is computation intensive and memory intensive, and improper parameter configuration at LLM platforms may exacerbate the inference time. In this paper, we analyze the impact of LLM output token distribution on the inference… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 8 pages

  8. arXiv:2407.01220  [pdf, other

    cs.CV

    Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation

    Authors: Zihan Gao, Lingling Li, Licheng Jiao, Fang Liu, Xu Liu, Wenping Ma, Yuwei Guo, Shuyuan Yang

    Abstract: Understanding 3D scenes is a crucial challenge in computer vision research with applications spanning multiple domains. Recent advancements in distilling 2D vision-language foundation models into neural fields, like NeRF and 3DGS, enables open-vocabulary segmentation of 3D scenes from 2D multi-view images without the need for precise 3D annotations. While effective, however, the per-pixel distilla… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 16 pages, 7 figures

  9. arXiv:2406.17005  [pdf, other

    cs.CV

    PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

    Authors: Henghui Ding, Chang Liu, Yunchao Wei, Nikhila Ravi, Shuting He, Song Bai, Philip Torr, Deshui Miao, Xin Li, Zhenyu He, Yaowei Wang, Ming-Hsuan Yang, Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu, Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang, Mingqi Gao, Jingnan Luo , et al. (12 additional authors not shown)

    Abstract: Pixel-level Video Understanding in the Wild Challenge (PVUW) focus on complex video understanding. In this CVPR 2024 workshop, we add two new tracks, Complex Video Object Segmentation Track based on MOSE dataset and Motion Expression guided Video Segmentation track based on MeViS dataset. In the two new tracks, we provide additional videos and annotations that feature challenging elements, such as… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: MOSE Challenge: https://henghuiding.github.io/MOSE/ChallengeCVPR2024, MeViS Challenge: https://henghuiding.github.io/MeViS/ChallengeCVPR2024

  10. arXiv:2406.13984  [pdf, other

    cs.DC cs.LG

    Reducing Memory Contention and I/O Congestion for Disk-based GNN Training

    Authors: Qisheng Jiang, Lei Jia, Chundong Wang

    Abstract: Graph neural networks (GNNs) gain wide popularity. Large graphs with high-dimensional features become common and training GNNs on them is non-trivial on an ordinary machine. Given a gigantic graph, even sample-based GNN training cannot work efficiently, since it is difficult to keep the graph's entire data in memory during the training process. Leveraging a solid-state drive (SSD) or other storage… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: This is a full version for the paper with almost the same title accepted by the 53rd International Conference on Parallel Processing (ICPP 2024)

  11. arXiv:2406.11739  [pdf, other

    cs.CV

    V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

    Authors: Jiaqi Wang, Yuhang Zang, Pan Zhang, Tao Chu, Yuhang Cao, Zeyi Sun, Ziyu Liu, Xiaoyi Dong, Tong Wu, Dahua Lin, Zeming Chen, Zhi Wang, Lingchen Meng, Wenhao Yao, Jianwei Yang, Sihong Wu, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Peixi Wu, Bosong Chai, Xuan Nie, Longquan Yan, Zeyu Wang, Qifan Zhou , et al. (9 additional authors not shown)

    Abstract: Detecting objects in real-world scenes is a complex task due to various challenges, including the vast range of object categories, and potential encounters with previously unknown or unseen objects. The challenges necessitate the development of public benchmarks and challenges to advance the field of object detection. Inspired by the success of previous COCO and LVIS Challenges, we organize the V3… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  12. arXiv:2406.10744  [pdf, other

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou, Cong Li, Senyan Xu , et al. (75 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 12 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 PBDL Challenges: https://pbdl-ws.github.io/pbdl2024/challenge/index.html

  13. arXiv:2406.08829  [pdf, other

    cs.CV cs.CR

    Improving Adversarial Robustness via Feature Pattern Consistency Constraint

    Authors: Jiacong Hu, Jingwen Ye, Zunlei Feng, Jiazhen Yang, Shunyu Liu, Xiaotian Yu, Lingxiang Jia, Mingli Song

    Abstract: Convolutional Neural Networks (CNNs) are well-known for their vulnerability to adversarial attacks, posing significant security concerns. In response to these threats, various defense methods have emerged to bolster the model's robustness. However, most existing methods either focus on learning from adversarial perturbations, leading to overfitting to the adversarial examples, or aim to eliminate… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  14. arXiv:2406.07949  [pdf, other

    cs.CV

    Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection

    Authors: Jie Feng, Xiaojian Zhong, Di Li, Weisheng Dong, Ronghua Shang, Licheng Jiao

    Abstract: Band selection plays a crucial role in hyperspectral image classification by removing redundant and noisy bands and retaining discriminative ones. However, most existing deep learning-based methods are aimed at dealing with a specific band selection dataset, and need to retrain parameters for new datasets, which significantly limits their generalizability.To address this issue, a novel multi-teach… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  15. arXiv:2406.06813  [pdf, other

    cs.CV

    Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation

    Authors: Dong Zhao, Shuang Wang, Qi Zang, Licheng Jiao, Nicu Sebe, Zhun Zhong

    Abstract: We study source-free unsupervised domain adaptation (SFUDA) for semantic segmentation, which aims to adapt a source-trained model to the target domain without accessing the source data. Many works have been proposed to address this challenging problem, among which uncertainty-based self-training is a predominant approach. However, without comprehensive denoising mechanisms, they still largely fall… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 2024 Conference on Computer Vision and Pattern Recognition

    Journal ref: (2024 Conference on Computer Vision and Pattern Recognition)

  16. arXiv:2406.05055  [pdf, other

    cs.AI

    Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions

    Authors: Shi-Yu Tian, Zhi Zhou, Lin-Han Jia, Lan-Zhe Guo, Yu-Feng Li

    Abstract: Large language models (LLMs) have demonstrated impressive performance on reasoning tasks, which can be further improved through few-shot prompting techniques. However, the current evaluation primarily focuses on carefully constructed benchmarks and neglects the consideration of real-world reasoning problems that present missing and contradictory conditions, known as ill-defined problems. Our obser… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Preprint. arXiv admin note: text overlap with arXiv:2304.09797

  17. arXiv:2406.04961  [pdf, other

    cs.CV

    Multiplane Prior Guided Few-Shot Aerial Scene Rendering

    Authors: Zihan Gao, Licheng Jiao, Lingling Li, Xu Liu, Fang Liu, Puhua Chen, Yuwei Guo

    Abstract: Neural Radiance Fields (NeRF) have been successfully applied in various aerial scenes, yet they face challenges with sparse views due to limited supervision. The acquisition of dense aerial views is often prohibitive, as unmanned aerial vehicles (UAVs) may encounter constraints in perspective range and energy constraints. In this work, we introduce Multiplane Prior guided NeRF (MPNeRF), a novel ap… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 17 pages, 8 figures, accepted at CVPR 2024

    Journal ref: CVPR 2024

  18. arXiv:2406.03668  [pdf, other

    cs.CV cs.AI

    3rd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation

    Authors: Xinyu Liu, Jing Zhang, Kexin Zhang, Yuting Yang, Licheng Jiao, Shuyuan Yang

    Abstract: Video Object Segmentation (VOS) is a vital task in computer vision, focusing on distinguishing foreground objects from the background across video frames. Our work draws inspiration from the Cutie model, and we investigate the effects of object memory, the total number of memory frames, and input resolution on segmentation performance. This report validates the effectiveness of our inference metho… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  19. arXiv:2406.02648  [pdf, other

    cs.LG cs.AI

    Exploring Effects of Hyperdimensional Vectors for Tsetlin Machines

    Authors: Vojtech Halenka, Ahmed K. Kadhim, Paul F. A. Clarke, Bimal Bhattarai, Rupsa Saha, Ole-Christoffer Granmo, Lei Jiao, Per-Arne Andersen

    Abstract: Tsetlin machines (TMs) have been successful in several application domains, operating with high efficiency on Boolean representations of the input data. However, Booleanizing complex data structures such as sequences, graphs, images, signal spectra, chemical compounds, and natural language is not trivial. In this paper, we propose a hypervector (HV) based method for expressing arbitrarily large se… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 9 pages, 17 figures

  20. arXiv:2406.01918  [pdf, other

    cs.CR

    Image steganography based on generative implicit neural representation

    Authors: Zhong Yangjie, Liu Jia, Ke Yan, Liu Meiqi

    Abstract: In the realm of advanced steganography, the scale of the model typically correlates directly with the resolution of the fundamental grid, necessitating the training of a distinct neural network for message extraction. This paper proposes an image steganography based on generative implicit neural representation. This approach transcends the constraints of image resolution by portraying data as cont… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 33 pages, 15 figures and 5 tables

    MSC Class: 68T07 ACM Class: E.3

  21. arXiv:2405.19779  [pdf, other

    cs.NE cs.GR cs.LG

    Automatic Graph Topology-Aware Transformer

    Authors: Chao Wang, Jiaxuan Zhao, Lingling Li, Licheng Jiao, Fang Liu, Shuyuan Yang

    Abstract: Existing efforts are dedicated to designing many topologies and graph-aware strategies for the graph Transformer, which greatly improve the model's representation capabilities. However, manually determining the suitable Transformer architecture for a specific graph dataset or task requires extensive expert knowledge and laborious trials. This paper proposes an evolutionary graph Transformer archit… ▽ More

    Submitted 5 August, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: This work has been accepted by IEEE Transactions on Neural Networks and Learning Systems. Copyright may be transferred without notice, after which this version may no longer be accessible

  22. arXiv:2405.18959  [pdf, other

    cs.CV cs.MM

    Transcending Fusion: A Multi-Scale Alignment Method for Remote Sensing Image-Text Retrieval

    Authors: Rui Yang, Shuang Wang, Yingping Han, Yuanheng Li, Dong Zhao, Dou Quan, Yanhe Guo, Licheng Jiao

    Abstract: Remote Sensing Image-Text Retrieval (RSITR) is pivotal for knowledge services and data mining in the remote sensing (RS) domain. Considering the multi-scale representations in image content and text vocabulary can enable the models to learn richer representations and enhance retrieval. Current multi-scale RSITR approaches typically align multi-scale fused image features with text features, but ove… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 figures

  23. arXiv:2405.16956  [pdf, other

    cs.LG cs.AI cs.CE cs.PL cs.SE

    Functional Programming Paradigm of Python for Scientific Computation Pipeline Integration

    Authors: Chen Zhang, Lecheng Jia, Wei Zhang, Ning Wen

    Abstract: The advent of modern data processing has led to an increasing tendency towards interdisciplinarity, which frequently involves the importation of different technical approaches. Consequently, there is an urgent need for a unified data control system to facilitate the integration of varying libraries. This integration is of profound significance in accelerating prototype verification, optimising alg… ▽ More

    Submitted 3 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: 16 pages

  24. arXiv:2405.04496  [pdf, other

    cs.CV

    Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing

    Authors: Yi Zuo, Lingling Li, Licheng Jiao, Fang Liu, Xu Liu, Wenping Ma, Shuyuan Yang, Yuwei Guo

    Abstract: Existing diffusion-based video editing methods have achieved impressive results in motion editing. Most of the existing methods focus on the motion alignment between the edited video and the reference video. However, these methods do not constrain the background and object content of the video to remain unchanged, which makes it possible for users to generate unexpected videos. In this paper, we p… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  25. Swipe2Pair: Secure and Fast In-Band Wireless Device Pairing

    Authors: Yaqi He, Kai Zeng, Long Jiao, Brian L. Mark, Khaled N. Khasawneh

    Abstract: Wireless device pairing is a critical security mechanism to bootstrap the secure communication between two devices without a pre-shared secret. It has been widely used in many Internet of Things (IoT) applications, such as smart-home and smart-health. Most existing device pairing mechanisms are based on out-of-band channels, e.g., extra sensors or hardware, to validate the proximity of pairing dev… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  26. arXiv:2404.18213  [pdf, other

    cs.CV cs.AI

    S$^2$Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification

    Authors: Guanchun Wang, Xiangrong Zhang, Zelin Peng, Tianyang Zhang, Licheng Jiao

    Abstract: Land cover analysis using hyperspectral images (HSI) remains an open problem due to their low spatial resolution and complex spectral information. Recent studies are primarily dedicated to designing Transformer-based architectures for spatial-spectral long-range dependencies modeling, which is computationally expensive with quadratic complexity. Selective structured state space model (Mamba), whic… ▽ More

    Submitted 13 August, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: 12 pages, 7 figures

  27. arXiv:2404.17173  [pdf, other

    cs.CV cs.AI

    Exploring Beyond Logits: Hierarchical Dynamic Labeling Based on Embeddings for Semi-Supervised Classification

    Authors: Yanbiao Ma, Licheng Jiao, Fang Liu, Lingling Li, Shuyuan Yang, Xu Liu

    Abstract: In semi-supervised learning, methods that rely on confidence learning to generate pseudo-labels have been widely proposed. However, increasing research finds that when faced with noisy and biased data, the model's representation network is more reliable than the classification network. Additionally, label generation methods based on model predictions often show poor adaptability across different d… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  28. arXiv:2404.13859  [pdf, other

    cs.CV cs.AI

    Unveiling and Mitigating Generalized Biases of DNNs through the Intrinsic Dimensions of Perceptual Manifolds

    Authors: Yanbiao Ma, Licheng Jiao, Fang Liu, Lingling Li, Wenping Ma, Shuyuan Yang, Xu Liu, Puhua Chen

    Abstract: Building fair deep neural networks (DNNs) is a crucial step towards achieving trustworthy artificial intelligence. Delving into deeper factors that affect the fairness of DNNs is paramount and serves as the foundation for mitigating model biases. However, current methods are limited in accurately predicting DNN biases, relying solely on the number of training samples and lacking more precise measu… ▽ More

    Submitted 17 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 8pages, 6figures, Submitted to TPAMI

  29. arXiv:2403.12686  [pdf, other

    cs.CV cs.MM cs.RO

    WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar

    Authors: Runwei Guan, Liye Jia, Fengyufan Yang, Shanliang Yao, Erick Purwanto, Xiaohui Zhu, Eng Gee Lim, Jeremy Smith, Ka Lok Man, Xuming Hu, Yutao Yue

    Abstract: The perception of waterways based on human intent is significant for autonomous navigation and operations of Unmanned Surface Vehicles (USVs) in water environments. Inspired by visual grounding, we introduce WaterVG, the first visual grounding dataset designed for USV-based waterway perception based on human prompts. WaterVG encompasses prompts describing multiple targets, with annotations at the… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 10 pages, 10 figures

  30. arXiv:2403.01381  [pdf, other

    cs.CV

    SA-MixNet: Structure-aware Mixup and Invariance Learning for Scribble-supervised Road Extraction in Remote Sensing Images

    Authors: Jie Feng, Hao Huang, Junpeng Zhang, Weisheng Dong, Dingwen Zhang, Licheng Jiao

    Abstract: Mainstreamed weakly supervised road extractors rely on highly confident pseudo-labels propagated from scribbles, and their performance often degrades gradually as the image scenes tend various. We argue that such degradation is due to the poor model's invariance to scenes with different complexities, whereas existing solutions to this problem are commonly based on crafted priors that cannot be der… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  31. arXiv:2401.11436  [pdf, other

    cs.CV

    Geometric Prior Guided Feature Representation Learning for Long-Tailed Classification

    Authors: Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Puhua Chen

    Abstract: Real-world data are long-tailed, the lack of tail samples leads to a significant limitation in the generalization ability of the model. Although numerous approaches of class re-balancing perform well for moderate class imbalance problems, additional knowledge needs to be introduced to help the tail class recover the underlying true distribution when the observed distribution from a few tail sample… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: This work was accepted by the IJCV

  32. arXiv:2401.10510  [pdf, other

    cs.NE cs.AI cs.CL cs.LG

    When large language models meet evolutionary algorithms

    Authors: Wang Chao, Jiaxuan Zhao, Licheng Jiao, Lingling Li, Fang Liu, Shuyuan Yang

    Abstract: Pre-trained large language models (LLMs) have powerful capabilities for generating creative natural text. Evolutionary algorithms (EAs) can discover diverse solutions to complex real-world problems. Motivated by the common collective and directionality of text generation and evolution, this paper illustrates the parallels between LLMs and EAs, which includes multiple one-to-one key characteristics… ▽ More

    Submitted 29 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: A review article under two review

  33. arXiv:2401.02940  [pdf, other

    quant-ph cs.LG

    Digital-analog quantum learning on Rydberg atom arrays

    Authors: Jonathan Z. Lu, Lucy Jiao, Kristina Wolinski, Milan Kornjača, Hong-Ye Hu, Sergio Cantu, Fangli Liu, Susanne F. Yelin, Sheng-Tao Wang

    Abstract: We propose hybrid digital-analog learning algorithms on Rydberg atom arrays, combining the potentially practical utility and near-term realizability of quantum learning with the rapidly scaling architectures of neutral atoms. Our construction requires only single-qubit operations in the digital setting and global driving according to the Rydberg Hamiltonian in the analog setting. We perform a comp… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 22 pages, 20 figures

  34. arXiv:2312.07021  [pdf, other

    cs.CV cs.AI

    Transferring Modality-Aware Pedestrian Attentive Learning for Visible-Infrared Person Re-identification

    Authors: Yuwei Guo, Wenhao Zhang, Licheng Jiao, Shuang Wang, Shuo Wang, Fang Liu

    Abstract: Visible-infrared person re-identification (VI-ReID) aims to search the same pedestrian of interest across visible and infrared modalities. Existing models mainly focus on compensating for modality-specific information to reduce modality variation. However, these methods often lead to a higher computational overhead and may introduce interfering information when generating the corresponding images… ▽ More

    Submitted 18 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

  35. arXiv:2312.06331  [pdf, other

    cs.CV

    Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation

    Authors: Dong Zhao, Ruizhi Yang, Shuang Wang, Qi Zang, Yang Hu, Licheng Jiao, Nicu Sebe, Zhun Zhong

    Abstract: Presently, self-training stands as a prevailing approach in cross-domain semantic segmentation, enhancing model efficacy by training with pixels assigned with reliable pseudo-labels. However, we find two critical limitations in this paradigm. (1) The majority of reliable pixels exhibit a speckle-shaped pattern and are primarily located in the central semantic region. This presents challenges for t… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  36. arXiv:2311.13445  [pdf, other

    cs.LG cs.CR

    Transfer Attacks and Defenses for Large Language Models on Coding Tasks

    Authors: Chi Zhang, Zifan Wang, Ravi Mangal, Matt Fredrikson, Limin Jia, Corina Pasareanu

    Abstract: Modern large language models (LLMs), such as ChatGPT, have demonstrated impressive capabilities for coding tasks including writing and reasoning about code. They improve upon previous neural network models of code, such as code2seq or seq2seq, that already demonstrated competitive results when performing tasks such as code summarization and identifying code vulnerabilities. However, these previous… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  37. arXiv:2311.01744  [pdf, other

    cs.CV

    Data-Centric Long-Tailed Image Recognition

    Authors: Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Puhua Chen

    Abstract: In the context of the long-tail scenario, models exhibit a strong demand for high-quality data. Data-centric approaches aim to enhance both the quantity and quality of data to improve model performance. Among these approaches, information augmentation has been progressively introduced as a crucial category. It achieves a balance in model performance by augmenting the richness and quantity of sampl… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 13 pages

  38. arXiv:2310.11481  [pdf, other

    cs.FL cs.AI

    Contracting Tsetlin Machine with Absorbing Automata

    Authors: Bimal Bhattarai, Ole-Christoffer Granmo, Lei Jiao, Per-Arne Andersen, Svein Anders Tunheim, Rishad Shafik, Alex Yakovlev

    Abstract: In this paper, we introduce a sparse Tsetlin Machine (TM) with absorbing Tsetlin Automata (TA) states. In brief, the TA of each clause literal has both an absorbing Exclude- and an absorbing Include state, making the learning scheme absorbing instead of ergodic. When a TA reaches an absorbing state, it will never leave that state again. If the absorbing state is an Exclude state, both the automato… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted to ISTM2023. 7 pages, 8 figures

  39. arXiv:2310.10090  [pdf, other

    cs.LG cs.AI

    Orthogonal Uncertainty Representation of Data Manifold for Robust Long-Tailed Learning

    Authors: Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Lingling Li

    Abstract: In scenarios with long-tailed distributions, the model's ability to identify tail classes is limited due to the under-representation of tail samples. Class rebalancing, information augmentation, and other techniques have been proposed to facilitate models to learn the potential distribution of tail classes. The disadvantage is that these methods generally pursue models with balanced class accuracy… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 10pages,Accepted by ACM MM 2023

  40. arXiv:2310.02005  [pdf, other

    cs.AI

    Generalized Convergence Analysis of Tsetlin Machines: A Probabilistic Approach to Concept Learning

    Authors: Mohamed-Bachir Belaid, Jivitesh Sharma, Lei Jiao, Ole-Christoffer Granmo, Per-Arne Andersen, Anis Yazidi

    Abstract: Tsetlin Machines (TMs) have garnered increasing interest for their ability to learn concepts via propositional formulas and their proven efficiency across various application domains. Despite this, the convergence proof for the TMs, particularly for the AND operator (\emph{conjunction} of literals), in the generalized case (inputs greater than two bits) remains an open problem. This paper aims to… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  41. arXiv:2309.15478  [pdf, other

    cs.CV cs.LG

    The Robust Semantic Segmentation UNCV2023 Challenge Results

    Authors: Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli , et al. (12 additional authors not shown)

    Abstract: This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty q… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures, accepted at ICCV 2023 UNCV workshop

  42. arXiv:2309.13619  [pdf, other

    cs.CV

    Changes-Aware Transformer: Learning Generalized Changes Representation

    Authors: Dan Wang, Licheng Jiao, Jie Chen, Shuyuan Yang, Fang Liu

    Abstract: Difference features obtained by comparing the images of two periods play an indispensable role in the change detection (CD) task. However, a pair of bi-temporal images can exhibit diverse changes, which may cause various difference features. Identifying changed pixels with differ difference features to be the same category is thus a challenge for CD. Most nowadays' methods acquire distinctive diff… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  43. arXiv:2309.06751  [pdf, other

    cs.CV

    Remote Sensing Object Detection Meets Deep Learning: A Meta-review of Challenges and Advances

    Authors: Xiangrong Zhang, Tianyang Zhang, Guanchun Wang, Peng Zhu, Xu Tang, Xiuping Jia, Licheng Jiao

    Abstract: Remote sensing object detection (RSOD), one of the most fundamental and challenging tasks in the remote sensing field, has received longstanding attention. In recent years, deep learning techniques have demonstrated robust feature representation capabilities and led to a big leap in the development of RSOD techniques. In this era of rapid technical evolution, this review aims to present a comprehe… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted with IEEE Geoscience and Remote Sensing Magazine. More than 300 papers relevant to the RSOD filed were reviewed in this survey

  44. arXiv:2309.06315  [pdf, other

    cs.LG cs.AI

    Learning Minimalistic Tsetlin Machine Clauses with Markov Boundary-Guided Pruning

    Authors: Ole-Christoffer Granmo, Per-Arne Andersen, Lei Jiao, Xuan Zhang, Christian Blakely, Tor Tveit

    Abstract: A set of variables is the Markov blanket of a random variable if it contains all the information needed for predicting the variable. If the blanket cannot be reduced without losing useful information, it is called a Markov boundary. Identifying the Markov boundary of a random variable is advantageous because all variables outside the boundary are superfluous. Hence, the Markov boundary provides an… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted to ISTM2023, 8 pages, 6 figures

  45. arXiv:2309.06006  [pdf, ps, other

    cs.CV cs.AI

    SoccerNet 2023 Challenges Results

    Authors: Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim , et al. (77 additional authors not shown)

    Abstract: The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, fo… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  46. arXiv:2308.14064  [pdf

    cs.CV

    Multi-model fusion for Aerial Vision and Dialog Navigation based on human attention aids

    Authors: Xinyi Wang, Xuan Cui, Danxu Li, Fang Liu, Licheng Jiao

    Abstract: Drones have been widely used in many areas of our daily lives. It relieves people of the burden of holding a controller all the time and makes drone control easier to use for people with disabilities or occupied hands. However, the control of aerial robots is more complicated compared to normal robots due to factors such as uncontrollable height. Therefore, it is crucial to develop an intelligent… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: 4 pages, 1 figures

  47. arXiv:2308.13884  [pdf, ps, other

    cs.NI

    Location Privacy and Spectrum Efficiency Enhancement in Spectrum Sharing Systems

    Authors: Long Jiao, Yao Ge, Kai Zeng, B. C. Hilburn

    Abstract: In this work, we investigate the benefits of secondary user (SU) network beamforming on improving primary user (PU) location privacy in spectrum sharing systems, where the beamformer in the SU network is designed to suppress the aggregate interference to improve the location privacy of PUs. We consider two problems: improving SU network communication throughput subject to the specified PU location… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  48. arXiv:2308.08204  [pdf, other

    cs.CL

    MoCoSA: Momentum Contrast for Knowledge Graph Completion with Structure-Augmented Pre-trained Language Models

    Authors: Jiabang He, Liu Jia, Lei Wang, Xiyao Li, Xing Xu

    Abstract: Knowledge Graph Completion (KGC) aims to conduct reasoning on the facts within knowledge graphs and automatically infer missing links. Existing methods can mainly be categorized into structure-based or description-based. On the one hand, structure-based methods effectively represent relational facts in knowledge graphs using entity embeddings. However, they struggle with semantically rich real-wor… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  49. arXiv:2307.16693  [pdf, other

    cs.DB

    AisLSM: Revolutionizing the Compaction with Asynchronous I/Os for LSM-tree

    Authors: Yanpeng Hu, Li Zhu, Lei Jia, Chundong Wang

    Abstract: The log-structured merge tree (LSM-tree) is widely employed to build key-value (KV) stores. LSM-tree organizes multiple levels in memory and on disk. The compaction of LSM-tree, which is used to redeploy KV pairs between on-disk levels in the form of SST files, severely stalls its foreground service. We overhaul and analyze the procedure of compaction. Writing and persisting files with fsyncs for… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 14 pages

  50. arXiv:2307.09042  [pdf

    cs.AI

    Emotional Intelligence of Large Language Models

    Authors: Xuena Wang, Xueting Li, Zi Yin, Yue Wu, Liu Jia

    Abstract: Large Language Models (LLMs) have demonstrated remarkable abilities across numerous disciplines, primarily assessed through tasks in language generation, knowledge utilization, and complex reasoning. However, their alignment with human emotions and values, which is critical for real-world applications, has not been systematically evaluated. Here, we assessed LLMs' Emotional Intelligence (EI), enco… ▽ More

    Submitted 28 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: 36 pages, 5 figures