Skip to main content

Showing 1–50 of 204 results for author: Jiang, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08848  [pdf, other

    cs.RO

    GCS*: Forward Heuristic Search on Implicit Graphs of Convex Sets

    Authors: Shao Yuan Chew Chia, Rebecca H. Jiang, Bernhard Paus Graesdal, Leslie Pack Kaelbling, Russ Tedrake

    Abstract: We consider large-scale, implicit-search-based solutions to the Shortest Path Problems on Graphs of Convex Sets (GCS). We propose GCS*, a forward heuristic search algorithm that generalizes A* search to the GCS setting, where a continuous-valued decision is made at each graph vertex, and constraints across graph edges couple these decisions, influencing costs and feasibility. Such mixed discrete-c… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.08529  [pdf, other

    cs.CR

    Enhancing Privacy of Spatiotemporal Federated Learning against Gradient Inversion Attacks

    Authors: Lele Zheng, Yang Cao, Renhe Jiang, Kenjiro Taura, Yulong Shen, Sheng Li, Masatoshi Yoshikawa

    Abstract: Spatiotemporal federated learning has recently raised intensive studies due to its ability to train valuable models with only shared gradients in various location-based services. On the other hand, recent studies have shown that shared gradients may be subject to gradient inversion attacks (GIA) on images or texts. However, so far there has not been any systematic study of the gradient inversion a… ▽ More

    Submitted 15 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by DASFAA 2024, 16 pages

  3. arXiv:2407.05639  [pdf

    cs.LG cs.CR

    Deep Learning-based Anomaly Detection and Log Analysis for Computer Networks

    Authors: Shuzhan Wang, Ruxue Jiang, Zhaoqi Wang, Yan Zhou

    Abstract: Computer network anomaly detection and log analysis, as an important topic in the field of network security, has been a key task to ensure network security and system reliability. First, existing network anomaly detection and log analysis methods are often challenged by high-dimensional data and complex network topologies, resulting in unstable performance and high false-positive rates. In additio… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 38 pages

  4. arXiv:2407.01846  [pdf, other

    cs.CV

    Investigating the Segment Anything Foundation Model for Mapping Smallholder Agriculture Field Boundaries Without Training Labels

    Authors: Pratyush Tripathy, Kathy Baylis, Kyle Wu, Jyles Watson, Ruizhe Jiang

    Abstract: Accurate mapping of agricultural field boundaries is crucial for enhancing outcomes like precision agriculture, crop monitoring, and yield estimation. However, extracting these boundaries from satellite images is challenging, especially for smallholder farms and data-scarce environments. This study explores the Segment Anything Model (SAM) to delineate agricultural field boundaries in Bihar, India… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 11 pages, 6 main figures, 7 supplementary figures

  5. arXiv:2406.12709  [pdf, other

    cs.LG cs.AI

    Enhancing Spatio-temporal Quantile Forecasting with Curriculum Learning: Lessons Learned

    Authors: Du Yin, Jinliang Deng, Shuang Ao, Zechen Li, Hao Xue, Arian Prabowo, Renhe Jiang, Xuan Song, Flora Salim

    Abstract: Training models on spatio-temporal (ST) data poses an open problem due to the complicated and diverse nature of the data itself, and it is challenging to ensure the model's performance directly trained on the original ST data. While limiting the variety of training data can make training easier, it can also lead to a lack of knowledge and information for the model, resulting in a decrease in perfo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  6. arXiv:2406.12208  [pdf, other

    cs.CL cs.AI cs.CV cs.NE

    Knowledge Fusion By Evolving Weights of Language Models

    Authors: Guodong Du, Jing Li, Hanting Liu, Runhua Jiang, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang

    Abstract: Fine-tuning pre-trained language models, particularly large language models, demands extensive computing resources and can result in varying performance outcomes across different domains and datasets. This paper examines the approach of integrating multiple models from diverse training scenarios into a unified model. This unified model excels across various data domains and exhibits the ability to… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL2024 Findings

  7. arXiv:2406.11191  [pdf, other

    cs.CL

    A Survey on Human Preference Learning for Large Language Models

    Authors: Ruili Jiang, Kehai Chen, Xuefeng Bai, Zhixuan He, Juntao Li, Muyun Yang, Tiejun Zhao, Liqiang Nie, Min Zhang

    Abstract: The recent surge of versatile large language models (LLMs) largely depends on aligning increasingly capable foundation models with human intentions by preference learning, enhancing LLMs with excellent applicability and effectiveness in a wide range of contexts. Despite the numerous related studies conducted, a perspective on how human preferences are introduced into LLMs remains limited, which ma… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: IEEE copyright statement added (also applied to the former version)

  8. arXiv:2406.04592  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convergence Analysis of Adaptive Gradient Methods under Refined Smoothness and Noise Assumptions

    Authors: Devyani Maladkar, Ruichen Jiang, Aryan Mokhtari

    Abstract: Adaptive gradient methods are arguably the most successful optimization algorithms for neural network training. While it is well-known that adaptive gradient methods can achieve better dimensional dependence than stochastic gradient descent (SGD) under favorable geometry for stochastic convex optimization, the theoretical justification for their success in stochastic non-convex optimization remain… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 21 pages

  9. arXiv:2406.02349  [pdf, other

    cs.NE cs.AI cs.CV

    CADE: Cosine Annealing Differential Evolution for Spiking Neural Network

    Authors: Runhua Jiang, Guodong Du, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang

    Abstract: Spiking neural networks (SNNs) have gained prominence for their potential in neuromorphic computing and energy-efficient artificial intelligence, yet optimizing them remains a formidable challenge for gradient-based methods due to their discrete, spike-based computation. This paper attempts to tackle the challenges by introducing Cosine Annealing Differential Evolution (CADE), designed to modulate… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  10. arXiv:2406.02016  [pdf, other

    math.OC cs.LG stat.ML

    Adaptive and Optimal Second-order Optimistic Methods for Minimax Optimization

    Authors: Ruichen Jiang, Ali Kavis, Qiujiang Jin, Sujay Sanghavi, Aryan Mokhtari

    Abstract: We propose adaptive, line search-free second-order methods with optimal rate of convergence for solving convex-concave min-max problems. By means of an adaptive step size, our algorithms feature a simple update rule that requires solving only one linear system per iteration, eliminating the need for line search or backtracking mechanisms. Specifically, we base our algorithms on the optimistic meth… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 33 pages, 2 figures

  11. arXiv:2406.01478  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Newton Proximal Extragradient Method

    Authors: Ruichen Jiang, Michał Dereziński, Aryan Mokhtari

    Abstract: Stochastic second-order methods achieve fast local convergence in strongly convex optimization by using noisy Hessian estimates to precondition the gradient. However, these methods typically reach superlinear convergence only when the stochastic Hessian noise diminishes, increasing per-iteration costs over time. Recent work in [arXiv:2204.09266] addressed this with a Hessian averaging scheme that… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 32 pages, 1 figure

  12. arXiv:2405.18322  [pdf, other

    cs.CV cs.AI

    SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation

    Authors: Kejia Yin, Varshanth R. Rao, Ruowei Jiang, Xudong Liu, Parham Aarabi, David B. Lindell

    Abstract: Self-supervised landmark estimation is a challenging task that demands the formation of locally distinct feature representations to identify sparse facial landmarks in the absence of annotated data. To tackle this task, existing state-of-the-art (SOTA) methods (1) extract coarse features from backbones that are trained with instance-level self-supervised learning (SSL) paradigms, which neglect the… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024

  13. arXiv:2405.16075  [pdf, other

    cs.LG cs.AI

    Continuous Temporal Domain Generalization

    Authors: Zekun Cai, Guangji Bai, Renhe Jiang, Xuan Song, Liang Zhao

    Abstract: Temporal Domain Generalization (TDG) addresses the challenge of training predictive models under temporally varying data distributions. Traditional TDG approaches typically focus on domain data collected at fixed, discrete time intervals, which limits their capability to capture the inherent dynamics within continuous-evolving and irregularly-observed temporal domains. To overcome this, this work… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  14. arXiv:2405.10800  [pdf, other

    cs.LG

    Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series Forecasting

    Authors: Zheng Dong, Renhe Jiang, Haotian Gao, Hangchen Liu, Jinliang Deng, Qingsong Wen, Xuan Song

    Abstract: Spatiotemporal time series forecasting plays a key role in a wide range of real-world applications. While significant progress has been made in this area, fully capturing and leveraging spatiotemporal heterogeneity remains a fundamental challenge. Therefore, we propose a novel Heterogeneity-Informed Meta-Parameter Learning scheme. Specifically, our approach implicitly captures spatiotemporal heter… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD'24 Research Track

  15. arXiv:2405.04976  [pdf, other

    cs.IT eess.SP

    RF-based Energy Harvesting: Nonlinear Models, Applications and Challenges

    Authors: Ruihong Jiang

    Abstract: So far, various aspects associated with wireless energy harvesting (EH) have been investigated from diverse perspectives, including energy sources and models, usage protocols, energy scheduling and optimization, and EH implementation in different wireless communication systems. However, a comprehensive survey specifically focusing on models of radio frequency (RF)-based EH behaviors has not yet be… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  16. arXiv:2405.03255  [pdf, other

    cs.LG

    Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning

    Authors: Jiewen Deng, Renhe Jiang, Jiaqi Zhang, Xuan Song

    Abstract: Multi-modality spatio-temporal (MoST) data extends spatio-temporal (ST) data by incorporating multiple modalities, which is prevalent in monitoring systems, encompassing diverse traffic demands and air quality assessments. Despite significant strides in ST modeling in recent years, there remains a need to emphasize harnessing the potential of information from different modalities. Robust MoST fore… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024 Main Track

  17. arXiv:2405.01350  [pdf, other

    cs.LG cs.SI

    Community-Invariant Graph Contrastive Learning

    Authors: Shiyin Tan, Dongyuan Li, Renhe Jiang, Ying Zhang, Manabu Okumura

    Abstract: Graph augmentation has received great attention in recent years for graph contrastive learning (GCL) to learn well-generalized node/graph representations. However, mainstream GCL methods often favor randomly disrupting graphs for augmentation, which shows limited generalization and inevitably leads to the corruption of high-level graph information, i.e., the graph community. Moreover, current know… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by ICML-2024

  18. arXiv:2405.00334  [pdf, other

    cs.LG

    A Survey on Deep Active Learning: Recent Advances and New Frontiers

    Authors: Dongyuan Li, Zhen Wang, Yankai Chen, Renhe Jiang, Weiping Ding, Manabu Okumura

    Abstract: Active learning seeks to achieve strong performance with fewer training samples. It does this by iteratively asking an oracle to label new selected samples in a human-in-the-loop manner. This technique has gained increasing popularity due to its broad applicability, yet its survey papers, especially for deep learning-based active learning (DAL), remain scarce. Therefore, we conduct an advanced and… ▽ More

    Submitted 15 July, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by IEEE Transactions on Neural Networks and Learning Systems

  19. arXiv:2404.15597  [pdf, other

    cs.NE cs.AI cs.LG cs.MA

    GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL

    Authors: Lang Qin, Ziming Wang, Runhao Jiang, Rui Yan, Huajin Tang

    Abstract: Spiking neural networks (SNNs) are widely applied in various fields due to their energy-efficient and fast-inference capabilities. Applying SNNs to reinforcement learning (RL) can significantly reduce the computational resource requirements for agents and improve the algorithm's performance under resource-constrained conditions. However, in current spiking reinforcement learning (SRL) algorithms,… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  20. arXiv:2404.10947  [pdf, other

    cs.CV

    Residual Connections Harm Abstract Feature Learning in Masked Autoencoders

    Authors: Xiao Zhang, Ruoxi Jiang, William Gao, Rebecca Willett, Michael Maire

    Abstract: We demonstrate that adding a weighting factor to decay the strength of identity shortcuts within residual networks substantially improves semantic feature learning in the state-of-the-art self-supervised masked autoencoding (MAE) paradigm. Our modification to the identity shortcuts within a VIT-B/16 backbone of an MAE boosts linear probing accuracy on ImageNet from 67.8% to 72.7%. This significant… ▽ More

    Submitted 20 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  21. arXiv:2404.09679  [pdf, other

    cs.DC cs.LG

    AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes

    Authors: Youshao Xiao, Lin Ju, Zhenglei Zhou, Siyuan Li, Zhaoxin Huan, Dalong Zhang, Rujie Jiang, Lin Wang, Xiaolu Zhang, Lei Liang, Jun Zhou

    Abstract: Many distributed training techniques like Parameter Server and AllReduce have been proposed to take advantage of the increasingly large data and rich features. However, stragglers frequently occur in distributed training due to resource contention and hardware heterogeneity, which significantly hampers the training efficiency. Previous works only address part of the stragglers and could not adapti… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  22. arXiv:2403.12574  [pdf, other

    cs.CV cs.AI cs.NE

    EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks

    Authors: Ziming Wang, Ziling Wang, Huaning Li, Lang Qin, Runhao Jiang, De Ma, Huajin Tang

    Abstract: Event cameras, with their high dynamic range and temporal resolution, are ideally suited for object detection, especially under scenarios with motion blur and challenging lighting conditions. However, while most existing approaches prioritize optimizing spatiotemporal representations with advanced detection backbones and early aggregation functions, the crucial issue of adaptive event sampling rem… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  23. arXiv:2403.11087  [pdf, other

    cs.LG cs.SI

    Incorporating Higher-order Structural Information for Graph Clustering

    Authors: Qiankun Li, Haobing Liu, Ruobing Jiang, Tingting Wang

    Abstract: Clustering holds profound significance in data mining. In recent years, graph convolutional network (GCN) has emerged as a powerful tool for deep clustering, integrating both graph structural information and node attributes. However, most existing methods ignore the higher-order structural information of the graph. Evidently, nodes within the same cluster can establish distant connections. Besides… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Journal ref: DASFAA 2024

  24. arXiv:2403.10568  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    MoPE: Parameter-Efficient and Scalable Multimodal Fusion via Mixture of Prompt Experts

    Authors: Ruixiang Jiang, Lingbo Liu, Changwen Chen

    Abstract: Prompt-tuning has demonstrated parameter-efficiency in fusing unimodal foundation models for multimodal tasks. However, its limited adaptivity and expressiveness lead to suboptimal performance when compared with other tuning methods. In this paper, we address this issue by disentangling the vanilla prompts to adaptively capture dataset-level and instance-level features. Building upon this disentan… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Extended version of arxiv:2312.03734

  25. arXiv:2403.05886  [pdf, other

    cs.CV

    Generalizing to Out-of-Sample Degradations via Model Reprogramming

    Authors: Runhua Jiang, Yahong Han

    Abstract: Existing image restoration models are typically designed for specific tasks and struggle to generalize to out-of-sample degradations not encountered during training. While zero-shot methods can address this limitation by fine-tuning model parameters on testing samples, their effectiveness relies on predefined natural priors and physical models of specific degradations. Nevertheless, determining ou… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  26. arXiv:2403.02566  [pdf, other

    eess.IV cs.CV

    Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning

    Authors: Zhaoxin Fan, Runmin Jiang, Junhao Wu, Xin Huang, Tianyang Wang, Heng Huang, Min Xu

    Abstract: 3D medical image segmentation is a challenging task with crucial implications for disease diagnosis and treatment planning. Recent advances in deep learning have significantly enhanced fully supervised medical image segmentation. However, this approach heavily relies on labor-intensive and time-consuming fully annotated ground-truth labels, particularly for 3D volumes. To overcome this limitation,… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  27. arXiv:2403.01636  [pdf, other

    stat.ML cs.LG

    Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

    Authors: Ziping Xu, Zifan Xu, Runxuan Jiang, Peter Stone, Ambuj Tewari

    Abstract: Multitask Reinforcement Learning (MTRL) approaches have gained increasing attention for its wide applications in many important Reinforcement Learning (RL) tasks. However, while recent advancements in MTRL theory have focused on the improved statistical efficiency by assuming a shared structure across tasks, exploration--a crucial aspect of RL--has been largely overlooked. This paper addresses thi… ▽ More

    Submitted 5 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  28. arXiv:2402.19004  [pdf, other

    cs.CV eess.IV

    RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation

    Authors: Jie Zhang, Xubing Yang, Rui Jiang, Wei Shao, Li Zhang

    Abstract: The development of high-resolution remote sensing satellites has provided great convenience for research work related to remote sensing. Segmentation and extraction of specific targets are essential tasks when facing the vast and complex remote sensing images. Recently, the introduction of Segment Anything Model (SAM) provides a universal pre-training model for image segmentation tasks. While the… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 12 pages, 11 figures

  29. Label Informed Contrastive Pretraining for Node Importance Estimation on Knowledge Graphs

    Authors: Tianyu Zhang, Chengbin Hou, Rui Jiang, Xuegong Zhang, Chenghu Zhou, Ke Tang, Hairong Lv

    Abstract: Node Importance Estimation (NIE) is a task of inferring importance scores of the nodes in a graph. Due to the availability of richer data and knowledge, recent research interests of NIE have been dedicating to knowledge graphs for predicting future or missing node importance scores. Existing state-of-the-art NIE methods train the model by available labels, and they consider every interested node e… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE TNNLS

  30. arXiv:2402.17732  [pdf, other

    math.ST cs.LG stat.ML

    Batched Nonparametric Contextual Bandits

    Authors: Rong Jiang, Cong Ma

    Abstract: We study nonparametric contextual bandits under batch constraints, where the expected reward for each action is modeled as a smooth function of covariates, and the policy updates are made at the end of each batch of observations. We establish a minimax regret lower bound for this setting and propose a novel batch learning algorithm that achieves the optimal regret (up to logarithmic factors). In e… ▽ More

    Submitted 10 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Add lower bound when grid is adaptively chosen; add results on adaptivity to margin parameter

  31. arXiv:2402.16190  [pdf

    cs.CE cond-mat.mtrl-sci

    Accurate predictions of keyhole depths using machine learning-aided simulations

    Authors: Jiahui Zhang, Runbo Jiang, Kangming Li, Pengyu Chen, Xiao Shang, Zhiying Liu, Jason Hattrick-Simpers, Brian J. Simonds, Qianglong Wei, Hongze Wang, Tao Sun, Anthony D. Rollett, Yu Zou

    Abstract: The keyhole phenomenon is widely observed in laser materials processing, including laser welding, remelting, cladding, drilling, and additive manufacturing. Keyhole-induced defects, primarily pores, dramatically affect the performance of final products, impeding the broad use of these laser-based technologies. The formation of these pores is typically associated with the dynamic behavior of the ke… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  32. arXiv:2402.14744  [pdf, other

    cs.AI cs.CL cs.CY cs.LG

    Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation

    Authors: Jiawei Wang, Renhe Jiang, Chuang Yang, Zengqing Wu, Makoto Onizuka, Ryosuke Shibasaki, Noboru Koshizuka, Chuan Xiao

    Abstract: This paper introduces a novel approach using Large Language Models (LLMs) integrated into an agent framework for flexible and effective personal mobility generation. LLMs overcome the limitations of previous models by effectively processing semantic data and offering versatility in modeling various tasks. Our approach addresses three research questions: aligning LLMs with real-world urban mobility… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Source codes are available at https://github.com/Wangjw6/LLMob/

  33. arXiv:2402.11764  [pdf, other

    cs.CL cs.AI cs.CY

    ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

    Authors: Pengrui Han, Rafal Kocielnik, Adhithya Saravanan, Roy Jiang, Or Sharir, Anima Anandkumar

    Abstract: Large Language models (LLMs), while powerful, exhibit harmful social biases. Debiasing is often challenging due to computational costs, data constraints, and potential degradation of multi-task language capabilities. This work introduces a novel approach utilizing ChatGPT to generate synthetic training data, aiming to enhance the debiasing of LLMs. We propose two strategies: Targeted Prompting, wh… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024 Workshop on Language Technology for Equality, Diversity, Inclusion (LT-EDI-2024)

    MSC Class: 68T50 ACM Class: I.2.7; K.4.1

  34. arXiv:2402.08097  [pdf, ps, other

    math.OC cs.LG stat.ML

    An Accelerated Gradient Method for Convex Smooth Simple Bilevel Optimization

    Authors: Jincheng Cao, Ruichen Jiang, Erfan Yazdandoost Hamedani, Aryan Mokhtari

    Abstract: In this paper, we focus on simple bilevel optimization problems, where we minimize a convex smooth objective function over the optimal solution set of another convex smooth constrained optimization problem. We present a novel bilevel optimization method that locally approximates the solution set of the lower-level problem using a cutting plane approach and employs an accelerated gradient-based upd… ▽ More

    Submitted 31 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  35. arXiv:2402.06673  [pdf, other

    cs.AI

    Advancing Explainable AI Toward Human-Like Intelligence: Forging the Path to Artificial Brain

    Authors: Yongchen Zhou, Richard Jiang

    Abstract: The intersection of Artificial Intelligence (AI) and neuroscience in Explainable AI (XAI) is pivotal for enhancing transparency and interpretability in complex decision-making processes. This paper explores the evolution of XAI methodologies, ranging from feature-based to human-centric approaches, and delves into their applications in diverse domains, including healthcare and finance. The challeng… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  36. arXiv:2401.10402  [pdf, other

    cs.CV

    Reconstructing the Invisible: Video Frame Restoration through Siamese Masked Conditional Variational Autoencoder

    Authors: Yongchen Zhou, Richard Jiang

    Abstract: In the domain of computer vision, the restoration of missing information in video frames is a critical challenge, particularly in applications such as autonomous driving and surveillance systems. This paper introduces the Siamese Masked Conditional Variational Autoencoder (SiamMCVAE), leveraging a siamese architecture with twin encoders based on vision transformers. This innovative design enhances… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  37. arXiv:2401.09475  [pdf, other

    cs.CV cs.LG

    Triamese-ViT: A 3D-Aware Method for Robust Brain Age Estimation from MRIs

    Authors: Zhaonian Zhang, Richard Jiang

    Abstract: The integration of machine learning in medicine has significantly improved diagnostic precision, particularly in the interpretation of complex structures like the human brain. Diagnosing challenging conditions such as Alzheimer's disease has prompted the development of brain age estimation techniques. These methods often leverage three-dimensional Magnetic Resonance Imaging (MRI) scans, with recen… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  38. arXiv:2401.04570  [pdf, other

    eess.IV cs.CV

    An Automatic Cascaded Model for Hemorrhagic Stroke Segmentation and Hemorrhagic Volume Estimation

    Authors: Weijin Xu, Zhuang Sha, Huihua Yang, Rongcai Jiang, Zhanying Li, Wentao Liu, Ruisheng Su

    Abstract: Hemorrhagic Stroke (HS) has a rapid onset and is a serious condition that poses a great health threat. Promptly and accurately delineating the bleeding region and estimating the volume of bleeding in Computer Tomography (CT) images can assist clinicians in treatment planning, leading to improved treatment outcomes for patients. In this paper, a cascaded 3D model is constructed based on UNet to per… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted by SWITCH2023: Stroke Workshop on Imaging and Treatment CHallenges, a workshop at MICCAI 2023

  39. arXiv:2401.03058  [pdf, other

    math.OC cs.LG stat.ML

    Krylov Cubic Regularized Newton: A Subspace Second-Order Method with Dimension-Free Convergence Rate

    Authors: Ruichen Jiang, Parameswaran Raman, Shoham Sabach, Aryan Mokhtari, Mingyi Hong, Volkan Cevher

    Abstract: Second-order optimization methods, such as cubic regularized Newton methods, are known for their rapid convergence rates; nevertheless, they become impractical in high-dimensional problems due to their substantial memory requirements and computational costs. One promising approach is to execute second-order updates within a lower-dimensional subspace, giving rise to subspace second-order methods.… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 27 pages, 2 figures

  40. arXiv:2312.15993  [pdf

    cs.AI cs.RO eess.SY

    Adaptive Kalman-based hybrid car following strategy using TD3 and CACC

    Authors: Yuqi Zheng, Ruidong Yan, Bin Jia, Rui Jiang, Adriana TAPUS, Xiaojing Chen, Shiteng Zheng, Ying Shang

    Abstract: In autonomous driving, the hybrid strategy of deep reinforcement learning and cooperative adaptive cruise control (CACC) can fully utilize the advantages of the two algorithms and significantly improve the performance of car following. However, it is challenging for the traditional hybrid strategy based on fixed coefficients to adapt to mixed traffic flow scenarios, which may decrease the performa… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 32pages,13figures

  41. arXiv:2312.13875  [pdf, other

    stat.ML cs.LG stat.ME

    Best Arm Identification in Batched Multi-armed Bandit Problems

    Authors: Shengyu Cao, Simai He, Ruoqing Jiang, Jin Xu, Hongsong Yuan

    Abstract: Recently multi-armed bandit problem arises in many real-life scenarios where arms must be sampled in batches, due to limited time the agent can wait for the feedback. Such applications include biological experimentation and online marketing. The problem is further complicated when the number of arms is large and the number of batches is small. We consider pure exploration in a batched multi-armed… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  42. arXiv:2312.10065  [pdf, other

    cs.CY cs.AI

    Exploring Social Bias in Downstream Applications of Text-to-Image Foundation Models

    Authors: Adhithya Prakash Saravanan, Rafal Kocielnik, Roy Jiang, Pengrui Han, Anima Anandkumar

    Abstract: Text-to-image diffusion models have been adopted into key commercial workflows, such as art generation and image editing. Characterising the implicit social biases they exhibit, such as gender and racial stereotypes, is a necessary first step in avoiding discriminatory outcomes. While existing studies on social bias focus on image generation, the biases exhibited in alternate applications of diffu… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    ACM Class: F.2.2; I.2.7

  43. Multiple Instance Learning for Uplift Modeling

    Authors: Yao Zhao, Haipeng Zhang, Shiwei Lyu, Ruiying Jiang, Jinjie Gu, Guannan Zhang

    Abstract: Uplift modeling is widely used in performance marketing to estimate effects of promotion campaigns (e.g., increase of customer retention rate). Since it is impossible to observe outcomes of a recipient in treatment (e.g., receiving a certain promotion) and control (e.g., without promotion) groups simultaneously (i.e., counter-factual), uplift models are mainly trained on instances of treatment and… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: short paper of CIKM22(full version)

    Journal ref: Proceedings of the 31st ACM International Conference on Information and Knowledge Management (2022) 4727-4731

  44. arXiv:2312.03734  [pdf, other

    cs.CL cs.AI

    Conditional Prompt Tuning for Multimodal Fusion

    Authors: Ruixiang Jiang, Lingbo Liu, Changwen Chen

    Abstract: We show that the representation of one modality can effectively guide the prompting of another modality for parameter-efficient multimodal fusion. Specifically, we first encode one modality and use its representation as a prior to conditionally prompt all frozen layers of the other modality. This is achieved by disentangling the vanilla prompt vectors into three types of specialized prompts that a… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

    Comments: under review

  45. arXiv:2312.00516  [pdf, other

    cs.LG

    Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal Forecasting

    Authors: Haotian Gao, Renhe Jiang, Zheng Dong, Jinliang Deng, Yuxin Ma, Xuan Song

    Abstract: Spatiotemporal forecasting techniques are significant for various domains such as transportation, energy, and weather. Accurate prediction of spatiotemporal series remains challenging due to the complex spatiotemporal heterogeneity. In particular, current end-to-end models are limited by input length and thus often fall into spatiotemporal mirage, i.e., similar input time series followed by dissim… ▽ More

    Submitted 28 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted at IJCAI-2024 Main Track

  46. arXiv:2311.16191  [pdf, other

    cs.LG cs.AI

    Learning Multi-Pattern Normalities in the Frequency Domain for Efficient Time Series Anomaly Detection

    Authors: Feiyi Chen, Yingying zhang, Zhen Qin, Lunting Fan, Renhe Jiang, Yuxuan Liang, Qingsong Wen, Shuiguang Deng

    Abstract: Anomaly detection significantly enhances the robustness of cloud systems. While neural network-based methods have recently demonstrated strong advantages, they encounter practical challenges in cloud environments: the contradiction between the impracticality of maintaining a unique model for each service and the limited ability to deal with diverse normal patterns by a unified model, as well as is… ▽ More

    Submitted 18 March, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted by IEEE 40th International Conference on Data Engineering (ICDE 2024)

  47. arXiv:2311.05139  [pdf, other

    cs.LG

    On neural and dimensional collapse in supervised and unsupervised contrastive learning with hard negative sampling

    Authors: Ruijie Jiang, Thuan Nguyen, Shuchin Aeron, Prakash Ishwar

    Abstract: For a widely-studied data model and general loss and sample-hardening functions we prove that the Supervised Contrastive Learning (SCL), Hard-SCL (HSCL), and Unsupervised Contrastive Learning (UCL) risks are minimized by representations that exhibit Neural Collapse (NC), i.e., the class means form an Equianglular Tight Frame (ETF) and data from the same class are mapped to the same representation.… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  48. Explainable Artificial Intelligence (XAI) 2.0: A Manifesto of Open Challenges and Interdisciplinary Research Directions

    Authors: Luca Longo, Mario Brcic, Federico Cabitza, Jaesik Choi, Roberto Confalonieri, Javier Del Ser, Riccardo Guidotti, Yoichi Hayashi, Francisco Herrera, Andreas Holzinger, Richard Jiang, Hassan Khosravi, Freddy Lecue, Gianclaudio Malgieri, Andrés Páez, Wojciech Samek, Johannes Schneider, Timo Speith, Simone Stumpf

    Abstract: As systems based on opaque Artificial Intelligence (AI) continue to flourish in diverse real-world applications, understanding these black box models has become paramount. In response, Explainable AI (XAI) has emerged as a field of research with practical and ethical benefits across various domains. This paper not only highlights the advancements in XAI and its application in real-world scenarios… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    ACM Class: F.2.0; H.1.2; I.2; I.2.6; K.4; K.5

    Journal ref: Information Fusion 2024

  49. arXiv:2310.18803  [pdf, other

    cs.LG

    Weakly Coupled Deep Q-Networks

    Authors: Ibrahim El Shar, Daniel R. Jiang

    Abstract: We propose weakly coupled deep Q-networks (WCDQN), a novel deep reinforcement learning algorithm that enhances performance in a class of structured problems called weakly coupled Markov decision processes (WCMDP). WCMDPs consist of multiple independent subproblems connected by an action space constraint, which is a structural property that frequently emerges in practice. Despite this appealing str… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: To appear in proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  50. arXiv:2310.18425  [pdf, other

    cs.RO

    Parallel-Jaw Gripper and Grasp Co-Optimization for Sets of Planar Objects

    Authors: Rebecca H. Jiang, Neel Doshi, Ravi Gondhalekar, Alberto Rodriguez

    Abstract: We propose a framework for optimizing a planar parallel-jaw gripper for use with multiple objects. While optimizing general-purpose grippers and contact locations for grasps are both well studied, co-optimizing grasps and the gripper geometry to execute them receives less attention. As such, our framework synthesizes grippers optimized to stably grasp sets of polygonal objects. Given a fixed numbe… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 2023 IEEE IROS conference