Zum Hauptinhalt springen

Showing 1–50 of 440 results for author: Jiang, R

.
  1. arXiv:2408.16103  [pdf, other

    cond-mat.mes-hall

    Orbital magnetoelectric coupling of three dimensional Chern insulators

    Authors: Xin Lu, Renwen Jiang, Jianpeng Liu

    Abstract: Orbital magnetoelectric effect is closely related to the band topology of bulk crystalline insulators. Typical examples include the half quantized Chern-Simons orbital magnetoelectric coupling in three dimensional (3D) axion insulators and topological insulators, which are the hallmarks of their nontrivial bulk band topology. While the Chern-Simons coupling is well defined only for insulators with… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: main text: 5 pages, 1 figure and 5 tables; SI: 15 pages, 5 figures and 2 tables

  2. arXiv:2408.12594  [pdf, other

    cs.LG

    Non-Homophilic Graph Pre-Training and Prompt Learning

    Authors: Xingtong Yu, Jie Zhang, Yuan Fang, Renhe Jiang

    Abstract: Graphs are ubiquitous for modeling complex relationships between objects across various fields. Graph neural networks (GNNs) have become a mainstream technique for graph-based applications, but their performance heavily relies on abundant labeled data. To reduce labeling requirement, pre-training and prompt learning has become a popular alternative. However, most existing prompt methods do not dif… ▽ More

    Submitted 30 August, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

    Comments: Under review

  3. arXiv:2408.09667  [pdf, other

    cs.CL

    BLADE: Benchmarking Language Model Agents for Data-Driven Science

    Authors: Ken Gu, Ruoxi Shang, Ruien Jiang, Keying Kuang, Richard-John Lin, Donghe Lyu, Yue Mao, Youran Pan, Teng Wu, Jiaqian Yu, Yikun Zhang, Tianmai M. Zhang, Lanyi Zhu, Mike A. Merrill, Jeffrey Heer, Tim Althoff

    Abstract: Data-driven scientific discovery requires the iterative integration of scientific domain knowledge, statistical expertise, and an understanding of data semantics to make nuanced analytical decisions, e.g., about which variables, transformations, and statistical models to consider. LM-based agents equipped with planning, memory, and code execution capabilities have the potential to support data-dri… ▽ More

    Submitted 20 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

  4. arXiv:2408.06966  [pdf, other

    cs.LG

    DyG-Mamba: Continuous State Space Modeling on Dynamic Graphs

    Authors: Dongyuan Li, Shiyin Tan, Ying Zhang, Ming Jin, Shirui Pan, Manabu Okumura, Renhe Jiang

    Abstract: Dynamic graph learning aims to uncover evolutionary laws in real-world systems, enabling accurate social recommendation (link prediction) or early detection of cancer cells (classification). Inspired by the success of state space models, e.g., Mamba, for efficiently capturing long-term dependencies in language modeling, we propose DyG-Mamba, a new continuous state space model (SSM) for dynamic gra… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  5. arXiv:2408.05563  [pdf, other

    cs.NE cs.AI cs.CV

    Impacts of Darwinian Evolution on Pre-trained Deep Neural Networks

    Authors: Guodong Du, Runhua Jiang, Senqiao Yang, Haoyang Li, Wei Chen, Keren Li, Sim Kuan Goh, Ho-Kin Tang

    Abstract: Darwinian evolution of the biological brain is documented through multiple lines of evidence, although the modes of evolutionary changes remain unclear. Drawing inspiration from the evolved neural systems (e.g., visual cortex), deep learning models have demonstrated superior performance in visual tasks, among others. While the success of training deep neural networks has been relying on back-propa… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  6. arXiv:2408.05109  [pdf, other

    cs.DB

    A Survey of NL2SQL with Large Language Models: Where are we, and where are we going?

    Authors: Xinyu Liu, Shuyu Shen, Boyan Li, Peixian Ma, Runzhi Jiang, Yuyu Luo, Yuxin Zhang, Ju Fan, Guoliang Li, Nan Tang

    Abstract: Translating users' natural language queries (NL) into SQL queries (i.e., NL2SQL) can significantly reduce barriers to accessing relational databases and support various commercial applications. The performance of NL2SQL has been greatly enhanced with the emergence of Large Language Models (LLMs). In this survey, we provide a comprehensive review of NL2SQL techniques powered by LLMs, covering its e… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  7. arXiv:2408.04570  [pdf, other

    cs.LG

    Mathematical Programming For Adaptive Experiments

    Authors: Ethan Che, Daniel R. Jiang, Hongseok Namkoong, Jimmy Wang

    Abstract: Adaptive experimentation can significantly improve statistical power, but standard algorithms overlook important practical issues including batched and delayed feedback, personalization, non-stationarity, multiple objectives, and constraints. To address these issues, the current algorithm design paradigm crafts tailored methods for each problem instance. Since it is infeasible to devise novel algo… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  8. arXiv:2408.04531  [pdf, other

    cs.LG

    AExGym: Benchmarks and Environments for Adaptive Experimentation

    Authors: Jimmy Wang, Ethan Che, Daniel R. Jiang, Hongseok Namkoong

    Abstract: Innovations across science and industry are evaluated using randomized trials (a.k.a. A/B tests). While simple and robust, such static designs are inefficient or infeasible for testing many hypotheses. Adaptive designs can greatly improve statistical power in theory, but they have seen limited adoption due to their fragility in practice. We present a benchmark for adaptive experimentation based on… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  9. arXiv:2408.03841  [pdf, other

    cs.SE cs.AI

    MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models

    Authors: Yuchen Dong, XiaoXiang Fang, Yuchen Hu, Renshuang Jiang, Zhe Jiang

    Abstract: The application of large language models to facilitate automated software operations and tool generation (SOTG), thus augmenting software productivity, mirrors the early stages of human evolution when the ability to create and use tools accelerated the progress of civilization. These complex tasks require AI to continuously summarize and improve. Current research often overlooks the importance of… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  10. arXiv:2407.16914  [pdf, other

    math.OC

    Learning to Solve Bilevel Programs with Binary Tender

    Authors: Bo Zhou, Ruiwei Jiang, Siqian Shen

    Abstract: Bilevel programs (BPs) find a wide range of applications in fields such as energy, transportation, and machine learning. As compared to BPs with continuous (linear/convex) optimization problems in both levels, the BPs with discrete decision variables have received much less attention, largely due to the ensuing computational intractability and the incapability of gradient-based algorithms for hand… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  11. arXiv:2407.16725  [pdf, other

    cs.CV

    Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions

    Authors: Kai Liu, Zhihang Fu, Chao Chen, Sheng Jin, Ze Chen, Mingyuan Tao, Rongxin Jiang, Jieping Ye

    Abstract: The key to OOD detection has two aspects: generalized feature representation and precise category description. Recently, vision-language models such as CLIP provide significant advances in both two issues, but constructing precise category descriptions is still in its infancy due to the absence of unseen categories. This work introduces two hierarchical contexts, namely perceptual context and spur… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted by 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  12. arXiv:2407.16724  [pdf, other

    cs.CL

    Educating LLMs like Human Students: Structure-aware Injection of Domain Knowledge

    Authors: Kai Liu, Ze Chen, Zhihang Fu, Rongxin Jiang, Fan Zhou, Yaowu Chen, Yue Wu, Jieping Ye

    Abstract: This paper presents a pioneering methodology, termed StructTuning, to efficiently transform foundation Large Language Models (LLMs) into domain specialists. It significantly minimizes the training corpus requirement to a mere 0.3% while achieving an impressive 50% of traditional knowledge injection performance. Our method is inspired by the educational processes for human students, particularly ho… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: N/A

  13. arXiv:2407.16434  [pdf, other

    cs.CL

    Enhancing LLM's Cognition via Structurization

    Authors: Kai Liu, Zhihang Fu, Chao Chen, Wei Zhang, Rongxin Jiang, Fan Zhou, Yaowu Chen, Yue Wu, Jieping Ye

    Abstract: When reading long-form text, human cognition is complex and structurized. While large language models (LLMs) process input contexts through a causal and sequential perspective, this approach can potentially limit their ability to handle intricate and complex inputs effectively. To enhance LLM's cognition capability, this paper presents a novel concept of context structurization. Specifically, we t… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: N/A

  14. arXiv:2407.16430  [pdf, other

    cs.CV

    Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution

    Authors: Kai Liu, Zhihang Fu, Sheng Jin, Chao Chen, Ze Chen, Rongxin Jiang, Fan Zhou, Yaowu Chen, Jieping Ye

    Abstract: Detecting and rejecting unknown out-of-distribution (OOD) samples is critical for deployed neural networks to void unreliable predictions. In real-world scenarios, however, the efficacy of existing OOD detection methods is often impeded by the inherent imbalance of in-distribution (ID) data, which causes significant performance decline. Through statistical observations, we have identified two comm… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: N/A

  15. arXiv:2407.16424  [pdf, other

    cs.CV

    ESOD: Efficient Small Object Detection on High-Resolution Images

    Authors: Kai Liu, Zhihang Fu, Sheng Jin, Ze Chen, Fan Zhou, Rongxin Jiang, Yaowu Chen, Jieping Ye

    Abstract: Enlarging input images is a straightforward and effective approach to promote small object detection. However, simple image enlargement is significantly expensive on both computations and GPU memory. In fact, small objects are usually sparsely distributed and locally clustered. Therefore, massive feature extraction computations are wasted on the non-target background area of images. Recent works h… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: N/A

  16. arXiv:2407.16351  [pdf, other

    cs.HC

    Datasets of Visualization for Machine Learning

    Authors: Can Liu, Ruike Jiang, Shaocong Tan, Jiacheng Yu, Chaofan Yang, Hanning Shao, Xiaoru Yuan

    Abstract: Datasets of visualization play a crucial role in automating data-driven visualization pipelines, serving as the foundation for supervised model training and algorithm benchmarking. In this paper, we survey the literature on visualization datasets and provide a comprehensive overview of existing visualization datasets, including their data types, formats, supported tasks, and openness. We propose a… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 15 pages

  17. arXiv:2407.15842  [pdf, other

    cs.CV cs.GR

    Artist: Aesthetically Controllable Text-Driven Stylization without Training

    Authors: Ruixiang Jiang, Changwen Chen

    Abstract: Diffusion models entangle content and style generation during the denoising process, leading to undesired content modification when directly applied to stylization tasks. Existing methods struggle to effectively control the diffusion model to meet the aesthetic-level requirements for stylization. In this paper, we introduce \textbf{Artist}, a training-free approach that aesthetically controls the… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: WIP,webpage: https://DiffusionArtist.github.io

  18. arXiv:2407.14618  [pdf, other

    math.OC cs.LG

    SOREL: A Stochastic Algorithm for Spectral Risks Minimization

    Authors: Yuze Ge, Rujun Jiang

    Abstract: The spectral risk has wide applications in machine learning, especially in real-world decision-making, where people are not only concerned with models' average performance. By assigning different weights to the losses of different sample points, rather than the same weights as in the empirical risk, it allows the model's performance to lie between the average performance and the worst-case perform… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  19. arXiv:2407.08848  [pdf, other

    cs.RO

    GCS*: Forward Heuristic Search on Implicit Graphs of Convex Sets

    Authors: Shao Yuan Chew Chia, Rebecca H. Jiang, Bernhard Paus Graesdal, Leslie Pack Kaelbling, Russ Tedrake

    Abstract: We consider large-scale, implicit-search-based solutions to the Shortest Path Problems on Graphs of Convex Sets (GCS). We propose GCS*, a forward heuristic search algorithm that generalizes A* search to the GCS setting, where a continuous-valued decision is made at each graph vertex, and constraints across graph edges couple these decisions, influencing costs and feasibility. Such mixed discrete-c… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  20. arXiv:2407.08529  [pdf, other

    cs.CR

    Enhancing Privacy of Spatiotemporal Federated Learning against Gradient Inversion Attacks

    Authors: Lele Zheng, Yang Cao, Renhe Jiang, Kenjiro Taura, Yulong Shen, Sheng Li, Masatoshi Yoshikawa

    Abstract: Spatiotemporal federated learning has recently raised intensive studies due to its ability to train valuable models with only shared gradients in various location-based services. On the other hand, recent studies have shown that shared gradients may be subject to gradient inversion attacks (GIA) on images or texts. However, so far there has not been any systematic study of the gradient inversion a… ▽ More

    Submitted 15 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by DASFAA 2024, 16 pages

  21. arXiv:2407.05639  [pdf

    cs.LG cs.CR

    Deep Learning-based Anomaly Detection and Log Analysis for Computer Networks

    Authors: Shuzhan Wang, Ruxue Jiang, Zhaoqi Wang, Yan Zhou

    Abstract: Computer network anomaly detection and log analysis, as an important topic in the field of network security, has been a key task to ensure network security and system reliability. First, existing network anomaly detection and log analysis methods are often challenged by high-dimensional data and complex network topologies, resulting in unstable performance and high false-positive rates. In additio… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 38 pages

  22. arXiv:2407.03319  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci physics.comp-ph

    `Interaction annealing' to determine effective quantized valence and orbital structure: an illustration with ferro-orbital order in WTe$_2$

    Authors: Ruoshi Jiang, Fangyuan Gu, Wei Ku

    Abstract: Strongly correlated materials are known to display qualitatively distinct emergent behaviors at low energy. Conveniently, the superposition principle of quantum mechanics ensures that, upon absorbing quantum fluctuation, these rich low-energy behaviors can always be effectively described by dressed particles with fully quantized charge, spin, and orbitals structure. Such a powerful and simple desc… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures

  23. arXiv:2407.01846  [pdf, other

    cs.CV

    Investigating the Segment Anything Foundation Model for Mapping Smallholder Agriculture Field Boundaries Without Training Labels

    Authors: Pratyush Tripathy, Kathy Baylis, Kyle Wu, Jyles Watson, Ruizhe Jiang

    Abstract: Accurate mapping of agricultural field boundaries is crucial for enhancing outcomes like precision agriculture, crop monitoring, and yield estimation. However, extracting these boundaries from satellite images is challenging, especially for smallholder farms and data-scarce environments. This study explores the Segment Anything Model (SAM) to delineate agricultural field boundaries in Bihar, India… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 11 pages, 6 main figures, 7 supplementary figures

  24. arXiv:2406.12709  [pdf, other

    cs.LG cs.AI

    Enhancing Spatio-temporal Quantile Forecasting with Curriculum Learning: Lessons Learned

    Authors: Du Yin, Jinliang Deng, Shuang Ao, Zechen Li, Hao Xue, Arian Prabowo, Renhe Jiang, Xuan Song, Flora Salim

    Abstract: Training models on spatio-temporal (ST) data poses an open problem due to the complicated and diverse nature of the data itself, and it is challenging to ensure the model's performance directly trained on the original ST data. While limiting the variety of training data can make training easier, it can also lead to a lack of knowledge and information for the model, resulting in a decrease in perfo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  25. arXiv:2406.12208  [pdf, other

    cs.CL cs.AI cs.CV cs.NE

    Knowledge Fusion By Evolving Weights of Language Models

    Authors: Guodong Du, Jing Li, Hanting Liu, Runhua Jiang, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang

    Abstract: Fine-tuning pre-trained language models, particularly large language models, demands extensive computing resources and can result in varying performance outcomes across different domains and datasets. This paper examines the approach of integrating multiple models from diverse training scenarios into a unified model. This unified model excels across various data domains and exhibits the ability to… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL2024 Findings

  26. arXiv:2406.11191  [pdf, other

    cs.CL

    A Survey on Human Preference Learning for Large Language Models

    Authors: Ruili Jiang, Kehai Chen, Xuefeng Bai, Zhixuan He, Juntao Li, Muyun Yang, Tiejun Zhao, Liqiang Nie, Min Zhang

    Abstract: The recent surge of versatile large language models (LLMs) largely depends on aligning increasingly capable foundation models with human intentions by preference learning, enhancing LLMs with excellent applicability and effectiveness in a wide range of contexts. Despite the numerous related studies conducted, a perspective on how human preferences are introduced into LLMs remains limited, which ma… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: IEEE copyright statement added (also applied to the former version)

  27. arXiv:2406.04592  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convergence Analysis of Adaptive Gradient Methods under Refined Smoothness and Noise Assumptions

    Authors: Devyani Maladkar, Ruichen Jiang, Aryan Mokhtari

    Abstract: Adaptive gradient methods are arguably the most successful optimization algorithms for neural network training. While it is well-known that adaptive gradient methods can achieve better dimensional dependence than stochastic gradient descent (SGD) under favorable geometry for stochastic convex optimization, the theoretical justification for their success in stochastic non-convex optimization remain… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 21 pages

  28. arXiv:2406.02349  [pdf, other

    cs.NE cs.AI cs.CV

    CADE: Cosine Annealing Differential Evolution for Spiking Neural Network

    Authors: Runhua Jiang, Guodong Du, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang

    Abstract: Spiking neural networks (SNNs) have gained prominence for their potential in neuromorphic computing and energy-efficient artificial intelligence, yet optimizing them remains a formidable challenge for gradient-based methods due to their discrete, spike-based computation. This paper attempts to tackle the challenges by introducing Cosine Annealing Differential Evolution (CADE), designed to modulate… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  29. arXiv:2406.02016  [pdf, other

    math.OC cs.LG stat.ML

    Adaptive and Optimal Second-order Optimistic Methods for Minimax Optimization

    Authors: Ruichen Jiang, Ali Kavis, Qiujiang Jin, Sujay Sanghavi, Aryan Mokhtari

    Abstract: We propose adaptive, line search-free second-order methods with optimal rate of convergence for solving convex-concave min-max problems. By means of an adaptive step size, our algorithms feature a simple update rule that requires solving only one linear system per iteration, eliminating the need for line search or backtracking mechanisms. Specifically, we base our algorithms on the optimistic meth… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 33 pages, 2 figures

  30. arXiv:2406.01478  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Newton Proximal Extragradient Method

    Authors: Ruichen Jiang, Michał Dereziński, Aryan Mokhtari

    Abstract: Stochastic second-order methods achieve fast local convergence in strongly convex optimization by using noisy Hessian estimates to precondition the gradient. However, these methods typically reach superlinear convergence only when the stochastic Hessian noise diminishes, increasing per-iteration costs over time. Recent work in [arXiv:2204.09266] addressed this with a Hessian averaging scheme that… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 32 pages, 1 figure

  31. arXiv:2405.18322  [pdf, other

    cs.CV cs.AI

    SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation

    Authors: Kejia Yin, Varshanth R. Rao, Ruowei Jiang, Xudong Liu, Parham Aarabi, David B. Lindell

    Abstract: Self-supervised landmark estimation is a challenging task that demands the formation of locally distinct feature representations to identify sparse facial landmarks in the absence of annotated data. To tackle this task, existing state-of-the-art (SOTA) methods (1) extract coarse features from backbones that are trained with instance-level self-supervised learning (SSL) paradigms, which neglect the… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024

  32. arXiv:2405.16075  [pdf, other

    cs.LG cs.AI

    Continuous Temporal Domain Generalization

    Authors: Zekun Cai, Guangji Bai, Renhe Jiang, Xuan Song, Liang Zhao

    Abstract: Temporal Domain Generalization (TDG) addresses the challenge of training predictive models under temporally varying data distributions. Traditional TDG approaches typically focus on domain data collected at fixed, discrete time intervals, which limits their capability to capture the inherent dynamics within continuous-evolving and irregularly-observed temporal domains. To overcome this, this work… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  33. arXiv:2405.15344  [pdf, other

    math.NA

    Adaptive Finite Element Method for a Nonlinear Helmholtz Equation with High Wave Number

    Authors: Run Jiang, Haijun Wu, Yifeng Xu, Jun Zou

    Abstract: A nonlinear Helmholtz (NLH) equation with high frequencies and corner singularities is discretized by the linear finite element method (FEM). After deriving some wave-number-explicit stability estimates and the singularity decomposition for the NLH problem, a priori stability and error estimates are established for the FEM on shape regular meshes including the case of locally refined meshes. Then… ▽ More

    Submitted 27 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  34. arXiv:2405.10800  [pdf, other

    cs.LG

    Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series Forecasting

    Authors: Zheng Dong, Renhe Jiang, Haotian Gao, Hangchen Liu, Jinliang Deng, Qingsong Wen, Xuan Song

    Abstract: Spatiotemporal time series forecasting plays a key role in a wide range of real-world applications. While significant progress has been made in this area, fully capturing and leveraging spatiotemporal heterogeneity remains a fundamental challenge. Therefore, we propose a novel Heterogeneity-Informed Meta-Parameter Learning scheme. Specifically, our approach implicitly captures spatiotemporal heter… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD'24 Research Track

  35. arXiv:2405.04976  [pdf, other

    cs.IT eess.SP

    RF-based Energy Harvesting: Nonlinear Models, Applications and Challenges

    Authors: Ruihong Jiang

    Abstract: So far, various aspects associated with wireless energy harvesting (EH) have been investigated from diverse perspectives, including energy sources and models, usage protocols, energy scheduling and optimization, and EH implementation in different wireless communication systems. However, a comprehensive survey specifically focusing on models of radio frequency (RF)-based EH behaviors has not yet be… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  36. arXiv:2405.04350  [pdf, other

    math.OC

    Decision-Dependent Uncertainty-Aware Distribution System Planning Under Wildfire Risk

    Authors: Felipe Piancó, Alexandre Moreira, Bruno Fanzeres, Ruiwei Jiang, Chaoyue Zhao, Miguel Heleno

    Abstract: The interaction between power systems and wildfires can be dangerous and costly. Damaged structures, load shedding, and high operational costs are potential consequences when the grid is unprepared. In fact, the operation of distribution grids can be liable for the outbreak of wildfires when extreme weather conditions arise. Within this context, investment planning should consider the impact of op… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  37. arXiv:2405.03255  [pdf, other

    cs.LG

    Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning

    Authors: Jiewen Deng, Renhe Jiang, Jiaqi Zhang, Xuan Song

    Abstract: Multi-modality spatio-temporal (MoST) data extends spatio-temporal (ST) data by incorporating multiple modalities, which is prevalent in monitoring systems, encompassing diverse traffic demands and air quality assessments. Despite significant strides in ST modeling in recent years, there remains a need to emphasize harnessing the potential of information from different modalities. Robust MoST fore… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024 Main Track

  38. arXiv:2405.01350  [pdf, other

    cs.LG cs.SI

    Community-Invariant Graph Contrastive Learning

    Authors: Shiyin Tan, Dongyuan Li, Renhe Jiang, Ying Zhang, Manabu Okumura

    Abstract: Graph augmentation has received great attention in recent years for graph contrastive learning (GCL) to learn well-generalized node/graph representations. However, mainstream GCL methods often favor randomly disrupting graphs for augmentation, which shows limited generalization and inevitably leads to the corruption of high-level graph information, i.e., the graph community. Moreover, current know… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by ICML-2024

  39. arXiv:2405.00713  [pdf, ps, other

    math.AP math.CA

    Some inequalities related to Riesz transform on exterior Lipschitz domains

    Authors: Renjin Jiang, Sibei Yang

    Abstract: Let $n\ge2$ and $\mathcal{L}=-\mathrm{div}(A\nabla\cdot)$ be an elliptic operator on $\mathbb{R}^n$. Given an exterior Lipschitz domain $Ω$, let $\mathcal{L}_D$ and $\mathcal{L}_N$ be the elliptic operators $\mathcal{L}$ on $Ω$ subject to the Dirichlet and the Neumann boundary {conditions}, respectively. For the Neumann operator, we show that the reverse inequality… ▽ More

    Submitted 25 April, 2024; originally announced May 2024.

    Comments: 24pp, comments are welcome

  40. arXiv:2405.00334  [pdf, other

    cs.LG

    A Survey on Deep Active Learning: Recent Advances and New Frontiers

    Authors: Dongyuan Li, Zhen Wang, Yankai Chen, Renhe Jiang, Weiping Ding, Manabu Okumura

    Abstract: Active learning seeks to achieve strong performance with fewer training samples. It does this by iteratively asking an oracle to label new selected samples in a human-in-the-loop manner. This technique has gained increasing popularity due to its broad applicability, yet its survey papers, especially for deep learning-based active learning (DAL), remain scarce. Therefore, we conduct an advanced and… ▽ More

    Submitted 15 July, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by IEEE Transactions on Neural Networks and Learning Systems

  41. arXiv:2404.16731  [pdf, ps, other

    math.OC

    Non-asymptotic Global Convergence Analysis of BFGS with the Armijo-Wolfe Line Search

    Authors: Qiujiang Jin, Ruichen Jiang, Aryan Mokhtari

    Abstract: In this paper, we establish the first explicit and non-asymptotic global convergence analysis of the BFGS method when deployed with an inexact line search scheme that satisfies the Armijo-Wolfe conditions. We show that BFGS achieves a global convergence rate of $(1-\frac{1}κ)^k$ for $μ$-strongly convex functions with $L$-Lipschitz gradients, where $κ=\frac{L}μ$ denotes the condition number. Furthe… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  42. arXiv:2404.15597  [pdf, other

    cs.NE cs.AI cs.LG cs.MA

    GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL

    Authors: Lang Qin, Ziming Wang, Runhao Jiang, Rui Yan, Huajin Tang

    Abstract: Spiking neural networks (SNNs) are widely applied in various fields due to their energy-efficient and fast-inference capabilities. Applying SNNs to reinforcement learning (RL) can significantly reduce the computational resource requirements for agents and improve the algorithm's performance under resource-constrained conditions. However, in current spiking reinforcement learning (SRL) algorithms,… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  43. arXiv:2404.12184  [pdf, other

    quant-ph

    Boolean Matching Reversible Circuits: Algorithm and Complexity

    Authors: Tian-Fu Chen, Jie-Hong R. Jiang

    Abstract: Boolean matching is an important problem in logic synthesis and verification. Despite being well-studied for conventional Boolean circuits, its treatment for reversible logic circuits remains largely, if not completely, missing. This work provides the first such study. Given two (black-box) reversible logic circuits that are promised to be matchable, we check their equivalences under various input… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  44. arXiv:2404.10947  [pdf, other

    cs.CV

    Residual Connections Harm Abstract Feature Learning in Masked Autoencoders

    Authors: Xiao Zhang, Ruoxi Jiang, William Gao, Rebecca Willett, Michael Maire

    Abstract: We demonstrate that adding a weighting factor to decay the strength of identity shortcuts within residual networks substantially improves semantic feature learning in the state-of-the-art self-supervised masked autoencoding (MAE) paradigm. Our modification to the identity shortcuts within a VIT-B/16 backbone of an MAE boosts linear probing accuracy on ImageNet from 67.8% to 72.7%. This significant… ▽ More

    Submitted 20 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  45. arXiv:2404.09679  [pdf, other

    cs.DC cs.LG

    AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes

    Authors: Youshao Xiao, Lin Ju, Zhenglei Zhou, Siyuan Li, Zhaoxin Huan, Dalong Zhang, Rujie Jiang, Lin Wang, Xiaolu Zhang, Lei Liang, Jun Zhou

    Abstract: Many distributed training techniques like Parameter Server and AllReduce have been proposed to take advantage of the increasingly large data and rich features. However, stragglers frequently occur in distributed training due to resource contention and hardware heterogeneity, which significantly hampers the training efficiency. Previous works only address part of the stragglers and could not adapti… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  46. arXiv:2404.02613  [pdf, other

    hep-ex hep-ph

    Searches for multi-Z boson productions and anomalous gauge boson couplings at a muon collider

    Authors: Ruobing Jiang, Chuqiao Jiang, Alim Ruzi, Tianyi Yang, Yong Ban, Qiang Li

    Abstract: Multi-boson productions can be exploited as novel probes either for standard model precision tests or new physics searches, and have become one of those popular topics in the ongoing LHC experiments, and in future collider studies, including those for electron-positron and muon-muon colliders. Here we focus on two examples, i.e., ZZZ direct productions through $μ^{+}μ^{-}$ annihilation at a 1 TeV… ▽ More

    Submitted 28 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This paper has been submitted to Chinese Physics C

  47. arXiv:2404.01267  [pdf, other

    math.OC

    Non-asymptotic Global Convergence Rates of BFGS with Exact Line Search

    Authors: Qiujiang Jin, Ruichen Jiang, Aryan Mokhtari

    Abstract: In this paper, we explore the non-asymptotic global convergence rates of the Broyden-Fletcher-Goldfarb-Shanno (BFGS) method implemented with exact line search. Notably, due to Dixon's equivalence result, our findings are also applicable to other quasi-Newton methods in the convex Broyden class employing exact line search, such as the Davidon-Fletcher-Powell (DFP) method. Specifically, we focus on… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  48. arXiv:2403.19172  [pdf, ps, other

    quant-ph

    Quantum circuit design for mixture and preparation of arbitrary pure and mixed quantum states

    Authors: Bo-Hung Chen, Dah-Wei Chiou, Jie-Hong Roland Jiang

    Abstract: This paper addresses the challenge of preparing arbitrary mixed quantum states, an area that has not been extensively studied compared to pure states. Two circuit design methods are presented: one via a mixture of pure states and the other via purification. A novel strategy utilizing the Cholesky decomposition is proposed to improve both computational efficiency during preprocessing and circuit ef… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 25 pages, 8 figures

  49. arXiv:2403.14769  [pdf, other

    stat.AP

    Fractional Tackles: Leveraging Player Tracking Data for Within-Play Tackling Evaluation in American Football

    Authors: Quang Nguyen, Ruitong Jiang, Meg Ellingwood, Ronald Yurko

    Abstract: Tackling is a fundamental defensive move in American football, with the main purpose of stopping the forward motion of the ball-carrier. However, current tackling metrics are manually recorded outcomes that are inherently flawed due to their discrete and subjective nature. Using player tracking data, we present a novel framework for assessing tackling contribution in a continuous and objective man… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 16 pages, 6 figures, 2 tables

  50. arXiv:2403.12574  [pdf, other

    cs.CV cs.AI cs.NE

    EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks

    Authors: Ziming Wang, Ziling Wang, Huaning Li, Lang Qin, Runhao Jiang, De Ma, Huajin Tang

    Abstract: Event cameras, with their high dynamic range and temporal resolution, are ideally suited for object detection, especially under scenarios with motion blur and challenging lighting conditions. However, while most existing approaches prioritize optimizing spatiotemporal representations with advanced detection backbones and early aggregation functions, the crucial issue of adaptive event sampling rem… ▽ More

    Submitted 24 August, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by ECCV2024