Zum Hauptinhalt springen

Showing 1–50 of 248 results for author: Ye, T

.
  1. arXiv:2408.16293  [pdf, other

    cs.CL cs.AI cs.LG

    Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

    Authors: Tian Ye, Zicheng Xu, Yuanzhi Li, Zeyuan Allen-Zhu

    Abstract: Language models have demonstrated remarkable performance in solving reasoning tasks; however, even the strongest models still occasionally make reasoning mistakes. Recently, there has been active research aimed at improving reasoning accuracy, particularly by using pretrained language models to "self-correct" their mistakes via multi-round prompting. In this paper, we follow this line of work but… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: text overlap with arXiv:2407.20311

  2. arXiv:2408.12541  [pdf, other

    stat.ME

    Clarifying the Role of the Mantel-Haenszel Risk Difference Estimator in Randomized Clinical Trials

    Authors: Xiaoyu Qiu, Yuhan Qian, Jaehwan Yi, Jinqiu Wang, Yu Du, Yanyao Yi, Ting Ye

    Abstract: The Mantel-Haenszel (MH) risk difference estimator, commonly used in randomized clinical trials for binary outcomes, calculates a weighted average of stratum-specific risk difference estimators. Traditionally, this method requires the stringent assumption that risk differences are homogeneous across strata, also known as the common risk difference assumption. In our article, we relax this assumpti… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  3. Timeline and Boundary Guided Diffusion Network for Video Shadow Detection

    Authors: Haipeng Zhou, Honqiu Wang, Tian Ye, Zhaohu Xing, Jun Ma, Ping Li, Qiong Wang, Lei Zhu

    Abstract: Video Shadow Detection (VSD) aims to detect the shadow masks with frame sequence. Existing works suffer from inefficient temporal learning. Moreover, few works address the VSD problem by considering the characteristic (i.e., boundary) of shadow. Motivated by this, we propose a Timeline and Boundary Guided Diffusion (TBGDiff) network for VSD where we take account of the past-future temporal guidanc… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: ACM MM2024

  4. arXiv:2408.09204  [pdf, other

    astro-ph.HE

    A Series of (Net) Spin-down Glitches in PSR J1522-5735: Insights from the Vortex Creep and Vortex Bending Models

    Authors: S. Q. Zhou, W. T. Ye, M. Y. Ge, E. GügercinoğLu, S. J. Zheng, C. Yu, J. P. Yuan, J. Zhang

    Abstract: Through a detailed timing analysis of $\textit{Fermi}$-LAT data, the rotational behavior of the $γ$-ray pulsar PSR J1522$-$5735 was tracked from August 2008 (MJD 54692) to January 2024 (MJD 60320). During this 15.4-year period, two over-recovery glitches and four anti-glitches were identified, marking a rare occurrence in rotation-powered pulsars (RPPs). The magnitudes of these (net) spin-down gli… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: 8 pages, 3 figures, 2 tables, submitted to ApJL. Comments Welcome!

  5. arXiv:2408.07032  [pdf

    quant-ph cs.CR

    QIris: Quantum Implementation of Rainbow Table Attacks

    Authors: Lee Jun Quan, Tan Jia Ye, Goh Geok Ling, Vivek Balachandran

    Abstract: This paper explores the use of Grover's Algorithm in the classical rainbow table, uncovering the potential of integrating quantum computing techniques with conventional cryptographic methods to develop a Quantum Rainbow Table Proof-of-Concept. This leverages on Quantum concepts and algorithms which includes the principle of qubit superposition, entanglement and teleportation, coupled with Grover's… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  6. arXiv:2407.20311  [pdf, other

    cs.AI cs.CL cs.LG

    Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

    Authors: Tian Ye, Zicheng Xu, Yuanzhi Li, Zeyuan Allen-Zhu

    Abstract: Recent advances in language models have demonstrated their capability to solve mathematical reasoning problems, achieving near-perfect accuracy on grade-school level math benchmarks like GSM8K. In this paper, we formally study how language models solve these problems. We design a series of controlled experiments to address several fundamental questions: (1) Can language models truly develop reason… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: video appeared in ICML 2024 tutorial

  7. arXiv:2407.18035  [pdf, other

    cs.CV cs.AI cs.CL

    RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models

    Authors: Haoyu Chen, Wenbo Li, Jinjin Gu, Jingjing Ren, Sixiang Chen, Tian Ye, Renjing Pei, Kaiwen Zhou, Fenglong Song, Lei Zhu

    Abstract: Natural images captured by mobile devices often suffer from multiple types of degradation, such as noise, blur, and low light. Traditional image restoration methods require manual selection of specific tasks, algorithms, and execution sequences, which is time-consuming and may yield suboptimal results. All-in-one models, though capable of handling multiple tasks, typically support only a limited r… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  8. arXiv:2407.14900  [pdf, other

    cs.CV

    AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement

    Authors: Yunlong Lin, Tian Ye, Sixiang Chen, Zhenqi Fu, Yingying Wang, Wenhao Chai, Zhaohu Xing, Lei Zhu, Xinghao Ding

    Abstract: Existing low-light image enhancement (LIE) methods have achieved noteworthy success in solving synthetic distortions, yet they often fall short in practical applications. The limitations arise from two inherent challenges in real-world LIE: 1) the collection of distorted/clean image pairs is often impractical and sometimes even unavailable, and 2) accurately modeling complex degradations presents… ▽ More

    Submitted 23 July, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

    Comments: 21 pages, 9 figures

  9. arXiv:2407.10923  [pdf, other

    cs.CV

    OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting

    Authors: Penglei Gao, Kai Yao, Tiandi Ye, Steven Wang, Yuan Yao, Xiaofeng Wang

    Abstract: In this paper, we tackle the recently popular topic of generating 360-degree images given the conventional narrow field of view (NFoV) images that could be taken from a single camera or cellphone. This task aims to predict the reasonable and consistent surroundings from the NFoV images. Existing methods for feature extraction and fusion, often built with transformer-based architectures, incur subs… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  10. arXiv:2407.09480  [pdf, other

    econ.GN cs.AI cs.CL

    Using Artificial Intelligence to Unlock Crowdfunding Success for Small Businesses

    Authors: Teng Ye, Jingnan Zheng, Junhui Jin, Jingyi Qiu, Wei Ai, Qiaozhu Mei

    Abstract: While small businesses are increasingly turning to online crowdfunding platforms for essential funding, over 40% of these campaigns may fail to raise any money, especially those from low socio-economic areas. We utilize the latest advancements in AI technology to identify crucial factors that influence the success of crowdfunding campaigns and to improve their fundraising outcomes by strategically… ▽ More

    Submitted 24 April, 2024; originally announced July 2024.

  11. arXiv:2407.01191  [pdf, other

    cs.RO cs.AI cs.CV

    MARS: Multimodal Active Robotic Sensing for Articulated Characterization

    Authors: Hongliang Zeng, Ping Zhang, Chengjiong Wu, Jiahua Wang, Tingyu Ye, Fang Li

    Abstract: Precise perception of articulated objects is vital for empowering service robots. Recent studies mainly focus on point cloud, a single-modal approach, often neglecting vital texture and lighting details and assuming ideal conditions like optimal viewpoints, unrepresentative of real-world scenarios. To address these limitations, we introduce MARS, a novel framework for articulated object characteri… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  12. arXiv:2406.17342  [pdf, other

    cs.CV cs.AI

    Masked Generative Extractor for Synergistic Representation and 3D Generation of Point Clouds

    Authors: Hongliang Zeng, Ping Zhang, Fang Li, Jiahua Wang, Tingyu Ye, Pengteng Guo

    Abstract: Representation and generative learning, as reconstruction-based methods, have demonstrated their potential for mutual reinforcement across various domains. In the field of point cloud processing, although existing studies have adopted training strategies from generative models to enhance representational capabilities, these methods are limited by their inability to genuinely generate 3D shapes. To… ▽ More

    Submitted 15 August, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  13. arXiv:2406.15478  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Impact of the Top SiO2 Interlayer Thickness on Memory Window of Si Channel FeFET with TiN/SiO2/Hf0.5Zr0.5O2/SiOx/Si (MIFIS) Gate Structure

    Authors: Tao Hu, Xianzhou Shao, Mingkai Bai, Xinpei Jia, Saifei Dai, Xiaoqing Sun, Runhao Han, Jia Yang, Xiaoyu Ke, Fengbin Tian, Shuai Yang, Junshuai Chai, Hao Xu, Xiaolei Wang, Wenwu Wang, Tianchun Ye

    Abstract: We study the impact of top SiO2 interlayer thickness on the memory window (MW) of Si channel ferroelectric field-effect transistor (FeFET) with TiN/SiO2/Hf0.5Zr0.5O2/SiOx/Si (MIFIS) gate structure. We find that the MW increases with the increasing thickness of the top SiO2 interlayer, and such an increase exhibits a two-stage linear dependence. The physical origin is the presence of the different… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 6 pages, 12 figures. arXiv admin note: substantial text overlap with arXiv:2404.15825

  14. arXiv:2406.11935  [pdf, other

    cs.PL cs.AI cs.SE

    Iterative or Innovative? A Problem-Oriented Perspective for Code Optimization

    Authors: Tong Ye, Tengfei Ma, Lingfei Wu, Xuhong Zhang, Shouling Ji, Wenhai Wang

    Abstract: Large language models (LLMs) have demonstrated strong capabilities in solving a wide range of programming tasks. However, LLMs have rarely been explored for code optimization. In this paper, we explore code optimization with a focus on performance enhancement, specifically aiming to optimize code for minimal execution time. The recently proposed first PIE dataset for performance optimization const… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  15. arXiv:2406.11247  [pdf, other

    cs.CV

    STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft

    Authors: Zhonghan Zhao, Wenhao Chai, Xuan Wang, Ke Ma, Kewei Chen, Dongxu Guo, Tian Ye, Yanting Zhang, Hongwei Wang, Gaoang Wang

    Abstract: Building an embodied agent system with a large language model (LLM) as its core is a promising direction. Due to the significant costs and uncontrollable factors associated with deploying and training such agents in the real world, we have decided to begin our exploration within the Minecraft environment. Our STEVE Series agents can complete basic tasks in a virtual environment and more challengin… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 Embodied AI Workshop

  16. arXiv:2406.07801  [pdf, other

    cs.CL cs.SD eess.AS

    PolySpeech: Exploring Unified Multitask Speech Models for Competitiveness with Single-task Models

    Authors: Runyan Yang, Huibao Yang, Xiqing Zhang, Tiantian Ye, Ying Liu, Yingying Gao, Shilei Zhang, Chao Deng, Junlan Feng

    Abstract: Recently, there have been attempts to integrate various speech processing tasks into a unified model. However, few previous works directly demonstrated that joint optimization of diverse tasks in multitask speech models has positive influence on the performance of individual tasks. In this paper we present a multitask speech model -- PolySpeech, which supports speech recognition, speech synthesis,… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures

  17. arXiv:2405.16133  [pdf, other

    cs.SE cs.AI

    Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting

    Authors: Tong Ye, Yangkai Du, Tengfei Ma, Lingfei Wu, Xuhong Zhang, Shouling Ji, Wenhai Wang

    Abstract: Large Language Models (LLMs) have exhibited remarkable proficiency in generating code. However, the misuse of LLM-generated (Synthetic) code has prompted concerns within both educational and industrial domains, highlighting the imperative need for the development of synthetic code detectors. Existing methods for detecting LLM-generated content are primarily tailored for general text and often stru… ▽ More

    Submitted 29 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: Previously submitted to EMNLP2023

  18. arXiv:2405.16046  [pdf, ps, other

    stat.ME stat.AP

    Sensitivity Analysis for Attributable Effects in Case$^2$ Studies

    Authors: Kan Chen, Ting Ye, Dylan S. Small

    Abstract: The case$^2$ study, also referred to as the case-case study design, is a valuable approach for conducting inference for treatment effects. Unlike traditional case-control studies, the case$^2$ design compares treatment in two types of cases with the same disease. A key quantity of interest is the attributable effect, which is the number of cases of disease among treated units which are caused by t… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 25 pages, 2 Figures, 4 Tables

  19. arXiv:2405.05811  [pdf, other

    cs.CV

    Parallel Cross Strip Attention Network for Single Image Dehazing

    Authors: Lihan Tong, Yun Liu, Tian Ye, Weijia Li, Liyuan Chen, Erkang Chen

    Abstract: The objective of single image dehazing is to restore hazy images and produce clear, high-quality visuals. Traditional convolutional models struggle with long-range dependencies due to their limited receptive field size. While Transformers excel at capturing such dependencies, their quadratic computational complexity in relation to feature map resolution makes them less suitable for pixel-to-pixel… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 10 pages , 4 figures, CTISC'24

    Report number: C052

  20. arXiv:2404.17747  [pdf, other

    cs.CV

    MMA-UNet: A Multi-Modal Asymmetric UNet Architecture for Infrared and Visible Image Fusion

    Authors: Jingxue Huang, Xilai Li, Tianshu Tan, Xiaosong Li, Tao Ye

    Abstract: Multi-modal image fusion (MMIF) maps useful information from various modalities into the same representation space, thereby producing an informative fused image. However, the existing fusion algorithms tend to symmetrically fuse the multi-modal images, causing the loss of shallow information or bias towards a single modality in certain regions of the fusion results. In this study, we analyzed the… ▽ More

    Submitted 11 July, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  21. arXiv:2404.17176  [pdf, other

    cs.CV

    MovieChat+: Question-aware Sparse Memory for Long Video Question Answering

    Authors: Enxin Song, Wenhao Chai, Tian Ye, Jenq-Neng Hwang, Xi Li, Gaoang Wang

    Abstract: Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific pre-defined vision tasks. Yet, existing methods either employ complex spatial-temporal modules or rely heavily on additional perception models to extract temporal features for video understanding, and they only perform well on short videos. For long… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  22. arXiv:2404.15825  [pdf

    physics.app-ph

    Impact of Top SiO2 interlayer Thickness on Memory Window of Si Channel FeFET with TiN/SiO2/Hf0.5Zr0.5O2/SiOx/Si (MIFIS) Gate Structure

    Authors: Tao Hu, Xianzhou Shao, Mingkai Bai, Xinpei Jia, Saifei Dai, Xiaoqing Sun, Runhao Han, Jia Yang, Xiaoyu Ke, Fengbin Tian, Shuai Yang, Junshuai Chai, Hao Xu, Xiaolei Wang, Wenwu Wang, Tianchun Ye

    Abstract: We study the impact of top SiO2 interlayer thickness on memory window of Si channel FeFET with TiN/SiO2/Hf0.5Zr0.5O2/SiOx/Si (MIFIS) gate structure. The memory window increases with thicker top SiO2. We realize the memory window of 6.3 V for 3.4 nm top SiO2. Moreover, we find that the endurance characteristic degrades with increasing the initial memory window.

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 4 page 7 figures

  23. arXiv:2404.14248  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

    Authors: Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi Jin, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, Jing Lin, Alan Yuille, Ben Shao, Jin Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin , et al. (87 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results. The aim of this challenge is to discover an effective network design or solution capable of generating brighter, clearer, and visually appealing results when dealing with a variety of conditions, including ultra-high resolution (4K and beyond), non-uniform illumination, backlig… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 Challenge Report

  24. arXiv:2404.03225  [pdf, other

    cs.CV cs.LG

    FACTUAL: A Novel Framework for Contrastive Learning Based Robust SAR Image Classification

    Authors: Xu Wang, Tian Ye, Rajgopal Kannan, Viktor Prasanna

    Abstract: Deep Learning (DL) Models for Synthetic Aperture Radar (SAR) Automatic Target Recognition (ATR), while delivering improved performance, have been shown to be quite vulnerable to adversarial attacks. Existing works improve robustness by training models on adversarial samples. However, by focusing mostly on attacks that manipulate images randomly, they neglect the real-world feasibility of such atta… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 2024 IEEE Radar Conference

  25. arXiv:2403.18493  [pdf, other

    cs.CV

    VersaT2I: Improving Text-to-Image Models with Versatile Reward

    Authors: Jianshu Guo, Wenhao Chai, Jie Deng, Hsiang-Wei Huang, Tian Ye, Yichen Xu, Jiawei Zhang, Jenq-Neng Hwang, Gaoang Wang

    Abstract: Recent text-to-image (T2I) models have benefited from large-scale and high-quality data, demonstrating impressive performance. However, these T2I models still struggle to produce images that are aesthetically pleasing, geometrically accurate, faithful to text, and of good low-level quality. We present VersaT2I, a versatile training framework that can boost the performance with multiple rewards of… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  26. arXiv:2403.18318  [pdf, other

    cs.CV

    Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural Networks

    Authors: Tian Ye, Rajgopal Kannan, Viktor Prasanna, Carl Busart

    Abstract: Adversarial attacks have demonstrated the vulnerability of Machine Learning (ML) image classifiers in Synthetic Aperture Radar (SAR) Automatic Target Recognition (ATR) systems. An adversarial attack can deceive the classifier into making incorrect predictions by perturbing the input SAR images, for example, with a few scatterers attached to the on-ground objects. Therefore, it is critical to devel… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  27. arXiv:2403.13260  [pdf, other

    stat.ME

    A Bayesian Approach for Selecting Relevant External Data (BASE): Application to a study of Long-Term Outcomes in a Hemophilia Gene Therapy Trial

    Authors: Tianyu Pan, Xiang Zhang, Weining Shen, Ting Ye

    Abstract: Gene therapies aim to address the root causes of diseases, particularly those stemming from rare genetic defects that can be life-threatening or severely debilitating. While there has been notable progress in the development of gene therapies in recent years, understanding their long-term effectiveness remains challenging due to a lack of data on long-term outcomes, especially during the early sta… ▽ More

    Submitted 9 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  28. arXiv:2403.12056  [pdf, other

    cs.CV physics.optics

    Enhancing Digital Hologram Reconstruction Using Reverse-Attention Loss for Untrained Physics-Driven Deep Learning Models with Uncertain Distance

    Authors: Xiwen Chen, Hao Wang, Zhao Zhang, Zhenmin Li, Huayu Li, Tong Ye, Abolfazl Razi

    Abstract: Untrained Physics-based Deep Learning (DL) methods for digital holography have gained significant attention due to their benefits, such as not requiring an annotated training dataset, and providing interpretability since utilizing the governing laws of hologram formation. However, they are sensitive to the hard-to-obtain precise object distance from the imaging plane, posing the… ▽ More

    Submitted 10 January, 2024; originally announced March 2024.

  29. arXiv:2403.11822  [pdf

    cond-mat.mtrl-sci

    Discovery of self-assembled Ru/Si heterostructures with unique periodic nanostripe patterns boosting hydrogen evolution

    Authors: Weizheng Cai, Xinyi He, Tian-Nan Ye, Xinmeng Hu, Chuanlong Liu, Masato Sasase, Masaaki Kitano, Toshio Kamiya, Hideo Hosono, Jiazhen Wu

    Abstract: Two-dimensional (2D) heterostructuring is a versatile methodology for designing nanoarchitecture catalytic systems that allow for reconstruction and modulation of interfaces and electronic structures. However, catalysts with such structures are extremely scarce due to limited synthetic strategies. Here, we report a highly ordered 2D Ru/Si nano-heterostructures (RSHS) by acid etching of the LaRuSi… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

  30. arXiv:2403.10340  [pdf, other

    cs.CV cs.RO

    Thermal-NeRF: Neural Radiance Fields from an Infrared Camera

    Authors: Tianxiang Ye, Qi Wu, Junyuan Deng, Guoqing Liu, Liu Liu, Songpengcheng Xia, Liang Pang, Wenxian Yu, Ling Pei

    Abstract: In recent years, Neural Radiance Fields (NeRFs) have demonstrated significant potential in encoding highly-detailed 3D geometry and environmental appearance, positioning themselves as a promising alternative to traditional explicit representation for 3D scene reconstruction. However, the predominant reliance on RGB imaging presupposes ideal lighting conditions: a premise frequently unmet in roboti… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  31. arXiv:2403.08282  [pdf, other

    cs.CV

    Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation

    Authors: Zhonghan Zhao, Kewei Chen, Dongxu Guo, Wenhao Chai, Tian Ye, Yanting Zhang, Gaoang Wang

    Abstract: Due to the dynamic and unpredictable open-world setting, navigating complex environments in Minecraft poses significant challenges for multi-agent systems. Agents must interact with the environment and coordinate their actions with other agents to achieve common objectives. However, traditional approaches often struggle to efficiently manage inter-agent communication and task distribution, crucial… ▽ More

    Submitted 18 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: ICLR 2024 Workshop on LLM Agents

  32. arXiv:2403.06144  [pdf, other

    cs.CY

    Simulating Family Conversations using LLMs: Demonstration of Parenting Styles

    Authors: Frank Tian-fang Ye, Xiaozi Gao

    Abstract: This study presents a framework for conducting psychological and linguistic research through simulated conversations using large language models (LLMs). The proposed methodology offers significant advantages, particularly for simulating human interactions involving potential unethical language or behaviors that would be impermissible in traditional experiments with human participants. As a demonst… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  33. arXiv:2403.01198  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Organic solvent boosts charge storage and charging dynamics of conductive MOF supercapacitors

    Authors: Ming Chen, Taizheng Wu, Liang Niu, Ting Ye, Wenlei Dai, Liang Zeng, Alexei A. Kornyshev, Zhenxiang Wang, Zhou Liu, Guang Feng

    Abstract: Conductive metal-organic frameworks (c-MOFs) and ionic liquids (ILs) have emerged as auspicious combinations for high-performance supercapacitors. However, the nanoconfinement from c-MOFs and high viscosity of ILs slow down the charging process. This hindrance can, however, be resolved by adding solvent. Here, we performed constant-potential molecular simulations to scrutinize the solvent impact o… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  34. arXiv:2402.16043  [pdf, other

    cs.CR cs.SE

    LuaTaint: A Static Taint Analysis System for Web Interface Framework Vulnerability of IoT Devices

    Authors: Jiahui Xiang, Wenhai Wang, Tong Ye, Peiyu Liu

    Abstract: IoT devices are currently facing continuous malicious attacks due to their widespread use. Among these IoT devices, web vulnerabilities are also widely exploited because of their inherent characteristics, such as improper permission controls and insecure interfaces. Recently, the embedded system web interface framework has become highly diverse, and specific vulnerabilities can arise if developers… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  35. arXiv:2402.14226  [pdf, other

    astro-ph.HE hep-ph

    Broadband noise and quasi-periodic oscillation characteristics of the X-ray pulsar RX J0440.9+4431

    Authors: P. P. Li, L. Tao, R. C. Ma, M. Y. Ge, Q. C. Zhao, S. J. Zhao, L. Zhang, Q. C. Bu, L. D. Kong, Y. L. Tuo, L. Ji, S. Zhang, J. L. Qu, S. N. Zhang, Y. Huang, X. Ma, W. T. Ye, Q. C. Shui

    Abstract: We present a comprehensive timing analysis on the Be/X-ray binary pulsar RX J0440.9+4431 using observations from \textit{NICER} and \textit{Insight}-HXMT during the 2022--2023 outburst. The power density spectrum (PDS) of RX J0440.9+4431 exhibits typical aperiodic variability in X-ray flux across a wide frequency range. During a super-critical accretion state, we detect quasi-periodic oscillations… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 8 pages, 7 figures. Accepted in MNRAS

  36. arXiv:2402.00307  [pdf, other

    stat.ME

    Debiased Multivariable Mendelian Randomization

    Authors: Yinxiang Wu, Hyunseung Kang, Ting Ye

    Abstract: Multivariable Mendelian randomization (MVMR) uses genetic variants as instrumental variables to infer the direct effect of multiple exposures on an outcome. Compared to univariable Mendelian randomization, MVMR is less prone to horizontal pleiotropy and enables estimation of the direct effect of each exposure on the outcome. However, MVMR faces greater challenges with weak instruments -- genetic v… ▽ More

    Submitted 23 March, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  37. arXiv:2401.15992  [pdf, other

    astro-ph.HE

    Pulsed Iron line Emission from the First Galactic Ultraluminous X-ray Pulsar Swift J0243.6+6124

    Authors: Y. X. Xiao, Y. J. Xu, M. Y. Ge, F. J. Lu, S. N. Zhang, S. Zhang, L. Tao, J. L. Qu, P. J. Wang, L. D. Kong, Y. L. Tuo, Y. You, S. J. Zhao, J. Q. Peng, Y. F. Du, Y. H. Zhang, W. T. Ye

    Abstract: We report the phase-resolved spectral results of the first Galactic Pulsating Ultra-Luminous X-ray source (PULX) Swift J0243.6+6124, modeling at its 2017-2018 outburst peak using data collected by the Hard X-ray Modulation Telescope (Insight-HXMT). The broad energy coverage of Insight-HXMT allows us to obtain more accurate spectral continuum to reduce the coupling of broad iron line profiles with… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  38. arXiv:2401.13560  [pdf, other

    cs.CV

    SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

    Authors: Zhaohu Xing, Tian Ye, Yijun Yang, Guang Liu, Lei Zhu

    Abstract: The Transformer architecture has shown a remarkable ability in modeling global relationships. However, it poses a significant computational challenge when processing high-dimensional medical images. This hinders its development and widespread adoption in this task. Mamba, as a State Space Model (SSM), recently emerged as a notable manner for long-range dependencies in sequential modeling, excellin… ▽ More

    Submitted 25 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Code has released

  39. arXiv:2401.03692  [pdf, other

    math.OC cs.LG

    Boosting Column Generation with Graph Neural Networks for Joint Rider Trip Planning and Crew Shift Scheduling

    Authors: Jiawei Lu, Tinghan Ye, Wenbo Chen, Pascal Van Hentenryck

    Abstract: Optimizing service schedules is pivotal to the reliable, efficient, and inclusive on-demand mobility. This pressing challenge is further exacerbated by the increasing needs of an aging population, the over-subscription of existing services, and the lack of effective solution methods. This study addresses the intricacies of service scheduling, by jointly optimizing rider trip planning and crew sche… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  40. arXiv:2312.16777  [pdf, other

    nucl-th

    Multiphoton fusion of light nuclei in intense laser fields

    Authors: Binbing Wu, Zhengfeng Fan, Difa Ye, Tao Ye, Congzhang Gao, Chengxin Yu, Xuefeng Xu, Cunbo Zhang, Jie Liu

    Abstract: We investigate the fusion cross sections of light nuclei in the presence of linearly polarized intense laser fields. By combining the Coulomb-Volkov solutions with the complex spherical square-well nuclear potential, we derive an explicit formulation of the multiphoton cross section in a self-consistent manner. Our analysis is specifically focused on deuteron-triton (DT) and proton-boron (p… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  41. arXiv:2312.13470  [pdf, ps, other

    cs.MM cs.NI

    Coffee: Cost-Effective Edge Caching for 360 Degree Live Video Streaming

    Authors: Chen Li, Tingwei Ye, Tongyu Zong, Liyang Sun, Houwei Cao, Yong Liu

    Abstract: While live 360 degree video streaming delivers immersive viewing experience, it poses significant bandwidth and latency challenges for content delivery networks. Edge servers are expected to play an important role in facilitating live streaming of 360 degree videos. In this paper, we propose a novel predictive edge caching algorithm (Coffee) for live 360 degree video that employ collaborative FoV… ▽ More

    Submitted 27 January, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  42. arXiv:2312.08874  [pdf, other

    cs.CV

    Agent Attention: On the Integration of Softmax and Linear Attention

    Authors: Dongchen Han, Tianzhu Ye, Yizeng Han, Zhuofan Xia, Siyuan Pan, Pengfei Wan, Shiji Song, Gao Huang

    Abstract: The attention module is the key component in Transformers. While the global attention mechanism offers high expressiveness, its excessive computational cost restricts its applicability in various scenarios. In this paper, we propose a novel attention paradigm, Agent Attention, to strike a favorable balance between computational efficiency and representation power. Specifically, the Agent Attention… ▽ More

    Submitted 15 July, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: ECCV 2024

  43. arXiv:2312.08606  [pdf, other

    cs.CV

    VQCNIR: Clearer Night Image Restoration with Vector-Quantized Codebook

    Authors: Wenbin Zou, Hongxia Gao, Tian Ye, Liang Chen, Weipeng Yang, Shasha Huang, Hongsheng Chen, Sixiang Chen

    Abstract: Night photography often struggles with challenges like low light and blurring, stemming from dark environments and prolonged exposures. Current methods either disregard priors and directly fitting end-to-end networks, leading to inconsistent illumination, or rely on unreliable handcrafted priors to constrain the network, thereby bringing the greater error to the final result. We believe in the str… ▽ More

    Submitted 16 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: This paper is accepted by AAAI2024

  44. Benchmarking Deep Learning Classifiers for SAR Automatic Target Recognition

    Authors: Jacob Fein-Ashley, Tian Ye, Rajgopal Kannan, Viktor Prasanna, Carl Busart

    Abstract: Synthetic Aperture Radar SAR Automatic Target Recognition ATR is a key technique of remote-sensing image recognition which can be supported by deep neural networks The existing works of SAR ATR mostly focus on improving the accuracy of the target recognition while ignoring the systems performance in terms of speed and storage which is critical to real-world applications of SAR ATR For decision-mak… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 6 Pages

  45. arXiv:2312.03775  [pdf, other

    cs.CV

    FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability

    Authors: Linze Li, Sunqi Fan, Hengjun Pu, Zhaodong Bing, Yao Tang, Tianzhu Ye, Tong Yang, Liangyu Chen, Jiajun Liang

    Abstract: Over recent years, diffusion models have facilitated significant advancements in video generation. Yet, the creation of face-related videos still confronts issues such as low facial fidelity, lack of frame consistency, limited editability and uncontrollable human poses. To address these challenges, we introduce a facial animation generation method that enhances both face identity fidelity and edit… ▽ More

    Submitted 20 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  46. arXiv:2312.02912  [pdf, other

    cs.CV

    Realistic Scatterer Based Adversarial Attacks on SAR Image Classifiers

    Authors: Tian Ye, Rajgopal Kannan, Viktor Prasanna, Carl Busart, Lance Kaplan

    Abstract: Adversarial attacks have highlighted the vulnerability of classifiers based on machine learning for Synthetic Aperture Radar (SAR) Automatic Target Recognition (ATR) tasks. An adversarial attack perturbs SAR images of on-ground targets such that the classifiers are misled into making incorrect predictions. However, many existing attacking techniques rely on arbitrary manipulation of SAR images whi… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  47. arXiv:2311.18173  [pdf

    eess.IV cs.CE cs.CV

    Quantification of cardiac capillarization in single-immunostained myocardial slices using weakly supervised instance segmentation

    Authors: Zhao Zhang, Xiwen Chen, William Richardson, Bruce Z. Gao, Abolfazl Razi, Tong Ye

    Abstract: Decreased myocardial capillary density has been reported as an important histopathological feature associated with various heart disorders. Quantitative assessment of cardiac capillarization typically involves double immunostaining of cardiomyocytes (CMs) and capillaries in myocardial slices. In contrast, single immunostaining of basement membrane components is a straightforward approach to simult… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  48. arXiv:2311.15209  [pdf, other

    cs.AI

    See and Think: Embodied Agent in Virtual Environment

    Authors: Zhonghan Zhao, Wenhao Chai, Xuan Wang, Li Boyi, Shengyu Hao, Shidong Cao, Tian Ye, Gaoang Wang

    Abstract: Large language models (LLMs) have achieved impressive pro-gress on several open-world tasks. Recently, using LLMs to build embodied agents has been a hotspot. This paper proposes STEVE, a comprehensive and visionary embodied agent in the Minecraft virtual environment. STEVE comprises three key components: vision perception, language instruction, and code action. Vision perception involves interpre… ▽ More

    Submitted 9 July, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: ECCV 2024. First three authors contribute equally to this work. Project Website https://rese1f.github.io/STEVE/

  49. arXiv:2311.12358  [pdf, other

    cs.LG cs.DC

    Federated Learning via Consensus Mechanism on Heterogeneous Data: A New Perspective on Convergence

    Authors: Shu Zheng, Tiandi Ye, Xiang Li, Ming Gao

    Abstract: Federated learning (FL) on heterogeneous data (non-IID data) has recently received great attention. Most existing methods focus on studying the convergence guarantees for the global objective. While these methods can guarantee the decrease of the global objective in each communication round, they fail to ensure risk decrease for each client. In this paper, to address the problem,we propose FedCOME… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  50. arXiv:2311.11638  [pdf, other

    cs.CV

    Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model

    Authors: Chunming He, Chengyu Fang, Yulun Zhang, Tian Ye, Kai Li, Longxiang Tang, Zhenhua Guo, Xiu Li, Sina Farsiu

    Abstract: Illumination degradation image restoration (IDIR) techniques aim to improve the visibility of degraded images and mitigate the adverse effects of deteriorated illumination. Among these algorithms, diffusion model (DM)-based methods have shown promising performance but are often burdened by heavy computational demands and pixel misalignment issues when predicting the image-level distribution. To ta… ▽ More

    Submitted 9 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 20 pages, 11 figures, 11 tables