Zum Hauptinhalt springen

Showing 1–50 of 501 results for author: Xiong, J

.
  1. arXiv:2408.17030  [pdf, ps, other

    math.OC

    Zero-sum stochastic linear-quadratic Stackelberg differential games of Markovian regime-switching system

    Authors: Fan Wu, Xun Li, Jie Xiong, Xin Zhang

    Abstract: This paper investigates a zero-sum stochastic linear-quadratic (SLQ, for short) Stackelberg differential game problem, where the coefficients of the state equation and the weighting matrices in the performance functional are regulated by a Markov chain. By utilizing the findings in \citet{Zhang.X.2021_ILQM}, we directly present the feedback representation to the rational reaction of the follower.… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    MSC Class: 91A15; 49N10; 93E20

  2. arXiv:2408.16350  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    Adiabatic Mass Loss in Binary Stars. V. Effects of Metallicity and Nonconservative Mass Transfer -- Application in High Mass X-ray Binaries

    Authors: Hongwei Ge, Christopher Adam Tout, Xuefei Chen, Song Wang, Jianping Xiong, Lifu Zhang, Qingzhong Liu, Zhanwen Han

    Abstract: Binary stars are responsible for many unusual astrophysical phenomena, including some important explosive cosmic events. The stability criteria for rapid mass transfer and common-envelope evolution are fundamental to binary star evolution. They determine the mass, mass ratio, and orbital distribution of systems such as X-ray binaries and merging gravitational-wave sources. We use our adiabatic mas… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: Submitted to ApJ. Comments are welcome

  3. arXiv:2408.12818  [pdf, ps, other

    math.OC

    Stochastic linear-quadratic differential game with Markovian jumps in an infinite horizon

    Authors: Fan Wu, Xun Li, Jie Xiong, Xin Zhang

    Abstract: This paper investigates a two-person non-homogeneous linear-quadratic stochastic differential game (LQ-SDG, for short) in an infinite horizon for a system regulated by a time-invariant Markov chain. Both non-zero-sum and zero-sum LQ-SDG problems are studied. It is shown that the zero-sum LQ-SDG problem can be considered a special non-zero-sum LQ-SDG problem. The open-loop Nash equilibrium point of… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    MSC Class: 93E03; 93E20

  4. arXiv:2408.10803  [pdf

    astro-ph.SR astro-ph.IM

    Estimating the Atmospheric Parameters of Early-type Stars from the Chinese Space Station Telescope (CSST) Slitless Spectra Survey

    Authors: JiaRui Rao, HaiLiang Chen, JianPing Xiong, LuQian Wang, YanJun Guo, JiaJia Li, Chao Liu, ZhanWen Han, XueFei Chen

    Abstract: The measurement of atmospheric parameters is fundamental for scientific research using stellar spectra. The Chinese Space Station Telescope (CSST), scheduled to be launched in 2024, will provide researchers with hundreds of millions of slitless spectra for stars during a 10 yr survey. And machine learning has unparalleled efficiency in processing large amounts of data compared to manual processing… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Journal ref: The Astronomical Journal, 168:20 (17pp), 2024 July

  5. arXiv:2408.07981  [pdf, other

    cs.CV cs.AI

    LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learning

    Authors: Jiajie Li, Garrett Skinner, Gene Yang, Brian R Quaranto, Steven D Schwaitzberg, Peter C W Kim, Jinjun Xiong

    Abstract: Multimodal large language models (LLMs) have achieved notable success across various domains, while research in the medical field has largely focused on unimodal images. Meanwhile, current general-domain multimodal models for videos still lack the capabilities to understand and engage in conversations about surgical videos. One major contributing factor is the absence of datasets in the surgical f… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  6. arXiv:2408.06381  [pdf, other

    eess.IV cs.AI cs.CV

    Assessment of Cell Nuclei AI Foundation Models in Kidney Pathology

    Authors: Junlin Guo, Siqi Lu, Can Cui, Ruining Deng, Tianyuan Yao, Zhewen Tao, Yizhe Lin, Marilyn Lionts, Quan Liu, Juming Xiong, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo

    Abstract: Cell nuclei instance segmentation is a crucial task in digital kidney pathology. Traditional automatic segmentation methods often lack generalizability when applied to unseen datasets. Recently, the success of foundation models (FMs) has provided a more generalizable solution, potentially enabling the segmentation of any cell type. In this study, we perform a large-scale evaluation of three widely… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  7. arXiv:2408.05878  [pdf

    cond-mat.supr-con physics.optics

    Drone based superconducting single photon detection system with detection efficiency more than 90%

    Authors: Ruoyan Ma, Zhimin Guo, Dai Chen, Xiaojun Dai, You Xiao, ChengJun Zhang, Jiamin Xiong, Jia Huang, Xingyu Zhang, Xiaoyu Liu, Liangliang Rong, Hao Li, Xiaofu Zhang, Lixing You

    Abstract: Bounded by the size, weight, and power consumption (SWaP) of conventional superconducting single photon detectors (SSPD), applications of SSPDs were commonly confined in the laboratory. However, booming demands for high efficiency single photon detector incorporated with avionic platforms arise with the development of remote imaging and sensing or long-haul quantum communication without topographi… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  8. arXiv:2408.04988  [pdf, other

    physics.bio-ph q-bio.MN

    Optimal Frequency in Second Messenger Signaling Quantifying cAMP Information Transmission in Bacteria

    Authors: Jiarui Xiong, Liang Wang, Jialun Lin, Lei Ni, Rongrong Zhang, Shuai Yang, Yajia Huang, Jun Chu, Fan Jin

    Abstract: Bacterial second messengers are crucial for transmitting environmental information to cellular responses. However, quantifying their information transmission capacity remains challenging. Here, we engineer an isolated cAMP signaling channel in Pseudomonas aeruginosa using targeted gene knockouts, optogenetics, and a fluorescent cAMP probe. This design allows precise optical control and real-time m… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 33 pages, 4 figures

    MSC Class: 92-05; 92-10 ACM Class: J.2.4

  9. A Method of Rapidly Deriving Late-type Contact Binary Parameters and Its Application in the Catalina Sky Survey

    Authors: JinLiang Wang, Xu Ding, JiaJia Li, JianPing Xiong, Qiyuan Cheng, KaiFan Ji

    Abstract: With the continuous development of large optical surveys, a large number of light curves of late-type contact binary systems (CBs) have been released. Deriving parameters for CBs using the the WD program and the PHOEBE program poses a challenge. Therefore, this study developed a method for rapidly deriving light curves based on the Neural Networks (NN) model combined with the Hamiltonian Monte Car… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Journal ref: The Astrophysical Journal Supplement Series. Published 2024 July 31

  10. arXiv:2408.00145  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.str-el

    Energy-lowering symmetry breaking creates a flat-band insulator in paramagnetic Nb3Cl8

    Authors: Jia-Xin Xiong, Xiuwen Zhang, Alex Zunger

    Abstract: Ordinary band structure calculations of quantum materials often incorrectly predicted metallic, instead of insulating electronic structure, motivating Mott-Hubbard strong electron correlation as a gapping mechanism. More recently, allowing the formation of local structural symmetry breaking motifs in otherwise ordinary band theory was shown to lower the total energy while predicting insulating gap… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: 3 figures

  11. arXiv:2407.19656  [pdf, other

    physics.app-ph

    Exploring quantum sensing for fine-grained liquid recognition

    Authors: Yuechun Jiao, Jinlian Hu, Zitong Lan, Fusang Zhang, Jie Xiong, Jingxu Bai, Zhaoxin Chang, Yuqi Su, Beihong Jin, Daqing Zhang, Jianming Zhao, Suotang Jia

    Abstract: Recent years have witnessed the use of pervasive wireless signals (e.g., Wi-Fi, RFID, and mmWave) for sensing purposes. Due to its non-intrusive characteristic, wireless sensing plays an important role in various intelligent sensing applications. However, limited by the inherent thermal noise of RF transceivers, the sensing granularity of existing wireless sensing systems are still coarse, limitin… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 7 pages, 6 figures

  12. arXiv:2407.09102  [pdf, ps, other

    math.PR

    Quantitative diffusion approximation for the Neutral $r$-Alleles Wright-Fisher Model with Mutations

    Authors: Peng Chen, Jie Xiong, Lihu Xu, Jiayu Zheng

    Abstract: We apply a Lindeberg principle under the Markov process setting to approximate the Wright-Fisher model with neutral $r$-alleles using a diffusion process, deriving an error rate based on a function class distance involving fourth-order bounded differentiable functions. This error rate consists of a linear combination of the maximum mutation rate and the reciprocal of the population size. Our resul… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  13. arXiv:2407.09025  [pdf, other

    cs.AI

    SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

    Authors: Yuzhang Tian, Jianbo Zhao, Haoyu Dong, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang

    Abstract: Spreadsheets, with their extensive two-dimensional grids, various layouts, and diverse formatting options, present notable challenges for large language models (LLMs). In response, we introduce SpreadsheetLLM, pioneering an efficient encoding method designed to unleash and optimize LLMs' powerful understanding and reasoning capability on spreadsheets. Initially, we propose a vanilla serialization… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  14. arXiv:2407.08965  [pdf, other

    cs.CV cs.LG

    Lite-SAM Is Actually What You Need for Segment Everything

    Authors: Jianhai Fu, Yuanjie Yu, Ningchuan Li, Yi Zhang, Qichao Chen, Jianping Xiong, Jun Yin, Zhiyu Xiang

    Abstract: This paper introduces Lite-SAM, an efficient end-to-end solution for the SegEvery task designed to reduce computational costs and redundancy. Lite-SAM is composed of four main components: a streamlined CNN-Transformer hybrid encoder (LiteViT), an automated prompt proposal network (AutoPPN), a traditional prompt encoder, and a mask decoder. All these components are integrated within the SAM framewo… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: ECCV 2024 Accepted

  15. arXiv:2407.06757  [pdf, ps, other

    math.AP

    Extinction profiles for the Sobolev critical fast diffusion equation in bounded domains. I. One bubble dynamics

    Authors: Tianling Jin, Jingang Xiong

    Abstract: In this paper, we investigate the extinction behavior of nonnegative solutions to the Sobolev critical fast diffusion equation in bounded smooth domains with the Dirichlet zero boundary condition. Under the two-bubble energy threshold assumption on the initial data, we prove the dichotomy that every solution converges uniformly, in terms of relative error, to either a steady state or a blowing-up… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 40 pages, comments are welcome

  16. arXiv:2407.06662  [pdf, other

    eess.SP

    Experimental Demonstration of 16D Voronoi Constellation with Two-Level Coding over 50km Four-Core Fiber

    Authors: Can Zhao, Bin Chen, Jiaqi Cai, Zhiwei Liang, Yi Lei, Junjie Xiong, Lin Ma, Daohui Hu, Lin Sun, Gangxiang Shen

    Abstract: A 16-dimensional Voronoi constellation concatenated with multilevel coding is experimentally demonstrated over a 50km four-core fiber transmission system. The proposed scheme reduces the required launch power by 6dB and provides a 17dB larger operating range than 16QAM with BICM at the outer HD-FEC BER threshold.

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 4 pages, 4 figures, accepted by 2024 European Conference on Optical Communication (ECOC)

  17. arXiv:2407.02404  [pdf, other

    cs.NI

    Shared-Protected Backup Paths Assignment with Mode Group Division Multiplexing in Optical Networks

    Authors: Jiaheng Xiong, Qiaolun Zhang, Ruikun Wang, Alberto Gatto, Francesco Musumeci, Massimo Tornatore

    Abstract: We evaluate the resource efficiency of Mode Group Division Multiplexing (MGDM) with shared path protection (SPP) in optical networks. On our case studies, SPP with MGDM obtains significant savings in terms of both additional backup spectrum occupation and MIMO-computing resources compared to other few-mode-transmission scenarios.

    Submitted 2 July, 2024; originally announced July 2024.

  18. arXiv:2407.00596  [pdf, other

    eess.IV cs.CV

    HATs: Hierarchical Adaptive Taxonomy Segmentation for Panoramic Pathology Image Analysis

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Juming Xiong, Shunxing Bao, Hao Li, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Haichun Yang, Yuankai Huo

    Abstract: Panoramic image segmentation in computational pathology presents a remarkable challenge due to the morphologically complex and variably scaled anatomy. For instance, the intricate organization in kidney pathology spans multiple layers, from regions like the cortex and medulla to functional units such as glomeruli, tubules, and vessels, down to various cell types. In this paper, we propose a novel… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.19286

  19. arXiv:2406.19540  [pdf, other

    cs.CV

    Weighted Circle Fusion: Ensembling Circle Representation from Different Object Detection Results

    Authors: Jialin Yue, Tianyuan Yao, Ruining Deng, Quan Liu, Juming Xiong, Haichun Yang, Yuankai Huo

    Abstract: Recently, the use of circle representation has emerged as a method to improve the identification of spherical objects (such as glomeruli, cells, and nuclei) in medical imaging studies. In traditional bounding box-based object detection, combining results from multiple models improves accuracy, especially when real-time processing isn't crucial. Unfortunately, this widely adopted strategy is not re… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  20. arXiv:2406.17926  [pdf, other

    cs.CL cs.SD eess.AS

    FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data

    Authors: Dancheng Liu, Jinjun Xiong

    Abstract: Automatic Speech Recognition (ASR) for adults' speeches has made significant progress by employing deep neural network (DNN) models recently, but improvement in children's speech is still unsatisfactory due to children's speech's distinct characteristics. DNN models pre-trained on adult data often struggle in generalizing children's speeches with fine tuning because of the lack of high-quality ali… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 4 pages, 1 figure

  21. arXiv:2406.15673  [pdf, other

    cs.CL cs.AI

    Large Language Models have Intrinsic Self-Correction Ability

    Authors: Dancheng Liu, Amir Nassereldine, Ziming Yang, Chenhui Xu, Yuting Hu, Jiajie Li, Utkarsh Kumar, Changjae Lee, Jinjun Xiong

    Abstract: Large language models (LLMs) have attracted significant attention for their remarkable abilities in various natural language processing tasks, but they suffer from hallucinations that will cause performance degradation. One promising solution to improve the LLMs' performance is to ask LLMs to revise their answer after generation, a technique known as self-correction. Among the two types of self-co… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: in submission

  22. arXiv:2406.15668  [pdf, other

    cs.CL cs.SD eess.AS

    PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics

    Authors: Amir Nassereldine, Dancheng Liu, Chenhui Xu, Jinjun Xiong

    Abstract: As edge-based automatic speech recognition (ASR) technologies become increasingly prevalent for the development of intelligent and personalized assistants, three important challenges must be addressed for these resource-constrained ASR models, i.e., adaptivity, incrementality, and inclusivity. We propose a novel ASR framework, PI-Whisper, in this work and show how it can improve an ASR's recogniti… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 11 pages, 3 figures

  23. arXiv:2406.14417  [pdf

    cond-mat.mes-hall

    Electrical switching of Ising-superconducting nonreciprocity for quantum neuronal transistor

    Authors: Junlin Xiong, Jiao Xie, Bin Cheng, Yudi Dai, Xinyu Cui, Lizheng Wang, Zenglin Liu, Ji Zhou, Naizhou Wang, Xianghan Xu, Xianhui Chen, Sang-Wook Cheong, Shi-Jun Liang, Feng Miao

    Abstract: Nonreciprocal quantum transport effect is mainly governed by the symmetry breaking of the material systems and is gaining extensive attention in condensed matter physics. Realizing electrical switching of the polarity of the nonreciprocal transport without external magnetic field is essential to the development of nonreciprocal quantum devices. However, electrical switching of superconducting nonr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Journal ref: Nature Communications 15, 4953 (2024)

  24. arXiv:2406.13035  [pdf, other

    cs.CL

    D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models

    Authors: Zhongwei Wan, Xinjian Wu, Yu Zhang, Yi Xin, Chaofan Tao, Zhihong Zhu, Xin Wang, Siqi Luo, Jing Xiong, Mi Zhang

    Abstract: Efficient inference in Large Language Models (LLMs) is impeded by the growing memory demands of key-value (KV) caching, especially for longer sequences. Traditional KV cache eviction strategies, which prioritize less critical KV-pairs based on attention scores, often degrade generation quality, leading to issues such as context loss or hallucinations. To address this, we introduce Dynamic Discrimi… ▽ More

    Submitted 23 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  25. arXiv:2406.06905  [pdf, ps, other

    math.PR

    Some quenched and annealed limit theorems of superprocesses in random environments

    Authors: Zeteng Fan, Jieliang Hong, Jie Xiong

    Abstract: Let $X=(X_t, t\geq 0)$ be a superprocess in a random environment described by a Gaussian noise $W=\{W(t,x), t\geq 0, x\in \mathbb{R}^d\}$ white in time and colored in space with correlation kernel $g(x,y)$. When $d\geq 3$, under the condition that the correlation function $g(x,y)$ is bounded above by some appropriate function $\bar{g}(x-y)$, we present the quenched and annealed Strong Law of Large… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 36 pages

    MSC Class: 60H15; 60G57; 60J80

  26. arXiv:2406.04621  [pdf, ps, other

    math.OC

    Mean-field stochastic linear quadratic control problem with random coefficients

    Authors: Jie Xiong, Wen Xu

    Abstract: In this paper, we first prove that the mean-field stochastic linear quadratic (MFSLQ) control problem with random coefficients has a unique optimal control and derive a preliminary stochastic maximum principle to characterize this optimal control by an optimality system. However, because of the term of the form $\mathbb{E}[A(\cdot)X(\cdot)] $ in the adjoint equation, which cannot be represented in… ▽ More

    Submitted 30 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  27. arXiv:2406.03777  [pdf, other

    cs.LG cs.AI

    Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices

    Authors: Ruiyang Qin, Dancheng Liu, Zheyu Yan, Zhaoxuan Tan, Zixuan Pan, Zhenge Jia, Meng Jiang, Ahmed Abbasi, Jinjun Xiong, Yiyu Shi

    Abstract: The scaling laws have become the de facto guidelines for designing large language models (LLMs), but they were studied under the assumption of unlimited computing resources for both training and inference. As LLMs are increasingly used as personalized intelligent assistants, their customization (i.e., learning through fine-tuning) and deployment onto resource-constrained edge devices will become m… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Benckmarking paper

  28. arXiv:2406.02630  [pdf, other

    cs.CR cs.AI

    AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways

    Authors: Zehang Deng, Yongjian Guo, Changzhou Han, Wanlun Ma, Junwu Xiong, Sheng Wen, Yang Xiang

    Abstract: An Artificial Intelligence (AI) agent is a software entity that autonomously performs tasks or makes decisions based on pre-defined objectives and data inputs. AI agents, capable of perceiving user inputs, reasoning and planning tasks, and executing actions, have seen remarkable advancements in algorithm development and task performance. However, the security challenges they pose remain under-expl… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: ACM Computing Survey

  29. arXiv:2406.01572  [pdf, other

    cs.LG

    Unlocking Guidance for Discrete State-Space Diffusion and Flow Models

    Authors: Hunter Nisonoff, Junhao Xiong, Stephan Allenspach, Jennifer Listgarten

    Abstract: Generative models on discrete state-spaces have a wide range of potential applications, particularly in the domain of natural sciences. In continuous state-spaces, controllable and flexible generation of samples with desired properties has been realized using guidance on diffusion and flow models. However, these guidance approaches are not readily amenable to discrete state-space models. Consequen… ▽ More

    Submitted 31 July, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  30. arXiv:2406.01341  [pdf, other

    cs.SI

    Important node identification for complex networks based on improved Electre Multi-Attribute fusion

    Authors: Qi Cao, Yurong Song, Min Li, Ruqi Li, Hongbo Qu, Guo-Ping Jiang, Jinye Xiong

    Abstract: Influence maximization problem involves selecting a subset of seed nodes within a social network to maximize information spread under a given diffusion model, so how to identify the important nodes is the problem to be considered in this paper. Due to the great differences in the reality of the network, a class of multi-attribute decision fusion methods is often used to solve this problem. Electre… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  31. The First Photometric Analysis of Two Low Mass Ratio Contact Binary Systems In TESS Survey

    Authors: Qiyuan Cheng, Jianping XIong, Xu Ding, Kaifan Ji, Jiao Li, Chao Liu, Jiangdan Li, Jingxiao Luo, Xin Lyu, Zhanwen Han, Xuefei Chen

    Abstract: Low mass-ratio (q) contact binary systems are progenitors of stellar mergers such as blue straggles (BS) or fast-rotating FK Com stars. In this study, we present the first light curve analysis of two newly identified low mass-ratio contact binary systems, TIC 55007847 and TIC 63597006, that are identified from TESS. Both stars are classified as A-subtype contact binaries. We obtained the precise o… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  32. arXiv:2405.16234  [pdf, other

    cs.CV

    Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities

    Authors: Shiyu Xia, Junyu Xiong, Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Mengyu Zhou, Yeye He, Shi Han, Dongmei Zhang

    Abstract: This paper explores capabilities of Vision Language Models on spreadsheet comprehension. We propose three self-supervised challenges with corresponding evaluation metrics to comprehensively evaluate VLMs on Optical Character Recognition (OCR), spatial perception, and visual format recognition. Additionally, we utilize the spreadsheet table detection task to assess the overall performance of VLMs b… ▽ More

    Submitted 8 August, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  33. arXiv:2405.14722  [pdf, other

    cs.CL

    CAPE: Context-Adaptive Positional Encoding for Length Extrapolation

    Authors: Chuanyang Zheng, Yihang Gao, Han Shi, Minbin Huang, Jingyao Li, Jing Xiong, Xiaozhe Ren, Michael Ng, Xin Jiang, Zhenguo Li, Yu Li

    Abstract: Positional encoding plays a crucial role in transformers, significantly impacting model performance and length generalization. Prior research has introduced absolute positional encoding (APE) and relative positional encoding (RPE) to distinguish token positions in given sequences. However, both APE and RPE remain fixed after model training regardless of input data, limiting their adaptability and… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Technical Report

  34. arXiv:2405.13972  [pdf, other

    cs.LG

    Infinite-Dimensional Feature Interaction

    Authors: Chenhui Xu, Fuxun Yu, Maoliang Li, Zihao Zheng, Zirui Xu, Jinjun Xiong, Xiang Chen

    Abstract: The past neural network design has largely focused on feature representation space dimension and its capacity scaling (e.g., width, depth), but overlooked the feature interaction space scaling. Recent advancements have shown shifted focus towards element-wise multiplication to facilitate higher-dimensional feature interaction space for better information transformation. Despite this progress, mu… ▽ More

    Submitted 9 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  35. arXiv:2405.11647  [pdf, other

    cs.AI cs.LG

    Hummer: Towards Limited Competitive Preference Dataset

    Authors: Li Jiang, Yusen Wu, Junwu Xiong, Jingqing Ruan, Yichuan Ding, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng

    Abstract: Preference datasets are essential for incorporating human preferences into pre-trained language models, playing a key role in the success of Reinforcement Learning from Human Feedback. However, these datasets often demonstrate conflicting alignment objectives, leading to increased vulnerability to jailbreak attacks and challenges in adapting downstream tasks to prioritize specific alignment object… ▽ More

    Submitted 6 August, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Journal ref: COLM 2024

  36. arXiv:2405.09083  [pdf, other

    cs.CV

    RSHazeDiff: A Unified Fourier-aware Diffusion Model for Remote Sensing Image Dehazing

    Authors: Jiamei Xiong, Xuefeng Yan, Yongzhen Wang, Wei Zhao, Xiao-Ping Zhang, Mingqiang Wei

    Abstract: Haze severely degrades the visual quality of remote sensing images and hampers the performance of automotive navigation, intelligent monitoring, and urban management. The emerging denoising diffusion probabilistic model (DDPM) exhibits the significant potential for dense haze removal with its strong generation ability. Since remote sensing images contain extensive small-scale texture structures, i… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  37. arXiv:2405.08748  [pdf, other

    cs.CV

    Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

    Authors: Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu , et al. (20 additional authors not shown)

    Abstract: We present Hunyuan-DiT, a text-to-image diffusion transformer with fine-grained understanding of both English and Chinese. To construct Hunyuan-DiT, we carefully design the transformer structure, text encoder, and positional encoding. We also build from scratch a whole data pipeline to update and evaluate data for iterative model optimization. For fine-grained language understanding, we train a Mu… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Project Page: https://dit.hunyuan.tencent.com/

  38. arXiv:2405.08463  [pdf, other

    cs.CV

    A Timely Survey on Vision Transformer for Deepfake Detection

    Authors: Zhikan Wang, Zhongyao Cheng, Jiajie Xiong, Xun Xu, Tianrui Li, Bharadwaj Veeravalli, Xulei Yang

    Abstract: In recent years, the rapid advancement of deepfake technology has revolutionized content creation, lowering forgery costs while elevating quality. However, this progress brings forth pressing concerns such as infringements on individual rights, national security threats, and risks to public safety. To counter these challenges, various detection methodologies have emerged, with Vision Transformer (… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  39. arXiv:2405.04700  [pdf, other

    cs.LG cs.AI cs.DC cs.IR

    Robust Implementation of Retrieval-Augmented Generation on Edge-based Computing-in-Memory Architectures

    Authors: Ruiyang Qin, Zheyu Yan, Dewen Zeng, Zhenge Jia, Dancheng Liu, Jianbo Liu, Zhi Zheng, Ningyuan Cao, Kai Ni, Jinjun Xiong, Yiyu Shi

    Abstract: Large Language Models (LLMs) deployed on edge devices learn through fine-tuning and updating a certain portion of their parameters. Although such learning methods can be optimized to reduce resource utilization, the overall required resources remain a heavy burden on edge devices. Instead, Retrieval-Augmented Generation (RAG), a resource-efficient LLM learning method, can improve the quality of th… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  40. arXiv:2405.03192  [pdf, other

    cs.LG cs.AI

    QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation

    Authors: Chenhui Xu, Xinyao Wang, Fuxun Yu, Jinjun Xiong, Xiang Chen

    Abstract: Machine learning is evolving towards high-order models that necessitate pre-training on extensive datasets, a process associated with significant overheads. Traditional models, despite having pre-trained weights, are becoming obsolete due to architectural differences that obstruct the effective transfer and initialization of these weights. To address these challenges, we introduce a novel framewor… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  41. arXiv:2405.00750  [pdf, other

    cs.HC cs.AI cs.CY

    From Keyboard to Chatbot: An AI-powered Integration Platform with Large-Language Models for Teaching Computational Thinking for Young Children

    Authors: Changjae Lee, Jinjun Xiong

    Abstract: Teaching programming in early childhood (4-9) to enhance computational thinking has gained popularity in the recent movement of computer science for all. However, current practices ignore some fundamental issues resulting from young children's developmental readiness, such as the sustained capability to keyboarding, the decomposition of complex tasks to small tasks, the need for intuitive mapping… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 26 pages, 11 figures

  42. arXiv:2404.18561  [pdf, ps, other

    math.OC

    Social Optima of Linear Forward-Backward Stochastic System

    Authors: Guangchen Wang, Shujun Wang, Jie Xiong

    Abstract: A linear quadratic (LQ) stochastic optimization system involving large population, which is driven by forward-backward stochastic differential equation (FBSDE), is investigated in this paper. Agents cooperate with each other to minimize the so-called social objective, which is rather different from mean field (MF) game. Employing forward-backward person-by-person optimality principle, we derive an… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 30 pages

  43. arXiv:2404.10963  [pdf, other

    physics.ins-det

    Dose rate dependence of TID damage to 65 nm CMOS transistors in X-ray irradiations of the ATLAS ITk Pixel ASIC (ITkPix)

    Authors: Daniela Bortoletto, Aleksandra Dimitrievska, Maurice Garcia-Sciveres, Timon Heim, Maria Mironova, Richard Plackett, Ian Shipsey, Junwen Xiong

    Abstract: The ATLAS Inner Tracker (ITk) upgrade for the High-Luminosity LHC (HL-LHC) requires a radiation-tolerant pixel readout chip, which must withstand a total ionising dose (TID) of up to 1 Grad. The readout ASIC for the ITk upgrade has been designed by the RD53 collaboration using 65 nm CMOS technology. In order to characterise the radiation tolerance of the chip digital logic, the RD53 ASICs include… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  44. arXiv:2404.10285  [pdf, other

    math.OC

    Two system transformation data-driven algorithms for linear quadratic mean-field games

    Authors: Xun Li, Guangchen Wang, Yu Wang, Jie Xiong, Heng Zhang

    Abstract: This paper studies a class of continuous-time linear quadratic (LQ) mean-field game problems. We develop two system transformation data-driven algorithms to approximate the decentralized strategies of the LQ mean-field games. The main feature of the obtained data-driven algorithms is that they eliminate the requirement on all system matrices. First, we transform the original stochastic system into… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  45. arXiv:2404.04835  [pdf, other

    astro-ph.SR astro-ph.HE

    A born ultramassive white dwarf-hot subdwarf super-Chandrasekhar candidate

    Authors: Changqing Luo, Jiao Li, Chuanjie Zheng, Dongdong Liu, Zhenwei Li, Yangping Luo, Peter Nemeth, Bo Zhang, Jianping Xiong, Bo Wang, Song Wang, Yu Bai, Qingzheng Li, Pei Wang, Zhanwen Han, Jifeng Liu, Yang Huang, Xuefei Chen, Chao Liu

    Abstract: Although supernovae is a well-known endpoint of an accreting white dwarf, alternative theoretical possibilities has been discussing broadly, such as the accretion-induced collapse (AIC) event as the endpoint of oxygen-neon (ONe) white dwarfs, either accreting up to or merging to excess the Chandrasekhar limit (the maximum mass of a stable white dwarf). AIC is an important channel to form neutron s… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 25 pages, 14 figures

  46. Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation

    Authors: Hui Xiao, Yuting Hong, Li Dong, Diqun Yan, Jiayan Zhuang, Junjie Xiong, Dongtai Liang, Chengbin Peng

    Abstract: Semi-supervised semantic segmentation relieves the reliance on large-scale labeled data by leveraging unlabeled data. Recent semi-supervised semantic segmentation approaches mainly resort to pseudo-labeling methods to exploit unlabeled data. However, unreliable pseudo-labeling can undermine the semi-supervision processes. In this paper, we propose an algorithm called Multi-Level Label Correction (… ▽ More

    Submitted 9 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 8 figures. IEEE Transactions on Multimedia, 2024

  47. arXiv:2403.19495  [pdf, other

    cs.CV cs.GR

    CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians

    Authors: Avinash Paliwal, Wei Ye, Jinhui Xiong, Dmytro Kotovenko, Rakesh Ranjan, Vikas Chandra, Nima Khademi Kalantari

    Abstract: The field of 3D reconstruction from images has rapidly evolved in the past few years, first with the introduction of Neural Radiance Field (NeRF) and more recently with 3D Gaussian Splatting (3DGS). The latter provides a significant edge over NeRF in terms of the training and inference speed, as well as the reconstruction quality. Although 3DGS works well for dense input images, the unstructured p… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Project page: https://people.engr.tamu.edu/nimak/Papers/CoherentGS

  48. arXiv:2403.18189  [pdf

    cond-mat.mes-hall

    Interfacial magnetic spin Hall effect in van der Waals Fe3GeTe2/MoTe2 heterostructure

    Authors: Yudi Dai, Junlin Xiong, Yanfeng Ge, Bin Cheng, Lizheng Wang, Pengfei Wang, Zenglin Liu, Shengnan Yan, Cuiwei Zhang, Xianghan Xu, Youguo Shi, Sang-Wook Cheong, Cong Xiao, Shengyuan A. Yang, Shi-Jun Liang, Feng Miao

    Abstract: The spin Hall effect (SHE) allows efficient generation of spin polarization or spin current through charge current and plays a crucial role in the development of spintronics. While SHE typically occurs in non-magnetic materials and is time-reversal even, exploring time-reversal-odd (T-odd) SHE, which couples SHE to magnetization in ferromagnetic materials, offers a new charge-spin conversion mecha… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Journal ref: Nature Communications 15, 1129 (2024)

  49. arXiv:2403.16209  [pdf

    cs.CV cs.AI

    Image Captioning in news report scenario

    Authors: Tianrui Liu, Qi Cai, Changxin Xu, Bo Hong, Jize Xiong, Yuxin Qiao, Tsungwei Yang

    Abstract: Image captioning strives to generate pertinent captions for specified images, situating itself at the crossroads of Computer Vision (CV) and Natural Language Processing (NLP). This endeavor is of paramount importance with far-reaching applications in recommendation systems, news outlets, social media, and beyond. Particularly within the realm of news reporting, captions are expected to encompass d… ▽ More

    Submitted 1 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures

  50. arXiv:2403.16000  [pdf, ps, other

    math.OC math.PR

    Stochastic maximum principle for weighted mean-field system with jump

    Authors: Yanyan Tang, Jie Xiong

    Abstract: In this article, we consider a weighted mean-field control problem with jump-diffusion as its state process. The main difficulty is from the non-Lipschitz property of the coefficients. We overcome this difficulty by an $L_{p,q}$-estimate of the solution processes with a suitably chosen $p$ and $q$. Convex pertubation method combining with the aforementioned $L_{p,q}$-estimation method is utilized… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.