Zum Hauptinhalt springen

Showing 101–150 of 1,129 results for author: Xie, J

.
  1. arXiv:2404.04568  [pdf, other

    math.CV math.AG math.DS

    The moduli space of a rational map is Carathéodory hyperbolic

    Authors: Zhuchao Ji, Junyi Xie

    Abstract: Let $f$ be a rational map of degree $d\geq 2$. The moduli space $\mathcal{M}_f$, introduced by McMullen and Sullivan, is a complex analytic space consisting all quasiconformal conjugacy classes of $f$. For $f$ that is not flexible Lattès, we show that there is a normal affine variety $X_f$ of dimension $2d-2$ and a holomorphic injection $i:\mathcal{M}_f\to X_f$ such that $i(\mathcal{M}_f)$ is prec… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 10 pages

  2. arXiv:2404.03302  [pdf, other

    cs.CL

    How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?

    Authors: Siye Wu, Jian Xie, Jiangjie Chen, Tinghui Zhu, Kai Zhang, Yanghua Xiao

    Abstract: By leveraging the retrieval of information from external knowledge databases, Large Language Models (LLMs) exhibit enhanced capabilities for accomplishing many knowledge-intensive tasks. However, due to the inherent flaws of current retrieval systems, there might exist irrelevant information within those retrieving top-ranked passages. In this work, we present a comprehensive investigation into th… ▽ More

    Submitted 24 July, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: COLM 2024

  3. arXiv:2404.02747  [pdf, other

    cs.CV

    Faster Diffusion via Temporal Attention Decomposition

    Authors: Haozhe Liu, Wentian Zhang, Jinheng Xie, Francesco Faccio, Mengmeng Xu, Tao Xiang, Mike Zheng Shou, Juan-Manuel Perez-Rua, Jürgen Schmidhuber

    Abstract: We explore the role of attention mechanism during inference in text-conditional diffusion models. Empirical observations suggest that cross-attention outputs converge to a fixed point after several inference steps. The convergence time naturally divides the entire inference process into two phases: an initial phase for planning text-oriented visual semantics, which are then translated into images… ▽ More

    Submitted 17 July, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2404.01448  [pdf

    physics.med-ph cs.LG

    Prior Frequency Guided Diffusion Model for Limited Angle (LA)-CBCT Reconstruction

    Authors: Jiacheng Xie, Hua-Chieh Shao, Yunxiang Li, You Zhang

    Abstract: Cone-beam computed tomography (CBCT) is widely used in image-guided radiotherapy. Reconstructing CBCTs from limited-angle acquisitions (LA-CBCT) is highly desired for improved imaging efficiency, dose reduction, and better mechanical clearance. LA-CBCT reconstruction, however, suffers from severe under-sampling artifacts, making it a highly ill-posed inverse problem. Diffusion models can generate… ▽ More

    Submitted 8 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 20 pages, 8 figures, submitted to Physics in Medicine & Biology

  5. arXiv:2404.00672  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    A General and Efficient Training for Transformer via Token Expansion

    Authors: Wenxuan Huang, Yunhang Shen, Jiao Xie, Baochang Zhang, Gaoqi He, Ke Li, Xing Sun, Shaohui Lin

    Abstract: The remarkable performance of Vision Transformers (ViTs) typically requires an extremely large training cost. Existing methods have attempted to accelerate the training of ViTs, yet typically disregard method universality with accuracy dropping. Meanwhile, they break the training consistency of the original transformers, including the consistency of hyper-parameters, architecture, and strategy, wh… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. Code is available at https://github.com/Osilly/TokenExpansion

  6. arXiv:2404.00403  [pdf, other

    cs.CL

    UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause

    Authors: Guimin Hu, Zhihong Zhu, Daniel Hershcovich, Hasti Seifi, Jiayuan Xie

    Abstract: Multimodal emotion recognition in conversation (MERC) and multimodal emotion-cause pair extraction (MECPE) has recently garnered significant attention. Emotions are the expression of affect or feelings; responses to specific events, thoughts, or situations are known as emotion causes. Both are like two sides of a coin, collectively describing human behaviors and intents. However, most existing wor… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  7. arXiv:2403.19919  [pdf, other

    cs.CV

    Diff-Reg v1: Diffusion Matching Model for Registration Problem

    Authors: Qianliang Wu, Haobo Jiang, Lei Luo, Jun Li, Yaqing Ding, Jin Xie, Jian Yang

    Abstract: Establishing reliable correspondences is essential for registration tasks such as 3D and 2D3D registration. Existing methods commonly leverage geometric or semantic point features to generate potential correspondences. However, these features may face challenges such as large deformation, scale inconsistency, and ambiguous matching problems (e.g., symmetry). Additionally, many previous methods, wh… ▽ More

    Submitted 24 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.00436

  8. arXiv:2403.19710  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    STRUM-LLM: Attributed and Structured Contrastive Summarization

    Authors: Beliz Gunel, James B. Wendt, Jing Xie, Yichao Zhou, Nguyen Vo, Zachary Fisher, Sandeep Tata

    Abstract: Users often struggle with decision-making between two options (A vs B), as it usually requires time-consuming research across multiple web pages. We propose STRUM-LLM that addresses this challenge by generating attributed, structured, and helpful contrastive summaries that highlight key differences between the two options. STRUM-LLM identifies helpful contrast: the specific attributes along which… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  9. arXiv:2403.19627  [pdf, ps, other

    math.DG

    Four-dimensional gradient Ricci solitons with (half) nonnegative isotropic curvature

    Authors: Huai-Dong Cao, Junming Xie

    Abstract: This is a sequel to our paper [24], in which we investigated the geometry of 4-dimensional gradient shrinking Ricci solitons with half positive (nonnegative) isotropic curvature. In this paper, we mainly focus on 4-dimensional gradient steady Ricci solitons with nonnegative isotropic curvature (WPIC) or half nonnegative isotropic curvature (half WPIC). In particular, for $4$D complete {\it ancient… ▽ More

    Submitted 18 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 21 pages; v.2: added Remark 1.3 & Remark 6.2

  10. arXiv:2403.19521  [pdf, other

    cs.CL cs.AI cs.LG

    Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models

    Authors: Ang Lv, Yuhan Chen, Kaiyi Zhang, Yulong Wang, Lifeng Liu, Ji-Rong Wen, Jian Xie, Rui Yan

    Abstract: In this paper, we delve into several mechanisms employed by Transformer-based language models (LLMs) for factual recall tasks. We outline a pipeline consisting of three major steps: (1) Given a prompt ``The capital of France is,'' task-specific attention heads extract the topic token, such as ``France,'' from the context and pass it to subsequent MLPs. (2) As attention heads' outputs are aggregate… ▽ More

    Submitted 24 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  11. arXiv:2403.16107  [pdf, other

    cs.HC

    Designing Upper-Body Gesture Interaction with and for People with Spinal Muscular Atrophy in VR

    Authors: Jingze Tian, Yingna Wang, Keye Yu, Liyi Xu, Junan Xie, Franklin Mingzhe Li, Yafeng Niu, Mingming Fan

    Abstract: Recent research proposed gaze-assisted gestures to enhance interaction within virtual reality (VR), providing opportunities for people with motor impairments to experience VR. Compared to people with other motor impairments, those with Spinal Muscular Atrophy (SMA) exhibit enhanced distal limb mobility, providing them with more design space. However, it remains unknown what gaze-assisted upper-bod… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11--16, 2024, Honolulu, HI, USA

  12. arXiv:2403.16053  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Quantitatively predicting angle-resolved polarized Raman intensity of black phosphorus flakes

    Authors: Tao Liu, Jia-Liang Xie, Yu-Chen Leng, Heng Wu, Jiahong Wang, Yang Li, Xue-Feng Yu, Miao-Ling Lin, Ping-Heng Tan

    Abstract: In-plane anisotropic layered materials (ALMs), such as black phosphorus (BP), exhibit unique angle-resolved polarized Raman (ARPR) spectroscopy characteristics, as attributed to birefringence, linear dichroism and complex Raman tensor. Moreover, the ARPR intensity profiles of BP flakes deposited on multilayer dielectrics are notably sensitive to their thickness, owing to interference effects. The… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures

  13. arXiv:2403.14983  [pdf, other

    physics.soc-ph cs.SI

    Reconstructing the evolution history of networked complex systems

    Authors: Junya Wang, Yi-Jiao Zhang, Cong Xu, Jiaze Li, Jiachen Sun, Jiarong Xie, Ling Feng, Tianshou Zhou, Yanqing Hu

    Abstract: The evolution processes of complex systems carry key information in the systems' functional properties. Applying machine learning algorithms, we demonstrate that the historical formation process of various networked complex systems can be extracted, including protein-protein interaction, ecology, and social network systems. The recovered evolution process has demonstrations of immense scientific v… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  14. arXiv:2403.13660  [pdf

    cs.CV

    ProMamba: Prompt-Mamba for polyp segmentation

    Authors: Jianhao Xie, Ruofan Liao, Ziang Zhang, Sida Yi, Yuesheng Zhu, Guibo Luo

    Abstract: Detecting polyps through colonoscopy is an important task in medical image segmentation, which provides significant assistance and reference value for clinical surgery. However, accurate segmentation of polyps is a challenging task due to two main reasons. Firstly, polyps exhibit various shapes and colors. Secondly, the boundaries between polyps and their normal surroundings are often unclear. Add… ▽ More

    Submitted 26 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 10 pages, 2 figures,3 tabels

  15. Context-based Fast Recommendation Strategy for Long User Behavior Sequence in Meituan Waimai

    Authors: Zhichao Feng, Junjiie Xie, Kaiyuan Li, Yu Qin, Pengfei Wang, Qianzhong Li, Bin Yin, Xiang Li, Wei Lin, Shangguang Wang

    Abstract: In the recommender system of Meituan Waimai, we are dealing with ever-lengthening user behavior sequences, which pose an increasing challenge to modeling user preference effectively. Existing sequential recommendation models often fail to capture long-term dependencies or are too complex, complicating the fulfillment of Meituan Waimai's unique business needs. To better model user interests, we con… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 9 pages, accepted by WWW 2024 Industry Track

  16. arXiv:2403.12455  [pdf, other

    cs.CV

    CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation

    Authors: Wenqi Zhu, Jiale Cao, Jin Xie, Shuangming Yang, Yanwei Pang

    Abstract: Open-vocabulary video instance segmentation strives to segment and track instances belonging to an open set of categories in a video. The vision-language model Contrastive Language-Image Pre-training (CLIP) has shown robust zero-shot classification ability in image-level open-vocabulary task. In this paper, we propose a simple encoder-decoder network, called CLIP-VIS, to adapt CLIP for open-vocabu… ▽ More

    Submitted 7 June, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  17. arXiv:2403.11465  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Ultra-Long Homochiral Graphene Nanoribbons Grown Within h-BN Stacks for High-Performance Electronics

    Authors: Bosai Lyu, Jiajun Chen, Sen Wang, Shuo Lou, Peiyue Shen, Jingxu Xie, Lu Qiu, Izaac Mitchell, Can Li, Cheng Hu, Xianliang Zhou, Kenji Watanabe, Takashi Taniguchi, Xiaoqun Wang, Jinfeng Jia, Qi Liang, Guorui Chen, Tingxin Li, Shiyong Wang, Wengen Ouyang, Oded Hod, Feng Ding, Michael Urbakh, Zhiwen Shi

    Abstract: Van der Waals encapsulation of two-dimensional materials within hexagonal boron nitride (h-BN) stacks has proven to be a promising way to create ultrahigh-performance electronic devices. However, contemporary approaches for achieving van der Waals encapsulation, which involve artificial layer stacking using mechanical transfer techniques, are difficult to control, prone to contamination, and unsca… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  18. arXiv:2403.10732  [pdf, other

    cs.LG cs.AI

    Variance-Dependent Regret Bounds for Non-stationary Linear Bandits

    Authors: Zhiyong Wang, Jize Xie, Yi Chen, John C. S. Lui, Dongruo Zhou

    Abstract: We investigate the non-stationary stochastic linear bandit problem where the reward distribution evolves each round. Existing algorithms characterize the non-stationarity by the total variation budget $B_K$, which is the summation of the change of the consecutive feature vectors of the linear bandits over $K$ rounds. However, such a quantity only measures the non-stationarity with respect to the e… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 30 pages

  19. arXiv:2403.10574  [pdf, other

    cs.CV

    Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers

    Authors: Jinxia Xie, Bineng Zhong, Zhiyi Mo, Shengping Zhang, Liangtao Shi, Shuxiang Song, Rongrong Ji

    Abstract: The rich spatio-temporal information is crucial to capture the complicated target appearance variations in visual tracking. However, most top-performing tracking algorithms rely on many hand-crafted components for spatio-temporal information aggregation. Consequently, the spatio-temporal information is far away from being fully explored. To alleviate this issue, we propose an adaptive tracker with… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  20. arXiv:2403.09181  [pdf, ps, other

    math.DS math.AG

    On the dynamical Mordell-Lang conjecture in positive characteristic

    Authors: Junyi Xie, She Yang

    Abstract: We disprove the original version of the dynamical Mordell-Lang conjecture in positive characteristic and propose a improved version of this pDML conjecture. We prove that this new version holds for bounded-degree self-maps of projective varieties. Moreover, we propose a geometric version of this pDML conjecture.

    Submitted 13 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 35 pages, most of the article is rewritten

  21. arXiv:2403.08931  [pdf, ps, other

    eess.SY

    Unleashing the True Power of Age-of-Information: Service Aggregation in Connected and Autonomous Vehicles

    Authors: Anik Mallik, Dawei Chen, Kyungtae Han, Jiang Xie, Zhu Han

    Abstract: Connected and autonomous vehicles (CAVs) rely heavily upon time-sensitive information update services to ensure the safety of people and assets, and satisfactory entertainment applications. Therefore, the freshness of information is a crucial performance metric for CAV services. However, information from roadside sensors and nearby vehicles can get delayed in transmission due to the high mobility… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 6 pages, 8 figures, to appear in the Proceedings of IEEE International Conference on Communications (IEEE ICC, 9-13 June 2024, Denver, CO, USA)

  22. arXiv:2403.08154  [pdf, other

    cs.LG eess.SP

    The Effect of Different Optimization Strategies to Physics-Constrained Deep Learning for Soil Moisture Estimation

    Authors: Jianxin Xie, Bing Yao, Zheyu Jiang

    Abstract: Soil moisture is a key hydrological parameter that has significant importance to human society and the environment. Accurate modeling and monitoring of soil moisture in crop fields, especially in the root zone (top 100 cm of soil), is essential for improving agricultural production and crop yield with the help of precision irrigation and farming tools. Realizing the full sensor data potential depe… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  23. arXiv:2403.07228  [pdf, other

    eess.SP

    Physics-constrained Active Learning for Soil Moisture Estimation and Optimal Sensor Placement

    Authors: Jianxin Xie, Bing Yao, Zheyu Jiang

    Abstract: Soil moisture is a crucial hydrological state variable that has significant importance to the global environment and agriculture. Precise monitoring of soil moisture in crop fields is critical to reducing agricultural drought and improving crop yield. In-situ soil moisture sensors, which are buried at pre-determined depths and distributed across the field, are promising solutions for monitoring so… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  24. arXiv:2403.06396  [pdf, ps, other

    eess.IV cs.CV

    A Segmentation Foundation Model for Diverse-type Tumors

    Authors: Jianhao Xie, Ziang Zhang, Guibo Luo, Yuesheng Zhu

    Abstract: Large pre-trained models with their numerous model parameters and extensive training datasets have shown excellent performance in various tasks. Many publicly available medical image datasets do not have a sufficient amount of data so there are few large-scale models in medical imaging. We propose a large-scale Tumor Segmentation Foundation Model (TSFM) with 1.6 billion parameters using Resblock-b… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 10 pages, 2 figures.About Medical image segmentation and Foundation Model

    ACM Class: I.4.6

  25. arXiv:2403.01676  [pdf, ps, other

    hep-ph

    Production of $X_b$ via radiative transition of $Υ(10753)$

    Authors: Shi-Dong Liu, Hao-Dong Cai, Zu-Xin Cai, Hong-Shuo Gao, Gang Li, Fan Wang, Ju-Jun Xie

    Abstract: We studied the radiative transitions between the $Υ(10753)$, the $S$-$D$ mixed state of the $Υ(4S)$ and $Υ_1(3\,{}^3D_1)$, and the $X_b$, the heavy quark flavor symmetry counterpart of the $X(3782)$ in the bottomonium sector. The radiative transition was assumed to occur through the intermediate bottom mesons, including $P$-wave $B_1^{(\prime)}$ mesons as well as the $S$-wave $B^{(*)}$ ones. The c… ▽ More

    Submitted 9 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures, accepted by PRD(20240510)

  26. arXiv:2403.01456  [pdf

    cs.CL cs.AI cs.CY

    Controlling Cloze-test Question Item Difficulty with PLM-based Surrogate Models for IRT Assessment

    Authors: Jingshen Zhang, Jiajun Xie, Xinying Qiu

    Abstract: Item difficulty plays a crucial role in adaptive testing. However, few works have focused on generating questions of varying difficulty levels, especially for multiple-choice (MC) cloze tests. We propose training pre-trained language models (PLMs) as surrogate models to enable item response theory (IRT) assessment, avoiding the need for human test subjects. We also propose two strategies to contro… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  27. arXiv:2403.00331  [pdf, other

    cs.DC

    WindGP: Efficient Graph Partitioning on Heterogenous Machines

    Authors: Li Zeng, Haohan Huang, Binfan Zheng, Kang Yang, Shengcheng Shao, Jinhua Zhou, Jun Xie, Rongqian Zhao, Xin Chen

    Abstract: Graph Partitioning is widely used in many real-world applications such as fraud detection and social network analysis, in order to enable the distributed graph computing on large graphs. However, existing works fail to balance the computation cost and communication cost on machines with different power (including computing capability, network bandwidth and memory size), as they only consider repli… ▽ More

    Submitted 6 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 19 pages, 15 figures, 18 tables

  28. arXiv:2402.18035  [pdf, other

    astro-ph.HE astro-ph.GA

    A study of 10 Rotating Radio Transients using Parkes radio telescope

    Authors: Xinhui Ren, Jingbo Wang, Wenming Yan, Jintao Xie, Shuangqiang Wang, Yirong Wen, Yong Xia

    Abstract: Rotating Radio Transients (RRATs) are a relatively new subclass of pulsars that emit detectable radio bursts sporadically. We conducted an analysis of 10 RRATs observed using the Parkes telescope, with 8 of these observed via the Ultra-Wideband Receiver. We measured the burst rate and produced integrated profiles spanning multiple frequency bands for 3 RRATs. We also conducted a spectral analysis… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 16 pages, 8 figures, RAA accepted

  29. A Search for Radio Pulsars in Supernova Remnants Using FAST with One Pulsar Discovered

    Authors: Zhen Zhang, Wen-Ming Yan, Jian-Ping Yuan, Na Wang, Jun-Tao Bai, Zhi-Gang Wen, Bao-Da Li, Jin-Tao Xie, De Zhao, Yu-Bin Wang, Nan-Nan Zhai

    Abstract: We report on the results of a search for radio pulsars in five supernova remnants (SNRs) with FAST. The observations were made using the 19-beam receiver in the Snapshot mode. The integration time for each pointing is 10 min. We discovered a new pulsar PSR J1845$-$0306 which has a spin period of 983.6 ms and a dispersion measure of 444.6$\pm$2.0 cm$^{-3}$ pc in observations of SNR G29.6+0.1. To ju… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 figures, 2 tables published in CPL

    Journal ref: Chin. Phys. Lett. 2024, 41 (2): 029701 February 2024

  30. arXiv:2402.17179  [pdf, other

    cs.LG q-bio.BM

    Dual-Space Optimization: Improved Molecule Sequence Design by Latent Prompt Transformer

    Authors: Deqian Kong, Yuhao Huang, Jianwen Xie, Edouardo Honig, Ming Xu, Shuanghong Xue, Pei Lin, Sanping Zhou, Sheng Zhong, Nanning Zheng, Ying Nian Wu

    Abstract: Designing molecules with desirable properties, such as drug-likeliness and high binding affinities towards protein targets, is a challenging problem. In this paper, we propose the Dual-Space Optimization (DSO) method that integrates latent space sampling and data space selection to solve this problem. DSO iteratively updates a latent space generative model and a synthetic dataset in an optimizatio… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  31. arXiv:2402.15116  [pdf, other

    cs.CV cs.AI cs.CL

    Large Multimodal Agents: A Survey

    Authors: Junlin Xie, Zhihong Chen, Ruifei Zhang, Xiang Wan, Guanbin Li

    Abstract: Large language models (LLMs) have achieved superior performance in powering text-based AI agents, endowing them with decision-making and reasoning abilities akin to humans. Concurrently, there is an emerging research trend focused on extending these LLM-powered AI agents into the multimodal domain. This extension enables AI agents to interpret and respond to diverse multimodal user queries, thereb… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 15 pages, 4 figures

  32. arXiv:2402.15069  [pdf, other

    astro-ph.HE

    Investigation of profile shifting and subpulse movement in PSR J0344-0901 with FAST

    Authors: H. M. Tedila, R. Yuen, N. Wang, D. Li, Z. G. Wen, W. M. Yan, J. P. Yuan, X. H. Han, P. Wang, W. W. Zhu, S. J. Dang, S. Q. Wang, J. T. Xie, Q. D. Wu, Sh. Khasanov, FAST Collaboration

    Abstract: We report two phenomena detected in PSR J0344$-$0901 from two observations conducted at frequency centered at 1.25 GHz using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The first phenomenon manifests as shifting in the pulse emission to later longitudinal phases and then gradually returns to its original location. The event lasts for about 216 pulse periods, with an average s… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  33. arXiv:2402.14789  [pdf, other

    cs.LG cs.AI

    Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised Learning

    Authors: Johnathan Xie, Yoonho Lee, Annie S. Chen, Chelsea Finn

    Abstract: Self-supervised learning excels in learning representations from large amounts of unlabeled data, demonstrating success across multiple data modalities. Yet, extending self-supervised learning to new modalities is non-trivial because the specifics of existing methods are tailored to each domain, such as domain-specific augmentations which reflect the invariances in the target task. While masked mo… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  34. arXiv:2402.13598  [pdf, other

    cs.CL cs.AI cs.LG

    User-LLM: Efficient LLM Contextualization with User Embeddings

    Authors: Lin Ning, Luyang Liu, Jiaxing Wu, Neo Wu, Devora Berlowitz, Sushant Prakash, Bradley Green, Shawn O'Banion, Jun Xie

    Abstract: Large language models (LLMs) have revolutionized natural language processing. However, effectively incorporating complex and potentially noisy user interaction data remains a challenge. To address this, we propose User-LLM, a novel framework that leverages user embeddings to contextualize LLMs. These embeddings, distilled from diverse user interactions using self-supervised pretraining, capture la… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  35. arXiv:2402.12908  [pdf, other

    cs.CV cs.AI cs.LG

    RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models

    Authors: Xinchen Zhang, Ling Yang, Yaqi Cai, Zhaochen Yu, Kai-Ni Wang, Jiake Xie, Ye Tian, Minkai Xu, Yong Tang, Yujiu Yang, Bin Cui

    Abstract: Diffusion models have achieved remarkable advancements in text-to-image generation. However, existing models still have many difficulties when faced with multiple-object compositional generation. In this paper, we propose RealCompo, a new training-free and transferred-friendly text-to-image generation framework, which aims to leverage the respective advantages of text-to-image models and spatial-a… ▽ More

    Submitted 24 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Project: https://github.com/YangLing0818/RealCompo

  36. arXiv:2402.12678  [pdf, ps, other

    math.DS math.AG

    Algebraic dynamics and recursive inequalities

    Authors: Junyi Xie

    Abstract: We get three basic results in algebraic dynamics: (1). We give the first algorithm to compute the dynamical degrees to arbitrary precision. (2). We prove that for a family of dominant rational self-maps, the dynamical degrees are lower semi-continuous with respect to the Zariski topology. This implies a conjecture of Call and Silverman. (3). We prove that the set of periodic points of a coho… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 45 pages

  37. arXiv:2402.11428  [pdf, other

    astro-ph.HE hep-th

    Modelling The Radial Distribution of Pulsars in the Galaxy

    Authors: J. T. Xie, J. B. Wang, N. Wang, R. Manchester, G. Hobbs

    Abstract: The Parkes 20 cm Multibeam pulsar surveys have discovered nearly half of the known pulsars and revealed many distant pulsars with high dispersion measures. Using a sample of 1,301 pulsars from these surveys, we have explored the spatial distribution and birth rate of normal pulsars. The pulsar distances used to calculate the pulsar surface density are estimated from the YMW16 electron-density mode… ▽ More

    Submitted 22 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  38. arXiv:2402.10045  [pdf

    cs.CV cs.LG

    Short-Form Videos and Mental Health: A Knowledge-Guided Neural Topic Model

    Authors: Jiaheng Xie, Ruicheng Liang, Yidong Chai, Yang Liu, Daniel Zeng

    Abstract: While short-form videos head to reshape the entire social media landscape, experts are exceedingly worried about their depressive impacts on viewers, as evidenced by medical studies. To prevent widespread consequences, platforms are eager to predict these videos' impact on viewers' mental health. Subsequently, they can take intervention measures, such as revising recommendation algorithms and disp… ▽ More

    Submitted 21 March, 2024; v1 submitted 10 January, 2024; originally announced February 2024.

  39. arXiv:2402.05607  [pdf, other

    eess.SY

    Internal Model Control design for systems learned by Control Affine Neural Nonlinear Autoregressive Exogenous Models

    Authors: Jing Xie, Fabio Bonassi, Riccardo Scattolini

    Abstract: This paper explores the use of Control Affine Neural Nonlinear AutoRegressive eXogenous (CA-NNARX) models for nonlinear system identification and model-based control design. The idea behind this architecture is to match the known control-affine structure of the system to achieve improved performance. Coherently with recent literature of neural networks for data-driven control, we first analyze the… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  40. arXiv:2402.05606  [pdf, other

    eess.SY

    A Learning-based Model Predictive Control Scheme with Application to Temperature Control Units

    Authors: Jing Xie, Léo Simpson, Jonas Asprion, Riccardo Scattolini

    Abstract: Temperature control is a complex task due to its often unknown dynamics and disturbances. This paper explores the use of Neural Nonlinear AutoRegressive eXogenous (NNARX) models for nonlinear system identification and model predictive control of a temperature control unit. First, the NNARX model is identified from input-output data collected from the real plant, and a state-space representation wi… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  41. arXiv:2402.04710  [pdf, other

    cs.LG

    Incorporating Retrieval-based Causal Learning with Information Bottlenecks for Interpretable Graph Neural Networks

    Authors: Jiahua Rao, Jiancong Xie, Hanjing Lin, Shuangjia Zheng, Zhen Wang, Yuedong Yang

    Abstract: Graph Neural Networks (GNNs) have gained considerable traction for their capability to effectively process topological data, yet their interpretability remains a critical concern. Current interpretation methods are dominated by post-hoc explanations to provide a transparent and intuitive understanding of GNNs. However, they have limited performance in interpreting complicated subgraphs and can't u… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  42. arXiv:2402.04647  [pdf, other

    cs.LG

    Latent Plan Transformer: Planning as Latent Variable Inference

    Authors: Deqian Kong, Dehong Xu, Minglu Zhao, Bo Pang, Jianwen Xie, Andrew Lizarraga, Yuhao Huang, Sirui Xie, Ying Nian Wu

    Abstract: In tasks aiming for long-term returns, planning becomes essential. We study generative modeling for planning with datasets repurposed from offline reinforcement learning. Specifically, we identify temporal consistency in the absence of step-wise rewards as one key technical challenge. We introduce the Latent Plan Transformer (LPT), a novel model that leverages a latent space to connect a Transform… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  43. arXiv:2402.04219  [pdf, ps, other

    math.CO

    A classification of nonzero skew immaculate functions

    Authors: Sarah Mason, Jack Xie

    Abstract: This article presents conditions under which the skewed version of immaculate noncommutative symmetric functions are nonzero. The work is motivated by the quest to determine when the matrix definition of a skew immaculate function aligns with the Hopf algberaic definition. We describe a necessary condition for a skew immaculate function to include a non-zero term, as well as a sufficient condition… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 20 pages, 3 figures

    MSC Class: 05E05; 05C70

  44. arXiv:2402.01622  [pdf, other

    cs.CL

    TravelPlanner: A Benchmark for Real-World Planning with Language Agents

    Authors: Jian Xie, Kai Zhang, Jiangjie Chen, Tinghui Zhu, Renze Lou, Yuandong Tian, Yanghua Xiao, Yu Su

    Abstract: Planning has been part of the core pursuit for artificial intelligence since its conception, but earlier AI agents mostly focused on constrained settings because many of the cognitive substrates necessary for human-level planning have been lacking. Recently, language agents powered by large language models (LLMs) have shown interesting capabilities such as tool use and reasoning. Are these languag… ▽ More

    Submitted 23 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: ICML 2024 (Spotlight)

  45. arXiv:2402.00947  [pdf, other

    physics.ins-det

    Performance of a coarsely pixelated LAPPD photosensor for the SoLID gas Cherenkov detectors

    Authors: J. Xie, C. Peng, S. Joosten, Z. -E. Meziani, A. Camsonne, M. Jones, S. Malace, E. Kaczanowicz, M. Rehfuss, N. Sparveris, M. Paolone, M. Foley, M. Minot, M. Popecki, Z. W. Zhao

    Abstract: The SoLID spectrometer's gas Cherenkov counters require photosensors that operate in a high luminosity and high background environment. The reference design features arrays of 9 or 16 tiled multi-anode photomultipliers (MaPMTs), distributed across 32 sectors, to serve the light-gas and heavy-gas Cherenkov counters, respectively. To assess the viability of a pixelated INCOM Large Area Picosecond Ph… ▽ More

    Submitted 3 July, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 11 pages, 8 figures

  46. arXiv:2401.17686  [pdf, other

    cs.CL

    Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning

    Authors: Tinghui Zhu, Kai Zhang, Jian Xie, Yu Su

    Abstract: Recent advancements have significantly augmented the reasoning capabilities of Large Language Models (LLMs) through various methodologies, especially chain-of-thought (CoT) reasoning. However, previous methods fail to address reasoning errors in intermediate steps, leading to accumulative errors. In this paper, we propose Deductive Beam Search (DBS), which seamlessly integrates CoT and deductive r… ▽ More

    Submitted 4 February, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

  47. Unveiling the $a_0(1710)$ nature in the process $J/ψ\to {\bar{K}}^0K^+ρ^- $

    Authors: Yan Ding, En Wang, De-Min Li, Li-Sheng Geng, Ju-Jun Xie

    Abstract: We have investigated the process $J/ψ\to {\bar{K}}^0K^+ρ^-$ by taking into account the $S$-wave ${K^*\bar{K}^*}$, $ρω$, and $ρφ$ final-state interactions, where the scalar meson $a_0(1710)$ is generated. In addition, we also take into account the contributions from the scalar $a_0(980)(\to \bar{K}^0K^+)$ and the intermediate resonances $K_1(1270)^{-}(\to {\bar{K}}^0ρ^-) $ and… ▽ More

    Submitted 23 July, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 12 pages, 13 figures, the version for PRD. arXiv admin note: substantial text overlap with arXiv:2306.15964

    Journal ref: Phys. Rev. D 110, 014032 (2024)

  48. arXiv:2401.15902  [pdf, other

    cs.CV

    A Concise but High-performing Network for Image Guided Depth Completion in Autonomous Driving

    Authors: Moyun Liu, Bing Chen, Youping Chen, Jingming Xie, Lei Yao, Yang Zhang, Joey Tianyi Zhou

    Abstract: Depth completion is a crucial task in autonomous driving, aiming to convert a sparse depth map into a dense depth prediction. Due to its potentially rich semantic information, RGB image is commonly fused to enhance the completion effect. Image-guided depth completion involves three key challenges: 1) how to effectively fuse the two modalities; 2) how to better recover depth information; and 3) how… ▽ More

    Submitted 22 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  49. arXiv:2401.15372  [pdf, ps, other

    math.AP

    Infinitely many solutions for three quasilinear Laplacian systems on weighted graphs

    Authors: Yan Pang, Junping Xie, Xingyong Zhang

    Abstract: We investigate a generalized poly-Laplacian system with a parameter on weighted finite graph, a generalized poly-Laplacian system with a parameter and Dirichlet boundary value on weighted locally finite graphs, and a $(p,q)$-Laplacian system with a parameter on weighted locally finite graphs. We utilize a critical points theorem built by Bonanno and Bisci [Bonanno, Bisci, and Regan, Math. Comput.… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  50. Effects of Magnetic Helicity on 3D Equilibria and Self-Organized States in KTX Reversed Field Pinch

    Authors: Ke Liu, Guodong Yu, Yuhua Huang, Wenzhe Mao, Yidong Xie, Xianyi Nie, Hong Li, Tao Lan, Jinlin Xie, Weixing Ding, Wandong Liu, Ge Zhuang, Caoxiang Zhu

    Abstract: The RFP is a toroidal magnetic configuration in which plasmas can spontaneously transform into different self-organized states. Among various states, the QSH state has a dominant component for the magnetic field and significantly improves confinement. Many theoretical and experimental efforts have investigated the transitions among different states. This paper employs the MRxMHD model to study the… ▽ More

    Submitted 6 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.