Zum Hauptinhalt springen

Showing 1–50 of 1,085 results for author: Zhao, T

.
  1. arXiv:2408.15527  [pdf, ps, other

    math.NT

    $L^p$ maximal estimates for Weyl sums with $k\ge3$ on $\mathbb{T}$

    Authors: Xuezhi Chen, Changxing Miao, Jiye Yuan, Tengfei Zhao

    Abstract: In this paper, we study the $L^p$ maximal estimates for the Weyl sums $\sum_{n=1}^{N}e^{2πi(nx + n^{k}t)}$ with higher-order $k\ge3$ on $\mathbb{T}$, and obtain the positive and negative results. Especially for the case $k=3$, our result is sharp up to the endpoint. The main idea is to investigate the structure of the set where large values of Weyl sums are achieved by making use of the rational a… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 17 pages

    MSC Class: 42B25; 42B37; 35Q41

  2. arXiv:2408.11769  [pdf

    cs.CY

    Decoding Pedestrian Stress on Urban Streets using Electrodermal Activity Monitoring in Virtual Immersive Reality

    Authors: Mohsen Nazemi, Bara Rababah, Daniel Ramos, Tangxu Zhao, Bilal Farooq

    Abstract: The pedestrian stress level is shown to significantly influence human cognitive processes and, subsequently, decision-making, e.g., the decision to select a gap and cross a street. This paper systematically studies the stress experienced by a pedestrian when crossing a street under different experimental manipulations by monitoring the ElectroDermal Activity (EDA) using the Galvanic Skin Response… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  3. arXiv:2408.09945  [pdf, other

    cs.CL cs.AI

    Benchmarking LLMs for Translating Classical Chinese Poetry:Evaluating Adequacy, Fluency, and Elegance

    Authors: Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang, Muyun Yang, Tiejun Zhao, Min Zhang

    Abstract: Large language models (LLMs) have shown remarkable performance in general translation tasks. However, the increasing demand for high-quality translations that are not only adequate but also fluent and elegant. To assess the extent to which current LLMs can meet these demands, we introduce a suitable benchmark for translating classical Chinese poetry into English. This task requires not only adequa… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Work in progress

  4. arXiv:2408.08883  [pdf

    eess.IV

    MR Optimized Reconstruction of Simultaneous Multi-Slice Imaging Using Diffusion Model

    Authors: Ting Zhao, Zhuoxu Cui, Sen Jia, Qingyong Zhu, Congcong Liu, Yihang Zhou, Yanjie Zhu, Dong Liang, Haifeng Wang

    Abstract: Diffusion model has been successfully applied to MRI reconstruction, including single and multi-coil acquisition of MRI data. Simultaneous multi-slice imaging (SMS), as a method for accelerating MR acquisition, can significantly reduce scanning time, but further optimization of reconstruction results is still possible. In order to optimize the reconstruction of SMS, we proposed a method to use dif… ▽ More

    Submitted 21 August, 2024; v1 submitted 4 August, 2024; originally announced August 2024.

    Comments: Accepted as ISMRM 2024 Digital Poster 4024

    Journal ref: ISMRM 2024 Digital poster 4024

  5. arXiv:2408.07301  [pdf

    physics.optics physics.class-ph

    Imaginary Poynting momentum driven particle rotation by cylindrically polarized Gaussian beams

    Authors: Xue Yun, Yansheng Liang, Linquan Guo, Minru He, Tianyu Zhao, Shaowei Wang, Ming Lei

    Abstract: Imaginary Poynting momentum (IPM) provides a new degree of freedom for particle manipulation. However, the application of IPM in experiments has been largely unexplored. Here, we demonstrate the IPM driven particle rotation by cylindrically polarized Gaussian beams with no spin or orbital angular momentum. Theoretical analysis and experimental measurements demonstrate that gold microparticles will… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 10 pages, 6 figures

    MSC Class: 78A10 Physical optics

  6. arXiv:2408.01369  [pdf, other

    quant-ph

    Fabrication and characterization of low-loss Al/Si/Al parallel plate capacitors for superconducting quantum information applications

    Authors: Anthony McFadden, Aranya Goswami, Tongyu Zhao, Teun van Schijndel, Trevyn F. Q. Larson, Sudhir Sahu, Stephen Gill, Florent Lecocq, Raymond Simmonds, Chris Palmstrøm

    Abstract: Increasing the density of superconducting circuits requires compact components, however, superconductor-based capacitors typically perform worse as dimensions are reduced due to loss at surfaces and interfaces. Here, parallel plate capacitors composed of aluminum-contacted, crystalline silicon fins are shown to be a promising technology for use in superconducting circuits by evaluating the perform… ▽ More

    Submitted 23 August, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

  7. arXiv:2407.19042  [pdf, other

    physics.chem-ph

    Prospects for rank-reduced CCSD(T) in the context of high-accuracy thermochemistry

    Authors: Tingting Zhao, James H. Thorpe, Devin A. Matthews

    Abstract: Obtaining sub-chemical accuracy (1 kJ mol${}^{-1}$) for reaction energies of medium-sized gas-phase molecules is a longstanding challenge in the field of thermochemical modeling. The perturbative triples correction to CCSD, CCSD(T), constitutes an important component of all high-accuracy composite model chemistries that obtain this accuracy, but can be a roadblock in the calculation of medium to l… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  8. HC-GST: Heterophily-aware Distribution Consistency based Graph Self-training

    Authors: Fali Wang, Tianxiang Zhao, Junjie Xu, Suhang Wang

    Abstract: Graph self-training (GST), which selects and assigns pseudo-labels to unlabeled nodes, is popular for tackling label sparsity in graphs. However, recent study on homophily graphs show that GST methods could introduce and amplify distribution shift between training and test nodes as they tend to assign pseudo-labels to nodes they are good at. As GNNs typically perform better on homophilic nodes, th… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: accepted by CIKM 2024

  9. arXiv:2407.15664  [pdf, ps, other

    math.CA math.NT

    Some new properties of the beta function and Ramanujan R-function

    Authors: Zhen-Hang Yang, Miao-Kun Wang, Tie-Hong Zhao

    Abstract: In this paper, the power series and hypergeometric series representations of the beta and Ramanujan functions \begin{equation*} \mathcal{B}\left( x\right) =\frac{Γ\left( x\right)^{2}}{Γ\left( 2x\right) }\text{ and }\mathcal{R}\left( x\right) =-2ψ\left( x\right) -2γ\end{equation*} are presented, which yield higher order monotonicity results related to $ \mathcal{B}(x)$ and $\mathcal{R}(x)$; the dec… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 18 pages

    MSC Class: 33B15; 33C05; 11M06; 30B10; 26A48

  10. arXiv:2407.13989  [pdf, other

    cs.LG cs.AI

    Enhancing Data-Limited Graph Neural Networks by Actively Distilling Knowledge from Large Language Models

    Authors: Quan Li, Tianxiang Zhao, Lingwei Chen, Junjie Xu, Suhang Wang

    Abstract: Graphs are pervasive in the real-world, such as social network analysis, bioinformatics, and knowledge graphs. Graph neural networks (GNNs) have great ability in node classification, a fundamental task on graphs. Unfortunately, conventional GNNs still face challenges in scenarios with few labeled nodes, despite the prevalence of few-shot node classification tasks in real-world applications. To add… ▽ More

    Submitted 28 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 10 pages, 3 Figures

  11. arXiv:2407.12998  [pdf, other

    cs.RO

    Surgical Robot Transformer (SRT): Imitation Learning for Surgical Tasks

    Authors: Ji Woong Kim, Tony Z. Zhao, Samuel Schmidgall, Anton Deguet, Marin Kobilarov, Chelsea Finn, Axel Krieger

    Abstract: We explore whether surgical manipulation tasks can be learned on the da Vinci robot via imitation learning. However, the da Vinci system presents unique challenges which hinder straight-forward implementation of imitation learning. Notably, its forward kinematics is inconsistent due to imprecise joint measurements, and naively training a policy using such approximate kinematics data often leads to… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 8 pages

  12. arXiv:2407.12793  [pdf, ps, other

    cs.DB cs.AI cs.LG

    Data Collection and Labeling Techniques for Machine Learning

    Authors: Qianyu Huang, Tongfang Zhao

    Abstract: Data collection and labeling are critical bottlenecks in the deployment of machine learning applications. With the increasing complexity and diversity of applications, the need for efficient and scalable data collection and labeling techniques has become paramount. This paper provides a review of the state-of-the-art methods in data collection, data labeling, and the improvement of existing data a… ▽ More

    Submitted 19 June, 2024; originally announced July 2024.

  13. arXiv:2407.09868  [pdf

    physics.med-ph

    Separation of Sodium Signals Between Mono- and Bi-Exponential T2 Decays via Multi-TE Single-Quantum Sodium (23Na) MRI

    Authors: Yongxian Qian, Ying-Chia Lin, Xingye Chen, Tiejun Zhao, Karthik Lakshmanan, Yulin Ge, Yvonne W. Lui, Fernando E. Boada

    Abstract: Purpose. It is a long standing pursuit in sodium (23Na) MRI to separate signals between mono and bi exponential T2 decays in the human brain, due to lack of clinically translational solutions under the restriction of intrinsically low signal to noise ratio (SNR). Here we propose a new technique called multi TE single quantum (MSQ) sodium MRI to address the challenge. Methods. We exploit an intrins… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 37 pages and 14 figures

  14. arXiv:2407.09315  [pdf, other

    physics.comp-ph math-ph

    RBMD: A molecular dynamics package enabling to simulate 10 million all-atom particles in a single graphics processing unit

    Authors: Weihang Gao, Teng Zhao, Yongfa Guo, Jiuyang Liang, Huan Liu, Maoying Luo, Zedong Luo, Wei Qin, Yichao Wang, Qi Zhou, Shi Jin, Zhenli Xu

    Abstract: This paper introduces a random-batch molecular dynamics (RBMD) package for fast simulations of particle systems at the nano/micro scale. Different from existing packages, the RBMD uses random batch methods for nonbonded interactions of particle systems. The long-range part of Coulomb interactions is calculated in Fourier space by the random batch Ewald algorithm, which achieves linear complexity a… ▽ More

    Submitted 22 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: 26 pages, 8 figures

  15. arXiv:2407.04923  [pdf, other

    cs.CV cs.CL

    OmChat: A Recipe to Train Multimodal Language Models with Strong Long Context and Video Understanding

    Authors: Tiancheng Zhao, Qianqian Zhang, Kyusong Lee, Peng Liu, Lu Zhang, Chunxin Fang, Jiajia Liao, Kelei Jiang, Yibo Ma, Ruochen Xu

    Abstract: We introduce OmChat, a model designed to excel in handling long contexts and video understanding tasks. OmChat's new architecture standardizes how different visual inputs are processed, making it more efficient and adaptable. It uses a dynamic vision encoding process to effectively handle images of various resolutions, capturing fine details across a range of image qualities. OmChat utilizes an ac… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 14 pages

  16. arXiv:2407.02394  [pdf, other

    cs.CV

    Similarity Distance-Based Label Assignment for Tiny Object Detection

    Authors: Shuohao Shi, Qiang Fang, Tong Zhao, Xin Xu

    Abstract: Tiny object detection is becoming one of the most challenging tasks in computer vision because of the limited object size and lack of information. The label assignment strategy is a key factor affecting the accuracy of object detection. Although there are some effective label assignment strategies for tiny objects, most of them focus on reducing the sensitivity to the bounding boxes to increase th… ▽ More

    Submitted 26 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 8 pages, 4 figures, this paper has been accepted by IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  17. arXiv:2407.01007  [pdf, other

    cs.CV

    GMT: A Robust Global Association Model for Multi-Target Multi-Camera Tracking

    Authors: Huijie Fan, Tinghui Zhao, Qiang Wang, Baojie Fan, Yandong Tang, LianQing Liu

    Abstract: In the task of multi-target multi-camera (MTMC) tracking of pedestrians, the data association problem is a key issue and main challenge, especially with complications arising from camera movements, lighting variations, and obstructions. However, most MTMC models adopt two-step approaches, thus heavily depending on the results of the first-step tracking in practical applications. Moreover, the same… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  18. arXiv:2407.00038  [pdf, other

    cs.IR

    JungleGPT: Designing and Optimizing Compound AI Systems for E-Commerce

    Authors: Sherry Ruan, Tian Zhao

    Abstract: LLMs have significantly advanced the e-commerce industry by powering applications such as personalized recommendations and customer service. However, most current efforts focus solely on monolithic LLMs and fall short in addressing the complexity and scale of real-world e-commerce scenarios. In this work, we present JungleGPT, the first compound AI system tailored for real-world e-commerce applica… ▽ More

    Submitted 28 May, 2024; originally announced July 2024.

  19. arXiv:2406.18763  [pdf, other

    cs.LG cs.AI

    Conformalized Link Prediction on Graph Neural Networks

    Authors: Tianyi Zhao, Jian Kang, Lu Cheng

    Abstract: Graph Neural Networks (GNNs) excel in diverse tasks, yet their applications in high-stakes domains are often hampered by unreliable predictions. Although numerous uncertainty quantification methods have been proposed to address this limitation, they often lack \textit{rigorous} uncertainty estimates. This work makes the first attempt to introduce a distribution-free and model-agnostic uncertainty… ▽ More

    Submitted 18 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  20. arXiv:2406.16620  [pdf, other

    cs.CV cs.CL

    OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer

    Authors: Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee

    Abstract: Recent advancements in Large Language Models (LLMs) have expanded their capabilities to multimodal contexts, including comprehensive video understanding. However, processing extensive videos such as 24-hour CCTV footage or full-length films presents significant challenges due to the vast data and processing demands. Traditional methods, like extracting key frames or converting frames to text, ofte… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  21. arXiv:2406.16321  [pdf, other

    cs.LG cs.AI

    Multimodal Graph Benchmark

    Authors: Jing Zhu, Yuhang Zhou, Shengyi Qian, Zhongmou He, Tong Zhao, Neil Shah, Danai Koutra

    Abstract: Associating unstructured data with structured information is crucial for real-world tasks that require relevance search. However, existing graph learning benchmarks often overlook the rich semantic information associate with each node. To bridge such gap, we introduce the Multimodal Graph Benchmark (MM-GRAPH), the first comprehensive multi-modal graph benchmark that incorporates both textual and v… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: https://mm-graph-benchmark.github.io/

  22. arXiv:2406.15568  [pdf, other

    cs.LG

    Robust Reinforcement Learning from Corrupted Human Feedback

    Authors: Alexander Bukharin, Ilgee Hong, Haoming Jiang, Zichong Li, Qingru Zhang, Zixuan Zhang, Tuo Zhao

    Abstract: Reinforcement learning from human feedback (RLHF) provides a principled framework for aligning AI systems with human preference data. For various reasons, e.g., personal bias, context ambiguity, lack of training, etc, human annotators may give incorrect or inconsistent preference labels. To tackle this challenge, we propose a robust RLHF approach -- $R^3M$, which models the potentially corrupted p… ▽ More

    Submitted 9 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: 22 pages, 7 figures

  23. arXiv:2406.13558   

    cs.AI

    Enhancing Travel Choice Modeling with Large Language Models: A Prompt-Learning Approach

    Authors: Xuehao Zhai, Hanlin Tian, Lintong Li, Tianyu Zhao

    Abstract: Travel choice analysis is crucial for understanding individual travel behavior to develop appropriate transport policies and recommendation systems in Intelligent Transportation Systems (ITS). Despite extensive research, this domain faces two critical challenges: a) modeling with limited survey data, and b) simultaneously achieving high model explainability and accuracy. In this paper, we introduc… ▽ More

    Submitted 22 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: We currently do not have a replacement version available. We request withdrawal due to a significant methodological error affecting the paper's validity, specifically a miscalculation in data preprocessing. We are working on corrections, but this will take time. We believe an interim withdrawal is necessary to prevent the dissemination of incorrect information.

  24. arXiv:2406.12439  [pdf, other

    cs.LG

    A data-centric approach for assessing progress of Graph Neural Networks

    Authors: Tianqi Zhao, Ngan Thi Dong, Alan Hanjalic, Megha Khosla

    Abstract: Graph Neural Networks (GNNs) have achieved state-of-the-art results in node classification tasks. However, most improvements are in multi-class classification, with less focus on the cases where each node could have multiple labels. The first challenge in studying multi-label node classification is the scarcity of publicly available datasets. To address this, we collected and released three real-w… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Journal ref: Published in Data-centric Machine Learning Research Worshop @ ICML 2024

  25. arXiv:2406.11354  [pdf, other

    cs.CL cs.AI cs.CV

    Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression

    Authors: Zilun Zhang, Yutao Sun, Tiancheng Zhao, Leigang Sha, Ruochen Xu, Kyusong Lee, Jianwei Yin

    Abstract: Humans can retain old knowledge while learning new information, but Large Language Models (LLMs) often suffer from catastrophic forgetting when post-pretrained or supervised fine-tuned (SFT) on domain-specific data. Moreover, for Multimodal Large Language Models (MLLMs) which are composed of the LLM base and visual projector (e.g. LLaVA), a significant decline in performance on language benchmarks… ▽ More

    Submitted 19 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  26. arXiv:2406.11195  [pdf, other

    cond-mat.str-el

    Resolving Geometric Excitations of Fractional Quantum Hall States

    Authors: Yang Liu, Tongzhou Zhao, T. Xiang

    Abstract: The quantum dynamics of the intrinsic metric profoundly influence the neutral excitations in the fractional quantum Hall system, as established by Haldane in 2011 \cite{Haldane2011}, and further evidenced by a recent two-photon experiment \cite{Liang2024}. Despite these advancements, a comprehensive understanding of the dynamic properties of these excitations, especially at long wavelengths, conti… ▽ More

    Submitted 1 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  27. arXiv:2406.11191  [pdf, other

    cs.CL

    A Survey on Human Preference Learning for Large Language Models

    Authors: Ruili Jiang, Kehai Chen, Xuefeng Bai, Zhixuan He, Juntao Li, Muyun Yang, Tiejun Zhao, Liqiang Nie, Min Zhang

    Abstract: The recent surge of versatile large language models (LLMs) largely depends on aligning increasingly capable foundation models with human intentions by preference learning, enhancing LLMs with excellent applicability and effectiveness in a wide range of contexts. Despite the numerous related studies conducted, a perspective on how human preferences are introduced into LLMs remains limited, which ma… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: IEEE copyright statement added (also applied to the former version)

  28. arXiv:2406.10797  [pdf, other

    cs.CV

    STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

    Authors: Xiaoxiao Ma, Mohan Zhou, Tao Liang, Yalong Bai, Tiejun Zhao, Huaian Chen, Yi Jin

    Abstract: We present STAR, a text-to-image model that employs scale-wise auto-regressive paradigm. Unlike VAR, which is limited to class-conditioned synthesis within a fixed set of predetermined categories, our STAR enables text-driven open-set generation through three key designs: To boost diversity and generalizability with unseen combinations of objects and concepts, we introduce a pre-trained text encod… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures

  29. arXiv:2406.10777  [pdf, other

    cs.CL

    RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning

    Authors: Haoyu Wang, Tianci Liu, Ruirui Li, Monica Cheng, Tuo Zhao, Jing Gao

    Abstract: Pre-trained language models, trained on large-scale corpora, demonstrate strong generalizability across various NLP tasks. Fine-tuning these models for specific tasks typically involves updating all parameters, which is resource-intensive. Parameter-efficient fine-tuning (PEFT) methods, such as the popular LoRA family, introduce low-rank matrices to learn only a few parameters efficiently. However… ▽ More

    Submitted 30 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  30. arXiv:2406.10593  [pdf, other

    cs.AI cs.DB cs.IR

    QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL

    Authors: Yinggang Sun, Ziming Guo, Haining Yu, Chuanyi Liu, Xiang Li, Bingxuan Wang, Xiangzhan Yu, Tiancheng Zhao

    Abstract: Fine-tuning large language models (LLMs) for specific domain tasks has achieved great success in Text-to-SQL tasks. However, these fine-tuned models often face challenges with multi-turn Text-to-SQL tasks caused by ambiguous or unanswerable questions. It is desired to enhance LLMs to handle multiple types of questions in multi-turn Text-to-SQL tasks. To address this, we propose a novel data augmen… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 13 pages, 7 figures

  31. Multi-source Unsupervised Domain Adaptation on Graphs with Transferability Modeling

    Authors: Tianxiang Zhao, Dongsheng Luo, Xiang Zhang, Suhang Wang

    Abstract: In this paper, we tackle a new problem of \textit{multi-source unsupervised domain adaptation (MSUDA) for graphs}, where models trained on annotated source domains need to be transferred to the unsupervised target graph for node classification. Due to the discrepancy in distribution across domains, the key challenge is how to select good source instances and how to adapt the model. Diverse graph s… ▽ More

    Submitted 22 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Journal ref: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '24), August 25--29, 2024, Barcelona, Spain

  32. arXiv:2406.08552  [pdf, other

    cs.CV

    DiTFastAttn: Attention Compression for Diffusion Transformer Models

    Authors: Zhihang Yuan, Pu Lu, Hanling Zhang, Xuefei Ning, Linfeng Zhang, Tianchen Zhao, Shengen Yan, Guohao Dai, Yu Wang

    Abstract: Diffusion Transformers (DiT) excel at image and video generation but face computational challenges due to self-attention's quadratic complexity. We propose DiTFastAttn, a novel post-training compression method to alleviate DiT's computational bottleneck. We identify three key redundancies in the attention computation during DiT inference: 1. spatial redundancy, where many attention heads focus on… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  33. arXiv:2406.07232  [pdf, other

    cs.CL cs.AI

    DUAL-REFLECT: Enhancing Large Language Models for Reflective Translation through Dual Learning Feedback Mechanisms

    Authors: Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang, Muyun Yang, Tiejun Zhao, Min Zhang

    Abstract: Recently, large language models (LLMs) enhanced by self-reflection have achieved promising performance on machine translation. The key idea is guiding LLMs to generate translation with human-like feedback. However, existing self-reflection methods lack effective feedback information, limiting the translation performance. To address this, we introduce a DUAL-REFLECT framework, leveraging the dual l… ▽ More

    Submitted 21 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 main conference

  34. arXiv:2406.06600  [pdf, other

    cs.LG cs.AI cs.CL

    HORAE: A Domain-Agnostic Modeling Language for Automating Multimodal Service Regulation

    Authors: Yutao Sun, Mingshuai Chen, Tiancheng Zhao, Kangjia Zhao, He Li, Jintao Chen, Liqiang Lu, Xinkui Zhao, Shuiguang Deng, Jianwei Yin

    Abstract: Artificial intelligence is rapidly encroaching on the field of service regulation. This work presents the design principles behind HORAE, a unified specification language to model multimodal regulation rules across a diverse set of domains. We show how HORAE facilitates an intelligent service regulation pipeline by further exploiting a fine-tuned large language model named HORAE that automates the… ▽ More

    Submitted 18 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  35. arXiv:2406.06528  [pdf, other

    quant-ph

    Phase estimation via number-conserving operation inside the SU(1,1) interferometer

    Authors: Qingqian Kang, Zekun Zhao, Teng Zhao, Cunjin Liu, Liyun Hu

    Abstract: Utilizing nonlinear elements, SU(1,1) interferometers demonstrate superior phase sensitivity compared to passive interferometers. However, the precision is significantly impacted by photon losses, particularly internal losses. We propose a theoretical scheme to improve the precision of phase measurement using homodyne detection by implementing number-conserving operations (PA-then-PS and PS-then-P… ▽ More

    Submitted 20 July, 2024; v1 submitted 29 March, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.14612

  36. arXiv:2406.05891  [pdf, other

    eess.IV cs.CV cs.LG

    GCtx-UNet: Efficient Network for Medical Image Segmentation

    Authors: Khaled Alrfou, Tian Zhao

    Abstract: Medical image segmentation is crucial for disease diagnosis and monitoring. Though effective, the current segmentation networks such as UNet struggle with capturing long-range features. More accurate models such as TransUNet, Swin-UNet, and CS-UNet have higher computation complexity. To address this problem, we propose GCtx-UNet, a lightweight segmentation architecture that can capture global and… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 13 pages, 7 figures, 7 tables

  37. arXiv:2406.05595  [pdf, other

    physics.chem-ph

    Analytic gradients for equation-of-motion coupled cluster with single, double, and perturbative triple excitations

    Authors: Tingting Zhao, Devin A. Matthews

    Abstract: Understanding the process of molecular photoexcitation is crucial in various fields, including drug development, materials science, photovoltaics, and more. The electronic vertical excitation energy is a critical property, for example in determining the singlet-triplet gap of chromophores. However, a full understanding of excited-state processes requires additional explorations of the excited-stat… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  38. arXiv:2406.05378  [pdf, ps, other

    eess.SY

    Practical Explicit-time Stabilization of a Proportional Control System

    Authors: Wen Yan, Tao Zhao

    Abstract: Proportional control can be realized directly through the amplification of analog signals, and it also has the advantage of easy tuning parameters in digital signal control. However, it is difficult for the proportional control to preset the upper bound of settling time. To address this problem, a novel practical explicit-time control method is proposed. In bounded initial condition, this method m… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  39. arXiv:2406.05267  [pdf, other

    nucl-th astro-ph.SR

    Characterizing the nuclear models informed by PREX and CREX: a view from Bayesian inference

    Authors: Tianqi Zhao, Zidu Lin, Bharat Kumar, Andrew W. Steiner, Madappa Prakash

    Abstract: New measurements of the weak charge density distributions of $^{48}$Ca and $^{208}$Pb challenge existing nuclear models. In the post-PREX-CREX era, it is unclear if current models can simultaneously describe weak charge distributions along with accurate measurements of binding energy and charge radii. In this letter, we explore the parameter space of relativistic and non-relativistic models to stu… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures in the letter, 11 pages, 8 figures in the supplemental material

  40. arXiv:2406.03684  [pdf, other

    cs.CV cs.CR

    Principles of Designing Robust Remote Face Anti-Spoofing Systems

    Authors: Xiang Xu, Tianchen Zhao, Zheng Zhang, Zhihua Li, Jon Wu, Alessandro Achille, Mani Srivastava

    Abstract: Protecting digital identities of human face from various attack vectors is paramount, and face anti-spoofing plays a crucial role in this endeavor. Current approaches primarily focus on detecting spoofing attempts within individual frames to detect presentation attacks. However, the emergence of hyper-realistic generative models capable of real-time operation has heightened the risk of digitally g… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Under review

  41. arXiv:2406.02764  [pdf, other

    cs.LG cs.AI

    Adaptive Preference Scaling for Reinforcement Learning with Human Feedback

    Authors: Ilgee Hong, Zichong Li, Alexander Bukharin, Yixiao Li, Haoming Jiang, Tianbao Yang, Tuo Zhao

    Abstract: Reinforcement learning from human feedback (RLHF) is a prevalent approach to align AI systems with human values by learning rewards from human preference data. Due to various reasons, however, such data typically takes the form of rankings over pairs of trajectory segments, which fails to capture the varying strengths of preferences across different pairs. In this paper, we propose a novel adaptiv… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  42. arXiv:2406.02540  [pdf, other

    cs.CV

    ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

    Authors: Tianchen Zhao, Tongcheng Fang, Enshu Liu, Rui Wan, Widyadewi Soedarmadji, Shiyao Li, Zinan Lin, Guohao Dai, Shengen Yan, Huazhong Yang, Xuefei Ning, Yu Wang

    Abstract: Diffusion transformers (DiTs) have exhibited remarkable performance in visual generation tasks, such as generating realistic images or videos based on textual instructions. However, larger model sizes and multi-frame processing for video generation lead to increased computational and memory costs, posing challenges for practical deployment on edge devices. Post-Training Quantization (PTQ) is an ef… ▽ More

    Submitted 30 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Project Page: https://a-suozhang.xyz/viditq.github.io/

  43. arXiv:2406.01229  [pdf, other

    cs.LG

    AGALE: A Graph-Aware Continual Learning Evaluation Framework

    Authors: Tianqi Zhao, Alan Hanjalic, Megha Khosla

    Abstract: In recent years, continual learning (CL) techniques have made significant progress in learning from streaming data while preserving knowledge across sequential tasks, particularly in the realm of euclidean data. To foster fair evaluation and recognize challenges in CL settings, several evaluation frameworks have been proposed, focusing mainly on the single- and multi-label classification task on e… ▽ More

    Submitted 7 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  44. arXiv:2405.20624  [pdf, ps, other

    cs.CL cs.AI

    Leveraging Large Language Models for Entity Matching

    Authors: Qianyu Huang, Tongfang Zhao

    Abstract: Entity matching (EM) is a critical task in data integration, aiming to identify records across different datasets that refer to the same real-world entities. Traditional methods often rely on manually engineered features and rule-based systems, which struggle with diverse and unstructured data. The emergence of Large Language Models (LLMs) such as GPT-4 offers transformative potential for EM, leve… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  45. arXiv:2405.19651  [pdf, ps, other

    math.CA

    Absolutely monotonic functions related to the asymptotic formula for the complete elliptic integral of the first kind

    Authors: Tiehong Zhao, Zhen-Hang Yang

    Abstract: Let $\mathcal{K}\left( x\right) $ be the complete elliptic integral of the first kind and \begin{equation*} \mathcal{G}_{p}\left( x\right) =e^{\mathcal{K}\left( \sqrt{x} \right) }-\frac{p}{\sqrt{1-x}} \end{equation*} for $p\in \mathbb{R}$ and $x\in \left( 0,1\right) $. In this paper we find the necessary and sufficient conditions for the functions… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 14 pages

    MSC Class: Primary 33E05; 26A48 Secondary 40A05; 41A10

  46. arXiv:2405.19109  [pdf, other

    cs.CL

    PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering

    Authors: Fangzhi Xu, Qika Lin, Tianzhe Zhao, Jiawei Han, Jun Liu

    Abstract: Logical reasoning task has attracted great interest since it was proposed. Faced with such a task, current competitive models, even large language models (e.g., ChatGPT and PaLM 2), still perform badly. Previous promising LMs struggle in logical consistency modeling and logical structure perception. To this end, we model the logical reasoning task by transforming each logical sample into reasoning… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024

  47. arXiv:2405.17873  [pdf, other

    cs.CV cs.AI

    MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

    Authors: Tianchen Zhao, Xuefei Ning, Tongcheng Fang, Enshu Liu, Guyue Huang, Zinan Lin, Shengen Yan, Guohao Dai, Yu Wang

    Abstract: Diffusion models have achieved significant visual generation quality. However, their significant computational and memory costs pose challenge for their application on resource-constrained mobile devices or even desktop GPUs. Recent few-step diffusion models reduces the inference time by reducing the denoising steps. However, their memory consumptions are still excessive. The Post Training Quantiz… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Project Page: https://a-suozhang.xyz/mixdq.github.io/

  48. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  49. arXiv:2405.14506  [pdf, other

    cs.CV cs.AI

    SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification

    Authors: Zuoyong Li, Qinghua Lin, Haoyi Fan, Tiesong Zhao, David Zhang

    Abstract: Semi-supervised learning suffers from the imbalance of labeled and unlabeled training data in the video surveillance scenario. In this paper, we propose a new semi-supervised learning method called SIAVC for industrial accident video classification. Specifically, we design a video augmentation module called the Super Augmentation Block (SAB). SAB adds Gaussian noise and randomly masks video frames… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  50. arXiv:2405.12971  [pdf, other

    cs.CV

    BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

    Authors: Theodore Zhao, Yu Gu, Jianwei Yang, Naoto Usuyama, Ho Hin Lee, Tristan Naumann, Jianfeng Gao, Angela Crabtree, Jacob Abel, Christine Moung-Wen, Brian Piening, Carlo Bifulco, Mu Wei, Hoifung Poon, Sheng Wang

    Abstract: Biomedical image analysis is fundamental for biomedical discovery in cell biology, pathology, radiology, and many other biomedical domains. Holistic image analysis comprises interdependent subtasks such as segmentation, detection, and recognition of relevant objects. Here, we propose BiomedParse, a biomedical foundation model for imaging parsing that can jointly conduct segmentation, detection, an… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Project page: https://aka.ms/biomedparse-project