Zum Hauptinhalt springen

Showing 51–100 of 3,069 results for author: Chen, T

.
  1. arXiv:2407.19225  [pdf, other

    cs.CV cs.MM

    Magic3DSketch: Create Colorful 3D Models From Sketch-Based 3D Modeling Guided by Text and Language-Image Pre-Training

    Authors: Ying Zang, Yidong Han, Chaotao Ding, Jianqi Zhang, Tianrun Chen

    Abstract: The requirement for 3D content is growing as AR/VR application emerges. At the same time, 3D modelling is only available for skillful experts, because traditional methods like Computer-Aided Design (CAD) are often too labor-intensive and skill-demanding, making it challenging for novice users. Our proposed method, Magic3DSketch, employs a novel technique that encodes sketches to predict a 3D mesh,… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  2. arXiv:2407.18544  [pdf, other

    cs.LG

    Utilising Explainable Techniques for Quality Prediction in a Complex Textiles Manufacturing Use Case

    Authors: Briony Forsberg, Dr Henry Williams, Prof Bruce MacDonald, Tracy Chen, Dr Reza Hamzeh, Dr Kirstine Hulse

    Abstract: This paper develops an approach to classify instances of product failure in a complex textiles manufacturing dataset using explainable techniques. The dataset used in this study was obtained from a New Zealand manufacturer of woollen carpets and rugs. In investigating the trade-off between accuracy and explainability, three different tree-based classification algorithms were evaluated: a Decision… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: Accepted at the 2024 IEEE 20th International Conference on Automation Science and Engineering (CASE 2024), awaiting publication Contains seven pages and five figures

  3. arXiv:2407.18479  [pdf, other

    cs.CL

    Multi-turn Response Selection with Commonsense-enhanced Language Models

    Authors: Yuandong Wang, Xuhui Ren, Tong Chen, Yuxiao Dong, Nguyen Quoc Viet Hung, Jie Tang

    Abstract: As a branch of advanced artificial intelligence, dialogue systems are prospering. Multi-turn response selection is a general research problem in dialogue systems. With the assistance of background information and pre-trained language models, the performance of state-of-the-art methods on this problem gains impressive improvement. However, existing studies neglect the importance of external commons… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  4. arXiv:2407.18450  [pdf, other

    cs.CV cs.LG

    Textile Anomaly Detection: Evaluation of the State-of-the-Art for Automated Quality Inspection of Carpet

    Authors: Briony Forsberg, Dr Henry Williams, Prof Bruce MacDonald, Tracy Chen, Dr Kirstine Hulse

    Abstract: In this study, state-of-the-art unsupervised detection models were evaluated for the purpose of automated anomaly inspection of wool carpets. A custom dataset of four unique types of carpet textures was created to thoroughly test the models and their robustness in detecting subtle anomalies in complex textures. Due to the requirements of an inline inspection system in a manufacturing use case, the… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted at the 2023 Australasian Conference on Robotics and Automation (ACRA 2023) Publication url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184380272&partnerID=40&md5=74fde263f4a24a1bff75d6560b423994 ISSN: 14482053 Contains 10 pages and three figures

  5. arXiv:2407.17983  [pdf, other

    eess.SP

    Explain EEG-based End-to-end Deep Learning Models in the Frequency Domain

    Authors: Hanqi Wang, Kun Yang, Jingyu Zhang, Tao Chen, Liang Song

    Abstract: The recent rise of EEG-based end-to-end deep learning models presents a significant challenge in elucidating how these models process raw EEG signals and generate predictions in the frequency domain. This challenge limits the transparency and credibility of EEG-based end-to-end models, hindering their application in security-sensitive areas. To address this issue, we propose a mask perturbation me… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  6. arXiv:2407.17857  [pdf, other

    cs.CV cs.AI

    Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network

    Authors: Sukwon Yun, Jie Peng, Alexandro E. Trevino, Chanyoung Park, Tianlong Chen

    Abstract: Recent advancements in graph-based approaches for multiplexed immunofluorescence (mIF) images have significantly propelled the field forward, offering deeper insights into patient-level phenotyping. However, current graph-based methodologies encounter two primary challenges: (1) Cellular Heterogeneity, where existing approaches fail to adequately address the inductive biases inherent in graphs, pa… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  7. arXiv:2407.17412  [pdf, other

    cs.CV cs.AI

    (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork

    Authors: Tianjin Huang, Fang Meng, Li Shen, Fan Liu, Yulong Pei, Mykola Pechenizkiy, Shiwei Liu, Tianlong Chen

    Abstract: Large-scale neural networks have demonstrated remarkable performance in different domains like vision and language processing, although at the cost of massive computation resources. As illustrated by compression literature, structural model pruning is a prominent algorithm to encourage model efficiency, thanks to its acceleration-friendly sparsity patterns. One of the key questions of structural p… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: Under review

  8. Unpaired Photo-realistic Image Deraining with Energy-informed Diffusion Model

    Authors: Yuanbo Wen, Tao Gao, Ting Chen

    Abstract: Existing unpaired image deraining approaches face challenges in accurately capture the distinguishing characteristics between the rainy and clean domains, resulting in residual degradation and color distortion within the reconstructed images. To this end, we propose an energy-informed diffusion model for unpaired photo-realistic image deraining (UPID-EDM). Initially, we delve into the intricate vi… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  9. arXiv:2407.17184  [pdf, other

    hep-ex

    Search for $η_{c}(2S)\to K^+ K^- η^{\prime}$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times10^{9}$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII, we find an evidence of the $η_{c}(2S)\to K^+ K^- η^{\prime}$ decay with a statistical significance of 3.1$σ$. Its decay branching fraction is measured to be $(12.24\pm4.60(\mathrm{stat.})\pm2.37(\mathrm{syst.})\pm4.68(\mathrm{extr.}))\times 10^{-4}$, where the first uncertainty is stati… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  10. arXiv:2407.17126  [pdf

    cs.CL cs.AI

    SDoH-GPT: Using Large Language Models to Extract Social Determinants of Health (SDoH)

    Authors: Bernardo Consoli, Xizhi Wu, Song Wang, Xinyu Zhao, Yanshan Wang, Justin Rousseau, Tom Hartvigsen, Li Shen, Huanmei Wu, Yifan Peng, Qi Long, Tianlong Chen, Ying Ding

    Abstract: Extracting social determinants of health (SDoH) from unstructured medical notes depends heavily on labor-intensive annotations, which are typically task-specific, hampering reusability and limiting sharing. In this study we introduced SDoH-GPT, a simple and effective few-shot Large Language Model (LLM) method leveraging contrastive examples and concise instructions to extract SDoH without relying… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  11. arXiv:2407.15595  [pdf, other

    cs.LG cs.AI

    Discrete Flow Matching

    Authors: Itai Gat, Tal Remez, Neta Shaul, Felix Kreuk, Ricky T. Q. Chen, Gabriel Synnaeve, Yossi Adi, Yaron Lipman

    Abstract: Despite Flow Matching and diffusion models having emerged as powerful generative paradigms for continuous variables such as images and videos, their application to high-dimensional discrete data, such as language, is still limited. In this work, we present Discrete Flow Matching, a novel discrete flow paradigm designed specifically for generating discrete data. Discrete Flow Matching offers severa… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  12. arXiv:2407.15411  [pdf, other

    cs.IR

    Scalable Dynamic Embedding Size Search for Streaming Recommendation

    Authors: Yunke Qu, Liang Qu, Tong Chen, Xiangyu Zhao, Quoc Viet Hung Nguyen, Hongzhi Yin

    Abstract: Recommender systems typically represent users and items by learning their embeddings, which are usually set to uniform dimensions and dominate the model parameters. However, real-world recommender systems often operate in streaming recommendation scenarios, where the number of users and items continues to grow, leading to substantial storage resource consumption for these embeddings. Although a fe… ▽ More

    Submitted 31 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: accepted to CIKM 2024

  13. arXiv:2407.15182  [pdf, other

    quant-ph

    Thermometry of Trapped Ions Based on Bichromatic Driving

    Authors: Xie-Qian Li, Yi Tao, Ting Chen, Wei Wu, Yi Xie, Chun-Wang Wu, Ping-Xing Chen

    Abstract: Accurate thermometry of laser-cooled ions is crucial for the performance of the trapped-ions quantum computing platform. However, most existing methods face a computational exponential bottleneck. Recently, a thermometry method based on bichromatic driving was theoretically proposed by Ivan Vybornyi et al. to overcome this obstacle, which allows the computational complexity to remain constant with… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  14. arXiv:2407.15128  [pdf, other

    math.RT math.NT

    A description of the integral depth-$r$ Bernstein center

    Authors: Tsao-Hsien Chen, Sarbartha Bhattacharya

    Abstract: In this paper we give a description of the depth-$r$ Bernstein center for non-negative integers $r$ of a reductive simply connected group $G$ over a non-archimedean local field as a limit of depth-$r$ standard parahoric Hecke algebras. Using the description, we construct maps from the algebra of stable functions on the $r$-th Moy-Prasad filtration quotient of hyperspecial parahorics to the depth-… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: 38 pages

  15. arXiv:2407.14903  [pdf, other

    cs.CV

    Automated Patient Positioning with Learned 3D Hand Gestures

    Authors: Zhongpai Gao, Abhishek Sharma, Meng Zheng, Benjamin Planche, Terrence Chen, Ziyan Wu

    Abstract: Positioning patients for scanning and interventional procedures is a critical task that requires high precision and accuracy. The conventional workflow involves manually adjusting the patient support to align the center of the target body part with the laser projector or other guiding devices. This process is not only time-consuming but also prone to inaccuracies. In this work, we propose an autom… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  16. arXiv:2407.14829  [pdf, other

    cs.CL

    Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

    Authors: Jiayu Lin, Guanrong Chen, Bojun Jin, Chenyang Li, Shutong Jia, Wancong Lin, Yang Sun, Yuhang He, Caihua Yang, Jianzhu Bao, Jipeng Wu, Wen Su, Jinglu Chen, Xinyi Li, Tianyu Chen, Mingjie Han, Shuaiwen Du, Zijian Wang, Jiyin Li, Fuzhong Suo, Hao Wang, Nuanchen Lin, Xuanjing Huang, Changjian Jiang, RuiFeng Xu , et al. (4 additional authors not shown)

    Abstract: In this paper we present the results of the AI-Debater 2023 Challenge held by the Chinese Conference on Affect Computing (CCAC 2023), and introduce the related datasets. We organize two tracks to handle the argumentative generation tasks in different scenarios, namely, Counter-Argument Generation (Track 1) and Claim-based Argument Generation (Track 2). Each track is equipped with its distinct data… ▽ More

    Submitted 24 July, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

  17. LinSATNet: The Positive Linear Satisfiability Neural Networks

    Authors: Runzhong Wang, Yunhao Zhang, Ziao Guo, Tianyi Chen, Xiaokang Yang, Junchi Yan

    Abstract: Encoding constraints into neural networks is attractive. This paper studies how to introduce the popular positive linear satisfiability to neural networks. We propose the first differentiable satisfiability layer based on an extension of the classic Sinkhorn algorithm for jointly encoding multiple sets of marginal distributions. We further theoretically characterize the convergence property of the… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: This is a revised version of our ICML'23 publication that fixes a minor issue in Eq (11). In Proceedings of the 40th International Conference on Machine Learning (ICML'23)

  18. arXiv:2407.13605  [pdf, other

    cs.LG

    Physics-guided Active Sample Reweighting for Urban Flow Prediction

    Authors: Wei Jiang, Tong Chen, Guanhua Ye, Wentao Zhang, Lizhen Cui, Zi Huang, Hongzhi Yin

    Abstract: Urban flow prediction is a spatio-temporal modeling task that estimates the throughput of transportation services like buses, taxis, and ride-sharing, where data-driven models have become the most popular solution in the past decade. Meanwhile, the implicitly learned mapping between historical observations to the prediction targets tend to over-simplify the dynamics of real-world urban flows, lead… ▽ More

    Submitted 6 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by Proceedings of the 33nd ACM International Conference on Information and Knowledge Management (CIKM '24)

  19. arXiv:2407.13322  [pdf, other

    cs.CV

    Fully Test-Time rPPG Estimation via Synthetic Signal-Guided Feature Learning

    Authors: Pei-Kai Huang, Tzu-Hsien Chen, Ya-Ting Chan, Kuan-Wen Chen, Chiou-Ting Hsu

    Abstract: Many remote photoplethysmography (rPPG) estimation models have achieved promising performance in the training domain but often fail to accurately estimate physiological signals or heart rates (HR) in the target domains. Domain generalization (DG) or domain adaptation (DA) techniques are therefore adopted during the offline training stage to adapt the model to either unobserved or observed target d… ▽ More

    Submitted 15 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  20. Identifying Smart Contract Security Issues in Code Snippets from Stack Overflow

    Authors: Jiachi Chen, Chong Chen, Jiang Hu, John Grundy, Yanlin Wang, Ting Chen, Zibin Zheng

    Abstract: Smart contract developers frequently seek solutions to developmental challenges on Q&A platforms such as Stack Overflow (SO). Although community responses often provide viable solutions, the embedded code snippets can also contain hidden vulnerabilities. Integrating such code directly into smart contracts may make them susceptible to malicious attacks. We conducted an online survey and received 74… ▽ More

    Submitted 23 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  21. arXiv:2407.13089  [pdf, other

    cs.AI cs.CL

    MetaSumPerceiver: Multimodal Multi-Document Evidence Summarization for Fact-Checking

    Authors: Ting-Chih Chen, Chia-Wei Tang, Chris Thomas

    Abstract: Fact-checking real-world claims often requires reviewing multiple multimodal documents to assess a claim's truthfulness, which is a highly laborious and time-consuming task. In this paper, we present a summarization model designed to generate claim-specific summaries useful for fact-checking from multimodal, multi-document datasets. The model takes inputs in the form of documents, images, and a cl… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 16 pages, 7 figures, The 62nd Annual Meeting of the Association for Computational Linguistics

  22. arXiv:2407.12270  [pdf, other

    hep-ex

    Observation of $Λ_c^+ \to Λa_0(980)^+$ and Evidence for $Σ(1380)^+$ in $Λ_c^+ \to Λπ^+ η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Based on $6.1~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at center-of-mass energies from 4.600~GeV to 4.843~GeV with the BESIII detector at the BEPCII collider, a partial wave analysis of $Λ_c^+\toΛπ^+η$ is performed, and branching fractions and decay asymmetry parameters of intermediate processes are determined. The process $Λ_c^+\toΛa_0(980)^+$ is observed for the first time, and… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 16 pages, 8 figures

  23. arXiv:2407.11727  [pdf, ps, other

    hep-ex hep-ph

    Measurement of the branching fraction of $D^+_s\to \ell^+ν_\ell$ via $e^+e^-\to D^{*+}_{s} D^{*-}_{s}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Based on $10.64~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data taken at center-of-mass energies between 4.237 and 4.699 GeV with the BESIII detector, we study the leptonic $D^+_s$ decays using the $e^+e^-\to D^{*+}_{s} D^{*-}_{s}$ process. The branching fractions of $D_s^+\to\ell^+ν_{\ell}\,(\ell=μ,τ)$ are measured to be $\mathcal{B}(D_s^+\toμ^+ν_μ)=(0.547\pm0.026_{\rm stat}\pm0.016_{\rm syst})\%$ a… ▽ More

    Submitted 18 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 27 pages, 13 figures

  24. arXiv:2407.11572  [pdf, other

    astro-ph.SR astro-ph.GA

    Discovery of an Extremely r-process-enhanced Thin-disk Star with [Eu/H] = +0.78

    Authors: Xiao-Jin Xie, Jianrong Shi, Hong-Liang Yan, Tian-Yi Chen, Carlos Allende Prieto, Timothy C. Beers, Shuai Liu, Chun-Qian Li, Ming-Yi Ding, Yao-Jia Tang, Ruizhi Zhang, Renjing Xie

    Abstract: Highly r-process-enhanced stars are rare and usually metal-poor ([Fe/H] < - 1.0), and mainly populate the Milky Way halo and dwarf galaxies. This study presents the discovery of a relatively bright (V = 12.72), highly r-process-enhanced (r-II) star ([Eu/Fe] = +1.32, [Ba/Eu] = - 0.95), LAMOST J020623.21 + 494127.9. This star was selected from the Large Sky Area Multi-Object Fiber Spectroscopic Tele… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 5 figures, 3 tables

    Journal ref: ApJL, 2024, Volume 970, Number 2, L30

  25. arXiv:2407.11277  [pdf, other

    cs.CL eess.AS

    Target conversation extraction: Source separation using turn-taking dynamics

    Authors: Tuochao Chen, Qirui Wang, Bohan Wu, Malek Itani, Sefik Emre Eskimez, Takuya Yoshioka, Shyamnath Gollakota

    Abstract: Extracting the speech of participants in a conversation amidst interfering speakers and noise presents a challenging problem. In this paper, we introduce the novel task of target conversation extraction, where the goal is to extract the audio of a target conversation based on the speaker embedding of one of its participants. To accomplish this, we propose leveraging temporal patterns inherent in h… ▽ More

    Submitted 29 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted by Interspeech 2024

  26. arXiv:2407.11055  [pdf, other

    cs.LG cs.SD eess.AS

    Knowledge boosting during low-latency inference

    Authors: Vidya Srinivas, Malek Itani, Tuochao Chen, Sefik Emre Eskimez, Takuya Yoshioka, Shyamnath Gollakota

    Abstract: Models for low-latency, streaming applications could benefit from the knowledge capacity of larger models, but edge devices cannot run these models due to resource constraints. A possible solution is to transfer hints during inference from a large model running remotely to a small model running on-device. However, this incurs a communication delay that breaks real-time requirements and does not gu… ▽ More

    Submitted 25 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted by Interspeech 2024

  27. arXiv:2407.11030  [pdf, other

    cs.LG cs.AI cs.CL

    DLO: Dynamic Layer Operation for Efficient Vertical Scaling of LLMs

    Authors: Zhen Tan, Daize Dong, Xinyu Zhao, Jie Peng, Yu Cheng, Tianlong Chen

    Abstract: In this paper, we introduce Dynamic Layer Operations (DLO), a novel approach for vertically scaling transformer-based Large Language Models (LLMs) by dynamically expanding, activating, or skipping layers using a sophisticated routing policy based on layerwise feature similarity. Unlike traditional Mixture-of-Experts (MoE) methods that focus on extending the model width, our approach targets model… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  28. arXiv:2407.10691  [pdf, other

    cs.IR cs.CL

    $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity

    Authors: Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl

    Abstract: Recent studies show the growing significance of document retrieval in the generation of LLMs, i.e., RAG, within the scientific domain by bridging their knowledge gap. However, dense retrievers often struggle with domain-specific retrieval and complex query-document relationships, particularly when query segments correspond to various parts of a document. To alleviate such prevalent challenges, thi… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  29. arXiv:2407.10613  [pdf, other

    physics.plasm-ph

    Global destabilization of drift-tearing mode with coupling to discretized electron drift-wave instability

    Authors: J. Bao, W. L. Zhang, Z. Lin, H. S. Cai, D. J. Liu, H. T. Chen, C. Dong, J. T. Cao, D. Li

    Abstract: The global linear behaviors of 2/1 DTM in the collisional regime are investigated based on a concisely resistive drift-MHD model. Besides DTM, extra normal modes including EDW and SAW are coupled together and destabilized in different parameter regimes by considering resistivity in this system. The EVP approach is applied for solving the eigenstate spectra with the distribution of all unstable sol… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 23 pages, 15 figues

  30. arXiv:2407.10275  [pdf, other

    cs.CL cs.AI

    Cross-Lingual Multi-Hop Knowledge Editing -- Benchmarks, Analysis and a Simple Contrastive Learning based Approach

    Authors: Aditi Khandelwal, Harman Singh, Hengrui Gu, Tianlong Chen, Kaixiong Zhou

    Abstract: Large language models are often expected to constantly adapt to new sources of knowledge and knowledge editing techniques aim to efficiently patch the outdated model knowledge, with minimal modification. Most prior works focus on monolingual knowledge editing in English, even though new information can emerge in any language from any part of the world. We propose the Cross-Lingual Multi-Hop Knowle… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Paper on Cross-Lingual Multi-Hop Knowledge Editing

  31. arXiv:2407.10267  [pdf, other

    cs.CV

    RS-NeRF: Neural Radiance Fields from Rolling Shutter Images

    Authors: Muyao Niu, Tong Chen, Yifan Zhan, Zhuoxiao Li, Xiang Ji, Yinqiang Zheng

    Abstract: Neural Radiance Fields (NeRFs) have become increasingly popular because of their impressive ability for novel view synthesis. However, their effectiveness is hindered by the Rolling Shutter (RS) effects commonly found in most camera systems. To solve this, we present RS-NeRF, a method designed to synthesize normal images from novel views using input with RS distortions. This involves a physical mo… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: ECCV 2024 ; Codes and data: https://github.com/MyNiuuu/RS-NeRF

  32. arXiv:2407.09860  [pdf, other

    quant-ph cond-mat.stat-mech

    Quantum Vicsek Model for Active Matter

    Authors: Hong Yuan, L. X. Cui, L. T. Chen, C. P. Sun

    Abstract: We propose a quantum analog of the Vicsek model, consisting of an ensemble of overdamped spin$-1/2$ particles with ferromagnetic couplings, driven by a uniformly polarized magnetic field. The spontaneous magnetization of the spin components breaks the $SO(3)$ (or $SO(2)$) symmetry, inducing an ordered phase of flocking. We derive the hydrodynamic equations, similar to those formulated by Toner and… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  33. arXiv:2407.09694  [pdf, other

    cs.CV

    Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images

    Authors: Tianyu Luan, Zhongpai Gao, Luyuan Xie, Abhishek Sharma, Hao Ding, Benjamin Planche, Meng Zheng, Ange Lou, Terrence Chen, Junsong Yuan, Ziyan Wu

    Abstract: We introduce a novel bottom-up approach for human body mesh reconstruction, specifically designed to address the challenges posed by partial visibility and occlusion in input images. Traditional top-down methods, relying on whole-body parametric models like SMPL, falter when only a small part of the human is visible, as they require visibility of most of the human body for accurate mesh reconstruc… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  34. arXiv:2407.09292  [pdf, other

    cs.CR

    Counterfactual Explainable Incremental Prompt Attack Analysis on Large Language Models

    Authors: Dong Shu, Mingyu Jin, Tianle Chen, Chong Zhang, Yongfeng Zhang

    Abstract: This study sheds light on the imperative need to bolster safety and privacy measures in large language models (LLMs), such as GPT-4 and LLaMA-2, by identifying and mitigating their vulnerabilities through explainable analysis of prompt attacks. We propose Counterfactual Explainable Incremental Prompt Attack (CEIPA), a novel technique where we guide prompts in a specific manner to quantitatively me… ▽ More

    Submitted 17 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: 23 pages, 6 figures

  35. arXiv:2407.09274  [pdf, other

    cs.LG cs.AI q-bio.BM

    Unifying Sequences, Structures, and Descriptions for Any-to-Any Protein Generation with the Large Multimodal Model HelixProtX

    Authors: Zhiyuan Chen, Tianhao Chen, Chenggang Xie, Yang Xue, Xiaonan Zhang, Jingbo Zhou, Xiaomin Fang

    Abstract: Proteins are fundamental components of biological systems and can be represented through various modalities, including sequences, structures, and textual descriptions. Despite the advances in deep learning and scientific large language models (LLMs) for protein research, current methodologies predominantly focus on limited specialized tasks -- often predicting one protein modality from another. Th… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  36. arXiv:2407.08956  [pdf, other

    cs.CR cs.SE

    DeCE: Deceptive Cross-Entropy Loss Designed for Defending Backdoor Attacks

    Authors: Guang Yang, Yu Zhou, Xiang Chen, Xiangyu Zhang, Terry Yue Zhuo, David Lo, Taolue Chen

    Abstract: Code Language Models (CLMs), particularly those leveraging deep learning, have achieved significant success in code intelligence domain. However, the issue of security, particularly backdoor attacks, is often overlooked in this process. The previous research has focused on designing backdoor attacks for CLMs, but effective defenses have not been adequately addressed. In particular, existing defens… ▽ More

    Submitted 20 August, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Under Review; Waiting for updates

  37. arXiv:2407.08016  [pdf, other

    eess.AS cs.SD

    Source Tracing of Audio Deepfake Systems

    Authors: Nicholas Klein, Tianxiang Chen, Hemlata Tak, Ricardo Casal, Elie Khoury

    Abstract: Recent progress in generative AI technology has made audio deepfakes remarkably more realistic. While current research on anti-spoofing systems primarily focuses on assessing whether a given audio sample is fake or genuine, there has been limited attention on discerning the specific techniques to create the audio deepfakes. Algorithms commonly used in audio deepfake generation, like text-to-speech… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by INTERSPEECH 2024

  38. arXiv:2407.07884  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation

    Authors: Tao Chen, Eric Cousineau, Naveen Kuppuswamy, Pulkit Agrawal

    Abstract: Recent studies have made significant progress in addressing dexterous manipulation problems, particularly in in-hand object reorientation. However, there are few existing works that explore the potential utilization of developed dexterous manipulation controllers for downstream tasks. In this study, we focus on constrained dexterous manipulation for food peeling. Food peeling presents various cons… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  39. arXiv:2407.07651  [pdf, other

    hep-ex physics.data-an

    Study of the decay and production properties of $D_{s1}(2536)$ and $D_{s2}^*(2573)$

    Authors: M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (645 additional authors not shown)

    Abstract: The $e^+e^-\rightarrow D_s^+D_{s1}(2536)^-$ and $e^+e^-\rightarrow D_s^+D^*_{s2}(2573)^-$ processes are studied using data samples collected with the BESIII detector at center-of-mass energies from 4.530 to 4.946~GeV. The absolute branching fractions of $D_{s1}(2536)^- \rightarrow \bar{D}^{*0}K^-$ and $D_{s2}^*(2573)^- \rightarrow \bar{D}^0K^-$ are measured for the first time to be… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  40. arXiv:2407.07155  [pdf, other

    astro-ph.HE astro-ph.IM

    Across the soft gamma-ray regime: utilizing simultaneous detections in the Compton Spectrometer and Imager (COSI) and the Background and Transient Observer (BTO) to understand astrophysical transients

    Authors: Hannah C. Gulick, Eliza Neights, Samer Al Nussirat, Claire Tianyi Chen, Kaylie Ching, Cassandra Dove, Alyson Joens, Carolyn Kierans, Hubert Liu, Israel Martinez, Tomas Mician, Shunsaku Nagasawa, Shreya Nandyala, Isabel Schmidtke, Derek Shah, Andreas Zoglauer, Kazuhiro Nakasawa, Tadayuki Takahashi, Juan-Carlos Martinez Oliveros, John A. Tomsick

    Abstract: The Compton Spectrometer and Imager (COSI) is a NASA funded Small Explorer (SMEX) mission slated to launch in 2027. COSI will house a wide-field gamma-ray telescope designed to survey the entire sky in the 0.2--5 MeV range. Using germanium detectors, the instrument will provide imaging, spectroscopy, and polarimetry of astrophysical sources with excellent energy resolution and degree-scale localiz… ▽ More

    Submitted 28 August, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 12 pages of text with an additional 2 pages for acknowledgments and citations. 9 figures. 1 table

    Journal ref: SPIE, 2024

  41. arXiv:2407.07087  [pdf, other

    cs.CL cs.LG

    CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation

    Authors: Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh

    Abstract: Evaluating the degree of reproduction of copyright-protected content by language models (LMs) is of significant interest to the AI and legal communities. Although both literal and non-literal similarities are considered by courts when assessing the degree of reproduction, prior research has focused only on literal similarities. To bridge this gap, we introduce CopyBench, a benchmark designed to me… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  42. arXiv:2407.06844  [pdf, other

    cs.CV

    Dynamic Correlation Learning and Regularization for Multi-Label Confidence Calibration

    Authors: Tianshui Chen, Weihang Wang, Tao Pu, Jinghui Qin, Zhijing Yang, Jie Liu, Liang Lin

    Abstract: Modern visual recognition models often display overconfidence due to their reliance on complex deep neural networks and one-hot target supervision, resulting in unreliable confidence scores that necessitate calibration. While current confidence calibration techniques primarily address single-label scenarios, there is a lack of focus on more practical and generalizable multi-label contexts. This pa… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: submitted to TIP

  43. arXiv:2407.06503  [pdf, other

    cs.LG

    Preference-Guided Reinforcement Learning for Efficient Exploration

    Authors: Guojian Wang, Faguo Wu, Xiao Zhang, Tianyuan Chen, Xuyang Chen, Lin Zhao

    Abstract: In this paper, we investigate preference-based reinforcement learning (PbRL) that allows reinforcement learning (RL) agents to learn from human feedback. This is particularly valuable when defining a fine-grain reward function is not feasible. However, this approach is inefficient and impractical for promoting deep exploration in hard-exploration tasks with long horizons and sparse rewards. To tac… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 13 pages, 17 figures

  44. arXiv:2407.06483  [pdf, other

    cs.LG cs.CL

    Composable Interventions for Language Models

    Authors: Arinbjorn Kolbeinsson, Kyle O'Brien, Tianjin Huang, Shanghua Gao, Shiwei Liu, Jonathan Richard Schwarz, Anurag Vaidya, Faisal Mahmood, Marinka Zitnik, Tianlong Chen, Thomas Hartvigsen

    Abstract: Test-time interventions for language models can enhance factual accuracy, mitigate harmful outputs, and improve model efficiency without costly retraining. But despite a flood of new methods, different types of interventions are largely developing independently. In practice, multiple interventions must be applied sequentially to the same model, yet we lack standardized ways to study how interventi… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  45. arXiv:2407.06304  [pdf, other

    cs.CV cs.AI cs.CL

    VIMI: Grounding Video Generation through Multi-modal Instruction

    Authors: Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chien Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov

    Abstract: Existing text-to-video diffusion models rely solely on text-only encoders for their pretraining. This limitation stems from the absence of large-scale multimodal prompt video datasets, resulting in a lack of visual grounding and restricting their versatility and application in multimodal integration. To address this, we construct a large-scale multimodal prompt dataset by employing retrieval metho… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  46. arXiv:2407.06079  [pdf, other

    cs.CV cs.AI

    Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis

    Authors: Emaad Khwaja, Abdullah Rashwan, Ting Chen, Oliver Wang, Suraj Kothawade, Yeqing Li

    Abstract: We present a one-shot text-to-image diffusion model that can generate high-resolution images from natural language descriptions. Our model employs a layered U-Net architecture that simultaneously synthesizes images at multiple resolution scales. We show that this method outperforms the baseline of synthesizing images only at the target resolution, while reducing the computational cost per step. We… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  47. arXiv:2407.05160  [pdf, other

    physics.atom-ph physics.chem-ph

    Relativistic Exact Two-Component Coupled-Cluster Study of Molecular Sensitivity Factors for Nuclear Schiff Moments

    Authors: Tianxiang Chen, Chaoqun Zhang, Lan Cheng, Kia Boon Ng, Stephan Malbrunot-Ettenauer, Victor V. Flambaum, Zack Lasner, John M. Doyle, Phelan Yu, Chandler J. Conn, Chi Zhang, Nicholas R. Hutzler, Andrew M. Jayich, Benjamin Augenbraun, David Demille

    Abstract: Relativistic exact two-component coupled-cluster calculations of molecular sensitivity factors for nuclear Schiff moments (NSMs) are reported. We focus on molecules containing heavy nuclei, especially octupole-deformed nuclei. Analytic relativistic coupled-cluster gradient techniques are used and serve as useful tools for identifying candidate molecules that sensitively probe for physics beyond th… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  48. arXiv:2407.04686  [pdf, other

    cs.DS math.NA

    Near-optimal hierarchical matrix approximation from matrix-vector products

    Authors: Tyler Chen, Feyza Duman Keles, Diana Halikias, Cameron Musco, Christopher Musco, David Persson

    Abstract: We describe a randomized algorithm for producing a near-optimal hierarchical off-diagonal low-rank (HODLR) approximation to an $n\times n$ matrix $\mathbf{A}$, accessible only though matrix-vector products with $\mathbf{A}$ and $\mathbf{A}^{\mathsf{T}}$. We prove that, for the rank-$k$ HODLR approximation problem, our method achieves a $(1+β)^{\log(n)}$-optimal approximation in expected Frobenius… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  49. arXiv:2407.04338  [pdf, other

    quant-ph

    Entanglement distribution based on quantum walk in arbitrary quantum networks

    Authors: Tianen Chen, Yun Shang, Chitong Chen, Heng Fan

    Abstract: In large-scale quantum networks, distributing the multi-particle entangled state among selected nodes is crucial for realizing long-distance and complicated quantum communication. Quantum repeaters provides an efficient method to generate entanglement between distant nodes. However, it is difficult to extend quantum repeater protocols to high-dimensional quantum states in existing experiments. Her… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  50. arXiv:2407.04055  [pdf, other

    q-bio.QM cs.AI cs.LG

    Benchmark on Drug Target Interaction Modeling from a Structure Perspective

    Authors: Xinnan Zhang, Jialin Wu, Junyi Xie, Tianlong Chen, Kaixiong Zhou

    Abstract: The prediction modeling of drug-target interactions is crucial to drug discovery and design, which has seen rapid advancements owing to deep learning technologies. Recently developed methods, such as those based on graph neural networks (GNNs) and Transformers, demonstrate exceptional performance across various datasets by effectively extracting structural information. However, the benchmarking of… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Submitted to NIPS 2024 Dataset and Benchmark