Zum Hauptinhalt springen

Showing 251–300 of 622 results for author: Wen, Y

.
  1. arXiv:2306.13651  [pdf, other

    cs.CL cs.LG

    Bring Your Own Data! Self-Supervised Evaluation for Large Language Models

    Authors: Neel Jain, Khalid Saifullah, Yuxin Wen, John Kirchenbauer, Manli Shu, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein

    Abstract: With the rise of Large Language Models (LLMs) and their ubiquitous deployment in diverse domains, measuring language model behavior on realistic data is imperative. For example, a company deploying a client-facing chatbot must ensure that the model will not respond to client requests with profanity. Current evaluations approach this problem using small, domain-specific datasets with human-curated… ▽ More

    Submitted 29 June, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: Code is available at https://github.com/neelsjain/BYOD. First two authors contributed equally. 21 pages, 22 figures

  2. arXiv:2306.12457  [pdf, other

    cs.LG cs.AI q-bio.PE

    Deep Dynamic Epidemiological Modelling for COVID-19 Forecasting in Multi-level Districts

    Authors: Ruhan Liu, Jiajia Li, Yang Wen, Huating Li, Ping Zhang, Bin Sheng, David Dagan Feng

    Abstract: Objective: COVID-19 has spread worldwide and made a huge influence across the world. Modeling the infectious spread situation of COVID-19 is essential to understand the current condition and to formulate intervention measurements. Epidemiological equations based on the SEIR model simulate disease development. The traditional parameter estimation method to solve SEIR equations could not precisely f… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  3. arXiv:2306.11335  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Surfer: Progressive Reasoning with World Models for Robotic Manipulation

    Authors: Pengzhen Ren, Kaidong Zhang, Hetao Zheng, Zixuan Li, Yuhang Wen, Fengda Zhu, Mas Ma, Xiaodan Liang

    Abstract: Considering how to make the model accurately understand and follow natural language instructions and perform actions consistent with world knowledge is a key challenge in robot manipulation. This mainly includes human fuzzy instruction reasoning and the following of physical knowledge. Therefore, the embodied intelligence agent must have the ability to model world knowledge from training data. How… ▽ More

    Submitted 20 March, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  4. arXiv:2306.10255  [pdf, other

    astro-ph.HE astro-ph.EP astro-ph.IM

    The First GECAM Observation Results on Terrestrial Gamma-ray Flashes and Terrestrial Electron Beams

    Authors: Y. Zhao, J. C. Liu, S. L. Xiong, W. C. Xue, Q. B. Yi, G. P. Lu, W. Xu, F. C. Lyu, J. C. Sun, W. X. Peng, C. Zheng, Y. Q. Zhang, C. Cai, S. Xiao, S. L. Xie, C. W. Wang, W. J. Tan, Z. H. An, G. Chen, Y. Q. Du, Y. Huang, M. Gao, K. Gong, D. Y. Guo, J. J. He , et al. (37 additional authors not shown)

    Abstract: Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) is a space-borne instrument dedicated to monitoring high-energy transients, including Terrestrial Gamma-ray Flashes (TGFs) and Terrestrial Electron Beams (TEBs). We implemented a TGF/TEB search algorithm for GECAM, with which 147 bright TGFs, 2 typical TEBs and 2 special TEB-like events are identified during an effe… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: The paper was accepted by Geophysical Research Letters on June 16th, 2023

  5. Emotional Speech-Driven Animation with Content-Emotion Disentanglement

    Authors: Radek Daněček, Kiran Chhatre, Shashank Tripathi, Yandong Wen, Michael J. Black, Timo Bolkart

    Abstract: To be widely adopted, 3D facial avatars must be animated easily, realistically, and directly from speech signals. While the best recent methods generate 3D animations that are synchronized with the input audio, they largely ignore the impact of emotions on facial expressions. Realistic facial animation requires lip-sync together with the natural expression of emotion. To that end, we propose EMOTE… ▽ More

    Submitted 26 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: SIGGRAPH Asia 2023 Conference Paper

  6. arXiv:2306.05437  [pdf, other

    cs.LG cs.AI

    One-step Multi-view Clustering with Diverse Representation

    Authors: Xinhang Wan, Jiyuan Liu, Xinwang Liu, Siwei Wang, Yi Wen, Tianjiao Wan, Li Shen, En Zhu

    Abstract: Multi-view clustering has attracted broad attention due to its capacity to utilize consistent and complementary information among views. Although tremendous progress has been made recently, most existing methods undergo high complexity, preventing them from being applied to large-scale tasks. Multi-view clustering via matrix factorization is a representative to address this issue. However, most of… ▽ More

    Submitted 27 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  7. arXiv:2306.05429  [pdf

    physics.optics

    Spin photonics on chip based on a twinning crystal metamaterial

    Authors: Yan Li, Jingbo Sun, Yongzheng Wen, Xiaoyu Xiong, Ji Zhou

    Abstract: Two-dimensional photonic circuits with high capacity are essential for a wide range of applications in next-generation photonic information technology and optoelectronics. Here we demonstrate a multi-channel spin-dependent photonic device based on a twinning crystal metamaterial. The structural symmetry and material symmetry of the twinning crystal metamaterial enable a total of 4 channels carryin… ▽ More

    Submitted 19 May, 2023; originally announced June 2023.

  8. arXiv:2306.04634  [pdf, other

    cs.LG cs.CL cs.CR

    On the Reliability of Watermarks for Large Language Models

    Authors: John Kirchenbauer, Jonas Geiping, Yuxin Wen, Manli Shu, Khalid Saifullah, Kezhi Kong, Kasun Fernando, Aniruddha Saha, Micah Goldblum, Tom Goldstein

    Abstract: As LLMs become commonplace, machine-generated text has the potential to flood the internet with spam, social media bots, and valueless content. Watermarking is a simple and effective strategy for mitigating such harms by enabling the detection and documentation of LLM-generated text. Yet a crucial question remains: How reliable is watermarking in realistic settings in the wild? There, watermarked… ▽ More

    Submitted 1 May, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 9 pages in the main body. Published at ICLR 2024. Code is available at https://github.com/jwkirchenbauer/lm-watermarking

  9. arXiv:2306.03424  [pdf, other

    cs.CV

    GCD-DDPM: A Generative Change Detection Model Based on Difference-Feature Guided DDPM

    Authors: Yihan Wen, Xianping Ma, Xiaokang Zhang, Man-On Pun

    Abstract: Deep learning (DL)-based methods have recently shown great promise in bitemporal change detection (CD). Existing discriminative methods based on Convolutional Neural Networks (CNNs) and Transformers rely on discriminative representation learning for change recognition while struggling with exploring local and long-range contextual dependencies. As a result, it is still challenging to obtain fine-g… ▽ More

    Submitted 2 March, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

  10. arXiv:2306.03359  [pdf

    physics.optics physics.app-ph

    Optical vortices enabled by structural vortices

    Authors: Yuanfeng Liu, Le Zhou, Mengfan Guo, Zongqi Xu, Jing Ma, Yongzheng Wen, Natalia M. Litchinitser, Yang Shen, Jingbo Sun, Ji Zhou

    Abstract: The structural symmetry of solids plays an important role in defining their linear and nonlinear optical properties. The quest for versatile, cost-effective, large-scale, and defect-free approaches and materials platforms for tailoring structural and optical properties on demand has been underway for decades. We experimentally demonstrate a bottom-up self-assembly-based organic engineered material… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  11. arXiv:2306.03034  [pdf, other

    cs.AI cs.HC

    Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination

    Authors: Yang Li, Shao Zhang, Jichen Sun, Wenhao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan

    Abstract: Securing coordination between AI agent and teammates (human players or AI agents) in contexts involving unfamiliar humans continues to pose a significant challenge in Zero-Shot Coordination. The issue of cooperative incompatibility becomes particularly prominent when an AI agent is unsuccessful in synchronizing with certain previously unknown partners. Traditional algorithms have aimed to collabor… ▽ More

    Submitted 7 January, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 46 pages. arXiv admin note: substantial text overlap with arXiv:2302.04831

  12. arXiv:2305.20030  [pdf, other

    cs.LG cs.CR cs.CV

    Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust

    Authors: Yuxin Wen, John Kirchenbauer, Jonas Geiping, Tom Goldstein

    Abstract: Watermarking the outputs of generative models is a crucial technique for tracing copyright and preventing potential harm from AI-generated content. In this paper, we introduce a novel technique called Tree-Ring Watermarking that robustly fingerprints diffusion model outputs. Unlike existing methods that perform post-hoc modifications to images after sampling, Tree-Ring Watermarking subtly influenc… ▽ More

    Submitted 3 July, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 16 pages, 8 figures, code is available at https://github.com/YuxinWenRick/tree-ring-watermark, fixed the repo link

  13. Demonstration of the quantum principle of least action with single photons

    Authors: Yong-Li Wen, Yunfei Wang, Li-Man Tian, Shanchao Zhang, Jianfeng Li, Jing-Song Du, Hui Yan, Shi-Liang Zhu

    Abstract: The principle of least action is arguably the most fundamental principle in physics as it can be used to derive the equations of motion in various branches of physics. However, this principle has not been experimentally demonstrated at the quantum level because the propagators for Feymann's path integrals have never been observed. The propagator is a fundamental concept and contains various signif… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Published online: 22 May 2023, Nature Photonics

  14. arXiv:2305.18576  [pdf, other

    cs.CL

    TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding

    Authors: Zichen Liu, Xuyuan Liu, Yanlong Wen, Guoqing Zhao, Fen Xia, Xiaojie Yuan

    Abstract: ICD coding is designed to assign the disease codes to electronic health records (EHRs) upon discharge, which is crucial for billing and clinical statistics. In an attempt to improve the effectiveness and efficiency of manual coding, many methods have been proposed to automatically predict ICD codes from clinical notes. However, most previous works ignore the decisive information contained in struc… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    ACM Class: I.2.7

  15. arXiv:2305.18498  [pdf, other

    cs.PL cs.AI cs.CL cs.LG

    ANPL: Towards Natural Programming with Interactive Decomposition

    Authors: Di Huang, Ziyuan Nan, Xing Hu, Pengwei Jin, Shaohui Peng, Yuanbo Wen, Rui Zhang, Zidong Du, Qi Guo, Yewen Pu, Yunji Chen

    Abstract: Though LLMs are capable of generating plausible programs, it's challenging to interact with the LLMs further to revise the program, especially if the user's specific requirements are different from the initial proposal. In this paper, we introduce ANPL, an interactive programming system that ensures users can always refine the generated code towards their specific programmatic intents via structur… ▽ More

    Submitted 30 November, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  16. Tuning atom-field interaction via phase shaping

    Authors: Y. -T. Cheng, C. -H. Chien, K. -M. Hsieh, Y. -H. Huang, P. Y. Wen, W. -J. Lin, Y. Lu, F. Aziz, C. -P. Lee, K. -T. Lin, C. -Y. Chen, J. C. Chen, C. -S. Chuu, A. F. Kockum, G. -D. Lin, Y. -H. Lin, I. -C. Hoi

    Abstract: A coherent electromagnetic field can be described by its amplitude, frequency, and phase. All these properties can influence the interaction between the field and an atom. Here we demonstrate the phase shaping of microwaves that are scattered by a superconducting artificial atom coupled to the end of a semi-infinite 1D transmission line. In particular, we input a weak exponentially rising pulse wi… ▽ More

    Submitted 26 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Journal ref: Physical Review A 109, 023705 (2024)

  17. Separable Ball around any Full-Rank Multipartite Product State

    Authors: Robin Yunfei Wen, Achim Kempf

    Abstract: We show that around any $m$-partite product state $ρ_{\rm prod}=ρ_1\otimes...\otimesρ_m$ of full rank (that is ${\rm det}(ρ_{\rm prod})\neq 0)$, there exists a finite-sized closed ball of separable states centered around $ρ_{\rm prod}$ whose radius is $β:=2^{1-m/2}λ_{\rm min}(ρ_{\rm prod})$. Here, $λ_{\rm min}(ρ_{\rm prod})$ is the smallest eigenvalue of $ρ_{\rm prod}$. We are assuming that the to… ▽ More

    Submitted 9 June, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 12 pages. v2: new sections, added references

    Journal ref: 2023 J. Phys. A: Math. Theor. 56 335302

  18. First-principles Prediction of Potential Candidate Materials MCu$_3$X$_4$ (M = V, Nb, Ta; X = S, Se, Te) for Neuromorphic Computing

    Authors: Baoxing Zhai, Ruiqing Cheng, Tianxing Wang, Li Liu, Lei Yin, Yao Wen, Hao Wang, Sheng Chang, Jun He

    Abstract: Inspired by the neuro-synaptic frameworks in the human brain, neuromorphic computing is expected to overcome the bottleneck of traditional von-Neumann architecture and be used in artificial intelligence. Here, we predict a class of potential candidate materials, MCu$_3$X$_4$ (M = V, Nb, Ta; X = S, Se, Te), for neuromorphic computing applications through first-principles calculations based on densi… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: 28+8 pages, 18 figures

    Journal ref: Phys. Rev. Applied 19, 054045 (2023)

  19. arXiv:2304.13678  [pdf, other

    cs.CV

    A marker-less human motion analysis system for motion-based biomarker discovery in knee disorders

    Authors: Kai Armstrong, Lei Zhang, Yan Wen, Alexander P. Willmott, Paul Lee, Xujioing Ye

    Abstract: In recent years the NHS has been having increased difficulty seeing all low-risk patients, this includes but not limited to suspected osteoarthritis (OA) patients. To help address the increased waiting lists and shortages of staff, we propose a novel method of automated biomarker identification for diagnosis of knee disorders and the monitoring of treatment progression. The proposed method allows… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 11 pages, 5 figures

  20. arXiv:2304.11966  [pdf, other

    cs.CV

    ICDAR 2023 Competition on Reading the Seal Title

    Authors: Wenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai

    Abstract: Reading seal title text is a challenging task due to the variable shapes of seals, curved text, background noise, and overlapped text. However, this important element is commonly found in official and financial scenarios, and has not received the attention it deserves in the field of OCR technology. To promote research in this area, we organized ICDAR 2023 competition on reading the seal title (Re… ▽ More

    Submitted 5 June, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: ICDAR2023 Competition on ReST report (To be appear in ICDAR 2023)

  21. Sweeping Horndeski Canvas: New Growth-Rate Parameterization for Modified-Gravity Theories

    Authors: Yuewei Wen, Nhat-Minh Nguyen, Dragan Huterer

    Abstract: We propose and numerically validate a new fitting formula that is sufficiently accurate to model the growth of structure in Horndeski theories of modified gravity for upcoming Stage IV and V large-scale structure surveys. Based on an analysis of more than 18,000 Horndeski models and adopting the popular parameterization of the growth rate $f(z) = Ω_{M}(z)^γ$, we generalize the constant growth inde… ▽ More

    Submitted 6 September, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: 23 pages, 6 figures; prepared for JCAP submission; comments welcome! v2: Add missing footnote on author role on the first page; fix latex package rendering error in a section title. v3: Match the version accepted to JCAP; correct repeated references

    Report number: LCTP-23-05

    Journal ref: JCAP09(2023)028

  22. arXiv:2304.03217  [pdf, other

    cond-mat.mtrl-sci physics.app-ph physics.ins-det

    Exploring the Spin Dynamics of a Room-Temperature Diamond Maser using an Extended rate Equation Model

    Authors: Yongqiang Wen, Philip L. Diggle, Neil McN. Alford, Daan M. Arroo

    Abstract: Masers - the microwave analogue of lasers - are coherent microwave sources that can act as oscillators or quantum-limited amplifiers. Masers have historically required high vacuum and cryogenic temperatures to operate, but recently masers based on diamond have been demonstrated to operate at room temperature and pressure, opening a route to new applications as ultra-low noise microwave amplifiers.… ▽ More

    Submitted 7 April, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 23 pages, 8 figures

    Journal ref: J. Appl. Phys. 134, 194501 (2023)

  23. arXiv:2304.02860  [pdf, other

    cs.CV

    Towards an Effective and Efficient Transformer for Rain-by-snow Weather Removal

    Authors: Tao Gao, Yuanbo Wen, Kaihao Zhang, Peng Cheng, Ting Chen

    Abstract: Rain-by-snow weather removal is a specialized task in weather-degraded image restoration aiming to eliminate coexisting rain streaks and snow particles. In this paper, we propose RSFormer, an efficient and effective Transformer that addresses this challenge. Initially, we explore the proximity of convolution networks (ConvNets) and vision Transformers (ViTs) in hierarchical architectures and exper… ▽ More

    Submitted 27 October, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: code is available at \url{https://github.com/chdwyb/RSFormer}

  24. The H$α$ broadband photometric reverberation mapping of four Seyfert 1 galaxies

    Authors: Qinchun Ma, Xue-Bing Wu, Huapeng Gu, Yuhan Wen, Yuming Fu

    Abstract: Broadband photometric reverberation mapping (PRM) have been investigated for AGNs in recent years, but mostly on accretion disk continuum RM. Due to the small fraction of broad emission lines in the broadband, PRM for emission lines is very challenging. Here we present an ICCF-Cut method for broadband PRM to obtain the H$α$ broad line lag and apply it to four Seyfert 1 galaxies, MCG+08-11-011, NGC… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: 22 pages, 19 figures, accepted for publication in ApJ

  25. arXiv:2303.02489  [pdf, other

    cs.CV

    CapDet: Unifying Dense Captioning and Open-World Detection Pretraining

    Authors: Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang

    Abstract: Benefiting from large-scale vision-language pre-training on image-text pairs, open-world detection methods have shown superior generalization ability under the zero-shot or few-shot detection settings. However, a pre-defined category space is still required during the inference stage of existing methods and only the objects belonging to that space will be predicted. To introduce a "real" open-worl… ▽ More

    Submitted 15 March, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023

  26. arXiv:2303.01983  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Auto-weighted Multi-view Clustering for Large-scale Data

    Authors: Xinhang Wan, Xinwang Liu, Jiyuan Liu, Siwei Wang, Yi Wen, Weixuan Liang, En Zhu, Zhe Liu, Lu Zhou

    Abstract: Multi-view clustering has gained broad attention owing to its capacity to exploit complementary information across multiple data views. Although existing methods demonstrate delightful clustering performance, most of them are of high time complexity and cannot handle large-scale data. Matrix factorization-based models are a representative of solving this problem. However, they assume that the view… ▽ More

    Submitted 20 January, 2023; originally announced March 2023.

  27. Improved Inner Approximation for Aggregating Power Flexibility in Active Distribution Networks and its Applications

    Authors: Yilin Wen, Zechun Hu, Jinhua He, Yi Guo

    Abstract: Concise and reliable modeling for aggregating power flexibility of distributed energy resources in active distribution networks (ADNs) is a crucial technique for coordinating transmission and distribution networks. Our recent research has successfully derived an explicit expression for the exact aggregation model (EAM) of power flexibility at the substation level under linearized distribution netw… ▽ More

    Submitted 24 June, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: 10 pages

  28. arXiv:2303.01277  [pdf, other

    cs.DC cs.AI cs.LG

    Boosting Distributed Full-graph GNN Training with Asynchronous One-bit Communication

    Authors: Meng Zhang, Qinghao Hu, Peng Sun, Yonggang Wen, Tianwei Zhang

    Abstract: Training Graph Neural Networks (GNNs) on large graphs is challenging due to the conflict between the high memory demand and limited GPU memory. Recently, distributed full-graph GNN training has been widely adopted to tackle this problem. However, the substantial inter-GPU communication overhead can cause severe throughput degradation. Existing communication compression techniques mainly focus on t… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  29. arXiv:2303.00270  [pdf, ps, other

    math.DG math.AP

    Stability and energy identity for Yang-Mills-Higgs pairs

    Authors: Xiaoli Han, Xishen Jin, Yang Wen

    Abstract: In this paper, we study the properties of the critical points of Yang-Mills-Higgs functional, which are called Yang-Mills-Higgs pairs. We first consider the properties of weakly stable Yang-Mills-Higgs pairs on a vector bundle over S^n (n > 3). When n > 3, we prove that the norm of its Higgs field is 1 and the connection is actually Yang-Mills. More precisely, its curvature vanishes when n > 4. We… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 23 pages, 0 figures

    MSC Class: 53C07

    Journal ref: J. Math. Phys. 64, 021511 (2023)

  30. arXiv:2302.14511  [pdf, other

    cs.CV

    A Unified BEV Model for Joint Learning of 3D Local Features and Overlap Estimation

    Authors: Lin Li, Wendong Ding, Yongkun Wen, Yufei Liang, Yong Liu, Guowei Wan

    Abstract: Pairwise point cloud registration is a critical task for many applications, which heavily depends on finding correct correspondences from the two point clouds. However, the low overlap between input point clouds causes the registration to fail easily, leading to mistaken overlapping and mismatched correspondences, especially in scenes where non-overlapping regions contain similar structures. In th… ▽ More

    Submitted 14 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: 8 pages. Accepted by ICRA-2023

  31. arXiv:2302.07450  [pdf, other

    cs.LG cs.CR

    FedABC: Targeting Fair Competition in Personalized Federated Learning

    Authors: Dui Wang, Li Shen, Yong Luo, Han Hu, Kehua Su, Yonggang Wen, Dacheng Tao

    Abstract: Federated learning aims to collaboratively train models without accessing their client's local private data. The data may be Non-IID for different clients and thus resulting in poor performance. Recently, personalized federated learning (PFL) has achieved great success in handling Non-IID data by enforcing regularization in local optimization or improving the model aggregation scheme on the server… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 9 pages,5 figures

    Journal ref: AAAI2023

  32. arXiv:2302.07442  [pdf, other

    quant-ph

    Microwave amplification via interfering multi-photon processes in a half-waveguide quantum electrodynamics system

    Authors: Fahad Aziz, Kuan Ting Lin, Ping Yi Wen, Samina, Yu Chen Lin, Emely Wiegand, Ching-Ping Lee, Yu-Ting Cheng, Ching-Yeh Chen, Chin-Hsun Chien, Kai-Min Hsieh, Yu-Huan Huang, Ian Hou, Jeng-Chung Chen, Yen-Hsiang Lin, Anton Frisk Kockum, Guin Dar Lin, Io-Chun Hoi

    Abstract: We investigate the amplification of a microwave probe signal by a superconducting artificial atom, a transmon, strongly coupled to the end of a one-dimensional semi-infinite transmission line. The end of the transmission line acts as a mirror for microwave fields. Due to the weak anharmonicity of the artificial atom, a strong pump field creates multi-photon excitations among the dressed states. Tr… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

  33. arXiv:2302.06205  [pdf, other

    cs.AI cs.GT cs.LG cs.MA

    Order Matters: Agent-by-agent Policy Optimization

    Authors: Xihuai Wang, Zheng Tian, Ziyu Wan, Ying Wen, Jun Wang, Weinan Zhang

    Abstract: While multi-agent trust region algorithms have achieved great success empirically in solving coordination tasks, most of them, however, suffer from a non-stationarity problem since agents update their policies simultaneously. In contrast, a sequential scheme that updates policies agent-by-agent provides another perspective and shows strong performance. However, sample inefficiency and lack of mono… ▽ More

    Submitted 26 February, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted by ICLR2023, https://openreview.net/forum?id=Q-neeWNVv1

  34. arXiv:2302.04831  [pdf, other

    cs.AI cs.LG

    Cooperative Open-ended Learning Framework for Zero-shot Coordination

    Authors: Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan

    Abstract: Zero-shot coordination in cooperative artificial intelligence (AI) remains a significant challenge, which means effectively coordinating with a wide range of unseen partners. Previous algorithms have attempted to address this challenge by optimizing fixed objectives within a population to improve strategy or behaviour diversity. However, these approaches can result in a loss of learning and an ina… ▽ More

    Submitted 28 February, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: 15 pages with 9 pages main body

  35. arXiv:2302.03668  [pdf, other

    cs.LG cs.CL

    Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery

    Authors: Yuxin Wen, Neel Jain, John Kirchenbauer, Micah Goldblum, Jonas Geiping, Tom Goldstein

    Abstract: The strength of modern generative models lies in their ability to be controlled through text-based prompts. Typical "hard" prompts are made from interpretable words and tokens, and must be hand-crafted by humans. There are also "soft" prompts, which consist of continuous feature vectors. These can be discovered using powerful optimization methods, but they cannot be easily interpreted, re-used acr… ▽ More

    Submitted 1 June, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 15 pages, 12 figures, Code is available at https://github.com/YuxinWenRick/hard-prompts-made-easy

  36. arXiv:2302.02184  [pdf, other

    cs.CV

    Real-Time Image Demoireing on Mobile Devices

    Authors: Yuxin Zhang, Mingbao Lin, Xunchao Li, Han Liu, Guozhi Wang, Fei Chao, Shuai Ren, Yafei Wen, Xiaoxin Chen, Rongrong Ji

    Abstract: Moire patterns appear frequently when taking photos of digital screens, drastically degrading the image quality. Despite the advance of CNNs in image demoireing, existing networks are with heavy design, causing redundant computation burden for mobile devices. In this paper, we launch the first study on accelerating demoireing networks and propose a dynamic demoireing acceleration method (DDA) towa… ▽ More

    Submitted 4 February, 2023; originally announced February 2023.

    Comments: To appear in the eleventh International Conference on Learning Representations (ICLR 2023)

  37. Evidence for suppression of structure growth in the concordance cosmological model

    Authors: Nhat-Minh Nguyen, Dragan Huterer, Yuewei Wen

    Abstract: We present evidence for a suppressed growth rate of large-scale structure during the dark-energy dominated era. Modeling the growth rate of perturbations with the ``growth index'' $γ$, we find that current cosmological data strongly prefer a higher growth index than the value $γ=0.55$ predicted by general relativity in a flat $Λ$CDM cosmology. Both the cosmic microwave background data from Planck… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: 5 pages + references; 5 figures, 2 tables, 2900 words. Comments welcome! v2: a few more pages and figures, just enough number of words; PRL in press

    Report number: LCTP-23-03

    Journal ref: Phys. Rev. Lett. 131, 111001 (2023)

  38. arXiv:2301.10226  [pdf, other

    cs.LG cs.CL cs.CR

    A Watermark for Large Language Models

    Authors: John Kirchenbauer, Jonas Geiping, Yuxin Wen, Jonathan Katz, Ian Miers, Tom Goldstein

    Abstract: Potential harms of large language models can be mitigated by watermarking model output, i.e., embedding signals into generated text that are invisible to humans but algorithmically detectable from a short span of tokens. We propose a watermarking framework for proprietary language models. The watermark can be embedded with negligible impact on text quality, and can be detected using an efficient o… ▽ More

    Submitted 1 May, 2024; v1 submitted 24 January, 2023; originally announced January 2023.

    Comments: 13 pages in the main body. Published at ICML 2023. Code is available at github.com/jwkirchenbauer/lm-watermarking

  39. arXiv:2301.06244  [pdf, other

    cs.RO eess.SY

    Haptic Transparency and Interaction Force Control for a Lower-Limb Exoskeleton

    Authors: Emek Barış Küçüktabak, Yue Wen, Sangjoon J. Kim, Matthew Short, Daniel Ludvig, Levi Hargrove, Eric Perreault, Kevin Lynch, Jose Pons

    Abstract: Controlling the interaction forces between a human and an exoskeleton is crucial for providing transparency or adjusting assistance or resistance levels. However, it is an open problem to control the interaction forces of lower-limb exoskeletons designed for unrestricted overground walking. For these types of exoskeletons, it is challenging to implement force/torque sensors at every contact betwee… ▽ More

    Submitted 22 January, 2024; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 19 pages, 13 figures. Accepted for publication in the IEEE Transactions on Robotics (T-RO)

  40. Model-based Transfer Learning for Automatic Optical Inspection based on domain discrepancy

    Authors: Erik Isai Valle Salgado, Haoxin Yan, Yue Hong, Peiyuan Zhu, Shidong Zhu, Chengwei Liao, Yanxiang Wen, Xiu Li, Xiang Qian, Xiaohao Wang, Xinghui Li

    Abstract: Transfer learning is a promising method for AOI applications since it can significantly shorten sample collection time and improve efficiency in today's smart manufacturing. However, related research enhanced the network models by applying TL without considering the domain similarity among datasets, the data long-tailedness of a source dataset, and mainly used linear transformations to mitigate th… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: This is a fix of the published paper "Relational-based transfer learning for automatic optical inspection based on domain discrepancy"

    Journal ref: Proc. SPIE 12317, Optoelectronic Imaging and Multimedia Technology IXMultimedia Technology IX, 2023

  41. arXiv:2301.02445  [pdf, other

    cs.AI cs.LG

    IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling

    Authors: Yilin Wen, Biao Luo, Yuqian Zhao

    Abstract: Multimodal knowledge graph link prediction aims to improve the accuracy and efficiency of link prediction tasks for multimodal data. However, for complex multimodal information and sparse training data, it is usually difficult to achieve interpretability and high accuracy simultaneously for most methods. To address this difficulty, a new model is developed in this paper, namely Interpretable Multi… ▽ More

    Submitted 11 January, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 12pages,10 figures

  42. arXiv:2301.02403  [pdf, other

    cs.CV

    CyberLoc: Towards Accurate Long-term Visual Localization

    Authors: Liu Liu, Yukai Lin, Xiao Liang, Qichao Xu, Miao Jia, Yangdong Liu, Yuxiang Wen, Wei Luo, Jiangwei Li

    Abstract: This technical report introduces CyberLoc, an image-based visual localization pipeline for robust and accurate long-term pose estimation under challenging conditions. The proposed method comprises four modules connected in a sequence. First, a mapping module is applied to build accurate 3D maps of the scene, one map for each reference sequence if there exist multiple reference sequences under diff… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: MLAD-ECCV 2022

  43. arXiv:2301.02348  [pdf, other

    cs.RO eess.SY

    High-Speed High-Accuracy Spatial Curve Tracking Using Motion Primitives in Industrial Robots

    Authors: Honglu He, Chen-lung Lu, Yunshi Wen, Glenn Saunders, Pinghai Yang, Jeffrey Schoonover, Agung Julius, John T. Wen

    Abstract: Industrial robots are increasingly deployed in applications requiring an end effector tool to closely track a specified path, such as in spraying and welding. Performance and productivity present possibly conflicting objectives: tracking accuracy, path speed, and motion uniformity. Industrial robots are programmed through motion primitives consisting of waypoints connected by pre-defined motion se… ▽ More

    Submitted 5 January, 2023; originally announced January 2023.

  44. arXiv:2301.02280  [pdf, other

    cs.CV

    Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

    Authors: Filip Radenovic, Abhimanyu Dubey, Abhishek Kadian, Todor Mihaylov, Simon Vandenhende, Yash Patel, Yi Wen, Vignesh Ramanathan, Dhruv Mahajan

    Abstract: Vision-language models trained with contrastive learning on large-scale noisy data are becoming increasingly popular for zero-shot recognition problems. In this paper we improve the following three aspects of the contrastive pre-training pipeline: dataset noise, model initialization and the training objective. First, we propose a straightforward filtering strategy titled Complexity, Action, and Te… ▽ More

    Submitted 29 March, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: CVPR 2023

  45. arXiv:2301.01795  [pdf, other

    cs.CV

    PACO: Parts and Attributes of Common Objects

    Authors: Vignesh Ramanathan, Anmol Kalia, Vladan Petrovic, Yi Wen, Baixue Zheng, Baishan Guo, Rui Wang, Aaron Marquez, Rama Kovvuri, Abhishek Kadian, Amir Mousavi, Yiwen Song, Abhimanyu Dubey, Dhruv Mahajan

    Abstract: Object models are gradually progressing from predicting just category labels to providing detailed descriptions of object instances. This motivates the need for large datasets which go beyond traditional object masks and provide richer annotations such as part masks and attributes. Hence, we introduce PACO: Parts and Attributes of Common Objects. It spans 75 object categories, 456 object-part cate… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  46. arXiv:2212.14249  [pdf

    cond-mat.soft physics.chem-ph physics.optics

    Momentum-Resolved Sum-Frequency Vibrational Spectroscopy of Bonded Interface Layer at Charged Water Interfaces

    Authors: Yao Hsiao, Ting-Han Chou, Animesh Patra, Yu-Chieh Wen

    Abstract: Interface-specific hydrogen- (H-)bonding network of water next to a substrate (including air) directly controls the energy transfer and chemical reaction pathway at many charged aqueous interfaces. Yet, experimental characterization of such bonded water layer structure is still a challenge due to the presence of the ion diffuse layer. We now develop a sum-frequency (SF) spectroscopic scheme with v… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

  47. Activity-assisted barrier-crossing of self-propelled colloids over parallel microgrooves

    Authors: Yan Wen, Zhihao Li, Haiqin Wang, Jing Zheng, Jinyao Tang, Pik-Yin Lai, Xinpeng Xu, Penger Tong

    Abstract: We report a systematic study of the dynamics of self-propelled particles (SPPs) over a one-dimensional periodic potential landscape, which is fabricated on a microgroove-patterned polydimethylsiloxane (PDMS) substrate. From the measured non-equilibrium probability density function of the SPPs, we find that the escape dynamics of the slow-rotating SPPs across the potential landscape can be describe… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

  48. arXiv:2212.12669  [pdf, other

    cs.AI cs.LG

    On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective

    Authors: Ying Wen, Ziyu Wan, Ming Zhou, Shufang Hou, Zhe Cao, Chenyang Le, Jingxiao Chen, Zheng Tian, Weinan Zhang, Jun Wang

    Abstract: The pervasive uncertainty and dynamic nature of real-world environments present significant challenges for the widespread implementation of machine-driven Intelligent Decision-Making (IDM) systems. Consequently, IDM should possess the ability to continuously acquire new skills and effectively generalize across a broad range of applications. The advancement of Artificial General Intelligence (AGI)… ▽ More

    Submitted 16 May, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

    Comments: 26 pages, 4 figures

  49. arXiv:2212.09248  [pdf, other

    cs.CL cs.SE

    Natural Language to Code Generation in Interactive Data Science Notebooks

    Authors: Pengcheng Yin, Wen-Ding Li, Kefan Xiao, Abhishek Rao, Yeming Wen, Kensen Shi, Joshua Howland, Paige Bailey, Michele Catasta, Henryk Michalewski, Alex Polozov, Charles Sutton

    Abstract: Computational notebooks, such as Jupyter notebooks, are interactive computing environments that are ubiquitous among data scientists to perform data wrangling and analytic tasks. To measure the performance of AI pair programmers that automatically synthesize programs for those tasks given natural language (NL) intents from users, we build ARCADE, a benchmark of 1082 code generation problems using… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 46 pages. 32 figures

  50. arXiv:2212.08059  [pdf, other

    cs.CV cs.AI cs.LG

    Rethinking Vision Transformers for MobileNet Size and Speed

    Authors: Yanyu Li, Ju Hu, Yang Wen, Georgios Evangelidis, Kamyar Salahi, Yanzhi Wang, Sergey Tulyakov, Jian Ren

    Abstract: With the success of Vision Transformers (ViTs) in computer vision tasks, recent arts try to optimize the performance and complexity of ViTs to enable efficient deployment on mobile devices. Multiple approaches are proposed to accelerate attention mechanism, improve inefficient designs, or incorporate mobile-friendly lightweight convolutions to form hybrid architectures. However, ViT and its varian… ▽ More

    Submitted 4 September, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Code is available at: https://github.com/snap-research/EfficientFormer