Zum Hauptinhalt springen

Showing 101–150 of 508 results for author: Wen, Z

.
  1. arXiv:2308.04149  [pdf

    cond-mat.mtrl-sci

    Fully epitaxial fcc(111) magnetic tunnel junctions with a Co90Fe10/MgAlO/Co90Fe10 structure

    Authors: Jieyuan Song, Thomas Scheike, Cong He, Zhenchao Wen, Tadakatsu Ohkubo, Kazuhiro Hono, Hiroaki Sukegawa, Seiji Mitani

    Abstract: Magnetic tunnel junctions (MTJs) with bcc(001)-type structures such as Fe(001)/MgO(001)/Fe(001), have been widely used as the core of various spintronic devices such as magnetoresistive memories; however, the limited material selection of (001)-type MTJs hinders the further development of spintronic devices. Here, as an alternative to the (001)-type MTJs, an fcc(111)-type MTJ using a fully epitaxi… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 18 pages, 5 figures

  2. arXiv:2307.14024  [pdf, other

    cs.IR

    Multi-view Hypergraph Contrastive Policy Learning for Conversational Recommendation

    Authors: Sen Zhao, Wei Wei, Xian-Ling Mao, Shuai Zhu, Minghui Yang, Zujie Wen, Dangyang Chen, Feida Zhu

    Abstract: Conversational recommendation systems (CRS) aim to interactively acquire user preferences and accordingly recommend items to users. Accurately learning the dynamic user preferences is of crucial importance for CRS. Previous works learn the user preferences with pairwise relations from the interactive conversation and item knowledge, while largely ignoring the fact that factors for a relationship i… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  3. arXiv:2307.10230  [pdf, other

    cs.IR

    Prompt Tuning on Graph-augmented Low-resource Text Classification

    Authors: Zhihao Wen, Yuan Fang

    Abstract: Text classification is a fundamental problem in information retrieval with many real-world applications, such as predicting the topics of online articles and the categories of e-commerce product descriptions. However, low-resource text classification, with no or few labeled samples, presents a serious concern for supervised learning. Meanwhile, many text data are inherently grounded on a network s… ▽ More

    Submitted 19 August, 2024; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: 15 pages, accepted by TKDE (IEEE Transactions on Knowledge and Data Engineering). arXiv admin note: substantial text overlap with arXiv:2305.03324

  4. Quantivine: A Visualization Approach for Large-scale Quantum Circuit Representation and Analysis

    Authors: Zhen Wen, Yihan Liu, Siwei Tan, Jieyi Chen, Minfeng Zhu, Dongming Han, Jianwei Yin, Mingliang Xu, Wei Chen

    Abstract: Quantum computing is a rapidly evolving field that enables exponential speed-up over classical algorithms. At the heart of this revolutionary technology are quantum circuits, which serve as vital tools for implementing, analyzing, and optimizing quantum algorithms. Recent advancements in quantum computing and the increasing capability of quantum devices have led to the development of more complex… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE VIS 2023

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, 2023

  5. arXiv:2307.08929  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.app-ph physics.comp-ph

    Active learning of effective Hamiltonian for super-large-scale atomic structures

    Authors: Xingyue Ma, Hongying Chen, Ri He, Zhanbo Yu, Sergei Prokhorenko, Zheng Wen, Zhicheng Zhong, Jorge Iñiguez, L. Bellaiche, Di Wu, Yurong Yang

    Abstract: The first-principles-based effective Hamiltonian scheme provides one of the most accurate modeling technique for large-scale structures, especially for ferroelectrics. However, the parameterization of the effective Hamiltonian is complicated and can be difficult for some complex systems such as high-entropy perovskites. Here, we propose a general form of effective Hamiltonian and develop an active… ▽ More

    Submitted 14 May, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 11 pages, 4 figures

  6. arXiv:2307.08699  [pdf, other

    cs.CV cs.AI

    Pair then Relation: Pair-Net for Panoptic Scene Graph Generation

    Authors: Jinghao Wang, Zhengyu Wen, Xiangtai Li, Zujin Guo, Jingkang Yang, Ziwei Liu

    Abstract: Panoptic Scene Graph (PSG) is a challenging task in Scene Graph Generation (SGG) that aims to create a more comprehensive scene graph representation using panoptic segmentation instead of boxes. Compared to SGG, PSG has several challenging problems: pixel-level segment outputs and full relationship exploration (It also considers thing and stuff relation). Thus, current PSG methods have limited per… ▽ More

    Submitted 9 August, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: IEEE TPAMI 2024. 13 pages. Project Page: https://github.com/king159/Pair-Net

  7. arXiv:2307.05074  [pdf, other

    cs.IR cs.AI cs.DB

    Retrieval-augmented GPT-3.5-based Text-to-SQL Framework with Sample-aware Prompting and Dynamic Revision Chain

    Authors: Chunxi Guo, Zhiliang Tian, Jintao Tang, Shasha Li, Zhihua Wen, Kaixuan Wang, Ting Wang

    Abstract: Text-to-SQL aims at generating SQL queries for the given natural language questions and thus helping users to query databases. Prompt learning with large language models (LLMs) has emerged as a recent approach, which designs prompts to lead LLMs to understand the input question and generate the corresponding SQL. However, it faces challenges with strict SQL syntax requirements. Existing work promp… ▽ More

    Submitted 4 September, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

  8. arXiv:2307.02046  [pdf, other

    cs.IR cs.AI cs.CL

    Recommender Systems in the Era of Large Language Models (LLMs)

    Authors: Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li

    Abstract: With the prosperity of e-commerce and web applications, Recommender Systems (RecSys) have become an important component of our daily life, providing personalized suggestions that cater to user preferences. While Deep Neural Networks (DNNs) have made significant advancements in enhancing recommender systems by modeling user-item interactions and incorporating textual side information, DNN-based met… ▽ More

    Submitted 29 April, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE TKDE

  9. arXiv:2307.00783  [pdf, other

    math.OC cs.AI cs.LG

    Monte Carlo Policy Gradient Method for Binary Optimization

    Authors: Cheng Chen, Ruitao Chen, Tianyou Li, Ruichen Ao, Zaiwen Wen

    Abstract: Binary optimization has a wide range of applications in combinatorial optimization problems such as MaxCut, MIMO detection, and MaxSAT. However, these problems are typically NP-hard due to the binary constraints. We develop a novel probabilistic model to sample the binary solution according to a parameterized policy distribution. Specifically, minimizing the KL divergence between the parameterized… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    MSC Class: 90C09; 90C27; 90C59; 60J45; 60J20

  10. Reciprocating Magnetic Fields in the Pulsar Wind Observed from the Black Widow Pulsar J1720-0534

    Authors: Chen-Chen Miao, Victoria Blackmon, Wei-Wei Zhu, Dong-Zi Li, Mingyu Ge, Xiao-Peng You, Maura McLaughlin, Di Li, Na Wang, Pei Wang, Jia-Rui Niu, M. Cruces, Jian-Ping Yuan, Jun-Tao Bai, D. J. Champion, Yu-Tong Chen, Ming-Min Chi, P. C. C. Freire, Yi Feng, Zhen-Ye Gan, M. Kramer, Fei-Fei Kou, Yu-Xi Li, Xue-Li Miao, Ling-Qi Meng , et al. (19 additional authors not shown)

    Abstract: We report the radio observations of the eclipsing black widow pulsar J1720-0534, a 3.26 ms pulsar in orbit with a low mass companion of mass 0.029 to 0.034 M$_{\odot}$. We obtain the phase-connected timing ephemeris and polarization profile of this millisecond pulsar (MSP) using the Five-hundred-meter Aperture Spherical Radio Telescope (FAST), the Green Bank Telescope (GBT), and the Parkes Telesco… ▽ More

    Submitted 28 August, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: 15 pages, 8 figures, 1 table, accepted by RAA

  11. arXiv:2307.00358  [pdf, ps, other

    math.OC

    The Error in Multivariate Linear Extrapolation with Applications to Derivative-Free Optimization

    Authors: Liyuan Cao, Zaiwen Wen, Ya-xiang Yuan

    Abstract: We study in this paper the function approximation error of multivariate linear extrapolation. While the sharp error bound of linear interpolation already exists in the literature, linear extrapolation is used far more often in applications such as derivative-free optimization, and its error is not well-studied. A method to numerically compute the sharp error bound is introduced, and several analyt… ▽ More

    Submitted 5 July, 2024; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: 28 pages, 5 figures. arXiv admin note: text overlap with arXiv:2209.12606

  12. arXiv:2306.15401  [pdf, other

    cs.MM cs.HC

    Explainable Multimodal Emotion Recognition

    Authors: Zheng Lian, Haiyang Sun, Licai Sun, Hao Gu, Zhuofan Wen, Siyuan Zhang, Shun Chen, Mingyu Xu, Ke Xu, Kang Chen, Lan Chen, Shan Liang, Ya Li, Jiangyan Yi, Bin Liu, Jianhua Tao

    Abstract: Multimodal emotion recognition is an important research topic in artificial intelligence, whose main goal is to integrate multimodal clues to identify human emotional states. Current works generally assume accurate labels for benchmark datasets and focus on developing more effective architectures. However, emotion annotation relies on subjective judgment. To obtain more reliable labels, existing d… ▽ More

    Submitted 23 May, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

  13. arXiv:2306.14112  [pdf, other

    cs.IR

    Enhancing Dynamic Image Advertising with Vision-Language Pre-training

    Authors: Zhoufutu Wen, Xinyu Zhao, Zhipeng Jin, Yi Yang, Wei Jia, Xiaodong Chen, Shuanglong Li, Lin Liu

    Abstract: In the multimedia era, image is an effective medium in search advertising. Dynamic Image Advertising (DIA), a system that matches queries with ad images and generates multimodal ads, is introduced to improve user experience and ad revenue. The core of DIA is a query-image matching module performing ad image retrieval and relevance modeling. Current query-image matching suffers from limited and inc… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 6 pages, 3 figures, accepted to SIRIP 2023

  14. arXiv:2306.10508  [pdf, other

    cs.CV cs.RO

    QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory Prediction

    Authors: Zikang Zhou, Zihao Wen, Jianping Wang, Yung-Hui Li, Yu-Kai Huang

    Abstract: Estimating the joint distribution of on-road agents' future trajectories is essential for autonomous driving. In this technical report, we propose a next-generation framework for joint multi-agent trajectory prediction called QCNeXt. First, we adopt the query-centric encoding paradigm for the task of joint multi-agent trajectory prediction. Powered by this encoding scheme, our scene encoder is equ… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: Technical report for the 1st place solution of the Argoverse 2 Multi-Agent Motion Forecasting Competition at the CVPR 2023 Workshop on Autonomous Driving

  15. Controllable Multi-Objective Re-ranking with Policy Hypernetworks

    Authors: Sirui Chen, Yuan Wang, Zijing Wen, Zhiyu Li, Changshuo Zhang, Xiao Zhang, Quan Lin, Cheng Zhu, Jun Xu

    Abstract: Multi-stage ranking pipelines have become widely used strategies in modern recommender systems, where the final stage aims to return a ranked list of items that balances a number of requirements such as user preference, diversity, novelty etc. Linear scalarization is arguably the most widely used technique to merge multiple requirements into one optimization objective, by summing up the requiremen… ▽ More

    Submitted 17 July, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

  16. Knowing-how & Knowing-that: A New Task for Machine Comprehension of User Manuals

    Authors: Hongru Liang, Jia Liu, Weihong Du, Dingnan Jin, Wenqiang Lei, Zujie Wen, Jiancheng Lv

    Abstract: The machine reading comprehension (MRC) of user manuals has huge potential in customer service. However, current methods have trouble answering complex questions. Therefore, we introduce the Knowing-how & Knowing-that task that requires the model to answer factoid-style, procedure-style, and inconsistent questions about user manuals. We resolve this task by jointly representing the steps and facts… ▽ More

    Submitted 8 August, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2023. (2023)

  17. arXiv:2306.04099  [pdf, other

    cs.LG

    NTKCPL: Active Learning on Top of Self-Supervised Model by Estimating True Coverage

    Authors: Ziting Wen, Oscar Pizarro, Stefan Williams

    Abstract: High annotation cost for training machine learning classifiers has driven extensive research in active learning and self-supervised learning. Recent research has shown that in the context of supervised learning different active learning strategies need to be applied at various stages of the training process to ensure improved performance over the random baseline. We refer to the point where the nu… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  18. arXiv:2305.20068  [pdf, other

    cs.RO cs.LG

    TOFG: A Unified and Fine-Grained Environment Representation in Autonomous Driving

    Authors: Zihao Wen, Yifan Zhang, Xinhong Chen, Jianping Wang

    Abstract: In autonomous driving, an accurate understanding of environment, e.g., the vehicle-to-vehicle and vehicle-to-lane interactions, plays a critical role in many driving tasks such as trajectory prediction and motion planning. Environment information comes from high-definition (HD) map and historical trajectories of vehicles. Due to the heterogeneity of the map data and trajectory data, many data-driv… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted by ICRA 2023

  19. arXiv:2305.13774  [pdf, other

    cs.SD eess.AS

    ADD 2023: the Second Audio Deepfake Detection Challenge

    Authors: Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li

    Abstract: Audio deepfake detection is an emerging topic in the artificial intelligence community. The second Audio Deepfake Detection Challenge (ADD 2023) aims to spur researchers around the world to build new innovative technologies that can further accelerate and foster research on detecting and analyzing deepfake speech utterances. Different from previous challenges (e.g. ADD 2022), ADD 2023 focuses on s… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  20. arXiv:2305.10011  [pdf

    physics.optics

    Super-Resolution Imaging via Angular Magnification

    Authors: Yi Zhou, Dingpeng Liao, Kun Zhang, Zijie Ma, Shikai Wu, Jun Ma, Xuemei Dai, Zhengguo Shang, Zhongquan Wen, Gang Chen

    Abstract: The far-field resolution of optical imaging systems is restricted by the Abbe diffraction limit, a direct result of the wave nature of light. One successful technological approach to circumventing this limit is to reduce the effective size of a point-spread-function. In the past decades, great endeavors have been made to engineer an effective point-spread-function by exploiting different mechanism… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  21. arXiv:2305.05250  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    More relaxed intracluster gas than galaxies in clusters in quasi-equilibrium

    Authors: Z. S. Yuan, J. L. Han, H. Böhringer, Z. L. Wen, G. Chon

    Abstract: During cluster mergers, the intracluster gas and member galaxies undergo dynamic evolution, but at different timescales and reach different states. We collect 24 galaxy clusters in quasi-equilibrium state as indicated by the X-ray image, and calculate the cluster orientations and three kinds of dynamical parameters, i.e., the normalized centroid offset, the sphere index and the ellipticity, for th… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 9 pages, 5 figures, 1 table, accepted for publication in MNRAS

  22. Augmenting Low-Resource Text Classification with Graph-Grounded Pre-training and Prompting

    Authors: Zhihao Wen, Yuan Fang

    Abstract: Text classification is a fundamental problem in information retrieval with many real-world applications, such as predicting the topics of online articles and the categories of e-commerce product descriptions. However, low-resource text classification, with few or no labeled samples, poses a serious concern for supervised learning. Meanwhile, many text data are inherently grounded on a network stru… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 11 pages, accepted by SIGIR'23

  23. arXiv:2305.02774  [pdf, other

    eess.IV cs.CV physics.med-ph

    Spatial and Modal Optimal Transport for Fast Cross-Modal MRI Reconstruction

    Authors: Qi Wang, Zhijie Wen, Jun Shi, Qian Wang, Dinggang Shen, Shihui Ying

    Abstract: Multi-modal magnetic resonance imaging (MRI) plays a crucial role in comprehensive disease diagnosis in clinical medicine. However, acquiring certain modalities, such as T2-weighted images (T2WIs), is time-consuming and prone to be with motion artifacts. It negatively impacts subsequent multi-modal image analysis. To address this issue, we propose an end-to-end deep learning framework that utilize… ▽ More

    Submitted 21 May, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

  24. arXiv:2305.02575  [pdf, other

    cs.IR

    Towards Hierarchical Policy Learning for Conversational Recommendation with Hypergraph-based Reinforcement Learning

    Authors: Sen Zhao, Wei Wei, Yifan Liu, Ziyang Wang, Wendi Li, Xian-Ling Mao, Shuai Zhu, Minghui Yang, Zujie Wen

    Abstract: Conversational recommendation systems (CRS) aim to timely and proactively acquire user dynamic preferred attributes through conversations for item recommendation. In each turn of CRS, there naturally have two decision-making processes with different roles that influence each other: 1) director, which is to select the follow-up option (i.e., ask or recommend) that is more effective for reducing the… ▽ More

    Submitted 26 July, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Journal ref: THE 32nd INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI2023)

  25. arXiv:2304.13400  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Observation of Fluctuation Spin Hall Effect in Antiferromagnet

    Authors: Chi Fang, Caihua Wan, Xiaoyue Zhang, Satoshi Okamoto, Tianyi Ma, Jianying Qin, Xiao Wang, Chenyang Guo, Jing Dong, Guoqiang Yu, Zhenchao Wen, Ning Tang, Stuart S. P. Parkin, Naoto Nagaosa, Yuan Lu, Xiufeng Han

    Abstract: The spin Hall effect (SHE) can generate a pure spin current by an electric current, which is promisingly used to electrically control magnetization. To reduce power consumption of this control, a giant spin Hall angle (SHA) in the SHE is desired in low-resistivity systems for practical applications. Here, critical spin fluctuation near the antiferromagnetic (AFM) phase-transition is proved as an e… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 27 pages, 9 figures

  26. arXiv:2304.13301  [pdf, other

    cs.CL cs.AI

    Prompting GPT-3.5 for Text-to-SQL with De-semanticization and Skeleton Retrieval

    Authors: Chunxi Guo, Zhiliang Tian, Jintao Tang, Pancheng Wang, Zhihua Wen, Kang Yang, Ting Wang

    Abstract: Text-to-SQL is a task that converts a natural language question into a structured query language (SQL) to retrieve information from a database. Large language models (LLMs) work well in natural language generation tasks, but they are not specifically pre-trained to understand the syntax and semantics of SQL commands. In this paper, we propose an LLM-based framework for Text-to-SQL which retrieves… ▽ More

    Submitted 31 August, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

  27. Ultra-high-speed coherent anti-Stokes Raman spectroscopy with a hybrid dual-comb source

    Authors: Tianjian Lv, Bing Han, Ming Yan, Zhaoyang Wen, Kun Huang, Kangwen Yang, Heping Zeng

    Abstract: Coherent anti-Stokes Raman scattering (CARS) spectroscopy with time-delayed ultrashort pulses and a single-pixel photodetector has shown great potential for spectroscopic imaging and transient studies in chemistry and biological research. However, those systems rely on mechanical delay lines or two asynchronous optical combs with inflexible repetition frequencies, technically limiting their acquis… ▽ More

    Submitted 3 December, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 22 pages, 6 figures

  28. arXiv:2304.05007  [pdf, other

    cs.CR

    Privacy Amplification via Shuffling: Unified, Simplified, and Tightened

    Authors: Shaowei Wang, Yun Peng, Jin Li, Zikai Wen, Zhipeng Li, Shiyu Yu, Di Wang, Wei Yang

    Abstract: The shuffle model of differential privacy provides promising privacy-utility balances in decentralized, privacy-preserving data analysis. However, the current analyses of privacy amplification via shuffling lack both tightness and generality. To address this issue, we propose the \emph{variation-ratio reduction} as a comprehensive framework for privacy amplification in both single-message and mult… ▽ More

    Submitted 28 July, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: To appear in VLDB 2024. Code available at https://github.com/wangsw/PrivacyAmplification

  29. arXiv:2303.11369  [pdf, other

    cs.LG cs.AI

    Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale

    Authors: Botao Hao, Rahul Jain, Dengwang Tang, Zheng Wen

    Abstract: In this paper, we address the following problem: Given an offline demonstration dataset from an imperfect expert, what is the best way to leverage it to bootstrap online learning performance in MDPs. We first propose an Informed Posterior Sampling-based RL (iPSRL) algorithm that uses the offline dataset, and information about the expert's behavioral policy used to generate the offline dataset. Its… ▽ More

    Submitted 16 July, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Alphabetical order. Corresponding to Rahul Jain

  30. arXiv:2303.10599  [pdf, ps, other

    stat.ML math.OC

    Convergence Analysis of Stochastic Gradient Descent with MCMC Estimators

    Authors: Tianyou Li, Fan Chen, Huajie Chen, Zaiwen Wen

    Abstract: Understanding stochastic gradient descent (SGD) and its variants is essential for machine learning. However, most of the preceding analyses are conducted under amenable conditions such as unbiased gradient estimator and bounded objective functions, which does not encompass many sophisticated applications, such as variational Monte Carlo, entropy-regularized reinforcement learning and variational i… ▽ More

    Submitted 23 March, 2024; v1 submitted 19 March, 2023; originally announced March 2023.

  31. arXiv:2303.10320  [pdf, other

    math.MG math.DS

    Topology automaton and conformal dimension of post-critical-finite self-similar sets

    Authors: Hui Rao, Zhi-Ying Wen, Qihan Yuan, Yuan Zhang

    Abstract: In this paper, we use a class of finite state automata, called topology automaton, to study the metric classification of a special class of post-critically finite self-similar sets. As an application, we prove that the conformal dimension of post-critically finite self-similar dendrites and fractal gasket with connected component is 1.

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 38 pages, 11 figures, 27 references

  32. arXiv:2303.01211  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Learning From Yourself: A Self-Distillation Method for Fake Speech Detection

    Authors: Jun Xue, Cunhang Fan, Jiangyan Yi, Chenglong Wang, Zhengqi Wen, Dan Zhang, Zhao Lv

    Abstract: In this paper, we propose a novel self-distillation method for fake speech detection (FSD), which can significantly improve the performance of FSD without increasing the model complexity. For FSD, some fine-grained information is very important, such as spectrogram defects, mute segments, and so on, which are often perceived by shallow networks. However, shallow networks have much noise, which can… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  33. Efficient reference-less transmission matrix retrieval for a multimode fiber using fast Fourier transform

    Authors: Jingshan Zhong, Zhong Wen, Quanzhi Li, Qilin Deng, Qing Yang

    Abstract: Transmission matrix (TM) linearly maps the incident and transmitted complex fields, and has been used widely due to its ability to characterize scattering media. It is computationally demanding to reconstruct the TM from intensity images measured by a reference-less experimental setup. Removing reference beam for interference gains the advantage of simple experimental setup. However, the long comp… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  34. arXiv:2302.13087  [pdf, other

    math.OC cs.LG

    Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation

    Authors: Zhifa Ke, Junyu Zhang, Zaiwen Wen

    Abstract: In this paper, a Gauss-Newton Temporal Difference (GNTD) learning method is proposed to solve the Q-learning problem with nonlinear function approximation. In each iteration, our method takes one Gauss-Newton (GN) step to optimize a variant of Mean-Squared Bellman Error (MSBE), where target networks are adopted to avoid double sampling. Inexact GN steps are analyzed so that one can safely and effi… ▽ More

    Submitted 31 March, 2024; v1 submitted 25 February, 2023; originally announced February 2023.

  35. arXiv:2302.12400  [pdf, other

    cs.LG cs.CV

    Towards Stable Test-Time Adaptation in Dynamic Wild World

    Authors: Shuaicheng Niu, Jiaxiang Wu, Yifan Zhang, Zhiquan Wen, Yaofo Chen, Peilin Zhao, Mingkui Tan

    Abstract: Test-time adaptation (TTA) has shown to be effective at tackling distribution shifts between training and testing data by adapting a given model on test samples. However, the online model updating of TTA may be unstable and this is often a key obstacle preventing existing TTA methods from being deployed in the real world. Specifically, TTA may fail to improve or even harm the model performance whe… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: accepted by International Conference on Learning Representations (ICLR) 2023 as Notable-Top-5%; 27 pages, 10 figures, 18 tables

  36. Isogeometric analysis using G-spline surfaces with arbitrary unstructured quadrilateral layout

    Authors: Zuowei Wen, Md. Sadman Faruque, Xin Li, Xiaodong Wei, Hugo Casquero

    Abstract: G-splines are a generalization of B-splines that deals with extraordinary points by imposing G^1 constraints across their spoke edges, thus obtaining a continuous tangent plane throughout the surface. Using the isoparametric concept and the Bubnov-Galerkin method to solve partial differential equations with G-splines results in discretizations with global C^1 continuity in physical space. Extraord… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  37. arXiv:2302.12046  [pdf, ps, other

    physics.optics

    Observation of Q-switched and continuous wave regimes with mode-hopping in Er-doped fiber lasers incorporating a dynamic population grating

    Authors: Zengrun Wen, Xiulin Fan, Kaile Wang, Weiming Wang, Song Gao, Wenjing Hao, Yuanmei Gao, Yangjian Cai, Liren Zheng

    Abstract: Dynamic population gratings (DPGs) in rare-earth doped fibers are prevalent devices in fiber lasers for the production of single-longitudinal-mode emission, Q-switched pulses, and wavelength self-sweeping regimes. This study presents a transition from Q-switched state to continuous wave (CW) state, accompanying irregular mode-hopping, in an erbium-doped fiber laser with a heavily-doped DPG centere… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  38. arXiv:2302.09205  [pdf, other

    cs.LG cs.AI

    Approximate Thompson Sampling via Epistemic Neural Networks

    Authors: Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

    Abstract: Thompson sampling (TS) is a popular heuristic for action selection, but it requires sampling from a posterior distribution. Unfortunately, this can become computationally intractable in complex environments, such as those modeled using neural networks. Approximate posterior samples can produce effective actions, but only if they reasonably approximate joint predictive distributions of outputs acro… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  39. Constraints on dark energy from the CSST galaxy clusters

    Authors: Yufei Zhang, Mingjing Chen, Zhonglue Wen, Wenjuan Fang

    Abstract: We study the potential of the galaxy cluster sample expected from the China Space Station Telescope (CSST) survey to constrain dark energy properties. By modelling the distribution of observed cluster mass for a given true mass to be log-normal and adopting a selection threshold in the observed mass $M_{200m} \geq 0.836 \times 10^{14} h^{-1}M_{\odot}$, we find about $4.1 \times 10^{5}$ clusters in… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 19 pages, 5 figures, 4 tables. Accepted for publication in Research in Astronomy and Astrophysics

    Journal ref: Res. Astron. Astrophys. 23 045011 (2023)

  40. Generating a Structured Summary of Numerous Academic Papers: Dataset and Method

    Authors: Shuaiqi Liu, Jiannong Cao, Ruosong Yang, Zhiyuan Wen

    Abstract: Writing a survey paper on one research topic usually needs to cover the salient content from numerous related papers, which can be modeled as a multi-document summarization (MDS) task. Existing MDS datasets usually focus on producing the structureless summary covering a few input documents. Meanwhile, previous structured summary generation works focus on summarizing a single document into a multi-… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: IJCAI 2022

    ACM Class: I.2.7; I.7

  41. arXiv:2302.03815  [pdf, other

    cs.CL cs.AI

    Long Text and Multi-Table Summarization: Dataset and Method

    Authors: Shuaiqi Liu, Jiannong Cao, Ruosong Yang, Zhiyuan Wen

    Abstract: Automatic document summarization aims to produce a concise summary covering the input document's salient information. Within a report document, the salient information can be scattered in the textual and non-textual content. However, existing document summarization datasets and methods usually focus on the text and filter out the non-textual content. Missing tabular data can limit produced summari… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: EMNLP 2022 Findings

    ACM Class: I.2.7; I.7

  42. arXiv:2302.03773  [pdf, other

    cs.CL cs.LG

    What Matters In The Structured Pruning of Generative Language Models?

    Authors: Michael Santacroce, Zixin Wen, Yelong Shen, Yuanzhi Li

    Abstract: Auto-regressive large language models such as GPT-3 require enormous computational resources to use. Traditionally, structured pruning methods are employed to reduce resource usage. However, their application to and efficacy for generative language models is heavily under-explored. In this paper we conduct an comprehensive evaluation of common structured pruning methods, including magnitude, rando… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  43. arXiv:2302.03319  [pdf, ps, other

    cs.LG math.ST stat.ML

    Leveraging Demonstrations to Improve Online Learning: Quality Matters

    Authors: Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen

    Abstract: We investigate the extent to which offline demonstration data can improve online learning. It is natural to expect some improvement, but the question is how, and by how much? We show that the degree of improvement must depend on the quality of the demonstration data. To generate portable insights, we focus on Thompson sampling (TS) applied to a multi-armed bandit as a prototypical online learning… ▽ More

    Submitted 17 May, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: Accepted at ICML 2023

  44. Ultra-soft Thermal Diodes Enabled by Dual-Alkane-Based Phase Change Composites

    Authors: Yunsong Pang, Junhong Li, Zhibin Wen, Ting Liang, Shan Gao, Dezhao Huang, Rong Sun Jianbin Xu Tengfei Luo, Xiaoliang Zeng

    Abstract: Thermal diode, a type of device that allows heat to flow in one direction preferentially, can be employed in many thermal applications. However, if the mechanical compliance of the thermal diode is poor, which prevents its intimate contact with heat source or sink surfaces, the thermal rectification performance cannot be used to its full extent. In this work, we introduce a heterojunction thermal… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Journal ref: Materials Today Physics (2024): 101450

  45. arXiv:2301.06290  [pdf, ps, other

    math.CV

    All possible orders less than 1 of transcendental entire solutions of linear difference equations with polynomial coefficients

    Authors: Katsuya Ishizaki, Zhi-Tao Wen

    Abstract: In this paper, we study all possible orders which are less than 1 of transcendental entire solutions of linear difference equations \begin{equation} P_m(z)Δ^mf(z)+\cdots+P_1(z)Δf(z)+P_0(z)f(z)=0,\tag{+} \end{equation} where $P_j(z)$ are polynomials for $j=0,\ldots,m$. Firstly, we give the condition on existence of transcendental entire solutions of order less than 1 of difference equations (… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  46. arXiv:2301.03801  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion

    Authors: Haogeng Liu, Tao Wang, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Jianhua Tao

    Abstract: Text-to-speech (TTS) and voice conversion (VC) are two different tasks both aiming at generating high quality speaking voice according to different input modality. Due to their similarity, this paper proposes UnifySpeech, which brings TTS and VC into a unified framework for the first time. The model is based on the assumption that speech can be decoupled into three independent components: content… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

  47. Three New Spiral Galaxies with Active Nuclei Producing Double Radio Lobes

    Authors: Xuyang Gao, Zhongsheng Yuan, Jinlin Han, Zhonglue Wen, Susu Shan

    Abstract: Double radio lobes are generally believed to be produced by active nuclei of elliptical galaxies. However, several double-lobed radio sources have been solidly found to be associated with spiral galaxies. By cross-matching $\sim9\times10^5$ spiral galaxies selected from the Sloan Digital Sky Survey DR8 data with the full 1.4-GHz radio source catalogs of NRAO VLA Sky Survey and Faint Images of Radi… ▽ More

    Submitted 16 February, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: typos corrected, accepted for publication in RAA

  48. arXiv:2212.10191  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Emotion Selectable End-to-End Text-based Speech Editing

    Authors: Tao Wang, Jiangyan Yi, Ruibo Fu, Jianhua Tao, Zhengqi Wen, Chu Yuan Zhang

    Abstract: Text-based speech editing allows users to edit speech by intuitively cutting, copying, and pasting text to speed up the process of editing speech. In the previous work, CampNet (context-aware mask prediction network) is proposed to realize text-based speech editing, significantly improving the quality of edited speech. This paper aims at a new task: adding emotional effect to the editing speech du… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Under review, 12 pages, 11 figures, demo page is available at https://hairuo55.github.io/Emo-CampNet/

  49. arXiv:2212.09970  [pdf, other

    cs.LG

    Data Augmentation on Graphs: A Technical Survey

    Authors: Jiajun Zhou, Chenxuan Xie, Shengbo Gong, Zhenyu Wen, Xiangyu Zhao, Qi Xuan, Xiaoniu Yang

    Abstract: In recent years, graph representation learning has achieved remarkable success while suffering from low-quality data problems. As a mature technology to improve data quality in computer vision, data augmentation has also attracted increasing attention in graph domain. To advance research in this emerging direction, this survey provides a comprehensive review and summary of existing graph data augm… ▽ More

    Submitted 21 June, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Version 2. Under review

  50. arXiv:2212.09450  [pdf, other

    q-bio.BM cs.CE cs.LG

    Accelerating Antimicrobial Peptide Discovery with Latent Structure

    Authors: Danqing Wang, Zeyu Wen, Fei Ye, Lei Li, Hao Zhou

    Abstract: Antimicrobial peptides (AMPs) are promising therapeutic approaches against drug-resistant pathogens. Recently, deep generative models are used to discover new AMPs. However, previous studies mainly focus on peptide sequence attributes and do not consider crucial structure information. In this paper, we propose a latent sequence-structure model for designing AMPs (LSSAMP). LSSAMP exploits multi-sca… ▽ More

    Submitted 20 August, 2023; v1 submitted 28 November, 2022; originally announced December 2022.

    Comments: KDD 2023