Zum Hauptinhalt springen

Showing 251–300 of 508 results for author: Wen, Z

.
  1. arXiv:2011.07217  [pdf

    cond-mat.mtrl-sci

    Voltage-controlled magnetic anisotropy under the electronic structure modulation in quantum wells

    Authors: Qingyi Xiang, Yoshio Miura, Muftah Al-Mahdawi, Thomas Scheike, Xiandong Xu, Yuya Sakuraba, Shinya Kasai, Zhenchao Wen, Hiroaki Sukegawa, Seiji Mitani, Kazuhiro Hono

    Abstract: Voltage-controlled magnetic anisotropy (VCMA) offers an emerging approach to realize energy-efficient magnetization switching in spintronic devices such as magnetic random access memories (MRAMs). Here, we show that manipulating the condensed states, i.e., introducing quantum well (QW) can significantly influence the VCMA in a Cr/Fe-QW/MgAl2O4 based magnetic tunnel junction (MTJ). Only for the MTJ… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: 26 pages, 8 figures

  2. arXiv:2011.05591  [pdf, other

    cs.SD cs.LG eess.AS

    Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning

    Authors: Cunhang Fan, Bin Liu, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Leichao Song

    Abstract: Recurrent neural networks (RNNs) have shown significant improvements in recent years for speech enhancement. However, the model complexity and inference time cost of RNNs are much higher than deep feed-forward neural networks (DNNs). Therefore, these limit the applications of speech enhancement. This paper proposes a deep time delay neural network (TDNN) for speech enhancement with full data learn… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted by ISCSLP 2021

  3. The Mode Switching in Pulsar J1326$-$6700

    Authors: Z. G. Wen, W. M. Yan, J. P. Yuan, H. G. Wang, J. L. Chen, M. Mijit, R. Yuen, N. Wang, Z. Y. Tu, S. J. Dang

    Abstract: We report on a detailed study of the mode switching in pulsar J1326$-$6700 by analyzing the data acquired from the Parkes 64 m radio telescope at 1369 MHz. During the abnormal mode, the emission at the central and trailing components becomes extremely weak. Meanwhile, the leading emission shifts toward earlier longitude by almost 2°, and remains in this position for typically less than a minute. T… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: 10 pages, 8 figures

  4. arXiv:2011.04249  [pdf, other

    cs.SD cs.CL eess.AS

    Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition

    Authors: Cunhang Fan, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Bin Liu, Zhengqi Wen

    Abstract: The joint training framework for speech enhancement and recognition methods have obtained quite good performances for robust end-to-end automatic speech recognition (ASR). However, these methods only utilize the enhanced feature as the input of the speech recognition component, which are affected by the speech distortion problem. In order to address this problem, this paper proposes a gated recurr… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech, and Language Processing

  5. arXiv:2011.02120  [pdf, other

    cs.CV

    Learning Discriminative Representations for Fine-Grained Diabetic Retinopathy Grading

    Authors: Li Tian, Liyan Ma, Zhijie Wen, Shaorong Xie, Yupeng Xu

    Abstract: Diabetic retinopathy (DR) is one of the leading causes of blindness. However, no specific symptoms of early DR lead to a delayed diagnosis, which results in disease progression in patients. To determine the disease severity levels, ophthalmologists need to focus on the discriminative parts of the fundus images. In recent years, deep learning has achieved great success in medical image analysis. Ho… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 5 pages

  6. Diverse polarization angle swings from a repeating fast radio burst source

    Authors: R. Luo, B. J. Wang, Y. P. Men, C. F. Zhang, J. C. Jiang, H. Xu, W. Y. Wang, K. J. Lee, J. L. Han, B. Zhang, R. N. Caballero, M. Z. Chen, X. L. Chen, H. Q. Gan, Y. J. Guo, L. F. Hao, Y. X. Huang, P. Jiang, H. Li, J. Li, Z. X. Li, J. T. Luo, J. Pan, X. Pei, L. Qian , et al. (12 additional authors not shown)

    Abstract: Fast radio bursts (FRBs) are millisecond-duration radio transients of unknown origin. Two possible mechanisms that could generate extremely coherent emission from FRBs invoke neutron star magnetospheres or relativistic shocks far from the central energy source. Detailed polarization observations may help us to understand the emission mechanism. However, the available FRB polarization data have bee… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

    Comments: Published online in Nature on 29 Oct, 2020

    Journal ref: Nature, Volume 586, Pages 693--696 (2020)

  7. Self-sweeping ytterbium-doped fiber laser based on a fiber saturable absorber

    Authors: Zengrun Wen, Kaile Wang, Baole Lu, Haowei Chen, Jintao Bai

    Abstract: Generally speaking, the self-sweeping effect relies on the dynamical grating formed in a gain fiber. Here, the normal self-sweeping was generated in a pump-free ytterbium-doped fiber which serves as a fiber saturable absorber and is introduced to the laser cavity by a circulator in this experiment. The sweeping rate and the sweeping range alter as usual, both of which can be controlled by the pump… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  8. arXiv:2010.14798  [pdf, other

    cs.SD cs.CL eess.AS

    Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition

    Authors: Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi wen

    Abstract: Despite the recent significant advances witnessed in end-to-end (E2E) ASR system for code-switching, hunger for audio-text paired data limits the further improvement of the models' performance. In this paper, we propose a decoupled transformer model to use monolingual paired data and unpaired text data to alleviate the problem of code-switching data shortage. The model is decoupled into two parts:… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 5 pages, 1 figures

  9. arXiv:2010.14791  [pdf, other

    eess.AS

    One In A Hundred: Select The Best Predicted Sequence from Numerous Candidates for Streaming Speech Recognition

    Authors: Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen

    Abstract: The RNN-Transducers and improved attention-based encoder-decoder models are widely applied to streaming speech recognition. Compared with these two end-to-end models, the CTC model is more efficient in training and inference. However, it cannot capture the linguistic dependencies between the output tokens. Inspired by the success of two-pass end-to-end models, we introduce a transformer decoder an… ▽ More

    Submitted 3 April, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

  10. arXiv:2010.12356  [pdf, ps, other

    math.CV

    Meromorphic functions of finite $\varphi$-order and linear $q$-difference equations

    Authors: Janne Heittokangas, Jun Wang, Zhi-Tao Wen, Hui Yu

    Abstract: The $\varphi$-order was introduced in 2009 for meromorphic functions in the unit disc, and was used as a growth indicator for solutions of linear differential equations. In this paper, the properties of meromorphic functions in the complex plane are investigated in terms of the $\varphi$-order, which measures the growth of functions between the classical order and the logarithmic order. Several re… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 28 pages

    MSC Class: 39A13

  11. Photometric redshifts for galaxies in the Subaru Hyper Suprime-Cam and unWISE and a catalogue of identified clusters of galaxies

    Authors: Z. L. Wen, J. L. Han

    Abstract: We first present a catalogue of photometric redshifts for 14.68 million galaxies derived from the 7-band photometric data of Hyper Suprime-Cam Subaru Strategic Program and the Wide-field Infrared Survey Explorer using the nearest-neighbour algorithm. The redshift uncertainty is about 0.024 for galaxies of z<0.7, and steadily increases with redshift to about 0.11 at z~2. From such a large data set,… ▽ More

    Submitted 19 November, 2020; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 16 pages, 18 figures, 2 tables, updated version after proof checking, online data available

  12. arXiv:2010.02330  [pdf, other

    cs.CV

    A Benchmark and Baseline for Language-Driven Image Editing

    Authors: Jing Shi, Ning Xu, Trung Bui, Franck Dernoncourt, Zheng Wen, Chenliang Xu

    Abstract: Language-driven image editing can significantly save the laborious image editing work and be friendly to the photography novice. However, most similar work can only deal with a specific image domain or can only do global retouching. To solve this new task, we first present a new language-driven image editing dataset that supports both local and global editing with editing operation and mask annota… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted by ACCV 2020

  13. arXiv:2009.03140  [pdf, ps, other

    eess.SP cs.LG

    Edge Learning with Unmanned Ground Vehicle: Joint Path, Energy and Sample Size Planning

    Authors: Dan Liu, Shuai Wang, Zhigang Wen, Lei Cheng, Miaowen Wen, Yik-Chung Wu

    Abstract: Edge learning (EL), which uses edge computing as a platform to execute machine learning algorithms, is able to fully exploit the massive sensing data generated by Internet of Things (IoT). However, due to the limited transmit power at IoT devices, collecting the sensing data in EL systems is a challenging task. To address this challenge, this paper proposes to integrate unmanned ground vehicle (UG… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: 16 pages, 6 figures, to appear in IEEE Internet of Things Journal

  14. arXiv:2008.09976  [pdf, other

    cs.SE cs.CL

    Emerging App Issue Identification via Online Joint Sentiment-Topic Tracing

    Authors: Cuiyun Gao, Jichuan Zeng, Zhiyuan Wen, David Lo, Xin Xia, Irwin King, Michael R. Lyu

    Abstract: Millions of mobile apps are available in app stores, such as Apple's App Store and Google Play. For a mobile app, it would be increasingly challenging to stand out from the enormous competitors and become prevalent among users. Good user experience and well-designed functionalities are the keys to a successful app. To achieve this, popular apps usually schedule their updates frequently. If we can… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

  15. arXiv:2008.07353  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    On the Sample Complexity of Reinforcement Learning with Policy Space Generalization

    Authors: Wenlong Mou, Zheng Wen, Xi Chen

    Abstract: We study the optimal sample complexity in large-scale Reinforcement Learning (RL) problems with policy space generalization, i.e. the agent has a prior knowledge that the optimal policy lies in a known policy space. Existing results show that without a generalization model, the sample complexity of an RL algorithm will inevitably depend on the cardinalities of state space and action space, which a… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  16. arXiv:2008.03942  [pdf, other

    eess.SP

    Joint Bandwidth Allocation and Path Selection in WANs with Path Cardinality Constraints

    Authors: Jinxin Wang, Fan Zhang, Zhonglin Xie, Gong Zhang, Zaiwen Wen

    Abstract: In this paper, we study a joint bandwidth allocation and path selection problem via solving a multi-objective minimization problem under the path cardinality constraints, namely MOPC. Our problem formulation captures various types of objectives including the proportional fairness, the total completion time, as well as the worst-case link utilization ratio. Such an optimization problem is very chal… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: Submitted to IEEE TSP and being under review

  17. arXiv:2007.15788  [pdf, other

    stat.ML cs.LG

    Stochastic Low-rank Tensor Bandits for Multi-dimensional Online Decision Making

    Authors: Jie Zhou, Botao Hao, Zheng Wen, Jingfei Zhang, Will Wei Sun

    Abstract: Multi-dimensional online decision making plays a crucial role in many real applications such as online recommendation and digital marketing. In these problems, a decision at each time is a combination of choices from different types of entities. To solve it, we introduce stochastic low-rank tensor bandits, a class of bandits whose mean rewards can be represented as a low-rank tensor. We consider t… ▽ More

    Submitted 13 February, 2024; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Accepted by Journal of the American Statistical Association

  18. Narrow bandwidth Q-switched Erbium-doped fiber laser based on dynamic saturable absorption filtering effect

    Authors: Zengrun Wen, Kaile Wang, Shuangcheng Chen, Xinyuan Qi, Baole Lu, Jintao Bai

    Abstract: We proposed a narrow spectral bandwidth Erbium-doped fiber (EDF) laser Q-switched by a homemade saturable dynamic induced grating (SDIG) which is introduced via reforming the structure of a fiber saturable absorbers FSA with a piece of EDF and a fiber Bragg grating. The SDIG integrates both saturable absorption and spectral filtering effect simultaneously, which was confirmed through theoretical a… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  19. arXiv:2007.06202  [pdf, ps, other

    cs.AI math.OC

    Structured Policy Iteration for Linear Quadratic Regulator

    Authors: Youngsuk Park, Ryan A. Rossi, Zheng Wen, Gang Wu, Handong Zhao

    Abstract: Linear quadratic regulator (LQR) is one of the most popular frameworks to tackle continuous Markov decision process tasks. With its fundamental theory and tractable optimal policy, LQR has been revisited and analyzed in recent years, in terms of reinforcement learning scenarios such as the model-free or model-based setting. In this paper, we introduce the \textit{Structured Policy Iteration} (S-PI… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  20. arXiv:2007.04915  [pdf, other

    cs.LG stat.ML

    Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems

    Authors: Tong Yu, Branislav Kveton, Zheng Wen, Ruiyi Zhang, Ole J. Mengshoel

    Abstract: We propose a novel framework for structured bandits, which we call an influence diagram bandit. Our framework captures complex statistical dependencies between actions, latent variables, and observations; and thus unifies and extends many existing models, such as combinatorial semi-bandits, cascading bandits, and low-rank bandits. We develop novel online learning algorithms that learn to act effic… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  21. Giant micropulse emission in the Vela pulsar at C band

    Authors: J. L. Chen, Z. G. Wen, L. F. Hao, J. P. Yuan, J. Li, H. G. Wang, W. M. Yan, K. J. Lee, N. Wang, Y. H. Xu, Z. X. Li, Y. X. Huang, R. Yuen, M. Mijit

    Abstract: We present here the analysis of giant micropulses from the Vela pulsar. A total of 4187 giant micropulses with peak flux density $>$2.5 Jy were detected during almost 4 hours of observations carried out with the Yunnan 40-m radio telescope at 6800 MHz. Nine of the giant micropulses arrived approximately 3 to 4 ms earlier than the peak of average pulse profile, longer than that at lower frequencies… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 9 pages, 8 figures

  22. arXiv:2007.03861  [pdf, other

    math.OC

    On the Analysis of Model-free Methods for the Linear Quadratic Regulator

    Authors: Zeyu Jin, Johann Michael Schmitt, Zaiwen Wen

    Abstract: Many reinforcement learning methods achieve great success in practice but lack theoretical foundation. In this paper, we study the convergence analysis on the problem of the Linear Quadratic Regulator (LQR). The global linear convergence properties and sample complexities are established for several popular algorithms such as the policy gradient algorithm, TD-learning and the actor-critic (AC) alg… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  23. arXiv:2006.09606  [pdf, other

    math.OC stat.ML

    Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods

    Authors: Minghan Yang, Dong Xu, Hongyu Chen, Zaiwen Wen, Mengyun Chen

    Abstract: In this paper, we consider stochastic second-order methods for minimizing a finite summation of nonconvex functions. One important key is to find an ingenious but cheap scheme to incorporate local curvature information. Since the true Hessian matrix is often a combination of a cheap part and an expensive part, we propose a structured stochastic quasi-Newton method by using partial Hessian informat… ▽ More

    Submitted 25 March, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  24. arXiv:2006.07464  [pdf, other

    cs.LG math.OC stat.ML

    Hypermodels for Exploration

    Authors: Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy

    Abstract: We study the use of hypermodels to represent epistemic uncertainty and guide exploration. This generalizes and extends the use of ensembles to approximate Thompson sampling. The computational cost of training an ensemble grows with its size, and as such, prior work has typically been limited to ensembles with tens of elements. We show that alternative hypermodels can enjoy dramatic efficiency gain… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: Published as a conference paper at ICLR 2020

  25. arXiv:2006.05924  [pdf, ps, other

    math.OC stat.ML

    Sketchy Empirical Natural Gradient Methods for Deep Learning

    Authors: Minghan Yang, Dong Xu, Zaiwen Wen, Mengyun Chen, Pengxiang Xu

    Abstract: In this paper, we develop an efficient sketchy empirical natural gradient method (SENG) for large-scale deep learning problems. The empirical Fisher information matrix is usually low-rank since the sampling is only practical on a small amount of data at each iteration. Although the corresponding natural gradient direction lies in a small subspace, both the computational cost and memory requirement… ▽ More

    Submitted 25 March, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

  26. arXiv:2006.03857  [pdf, other

    cs.AI cs.CY

    EPARS: Early Prediction of At-risk Students with Online and Offline Learning Behaviors

    Authors: Yu Yang, Zhiyuan Wen, Jiannong Cao, Jiaxing Shen, Hongzhi Yin, Xiaofang Zhou

    Abstract: Early prediction of students at risk (STAR) is an effective and significant means to provide timely intervention for dropout and suicide. Existing works mostly rely on either online or offline learning behaviors which are not comprehensive enough to capture the whole learning processes and lead to unsatisfying prediction performance. We propose a novel algorithm (EPARS) that could early predict ST… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Comments: To be published in DASFAA 2020

  27. arXiv:2005.08529  [pdf, other

    physics.optics

    Discretized optical dynamics in one-dimensionally synthetic photonic lattice

    Authors: Zengrun Wen, Kaile Wang, Baole Lu, Xinyuan Qi, Haowei Chen, Jintao Bai

    Abstract: Synthetic photonic lattice with temporally controlled potentials is a versatile platform for realizing wave dynamics associated with physical areas of optics and quantum physics. Here, discrete optics in one-dimensionally synthetic photonic lattice is investigated systematically, in which the light behavior is highly similar to those in evanescently coupled one-dimensional discrete waveguides. Suc… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

  28. arXiv:2005.07903  [pdf, other

    eess.AS cs.CL cs.SD

    Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition

    Authors: Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Shuai Zhang, Zhengqi Wen

    Abstract: Non-autoregressive transformer models have achieved extremely fast inference speed and comparable performance with autoregressive sequence-to-sequence models in neural machine translation. Most of the non-autoregressive transformers decode the target sequence from a predefined-length mask sequence. If the predefined length is too long, it will cause a lot of redundant calculations. If the predefin… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: 5 pages

  29. arXiv:2005.07326  [pdf

    physics.flu-dyn physics.comp-ph

    How does boiling occur in lattice Boltzmann simulations?

    Authors: Qing Li, Y. Yu, Z. X. Wen

    Abstract: In recent years, the lattice Boltzmann (LB) method has been widely employed to simulate boiling phenomena [A. Márkus and G. Házi, Phys. Rev. E 83, 046705 (2011); Biferale et al., Phys. Rev. Lett. 108, 104502 (2012); Li et al., Phys. Rev. E 96, 063303 (2017); Wu et al., Int. J. Heat Mass Transfer 126, 773 (2018)]. However, a very important issue still remains open, i.e., how does boiling occur in t… ▽ More

    Submitted 20 May, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: 10 figures

    Journal ref: Physics of Fluids 32, 093306 (2020)

  30. arXiv:2005.05602  [pdf, ps, other

    physics.optics

    Synthetic topological insulator with periodically modulated effective gauge fields

    Authors: Zengrun Wen, Baole Lu, Kaiwen Ji, Kaile Wang, Haowei Chen, Xinyuan Qi, Jintao Bai

    Abstract: We study both theoretically and numerically the topological edge states in synthetic photonic lattice with finitely periodic gauge potentials. The effective gauge fields are implemented by tailoring the phase alternatively and periodically, which finally results in symmetric total reflection at two boundaries of the one-dimensional synthetic lattice. Further tuning the nearest-neighbor coupling an… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

  31. arXiv:2005.04862  [pdf, other

    eess.AS cs.CL

    Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition

    Authors: Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang

    Abstract: Although attention based end-to-end models have achieved promising performance in speech recognition, the multi-pass forward computation in beam-search increases inference time cost, which limits their practical applications. To address this issue, we propose a non-autoregressive end-to-end speech recognition system called LASO (listen attentively, and spell once). Because of the non-autoregressiv… ▽ More

    Submitted 5 August, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: accepted by INTERSPEECH2020

  32. arXiv:2005.01279  [pdf, other

    cs.CL cs.LG

    Improving Adversarial Text Generation by Modeling the Distant Future

    Authors: Ruiyi Zhang, Changyou Chen, Zhe Gan, Wenlin Wang, Dinghan Shen, Guoyin Wang, Zheng Wen, Lawrence Carin

    Abstract: Auto-regressive text generation models usually focus on local fluency, and may cause inconsistent semantic meaning in long text generation. Further, automatically generating words with similar semantics is challenging, and hand-crafted linguistic rules are difficult to apply. We consider a text planning scheme and present a model-based imitation-learning approach to alleviate the aforementioned is… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: ACL 2020. arXiv admin note: substantial text overlap with arXiv:1811.00696

  33. arXiv:2004.13826  [pdf, other

    cs.CL

    Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks

    Authors: Yufeng Zhang, Xueli Yu, Zeyu Cui, Shu Wu, Zhongzhen Wen, Liang Wang

    Abstract: Text classification is fundamental in natural language processing (NLP), and Graph Neural Networks (GNN) are recently applied in this task. However, the existing graph-based works can neither capture the contextual word relationships within each document nor fulfil the inductive learning of new words. In this work, to overcome such problems, we propose TextING for inductive text classification via… ▽ More

    Submitted 12 May, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: To appear at ACL 2020

  34. arXiv:2004.02420  [pdf, other

    eess.AS cs.LG cs.SD

    Simultaneous Denoising and Dereverberation Using Deep Embedding Features

    Authors: Cunhang Fan, Jianhua Tao, Bin Liu, Jiangyan Yi, Zhengqi Wen

    Abstract: Monaural speech dereverberation is a very challenging task because no spatial cues can be used. When the additive noises exist, this task becomes more challenging. In this paper, we propose a joint training method for simultaneous speech denoising and dereverberation using deep embedding features, which is based on the deep clustering (DC). DC is a state-of-the-art method for speech separation tha… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  35. GIANT: Scalable Creation of a Web-scale Ontology

    Authors: Bang Liu, Weidong Guo, Di Niu, Jinwen Luo, Chaoyue Wang, Zhen Wen, Yu Xu

    Abstract: Understanding what online users may pay attention to is key to content recommendation and search services. These services will benefit from a highly structured and web-scale ontology of entities, concepts, events, topics and categories. While existing knowledge bases and taxonomies embody a large volume of entities and categories, we argue that they fail to discover properly grained concepts, even… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: Accepted as full paper by SIGMOD 2020

  36. Discovery of delayed spin-up behavior following two large glitches in the Crab pulsar, and the statistics of such processes

    Authors: M. Y. Ge, S. N. Zhang, F. J. Lu, T. P. Li, J. P. Yuan, X. P. Zheng, Y. Huang, S. J. Zheng, Y. P. Chen, Z. Chang, Y. L. Tuo, Q. Cheng, C. Güngör, L. M. Song, Y. P. Xu, X. L. Cao, Y. Chen, C. Z. Liu, S. Zhang, J. L. Qu, Q. C. Bu, C. Cai, G. Chen, L. Chen, M. Z. Chen , et al. (111 additional authors not shown)

    Abstract: Glitches correspond to sudden jumps of rotation frequency ($ν$) and its derivative ($\dotν$) of pulsars, the origin of which remains not well understood yet, partly because the jump processes of most glitches are not well time-resolved. There are three large glitches of the Crab pulsar, detected in 1989, 1996 and 2017, which were found to have delayed spin-up processes before the normal recovery p… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: 25 pages, 8 figures

  37. A modular phantom and software to characterize 3D geometric distortion in MRI

    Authors: Jordan M. Slagowski, Yao Ding, Manik Aima, Zhifei Wen, Clifton D. Fuller, Caroline Chung, J. Matthew Debnam, Ken-Pin Hwang, Mo Kadbi, Janio Szklaruk, Jihong Wang

    Abstract: MRI offers outstanding soft tissue contrast that may reduce uncertainties in target and organ-at-risk delineation and enable online adaptive image-guided treatment. Spatial distortions resulting from non-linearities in the gradient fields and non-uniformity in the main magnetic field must be accounted for across the imaging field-of-view to prevent systematic errors during treatment delivery. This… ▽ More

    Submitted 6 April, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: 25 pages

  38. arXiv:2003.07544  [pdf, other

    eess.AS cs.MM cs.SD

    Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method

    Authors: Cunhang Fan, Jianhua Tao, Bin Liu, Jiangyan Yi, Zhengqi Wen, Xuefei Liu

    Abstract: In this paper, we propose an end-to-end post-filter method with deep attention fusion features for monaural speaker-independent speech separation. At first, a time-frequency domain speech separation method is applied as the pre-separation stage. The aim of pre-separation stage is to separate the mixture preliminarily. Although this stage can separate the mixture, it still contains the residual int… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: ACCEPTED by IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)

  39. arXiv:2003.00739  [pdf, other

    cs.CV cs.CL

    Long Short-Term Sample Distillation

    Authors: Liang Jiang, Zujie Wen, Zhongping Liang, Yafang Wang, Gerard de Melo, Zhe Li, Liangzhuang Ma, Jiaxing Zhang, Xiaolong Li, Yuan Qi

    Abstract: In the past decade, there has been substantial progress at training increasingly deep neural networks. Recent advances within the teacher--student training paradigm have established that information about past training updates show promise as a source of guidance during subsequent training steps. Based on this notion, in this paper, we propose Long Short-Term Sample Distillation, a novel training… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: published as a conference paper at AAAI 2020

  40. arXiv:2002.08513  [pdf, ps, other

    math.OC

    A Trust-Region Method For Nonsmooth Nonconvex Optimization

    Authors: Ziang Chen, Andre Milzarek, Zaiwen Wen

    Abstract: We propose a trust-region type method for a class of nonsmooth nonconvex optimization problems where the objective function is a summation of a (probably nonconvex) smooth function and a (probably nonsmooth) convex function. The model function of our trust-region subproblem is always quadratic and the linear term of the model is generated using abstract descent directions. Therefore, the trust-reg… ▽ More

    Submitted 23 October, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

  41. arXiv:2002.06979  [pdf, ps, other

    cs.LG stat.ML

    Convergence of End-to-End Training in Deep Unsupervised Contrastive Learning

    Authors: Zixin Wen

    Abstract: Unsupervised contrastive learning has gained increasing attention in the latest research and has proven to be a powerful method for learning representations from unlabeled data. However, little theoretical analysis was known for this framework. In this paper, we study the optimization of deep unsupervised contrastive learning. We prove that, by applying end-to-end training that simultaneously upda… ▽ More

    Submitted 30 May, 2021; v1 submitted 17 February, 2020; originally announced February 2020.

  42. arXiv:2002.04398  [pdf, other

    quant-ph hep-th math-ph

    PT-symmetric potentials having continuous spectra

    Authors: Zichao Wen, Carl M. Bender

    Abstract: One-dimensional PT-symmetric quantum-mechanical Hamiltonians having continuous spectra are studied. The Hamiltonians considered have the form $H=p^2+V(x)$, where $V(x)$ is odd in $x$, pure imaginary, and vanishes as $|x|\to\infty$. Five PT-symmetric potentials are studied: the Scarf-II potential $V_1(x)=iA_1\,{\rm sech}(x)\tanh(x)$, which decays exponentially for large $|x|$; the rational potentia… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: 15 pages, 9 figures

  43. arXiv:2002.01626  [pdf, other

    eess.AS cs.LG cs.SD

    Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features

    Authors: Cunhang Fan, Bin Liu, Jianhua Tao, Jiangyan Yi, Zhengqi Wen

    Abstract: Multi-channel deep clustering (MDC) has acquired a good performance for speech separation. However, MDC only applies the spatial features as the additional information. So it is difficult to learn mutual relationship between spatial and spectral features. Besides, the training objective of MDC is defined at embedding vectors, rather than real separated sources, which may damage the separation perf… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

  44. arXiv:2001.09624  [pdf, other

    cs.CR

    SecEL: Privacy-Preserving, Verifiable and Fault-Tolerant Edge Learning for Autonomous Vehicles

    Authors: Jiasi Weng, Jian Weng, Yue Zhang, Ming Li, Zhaodi Wen

    Abstract: Mobile edge computing (MEC) is an emerging technology to transform the cloud-based computing services into the edge-based ones. Autonomous vehicular network (AVNET), as one of the most promising applications of MEC, can feature edge learning and communication techniques, improving the safety for autonomous vehicles (AVs). This paper focuses on the edge learning in AVNET, where AVs at the edge of t… ▽ More

    Submitted 16 February, 2020; v1 submitted 27 January, 2020; originally announced January 2020.

  45. arXiv:2001.06944  [pdf, other

    cs.CL cs.LG

    Nested-Wasserstein Self-Imitation Learning for Sequence Generation

    Authors: Ruiyi Zhang, Changyou Chen, Zhe Gan, Zheng Wen, Wenlin Wang, Lawrence Carin

    Abstract: Reinforcement learning (RL) has been widely studied for improving sequence-generation models. However, the conventional rewards used for RL training typically cannot capture sufficient semantic information and therefore render model bias. Further, the sparse and delayed rewards make RL exploration inefficient. To alleviate these issues, we propose the concept of nested-Wasserstein distance for dis… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: Accepted by AISTATS2020

  46. arXiv:2001.01594  [pdf

    physics.flu-dyn physics.app-ph

    Enhancement of nucleate boiling by combining the effects of surface structure and mixed wettability: A lattice Boltzmann study

    Authors: W. X. Li, Q. Li, Y. Yu, Z. X. Wen

    Abstract: The combination of microstructures and mixed wettability for enhancing nucleate boiling has attracted much attention in recent years. However, in the existing experimental and numerical studies, the tops of microstructures are entirely subjected to wettability modification, which makes the influences of mixed wettability dependant on the characteristic length of microstructures. In order to disclo… ▽ More

    Submitted 3 May, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: 15 figures

    Journal ref: Applied Thermal Engineering 180 (2020) 115849

  47. arXiv:1912.04843  [pdf

    eess.SP

    A novel generative reverse net assisted evolution algorithm for expensive-computational optimizations

    Authors: Yu Li, Hu Wang, Ziming Wen, Xin Wang

    Abstract: Simulation-based optimization is a useful method for practical design problems. However, it is difficult for complicated problems due to expensive-computational costs. A popular way to overcome this issue is to use a surrogate model to save the cost. Nevertheless, limited design parameters those are input to traditional surrogate models can difficultly represent the whole design problem, which mig… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

  48. arXiv:1912.02958  [pdf, other

    eess.AS cs.CL cs.LG

    Synchronous Transformers for End-to-End Speech Recognition

    Authors: Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen

    Abstract: For most of the attention-based sequence-to-sequence models, the decoder predicts the output sequence conditioned on the entire input sequence processed by the encoder. The asynchronous problem between the encoding and decoding makes these models difficult to be applied for online speech recognition. In this paper, we propose a model named synchronous transformer to address this problem, which can… ▽ More

    Submitted 23 February, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Accepted by ICASSP 2020

  49. arXiv:1912.01777  [pdf, other

    eess.AS cs.CL cs.SD

    Integrating Knowledge into End-to-End Speech Recognition from External Text-Only Data

    Authors: Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Zhengkun Tian, Shuai Zhang

    Abstract: Attention-based encoder-decoder (AED) models have achieved promising performance in speech recognition. However, because of the end-to-end training, an AED model is usually trained with speech-text paired data. It is challenging to incorporate external text-only data into AED models. Another issue of the AED model is that it does not use the right context of a text token while predicting the token… ▽ More

    Submitted 15 March, 2021; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: Submitted TASLP

  50. Periodic mode changing in PSR J1048-5832

    Authors: W. M. Yan, R. N. Manchester, N. Wang, Z. G. Wen, J. P. Yuan, K. J. Lee, J. L. Chen

    Abstract: By analysing the data acquired from the Parkes 64-m radio telescope at 1369 MHz, we report on the phase-stationary non-drift amplitude modulation observed in PSR J1048-5832. The high-sensitivity observations revealed that the central and trailing components of the pulse profile of this pulsar switch between a strong mode and a weak mode periodically. However, the leading component remains unchange… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: 8 pages, 8 figures, 3 tables, accepted in MNRAS for publication