Zum Hauptinhalt springen

Showing 201–250 of 551 results for author: Yan, M

.
  1. arXiv:2204.02372  [pdf, other

    cs.LG

    Jump-Start Reinforcement Learning

    Authors: Ikechukwu Uchendu, Ted Xiao, Yao Lu, Banghua Zhu, Mengyuan Yan, Joséphine Simon, Matthew Bennice, Chuyuan Fu, Cong Ma, Jiantao Jiao, Sergey Levine, Karol Hausman

    Abstract: Reinforcement learning (RL) provides a theoretical framework for continuously improving an agent's behavior via trial and error. However, efficiently learning policies from scratch can be very difficult, particularly for tasks with exploration challenges. In such settings, it might be desirable to initialize RL with an existing policy, offline data, or demonstrations. However, naively performing s… ▽ More

    Submitted 7 July, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: 20 pages, 10 figures

  2. arXiv:2204.01691  [pdf, other

    cs.RO cs.CL cs.LG

    Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

    Authors: Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-Huei Lee , et al. (20 additional authors not shown)

    Abstract: Large language models can encode a wealth of semantic knowledge about the world. Such knowledge could be extremely useful to robots aiming to act upon high-level, temporally extended instructions expressed in natural language. However, a significant weakness of language models is that they lack real-world experience, which makes it difficult to leverage them for decision making within a given embo… ▽ More

    Submitted 16 August, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: See website at https://say-can.github.io/ V1. Initial Upload. V2. Added PaLM results. Added study about new capabilities (drawer manipulation, chain of thought prompting, multilingual instructions). Added an ablation study of language model size. Added an open-source version of \algname on a simulated tabletop environment. Improved readability

  3. arXiv:2203.15442  [pdf, other

    cs.CV cs.MM

    Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding

    Authors: Jiabo Ye, Junfeng Tian, Ming Yan, Xiaoshan Yang, Xuwu Wang, Ji Zhang, Liang He, Xin Lin

    Abstract: Visual grounding focuses on establishing fine-grained alignment between vision and natural language, which has essential applications in multimodal reasoning systems. Existing methods use pre-trained query-agnostic visual backbones to extract visual feature maps independently without considering the query information. We argue that the visual features extracted from the visual backbones and the fe… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  4. arXiv:2203.14600  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Observation of quadruple Weyl point in hybrid-Weyl phononic crystals

    Authors: Li Luo, Weiyin Deng, Yating Yang, Mou Yan, Jiuyang Lu, Xueqin Huang, Zhengyou Liu

    Abstract: The discovery of Weyl semimetals opens the door for searching topological semimetals in physical science. The Weyl points are generally recognized as conventional, quadratic, spin-1, and those of high topological charges. Here we report the observation of the quadruple Weyl point of charge 4, the highest topological charge a twofold degenerate node can carry. Besides the quadruple Weyl point, the… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 15 pages, 3 figures, and 1 table

  5. arXiv:2202.11343  [pdf, other

    cs.AR cs.DC

    Alleviating Datapath Conflicts and Design Centralization in Graph Analytics Acceleration

    Authors: Haiyang Lin, Mingyu Yan, Duo Wang, Mo Zou, Fengbin Tu, Xiaochun Ye, Dongrui Fan, Yuan Xie

    Abstract: Previous graph analytics accelerators have achieved great improvement on throughput by alleviating irregular off-chip memory accesses. However, on-chip side datapath conflicts and design centralization have become the critical issues hindering further throughput improvement. In this paper, a general solution, Multiple-stage Decentralized Propagation network (MDP-network), is proposed to address th… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: To Appear in 59th Design Automation Conference (DAC 2022)

  6. arXiv:2202.04822  [pdf, other

    cs.LG cs.AI

    Survey on Graph Neural Network Acceleration: An Algorithmic Perspective

    Authors: Xin Liu, Mingyu Yan, Lei Deng, Guoqi Li, Xiaochun Ye, Dongrui Fan, Shirui Pan, Yuan Xie

    Abstract: Graph neural networks (GNNs) have been a hot spot of recent research and are widely utilized in diverse applications. However, with the use of huger data and deeper models, an urgent demand is unsurprisingly made to accelerate GNNs for more efficient execution. In this paper, we provide a comprehensive survey on acceleration methods for GNNs from an algorithmic perspective. We first present a new… ▽ More

    Submitted 24 April, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Accepted by International Joint Conference on Artificial Intelligence (IJCAI-22)

  7. arXiv:2202.01382  [pdf, other

    physics.flu-dyn physics.geo-ph

    Asymptotic behaviour of rotating convection-driven dynamos in the plane layer geometry

    Authors: Ming Yan, Michael A. Calkins

    Abstract: Dynamos driven by rotating convection in the plane layer geometry are investigated numerically for a range of Ekman number ($E$), magnetic Prandtl number ($Pm$) and Rayleigh number ($Ra$). The primary purpose of the investigation is to compare results of the simulations with previously developed asymptotic theory that is applicable in the limit of rapid rotation. We find that all of the simulation… ▽ More

    Submitted 2 August, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 38 pages, 13 figures

  8. arXiv:2201.12667  [pdf, other

    cs.DC cs.LG

    Distributed SLIDE: Enabling Training Large Neural Networks on Low Bandwidth and Simple CPU-Clusters via Model Parallelism and Sparsity

    Authors: Minghao Yan, Nicholas Meisburger, Tharun Medini, Anshumali Shrivastava

    Abstract: More than 70% of cloud computing is paid for but sits idle. A large fraction of these idle compute are cheap CPUs with few cores that are not utilized during the less busy hours. This paper aims to enable those CPU cycles to train heavyweight AI models. Our goal is against mainstream frameworks, which focus on leveraging expensive specialized ultra-high bandwidth interconnect to address the commun… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

  9. arXiv:2201.11313  [pdf, other

    cs.CL cs.IR

    Learning Deep Semantic Model for Code Search using CodeSearchNet Corpus

    Authors: Chen Wu, Ming Yan

    Abstract: Semantic code search is the task of retrieving relevant code snippet given a natural language query. Different from typical information retrieval tasks, code search requires to bridge the semantic gap between the programming language and natural language, for better describing intrinsic concepts and semantics. Recently, deep neural network for code search has been a hot research topic. Typical met… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  10. arXiv:2201.00139  [pdf, other

    math.OC math.NA

    On the improved conditions for some primal-dual algorithms

    Authors: Yao Li, Ming Yan

    Abstract: The convex minimization of $f(\mathbf{x})+g(\mathbf{x})+h(\mathbf{A}\mathbf{x})$ over $\mathbb{R}^n$ with differentiable $f$ and linear operator $\mathbf{A}: \mathbb{R}^n\rightarrow \mathbb{R}^m$, has been well-studied in the literature. By considering the primal-dual optimality of the problem, many algorithms are proposed from different perspectives such as monotone operator scheme and fixed poin… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

  11. Correlations in the Electronic Structure of van der Waals NiPS$_3$ Crystals: An X-Ray Absorption and Resonant Photoelectron Spectroscopy Study

    Authors: Mouhui Yan, Yichen Jin, Zhicheng Wu, Arshak Tsaturyan, Anna Makarova, Dmitry Smirnov, Elena Voloshina, Yuriy Dedkov

    Abstract: The electronic structure of high-quality van der Waals NiPS$_3$ crystals was studied using near-edge x-ray absorption spectroscopy (NEXAFS) and resonant photoelectron spectroscopy (ResPES) in combination with density functional theory (DFT) approach. The experimental spectroscopic methods, being element specific, allow to discriminate between atomic contributions in the valence and conduction band… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Journal ref: J. Phys. Chem. Lett. 12, 2400 (2021)

  12. arXiv:2112.12529  [pdf, other

    cond-mat.mtrl-sci

    Mott-Hubbard Insulating State for the Layered van der Waals FePX$_3$ (X:S, Se) As Revealed by NEXAFS and Resonant Photoelectron Spectroscopy

    Authors: Yichen Jin, Mouhui Yan, Tomislav Kremer, Elena Voloshina, Yuriy Dedkov

    Abstract: A broad family of the nowadays studied low-dimensional systems, including 2D materials, demonstrate many fascinating properties, which however depend on the atomic composition as well as on the system dimensionality. Therefore, the studies of the electronic correlation effects in the new 2D materials is of paramount importance for the understanding of their transport, optical and catalytic propert… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in Sci. Rep. (22.12.2021)

  13. Topological Quasi-2D Semimetal Co$_3$Sn$_2$S$_2$: Insights To Electronic Structure From NEXAFS and Resonant Photoelectron Spectroscopy

    Authors: Mouhui Yan, Yichen Jin, Xiaofei Hou, Yanfeng Guo, Arshak Tsaturyan, Anna Makarova, Dmitry Smirnov, Yuriy Dedkov, Elena Voloshina

    Abstract: The electronic structure of the natural topological semimetal Co$_3$Sn$_2$S$_2$ crystals was studied using near-edge x-ray absorption spectroscopy (NEXAFS) and resonant photoelectron spectroscopy (ResPES). Although, the significant increase of the Co\,$3d$ valence band emission is observed at the Co\,$2p$ absorption edge in the ResPES experiments, the spectral weight at these photon energies is do… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Journal ref: J. Phys. Chem. Lett. 12, 9807 (2021)

  14. To the synthesis and characterization of layered metal phosphorus triselenides proposed for electrochemical sensing and energy applications

    Authors: Yuriy Dedkov, Mouhui Yan, Elena Voloshina

    Abstract: Recent studies reported on the synthesis and characterization of several bulk crystals of layered metal triselenophosphites MPSe$_3$ (M = transition metals). In these works characterization was performed via a combination of different bulk- and surface-sensitive experimental methods accompanied by DFT calculations. However, the critical examination of the available experimental and theoretical dat… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Journal ref: Chem. Phys. Lett. 754, 137627 (2020)

  15. Domain Adaptation on Point Clouds via Geometry-Aware Implicits

    Authors: Yuefan Shen, Yanchao Yang, Mi Yan, He Wang, Youyi Zheng, Leonidas Guibas

    Abstract: As a popular geometric representation, point clouds have attracted much attention in 3D vision, leading to many applications in autonomous driving and robotics. One important yet unsolved issue for learning on point cloud is that point clouds of the same object can have significant geometric variations if generated using different procedures or captured using different sensors. These inconsistenci… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  16. arXiv:2111.08896  [pdf, other

    cs.CL cs.CV

    Achieving Human Parity on Visual Question Answering

    Authors: Ming Yan, Haiyang Xu, Chenliang Li, Junfeng Tian, Bin Bi, Wei Wang, Weihua Chen, Xianzhe Xu, Fan Wang, Zheng Cao, Zhicheng Zhang, Qiyu Zhang, Ji Zhang, Songfang Huang, Fei Huang, Luo Si, Rong Jin

    Abstract: The Visual Question Answering (VQA) task utilizes both visual image and language analysis to answer a textual question with respect to an image. It has been a popular research topic with an increasing number of real-world applications in the last decade. This paper describes our recent research of AliceMind-MMU (ALIbaba's Collection of Encoder-decoders from Machine IntelligeNce lab of Damo academy… ▽ More

    Submitted 19 November, 2021; v1 submitted 16 November, 2021; originally announced November 2021.

  17. arXiv:2111.07549  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data

    Authors: Zhu Li, Yuqing Zhang, Mengxi Nie, Ming Yan, Mengnan He, Ruixiong Zhang, Caixia Gong

    Abstract: Recent advancements in end-to-end speech synthesis have made it possible to generate highly natural speech. However, training these models typically requires a large amount of high-fidelity speech data, and for unseen texts, the prosody of synthesized speech is relatively unnatural. To address these issues, we propose to combine a fine-tuned BERT-based front-end with a pre-trained FastSpeech2-base… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  18. arXiv:2111.05424  [pdf, other

    cs.RO

    AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale

    Authors: Yao Lu, Karol Hausman, Yevgen Chebotar, Mengyuan Yan, Eric Jang, Alexander Herzog, Ted Xiao, Alex Irpan, Mohi Khansari, Dmitry Kalashnikov, Sergey Levine

    Abstract: Robotic skills can be learned via imitation learning (IL) using user-provided demonstrations, or via reinforcement learning (RL) using large amountsof autonomously collected experience.Both methods have complementarystrengths and weaknesses: RL can reach a high level of performance, but requiresexploration, which can be very time consuming and unsafe; IL does not requireexploration, but only learn… ▽ More

    Submitted 11 November, 2021; v1 submitted 9 November, 2021; originally announced November 2021.

  19. arXiv:2110.14721  [pdf, other

    physics.flu-dyn

    Quasi-static magnetoconvection with a tilted magnetic field

    Authors: Justin A. Nicoski, Ming Yan, Michael A. Calkins

    Abstract: A numerical study of convection with stress-free boundary conditions in the presence of an imposed magnetic field that is tilted with respect to the direction of gravity is carried out in the limit of small magnetic Reynolds number. The dynamics are investigated over a range of Rayleigh number $Ra$ and Chandrasekhar numbers up to $Q = 2\times10^6$, with the tilt angle between the gravity vector an… ▽ More

    Submitted 2 November, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: 30 pages, 16 figures

  20. arXiv:2110.12478  [pdf, other

    cs.CV cs.IR

    Deep Asymmetric Hashing with Dual Semantic Regression and Class Structure Quantization

    Authors: Jianglin Lu, Hailing Wang, Jie Zhou, Mengfan Yan, Jiajun Wen

    Abstract: Recently, deep hashing methods have been widely used in image retrieval task. Most existing deep hashing approaches adopt one-to-one quantization to reduce information loss. However, such class-unrelated quantization cannot give discriminative feedback for network training. In addition, these methods only utilize single label to integrate supervision information of data for hashing function learni… ▽ More

    Submitted 23 December, 2021; v1 submitted 24 October, 2021; originally announced October 2021.

  21. arXiv:2110.11495  [pdf, other

    hep-ph

    CTEQ-TEA group updates: Photon PDF and Impact from heavy flavors in the CT18 global analysis

    Authors: Marco Guzzi, Keping Xie, Tie-Jiun Hou, Pavel Nadolsky, Carl Schmidt, Mengshi Yan, C. -P. Yuan

    Abstract: We discuss recent CTEQ-TEA group activities after the publication of the CT18 global analysis of parton distribution functions (PDFs) in the proton. In particular, we discuss a new calculation for the photon content in the proton, termed as CT18lux and CT18qed PDFs, and the impact of novel charm- and bottom-quark production cross section measurements at HERA on the CT18 global analysis.

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 6 pages, 4 figures, EPS-HEP2021 Conference Proceedings. Contribution to the European Physical Society Conference on High Energy Physics 2021

  22. Nulling and subpulse drifting in PSR J1727-2739

    Authors: Rukiye Rejep, N. Wang, W. M. Yan, Z. G. Wen

    Abstract: In this paper, we investigate the emission properties of PSR J1727-2739, whose mean pulse profile has two main components, by analysing five single-pulse observations made using the Parkes 64-m radio telescope with a central frequency of 1369 MHz between 2014 April and October. The total observation time is about 6.1 hours which contains 16718 pulses after removal of radio frequency interference (… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 10 pages, 15 figures

  23. arXiv:2110.07058  [pdf, other

    cs.CV cs.AI

    Ego4D: Around the World in 3,000 Hours of Egocentric Video

    Authors: Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do , et al. (60 additional authors not shown)

    Abstract: We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards with cons… ▽ More

    Submitted 11 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: To appear in the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022. This version updates the baseline result numbers for the Hands and Objects benchmark (appendix)

  24. arXiv:2110.05282  [pdf, ps, other

    math.OC

    Optimal Gradient Tracking for Decentralized Optimization

    Authors: Zhuoqing Song, Lei Shi, Shi Pu, Ming Yan

    Abstract: In this paper, we focus on solving the decentralized optimization problem of minimizing the sum of $n$ objective functions over a multi-agent network. The agents are embedded in an undirected graph where they can only send/receive information directly to/from their immediate neighbors. Assuming smooth and strongly convex objective functions, we propose an Optimal Gradient Tracking (OGT) method tha… ▽ More

    Submitted 20 April, 2024; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: Mathematical Programming, in press

  25. arXiv:2110.03086  [pdf, other

    physics.flu-dyn physics.geo-ph

    Strong large scale magnetic fields in rotating convection-driven dynamos: the important role of magnetic diffusion

    Authors: Ming Yan, Michael A. Calkins

    Abstract: Natural dynamos such as planets and stars generate global scale magnetic field despite the inferred presence of small scale turbulence. Such systems are known as large scale dynamos and are typically driven by convection and influenced by rotation. Previous numerical studies of rotating dynamos generally find that the large scale magnetic field becomes weaker as the flow becomes more turbulent. Th… ▽ More

    Submitted 9 February, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: 6 pages, 4 figures

  26. arXiv:2109.00879  [pdf

    physics.optics

    Observation of Square-Root Higher-Order Topological States in Photonic Waveguide Arrays

    Authors: Juan Kang, Tao Liu, Mou Yan, Dandan Yang, Xiongjian Huang, Ruishan Wei, Jianrong Qiu, Guoping Dong, Zhongmin Yang, Franco Nori

    Abstract: Recently, high-order topological insulators (HOTIs), accompanied by topologically nontrivial boundary states with codimension larger than one, have been extensively explored because of unconventional bulk-boundary correspondences. As a novel type of HOTIs, very recent works have explored the square-root HOTIs, where the topological nontrivial nature of bulk bands stems from the square of the Hamil… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  27. arXiv:2108.11571  [pdf, other

    cs.LG

    GNNSampler: Bridging the Gap between Sampling Algorithms of GNN and Hardware

    Authors: Xin Liu, Mingyu Yan, Shuhan Song, Zhengyang Lv, Wenming Li, Guangyu Sun, Xiaochun Ye, Dongrui Fan

    Abstract: Sampling is a critical operation in Graph Neural Network (GNN) training that helps reduce the cost. Previous literature has explored improving sampling algorithms via mathematical and statistical methods. However, there is a gap between sampling algorithms and hardware. Without consideration of hardware, algorithm designers merely optimize sampling at the algorithm level, missing the great potenti… ▽ More

    Submitted 24 June, 2022; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: Accepted by ECML-PKDD 2022

  28. arXiv:2108.09479  [pdf, other

    cs.MM cs.CL cs.CV

    Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training

    Authors: Ming Yan, Haiyang Xu, Chenliang Li, Bin Bi, Junfeng Tian, Min Gui, Wei Wang

    Abstract: Existing approaches to vision-language pre-training (VLP) heavily rely on an object detector based on bounding boxes (regions), where salient objects are first detected from images and then a Transformer-based model is used for cross-modal fusion. Despite their superior performance, these approaches are bounded by the capability of the object detector in terms of both effectiveness and efficiency.… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

  29. arXiv:2108.06768  [pdf, other

    hep-ph

    Connected and Disconnected Sea Partons from CT18 Parametrization of PDFs

    Authors: Tie-Jiun Hou, Jian Liang, Keh-Fei Liu, Mengshi Yan, C. --P. Yuan

    Abstract: The separation of the connected and disconnected sea partons, which were uncovered in the Euclidean path-integral formulation of the hadronic tensor, is accommodated with the CT18 parametrization of the global analysis of the parton distribution functions (PDFs). This is achieved with the help of the distinct small $x$ behaviors of these two sea parton components and the constraint from the lattic… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Report number: MSUHEP-21-017

  30. arXiv:2108.06596  [pdf, other

    hep-ph

    NNLO constraints on proton PDFs from the SeaQuest and STAR experiments and other developments in the CTEQ-TEA global analysis

    Authors: Marco Guzzi, T. J. Hobbs, Tie-Jiun Hou, Xiaoxian Jing, Keping Xie, Aurore Courtoy, Sayipjamal Dulat, Jun Gao, Joey Huston, Pavel M. Nadolsky, Carl Schmidt, Ibrahim Sitiwaldi, Mengshi Yan, C. -P. Yuan

    Abstract: We review progress in the global QCD analysis by the CTEQ-TEA group since the publication of CT18 parton distribution functions (PDFs) in the proton. Specifically, we discuss comparisons of CT18 NNLO predictions with the LHC 13 TeV measurements as well as with the FNAL SeaQuest and BNL STAR data on lepton pair production. The specialized CT18X PDFs approximating saturation effects are compared wit… ▽ More

    Submitted 11 February, 2022; v1 submitted 14 August, 2021; originally announced August 2021.

    Comments: 16 pages, 7 figures

    Report number: FERMILAB-CONF-21-361-QIS-SCD-T, MSUHEP-21-023, PITT-PACC-2117, SMU-HEP-21-09

  31. arXiv:2108.05306  [pdf, ps, other

    hep-ph hep-ex nucl-th

    Interpretations of the new LHCb $P_c(4337)^+$ pentaquark state

    Authors: Mao-Jun Yan, Fang-Zheng Peng, Mario Sánchez Sánchez, Manuel Pavon Valderrama

    Abstract: Recently the LHCb collaboration has observed a new pentaquark state, the $P_c(4337)^+$. Owing to its proximity to the $χ_{c0}(1S) p$, $\bar{D}^* Λ_c$, $\bar{D} Σ_c$ and $\bar{D} Σ_c^*$ thresholds, this new pentaquark might very well be a meson-baryon bound state. However its spin and parity have not been determined yet and none of the previous possibilities can be ruled out. We briefly explore a f… ▽ More

    Submitted 4 July, 2022; v1 submitted 11 August, 2021; originally announced August 2021.

    Comments: 19 pages, 5 tables, 1 figure, corresponds with published version

    Journal ref: EPJC 82, 574 (2022)

  32. arXiv:2108.04785  [pdf, ps, other

    hep-ph hep-ex nucl-th

    Subleading contributions to the decay width of the $T_{cc}^+$ tetraquark

    Authors: Mao-Jun Yan, Manuel Pavon Valderrama

    Abstract: Recently the LHCb collaboration has announced the discovery of the $T_{cc}^+$ tetraquark. Being merely a few hundred ${\rm keV}$ below the $D^{*+} D^0$ threshold, the $T_{cc}^+$ is expected to have a molecular component, for which there is a good separation of scales that can be exploited to make reasonably accurate theoretical predictions about this tetraquark. Independently of its nature, the mo… ▽ More

    Submitted 8 January, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: 13 pages, 1 figure; corresponds with the published version

    Journal ref: Phys. Rev. D 105, 014007(2022)

  33. arXiv:2108.04448  [pdf, other

    cs.LG cs.DC math.OC

    Decentralized Composite Optimization with Compression

    Authors: Yao Li, Xiaorui Liu, Jiliang Tang, Ming Yan, Kun Yuan

    Abstract: Decentralized optimization and communication compression have exhibited their great potential in accelerating distributed machine learning by mitigating the communication bottleneck in practice. While existing decentralized algorithms with communication compression mostly focus on the problems with only smooth components, we study the decentralized stochastic composite optimization problem with a… ▽ More

    Submitted 12 August, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

  34. 3D hinge transport in acoustic higher-order topological insulators

    Authors: Qiang Wei, Xuewei Zhang, Weiyin Deng, Jiuyang Lu, Xueqin Huang, Mou Yan, Gang Chen, Zhengyou Liu, Suotang Jia

    Abstract: The discovery of topologically protected boundary states in topological insulators opens a new avenue toward exploring novel transport phenomena. The one-way feature of boundary states against disorders and impurities prospects great potential in applications of electronic and classical wave devices. Particularly, for the 3D higher-order topological insulators, it can host hinge states, which allo… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  35. arXiv:2108.02102  [pdf, other

    cs.DC

    ErrorCompensatedX: error compensation for variance reduced algorithms

    Authors: Hanlin Tang, Yao Li, Ji Liu, Ming Yan

    Abstract: Communication cost is one major bottleneck for the scalability for distributed learning. One approach to reduce the communication cost is to compress the gradient during communication. However, directly compressing the gradient decelerates the convergence speed, and the resulting algorithm may diverge for biased compression. Recent work addressed this problem for stochastic gradient descent by add… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  36. arXiv:2107.13580  [pdf, other

    hep-ph hep-ex nucl-th

    The photon content of the proton in the CT18 global analysis

    Authors: Keping Xie, T. J. Hobbs, Tie-Jiun Hou, Carl Schmidt, Mengshi Yan, C. -P. Yuan

    Abstract: Recently, two photon PDF sets based on implementations of the LUX ansatz into the CT18 global analysis were released. In CT18lux, the photon PDF is calculated directly using the LUX master formula for all scales, $μ$. In an alternative realization, CT18qed, the photon PDF is initialized at the starting scale, $μ_0$, using the LUX formulation and evolved to higher scales $μ(>μ_0)$ with a combined Q… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: Submission to SciPost

    Report number: MSUHEP-21-015, PITT-PACC-2116, SMU-HEP-21-11

  37. arXiv:2107.12065  [pdf, other

    math.OC cs.DC cs.LG eess.SP eess.SY

    Provably Accelerated Decentralized Gradient Method Over Unbalanced Directed Graphs

    Authors: Zhuoqing Song, Lei Shi, Shi Pu, Ming Yan

    Abstract: We consider the decentralized optimization problem, where a network of $n$ agents aims to collaboratively minimize the average of their individual smooth and convex objective functions through peer-to-peer communication in a directed graph. To tackle this problem, we propose two accelerated gradient tracking methods, namely APD and APD-SC, for non-strongly convex and strongly convex objective func… ▽ More

    Submitted 6 December, 2023; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: SIAM Journal on Optimization, in press

  38. arXiv:2107.06996  [pdf, other

    cs.LG cs.AI cs.NE

    Elastic Graph Neural Networks

    Authors: Xiaorui Liu, Wei Jin, Yao Ma, Yaxin Li, Hua Liu, Yiqi Wang, Ming Yan, Jiliang Tang

    Abstract: While many existing graph neural networks (GNNs) have been proven to perform $\ell_2$-based graph smoothing that enforces smoothness globally, in this work we aim to further enhance the local smoothness adaptivity of GNNs via $\ell_1$-based graph smoothing. As a result, we introduce a family of GNNs (Elastic GNNs) based on $\ell_1$ and $\ell_2$-based graph smoothing. In particular, we propose a no… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: ICML 2021 (International Conference on Machine Learning)

  39. arXiv:2106.13477  [pdf, ps, other

    math.OC math.NA

    Hessian informed mirror descent

    Authors: Li Wang, Ming Yan

    Abstract: Inspired by the recent paper (L. Ying, Mirror descent algorithms for minimizing interacting free energy, Journal of Scientific Computing, 84 (2020), pp. 1-14),we explore the relationship between the mirror descent and the variable metric method. When the metric in the mirror decent is induced by a convex function, whose Hessian is close to the Hessian of the objective function, this method enjoys… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  40. arXiv:2106.10299  [pdf, other

    hep-ph hep-ex nucl-th

    The photon PDF within the CT18 global analysis

    Authors: Keping Xie, T. J. Hobbs, Tie-Jiun Hou, Carl Schmidt, Mengshi Yan, C. -P. Yuan

    Abstract: Building upon the most recent CT18 global fit, we present a new calculation of the photon content of the proton based on an application of the LUX formalism. In this work, we explore two principal variations of the LUX ansatz. In one approach, which we designate "CT18lux," the photon PDF is calculated directly using the LUX formula for all scales, $μ$. In an alternative realization, "CT18qed," we… ▽ More

    Submitted 12 February, 2023; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 45 pages, 27 figures, and 5 tables

    Report number: MSUHEP-21-013, PITT-PACC-2112, SMU-HEP-21-06, FERMILAB-PUB-21-370-QIS-SCD-T

  41. arXiv:2106.08235  [pdf, other

    cs.LG cs.CL

    PairConnect: A Compute-Efficient MLP Alternative to Attention

    Authors: Zhaozhuo Xu, Minghao Yan, Junyan Zhang, Anshumali Shrivastava

    Abstract: Transformer models have demonstrated superior performance in natural language processing. The dot product self-attention in Transformer allows us to model interactions between words. However, this modeling comes with significant computational overhead. In this work, we revisit the memory-compute trade-off associated with Transformer, particularly multi-head attention, and show a memory-heavy but s… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  42. arXiv:2106.07243  [pdf, ps, other

    math.OC cs.DC cs.LG cs.MA eess.SP

    Compressed Gradient Tracking for Decentralized Optimization Over General Directed Networks

    Authors: Zhuoqing Song, Lei Shi, Shi Pu, Ming Yan

    Abstract: In this paper, we propose two communication efficient decentralized optimization algorithms over a general directed multi-agent network. The first algorithm, termed Compressed Push-Pull (CPP), combines the gradient tracking Push-Pull method with communication compression. We show that CPP is applicable to a general class of unbiased compression operators and achieves linear convergence rate for st… ▽ More

    Submitted 9 April, 2024; v1 submitted 14 June, 2021; originally announced June 2021.

    Journal ref: IEEE Transactions on Signal Processing, 70(2022), 1775-1787

  43. arXiv:2106.01804  [pdf, other

    cs.CV cs.AI cs.CL

    E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning

    Authors: Haiyang Xu, Ming Yan, Chenliang Li, Bin Bi, Songfang Huang, Wenming Xiao, Fei Huang

    Abstract: Vision-language pre-training (VLP) on large-scale image-text pairs has achieved huge success for the cross-modal downstream tasks. The most existing pre-training methods mainly adopt a two-step training procedure, which firstly employs a pre-trained object detector to extract region-based visual features, then concatenates the image representation and text embedding as the input of Transformer to… ▽ More

    Submitted 4 June, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: ACL2021 main conference

  44. arXiv:2105.11210  [pdf, other

    cs.CL

    StructuralLM: Structural Pre-training for Form Understanding

    Authors: Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang, Fei Huang, Luo Si

    Abstract: Large pre-trained language models achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, they almost exclusively focus on text-only representation, while neglecting cell-level layout information that is important for form image understanding. In this paper, we propose a new pre-training approach, StructuralLM, to jointly leverage cell and layout information from scanned… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: Accepted by ACL2021 main conference

  45. arXiv:2104.01768  [pdf, other

    cs.SE

    Predicting Crash Fault Residence via Simplified Deep Forest Based on A Reduced Feature Set

    Authors: Kunsong Zhao, Jin Liu, Zhou Xu, Li Li, Meng Yan, Jiaojiao Yu, Yuxuan Zhou

    Abstract: The software inevitably encounters the crash, which will take developers a large amount of effort to find the fault causing the crash (short for crashing fault). Developing automatic methods to identify the residence of the crashing fault is a crucial activity for software quality assurance. Researchers have proposed methods to predict whether the crashing fault resides in the stack trace based on… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

  46. arXiv:2104.01032  [pdf, other

    cs.SE cs.AI cs.CV

    Plot2API: Recommending Graphic API from Plot via Semantic Parsing Guided Neural Network

    Authors: Zeyu Wang, Sheng Huang, Zhongxin Liu, Meng Yan, Xin Xia, Bei Wang, Dan Yang

    Abstract: Plot-based Graphic API recommendation (Plot2API) is an unstudied but meaningful issue, which has several important applications in the context of software engineering and data visualization, such as the plotting guidance of the beginner, graphic API correlation analysis, and code conversion for plotting. Plot2API is a very challenging task, since each plot is often associated with multiple APIs an… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted by SANER2021

  47. arXiv:2103.14493  [pdf, other

    cs.LG cs.NE

    RCT: Resource Constrained Training for Edge AI

    Authors: Tian Huang, Tao Luo, Ming Yan, Joey Tianyi Zhou, Rick Goh

    Abstract: Neural networks training on edge terminals is essential for edge AI computing, which needs to be adaptive to evolving environment. Quantised models can efficiently run on edge devices, but existing training methods for these compact models are designed to run on powerful servers with abundant memory and energy budget. For example, quantisation-aware training (QAT) method involves two copies of mod… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: 14 pages

    MSC Class: 68T07 (Primary) 68T05 (Secondary) ACM Class: I.5.1; I.2.6

  48. arXiv:2103.12393  [pdf, other

    cs.AR

    RISC-NN: Use RISC, NOT CISC as Neural Network Hardware Infrastructure

    Authors: Taoran Xiang, Lunkai Zhang, Shuqian An, Xiaochun Ye, Mingzhe Zhang, Yanhuan Liu, Mingyu Yan, Da Wang, Hao Zhang, Wenming Li, Ninghui Sun, Dongrui Fan

    Abstract: Neural Networks (NN) have been proven to be powerful tools to analyze Big Data. However, traditional CPUs cannot achieve the desired performance and/or energy efficiency for NN applications. Therefore, numerous NN accelerators have been used or designed to meet these goals. These accelerators all fall into three categories: GPGPUs, ASIC NN Accelerators and CISC NN Accelerators. Though CISC NN Acce… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  49. Determining the helicity structure of the nucleon at the Electron Ion Collider in China

    Authors: Daniele Paolo Anderle, Tie-Jiun Hou, Hongxi Xing, Mengshi Yan, C. -P. Yuan, Yuxiang Zhao

    Abstract: Understanding how sea quarks behave inside a nucleon is one of the most important physics goals of the proposed Electron-Ion Collider in China (EicC), which is designed to have 3.5 GeV polarized electron beam (80% polarization) colliding with 20 GeV polarized proton beam (70% polarization) at instantaneous luminosity of $2 \times 10^{33} {\rm cm}^{-2} {\rm s}^{-1}$. A specific topic at EicC is to… ▽ More

    Submitted 13 July, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: 40 pages, 12 figures

    Report number: JHEP08(2021)034

    Journal ref: https://doi.org/10.1007/JHEP08(2021)034

  50. arXiv:2103.07829  [pdf, other

    cs.CL

    SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels

    Authors: Chenliang Li, Ming Yan, Haiyang Xu, Fuli Luo, Wei Wang, Bin Bi, Songfang Huang

    Abstract: Vision-language pre-training (VLP) on large-scale image-text pairs has recently witnessed rapid progress for learning cross-modal representations. Existing pre-training methods either directly concatenate image representation and text representation at a feature level as input to a single-stream Transformer, or use a two-stream cross-modal Transformer to align the image-text representation at a hi… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.

    Comments: 10 pages, 4 figures