Zum Hauptinhalt springen

Showing 51–100 of 330 results for author: Yao, C

.
  1. arXiv:2311.17215  [pdf, other

    math.NT math.NA

    Applications of Moments of Dirichlet Coefficients in Elliptic Curve Families

    Authors: Zoë Batterman, Aditya Jambhale, Steven J. Miller, Akash L. Narayanan, Kishan Sharma, Andrew Yang, Chris Yao

    Abstract: The moments of the coefficients of elliptic curve L-functions are related to numerous arithmetic problems. Rosen and Silverman proved a conjecture of Nagao relating the first moment of one-parameter families satisfying Tate's conjecture to the rank of the corresponding elliptic surface over Q(T); one can also construct families of moderate rank by finding families with large first moments. Michel… ▽ More

    Submitted 17 June, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    MSC Class: 11G05; 11G40

  2. arXiv:2311.14956  [pdf, other

    physics.plasm-ph

    Anomalous hot electron generation from two-plasmon decay instability driven by broadband laser pulses with intensity modulations

    Authors: C. Yao, J. Li, L. Hao, R. Yan, C. Wang, A. Lei, Y-K. Ding, J. Zheng

    Abstract: We investigate the hot electrons generated from two-plasmon decay (TPD) instability driven by laser pulses with intensity modulated by a frequency $Δω_m$. Our primary focus lies on scenarios where $Δω_m$ is on the same order of the TPD growth rate $ γ_0$ ( $Δω_m \sim γ_0$), corresponding to moderate laser frequency bandwidths for TPD mitigation. With $Δω_m$ conveniently modeled by a basic two-colo… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  3. arXiv:2311.11482  [pdf, other

    cs.AI cs.CL

    Meta Prompting for AI Systems

    Authors: Yifan Zhang, Yang Yuan, Andrew Chi-Chih Yao

    Abstract: In this work, we present a comprehensive study of Meta Prompting (MP), an innovative technique reshaping the utilization of language models (LMs) and AI systems in problem-solving and data interaction. Grounded in type theory and category theory, Meta Prompting emphasizes the structure and syntax of information over traditional content-centric methods. The paper explores the formal definitions of… ▽ More

    Submitted 15 June, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  4. arXiv:2310.18090  [pdf, ps, other

    eess.SP

    Probabilistic Constellation Shaping for OFDM-Based ISAC Signaling

    Authors: Zhen Du, Fan Liu, Yifeng Xiong, Tony Xiao Han, Weijie Yuan, Yuanhao Cui, Changhua Yao, Yonina C. Eldar

    Abstract: Integrated Sensing and Communications (ISAC) has garnered significant attention as a promising technology for the upcoming sixth-generation wireless communication systems (6G). In pursuit of this goal, a common strategy is that a unified waveform, such as Orthogonal Frequency Division Multiplexing (OFDM), should serve dual-functional roles by enabling simultaneous sensing and communications (S&C)… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  5. arXiv:2310.16070  [pdf, other

    cs.LG

    Spatial-Temporal Hypergraph Neural Network for Traffic Forecasting

    Authors: Chengzhi Yao, Zhi Li, Junbo Wang

    Abstract: Traffic forecasting, which benefits from mobile Internet development and position technologies, plays a critical role in Intelligent Transportation Systems. It helps to implement rich and varied transportation applications and bring convenient transportation services to people based on collected traffic data. Most existing methods usually leverage graph-based deep learning networks to model the co… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  6. arXiv:2310.12430  [pdf, other

    cs.CV cs.CL

    DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond

    Authors: Cong Yao

    Abstract: In this report, we introduce DocXChain, a powerful open-source toolchain for document parsing, which is designed and developed to automatically convert the rich information embodied in unstructured documents, such as text, tables and charts, into structured representations that are readable and manipulable by machines. Specifically, basic capabilities, including text detection, text recognition, t… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 4 pages, 4 figures, 2 tables

  7. arXiv:2310.10362  [pdf, other

    cs.LG cs.AI

    Self-Pro: A Self-Prompt and Tuning Framework for Graph Neural Networks

    Authors: Chenghua Gong, Xiang Li, Jianxiang Yu, Cheng Yao, Jiaqi Tan, Chengcheng Yu

    Abstract: Graphs have become an important modeling tool for web applications, and Graph Neural Networks (GNNs) have achieved great success in graph representation learning. However, the performance of traditional GNNs heavily relies on a large amount of supervision. Recently, ``pre-train, fine-tune'' has become the paradigm to address the issues of label dependency and poor generalization. However, the pre-… ▽ More

    Submitted 4 June, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted at ECML-PKDD 2024

  8. arXiv:2310.08064  [pdf

    cs.CV

    Age Estimation Based on Graph Convolutional Networks and Multi-head Attention Mechanisms

    Authors: Miaomiao Yang, Changwei Yao, Shijin Yan

    Abstract: Age estimation technology is a part of facial recognition and has been applied to identity authentication. This technology achieves the development and application of a juvenile anti-addiction system by authenticating users in the game. Convolutional Neural Network (CNN) and Transformer algorithms are widely used in this application scenario. However, these two models cannot flexibly extract and m… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  9. arXiv:2310.04975  [pdf, ps, other

    cs.CR cs.DC

    A Trustworthy and Consistent Blockchain Oracle Scheme for Industrial Internet of Things

    Authors: Peng Liu, Youquan Xian, Chuanjian Yao, Peng Wang, Li-e Wang, Xianxian Li

    Abstract: Blockchain provides decentralization and trustlessness features for the Industrial Internet of Things (IIoT), which expands the application scenarios of IIoT. To address the problem that the blockchain cannot actively obtain off-chain data, the blockchain oracle is proposed as a bridge between the blockchain and external data. However, the existing oracle schemes are difficult to solve the problem… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: Rejected after the third round of review of IEEE Internet of Things Journal

  10. arXiv:2310.00890  [pdf

    cond-mat.mtrl-sci

    Femtosecond electron diffraction reveals local disorder and local anharmonicity in thermoelectric SnSe

    Authors: Jingjun Li, Yingpeng Qi, Qing Yang, Luye Yue, Changyuan Yao, Zijing Chen, Sheng Meng, Dao Xiang, Jianming Cao

    Abstract: The microscopic arrangement of atoms and molecules is the determining factor in how materials behave and perform. Beyond the long-range periodicity, the local disorder with local structures deviating from the average lattice structure plays a vital role in determining the physical properties of the phonon, electron and spin subsystems in crystalline functional materials. Experimentally characteriz… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Report number: 2313742

    Journal ref: Adv. Mater. 2313742 (2024)

  11. IBVC: Interpolation-driven B-frame Video Compression

    Authors: Chenming Xu, Meiqin Liu, Chao Yao, Weisi Lin, Yao Zhao

    Abstract: Learned B-frame video compression aims to adopt bi-directional motion estimation and motion compensation (MEMC) coding for middle frame reconstruction. However, previous learned approaches often directly extend neural P-frame codecs to B-frame relying on bi-directional optical-flow estimation or video frame interpolation. They suffer from inaccurate quantized motions and inefficient motion compens… ▽ More

    Submitted 14 March, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Submitted to Pattern Recognition

  12. arXiv:2309.13596  [pdf, other

    cs.CV

    Advancements in 3D Lane Detection Using LiDAR Point Clouds: From Data Collection to Model Development

    Authors: Runkai Zhao, Yuwen Heng, Heng Wang, Yuanda Gao, Shilei Liu, Changhao Yao, Jiawen Chen, Weidong Cai

    Abstract: Advanced Driver-Assistance Systems (ADAS) have successfully integrated learning-based techniques into vehicle perception and decision-making. However, their application in 3D lane detection for effective driving environment perception is hindered by the lack of comprehensive LiDAR datasets. The sparse nature of LiDAR point cloud data prevents an efficient manual annotation process. To solve this p… ▽ More

    Submitted 15 March, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted by ICRA2024

  13. arXiv:2309.12748  [pdf, other

    math.CO math.NT

    The Reversed Zeckendorf Game

    Authors: Zoë X. Batterman, Aditya Jambhale, Steven J. Miller, Akash L. Narayanan, Kishan Sharma, Andrew K. Yang, Chris Yao

    Abstract: Zeckendorf proved that every natural number $n$ can be expressed uniquely as a sum of non-consecutive Fibonacci numbers, called its Zeckendorf decomposition. Baird-Smith, Epstein, Flint, and Miller created the Zeckendorf game, a two-player game played on partitions of $n$ into Fibonacci numbers which always terminates at a Zeckendorf decomposition, and proved that Player 2 has a winning strategy f… ▽ More

    Submitted 4 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 25 Pages, 4 figures

    MSC Class: 11B39 (Primary); 05C57; 65Q30; 91A05; 91A46 (Secondary)

  14. arXiv:2308.14978  [pdf, other

    cs.CV

    Vision Grid Transformer for Document Layout Analysis

    Authors: Cheng Da, Chuwei Luo, Qi Zheng, Cong Yao

    Abstract: Document pre-trained models and grid-based models have proven to be very effective on various tasks in Document AI. However, for the document layout analysis (DLA) task, existing document pre-trained models, even those pre-trained in a multi-modal fashion, usually rely on either textual features or visual features. Grid-based models for DLA are multi-modality but largely neglect the effect of pre-… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV2023

  15. arXiv:2308.12774  [pdf, other

    cs.CV

    LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

    Authors: Changxu Cheng, Peng Wang, Cheng Da, Qi Zheng, Cong Yao

    Abstract: The diversity in length constitutes a significant characteristic of text. Due to the long-tail distribution of text lengths, most existing methods for scene text recognition (STR) only work well on short or seen-length text, lacking the capability of recognizing longer text or performing length extrapolation. This is a crucial issue, since the lengths of the text to be recognized are usually not g… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  16. arXiv:2308.09365  [pdf, ps, other

    math.DG

    The dissolving limit and large volume limit of Einstein-Bogomol'nyi metrics

    Authors: Chengjian Yao

    Abstract: We study the limits of Einstein-Bogomol'nyi metrics on $\mathbf{P}^1$, which is the solution to a dimensional reduction of Einstein-Maxwell-Higgs system in dimension four, in two regimes. In one regime called the "dissolving limit" where the volume of the metrics is approaching the admissible lower bound, it exhibits a pattern that all the vortices are dissolving similar to the Bradlow limit in th… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 26 pages, 2 figures

    MSC Class: 53C07; 53C25 (Primary) 83C22; 83C50 (Secondary)

  17. arXiv:2308.04371  [pdf, other

    cs.AI

    Cumulative Reasoning with Large Language Models

    Authors: Yifan Zhang, Jingqin Yang, Yang Yuan, Andrew Chi-Chih Yao

    Abstract: Despite the recent advancements in language models (LMs), their ability to solve complex problems remains limited. This paper introduces Cumulative Reasoning (CR), a novel approach that utilizes LMs cumulatively and iteratively, mirroring human thought processes for problem-solving. CR decomposes tasks into smaller, manageable components and leverages previous propositions for effective compositio… ▽ More

    Submitted 1 April, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

  18. arXiv:2307.15272  [pdf

    eess.SP

    Direct Power Flow Controller with Continuous Full Regulation Range

    Authors: Chong Yao, Youjun Zhang

    Abstract: For enhancing power flow control in power transmission, a simplified new structure of direct power flow controller with continuous full regulation range (F-DPFC) was proposed. It has only one-stage power conversion and comprises of a three-phase transformer in parallel and a three-phase trans-former in series with grid, three single-phase full-bridge ac units, and a three-phase filter. Compared wi… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 9 pages,20 figures

  19. Influence of cation vacancy concentrations on ultra-low thermal conductivity in $(1-x)$BiVO$_4$-$x$Bi$_{2/3}$MoO$_4$ scheelite solid solutions

    Authors: Guillaume F. Nataf, Hicham Ait Laasri, Damien Brault, Tatiana Chartier, Chalit Ya, Fabian Delorme, Isabelle Monot-Laffez, Fabien Giovannelli

    Abstract: Bismuth vanadate - bismuth molybdate solid-solution was prepared to elaborate ceramics with different amounts of cation vacancies. Dense ceramics with similar microstructures were obtained and the evolution of their melting point, specific heat, thermal diffusivity, and conductivity as a function of the amount of vacancy was evaluated. At room temperature, the thermal conductivity decreases from 1… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 17 pages, 5 figures

    Journal ref: Open Ceramics 15, 100406 (2023)

  20. arXiv:2307.13244  [pdf, other

    cs.CV

    Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition

    Authors: Cheng Da, Peng Wang, Cong Yao

    Abstract: Due to the enormous technical challenges and wide range of applications, scene text recognition (STR) has been an active research topic in computer vision for years. To tackle this tough problem, numerous innovative methods have been successively proposed, and incorporating linguistic knowledge into STR models has recently become a prominent trend. In this work, we first draw inspiration from the… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: submitted to TPAMI; an extension to our previous ECCV 2022 paper arXiv:2209.03592

  21. arXiv:2307.08563  [pdf, other

    hep-ph

    Hilbert series for ALP EFTs

    Authors: Christophe Grojean, Jonathan Kley, Chang-Yuan Yao

    Abstract: Axions and axion-like particles (ALPs) are ubiquitous in popular attempts to solve supercalifragilisticexpialidocious puzzles of Nature. A widespread and vivid experimental programme spanning a vast range of mass scales and decades of couplings strives to find evidence for these elusive but theoretically well-motivated particles. In the absence of clear guiding principle, effective field theories… ▽ More

    Submitted 30 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 33 pages + appendices, 2 figures, 13 tables, added discussion about CP, updated the ancillary file

    Report number: DESY-23-098, HU-EP-23/39

  22. arXiv:2307.04420  [pdf, ps, other

    cs.DC cs.AI

    FedDCT: A Dynamic Cross-Tier Federated Learning Scheme in Wireless Communication Networks

    Authors: Peng Liu, Youquan Xian, Chuanjian Yao, Xiaoyun Gan, Lianghaojie Zhou, Jianyong Jiang, Dongcheng Li

    Abstract: With the rapid proliferation of Internet of Things (IoT) devices and the growing concern for data privacy among the public, Federated Learning (FL) has gained significant attention as a privacy-preserving machine learning paradigm. FL enables the training of a global model among clients without exposing local data. However, when a federated learning system runs on wireless communication networks,… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  23. arXiv:2307.02828  [pdf, other

    cs.CV cs.CR cs.LG

    Sampling-based Fast Gradient Rescaling Method for Highly Transferable Adversarial Attacks

    Authors: Xu Han, Anmin Liu, Chenxuan Yao, Yanbo Fan, Kun He

    Abstract: Deep neural networks are known to be vulnerable to adversarial examples crafted by adding human-imperceptible perturbations to the benign input. After achieving nearly 100% attack success rates in white-box setting, more focus is shifted to black-box attacks, of which the transferability of adversarial examples has gained significant attention. In either case, the common gradient-based methods gen… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 10 pages, 6 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2204.02887

  24. arXiv:2306.17368  [pdf, other

    hep-ph

    Searching for heavy neutral lepton and lepton number violation through VBS at high-energy muon colliders

    Authors: Tong Li, Chang-Yuan Yao, Man Yuan

    Abstract: High-energy muon collider can play as an emitter of electroweak gauge bosons and thus leads to substantial vector boson scattering (VBS) processes. In this work, we investigate the production of heavy neutral lepton (HNL) $N$ and lepton number violation (LNV) signature through VBS at high-energy muon colliders. VBS induces LNV processes… ▽ More

    Submitted 3 September, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 24 pages, 8 figures, 2 tables. Accepted for publication in JHEP

    Report number: DESY-23-092

  25. arXiv:2306.12052  [pdf, ps, other

    math.RA math.CV math.GR

    Some invariants of $U(1,1;\mathbb{H})$ and diagonalization

    Authors: Cailing Yao, Bingzhe Hou, Xiaoqi Feng

    Abstract: Denote by $\mathbb{H}$ the set of all quaternions. We are interested in the group $U(1,1;\mathbb{H})$, which is a subgroup of $2\times 2$ quaternionic matrix group and is sometimes called $Sp(1,1)$. As well known, $U(1,1;\mathbb{H})$ corresponds to the quaternionic Möbius transformations on the unit ball in $\mathbb{H}$. In this article, some similar invariants on $U(1,1;\mathbb{H})$ are discussed… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 17 pages

    MSC Class: 15A20; 15B33; 30G35 (Primary); 15A18; 16R30 (Secondary)

  26. arXiv:2306.10804  [pdf, other

    cs.CV

    Conditional Text Image Generation with Diffusion Models

    Authors: Yuanzhi Zhu, Zhaohai Li, Tianwei Wang, Mengchao He, Cong Yao

    Abstract: Current text recognition systems, including those for handwritten scripts and scene text, have relied heavily on image synthesis and augmentation, since it is difficult to realize real-world complexity and diversity through collecting and annotating enough real text images. In this paper, we explore the problem of text image generation, by taking advantage of the powerful abilities of Diffusion Mo… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  27. arXiv:2306.05729  [pdf

    physics.optics physics.app-ph

    Redesigning spectroscopic sensors with programmable photonic circuits

    Authors: Chunhui Yao, Kangning Xu, Wanlu Zhang, Minjia Chen, Qixiang Cheng, Richard Penty

    Abstract: Optical spectroscopic sensors are a powerful tool to reveal light-matter interactions in many fields, such as physics, biology, chemistry, and astronomy. Miniaturizing the currently bulky spectrometers has become imperative for the wide range of applications that demand in situ or even in vitro characterization systems, a field that is growing rapidly. Benchtop spectrometers are capable of offerin… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  28. arXiv:2306.04619  [pdf, other

    cs.CV

    ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections

    Authors: Chun-Han Yao, Amit Raj, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani

    Abstract: Estimating 3D articulated shapes like animal bodies from monocular images is inherently challenging due to the ambiguities of camera viewpoint, pose, texture, lighting, etc. We propose ARTIC3D, a self-supervised framework to reconstruct per-instance 3D shapes from a sparse image collection in-the-wild. Specifically, ARTIC3D is built upon a skeleton-based surface representation and is further guide… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Project page: https://chhankyao.github.io/artic3d/

  29. arXiv:2306.02001  [pdf, ps, other

    math.OC

    A globally convergent difference-of-convex algorithmic framework and application to log-determinant optimization problems

    Authors: Chaorui Yao, Xin Jiang

    Abstract: The difference-of-convex algorithm (DCA) is a conceptually simple method for the minimization of (possibly) nonconvex functions that are expressed as the difference of two convex functions. At each iteration, DCA constructs a global overestimator of the objective and solves the resulting convex subproblem. Despite its conceptual simplicity, the theoretical understanding and algorithmic framework o… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  30. arXiv:2305.18548  [pdf

    cs.ET physics.optics

    I/O-efficient iterative matrix inversion with photonic integrated circuits

    Authors: Minjia Chen, Yizhi Wang, Chunhui Yao, Adrian Wonfor, Shuai Yang, Richard Penty, Qixiang Cheng

    Abstract: Photonic integrated circuits have been extensively explored for optical processing with the aim of breaking the speed bottleneck of digital electronics. However, the input/output (IO) bottleneck remains one of the key barriers. Here we report a novel photonic iterative processor (PIP) for matrix-inversion-intensive applications. The direct reuse of inputted data in the optical domain unlocks the p… ▽ More

    Submitted 22 May, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  31. arXiv:2305.18442  [pdf, other

    cs.LG math.OC

    Improved Projection-free Online Continuous Submodular Maximization

    Authors: Yucheng Liao, Yuanyu Wan, Chang Yao, Mingli Song

    Abstract: We investigate the problem of online learning with monotone and continuous DR-submodular reward functions, which has received great attention recently. To efficiently handle this problem, especially in the case with complicated decision sets, previous studies have proposed an efficient projection-free algorithm called Mono-Frank-Wolfe (Mono-FW) using $O(T)$ gradient evaluations and linear optimiza… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  32. arXiv:2305.15940  [pdf, other

    cs.CV

    Mask Attack Detection Using Vascular-weighted Motion-robust rPPG Signals

    Authors: Chenglin Yao, Jianfeng Ren, Ruibin Bai, Heshan Du, Jiang Liu, Xudong Jiang

    Abstract: Detecting 3D mask attacks to a face recognition system is challenging. Although genuine faces and 3D face masks show significantly different remote photoplethysmography (rPPG) signals, rPPG-based face anti-spoofing methods often suffer from performance degradation due to unstable face alignment in the video sequence and weak rPPG signals. To enhance the rPPG signal in a motion-robust way, a landma… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  33. arXiv:2305.12131  [pdf, other

    cs.LG

    Non-stationary Online Convex Optimization with Arbitrary Delays

    Authors: Yuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang

    Abstract: Online convex optimization (OCO) with arbitrary delays, in which gradients or other information of functions could be arbitrarily delayed, has received increasing attention recently. Different from previous studies that focus on stationary environments, this paper investigates the delayed OCO in non-stationary environments, and aims to minimize the dynamic regret with respect to any sequence of co… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Camera-ready Version for ICML2024

  34. arXiv:2305.08325  [pdf, other

    cs.CV eess.IV

    Screentone-Aware Manga Super-Resolution Using DeepLearning

    Authors: Chih-Yuan Yao, Husan-Ting Chou, Yu-Sheng Lin, Kuo-wei Chen

    Abstract: Manga, as a widely beloved form of entertainment around the world, have shifted from paper to electronic screens with the proliferation of handheld devices. However, as the demand for image quality increases with screen development, high-quality images can hinder transmission and affect the viewing experience. Traditional vectorization methods require a significant amount of manual parameter adjus… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  35. arXiv:2305.06737  [pdf, other

    cs.IT

    A Diagonal Splitting Algorithm for Adaptive Group Testing

    Authors: Chaorui Yao, Pavlos Nikolopoulos, Christina Fragouli

    Abstract: Group testing enables to identify infected individuals in a population using a smaller number of tests than individual testing. To achieve this, group testing algorithms commonly assume knowledge of the number of infected individuals; nonadaptive and several adaptive algorithms fall in this category. Some adaptive algorithms, like binary splitting, operate without this assumption, but require a nu… ▽ More

    Submitted 14 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  36. arXiv:2304.14761  [pdf, other

    math.AP math.CV

    Topological regularity for solutions to the generalised Hopf equation

    Authors: Gaven Martin, Cong Yao

    Abstract: The generalised Hopf equation is the first order nonlinear equation with data $Φ$ a holomorphic functions and $η\geq 1$ a positive weight, \[ h_w\,\overline{h_\wbar}\,η(w) = Φ.\] The Hopf equation is the special case $η(w)=\tildeη(h(w))$ and reflects that $h$ is harmonic with respect to the conformal metric $\sqrt{\tildeη(z)}|dz|$. This article obtains conditions on the data to ensure that a solut… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

  37. Quantum transport theory of hybrid superconducting systems

    Authors: Chuan-Zhe Yao, Hon-Lam Lai, Wei-Min Zhang

    Abstract: We present a quantum transport theory for hybrid superconducting systems based on our exact master equation approach. The total transient transport current is decomposed into components that describe coherent transports through different paths of particle and hole channels. We show that the coherent transports are resultant interferences of numerous repeated tunneling processes and cannot be rende… ▽ More

    Submitted 2 November, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

    Journal ref: Phys. Rev. B 108, 195402 (2023)

  38. arXiv:2304.10759  [pdf, other

    cs.CV cs.CL

    GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

    Authors: Chuwei Luo, Changxu Cheng, Qi Zheng, Cong Yao

    Abstract: Visual information extraction (VIE) plays an important role in Document Intelligence. Generally, it is divided into two tasks: semantic entity recognition (SER) and relation extraction (RE). Recently, pre-trained models for documents have achieved substantial progress in VIE, particularly in SER. However, most of the existing models learn the geometric representation in an implicit way, which has… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: CVPR 2023 Highlight

  39. arXiv:2303.13095  [pdf, other

    cs.CV

    Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

    Authors: Zhibo Yang, Rujiao Long, Pengfei Wang, Sibo Song, Humen Zhong, Wenqing Cheng, Xiang Bai, Cong Yao

    Abstract: Recently, Visual Information Extraction (VIE) has been becoming increasingly important in both the academia and industry, due to the wide range of real-world applications. Previously, numerous works have been proposed to tackle this problem. However, the benchmarks used to assess these methods are relatively plain, i.e., scenarios with real-world complexity are not fully represented in these bench… ▽ More

    Submitted 28 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

  40. arXiv:2303.09112  [pdf, other

    eess.IV cs.AI cs.LG cs.MM

    SigVIC: Spatial Importance Guided Variable-Rate Image Compression

    Authors: Jiaming Liang, Meiqin Liu, Chao Yao, Chunyu Lin, Yao Zhao

    Abstract: Variable-rate mechanism has improved the flexibility and efficiency of learning-based image compression that trains multiple models for different rate-distortion tradeoffs. One of the most common approaches for variable-rate is to channel-wisely or spatial-uniformly scale the internal features. However, the diversity of spatial importance is instructive for bit allocation of image compression. In… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE ICASSP2023 (Camera Ready)

  41. Limit of the Wulff crystal when approaching criticality for isoperimetry in 2D percolation

    Authors: Chang-Long Yao

    Abstract: We consider isoperimetric sets, i.e., sets with minimal vertex boundary for a prescribed volume, of the infinite cluster of supercritical site percolation on the triangular lattice. Let $p$ be the percolation parameter and let $p_c$ be the critical point. By adapting the proof of Biskup, Louidor, Procaccia and Rosenthal [6] for isoperimetry in bond percolation on the square lattice, we show that t… ▽ More

    Submitted 22 November, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 20 pages, 6 figures. To appear in Electronic Journal of Probability

    MSC Class: 60K35; 82B43

    Journal ref: Electronic Journal of Probability 28, no. 165, 1-20 (2023)

  42. arXiv:2303.03730  [pdf, other

    cs.CV

    LORE: Logical Location Regression Network for Table Structure Recognition

    Authors: Hangdi Xing, Feiyu Gao, Rujiao Long, Jiajun Bu, Qi Zheng, Liangcheng Li, Cong Yao, Zhi Yu

    Abstract: Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats. Recent methods solve this problem by predicting the adjacency relations of detected cell boxes, or learning to generate the corresponding markup sequences from the table images. However, they either count on additional heuristic rules to recover the table structures, or require a huge amount… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  43. arXiv:2303.03131  [pdf, other

    cs.CV cs.AI cs.MM

    Video Question Answering Using CLIP-Guided Visual-Text Attention

    Authors: Shuhong Ye, Weikai Kong, Chenglin Yao, Jianfeng Ren, Xudong Jiang

    Abstract: Cross-modal learning of video and text plays a key role in Video Question Answering (VideoQA). In this paper, we propose a visual-text attention mechanism to utilize the Contrastive Language-Image Pre-training (CLIP) trained on lots of general domain language-image pairs to guide the cross-modal learning for VideoQA. Specifically, we first extract video features using a TimeSformer and text featur… ▽ More

    Submitted 8 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Submitted to the 2023 IEEE International Conference on Image Processing (ICIP 2023)

    ACM Class: I.2.10

  44. arXiv:2303.03105  [pdf, other

    cs.MM

    Confidence-based Event-centric Online Video Question Answering on a Newly Constructed ATBS Dataset

    Authors: Weikai Kong, Shuhong Ye, Chenglin Yao, Jianfeng Ren

    Abstract: Deep neural networks facilitate video question answering (VideoQA), but the real-world applications on video streams such as CCTV and live cast place higher demands on the solver. To address the challenges of VideoQA on long videos of unknown length, we define a new set of problems called Online Open-ended Video Question Answering (O^2VQA). It requires an online state-updating mechanism for the so… ▽ More

    Submitted 7 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

  45. arXiv:2302.09528  [pdf

    cs.CV

    A Comprehensive Evaluation Study on Risk Level Classification of Melanoma by Computer Vision on ISIC 2016-2020 Datasets

    Authors: Chengdong Yao

    Abstract: Skin cancer is the most common type of cancer. Specifically, melanoma is the cause of 75% of skin cancer deaths, although it is the least common skin cancer. Better detection of melanoma could have a positive impact on millions of people. The ISIC archive contains the largest publicly available collection of dermatoscopic images of skin lesions. In this research, we investigate the efficacy of app… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: 9 pages, 12 figures, 11 tables

  46. arXiv:2302.09144  [pdf

    cs.RO

    Designing a Wayfinding Robot for People with Visual Impairments

    Authors: Shuijing Liu, Aamir Hasan, Kaiwen Hong, Chun-Kai Yao, Justin Lin, Weihang Liang, Megan A. Bayles, Wendy A. Rogers, Katherine Driggs-Campbell

    Abstract: People with visual impairments (PwVI) often have difficulties navigating through unfamiliar indoor environments. However, current wayfinding tools are fairly limited. In this short paper, we present our in-progress work on a wayfinding robot for PwVI. The robot takes an audio command from the user that specifies the intended destination. Then, the robot autonomously plans a path to navigate to the… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: Presented at ICRA 2022 Workshop on Intelligent Control Methods and Machine Learning Algorithms for Human-Robot Interaction and Assistive Robotics

  47. Revealing the origin of neutrino masses through the Type II Seesaw mechanism at high-energy muon colliders

    Authors: Tong Li, Chang-Yuan Yao, Man Yuan

    Abstract: The future muon collider can play as an ideal machine to search for new physics at high energies. In this work, we study the search potential of the heavy Higgs triplet in the Type II Seesaw mechanism at muon colliders with high collision energy and high luminosity. The latest neutrino oscillation data are taken into account for realizing the leptonic decay modes of the charged Higgs bosons… ▽ More

    Submitted 27 February, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: 33 pages, 18 figures, 4 tables. Version accepted for publication in JHEP

    Report number: DESY-23-006

  48. arXiv:2301.06523  [pdf

    cond-mat.str-el cond-mat.supr-con

    Giant Nernst effect in the crossover between Fermi liquid and strange metal

    Authors: Yusen Yang, Qian Tao, Yuqiang Fang, Guoxiong Tang, Chao Yao, Xiaoxian Yan, Chenxi Jiang, Xiangfan Xu, Fuqiang Huang, Wenxin Ding, Yu Wang, Zhiqiang Mao, Hui Xing, Zhu-An Xu

    Abstract: The strange-metal state is a crucial problem in condensed matter physics highlighted by its ubiquity in almost all major correlated systems[1-7]. Its understanding could provide important insight into high-Tc superconductivity[2] and quantum criticality[8]. However, with the Fermi liquid theory failing in strange metals, understanding the highly unconventional behaviors has been a long-standing ch… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Journal ref: Further revised version published in Nature Physics 2023

  49. Systematic study of one-loop realizations of $d=7$ long-range $0νββ$ decay operators

    Authors: Ping-Tao Chen, Gui-Jun Ding, Chang-Yuan Yao

    Abstract: We study the systematical one-loop decomposition of the dimension-7 long-range $0νββ$ decay operators. We find that there are 3 genuine one-loop topologies and 8 diagrams. The procedure to determine the SM quantum number assignments for both internal and external fields is presented. The Majorana neutrino mass in long-range $0νββ$ models is discussed. We also present a one-loop $0νββ$ decay model… ▽ More

    Submitted 23 March, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 42 pages, 19 figures

    Report number: DESY-22-211

  50. arXiv:2212.11042  [pdf, other

    cs.CV

    Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image Ensemble

    Authors: Chun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang, Varun Jampani

    Abstract: Automatically estimating 3D skeleton, shape, camera viewpoints, and part articulation from sparse in-the-wild image ensembles is a severely under-constrained and challenging problem. Most prior methods rely on large-scale image datasets, dense temporal correspondence, or human annotations like camera pose, 2D keypoints, and shape templates. We propose Hi-LASSIE, which performs 3D articulated recon… ▽ More

    Submitted 25 March, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: Project page: https://chhankyao.github.io/hi-lassie/