Skip to main content

Showing 1–50 of 59 results for author: Yao, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04067  [pdf, other

    cs.CL

    Semantic Graphs for Syntactic Simplification: A Revisit from the Age of LLM

    Authors: Peiran Yao, Kostyantyn Guzhva, Denilson Barbosa

    Abstract: Symbolic sentence meaning representations, such as AMR (Abstract Meaning Representation) provide expressive and structured semantic graphs that act as intermediates that simplify downstream NLP tasks. However, the instruction-following capability of large language models (LLMs) offers a shortcut to effectively solve NLP tasks, questioning the utility of semantic graphs. Meanwhile, recent work has… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted at TextGraphs-17 @ ACL 2024

  2. arXiv:2406.14629  [pdf, other

    cs.CL cs.AI

    Can LLMs Learn by Teaching? A Preliminary Study

    Authors: Xuefei Ning, Zifu Wang, Shiyao Li, Zinan Lin, Peiran Yao, Tianyu Fu, Matthew B. Blaschko, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Teaching to improve student models (e.g., knowledge distillation) is an extensively studied methodology in LLMs. However, for humans, teaching not only improves students but also improves teachers. We ask: Can LLMs also learn by teaching (LbT)? If yes, we can potentially unlock the possibility of continuously advancing the models without solely relying on human-produced data or stronger models. In… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  3. arXiv:2405.16702  [pdf, other

    cs.CL

    Accurate and Nuanced Open-QA Evaluation Through Textual Entailment

    Authors: Peiran Yao, Denilson Barbosa

    Abstract: Open-domain question answering (Open-QA) is a common task for evaluating large language models (LLMs). However, current Open-QA evaluations are criticized for the ambiguity in questions and the lack of semantic understanding in evaluators. Complex evaluators, powered by foundation models or LLMs and pertaining to semantic equivalence, still deviate from human judgments by a large margin. We propos… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: To appear at ACL 2024 (Findings)

  4. arXiv:2404.18771  [pdf, other

    cs.SE

    KBX: Verified Model Synchronization via Formal Bidirectional Transformation

    Authors: Jianhong Zhao, Yongwang Zhao, Peisen Yao, Fanlang Zeng, Bohua Zhan, Kui Ren

    Abstract: Complex safety-critical systems require multiple models for a comprehensive description, resulting in error-prone development and laborious verification. Bidirectional transformation (BX) is an approach to automatically synchronizing these models. However, existing BX frameworks lack formal verification to enforce these models' consistency rigorously. This paper introduces KBX, a formal bidirectio… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  5. arXiv:2401.00454  [pdf, other

    cs.CC quant-ph

    Quantum and Classical Communication Complexity of Permutation-Invariant Functions

    Authors: Ziyi Guan, Yunqi Huang, Penghui Yao, Zekun Ye

    Abstract: This paper gives a nearly tight characterization of the quantum communication complexity of the permutation-invariant Boolean functions. With such a characterization, we show that the quantum and randomized communication complexity of the permutation-invariant Boolean functions are quadratically equivalent (up to a logarithmic factor). Our results extend a recent line of research regarding query c… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: accepted in STACS 2024

  6. arXiv:2312.04360  [pdf, other

    quant-ph cs.CC

    The Computational Advantage of MIP* Vanishes in the Presence of Noise

    Authors: Yangjing Dong, Honghao Fu, Anand Natarajan, Minglong Qin, Haochen Xu, Penghui Yao

    Abstract: Quantum multiprover interactive proof systems with entanglement MIP* are much more powerful than their classical counterpart MIP (Babai et al. '91, Ji et al. '20): while MIP = NEXP, the quantum class MIP* is equal to RE, a class including the halting problem. This is because the provers in MIP* can share unbounded quantum entanglement. However, recent works of Qin and Yao '21 and '23 have shown th… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Comments are welcome!

  7. arXiv:2311.04723  [pdf, other

    quant-ph cs.IT

    Communication Complexity of Common Randomness Generation with Isotropic States

    Authors: Yangjing Dong, Penghui Yao

    Abstract: This paper addresses the problem of generating a common random string with min-entropy k using an unlimited supply of noisy EPR pairs or quantum isotropic states, with minimal communication between Alice and Bob. The paper considers two communication models -- one-way classical communication and one-way quantum communication, and derives upper bounds on the optimal common randomness rate for both… ▽ More

    Submitted 24 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 20 pages, 2 figures. Update funding information

  8. arXiv:2310.14464  [pdf, ps, other

    quant-ph cs.CR

    A Cryptographic Perspective on the Verifiability of Quantum Advantage

    Authors: Nai-Hui Chia, Honghao Fu, Fang Song, Penghui Yao

    Abstract: In recent years, achieving verifiable quantum advantage on a NISQ device has emerged as an important open problem in quantum information. The sampling-based quantum advantages are not known to have efficient verification methods. This paper investigates the verification of quantum advantage from a cryptographic perspective. We establish a strong connection between the verifiability of quantum adva… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 21 pages, 2 figures

  9. arXiv:2310.11478  [pdf, other

    cs.LG cs.AI cs.CV

    ASP: Automatic Selection of Proxy dataset for efficient AutoML

    Authors: Peng Yao, Chao Liao, Jiyuan Jia, Jianchao Tan, Bin Chen, Chengru Song, Di Zhang

    Abstract: Deep neural networks have gained great success due to the increasing amounts of data, and diverse effective neural network designs. However, it also brings a heavy computing burden as the amount of training data is proportional to the training time. In addition, a well-behaved model requires repeated trials of different structure designs and hyper-parameters, which may take a large amount of time… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: This paper was actually finished in 2021

  10. arXiv:2310.11117  [pdf, other

    cs.CV cs.AI

    USDC: Unified Static and Dynamic Compression for Visual Transformer

    Authors: Huan Yuan, Chao Liao, Jianchao Tan, Peng Yao, Jiyuan Jia, Bin Chen, Chengru Song, Di Zhang

    Abstract: Visual Transformers have achieved great success in almost all vision tasks, such as classification, detection, and so on. However, the model complexity and the inference speed of the visual transformers hinder their deployments in industrial products. Various model compression techniques focus on directly compressing the visual transformers into a smaller one while maintaining the model performanc… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: This paper was actually finished in 2021

  11. arXiv:2309.11279  [pdf, other

    cs.CC quant-ph

    On the Fine-Grained Query Complexity of Symmetric Functions

    Authors: Supartha Podder, Penghui Yao, Zekun Ye

    Abstract: This paper explores a fine-grained version of the Watrous conjecture, including the randomized and quantum algorithms with success probabilities arbitrarily close to $1/2$. Our contributions include the following: i) An analysis of the optimal success probability of quantum and randomized query algorithms of two fundamental partial symmetric Boolean functions given a fixed number of queries. We… ▽ More

    Submitted 21 October, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: accepted in ISAAC 2023

  12. arXiv:2309.08941  [pdf, ps, other

    quant-ph cs.CC cs.CR

    Quantum Pseudorandom Scramblers

    Authors: Chuhan Lu, Minglong Qin, Fang Song, Penghui Yao, Mingnan Zhao

    Abstract: Quantum pseudorandom state generators (PRSGs) have stimulated exciting developments in recent years. A PRSG, on a fixed initial (e.g., all-zero) state, produces an output state that is computationally indistinguishable from a Haar random state. However, pseudorandomness of the output state is not guaranteed on other initial states. In fact, known PRSG constructions provably fail on some initial st… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

  13. arXiv:2309.05683  [pdf, other

    cs.LG cs.AI cs.RO

    EANet: Expert Attention Network for Online Trajectory Prediction

    Authors: Pengfei Yao, Tianlu Mao, Min Shi, Jingkai Sun, Zhaoqi Wang

    Abstract: Trajectory prediction plays a crucial role in autonomous driving. Existing mainstream research and continuoual learning-based methods all require training on complete datasets, leading to poor prediction accuracy when sudden changes in scenarios occur and failing to promptly respond and update the model. Whether these methods can make a prediction in real-time and use data instances to update the… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  14. arXiv:2309.05264  [pdf, other

    cs.SE cs.AI

    Enabling Runtime Verification of Causal Discovery Algorithms with Automated Conditional Independence Reasoning (Extended Version)

    Authors: Pingchuan Ma, Zhenlan Ji, Peisen Yao, Shuai Wang, Kui Ren

    Abstract: Causal discovery is a powerful technique for identifying causal relationships among variables in data. It has been widely used in various applications in software engineering. Causal discovery extensively involves conditional independence (CI) tests. Hence, its output quality highly depends on the performance of CI tests, which can often be unreliable in practice. Moreover, privacy concerns arise… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  15. arXiv:2308.15987  [pdf, other

    cs.CL cs.AI cs.LG

    FPTQ: Fine-grained Post-Training Quantization for Large Language Models

    Authors: Qingyuan Li, Yifan Zhang, Liang Li, Peng Yao, Bo Zhang, Xiangxiang Chu, Yerui Sun, Li Du, Yuchen Xie

    Abstract: In the era of large-scale language models, the substantial parameter size poses significant challenges for deployment. Being a prevalent compression technique, quantization has emerged as the mainstream practice to tackle this issue, which is mainly centered on two recipes W8A8 and W4A16 (i.e. weights and activations in such bit widths). In this study, we propose a novel W4A8 post-training quantiz… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  16. arXiv:2307.14588  [pdf

    eess.IV cs.CV cs.LG

    MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation

    Authors: Liang Xu, Mingxiao Chen, Yi Cheng, Pengfei Shao, Shuwei Shen, Peng Yao, Ronald X. Xu

    Abstract: The UNet architecture, based on Convolutional Neural Networks (CNN), has demonstrated its remarkable performance in medical image analysis. However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome t… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  17. arXiv:2307.07928  [pdf

    cs.CV

    Reinforced Disentanglement for Face Swapping without Skip Connection

    Authors: Xiaohang Ren, Xingyu Chen, Pengfei Yao, Heung-Yeung Shum, Baoyuan Wang

    Abstract: The SOTA face swap models still suffer the problem of either target identity (i.e., shape) being leaked or the target non-identity attributes (i.e., background, hair) failing to be fully preserved in the final results. We show that this insufficient disentanglement is caused by two flawed designs that were commonly adopted in prior models: (1) counting on only one compressed encoder to represent b… ▽ More

    Submitted 3 August, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023

  18. arXiv:2305.12097  [pdf, ps, other

    quant-ph cs.CC

    On Testing and Learning Quantum Junta Channels

    Authors: Zongbo Bao, Penghui Yao

    Abstract: We consider the problems of testing and learning quantum $k$-junta channels, which are $n$-qubit to $n$-qubit quantum channels acting non-trivially on at most $k$ out of $n$ qubits and leaving the rest of qubits unchanged. We show the following. 1. An $O\left(k\right)$-query algorithm to distinguish whether the given channel is $k$-junta channel or is far from any $k$-junta channels, and a lower… ▽ More

    Submitted 19 December, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  19. arXiv:2305.04316  [pdf, other

    cs.PL cs.SE

    Synthesizing Conjunctive Queries for Code Search

    Authors: Chengpeng Wang, Peisen Yao, Wensheng Tang, Gang Fan, Charles Zhang

    Abstract: This paper presents Squid, a new conjunctive query synthesis algorithm for searching code with target patterns. Given positive and negative examples along with a natural language description, Squid analyzes the relations derived from the examples by a Datalog-based program analyzer and synthesizes a conjunctive query expressing the search intent. The synthesized query can be further used to search… ▽ More

    Submitted 11 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 32 pages, 7 figures, and 1 table. Accepted by ECOOP 2023

  20. arXiv:2304.12690  [pdf, ps, other

    quant-ph cs.IT

    The Generations of Classical Correlations via Quantum Schemes

    Authors: Zhenyu Chen, Lijinzhi Lin, Xiaodie Lin, Zhaohui Wei, Penghui Yao

    Abstract: Suppose two separated parties, Alice and Bob, share a bipartite quantum state or a classical correlation called a \emph{seed}, and they try to generate a target classical correlation by performing local quantum or classical operations on the seed, i.e., any communications are not allowed. We consider the following fundamental problem about this setting: whether Alice and Bob can use a given seed t… ▽ More

    Submitted 13 May, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 18 pages, no figures. To appear in IEEE Transactions on Information Theory. Comments are welcome

  21. NLP Workbench: Efficient and Extensible Integration of State-of-the-art Text Mining Tools

    Authors: Peiran Yao, Matej Kosmajac, Abeer Waheed, Kostyantyn Guzhva, Natalie Hervieux, Denilson Barbosa

    Abstract: NLP Workbench is a web-based platform for text mining that allows non-expert users to obtain semantic understanding of large-scale corpora using state-of-the-art text mining models. The platform is built upon latest pre-trained models and open source systems from academia that provide semantic analysis functionalities, including but not limited to entity linking, sentiment analysis, semantic parsi… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Camera-ready version for EACL 2023: System Demonstrations

  22. arXiv:2301.11011  [pdf, other

    cs.PL cs.SE

    Verifying Data Constraint Equivalence in FinTech Systems

    Authors: Chengpeng Wang, Gang Fan, Peisen Yao, Fuxiong Pan, Charles Zhang

    Abstract: Data constraints are widely used in FinTech systems for monitoring data consistency and diagnosing anomalous data manipulations. However, many equivalent data constraints are created redundantly during the development cycle, slowing down the FinTech systems and causing unnecessary alerts. We present EqDAC, an efficient decision procedure to determine the data constraint equivalence. We first propo… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 14 pages, 11 figures, accepted by ICSE 2023

  23. Quantum Complexity of Weighted Diameter and Radius in CONGEST Networks

    Authors: Xudong Wu, Penghui Yao

    Abstract: This paper studies the round complexity of computing the weighted diameter and radius of a graph in the quantum CONGEST model. We present a quantum algorithm that $(1+o(1))$-approximates the diameter and radius with round complexity $\widetilde O\left(\min\left\{n^{9/10}D^{3/10},n\right\}\right)$, where $D$ denotes the unweighted diameter. This exhibits the advantages of quantum communication over… ▽ More

    Submitted 26 September, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 24 pages. accepted by PODC 2022

    Journal ref: Proceedings of the 2022 ACM Symposium on Principles of Distributed Computing (PODC 2022), pp. 120-130, 2022

  24. Complexity of Eccentricities and All-Pairs Shortest Paths in the Quantum CONGEST Model

    Authors: ChengSheng Wang, Xudong Wu, Penghui Yao

    Abstract: Computing the distance parameters of a network, including the diameter, radius, eccentricities and the all-pairs shortest paths (APSP) is a central problem in distributed computing. This paper investigates he dtistance parameters in the quantum CONGEST models and establishes almost linear lower bounds on eccentricities and APSP, which match the classical upper bounds. Our results imply that there… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 16 pages. invited paper on SPIN, Special Issue on Quantum Algorithms and Software

  25. arXiv:2205.12117  [pdf, other

    cs.LG

    Phased Progressive Learning with Coupling-Regulation-Imbalance Loss for Imbalanced Data Classification

    Authors: Liang Xu, Yi Cheng, Fan Zhang, Bingxuan Wu, Pengfei Shao, Peng Liu, Shuwei Shen, Peng Yao, Ronald X. Xu

    Abstract: Deep convolutional neural networks often perform poorly when faced with datasets that suffer from quantity imbalances and classification difficulties. Despite advances in the field, existing two-stage approaches still exhibit dataset bias or domain shift. To counter this, a phased progressive learning schedule has been proposed that gradually shifts the emphasis from representation learning to tra… ▽ More

    Submitted 15 March, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

  26. arXiv:2201.12772  [pdf, ps, other

    cs.DS quant-ph

    Polynomial-Time Approximation of Zero-Free Partition Functions

    Authors: Penghui Yao, Yitong Yin, Xinyuan Zhang

    Abstract: Zero-free based algorithm is a major technique for deterministic approximate counting. In Barvinok's original framework[Bar17], by calculating truncated Taylor expansions, a quasi-polynomial time algorithm was given for estimating zero-free partition functions. Patel and Regts[PR17] later gave a refinement of Barvinok's framework, which gave a polynomial-time algorithm for a class of zero-free gra… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

  27. arXiv:2110.10423  [pdf, other

    cs.LG

    ProxyBO: Accelerating Neural Architecture Search via Bayesian Optimization with Zero-cost Proxies

    Authors: Yu Shen, Yang Li, Jian Zheng, Wentao Zhang, Peng Yao, Jixiang Li, Sen Yang, Ji Liu, Bin Cui

    Abstract: Designing neural architectures requires immense manual efforts. This has promoted the development of neural architecture search (NAS) to automate the design. While previous NAS methods achieve promising results but run slowly, zero-cost proxies run extremely fast but are less promising. Therefore, it is of great potential to accelerate NAS via those zero-cost proxies. The existing method has two l… ▽ More

    Submitted 13 March, 2023; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: Accepted by AAAI 2023

  28. arXiv:2109.07923  [pdf, other

    cs.PL cs.SE

    Efficient Path-Sensitive Data-Dependence Analysis

    Authors: Peisen Yao, Jinguo Zhou, Xiao Xiao, Qingkai Shi, Rongxin Wu, Charles Zhang

    Abstract: This paper presents a scalable path- and context-sensitive data-dependence analysis. The key is to address the aliasing-path-explosion problem via a sparse, demand-driven, and fused approach that piggybacks the computation of pointer information with the resolution of data dependence. Specifically, our approach decomposes the computational efforts of disjunctive reasoning into 1) a context- and se… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  29. arXiv:2107.13200  [pdf

    eess.IV cs.CV cs.LG

    An explainable two-dimensional single model deep learning approach for Alzheimer's disease diagnosis and brain atrophy localization

    Authors: Fan Zhang, Bo Pan, Pengfei Shao, Peng Liu, Shuwei Shen, Peng Yao, Ronald X. Xu

    Abstract: Early and accurate diagnosis of Alzheimer's disease (AD) and its prodromal period mild cognitive impairment (MCI) is essential for the delayed disease progression and the improved quality of patients'life. The emerging computer-aided diagnostic methods that combine deep learning with structural magnetic resonance imaging (sMRI) have achieved encouraging results, but some of them are limit of issue… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  30. arXiv:2107.03660  [pdf, other

    cs.SE cs.DB cs.PL

    Duplicate-sensitivity Guided Transformation Synthesis for DBMS Correctness Bug Detection

    Authors: Yushan Zhang, Peisen Yao, Rongxin Wu, Charles Zhang

    Abstract: Database Management System (DBMS) plays a core role in modern software from mobile apps to online banking. It is critical that DBMS should provide correct data to all applications. When the DBMS returns incorrect data, a correctness bug is triggered. Current production-level DBMSs still suffer from insufficient testing due to the limited hand-written test cases. Recently several works proposed to… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: 11 pages, 6 figures, 7 tables

  31. arXiv:2106.14300  [pdf, other

    cs.LG cs.AI cs.CR

    ASK: Adversarial Soft k-Nearest Neighbor Attack and Defense

    Authors: Ren Wang, Tianqi Chen, Philip Yao, Sijia Liu, Indika Rajapakse, Alfred Hero

    Abstract: K-Nearest Neighbor (kNN)-based deep learning methods have been applied to many applications due to their simplicity and geometric interpretability. However, the robustness of kNN-based classification models has not been thoroughly explored and kNN attack strategies are underdeveloped. In this paper, we propose an Adversarial Soft kNN (ASK) loss to both design more effective kNN attack strategies a… ▽ More

    Submitted 26 September, 2022; v1 submitted 27 June, 2021; originally announced June 2021.

  32. arXiv:2105.05956  [pdf

    cs.ET cond-mat.dis-nn cond-mat.mtrl-sci

    2022 Roadmap on Neuromorphic Computing and Engineering

    Authors: Dennis V. Christensen, Regina Dittmann, Bernabé Linares-Barranco, Abu Sebastian, Manuel Le Gallo, Andrea Redaelli, Stefan Slesazeck, Thomas Mikolajick, Sabina Spiga, Stephan Menzel, Ilia Valov, Gianluca Milano, Carlo Ricciardi, Shi-Jun Liang, Feng Miao, Mario Lanza, Tyler J. Quill, Scott T. Keene, Alberto Salleo, Julie Grollier, Danijela Marković, Alice Mizrahi, Peng Yao, J. Joshua Yang, Giacomo Indiveri , et al. (34 additional authors not shown)

    Abstract: Modern computation based on the von Neumann architecture is today a mature cutting-edge science. In the Von Neumann architecture, processing and memory units are implemented as separate blocks interchanging data intensively and continuously. This data transfer is responsible for a large part of the power consumption. The next generation computer technology is expected to solve problems at the exas… ▽ More

    Submitted 13 January, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Journal ref: Neuromorph. Comput. Eng. 2 022501 (2022)

  33. arXiv:2102.02307  [pdf, other

    cs.DB cs.LG

    Typing Errors in Factual Knowledge Graphs: Severity and Possible Ways Out

    Authors: Peiran Yao, Denilson Barbosa

    Abstract: Factual knowledge graphs (KGs) such as DBpedia and Wikidata have served as part of various downstream tasks and are also widely adopted by artificial intelligence research communities as benchmark datasets. However, we found these KGs to be surprisingly noisy. In this study, we question the quality of these KGs, where the typing error rate is estimated to be 27% for coarse-grained types on average… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 9 pages, 3 figures Camera-ready for WWW2021

  34. Single Model Deep Learning on Imbalanced Small Datasets for Skin Lesion Classification

    Authors: Peng Yao, Shuwei Shen, Mengjuan Xu, Peng Liu, Fan Zhang, Jinyu Xing, Pengfei Shao, Benjamin Kaffenberger, Ronald X. Xu

    Abstract: Deep convolutional neural network (DCNN) models have been widely explored for skin disease diagnosis and some of them have achieved the diagnostic outcomes comparable or even superior to those of dermatologists. However, broad implementation of DCNN in skin disease detection is hindered by small size and data imbalance of the publically accessible skin lesion datasets. This paper proposes a novel… ▽ More

    Submitted 11 February, 2022; v1 submitted 1 February, 2021; originally announced February 2021.

    Journal ref: IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021

  35. arXiv:2101.08141  [pdf, ps, other

    cs.CC math.CO

    Positive spectrahedra: Invariance principles and Pseudorandom generators

    Authors: Srinivasan Arunachalam, Penghui Yao

    Abstract: In a recent work, O'Donnell, Servedio and Tan (STOC 2019) gave explicit pseudorandom generators (PRGs) for arbitrary $m$-facet polytopes in $n$ variables with seed length poly-logarithmic in $m,n$, concluding a sequence of works in the last decade, that was started by Diakonikolas, Gopalan, Jaiswal, Servedio, Viola (SICOMP 2010) and Meka, Zuckerman (SICOMP 2013) for fooling linear and polynomial t… ▽ More

    Submitted 1 June, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

    Comments: 63 pages. v2: Minor revisions and Improvements in presentation

  36. arXiv:2101.02353  [pdf

    cs.CV cs.AI

    Low-cost and high-performance data augmentation for deep-learning-based skin lesion classification

    Authors: Shuwei Shen, Mengjuan Xu, Fan Zhang, Pengfei Shao, Honghong Liu, Liang Xu, Chi Zhang, Peng Liu, Zhihong Zhang, Peng Yao, Ronald X. Xu

    Abstract: Although deep convolutional neural networks (DCNNs) have achieved significant accuracy in skin lesion classification comparable or even superior to those of dermatologists, practical implementation of these models for skin cancer screening in low resource settings is hindered by their limitations in computational cost and training dataset. To overcome these limitations, we propose a low-cost and h… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: 8 pages, 5 figures

  37. arXiv:2012.07101  [pdf, other

    cs.CV

    Learning Heatmap-Style Jigsaw Puzzles Provides Good Pretraining for 2D Human Pose Estimation

    Authors: Kun Zhang, Rui Wu, Ping Yao, Kai Deng, Ding Li, Renbiao Liu, Chuanguang Yang, Ge Chen, Min Du, Tianyao Zheng

    Abstract: The target of 2D human pose estimation is to locate the keypoints of body parts from input 2D images. State-of-the-art methods for pose estimation usually construct pixel-wise heatmaps from keypoints as labels for learning convolution neural networks, which are usually initialized randomly or using classification models on ImageNet as their backbones. We note that 2D pose estimation task is highly… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

  38. arXiv:2007.10673  [pdf, ps, other

    quant-ph cs.CC cs.IT

    Quantum and Classical Hybrid Generations for Classical Correlations

    Authors: Xiaodie Lin, Zhaohui Wei, Penghui Yao

    Abstract: We consider two-stage hybrid protocols that combine quantum resource and classical resource to generate classical correlations shared by two separated players. Our motivation is twofold. First, in the near future the scale of quantum information processing is quite limited, and when quantum resource available is not sufficient for certain tasks, a possible way to strengthen the capability of quant… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

    Comments: 13 pages

  39. arXiv:2003.08897  [pdf, other

    cs.CV cs.CL cs.MM

    Normalized and Geometry-Aware Self-Attention Network for Image Captioning

    Authors: Longteng Guo, Jing Liu, Xinxin Zhu, Peng Yao, Shichen Lu, Hanqing Lu

    Abstract: Self-attention (SA) network has shown profound value in image captioning. In this paper, we improve SA from two aspects to promote the performance of image captioning. First, we propose Normalized Self-Attention (NSA), a reparameterization of SA that brings the benefits of normalization inside SA. While normalization is previously only applied outside SA, we introduce a novel normalization method… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: Accepted by CVPR 2020

  40. arXiv:2001.02818  [pdf, other

    quant-ph cs.CC cs.DS cs.IT

    Capacity Approaching Coding for Low Noise Interactive Quantum Communication, Part I: Large Alphabets

    Authors: Debbie Leung, Ashwin Nayak, Ala Shayeghi, Dave Touchette, Penghui Yao, Nengkun Yu

    Abstract: We consider the problem of implementing two-party interactive quantum communication over noisy channels, a necessary endeavor if we wish to fully reap quantum advantages for communication. For an arbitrary protocol with $n$ messages, designed for a noiseless qudit channel over a $\mathrm{poly}(n)$ size alphabet, our main result is a simulation method that fails with probability less than… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

    Comments: 94 pages, 7 figures

  41. arXiv:1910.11102  [pdf, other

    cs.CV cs.MM

    Vatex Video Captioning Challenge 2020: Multi-View Features and Hybrid Reward Strategies for Video Captioning

    Authors: Xinxin Zhu, Longteng Guo, Peng Yao, Shichen Lu, Wei Liu, Jing Liu

    Abstract: This report describes our solution for the VATEX Captioning Challenge 2020, which requires generating descriptions for the videos in both English and Chinese languages. We identified three crucial factors that improve the performance, namely: multi-view features, hybrid reward, and diverse ensemble. Based on our method of VATEX 2019 challenge, we achieved significant improvements this year with mo… ▽ More

    Submitted 23 June, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: 4 pages,2 figure

  42. Learning Enhanced Resolution-wise features for Human Pose Estimation

    Authors: Kun Zhang, Peng He, Ping Yao, Ge Chen, Rui Wu, Min Du, Huimin Li, Li Fu, Tianyao Zheng

    Abstract: Recently, multi-resolution networks (such as Hourglass, CPN, HRNet, etc.) have achieved significant performance on pose estimation by combining feature maps of various resolutions. In this paper, we propose a Resolution-wise Attention Module (RAM) and Gradual Pyramid Refinement (GPR), to learn enhanced resolution-wise feature maps for precise pose estimation. Specifically, RAM learns a group of we… ▽ More

    Submitted 13 December, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: Published on ICIP 2020

  43. arXiv:1904.08832  [pdf, other

    quant-ph cs.CC cs.DS

    A doubly exponential upper bound on noisy EPR states for binary games

    Authors: Penghui Yao

    Abstract: This paper initiates the study of a class of entangled games, mono-state games, denoted by $(G,ψ)$, where $G$ is a two-player one-round game and $ψ$ is a bipartite state independent of the game $G$. In the mono-state game $(G,ψ)$, the players are only allowed to share arbitrary copies of $ψ$. This paper provides a doubly exponential upper bound on the copies of $ψ$ for the players to approximate t… ▽ More

    Submitted 15 September, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: The proof of Lemma C.9 is corrected. The presentation is improved. Some typos are corrected

  44. arXiv:1903.10153  [pdf, other

    cs.CV

    DenseBody: Directly Regressing Dense 3D Human Pose and Shape From a Single Color Image

    Authors: Pengfei Yao, Zheng Fang, Fan Wu, Yao Feng, Jiwei Li

    Abstract: Recovering 3D human body shape and pose from 2D images is a challenging task due to high complexity and flexibility of human body, and relatively less 3D labeled data. Previous methods addressing these issues typically rely on predicting intermediate results such as body part segmentation, 2D/3D joints, silhouette mask to decompose the problem into multiple sub-tasks in order to utilize more 2D la… ▽ More

    Submitted 28 March, 2019; v1 submitted 25 March, 2019; originally announced March 2019.

    Comments: 10 pages, 6 figures

  45. arXiv:1901.00984  [pdf, other

    quant-ph cs.DS

    Quantum Insertion-Deletion Channels

    Authors: Janet Leahy, Dave Touchette, Penghui Yao

    Abstract: We introduce a model of quantum insertion-deletion (insdel) channels. Insdel channels are meant to represent, for example, synchronization errors arising in data transmission. In the classical setting, they represent a strict generalization of the better-understood corruption error channels, and until recently, had mostly resisted effort toward a similar understanding as their corruption counterpa… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

  46. On the compression of messages in the multi-party setting

    Authors: Anurag Anshu, Penghui Yao

    Abstract: We consider the following communication task in the multi-party setting, which involves a joint random variable $XYZMN$ with the property that $M$ is independent of $YZN$ conditioned on $X$ and $N$ is independent of $XZM$ conditioned on $Y$. Three parties Alice, Bob and Charlie, respectively, observe samples $x,y$ and $z$ from $XYZ$. Alice and Bob communicate messages to Charlie with the goal that… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Comments: version 1, 34 pages, 2 figures

    Journal ref: IEEE Transactions on Information Theory ( Volume: 66 , Issue: 4 , April 2020 )

  47. arXiv:1806.00751  [pdf, ps, other

    cs.DC cs.AR

    An Efficient Graph Accelerator with Parallel Data Conflict Management

    Authors: Pengcheng Yao

    Abstract: Graph-specific computing with the support of dedicated accelerator has greatly boosted the graph processing in both efficiency and energy. Nevertheless, their data conflict management is still sequential in essential when some vertex needs a large number of conflicting updates at the same time, leading to prohibitive performance degradation. This is particularly true for processing natural graphs.… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

  48. arXiv:1611.08946  [pdf, other

    quant-ph cs.CC cs.IT

    Exponential Separation of Quantum Communication and Classical Information

    Authors: Anurag Anshu, Dave Touchette, Penghui Yao, Nengkun Yu

    Abstract: We exhibit a Boolean function for which the quantum communication complexity is exponentially larger than the classical information complexity. An exponential separation in the other direction was already known from the work of Kerenidis et. al. [SICOMP 44, pp. 1550-1572], hence our work implies that these two complexity measures are incomparable. As classical information complexity is an upper bo… ▽ More

    Submitted 27 November, 2016; originally announced November 2016.

    Comments: v1, 36 pages, 3 figures

    Journal ref: Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017

  49. arXiv:1506.02936  [pdf, other

    cs.CC

    Parity Decision Tree Complexity and 4-Party Communication Complexity of XOR-functions Are Polynomially Equivalent

    Authors: Penghui Yao

    Abstract: In this note, we study the relation between the parity decision tree complexity of a boolean function $f$, denoted by $\mathrm{D}_{\oplus}(f)$, and the $k$-party number-in-hand multiparty communication complexity of the XOR functions $F(x_1,\ldots, x_k)= f(x_1\oplus\cdots\oplus x_k)$, denoted by $\mathrm{CC}^{(k)}(F)$. It is known that $\mathrm{CC}^{(k)}(F)\leq k\cdot\mathrm{D}_{\oplus}(f)$ becaus… ▽ More

    Submitted 28 June, 2015; v1 submitted 9 June, 2015; originally announced June 2015.

  50. arXiv:1405.6015  [pdf, ps, other

    quant-ph cs.CC

    Multipartite Quantum Correlation and Communication Complexities

    Authors: Rahul Jain, Zhaohui Wei, Penghui Yao, Shengyu Zhang

    Abstract: The concepts of quantum correlation complexity and quantum communication complexity were recently proposed to quantify the minimum amount of resources needed in generating bipartite classical or quantum states in the single-shot setting. The former is the minimum size of the initially shared state $σ$ on which local operations by the two parties (without communication) can generate the target stat… ▽ More

    Submitted 17 July, 2014; v1 submitted 23 May, 2014; originally announced May 2014.

    Comments: 19 pages; some typos are corrected