Zum Hauptinhalt springen

Showing 1–50 of 72 results for author: Yuan, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.13987  [pdf, other

    cs.CL cs.AI

    Focused Large Language Models are Stable Many-Shot Learners

    Authors: Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li

    Abstract: In-Context Learning (ICL) enables large language models (LLMs) to achieve rapid task adaptation by learning from demonstrations. With the increase in available context length of LLMs, recent experiments have shown that the performance of ICL does not necessarily scale well in many-shot (demonstration) settings. We theoretically and experimentally confirm that the reason lies in more demonstrations… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: 15 pages

  2. arXiv:2408.13738  [pdf, other

    cs.CL

    Poor-Supervised Evaluation for SuperLLM via Mutual Consistency

    Authors: Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Boyuan Pan, Heda Wang, Yao Hu, Kan Li

    Abstract: The guidance from capability evaluations has greatly propelled the progress of both human society and Artificial Intelligence. However, as LLMs evolve, it becomes challenging to construct evaluation benchmarks for them with accurate labels on hard tasks that approach the boundaries of human capabilities. To credibly conduct evaluation without accurate labels (denoted as poor-supervised evaluation)… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: ACL findings

  3. arXiv:2408.13457  [pdf, other

    cs.CL cs.AI

    Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning

    Authors: Xinglin Wang, Shaoxiong Feng, Yiwei Li, Peiwen Yuan, Yueqi Zhang, Boyuan Pan, Heda Wang, Yao Hu, Kan Li

    Abstract: Self-consistency (SC), a widely used decoding strategy for chain-of-thought reasoning, shows significant gains across various multi-step reasoning tasks but comes with a high cost due to multiple sampling with the preset size. Its variants, Adaptive self-consistency (ASC) and Early-stopping self-consistency (ESC), dynamically adjust the number of samples based on the posterior distribution of a se… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: Preprint

  4. arXiv:2408.09150  [pdf, other

    cs.CL cs.AI

    CogLM: Tracking Cognitive Development of Large Language Models

    Authors: Xinglin Wang, Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Boyuan Pan, Heda Wang, Yao Hu, Kan Li

    Abstract: Piaget's Theory of Cognitive Development (PTC) posits that the development of cognitive levels forms the foundation for human learning across various abilities. As Large Language Models (LLMs) have recently shown remarkable abilities across a wide variety of tasks, we are curious about the cognitive levels of current LLMs: to what extent they have developed and how this development has been achiev… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: under review

  5. arXiv:2408.04845  [pdf

    cs.LG

    MDS-GNN: A Mutual Dual-Stream Graph Neural Network on Graphs with Incomplete Features and Structure

    Authors: Peng Yuan, Peng Tang

    Abstract: Graph Neural Networks (GNNs) have emerged as powerful tools for analyzing and learning representations from graph-structured data. A crucial prerequisite for the outstanding performance of GNNs is the availability of complete graph information, i.e., node features and graph structure, which is frequently unmet in real-world scenarios since graphs are often incomplete due to various uncontrollable… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  6. arXiv:2408.04299  [pdf, other

    cs.CV

    Respiratory Subtraction for Pulmonary Microwave Ablation Evaluation

    Authors: Wan Li, Xinyun Zhong, Wei Li, Song Zhang, Moheng Rong, Yan Xi, Peng Yuan, Zechen Wang, Xiaolei Jiang, Rongxi Yi, Hui Tang, Yang Chen, Chaohui Tong, Zhan Wu, Feng Wang

    Abstract: Currently, lung cancer is a leading cause of global cancer mortality, often necessitating minimally invasive interventions. Microwave ablation (MWA) is extensively utilized for both primary and secondary lung tumors. Although numerous clinical guidelines and standards for MWA have been established, the clinical evaluation of ablation surgery remains challenging and requires long-term patient follo… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  7. arXiv:2408.04218  [pdf, other

    cs.IT

    On many-to-one mappings over finite fields

    Authors: Yanbin Zheng, Yanjin Ding, Meiying Zhang, Pingzhi Yuan, Qiang Wang

    Abstract: The definition of many-to-one mapping, or $m$-to-$1$ mapping for short, between two finite sets is introduced in this paper, which unifies and generalizes the definitions of $2$-to-$1$ mappings and $n$-to-$1$ mappings. A generalized local criterion is given, which is an abstract criterion for a mapping to be $m$-to-$1$. By employing the generalized local criterion, three constructions of $m$-to-… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  8. arXiv:2407.16137  [pdf

    cs.CV

    3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images

    Authors: Jie Zhao, Jianing Li, Weihan Chen, Wentong Wang, Pengfei Yuan, Xu Zhang, Deshu Peng

    Abstract: Human pose estimation remains a multifaceted challenge in computer vision, pivotal across diverse domains such as behavior recognition, human-computer interaction, and pedestrian tracking. This paper proposes an improved method based on the spatial-temporal graph convolution net-work (UGCN) to address the issue of missing human posture skeleton sequences in single-view videos. We present the impro… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Proceedings of IEEE AICON2024

  9. arXiv:2407.02056  [pdf, other

    cs.CL cs.AI

    Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation

    Authors: Xinglin Wang, Yiwei Li, Shaoxiong Feng, Peiwen Yuan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li

    Abstract: Self-consistency (SC), leveraging multiple samples from LLMs, shows significant gains on various reasoning tasks but struggles with free-form generation due to the difficulty of aggregating answers. Its variants, UCS and USC, rely on sample selection or voting mechanisms to improve output quality. These methods, however, face limitations due to their inability to fully utilize the nuanced consensu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to ACL2024 Main Conference

  10. arXiv:2406.18984  [pdf, other

    cs.IR

    Amplify Graph Learning for Recommendation via Sparsity Completion

    Authors: Peng Yuan, Haojie Li, Minying Fang, Xu Yu, Yongjing Hao, Junwei Du

    Abstract: Graph learning models have been widely deployed in collaborative filtering (CF) based recommendation systems. Due to the issue of data sparsity, the graph structure of the original input lacks potential positive preference edges, which significantly reduces the performance of recommendations. In this paper, we study how to enhance the graph structure for CF more effectively, thereby optimizing the… ▽ More

    Submitted 1 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

  11. arXiv:2406.11782  [pdf, ps, other

    cs.IT

    Soft-output Guessing Codeword Decoding

    Authors: Ken R. Duffy, Peihong Yuan, Joseph Griffin, Muriel Medard

    Abstract: We establish that it is possible to extract accurate blockwise and bitwise soft output from Guessing Codeword Decoding with minimal additional computational complexity by considering it as a variant of Guessing Random Additive Noise Decoding. Blockwise soft output can be used to control decoding misdetection rate while bitwise soft output results in a soft-input soft-output decoder that can be use… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  12. arXiv:2406.08087  [pdf, ps, other

    eess.SP cs.IT

    A Unified Pilot Design for Integrated Sensing and Communications

    Authors: Pu Yuan

    Abstract: This paper investigates a unified pilot signal design in an orthogonal frequency division modulation (OFDM)-based integrated sensing and communications (ISAC) system. The novel designed two-dimensional (2D) pilot signal is generated on the delay-Doppler (DD) plane for sensing, while its time-frequency (TF) plane transformation acts as the demodulation reference signal (DMRS) for the OFDM data. The… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: ICC 2024 Workshop. arXiv admin note: text overlap with arXiv:2307.12595

  13. arXiv:2404.11105  [pdf, other

    cs.DB cs.DC

    XMiner: Efficient Directed Subgraph Matching with Pattern Reduction

    Authors: Pingpeng Yuan, Yujiang Wang, Tianyu Ma, Siyuan He, Ling Liu

    Abstract: Graph pattern matching, one of the fundamental graph mining problems, aims to extract structural patterns of interest from an input graph. The state-of-the-art graph matching algorithms and systems are mainly designed for undirected graphs. Directed graph matching is more complex than undirected graph matching because the edge direction must be taken into account before the exploration of each dir… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  14. arXiv:2404.05168  [pdf, other

    cs.LG

    Adapting to Covariate Shift in Real-time by Encoding Trees with Motion Equations

    Authors: Tham Yik Foong, Heng Zhang, Mao Po Yuan, Danilo Vasconcellos Vargas

    Abstract: Input distribution shift presents a significant problem in many real-world systems. Here we present Xenovert, an adaptive algorithm that can dynamically adapt to changes in input distribution. It is a perfect binary tree that adaptively divides a continuous input space into several intervals of uniform density while receiving a continuous stream of input. This process indirectly maps the source di… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 7 figures, 2 tables

  15. arXiv:2403.07564  [pdf, other

    cs.CV

    RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model

    Authors: Mingze Wang, Lili Su, Cilin Yan, Sheng Xu, Pengcheng Yuan, Xiaolong Jiang, Baochang Zhang

    Abstract: The intelligent interpretation of buildings plays a significant role in urban planning and management, macroeconomic analysis, population dynamics, etc. Remote sensing image building interpretation primarily encompasses building extraction and change detection. However, current methodologies often treat these two tasks as separate entities, thereby failing to leverage shared knowledge. Moreover, t… ▽ More

    Submitted 14 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  16. arXiv:2402.05004  [pdf, ps, other

    cs.IT

    Near-Optimal Generalized Decoding of Polar-like Codes

    Authors: Peihong Yuan, Ken R. Duffy, Muriel Médard

    Abstract: We present a framework that can exploit the tradeoff between the undetected error rate (UER) and block error rate (BLER) of polar-like codes. It is compatible with all successive cancellation (SC)-based decoding methods and relies on a novel approximation that we call codebook probability. This approximation is based on an auxiliary distribution that mimics the dynamics of decoding algorithms foll… ▽ More

    Submitted 2 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: being published at IEEE ISIT 2024

  17. arXiv:2401.10487  [pdf, other

    cs.IR cs.CL

    Generative Dense Retrieval: Memory Can Be a Burden

    Authors: Peiwen Yuan, Xinglin Wang, Shaoxiong Feng, Boyuan Pan, Yiwei Li, Heda Wang, Xupeng Miao, Kan Li

    Abstract: Generative Retrieval (GR), autoregressively decoding relevant document identifiers given a query, has been shown to perform well under the setting of small-scale corpora. By memorizing the document corpus with model parameters, GR implicitly achieves deep interaction between query and document. However, such a memorizing mechanism faces three drawbacks: (1) Poor memory accuracy for fine-grained fe… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: EACL 2024 main

    Journal ref: EACL 2024 main

  18. arXiv:2401.10480  [pdf, other

    cs.CL cs.AI

    Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning

    Authors: Yiwei Li, Peiwen Yuan, Shaoxiong Feng, Boyuan Pan, Xinglin Wang, Bin Sun, Heda Wang, Kan Li

    Abstract: Self-consistency (SC) has been a widely used decoding strategy for chain-of-thought reasoning. Despite bringing significant performance improvements across a variety of multi-step reasoning tasks, it is a high-cost method that requires multiple sampling with the preset size. In this paper, we propose a simple and scalable sampling process, \textbf{E}arly-Stopping \textbf{S}elf-\textbf{C}onsistency… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: ICLR 2024

  19. arXiv:2401.00437  [pdf, other

    cs.CL

    BatchEval: Towards Human-like Text Evaluation

    Authors: Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Boyuan Pan, Heda Wang, Kan Li

    Abstract: Significant progress has been made in automatic text evaluation with the introduction of large language models (LLMs) as evaluators. However, current sample-wise evaluation paradigm suffers from the following issues: (1) Sensitive to prompt design; (2) Poor resistance to noise; (3) Inferior ensemble performance with static reference. Inspired by the fact that humans treat both criterion definition… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: 19 pages, 9 figures

  20. arXiv:2312.12832  [pdf, other

    cs.CL cs.AI

    Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data

    Authors: Yiwei Li, Peiwen Yuan, Shaoxiong Feng, Boyuan Pan, Bin Sun, Xinglin Wang, Heda Wang, Kan Li

    Abstract: Large Language Models (LLMs) have performed well on various reasoning tasks, but their inaccessibility and numerous parameters hinder wide application in practice. One promising way is distilling the reasoning ability from LLMs to small models by the generated chain-of-thought reasoning paths. In some cases, however, LLMs may produce incorrect reasoning chains, especially when facing complex mathe… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  21. arXiv:2311.07091  [pdf, ps, other

    cs.IT

    Code-Aided Channel Estimation in LDPC-Coded MIMO Systems

    Authors: Binghui Shi, Yongpeng Wu, Peihong Yuan, Derrick Wing Kwan Ng, Xiang-Gen Xia, Wenjun Zhang

    Abstract: For a multiple-input multiple-output (MIMO) system with unknown channel state information (CSI), a novel low-density parity check (LDPC)-coded transmission (LCT) scheme with joint pilot and data channel estimation is proposed. To fine-tune the CSI, a method based on the constraints introduced by the coded data from an LDPC code is designed such that the MIMO detector exploits the fine-tuned CSI. F… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted by IEEE Wireless Communications Letters

  22. arXiv:2310.10737  [pdf, ps, other

    cs.IT

    Soft-output (SO) GRAND and Iterative Decoding to Outperform LDPCs

    Authors: Peihong Yuan, Muriel Medard, Kevin Galligan, Ken R. Duffy

    Abstract: We establish that a large, flexible class of long, high redundancy error correcting codes can be efficiently and accurately decoded with guessing random additive noise decoding (GRAND). Performance evaluation demonstrates that it is possible to construct simple concatenated codes that outperform low-density parity-check (LDPC) codes found in the 5G New Radio standard in both additive white Gaussia… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.05777

  23. arXiv:2307.12595  [pdf, ps, other

    cs.IT eess.SP

    Underlaid Sensing Pilot for Integrated Sensing and Communications

    Authors: Pu Yuan, Hao Liu, Junjie Tan, Dajie Jiang, Lei Yan

    Abstract: This paper investigates a novel underlaid sensing pilot signal design for integrated sensing and communications (ISAC) in an OFDM-based communication system. The proposed two-dimensional (2D) pilot signal is first generated on the delay-Doppler (DD) plane and then converted to the time-frequency (TF) plane for multiplexing with the OFDM data symbols. The sensing signal underlays the OFDM data, all… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 13 pages, 6 figures

  24. arXiv:2305.05777  [pdf, ps, other

    cs.IT

    Upgrade error detection to prediction with GRAND

    Authors: Kevin Galligan, Peihong Yuan, Muriel Médard, Ken R. Duffy

    Abstract: Guessing Random Additive Noise Decoding (GRAND) is a family of hard- and soft-detection error correction decoding algorithms that provide accurate decoding of any moderate redundancy code of any length. Here we establish a method through which any soft-input GRAND algorithm can provide soft output in the form of an accurate a posteriori estimate of the likelihood that a decoding is correct or, in… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Journal ref: 2023 IEEE Global Communications Conference (Globecom)

  25. arXiv:2302.11120  [pdf

    cs.RO eess.SY

    Soft Pneumatic Actuator Capable of Generating Various Bending and Extension Motions Inspired by an Elephant Trunk

    Authors: Peizheng Yuan, Hideyuki Tsukagoshi

    Abstract: Inspired by the dexterous handling ability of an elephant's trunk, we propose a pneumatic actuator that generates diverse bending and extension motions in a flexible arm. The actuator consists of two flexible tubes. Each flexible tube is restrained by a single string with variable length and tilt angle. Even if a single tube can perform only three simple types of motions (bending, extension, and h… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 8 pages, 11 figures, submitted to the IEEE Robotics and Automation Letters (RA-L)

  26. arXiv:2302.08740  [pdf, other

    cs.DS

    Query-Centered Temporal Community Search via Time-Constrained Personalized PageRank

    Authors: Longlong Lin, Pingpeng Yuan, Rong-Hua Li, Chunxue Zhu, Hongchao Qin, Hai Jin, Tao Jia

    Abstract: Existing temporal community search suffers from two defects: (i) they ignore the temporal proximity between the query vertex $q$ and other vertices but simply require the result to include $q$. Thus, they find many temporal irrelevant vertices (these vertices are called \emph{query-drifted vertices}) to $q$ for satisfying their cohesiveness, resulting in $q$ being marginalized; (ii) their methods… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  27. arXiv:2206.06350  [pdf, other

    cs.SI

    Significant Engagement Community Search on Temporal Networks: Concepts and Algorithms

    Authors: Yifei Zhang, Longlong Lin, Pingpeng Yuan, Hai Jin

    Abstract: Community search, retrieving the cohesive subgraph which contains the query vertex, has been widely touched over the past decades. The existing studies on community search mainly focus on static networks. However, real-world networks usually are temporal networks where each edge is associated with timestamps. The previous methods do not work when handling temporal networks. We study the problem of… ▽ More

    Submitted 14 June, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: 22 pages, 26 figures

  28. arXiv:2206.01894  [pdf, other

    cs.IR

    Soft Retargeting Network for Click Through Rate Prediction

    Authors: Xiaochen Li, Xin Song, Pengjia Yuan, Xialong Liu, Yu Zhang

    Abstract: The study of user interest models has received a great deal of attention in click through rate (CTR) prediction recently. These models aim at capturing user interest from different perspectives, including user interest evolution, session interest, multiple interests, etc. In this paper, we focus on a new type of user interest, i.e., user retargeting interest. User retargeting interest is defined a… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: 5 pages

    ACM Class: H.3.3

  29. arXiv:2203.13552  [pdf, ps, other

    cs.IT

    On the Role of Quantization of Soft Information in GRAND

    Authors: Peihong Yuan, Ken R. Duffy, Evan P. Gabhart, Muriel Médard

    Abstract: In this work, we investigate guessing random additive noise decoding (GRAND) with quantized soft input. First, we analyze the achievable rate of ordered reliability bits GRAND (ORBGRAND), which uses the rank order of the reliability as quantized soft information. We show that multi-line ORBGRAND can approach capacity for any signal-to-noise ratio (SNR). We then introduce discretized soft GRAND (DS… ▽ More

    Submitted 24 November, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  30. arXiv:2203.00279  [pdf, ps, other

    cs.IT

    Compositional Inverses of AGW-PPs

    Authors: Pingzhi Yuan

    Abstract: In this paper, we present two methods to obtain the compositional inverses of AGW-PPs. We improve some known results in this topic.

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2004.12552 by other authors

  31. arXiv:2112.10735  [pdf, ps, other

    cs.IT

    Successive Cancellation Ordered Search Decoding of Modified $\boldsymbol{G}_N$-Coset Codes

    Authors: Peihong Yuan, Mustafa Cemil Coşkun

    Abstract: A tree search algorithm called successive cancellation ordered search (SCOS) is proposed for $\boldsymbol{G}_N$-coset codes that implements maximum-likelihood (ML) decoding with adaptive complexity for transmission over binary-input AWGN channels. Unlike bit-flip decoders, no outer code is needed to terminate decoding; therefore, SCOS also applies to $\boldsymbol{G}_N$-coset codes modified with dy… ▽ More

    Submitted 6 February, 2024; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: 13 pages, 9 figures, 4 tables. To appear in IEEE TCOM. arXiv admin note: text overlap with arXiv:2105.04048

  32. Multimodal Breast Lesion Classification Using Cross-Attention Deep Networks

    Authors: Hung Q. Vo, Pengyu Yuan, Tiancheng He, Stephen T. C. Wong, Hien V. Nguyen

    Abstract: Accurate breast lesion risk estimation can significantly reduce unnecessary biopsies and help doctors decide optimal treatment plans. Most existing computer-aided systems rely solely on mammogram features to classify breast lesions. While this approach is convenient, it does not fully exploit useful information in clinical reports to achieve the optimal performance. Would clinical features signifi… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

  33. arXiv:2105.04048  [pdf, ps, other

    cs.IT

    Complexity-Adaptive Maximum-Likelihood Decoding of Modified $\boldsymbol{G}_N$-Coset Codes

    Authors: Peihong Yuan, Mustafa Cemil Coşkun

    Abstract: A complexity-adaptive tree search algorithm is proposed for $\boldsymbol{G}_N$-coset codes that implements maximum-likelihood (ML) decoding by using a successive decoding schedule. The average complexity is close to that of the successive cancellation (SC) decoding for practical error rates when applied to polar codes and short Reed-Muller (RM) codes, e.g., block lengths up to $N=128$. By modifyin… ▽ More

    Submitted 2 September, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

    Comments: Accepted for a presentation at ITW2021

  34. arXiv:2104.06960  [pdf, other

    cs.CL

    K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce

    Authors: Song Xu, Haoran Li, Peng Yuan, Yujia Wang, Youzheng Wu, Xiaodong He, Ying Liu, Bowen Zhou

    Abstract: Existing pre-trained language models (PLMs) have demonstrated the effectiveness of self-supervised learning for a broad range of natural language processing (NLP) tasks. However, most of them are not explicitly aware of domain-specific knowledge, which is essential for downstream tasks in many domains, such as tasks in e-commerce scenarios. In this paper, we propose K-PLUG, a knowledge-injected pr… ▽ More

    Submitted 27 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted by Findings of EMNLP 2021

  35. First arrival picking using U-net with Lovasz loss and nearest point picking method

    Authors: Pengyu Yuan, Wenyi Hu, Xuqing Wu, Jiefu Chen, Hien Van Nguyen

    Abstract: We proposed a robust segmentation and picking workflow to solve the first arrival picking problem for seismic signal processing. Unlike traditional classification algorithm, image segmentation method can utilize the location information by outputting a prediction map which has the same size of the input image. A parameter-free nearest point picking algorithm is proposed to further improve the accu… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  36. arXiv:2103.15289  [pdf

    cs.CR

    Dynamic Binary Translation for SGX Enclaves

    Authors: Jinhua Cui, Shweta Shinde, Satyaki Sen, Prateek Saxena, Pinghai Yuan

    Abstract: Enclaves, such as those enabled by Intel SGX, offer a hardware primitive for shielding user-level applications from the OS. While enclaves are a useful starting point, code running in the enclave requires additional checks whenever control or data is transferred to/from the untrusted OS. The enclave-OS interface on SGX, however, can be extremely large if we wish to run existing unmodified binaries… ▽ More

    Submitted 28 March, 2021; originally announced March 2021.

    Comments: 24 pages, 11 figures, 10 tables. arXiv admin note: substantial text overlap with arXiv:2009.01144

  37. arXiv:2102.10719  [pdf, other

    cs.IT

    Polar-Coded Non-Coherent Communication

    Authors: Peihong Yuan, Mustafa Cemil Coşkun, Gerhard Kramer

    Abstract: A polar-coded transmission (PCT) scheme with joint channel estimation and decoding is proposed for channels with unknown channel state information (CSI). The CSI is estimated via successive cancellation (SC) decoding and the constraints imposed by the frozen bits. SC list decoding with an outer code improves performance, including resolving a phase ambiguity when using quadrature phase-shift keyin… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

    Comments: Accepted for publication in IEEE Communications Letters

  38. arXiv:2012.05400  [pdf, other

    cs.CV

    A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data

    Authors: Xianfeng Li, Weijie Chen, Di Xie, Shicai Yang, Peng Yuan, Shiliang Pu, Yueting Zhuang

    Abstract: Unsupervised domain adaptation (UDA) assumes that source and target domain data are freely available and usually trained together to reduce the domain gap. However, considering the data privacy and the inefficiency of data transmission, it is impractical in real scenarios. Hence, it draws our eyes to optimize the network in the target domain without accessing labeled source data. To explore this d… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: accepted by AAAI2021

  39. arXiv:2010.07621  [pdf, other

    cs.CV

    HS-ResNet: Hierarchical-Split Block on Convolutional Neural Network

    Authors: Pengcheng Yuan, Shufei Lin, Cheng Cui, Yuning Du, Ruoyu Guo, Dongliang He, Errui Ding, Shumin Han

    Abstract: This paper addresses representational block named Hierarchical-Split Block, which can be taken as a plug-and-play block to upgrade existing convolutional neural networks, improves model performance significantly in a network. Hierarchical-Split Block contains many hierarchical split and concatenate connections within one single residual block. We find multi-scale features is of great importance fo… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  40. arXiv:2009.01144  [pdf, other

    cs.CR

    Binary Compatibility For SGX Enclaves

    Authors: Shweta Shinde, Jinhua Cui, Satyaki Sen, Pinghai Yuan, Prateek Saxena

    Abstract: Enclaves, such as those enabled by Intel SGX, offer a powerful hardware isolation primitive for application partitioning. To become universally usable on future commodity OSes, enclave designs should offer compatibility with existing software. In this paper, we draw attention to 5 design decisions in SGX that create incompatibility with existing software. These represent concrete starting points,… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

  41. arXiv:2007.05343  [pdf, other

    cs.CV

    DECAPS: Detail-Oriented Capsule Networks

    Authors: Aryan Mobiny, Pengyu Yuan, Pietro Antonio Cicalese, Hien Van Nguyen

    Abstract: Capsule Networks (CapsNets) have demonstrated to be a promising alternative to Convolutional Neural Networks (CNNs). However, they often fall short of state-of-the-art accuracies on large-scale high-dimensional datasets. We propose a Detail-Oriented Capsule Network (DECAPS) that combines the strength of CapsNets with several novel techniques to boost its classification accuracies. First, DECAPS us… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: text overlap with arXiv:2004.07407

  42. arXiv:2007.05009  [pdf, other

    cs.CV

    Few Is Enough: Task-Augmented Active Meta-Learning for Brain Cell Classification

    Authors: Pengyu Yuan, Aryan Mobiny, Jahandar Jahanipour, Xiaoyang Li, Pietro Antonio Cicalese, Badrinath Roysam, Vishal Patel, Maric Dragan, Hien Van Nguyen

    Abstract: Deep Neural Networks (or DNNs) must constantly cope with distribution changes in the input data when the task of interest or the data collection protocol changes. Retraining a network from scratch to combat this issue poses a significant cost. Meta-learning aims to deliver an adaptive model that is sensitive to these underlying distribution changes, but requires many tasks during the meta-training… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  43. arXiv:2007.05008  [pdf, other

    cs.CV eess.IV

    StyPath: Style-Transfer Data Augmentation For Robust Histology Image Classification

    Authors: Pietro Antonio Cicalese, Aryan Mobiny, Pengyu Yuan, Jan Becker, Chandra Mohan, Hien Van Nguyen

    Abstract: The classification of Antibody Mediated Rejection (AMR) in kidney transplant remains challenging even for experienced nephropathologists; this is partly because histological tissue stain analysis is often characterized by low inter-observer agreement and poor reproducibility. One of the implicated causes for inter-observer disagreement is the variability of tissue stain quality between (and within… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  44. The Role of the Hercules Autonomous Vehicle During the COVID-19 Pandemic: An Autonomous Logistic Vehicle for Contactless Goods Transportation

    Authors: Tianyu Liu, Qinghai Liao, Lu Gan, Fulong Ma, Jie Cheng, Xupeng Xie, Zhe Wang, Yingbing Chen, Yilong Zhu, Shuyang Zhang, Zhengyong Chen, Yang Liu, Meng Xie, Yang Yu, Zitong Guo, Guang Li, Peidong Yuan, Dong Han, Yuying Chen, Haoyang Ye, Jianhao Jiao, Peng Yun, Zhenhua Xu, Hengli Wang, Huaiyang Huang , et al. (6 additional authors not shown)

    Abstract: Since early 2020, the coronavirus disease 2019 (COVID-19) has spread rapidly across the world. As at the date of writing this article, the disease has been globally reported in 223 countries and regions, infected over 108 million people and caused over 2.4 million deaths (https://covid19.who.int/, accessed on Feb. 17, 2021). Avoiding person-to-person transmission is an effective approach to contro… ▽ More

    Submitted 16 February, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Journal ref: IEEE Robotics and Automation Magazine, 2021

  45. arXiv:2004.07407  [pdf, other

    eess.IV cs.CV cs.LG

    Radiologist-Level COVID-19 Detection Using CT Scans with Detail-Oriented Capsule Networks

    Authors: Aryan Mobiny, Pietro Antonio Cicalese, Samira Zare, Pengyu Yuan, Mohammadsajad Abavisani, Carol C. Wu, Jitesh Ahuja, Patricia M. de Groot, Hien Van Nguyen

    Abstract: Radiographic images offer an alternative method for the rapid screening and monitoring of Coronavirus Disease 2019 (COVID-19) patients. This approach is limited by the shortage of radiology experts who can provide a timely interpretation of these images. Motivated by this challenge, our paper proposes a novel learning architecture, called Detail-Oriented Capsule Networks (DECAPS), for the automati… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

  46. arXiv:2002.03109  [pdf, other

    cs.DC cs.PF

    Performance Modeling and Analysis of a Hyperledger-based System Using GSPN

    Authors: Pu Yuan, Kan Zheng, Xiong Xiong, Kuan Zhang, Lei Lei

    Abstract: As a highly scalable permissioned blockchain platform, Hyperledger Fabric supports a wide range of industry use cases ranging from governance to finance. In this paper, we propose a model to analyze the performance of a Hyperledgerbased system by using Generalised Stochastic Petri Nets (GSPN). This model decomposes a transaction flow into multiple phases and provides a simulation-based approach to… ▽ More

    Submitted 8 February, 2020; originally announced February 2020.

  47. arXiv:1911.01102  [pdf, other

    cs.CL cs.NE cs.SD eess.AS

    What does a network layer hear? Analyzing hidden representations of end-to-end ASR through speech synthesis

    Authors: Chung-Yi Li, Pei-Chieh Yuan, Hung-Yi Lee

    Abstract: End-to-end speech recognition systems have achieved competitive results compared to traditional systems. However, the complex transformations involved between layers given highly variable acoustic signals are hard to analyze. In this paper, we present our ASR probing model, which synthesizes speech from hidden representations of end-to-end ASR to examine the information maintain after each layer c… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Comments: submitted to ICASSP 2020

  48. arXiv:1907.08468  [pdf, ps, other

    cs.IT

    Shaped On-Off Keying Using Polar Codes

    Authors: Thomas Wiegart, Fabian Steiner, Patrick Schulte, Peihong Yuan

    Abstract: The probabilistic shaping scheme from Honda and Yamamoto (2013) for polar codes is used to enable power-efficient signaling for on-off keying (OOK). As OOK has a non-symmetric optimal input distribution, shaping approaches that are based on the concatenation of a distribution matcher followed by systematic encoding do not result in optimal signaling. Instead, these approaches represent a time shar… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Comments: accepted for publication in IEEE Communications Letters

  49. arXiv:1907.05568  [pdf, other

    cs.DS

    A Quantum-inspired Classical Algorithm for Separable Non-negative Matrix Factorization

    Authors: Zhihuai Chen, Yinan Li, Xiaoming Sun, Pei Yuan, Jialin Zhang

    Abstract: Non-negative Matrix Factorization (NMF) asks to decompose a (entry-wise) non-negative matrix into the product of two smaller-sized nonnegative matrices, which has been shown intractable in general. In order to overcome this issue, the separability assumption is introduced which assumes all data points are in a conical hull. This assumption makes NMF tractable and is widely used in text analysis an… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

  50. arXiv:1905.04153  [pdf, other

    cs.CV cs.CG cs.GR

    DeepICP: An End-to-End Deep Neural Network for 3D Point Cloud Registration

    Authors: Weixin Lu, Guowei Wan, Yao Zhou, Xiangyu Fu, Pengfei Yuan, Shiyu Song

    Abstract: We present DeepICP - a novel end-to-end learning-based 3D point cloud registration framework that achieves comparable registration accuracy to prior state-of-the-art geometric methods. Different from other keypoint based methods where a RANSAC procedure is usually needed, we implement the use of various deep neural network structures to establish an end-to-end trainable network. Our keypoint detec… ▽ More

    Submitted 16 September, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: 10 pages, 6 figures, 3 tables, typos corrected, experimental results updated, accepted by ICCV 2019

    Journal ref: The IEEE International Conference on Computer Vision (ICCV), 2019, pp. 12-21