Zum Hauptinhalt springen

Showing 101–150 of 233 results for author: Qin, B

.
  1. arXiv:2212.08322  [pdf, other

    cs.AI cs.CL

    ReCo: Reliable Causal Chain Reasoning via Structural Causal Recurrent Neural Networks

    Authors: Kai Xiong, Xiao Ding, Zhongyang Li, Li Du, Bing Qin, Yi Zheng, Baoxing Huai

    Abstract: Causal chain reasoning (CCR) is an essential ability for many decision-making AI systems, which requires the model to build reliable causal chains by connecting causal pairs. However, CCR suffers from two main transitive problems: threshold effect and scene drift. In other words, the causal pairs to be spliced may have a conflicting threshold boundary or scenario. To address these issues, we propo… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: Accepted by EMNLP 2022

  2. arXiv:2212.08307  [pdf, other

    cs.CL

    Controllable Text Generation via Probability Density Estimation in the Latent Space

    Authors: Yuxuan Gu, Xiaocheng Feng, Sicheng Ma, Lingyuan Zhang, Heng Gong, Weihong Zhong, Bing Qin

    Abstract: Previous work on controllable text generation has explored the idea of control from the latent space, such as optimizing a representation with attribute-related classifiers or sampling a representation from relevant discrete samples. However, they are not effective enough in modeling both the latent space and the control, leaving controlled text with low quality and diversity. In this work, we pro… ▽ More

    Submitted 24 May, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: 25 pages, 9 figures, Accepted to ACL2023

  3. arXiv:2212.02995  [pdf, other

    cs.CL

    Knowledge-Bridged Causal Interaction Network for Causal Emotion Entailment

    Authors: Weixiang Zhao, Yanyan Zhao, Zhuojun Li, Bing Qin

    Abstract: Causal Emotion Entailment aims to identify causal utterances that are responsible for the target utterance with a non-neutral emotion in conversations. Previous works are limited in thorough understanding of the conversational context and accurate reasoning of the emotion cause. To this end, we propose Knowledge-Bridged Causal Interaction Network (KBCIN) with commonsense knowledge (CSK) leveraged… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Accepted by AAAI 2023

  4. arXiv:2212.01543  [pdf, other

    cs.CL

    The RoyalFlush System for the WMT 2022 Efficiency Task

    Authors: Bo Qin, Aixin Jia, Qiang Wang, Jianning Lu, Shuqin Pan, Haibo Wang, Ming Chen

    Abstract: This paper describes the submission of the RoyalFlush neural machine translation system for the WMT 2022 translation efficiency task. Unlike the commonly used autoregressive translation system, we adopted a two-stage translation paradigm called Hybrid Regression Translation (HRT) to combine the advantages of autoregressive and non-autoregressive translation. Specifically, HRT first autoregressivel… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

    Comments: Accepted by WMT 2022. arXiv admin note: text overlap with arXiv:2210.10416

  5. arXiv:2211.16368  [pdf, other

    cs.LG

    DBA: Efficient Transformer with Dynamic Bilinear Low-Rank Attention

    Authors: Bosheng Qin, Juncheng Li, Siliang Tang, Yueting Zhuang

    Abstract: Many studies have been conducted to improve the efficiency of Transformer from quadric to linear. Among them, the low-rank-based methods aim to learn the projection matrices to compress the sequence length. However, the projection matrices are fixed once they have been learned, which compress sequence length with dedicated coefficients for tokens in the same position. Adopting such input-invariant… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 19 pages, 4 figures

  6. BDTS: Blockchain-based Data Trading System

    Authors: Erya Jiang, Bo Qin, Qin Wang, Qianhong Wu, Sanxi Li, Wenchang Shi, Yingxin Bi, Wenyi Tang

    Abstract: Trading data through blockchain platforms is hard to achieve \textit{fair exchange}. Reasons come from two folds: Firstly, guaranteeing fairness between sellers and consumers is a challenging task as the deception of any participating parties is risk-free. This leads to the second issue where judging the behavior of data executors (such as cloud service providers) among distrustful parties is impr… ▽ More

    Submitted 31 October, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: ICICS 2023 (Best Paper Award)

    Journal ref: International Conference on Information and Communications Security, pp. 645-664. Singapore: Springer Nature Singapore, 2023

  7. arXiv:2211.03612  [pdf

    cs.AI

    BigCilin: An Automatic Chinese Open-domain Knowledge Graph with Fine-grained Hypernym-Hyponym Relations

    Authors: Ming Liu, Yaojia LV, Jingrun Zhang, Ruiji Fu, Bing Qin

    Abstract: This paper presents BigCilin, the first Chinese open-domain knowledge graph with fine-grained hypernym-hyponym re-lations which are extracted automatically from multiple sources for Chinese named entities. With the fine-grained hypernym-hyponym relations, BigCilin owns flexible semantic hierarchical structure. Since the hypernym-hyponym paths are automati-cally generated and one entity may have se… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 5 pages, 3 figures

  8. arXiv:2211.00732  [pdf, other

    cs.IR cs.AI cs.CL

    Kuaipedia: a Large-scale Multi-modal Short-video Encyclopedia

    Authors: Haojie Pan, Zepeng Zhai, Yuzhou Zhang, Ruiji Fu, Ming Liu, Yangqiu Song, Zhongyuan Wang, Bing Qin

    Abstract: Online encyclopedias, such as Wikipedia, have been well-developed and researched in the last two decades. One can find any attributes or other information of a wiki item on a wiki page edited by a community of volunteers. However, the traditional text, images and tables can hardly express some aspects of an wiki item. For example, when we talk about ``Shiba Inu'', one may care more about ``How to… ▽ More

    Submitted 11 August, 2023; v1 submitted 28 October, 2022; originally announced November 2022.

  9. arXiv:2210.03884  [pdf, other

    cs.CL

    Don't Lose Yourself! Empathetic Response Generation via Explicit Self-Other Awareness

    Authors: Weixiang Zhao, Yanyan Zhao, Xin Lu, Bing Qin

    Abstract: As a critical step to achieve human-like chatbots, empathetic response generation has attained increasing interests. Previous attempts are incomplete and not sufficient enough to elicit empathy because they only focus on the initial aspect of empathy to automatically mimic the feelings and thoughts of the user via other-awareness. However, they ignore to maintain and take the own views of the syst… ▽ More

    Submitted 5 May, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: Accepted to Findings of ACL 2023

  10. arXiv:2210.02889  [pdf, other

    cs.CL

    A Distributional Lens for Multi-Aspect Controllable Text Generation

    Authors: Yuxuan Gu, Xiaocheng Feng, Sicheng Ma, Lingyuan Zhang, Heng Gong, Bing Qin

    Abstract: Multi-aspect controllable text generation is a more challenging and practical task than single-aspect control. Existing methods achieve complex multi-aspect control by fusing multiple controllers learned from single-aspect, but suffer from attribute degeneration caused by the mutual interference of these controllers. To address this, we provide observations on attribute fusion from a distributiona… ▽ More

    Submitted 19 October, 2022; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: 21pages, 21figures, EMNLP2022

  11. arXiv:2209.09768  [pdf, other

    cs.CL

    An Efficient End-to-End Transformer with Progressive Tri-modal Attention for Multi-modal Emotion Recognition

    Authors: Yang Wu, Pai Peng, Zhenyu Zhang, Yanyan Zhao, Bing Qin

    Abstract: Recent works on multi-modal emotion recognition move towards end-to-end models, which can extract the task-specific features supervised by the target task compared with the two-phase pipeline. However, previous methods only model the feature interactions between the textual and either acoustic and visual modalities, ignoring capturing the feature interactions between the acoustic and visual modali… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  12. Mixing in chaotic flows with swimming bacteria

    Authors: Ranjiangshang Ran, Quentin Brosseau, Brendan C. Blackwell, Boyang Qin, Rebecca L. Winter, Paulo E. Arratia

    Abstract: This is a manuscript accepted for publication on Physical Review Fluids, Gallery of Fluid Motion special issue. The manuscript is associated with a poster winner of the 39th Annual Gallery of Fluid Motion Award, for work presented at the 74th Annual Meeting of the American Physical Society's Division of Fluid Dynamics (Phoenix, AZ, USA 2021).

    Submitted 24 August, 2022; originally announced September 2022.

    Comments: This is a manuscript accepted for publication on Physical Review Fluids, Gallery of Fluid Motion special issue

    Journal ref: Phys.Rev.Fluids 7 (2022) 110511

  13. arXiv:2209.06453  [pdf, other

    cs.CL

    Prompt Combines Paraphrase: Teaching Pre-trained Models to Understand Rare Biomedical Words

    Authors: Haochun Wang, Chi Liu, Nuwa Xi, Sendong Zhao, Meizhi Ju, Shiwei Zhang, Ziheng Zhang, Yefeng Zheng, Bing Qin, Ting Liu

    Abstract: Prompt-based fine-tuning for pre-trained models has proven effective for many natural language processing tasks under few-shot settings in general domain. However, tuning with prompt in biomedical domain has not been investigated thoroughly. Biomedical words are often rare in general domain, but quite ubiquitous in biomedical contexts, which dramatically deteriorates the performance of pre-trained… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022

  14. arXiv:2209.06442  [pdf, other

    cs.CL

    SUN: Exploring Intrinsic Uncertainties in Text-to-SQL Parsers

    Authors: Bowen Qin, Lihan Wang, Binyuan Hui, Bowen Li, Xiangpeng Wei, Binhua Li, Fei Huang, Luo Si, Min Yang, Yongbin Li

    Abstract: This paper aims to improve the performance of text-to-SQL parsing by exploring the intrinsic uncertainties in the neural network based approaches (called SUN). From the data uncertainty perspective, it is indisputable that a single SQL can be learned from multiple semantically-equivalent questions.Different from previous methods that are limited to one-to-one mapping, we propose a data uncertainty… ▽ More

    Submitted 28 October, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted at COLING 2022

  15. arXiv:2209.02276  [pdf, other

    cs.CL cs.AI

    Zero-shot Aspect-level Sentiment Classification via Explicit Utilization of Aspect-to-Document Sentiment Composition

    Authors: Pengfei Deng, Jianhua Yuan, Yanyan Zhao, Bing Qin

    Abstract: As aspect-level sentiment labels are expensive and labor-intensive to acquire, zero-shot aspect-level sentiment classification is proposed to learn classifiers applicable to new domains without using any annotated aspect-level data. In contrast, document-level sentiment data with ratings are more easily accessible. In this work, we achieve zero-shot aspect-level sentiment classification by only us… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  16. arXiv:2208.13629  [pdf, other

    cs.CL

    A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions

    Authors: Bowen Qin, Binyuan Hui, Lihan Wang, Min Yang, Jinyang Li, Binhua Li, Ruiying Geng, Rongyu Cao, Jian Sun, Luo Si, Fei Huang, Yongbin Li

    Abstract: Text-to-SQL parsing is an essential and challenging task. The goal of text-to-SQL parsing is to convert a natural language (NL) question to its corresponding structured query language (SQL) based on the evidences provided by relational databases. Early text-to-SQL parsing systems from the database community achieved a noticeable progress with the cost of heavy human engineering and user interactio… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  17. arXiv:2208.09884  [pdf, other

    cs.LG cs.AI cs.CV

    DiscrimLoss: A Universal Loss for Hard Samples and Incorrect Samples Discrimination

    Authors: Tingting Wu, Xiao Ding, Hao Zhang, Jinglong Gao, Li Du, Bing Qin, Ting Liu

    Abstract: Given data with label noise (i.e., incorrect data), deep neural networks would gradually memorize the label noise and impair model performance. To relieve this issue, curriculum learning is proposed to improve model performance and generalization by ordering training samples in a meaningful (e.g., easy to hard) sequence. Previous work takes incorrect samples as generic hard ones without discrimina… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

  18. arXiv:2207.13005  [pdf, other

    cs.CL

    Hansel: A Chinese Few-Shot and Zero-Shot Entity Linking Benchmark

    Authors: Zhenran Xu, Zifei Shan, Yuxin Li, Baotian Hu, Bing Qin

    Abstract: Modern Entity Linking (EL) systems entrench a popularity bias, yet there is no dataset focusing on tail and emerging entities in languages other than English. We present Hansel, a new benchmark in Chinese that fills the vacancy of non-English few-shot and zero-shot EL challenges. The test set of Hansel is human annotated and reviewed, created with a novel method for collecting zero-shot EL dataset… ▽ More

    Submitted 29 October, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: WSDM 2023

  19. arXiv:2207.01528  [pdf, other

    cs.CL

    VEM$^2$L: A Plug-and-play Framework for Fusing Text and Structure Knowledge on Sparse Knowledge Graph Completion

    Authors: Tao He, Ming Liu, Yixin Cao, Tianwen Jiang, Zihao Zheng, Jingrun Zhang, Sendong Zhao, Bing Qin

    Abstract: Knowledge Graph Completion (KGC) aims to reason over known facts and infer missing links but achieves weak performances on those sparse Knowledge Graphs (KGs). Recent works introduce text information as auxiliary features or apply graph densification to alleviate this challenge, but suffer from problems of ineffectively incorporating structure features and injecting noisy triples. In this paper, w… ▽ More

    Submitted 15 August, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: 12 pages, 5 figures

  20. arXiv:2206.14017  [pdf, other

    cs.CL

    Proton: Probing Schema Linking Information from Pre-trained Language Models for Text-to-SQL Parsing

    Authors: Lihan Wang, Bowen Qin, Binyuan Hui, Bowen Li, Min Yang, Bailin Wang, Binhua Li, Fei Huang, Luo Si, Yongbin Li

    Abstract: The importance of building text-to-SQL parsers which can be applied to new databases has long been acknowledged, and a critical step to achieve this goal is schema linking, i.e., properly recognizing mentions of unseen columns or tables when generating SQLs. In this work, we propose a novel framework to elicit relational structures from large-scale pre-trained language models (PLMs) via a probing… ▽ More

    Submitted 6 August, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: Accepted at KDD 2022

  21. arXiv:2206.13969  [pdf, other

    cs.CL

    MACSA: A Multimodal Aspect-Category Sentiment Analysis Dataset with Multimodal Fine-grained Aligned Annotations

    Authors: Hao Yang, Yanyan Zhao, Jianwei Liu, Yang Wu, Bing Qin

    Abstract: Multimodal fine-grained sentiment analysis has recently attracted increasing attention due to its broad applications. However, the existing multimodal fine-grained sentiment datasets most focus on annotating the fine-grained elements in text but ignore those in images, which leads to the fine-grained elements in visual content not receiving the full attention they deserve. In this paper, we propos… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  22. arXiv:2206.12742  [pdf, other

    eess.SY

    A Planning-free Longitudinal Controller Design for Vehicles in Dynamic Traffic Environments

    Authors: Wubing B. Qin

    Abstract: This paper investigates the longitudinal control problem in a dynamic traffic environment where driving scenarios change between free-driving scenarios and car-following scenarios. A comprehensive longitudinal controller is proposed to ensure reasonable transient response and steady-state response in scenarios changes, which is independent of planning algorithms. This design takes into account pas… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Comments: 11 pages, 8 figures, 1 table

  23. arXiv:2206.04592  [pdf, other

    eess.SY

    Representing Lanes as Arc-length-based Parametric Curves to Facilitate Estimation in Vehicle Control

    Authors: Wubing B. Qin

    Abstract: This paper revisits the fundamental mathematics of Taylor series to approximate curves with function representation and arc-length-based parametric representation. Parametric representation is shown to preserve its form in coordinate transformation and parameter shifting. These preservations can significantly facilitate lane estimation in vehicle control since lanes perceived by cameras are typica… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 14 pages, 8 figures, currently submitted and under review

  24. arXiv:2205.12593  [pdf, other

    cs.CL

    Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

    Authors: Yanrui Du, Jing Yan, Yan Chen, Jing Liu, Sendong Zhao, Qiaoqiao She, Hua Wu, Haifeng Wang, Bing Qin

    Abstract: Recent research has revealed that deep neural networks often take dataset biases as a shortcut to make decisions rather than understand tasks, leading to failures in real-world applications. In this study, we focus on the spurious correlation between word features and labels that models learn from the biased data distribution of training data. In particular, we define the word highly co-occurring… ▽ More

    Submitted 22 June, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  25. arXiv:2205.10822  [pdf, other

    cs.CL cs.AI

    A Graph Enhanced BERT Model for Event Prediction

    Authors: Li Du, Xiao Ding, Yue Zhang, Kai Xiong, Ting Liu, Bing Qin

    Abstract: Predicting the subsequent event for an existing event context is an important but challenging task, as it requires understanding the underlying relationship between events. Previous methods propose to retrieve relational features from event graph to enhance the modeling of event correlation. However, the sparsity of event graph may restrict the acquisition of relevant graph information, and hence… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  26. arXiv:2205.07762  [pdf, other

    eess.SY

    A Nonlinear Lateral Controller Design for Vehicle Path-following with an Arbitrary Sensor Location

    Authors: Wubing B. Qin, Zhaojian Li

    Abstract: This paper investigates the lateral control problem in vehicular path-following when the feedback sensor(s) are mounted at an arbitrary location in the longitudinal symmetric axis. We point out that some existing literature has abused the kinematic bicycle model describing the motion of rear axle center for other locations, which may lead to poor performance in practical implementations. A new non… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: 11 pages, 9 figures, 1 table, submitted to IEEE Transactions on Intelligent Vehicles

  27. arXiv:2205.05849  [pdf, other

    cs.AI cs.CL

    e-CARE: a New Dataset for Exploring Explainable Causal Reasoning

    Authors: Li Du, Xiao Ding, Kai Xiong, Ting Liu, Bing Qin

    Abstract: Understanding causality has vital importance for various Natural Language Processing (NLP) applications. Beyond the labeled instances, conceptual explanations of the causality can provide deep understanding of the causal facts to facilitate the causal reasoning process. However, such explanation information still remains absent in existing causal reasoning resources. In this paper, we fill this ga… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  28. A Nonlinear Car-following Controller Design Inspired By Human-driving Behaviors to Increase Comfort and Enhance Safety

    Authors: Wubing B. Qin

    Abstract: This paper investigates the car-following problem and proposes a nonlinear controller that considers driving comfort, safety concerns, steady-state response and transient response. This controller is designed based on the demands of lower cost, faster response, increased comfort, enhanced safety and elevated extendability from the automotive industry. Design insights and intuitions are provided in… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 13 pages, 10 figures, submitted to IEEE Transactions on Vehicular Technology

    Journal ref: TVT.2022.3175746

  29. arXiv:2205.01620  [pdf, other

    cs.CL

    Unifying the Convergences in Multilingual Neural Machine Translation

    Authors: Yichong Huang, Xiaocheng Feng, Xinwei Geng, Bing Qin

    Abstract: Although all-in-one-model multilingual neural machine translation (multilingual NMT) has achieved remarkable progress, the convergence inconsistency in the joint training is ignored, i.e., different language pairs reaching convergence in different epochs. This leads to the trained MNMT model over-fitting low-resource language translations while under-fitting high-resource ones. In this paper, we p… ▽ More

    Submitted 19 October, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: EMNLP2022

  30. arXiv:2204.10105  [pdf, other

    cs.CV cs.AI cs.LG physics.med-ph

    Working memory inspired hierarchical video decomposition with transformative representations

    Authors: Binjie Qin, Haohao Mao, Ruipeng Zhang, Yueqi Zhu, Song Ding, Xu Chen

    Abstract: Video decomposition is very important to extract moving foreground objects from complex backgrounds in computer vision, machine learning, and medical imaging, e.g., extracting moving contrast-filled vessels from the complex and noisy backgrounds of X-ray coronary angiography (XCA). However, the challenges caused by dynamic backgrounds, overlapping heterogeneous environments and complex noises stil… ▽ More

    Submitted 5 May, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

  31. arXiv:2204.08466  [pdf, other

    eess.IV cs.AI cs.CV physics.med-ph

    Robust PCA Unrolling Network for Super-resolution Vessel Extraction in X-ray Coronary Angiography

    Authors: Binjie Qin, Haohao Mao, Yiming Liu, Jun Zhao, Yisong Lv, Yueqi Zhu, Song Ding, Xu Chen

    Abstract: Although robust PCA has been increasingly adopted to extract vessels from X-ray coronary angiography (XCA) images, challenging problems such as inefficient vessel-sparsity modelling, noisy and dynamic background artefacts, and high computational cost still remain unsolved. Therefore, we propose a novel robust PCA unrolling network with sparse feature selection for super-resolution XCA vessel imagi… ▽ More

    Submitted 23 April, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

  32. arXiv:2203.16373  [pdf, other

    cs.LG

    Slow-varying Dynamics Assisted Temporal Capsule Network for Machinery Remaining Useful Life Estimation

    Authors: Yan Qin, Chau Yuen, Yimin Shao, Bo Qin, Xiaoli Li

    Abstract: Capsule network (CapsNet) acts as a promising alternative to the typical convolutional neural network, which is the dominant network to develop the remaining useful life (RUL) estimation models for mechanical equipment. Although CapsNet comes with an impressive ability to represent the entities' hierarchical relationships through a high-dimensional vector embedding, it fails to capture the long-te… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: This paper has been accepted by IEEE Transactions on Cybernetics

  33. arXiv:2203.06958  [pdf, other

    cs.CL

    S$^2$SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers

    Authors: Binyuan Hui, Ruiying Geng, Lihan Wang, Bowen Qin, Bowen Li, Jian Sun, Yongbin Li

    Abstract: The task of converting a natural language question into an executable SQL query, known as text-to-SQL, is an important branch of semantic parsing. The state-of-the-art graph-based encoder has been successfully used in this task but does not model the question syntax well. In this paper, we propose S$^2$SQL, injecting Syntax to question-Schema graph encoder for Text-to-SQL parsers, which effectivel… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 Findings

  34. arXiv:2203.00257  [pdf, other

    cs.CL

    Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors

    Authors: Yang Wu, Yanyan Zhao, Hao Yang, Song Chen, Bing Qin, Xiaohuan Cao, Wenting Zhao

    Abstract: Multimodal sentiment analysis has attracted increasing attention and lots of models have been proposed. However, the performance of the state-of-the-art models decreases sharply when they are deployed in the real world. We find that the main reason is that real-world applications can only access the text outputs by the automatic speech recognition (ASR) models, which may be with errors because of… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: Findings of ACL 2022

  35. arXiv:2202.12142  [pdf, other

    cs.CL

    Pretraining without Wordpieces: Learning Over a Vocabulary of Millions of Words

    Authors: Zhangyin Feng, Duyu Tang, Cong Zhou, Junwei Liao, Shuangzhi Wu, Xiaocheng Feng, Bing Qin, Yunbo Cao, Shuming Shi

    Abstract: The standard BERT adopts subword-based tokenization, which may break a word into two or more wordpieces (e.g., converting "lossless" to "loss" and "less"). This will bring inconvenience in following situations: (1) what is the best way to obtain the contextual vector of a word that is divided into multiple wordpieces? (2) how to predict a word via cloze test without knowing the number of wordpiece… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  36. arXiv:2202.08576  [pdf, other

    cs.CR

    Local Differential Privacy for Belief Functions

    Authors: Qiyu Li, Chunlai Zhou, Biao Qin, Zhiqiang Xu

    Abstract: In this paper, we propose two new definitions of local differential privacy for belief functions. One is based on Shafer's semantics of randomly coded messages and the other from the perspective of imprecise probabilities. We show that such basic properties as composition and post-processing also hold for our new definitions. Moreover, we provide a hypothesis testing framework for these definition… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  37. arXiv:2112.08723  [pdf, other

    cs.CL cs.CV

    Distilled Dual-Encoder Model for Vision-Language Understanding

    Authors: Zekun Wang, Wenhui Wang, Haichao Zhu, Ming Liu, Bing Qin, Furu Wei

    Abstract: We propose a cross-modal attention distillation framework to train a dual-encoder model for vision-language understanding tasks, such as visual reasoning and visual question answering. Dual-encoder models have a faster inference speed than fusion-encoder models and enable the pre-computation of images and text during inference. However, the shallow interaction module used in dual-encoder models is… ▽ More

    Submitted 17 October, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022

  38. arXiv:2112.03603  [pdf, other

    cs.CV cs.AI cs.LG

    Handwritten Mathematical Expression Recognition via Attention Aggregation based Bi-directional Mutual Learning

    Authors: Xiaohang Bian, Bo Qin, Xiaozhe Xin, Jianwu Li, Xuefeng Su, Yanfeng Wang

    Abstract: Handwritten mathematical expression recognition aims to automatically generate LaTeX sequences from given images. Currently, attention-based encoder-decoder models are widely used in this task. They typically generate target sequences in a left-to-right (L2R) manner, leaving the right-to-left (R2L) contexts unexploited. In this paper, we propose an Attention aggregation based Bi-directional Mutual… ▽ More

    Submitted 23 February, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: 9 pages,5 figures, have been accepted in AAAI 2022 Oral

    Journal ref: AAAI 2022

  39. arXiv:2111.10946  [pdf, other

    cs.RO

    A General Framework for Lifelong Localization and Mapping in Changing Environment

    Authors: Min Zhao, Xin Guo, Le Song, Baoxing Qin, Xuesong Shi, Gim Hee Lee, Guanghui Sun

    Abstract: The environment of most real-world scenarios such as malls and supermarkets changes at all times. A pre-built map that does not account for these changes becomes out-of-date easily. Therefore, it is necessary to have an up-to-date model of the environment to facilitate long-term operation of a robot. To this end, this paper presents a general lifelong simultaneous localization and mapping (SLAM) f… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

  40. arXiv:2111.09486  [pdf, other

    cs.CL

    Linking-Enhanced Pre-Training for Table Semantic Parsing

    Authors: Bowen Qin, Lihan Wang, Binyuan Hui, Ruiying Geng, Zheng Cao, Min Yang, Jian Sun, Yongbin Li

    Abstract: Recently pre-training models have significantly improved the performance of various NLP tasks by leveraging large-scale text corpora to improve the contextual representation ability of the neural network. The large pre-training language model has also been applied in the area of table semantic parsing. However, existing pre-training approaches have not carefully explored explicit interaction relat… ▽ More

    Submitted 14 February, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

  41. GEDIT: Geographic-Enhanced and Dependency-Guided Tagging for Joint POI and Accessibility Extraction at Baidu Maps

    Authors: Yibo Sun, Jizhou Huang, Chunyuan Yuan, Miao Fan, Haifeng Wang, Ming Liu, Bing Qin

    Abstract: Providing timely accessibility reminders of a point-of-interest (POI) plays a vital role in improving user satisfaction of finding places and making visiting decisions. However, it is difficult to keep the POI database in sync with the real-world counterparts due to the dynamic nature of business changes. To alleviate this problem, we formulate and present a practical solution that jointly extract… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: Accepted by CIKM'21

  42. Nonholonomic dynamics and control of road vehicles: moving toward automation

    Authors: Wubing B. Qin, Yiming Zhang, Dénes Takács, Gábor Stépán, Gábor Orosz

    Abstract: Nonholonomic models of automobiles are developed by utilizing tools of analytical mechanics, in particular the Appellian approach that allows one to describe the vehicle dynamics with minimum number of time-dependent state variables. The models are categorized based on how they represent the wheel-ground contact, whether they incorporate the longitudinal dynamics, and whether they consider the ste… ▽ More

    Submitted 25 June, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: 42 pages, 25 figures, 5 tables, accepted for inclusion in a future issue in Nonlinear Dynamics, Springer

    Journal ref: Nonlinear Dynamics (2022)

  43. arXiv:2108.01049  [pdf, other

    physics.flu-dyn cond-mat.soft physics.bio-ph

    Bacteria hinder large-scale transport and enhance small-scale mixing in time-periodic flows

    Authors: Ranjiangshang Ran, Quentin Brosseau, Brendan C. Blackwell, Boyang Qin, Rebecca Winter, Paulo E. Arratia

    Abstract: Understanding mixing and transport of passive scalars in active fluids is important to many natural (e.g. algal blooms) and industrial (e.g. biofuel, vaccine production) processes. Here, we study the mixing of a passive scalar (dye) in dilute suspensions of swimming Escherichia coli in experiments using a two-dimensional (2D) time-periodic flow and in a simple simulation. Results show that the pre… ▽ More

    Submitted 22 August, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Supplementary Information added

  44. arXiv:2107.09852  [pdf, other

    cs.CL

    CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision

    Authors: Zhongyang Li, Xiao Ding, Kuo Liao, Bing Qin, Ting Liu

    Abstract: Recent work has shown success in incorporating pre-trained models like BERT to improve NLP systems. However, existing pre-trained models lack of causal knowledge which prevents today's NLP systems from thinking like humans. In this paper, we investigate the problem of injecting causal knowledge into pre-trained models. There are two fundamental problems: 1) how to collect various granularities of… ▽ More

    Submitted 7 August, 2021; v1 submitted 20 July, 2021; originally announced July 2021.

  45. arXiv:2107.03175  [pdf, ps, other

    cs.CL

    A Survey on Dialogue Summarization: Recent Advances and New Frontiers

    Authors: Xiachong Feng, Xiaocheng Feng, Bing Qin

    Abstract: Dialogue summarization aims to condense the original dialogue into a shorter version covering salient information, which is a crucial way to reduce dialogue data overload. Recently, the promising achievements in both dialogue systems and natural language generation techniques drastically lead this task to a new landscape, which results in significant research attentions. However, there still remai… ▽ More

    Submitted 27 April, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: IJCAI 2022 Survey Track

  46. arXiv:2106.09895  [pdf, other

    cs.CL

    PRGC: Potential Relation and Global Correspondence Based Joint Relational Triple Extraction

    Authors: Hengyi Zheng, Rui Wen, Xi Chen, Yifan Yang, Yunyan Zhang, Ziheng Zhang, Ningyu Zhang, Bin Qin, Ming Xu, Yefeng Zheng

    Abstract: Joint extraction of entities and relations from unstructured texts is a crucial task in information extraction. Recent methods achieve considerable performance but still suffer from some inherent limitations, such as redundancy of relation prediction, poor generalization of span-based extraction and inefficiency. In this paper, we decompose this task into three subtasks, Relation Judgement, Entity… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted by ACL 2021

  47. arXiv:2105.12544  [pdf, other

    cs.CL

    Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization

    Authors: Xiachong Feng, Xiaocheng Feng, Libo Qin, Bing Qin, Ting Liu

    Abstract: Current dialogue summarization systems usually encode the text with a number of general semantic features (e.g., keywords and topics) to gain more powerful dialogue modeling capabilities. However, these features are obtained via open-domain toolkits that are dialog-agnostic or heavily relied on human annotations. In this paper, we show how DialoGPT, a pre-trained model for conversational response… ▽ More

    Submitted 27 May, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: ACL 2021

  48. arXiv:2104.14839  [pdf, other

    cs.CL

    The Factual Inconsistency Problem in Abstractive Text Summarization: A Survey

    Authors: Yichong Huang, Xiachong Feng, Xiaocheng Feng, Bing Qin

    Abstract: Recently, various neural encoder-decoder models pioneered by Seq2Seq framework have been proposed to achieve the goal of generating more abstractive summaries by learning to map input text to output text. At a high level, such neural models can freely generate summaries without any constraint on the words or phrases used. Moreover, their format is closer to human-edited summaries and output is mor… ▽ More

    Submitted 10 April, 2023; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: 9 pages, 5 figures

  49. arXiv:2104.12377  [pdf, other

    cs.CL

    DADgraph: A Discourse-aware Dialogue Graph Neural Network for Multiparty Dialogue Machine Reading Comprehension

    Authors: Jiaqi Li, Ming Liu, Zihao Zheng, Heng Zhang, Bing Qin, Min-Yen Kan, Ting Liu

    Abstract: Multiparty Dialogue Machine Reading Comprehension (MRC) differs from traditional MRC as models must handle the complex dialogue discourse structure, previously unconsidered in traditional MRC. To fully exploit such discourse structure in multiparty dialogue, we present a discourse-aware dialogue graph neural network, DADgraph, which explicitly constructs the dialogue graph using discourse dependen… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: Accepted by IJCNN 2021

  50. arXiv:2104.08480  [pdf, other

    cs.CL

    Learning to Share by Masking the Non-shared for Multi-domain Sentiment Classification

    Authors: Jianhua Yuan, Yanyan Zhao, Bing Qin, Ting Liu

    Abstract: Multi-domain sentiment classification deals with the scenario where labeled data exists for multiple domains but insufficient for training effective sentiment classifiers that work across domains. Thus, fully exploiting sentiment knowledge shared across domains is crucial for real world applications. While many existing works try to extract domain-invariant features in high-dimensional space, such… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 11 pages