Skip to main content

Showing 1–50 of 70 results for author: Gu, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11677  [pdf, other

    cs.CV

    Video-Language Alignment Pre-training via Spatio-Temporal Graph Transformer

    Authors: Shi-Xue Zhang, Hongfa Wang, Xiaobin Zhu, Weibo Gu, Tianjin Zhang, Chun Yang, Wei Liu, Xu-Cheng Yin

    Abstract: Video-language alignment is a crucial multi-modal task that benefits various downstream applications, e.g., video-text retrieval and video question answering. Existing methods either utilize multi-modal information in video-text pairs or apply global and local alignment techniques to promote alignment precision. However, these methods often fail to fully explore the spatio-temporal relationships a… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: under review

  2. arXiv:2407.01553  [pdf, other

    cs.AI

    Fish-bone diagram of research issue: Gain a bird's-eye view on a specific research topic

    Authors: JingHong Li, Huy Phan, Wen Gu, Koichi Ota, Shinobu Hasegawa

    Abstract: Novice researchers often face difficulties in understanding a multitude of academic papers and grasping the fundamentals of a new research field. To solve such problems, the knowledge graph supporting research survey is gradually being developed. Existing keyword-based knowledge graphs make it difficult for researchers to deeply understand abstract concepts. Meanwhile, novice researchers may find… ▽ More

    Submitted 10 July, 2024; v1 submitted 30 April, 2024; originally announced July 2024.

    Comments: This paper has been accepted by IEEE SMC 2024

  3. arXiv:2406.11904  [pdf, other

    cs.SI

    Pay Attention to Weak Ties: A Heterogeneous Multiplex Representation Learning Framework for Link Prediction

    Authors: Weiwei Gu, Linbi Lv, Gang Lu, Ruiqi Li

    Abstract: Graph neural networks (GNNs) can learn effective node representations that significantly improve link prediction accuracy. However, most GNN-based link prediction algorithms are incompetent to predict weak ties connecting different communities. Most link prediction algorithms are designed for networks with only one type of relation between nodes but neglect the fact that many complex systems, incl… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  4. S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles

    Authors: Xiao Wang, Ke Tang, Xingyuan Dai, Jintao Xu, Quancheng Du, Rui Ai, Yuxiao Wang, Weihao Gu

    Abstract: In public roads, autonomous vehicles (AVs) face the challenge of frequent interactions with human-driven vehicles (HDVs), which render uncertain driving behavior due to varying social characteristics among humans. To effectively assess the risks prevailing in the vicinity of AVs in social interactive traffic scenarios and achieve safe autonomous driving, this article proposes a social-suitable and… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 12 pages,4 figures, published to IEEE Transactions on Intelligent Vehicles

  5. arXiv:2404.00231  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Attention-based Shape-Deformation Networks for Artifact-Free Geometry Reconstruction of Lumbar Spine from MR Images

    Authors: Linchen Qian, Jiasong Chen, Linhai Ma, Timur Urakov, Weiyong Gu, Liang Liang

    Abstract: Lumbar disc degeneration, a progressive structural wear and tear of lumbar intervertebral disc, is regarded as an essential role on low back pain, a significant global health concern. Automated lumbar spine geometry reconstruction from MR images will enable fast measurement of medical parameters to evaluate the lumbar status, in order to determine a suitable treatment. Existing image segmentation-… ▽ More

    Submitted 30 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

  6. arXiv:2403.19615  [pdf, other

    cs.CV

    SA-GS: Scale-Adaptive Gaussian Splatting for Training-Free Anti-Aliasing

    Authors: Xiaowei Song, Jv Zheng, Shiran Yuan, Huan-ang Gao, Jingwei Zhao, Xiang He, Weihao Gu, Hao Zhao

    Abstract: In this paper, we present a Scale-adaptive method for Anti-aliasing Gaussian Splatting (SA-GS). While the state-of-the-art method Mip-Splatting needs modifying the training procedure of Gaussian splatting, our method functions at test-time and is training-free. Specifically, SA-GS can be applied to any pretrained Gaussian splatting field as a plugin to significantly improve the field's anti-alisin… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Project page: https://kevinsong729.github.io/project-pages/SA-GS/ Code: https://github.com/zsy1987/SA-GS

  7. arXiv:2403.18926  [pdf, other

    cs.LG cs.CL

    XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection

    Authors: Yuanhang Yang, Shiyi Qi, Wenchao Gu, Chaozheng Wang, Cuiyun Gao, Zenglin Xu

    Abstract: Sparse models, including sparse Mixture-of-Experts (MoE) models, have emerged as an effective approach for scaling Transformer models. However, they often suffer from computational inefficiency since a significant number of parameters are unnecessarily involved in computations via multiplying values by zero or low activation values. To address this issue, we present \tool, a novel MoE designed to… ▽ More

    Submitted 24 May, 2024; v1 submitted 27 February, 2024; originally announced March 2024.

    Comments: ACL2024 Findings

  8. arXiv:2403.18762  [pdf, other

    cs.CV cs.AI cs.RO

    ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition

    Authors: Weidong Xie, Lun Luo, Nanfei Ye, Yi Ren, Shaoyi Du, Minhang Wang, Jintao Xu, Rui Ai, Weihao Gu, Xieyuanli Chen

    Abstract: Place recognition is an important task for robots and autonomous cars to localize themselves and close loops in pre-built maps. While single-modal sensor-based methods have shown satisfactory performance, cross-modal place recognition that retrieving images from a point-cloud database remains a challenging problem. Current cross-modal methods transform images into 3D points using depth estimation… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 8 pages, 11 figures, conference

  9. arXiv:2403.07566  [pdf, other

    cs.AI

    An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning

    Authors: Weiwei Gu, Senquan Wang

    Abstract: Blood Glucose (BG) control involves keeping an individual's BG within a healthy range through extracorporeal insulin injections is an important task for people with type 1 diabetes. However,traditional patient self-management is cumbersome and risky. Recent research has been devoted to exploring individualized and automated BG control approaches, among which Deep Reinforcement Learning (DRL) shows… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  10. Improving link prediction accuracy of network embedding algorithms via rich node attribute information

    Authors: Weiwei Gu, Jinqiang Hou, Weiyi Gu

    Abstract: Complex networks are widely used to represent an abundance of real-world relations ranging from social networks to brain networks. Inferring missing links or predicting future ones based on the currently observed network is known as the link prediction task.Recent network embedding based link prediction algorithms have demonstrated ground-breaking performance on link prediction accuracy. Those alg… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Journal ref: Journal of Social Computing, 2023, 4(4): 326-336

  11. arXiv:2402.04854  [pdf, other

    cs.DL cs.CL cs.LG

    Hierarchical Tree-structured Knowledge Graph For Academic Insight Survey

    Authors: Jinghong Li, Huy Phan, Wen Gu, Koichi Ota, Shinobu Hasegawa

    Abstract: Research surveys have always posed a challenge for beginner researchers who lack of research training. These researchers struggle to understand the directions within their research topic, and the discovery of new research findings within a short time. One way to provide intuitive assistance to beginner researchers is by offering relevant knowledge graphs(KG) and recommending related academic paper… ▽ More

    Submitted 4 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted by 'The 18TH International Conference on INnovations in Intelligent SysTems and Applications (INISTA 2024)'

  12. arXiv:2402.01723  [pdf, other

    cs.CL cs.AI

    An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios

    Authors: Zongjie Li, Wenying Qiu, Pingchuan Ma, Yichen Li, You Li, Sijia He, Baozheng Jiang, Shuai Wang, Weixi Gu

    Abstract: Recent years have witnessed the rapid development of large language models (LLMs) in various domains. To better serve the large number of Chinese users, many commercial vendors in China have adopted localization strategies, training and providing local LLMs specifically customized for Chinese users. Furthermore, looking ahead, one of the key future applications of LLMs will be practical deployment… ▽ More

    Submitted 26 January, 2024; originally announced February 2024.

  13. arXiv:2401.14385  [pdf, other

    quant-ph cs.IT math-ph math.PR

    Entropic Quantum Central Limit Theorem and Quantum Inverse Sumset Theorem

    Authors: Kaifeng Bu, Weichen Gu, Arthur Jaffe

    Abstract: We establish an entropic, quantum central limit theorem and quantum inverse sumset theorem in discrete-variable quantum systems describing qudits or qubits. Both results are enabled by using our recently-discovered quantum convolution. We show that the exponential rate of convergence of the entropic central limit theorem is bounded by the magic gap. We also establish an ``quantum, entropic inverse… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 23 pages

  14. arXiv:2401.09627  [pdf

    eess.IV cs.CV cs.LG

    SymTC: A Symbiotic Transformer-CNN Net for Instance Segmentation of Lumbar Spine MRI

    Authors: Jiasong Chen, Linchen Qian, Linhai Ma, Timur Urakov, Weiyong Gu, Liang Liang

    Abstract: Intervertebral disc disease, a prevalent ailment, frequently leads to intermittent or persistent low back pain, and diagnosing and assessing of this disease rely on accurate measurement of vertebral bone and intervertebral disc geometries from lumbar MR images. Deep neural network (DNN) models may assist clinicians with more efficient image segmentation of individual instances (disks and vertebrae… ▽ More

    Submitted 1 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  15. arXiv:2401.06175  [pdf, other

    cs.SE cs.AI cs.LG

    MTAD: Tools and Benchmarks for Multivariate Time Series Anomaly Detection

    Authors: Jinyang Liu, Wenwei Gu, Zhuangbin Chen, Yichen Li, Yuxin Su, Michael R. Lyu

    Abstract: Key Performance Indicators (KPIs) are essential time-series metrics for ensuring the reliability and stability of many software systems. They faithfully record runtime states to facilitate the understanding of anomalous system behaviors and provide informative clues for engineers to pinpoint the root causes. The unprecedented scale and complexity of modern software systems, however, make the volum… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: The code and datasets are available at https://github.com/OpsPAI/MTAD

  16. arXiv:2401.01918  [pdf, other

    cs.CV

    Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection

    Authors: Haowen Zheng, Dong Cao, Jintao Xu, Rui Ai, Weihao Gu, Yang Yang, Yanyan Liang

    Abstract: Striking a balance between precision and efficiency presents a prominent challenge in the bird's-eye-view (BEV) 3D object detection. Although previous camera-based BEV methods achieved remarkable performance by incorporating long-term temporal information, most of them still face the problem of low efficiency. One potential solution is knowledge distillation. Existing distillation methods only foc… ▽ More

    Submitted 8 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

  17. arXiv:2312.13219  [pdf, other

    cs.RO cs.CL cs.CV

    Interactive Visual Task Learning for Robots

    Authors: Weiwei Gu, Anant Sah, Nakul Gopalan

    Abstract: We present a framework for robots to learn novel visual concepts and tasks via in-situ linguistic interactions with human users. Previous approaches have either used large pre-trained visual models to infer novel objects zero-shot, or added novel concepts along with their attributes and representations to a concept hierarchy. We extend the approaches that focus on learning visual concept hierarchi… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: In Proceedings of The 38th Annual AAAI Conference on Artificial Intelligence

  18. arXiv:2312.09038  [pdf, other

    cs.CV cs.DL cs.LG

    Object Recognition from Scientific Document based on Compartment Refinement Framework

    Authors: Jinghong Li, Wen Gu, Koichi Ota, Shinobu Hasegawa

    Abstract: With the rapid development of the internet in the past decade, it has become increasingly important to extract valuable information from vast resources efficiently, which is crucial for establishing a comprehensive digital ecosystem, particularly in the context of research surveys and comprehension. The foundation of these tasks focuses on accurate extraction and deep mining of data from scientifi… ▽ More

    Submitted 4 July, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.17401

  19. arXiv:2311.17663  [pdf, other

    cs.CV

    Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications

    Authors: Junyi Ma, Xieyuanli Chen, Jiawei Huang, Jingyi Xu, Zhen Luo, Jintao Xu, Weihao Gu, Rui Ai, Hesheng Wang

    Abstract: Understanding how the surrounding environment changes is crucial for performing downstream tasks safely and reliably in autonomous driving applications. Recent occupancy estimation techniques using only camera images as input can provide dense occupancy representations of large-scale scenes based on the current observation. However, they are mostly limited to representing the current 3D space and… ▽ More

    Submitted 7 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  20. arXiv:2311.14514  [pdf, other

    cs.CR cs.AI

    FRAD: Front-Running Attacks Detection on Ethereum using Ternary Classification Model

    Authors: Yuheng Zhang, Pin Liu, Guojun Wang, Peiqiang Li, Wanyi Gu, Houji Chen, Xuelei Liu, Jinyao Zhu

    Abstract: With the evolution of blockchain technology, the issue of transaction security, particularly on platforms like Ethereum, has become increasingly critical. Front-running attacks, a unique form of security threat, pose significant challenges to the integrity of blockchain transactions. In these attack scenarios, malicious actors monitor other users' transaction activities, then strategically submit… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 14 pages, 8 figures

  21. arXiv:2310.02609  [pdf, other

    cs.CR

    RLTrace: Synthesizing High-Quality System Call Traces for OS Fuzz Testing

    Authors: Wei Chen, Huaijin Wang, Weixi Gu, Shuai Wang

    Abstract: Securing operating system (OS) kernel is one central challenge in today's cyber security landscape. The cutting-edge testing technique of OS kernel is software fuzz testing. By mutating the program inputs with random variations for iterations, fuzz testing aims to trigger program crashes and hangs caused by potential bugs that can be abused by the inputs. To achieve high OS code coverage, the de f… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Information Security Conference 2023

  22. arXiv:2308.00624  [pdf, other

    cs.CL cs.AI

    JIANG: Chinese Open Foundation Language Model

    Authors: Qinhua Duan, Wenchao Gu, Yujia Chen, Wenxin Mao, Zewen Tian, Hui Cao

    Abstract: With the advancements in large language model technology, it has showcased capabilities that come close to those of human beings across various tasks. This achievement has garnered significant interest from companies and scientific research institutions, leading to substantial investments in the research and development of these models. While numerous large models have emerged during this period,… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  23. arXiv:2307.14041  [pdf, other

    cs.CR cs.DL

    GovernR: Provenance and Confidentiality Guarantees In Research Data Repositories

    Authors: Anwitaman Datta, Chua Chiah Soon, Wangfan Gu

    Abstract: We propose cryptographic protocols to incorporate time provenance guarantees while meeting confidentiality and controlled sharing needs for research data. We demonstrate the efficacy of these mechanisms by developing and benchmarking a practical tool, GovernR, which furthermore takes into usability issues and is compatible with a popular open-sourced research data storage platform, Dataverse. In d… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 2 Figures, 3 Tables

  24. arXiv:2307.10869  [pdf, other

    cs.LG cs.SE

    Performance Issue Identification in Cloud Systems with Relational-Temporal Anomaly Detection

    Authors: Wenwei Gu, Jinyang Liu, Zhuangbin Chen, Jianping Zhang, Yuxin Su, Jiazhen Gu, Cong Feng, Zengyin Yang, Michael Lyu

    Abstract: Performance issues permeate large-scale cloud service systems, which can lead to huge revenue losses. To ensure reliable performance, it's essential to accurately identify and localize these issues using service monitoring metrics. Given the complexity and scale of modern cloud systems, this task can be challenging and may require extensive expertise and resources beyond the capacity of individual… ▽ More

    Submitted 1 August, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  25. arXiv:2306.17797  [pdf, other

    cs.CV eess.IV

    HIDFlowNet: A Flow-Based Deep Network for Hyperspectral Image Denoising

    Authors: Li Pang, Weizhen Gu, Xiangyong Cao, Xiangyu Rui, Jiangjun Peng, Shuang Xu, Gang Yang, Deyu Meng

    Abstract: Hyperspectral image (HSI) denoising is essentially ill-posed since a noisy HSI can be degraded from multiple clean HSIs. However, current deep learning-based approaches ignore this fact and restore the clean image with deterministic mapping (i.e., the network receives a noisy HSI and outputs a clean HSI). To alleviate this issue, this paper proposes a flow-based HSI denoising network (HIDFlowNet)… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 10 pages, 8 figures

  26. arXiv:2306.15369  [pdf, other

    cs.SE cs.LG

    A Meta-analytical Comparison of Naive Bayes and Random Forest for Software Defect Prediction

    Authors: Ch Muhammad Awais, Wei Gu, Gcinizwe Dlamini, Zamira Kholmatova, Giancarlo Succi

    Abstract: Is there a statistical difference between Naive Bayes and Random Forest in terms of recall, f-measure, and precision for predicting software defects? By utilizing systematic literature review and meta-analysis, we are answering this question. We conducted a systematic literature review by establishing criteria to search and choose papers, resulting in five studies. After that, using the meta-data… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 11 pages, 8 figures, Conference Paper

    Journal ref: Intelligent Systems Design and Applications. ISDA 2022. Lecture Notes in Networks and Systems, vol 716

  27. arXiv:2306.10200  [pdf, other

    cs.CR

    Privacy-Enhancing Technologies for Financial Data Sharing

    Authors: Panagiotis Chatzigiannis, Wanyun Catherine Gu, Srinivasan Raghuraman, Peter Rindal, Mahdi Zamani

    Abstract: Today, financial institutions (FIs) store and share consumers' financial data for various reasons such as offering loans, processing payments, and protecting against fraud and financial crime. Such sharing of sensitive data have been subject to data breaches in the past decade. While some regulations (e.g., GDPR, FCRA, and CCPA) help to prevent institutions from freely sharing clients' sensitive… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  28. arXiv:2306.09292  [pdf, other

    quant-ph cs.CC math-ph

    Stabilizer Testing and Magic Entropy

    Authors: Kaifeng Bu, Weichen Gu, Arthur Jaffe

    Abstract: We introduce systematic protocols to perform stabilizer testing for quantum states and gates. These protocols are based on quantum convolutions and swap-tests, realized by quantum circuits that implement the quantum convolution for both qubit and qudit systems. We also introduce ''magic entropy'' to quantify magic in quantum states and gates, in a way which may be measurable experimentally.

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 35 pages

  29. arXiv:2305.18743  [pdf, other

    cs.CV

    Decomposed Human Motion Prior for Video Pose Estimation via Adversarial Training

    Authors: Wenshuo Chen, Xiang Zhou, Zhengdi Yu, Weixi Gu, Kai Zhang

    Abstract: Estimating human pose from video is a task that receives considerable attention due to its applicability in numerous 3D fields. The complexity of prior knowledge of human body movements poses a challenge to neural network models in the task of regressing keypoints. In this paper, we address this problem by incorporating motion prior in an adversarial way. Different from previous methods, we propos… ▽ More

    Submitted 24 September, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  30. A Framework For Refining Text Classification and Object Recognition from Academic Articles

    Authors: Jinghong Li, Koichi Ota, Wen Gu, Shinobu Hasegawa

    Abstract: With the widespread use of the internet, it has become increasingly crucial to extract specific information from vast amounts of academic articles efficiently. Data mining techniques are generally employed to solve this issue. However, data mining for academic articles is challenging since it requires automatically extracting specific patterns in complex and unstructured layout documents. Current… ▽ More

    Submitted 2 July, 2024; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted at 'The International Symposium on Innovations in Intelligent Systems and Applications 2023 (INISTA 2023)'

  31. arXiv:2303.15587  [pdf

    cs.CL

    Linguistically Informed ChatGPT Prompts to Enhance Japanese-Chinese Machine Translation: A Case Study on Attributive Clauses

    Authors: Wenshi Gu

    Abstract: In the field of Japanese-Chinese translation linguistics, the issue of correctly translating attributive clauses has persistently proven to be challenging. Present-day machine translation tools often fail to accurately translate attributive clauses from Japanese to Chinese. In light of this, this paper investigates the linguistic problem underlying such difficulties, namely how does the semantic r… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  32. arXiv:2303.01043  [pdf, other

    cs.CV

    I2P-Rec: Recognizing Images on Large-scale Point Cloud Maps through Bird's Eye View Projections

    Authors: Shuhang Zheng, Yixuan Li, Zhu Yu, Beinan Yu, Si-Yuan Cao, Minhang Wang, Jintao Xu, Rui Ai, Weihao Gu, Lun Luo, Hui-Liang Shen

    Abstract: Place recognition is an important technique for autonomous cars to achieve full autonomy since it can provide an initial guess to online localization algorithms. Although current methods based on images or point clouds have achieved satisfactory performance, localizing the images on a large-scale point cloud map remains a fairly unexplored problem. This cross-modal matching task is challenging due… ▽ More

    Submitted 14 August, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted by IROS 2023

  33. arXiv:2212.02029  [pdf, other

    math.DG cs.AI

    The LG Fibration

    Authors: Daniel Livschitz, Weiqing Gu

    Abstract: Deep Learning has significantly impacted the application of data-to-decision throughout research and industry, however, they lack a rigorous mathematical foundation, which creates situations where algorithmic results fail to be practically invertible. In this paper we present a nearly invertible mapping between $\mathbb{R}^{2^n}$ and $\mathbb{R}^{n+1}$ via a topological connection between… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  34. arXiv:2211.15656  [pdf, other

    cs.CV cs.RO

    SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation

    Authors: Hao Dong, Xianjing Zhang, Jintao Xu, Rui Ai, Weihao Gu, Huimin Lu, Juho Kannala, Xieyuanli Chen

    Abstract: High-definition (HD) semantic map generation of the environment is an essential component of autonomous driving. Existing methods have achieved good performance in this task by fusing different sensor modalities, such as LiDAR and camera. However, current works are based on raw data or network feature-level fusion and only consider short-range HD map generation, limiting their deployment to realis… ▽ More

    Submitted 16 March, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

  35. arXiv:2210.06824  [pdf, other

    cs.CL

    An Empirical Study on Finding Spans

    Authors: Weiwei Gu, Boyuan Zheng, Yunmo Chen, Tongfei Chen, Benjamin Van Durme

    Abstract: We present an empirical study on methods for span finding, the selection of consecutive tokens in text for some downstream tasks. We focus on approaches that can be employed in training end-to-end information extraction systems, and find there is no definitive solution without considering task properties, and provide our observations to help with future design choices: 1) a tagging approach often… ▽ More

    Submitted 13 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  36. arXiv:2210.06600  [pdf, other

    cs.CL

    Iterative Document-level Information Extraction via Imitation Learning

    Authors: Yunmo Chen, William Gantt, Weiwei Gu, Tongfei Chen, Aaron Steven White, Benjamin Van Durme

    Abstract: We present a novel iterative extraction model, IterX, for extracting complex relations, or templates (i.e., N-tuples representing a mapping from named slots to spans of text) within a document. Documents may feature zero or more instances of a template of any given type, and the task of template extraction entails identifying the templates in a document and extracting each template's slot values.… ▽ More

    Submitted 1 May, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to EACL 2023

  37. arXiv:2209.08055  [pdf, other

    cs.SE

    A Transformer-Based Approach for Improving App Review Response Generation

    Authors: Weizhe Zhang, Wenchao Gu, Cuiyun Gao, Michael R. Lyu

    Abstract: Mobile apps are becoming an integral part of people's daily life by providing various functionalities, such as messaging and gaming. App developers try their best to ensure user experience during app development and maintenance to improve the rating of their apps on app platforms and attract more user downloads. Previous studies indicated that responding to users' reviews tends to change their att… ▽ More

    Submitted 25 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted to Software: Practice and Experience (SPE)

  38. arXiv:2209.00465  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    On Grounded Planning for Embodied Tasks with Language Models

    Authors: Bill Yuchen Lin, Chengsong Huang, Qian Liu, Wenda Gu, Sam Sommerer, Xiang Ren

    Abstract: Language models (LMs) have demonstrated their capability in possessing commonsense knowledge of the physical world, a crucial aspect of performing tasks in everyday life. However, it remains unclear **whether LMs have the capacity to generate grounded, executable plans for embodied tasks.** This is a challenging task as LMs lack the ability to perceive the environment through vision and feedback f… ▽ More

    Submitted 15 July, 2023; v1 submitted 29 August, 2022; originally announced September 2022.

    Comments: Accepted to AAAI 2023 Project website: https://yuchenlin.xyz/g-planet/

  39. arXiv:2208.11908  [pdf, other

    cs.CV

    Adaptive Perception Transformer for Temporal Action Localization

    Authors: Yizheng Ouyang, Tianjin Zhang, Weibo Gu, Hongfa Wang

    Abstract: Temporal action localization aims to predict the boundary and category of each action instance in untrimmed long videos. Most of previous methods based on anchors or proposals neglect the global-local context interaction in entire video sequences. Besides, their multi-stage designs cannot generate action boundaries and categories straightforwardly. To address the above issues, this paper proposes… ▽ More

    Submitted 15 September, 2022; v1 submitted 25 August, 2022; originally announced August 2022.

  40. arXiv:2208.05144  [pdf

    cs.LG stat.ML

    Machine Learning-based EEG Applications and Markets

    Authors: Weiqing Gu, Bohan Yang, Ryan Chang

    Abstract: This paper addresses both the various EEG applications and the current EEG market ecosystem propelled by machine learning. Increasingly available open medical and health datasets using EEG encourage data-driven research with a promise of improving neurology for patient care through knowledge discovery and machine learning data science algorithm development. This effort leads to various kinds of EE… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 35 pages

  41. arXiv:2207.07278  [pdf, other

    cs.CV cs.CL cs.MM

    Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization

    Authors: Mengyin Liu, Chao Zhu, Hongyu Gao, Weibo Gu, Hongfa Wang, Wei Liu, Xu-cheng Yin

    Abstract: With the prosperity of e-commerce industry, various modalities, e.g., vision and language, are utilized to describe product items. It is an enormous challenge to understand such diversified data, especially via extracting the attribute-value pairs in text sequences with the aid of helpful image regions. Although a series of previous works have been dedicated to this task, there remain seldomly inv… ▽ More

    Submitted 6 April, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

  42. arXiv:2207.02201  [pdf, other

    cs.CV cs.RO

    Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation

    Authors: Jiadai Sun, Yuchao Dai, Xianjing Zhang, Jintao Xu, Rui Ai, Weihao Gu, Xieyuanli Chen

    Abstract: Accurate moving object segmentation is an essential task for autonomous driving. It can provide effective information for many downstream tasks, such as collision avoidance, path planning, and static map construction. How to effectively exploit the spatial-temporal information is a critical question for 3D LiDAR moving object segmentation (LiDAR-MOS). In this work, we propose a novel deep neural n… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted by IROS2022. Code: https://github.com/haomo-ai/MotionSeg3D

  43. arXiv:2205.14413  [pdf

    cs.GT math.OC

    Discrimination-Based Double Auction for Maximizing Social Welfare in the Electricity and Heating Market Considering Privacy Preservation

    Authors: Lu Wang, Wei Gu, Shuai Lu, Haifeng Qiu, Zhi Wu

    Abstract: This paper proposes a doubled-sided auction mechanism with price discrimination for social welfare (SW) maximization in the electricity and heating market. In this mechanism, energy service providers (ESPs) submit offers and load aggregators (LAs) submit bids to an energy trading center (ETC) to maximize their utility; in turn, the selfless ETC as an auctioneer leverages dis-criminatory price weig… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  44. arXiv:2204.03293  [pdf, other

    cs.SE cs.AI cs.LG

    CoCoSoDa: Effective Contrastive Learning for Code Search

    Authors: Ensheng Shi, Yanlin Wang, Wenchao Gu, Lun Du, Hongyu Zhang, Shi Han, Dongmei Zhang, Hongbin Sun

    Abstract: Code search aims to retrieve semantically relevant code snippets for a given natural language query. Recently, many approaches employing contrastive learning have shown promising results on code representation learning and greatly improved the performance of code search. However, there is still a lot of room for improvement in using contrastive learning for code search. In this paper, we propose C… ▽ More

    Submitted 12 February, 2023; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Accepted by ICSE 2023 (The 45th International Conference on Software Engineering)

  45. arXiv:2203.15287  [pdf, other

    cs.SE cs.AI

    Accelerating Code Search with Deep Hashing and Code Classification

    Authors: Wenchao Gu, Yanlin Wang, Lun Du, Hongyu Zhang, Shi Han, Dongmei Zhang, Michael R. Lyu

    Abstract: Code search is to search reusable code snippets from source code corpus based on natural languages queries. Deep learning-based methods of code search have shown promising results. However, previous methods focus on retrieval accuracy but lacked attention to the efficiency of the retrieval process. We propose a novel method CoSHC to accelerate code search with deep hashing and code classification,… ▽ More

    Submitted 30 March, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted to 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022)

  46. OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition

    Authors: Junyi Ma, Jun Zhang, Jintao Xu, Rui Ai, Weihao Gu, Xieyuanli Chen

    Abstract: Place recognition is an important capability for autonomously navigating vehicles operating in complex environments and under changing conditions. It is a key component for tasks such as loop closing in SLAM or global localization. In this paper, we address the problem of place recognition based on 3D LiDAR scans recorded by an autonomous vehicle. We propose a novel lightweight neural network expl… ▽ More

    Submitted 19 April, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: Accepted by RAL/IROS 2022

  47. arXiv:2202.13686  [pdf, other

    cs.AI

    Points-of-Interest Relationship Inference with Spatial-enriched Graph Neural Networks

    Authors: Yile Chen, Xiucheng Li, Gao Cong, Cheng Long, Zhifeng Bao, Shang Liu, Wanli Gu, Fuzheng Zhang

    Abstract: As a fundamental component in location-based services, inferring the relationship between points-of-interests (POIs) is very critical for service providers to offer good user experience to business owners and customers. Most of the existing methods for relationship inference are not targeted at POI, thus failing to capture unique spatial characteristics that have huge effects on POI relationships.… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  48. arXiv:2202.06521  [pdf, other

    cs.CL cs.SE

    Source Code Summarization with Structural Relative Position Guided Transformer

    Authors: Zi Gong, Cuiyun Gao, Yasheng Wang, Wenchao Gu, Yun Peng, Zenglin Xu

    Abstract: Source code summarization aims at generating concise and clear natural language descriptions for programming languages. Well-written code summaries are beneficial for programmers to participate in the software development and maintenance process. To learn the semantic representations of source code, recent efforts focus on incorporating the syntax structure of code into neural networks such as Tra… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 12 pages, SANER 2022

  49. arXiv:2201.08054  [pdf, other

    cs.CL

    VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation

    Authors: Yihang Li, Shuichiro Shimizu, Weiqi Gu, Chenhui Chu, Sadao Kurohashi

    Abstract: Existing multimodal machine translation (MMT) datasets consist of images and video captions or general subtitles, which rarely contain linguistic ambiguity, making visual information not so effective to generate appropriate translations. We introduce VISA, a new dataset that consists of 40k Japanese-English parallel sentence pairs and corresponding video clips with the following key features: (1)… ▽ More

    Submitted 26 May, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted by LREC2022

  50. arXiv:2112.12653  [pdf, other

    cs.SE

    Revisiting, Benchmarking and Exploring API Recommendation: How Far Are We?

    Authors: Yun Peng, Shuqing Li, Wenwei Gu, Yichen Li, Wenxuan Wang, Cuiyun Gao, Michael Lyu

    Abstract: Application Programming Interfaces (APIs), which encapsulate the implementation of specific functions as interfaces, greatly improve the efficiency of modern software development. As numbers of APIs spring up nowadays, developers can hardly be familiar with all the APIs, and usually need to search for appropriate APIs for usage. So lots of efforts have been devoted to improving the API recommendat… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.