Zum Hauptinhalt springen

Showing 1–50 of 106 results for author: Ji, D

.
  1. arXiv:2408.10508  [pdf, ps, other

    math.CO

    Non-Stabilizing Parallel Chip-Firing Games

    Authors: David Ji, Michael Li, Daniel Wang

    Abstract: In 2010, Kominers and Kominers proved that any parallel chip-firing game on $G(V,\,E)$ with $|σ|\geq 4|E|-|V|$ chips stabilizes. Recently, Bu, Choi, and Xu made the bound exact: all games with $|σ|< |E|$ chips or $|σ|> 3|E|-|V|$ chips stabilize. Meanwhile, Levine found a "devil's staircase'' pattern in the plot of the activity of parallel chip-firing games against their density of chips. The stabi… ▽ More

    Submitted 24 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: 11 pages

    MSC Class: 05C57 68R10

  2. arXiv:2408.09506  [pdf, other

    cs.DB

    The Story Behind the Lines: Line Charts as a Gateway to Dataset Discovery

    Authors: Daomin Ji, Hui Luo, Zhifeng Bao, J. Shane Culpepper

    Abstract: Line charts are a valuable tool for data analysis and exploration, distilling essential insights from a dataset. However, access to the underlying dataset behind a line chart is rarely readily available. In this paper, we explore a novel dataset discovery problem, dataset discovery via line charts, focusing on the use of line charts as queries to discover datasets within a large data repository th… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  3. arXiv:2408.04579  [pdf, other

    cs.CV

    SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More

    Authors: Tianrun Chen, Ankang Lu, Lanyun Zhu, Chaotao Ding, Chunan Yu, Deyi Ji, Zejian Li, Lingyun Sun, Papa Mao, Ying Zang

    Abstract: The advent of large models, also known as foundation models, has significantly transformed the AI research landscape, with models like Segment Anything (SAM) achieving notable success in diverse image segmentation scenarios. Despite its advancements, SAM encountered limitations in handling some complex low-level segmentation tasks like camouflaged object and medical imaging. In response, in 2023,… ▽ More

    Submitted 10 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: text overlap with arXiv:2304.09148

  4. arXiv:2407.21560  [pdf, ps, other

    cs.CL cs.AI

    Generative Sentiment Analysis via Latent Category Distribution and Constrained Decoding

    Authors: Jun Zhou, Dongyang Yu, Kamran Aziz, Fangfang Su, Qing Zhang, Fei Li, Donghong Ji

    Abstract: Fine-grained sentiment analysis involves extracting and organizing sentiment elements from textual data. However, existing approaches often overlook issues of category semantic inclusion and overlap, as well as inherent structural patterns within the target sequence. This study introduces a generative sentiment analysis model. To address the challenges related to category semantic inclusion and ov… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  5. arXiv:2407.15889  [pdf, ps, other

    math.CO

    Parallel chip-firing games on directed graphs

    Authors: David Ji, Michael Li, Daniel Wang

    Abstract: In 1992, Bitar and Goles introduced the parallel chip-firing game on undirected graphs. Two years later, Prisner extended the game to directed graphs. While the properties of parallel chip-firing games on undirected graphs have been extensively studied, their analogs for parallel chip-firing games on directed graphs have been sporadic. In this paper, we prove the outstanding analogs of the core re… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  6. arXiv:2407.04801  [pdf, other

    cs.CL cs.AI

    Revisiting Structured Sentiment Analysis as Latent Dependency Graph Parsing

    Authors: Chengjie Zhou, Bobo Li, Hao Fei, Fei Li, Chong Teng, Donghong Ji

    Abstract: Structured Sentiment Analysis (SSA) was cast as a problem of bi-lexical dependency graph parsing by prior studies. Multiple formulations have been proposed to construct the graph, which share several intrinsic drawbacks: (1) The internal structures of spans are neglected, thus only the boundary tokens of spans are used for relation prediction and span recognition, thus hindering the model's expres… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  7. arXiv:2407.01530  [pdf, other

    eess.IV cs.CV

    xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart

    Authors: Tianrun Chen, Chaotao Ding, Lanyun Zhu, Tao Xu, Deyi Ji, Yan Wang, Ying Zang, Zejian Li

    Abstract: Convolutional Neural Networks (CNNs) and Vision Transformers (ViT) have been pivotal in biomedical image segmentation, yet their ability to manage long-range dependencies remains constrained by inherent locality and computational overhead. To overcome these challenges, in this technical report, we first propose xLSTM-UNet, a UNet structured deep learning neural network that leverages Vision-LSTM (… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  8. arXiv:2406.19632  [pdf, other

    cs.CV

    PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation

    Authors: Deyi Ji, Wenwei Jin, Hongtao Lu, Feng Zhao

    Abstract: The ascension of Unmanned Aerial Vehicles (UAVs) in various fields necessitates effective UAV image segmentation, which faces challenges due to the dynamic perspectives of UAV-captured images. Traditional segmentation algorithms falter as they cannot accurately mimic the complexity of UAV perspectives, and the cost of obtaining multi-perspective labeled datasets is prohibitive. To address these is… ▽ More

    Submitted 11 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: IJCAI 2024

  9. arXiv:2406.16021  [pdf, other

    cs.CL cs.AI

    Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction Paradigm

    Authors: Qiang Gao, Zixiang Meng, Bobo Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji

    Abstract: Document-level event extraction aims to extract structured event information from unstructured text. However, a single document often contains limited event information and the roles of different event arguments may be biased due to the influence of the information source. This paper addresses the limitations of traditional document-level event extraction by proposing the task of cross-document ev… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: ACL2024(Findings)

    Report number: 2024.findings-acl.114

    Journal ref: https://aclanthology.org/2024.findings-acl.114

  10. arXiv:2406.15990  [pdf, other

    cs.CL cs.AI

    Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic Information

    Authors: Qiang Gao, Bobo Li, Zixiang Meng, Yunlong Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji

    Abstract: Existing cross-document event coreference resolution models, which either compute mention similarity directly or enhance mention representation by extracting event arguments (such as location, time, agent, and patient), lacking the ability to utilize document-level information. As a result, they struggle to capture long-distance dependencies. This shortcoming leads to their underwhelming performan… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Report number: https://aclanthology.org/2024.lrec-main.523/

    Journal ref: LREC|COLING,Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation,2024,5907-5921

  11. arXiv:2406.10475  [pdf, other

    cs.CV

    Discrete Latent Perspective Learning for Segmentation and Detection

    Authors: Deyi Ji, Feng Zhao, Lanyun Zhu, Wenwei Jin, Hongtao Lu, Jieping Ye

    Abstract: In this paper, we address the challenge of Perspective-Invariant Learning in machine learning and computer vision, which involves enabling a network to understand images from varying perspectives to achieve consistent semantic interpretation. While standard approaches rely on the labor-intensive collection of multi-view images or limited data augmentation techniques, we propose a novel framework,… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: ICML 2024 Spotlight

  12. arXiv:2406.03871  [pdf

    physics.acc-ph

    Development of high-level applications for High Energy Photon Source booster

    Authors: Yuemei Peng, Daheng Ji, Hongfei Ji, Nan Li, Xiaohan Lu, Saike Tian, Yuanyuan Wei, Haisheng Xu, Yaliang Zhao, Yi Jiao, Jingyi Li

    Abstract: The High Energy Photon Source (HEPS), is the first fourth-generation storage ring light source being built in the suburb of Beijing, China. The storage ring was designed with the emittance lower than 60 pm.rad with a circumference of 1.36 km and beam energy of 6 GeV. Its injector contains a 500 MeV S-band Linac and a 454 m booster which was designed as an accumulator at the extraction energy. In t… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  13. arXiv:2405.19326  [pdf, other

    cs.CV cs.GR cs.HC

    Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models

    Authors: Tianrun Chen, Chunan Yu, Jing Li, Jianqi Zhang, Lanyun Zhu, Deyi Ji, Yong Zhang, Ying Zang, Zejian Li, Lingyun Sun

    Abstract: In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that transcends limitations for previous category-specific 3D semantic segmentation, 3D instance segmentation, and open-vocabulary 3D segmentation. We design a simple baseline method, Reasoning3D, with the capability to understand… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  14. arXiv:2405.08816  [pdf, other

    cs.CV cs.RO

    The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

    Authors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang , et al. (66 additional authors not shown)

    Abstract: In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICRA 2024; 32 pages, 24 figures, 5 tables; Code at https://robodrive-24.github.io/

  15. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  16. arXiv:2404.14728  [pdf

    cs.LG cs.CY

    Novel Topological Machine Learning Methodology for Stream-of-Quality Modeling in Smart Manufacturing

    Authors: Jay Lee, Dai-Yan Ji, Yuan-Ming Hsu

    Abstract: This paper presents a topological analytics approach within the 5-level Cyber-Physical Systems (CPS) architecture for the Stream-of-Quality assessment in smart manufacturing. The proposed methodology not only enables real-time quality monitoring and predictive analytics but also discovers the hidden relationships between quality features and process parameters across different manufacturing proces… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: The paper has been submitted to Manufacturing Letters (Under Review)

  17. arXiv:2404.13851  [pdf, other

    q-bio.NC

    Theta oscillons in behaving rats

    Authors: M. S. Zobaer, N. Lotfi, C. M. Domenico, C. Hoffman, L. Perotti, D. Ji, Y. Dabaghian

    Abstract: Recently discovered constituents of the brain waves -- the oscillons -- provide high-resolution representation of the extracellular field dynamics. Here we study the most robust, highest-amplitude oscillons that manifest in actively behaving rats and generally correspond to the traditional theta-waves. We show that the resemblances between theta-oscillons and the conventional theta-waves apply to… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

  18. arXiv:2403.15926  [pdf, other

    q-bio.NC

    Altered patterning of neural activity in a tauopathy mouse model

    Authors: C. Hoffman, J. Cheng, R. Morales, D. Ji, Y. Dabaghian

    Abstract: Alzheimer's disease (AD) is a complex neurodegenerative condition that manifests at multiple levels and involves a spectrum of abnormalities ranging from the cellular to cognitive. Here, we investigate the impact of AD-related tau-pathology on hippocampal circuits in mice engaged in spatial navigation, and study changes of neuronal firing and dynamics of extracellular fields. While most studies ar… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 17 pages, plus supplementary material

  19. arXiv:2403.15776  [pdf, other

    cs.CL cs.AI

    Modeling Unified Semantic Discourse Structure for High-quality Headline Generation

    Authors: Minghui Xu, Hao Fei, Fei Li, Shengqiong Wu, Rui Sun, Chong Teng, Donghong Ji

    Abstract: Headline generation aims to summarize a long document with a short, catchy title that reflects the main idea. This requires accurately capturing the core document semantics, which is challenging due to the lengthy and background information-rich na ture of the texts. In this work, We propose using a unified semantic discourse structure (S3) to represent document semantics, achieved by combining do… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  20. arXiv:2403.10830  [pdf, other

    cs.CV

    View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV

    Authors: Deyi Ji, Siqi Gao, Lanyun Zhu, Qi Zhu, Yiru Zhao, Peng Xu, Hongtao Lu, Feng Zhao, Jieping Ye

    Abstract: In this paper, we address the challenge of multi-object tracking (MOT) in moving Unmanned Aerial Vehicle (UAV) scenarios, where irregular flight trajectories, such as hovering, turning left/right, and moving up/down, lead to significantly greater complexity compared to fixed-camera MOT. Specifically, changes in the scene background not only render traditional frame-to-frame object IOU association… ▽ More

    Submitted 14 May, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  21. arXiv:2403.03721  [pdf, other

    cs.CV

    CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection

    Authors: Gyusam Chang, Wonseok Roh, Sujin Jang, Dongwook Lee, Daehyun Ji, Gyeongrok Oh, Jinsun Park, Jinkyu Kim, Sangpil Kim

    Abstract: Recent LiDAR-based 3D Object Detection (3DOD) methods show promising results, but they often do not generalize well to target domains outside the source (or training) data distribution. To reduce such domain gaps and thus to make 3DOD models more generalizable, we introduce a novel unsupervised domain adaptation (UDA) method, called CMDA, which (i) leverages visual semantic cues from an image moda… ▽ More

    Submitted 6 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI 2024

  22. arXiv:2402.18476  [pdf, other

    cs.CV

    IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding

    Authors: Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu

    Abstract: Despite achieving rapid developments and with widespread applications, Large Vision-Language Models (LVLMs) confront a serious challenge of being prone to generating hallucinations. An over-reliance on linguistic priors has been identified as a key factor leading to these hallucinations. In this paper, we propose to alleviate this problem by introducing a novel image-biased decoding (IBD) techniqu… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  23. arXiv:2402.13693  [pdf, other

    cs.CL

    CMNER: A Chinese Multimodal NER Dataset based on Social Media

    Authors: Yuanze Ji, Bobo Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji

    Abstract: Multimodal Named Entity Recognition (MNER) is a pivotal task designed to extract named entities from text with the support of pertinent images. Nonetheless, a notable paucity of data for Chinese MNER has considerably impeded the progress of this natural language processing task within the Chinese domain. Consequently, in this study, we compile a Chinese Multimodal NER dataset (CMNER) utilizing dat… ▽ More

    Submitted 1 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  24. arXiv:2402.07218  [pdf, other

    cs.RO

    Sensor Misalignment-tolerant AUV Navigation with Passive DoA and Doppler Measurements

    Authors: Bingbing Zhang, Shuo Liu, Shanmin Zhou, Daxiong Ji, Tao Wang, Tian Xia, Wen Xu

    Abstract: We present a sensor misalignment-tolerant AUV navigation method that leverages measurements from an acoustic array and dead reckoned information. Recent studies have demonstrated the potential use of passive acoustic Direction of Arrival (DoA) measurements for AUV navigation without requiring ranging measurements. However, the sensor misalignment between the acoustic array and the attitude sensor… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  25. arXiv:2401.07716  [pdf, other

    quant-ph

    Disentanglement Provides a Unified Estimation for Quantum Entropies and Distance Measures

    Authors: Myeongjin Shin, Seungwoo Lee, Junseo Lee, Mingyu Lee, Donghwa Ji, Hyeonjun Yeo, Kabgyun Jeong

    Abstract: The estimation of quantum entropies and distance measures, such as von Neumann entropy, Rényi entropy, Tsallis entropy, trace distance, and fidelity-induced distances like Bures distance, has been a key area of research. This paper introduces a unified approach using Disentangling Quantum Neural Networks (DEQNN) for estimating these quantities, leveraging continuity bounds and disentanglement in t… ▽ More

    Submitted 29 July, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 12 pages, 3 figure

  26. arXiv:2312.17428  [pdf, other

    cs.CV

    ChangeNet: Multi-Temporal Asymmetric Change Detection Dataset

    Authors: Deyi Ji, Siqi Gao, Mingyuan Tao, Hongtao Lu, Feng Zhao

    Abstract: Change Detection (CD) has been attracting extensive interests with the availability of bi-temporal datasets. However, due to the huge cost of multi-temporal images acquisition and labeling, existing change detection datasets are small in quantity, short in temporal, and low in practicability. Therefore, a large-scale practical-oriented dataset covering wide temporal phases is urgently needed to fa… ▽ More

    Submitted 11 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024 Oral/Lecture

  27. arXiv:2312.15291  [pdf, other

    cs.CL

    Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought

    Authors: Li Zheng, Hao Fei, Fei Li, Bobo Li, Lizi Liao, Donghong Ji, Chong Teng

    Abstract: With the proliferation of dialogic data across the Internet, the Dialogue Commonsense Multi-choice Question Answering (DC-MCQ) task has emerged as a response to the challenge of comprehending user queries and intentions. Although prevailing methodologies exhibit effectiveness in addressing single-choice questions, they encounter difficulties in handling multi-choice queries due to the heightened i… ▽ More

    Submitted 26 December, 2023; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: This paper has been accepted by the 38th Annual AAAI Conference on Artificial Intelligence (AAAI'24, FEBRUARY 20-27, 2024, VANCOUVER, CANADA)

  28. arXiv:2312.11276  [pdf, other

    cs.CL

    Compositional Generalization for Multi-label Text Classification: A Data-Augmentation Approach

    Authors: Yuyang Chai, Zhuang Li, Jiahui Liu, Lei Chen, Fei Li, Donghong Ji, Chong Teng

    Abstract: Despite significant advancements in multi-label text classification, the ability of existing models to generalize to novel and seldom-encountered complex concepts, which are compositions of elementary ones, remains underexplored. This research addresses this gap. By creating unique data splits across three benchmarks, we assess the compositional generalization ability of existing multi-label text… ▽ More

    Submitted 20 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI'24

  29. arXiv:2311.16926  [pdf, other

    cs.CV

    LLaFS: When Large Language Models Meet Few-Shot Segmentation

    Authors: Lanyun Zhu, Tianrun Chen, Deyi Ji, Jieping Ye, Jun Liu

    Abstract: This paper proposes LLaFS, the first attempt to leverage large language models (LLMs) in few-shot segmentation. In contrast to the conventional few-shot segmentation methods that only rely on the limited and biased information from the annotated support images, LLaFS leverages the vast prior knowledge gained by LLM as an effective supplement and directly uses the LLM to segment images in a few-sho… ▽ More

    Submitted 3 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted to CVPR2024

  30. arXiv:2310.02031  [pdf, other

    cs.CL cs.AI cs.CE cs.LG cs.RO

    OceanGPT: A Large Language Model for Ocean Science Tasks

    Authors: Zhen Bi, Ningyu Zhang, Yida Xue, Yixin Ou, Daxiong Ji, Guozhou Zheng, Huajun Chen

    Abstract: Ocean science, which delves into the oceans that are reservoirs of life and biodiversity, is of great significance given that oceans cover over 70% of our planet's surface. Recently, advances in Large Language Models (LLMs) have transformed the paradigm in science. Despite the success in other domains, current LLMs often fall short in catering to the needs of domain experts like oceanographers, an… ▽ More

    Submitted 3 September, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ACL2024. Project Website: http://oceangpt.zjukg.cn/

  31. arXiv:2308.04502  [pdf, other

    cs.CL

    Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition

    Authors: Bobo Li, Hao Fei, Lizi Liao, Yu Zhao, Chong Teng, Tat-Seng Chua, Donghong Ji, Fei Li

    Abstract: It has been a hot research topic to enable machines to understand human emotions in multimodal contexts under dialogue scenarios, which is tasked with multimodal emotion analysis in conversation (MM-ERC). MM-ERC has received consistent attention in recent years, where a diverse range of methods has been proposed for securing better task performance. Most existing works treat MM-ERC as a standard m… ▽ More

    Submitted 12 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM 2023

  32. arXiv:2308.04498  [pdf, other

    cs.CL

    DialogRE^C+: An Extension of DialogRE to Investigate How Much Coreference Helps Relation Extraction in Dialogs

    Authors: Yiyun Xiong, Mengwei Dai, Fei Li, Hao Fei, Bobo Li, Shengqiong Wu, Donghong Ji, Chong Teng

    Abstract: Dialogue relation extraction (DRE) that identifies the relations between argument pairs in dialogue text, suffers much from the frequent occurrence of personal pronouns, or entity and speaker coreference. This work introduces a new benchmark dataset DialogRE^C+, introducing coreference resolution into the DRE scenario. With the aid of high-quality coreference knowledge, the reasoning of argument r… ▽ More

    Submitted 12 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by NLPCC 2023

  33. arXiv:2308.04424  [pdf, other

    cs.CL

    A Bi-directional Multi-hop Inference Model for Joint Dialog Sentiment Classification and Act Recognition

    Authors: Li Zheng, Fei Li, Yuyang Chai, Chong Teng, Donghong Ji

    Abstract: The joint task of Dialog Sentiment Classification (DSC) and Act Recognition (DAR) aims to predict the sentiment label and act label for each utterance in a dialog simultaneously. However, current methods encode the dialog context in only one direction, which limits their ability to thoroughly comprehend the context. Moreover, these methods overlook the explicit correlations between sentiment and a… ▽ More

    Submitted 12 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by NLPCC 2023

  34. arXiv:2307.00711  [pdf, other

    cs.CV

    Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation

    Authors: Deyi Ji, Feng Zhao, Hongtao Lu

    Abstract: Most existing ultra-high resolution (UHR) segmentation methods always struggle in the dilemma of balancing memory cost and local characterization accuracy, which are both taken into account in our proposed Guided Patch-Grouping Wavelet Transformer (GPWFormer) that achieves impressive performances. In this work, GPWFormer is a Transformer ($\mathcal{T}$)-CNN ($\mathcal{C}$) mutual leaning framework… ▽ More

    Submitted 5 July, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: Accepted to IJCAI 2023

  35. arXiv:2306.03975  [pdf, other

    cs.CL

    Revisiting Conversation Discourse for Dialogue Disentanglement

    Authors: Bobo Li, Hao Fei, Fei Li, Shengqiong Wu, Lizi Liao, Yinwei Wei, Tat-Seng Chua, Donghong Ji

    Abstract: Dialogue disentanglement aims to detach the chronologically ordered utterances into several independent sessions. Conversation utterances are essentially organized and described by the underlying discourse, and thus dialogue disentanglement requires the full understanding and harnessing of the intrinsic discourse attribute. In this paper, we propose enhancing dialogue disentanglement by taking ful… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: under review

  36. arXiv:2306.03974  [pdf, other

    cs.CL

    TKDP: Threefold Knowledge-enriched Deep Prompt Tuning for Few-shot Named Entity Recognition

    Authors: Jiang Liu, Hao Fei, Fei Li, Jingye Li, Bobo Li, Liang Zhao, Chong Teng, Donghong Ji

    Abstract: Few-shot named entity recognition (NER) exploits limited annotated instances to identify named mentions. Effectively transferring the internal or external resources thus becomes the key to few-shot NER. While the existing prompt tuning methods have shown remarkable few-shot performances, they still fail to make full use of knowledge. In this work, we investigate the integration of rich knowledge t… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: under review

  37. arXiv:2306.03969  [pdf, other

    cs.CL

    ECQED: Emotion-Cause Quadruple Extraction in Dialogs

    Authors: Li Zheng, Donghong Ji, Fei Li, Hao Fei, Shengqiong Wu, Jingye Li, Bobo Li, Chong Teng

    Abstract: The existing emotion-cause pair extraction (ECPE) task, unfortunately, ignores extracting the emotion type and cause type, while these fine-grained meta-information can be practically useful in real-world applications, i.e., chat robots and empathic dialog generation. Also the current ECPE is limited to the scenario of single text piece, while neglecting the studies at dialog level that should hav… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: under review

  38. arXiv:2305.17497  [pdf, other

    cs.CL

    FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing

    Authors: Zhuang Li, Yuyang Chai, Terry Yue Zhuo, Lizhen Qu, Gholamreza Haffari, Fei Li, Donghong Ji, Quan Hung Tran

    Abstract: Textual scene graph parsing has become increasingly important in various vision-language applications, including image caption evaluation and image retrieval. However, existing scene graph parsers that convert image captions into scene graphs often suffer from two types of errors. First, the generated scene graphs fail to capture the true semantics of the captions or the corresponding images, resu… ▽ More

    Submitted 1 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 9 pages, ACL 2023 (findings)

  39. arXiv:2305.10899  [pdf, other

    cs.CV

    Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark

    Authors: Deyi Ji, Feng Zhao, Hongtao Lu, Mingyuan Tao, Jieping Ye

    Abstract: With the increasing interest and rapid development of methods for Ultra-High Resolution (UHR) segmentation, a large-scale benchmark covering a wide range of scenes with full fine-grained dense annotations is urgently needed to facilitate the field. To this end, the URUR dataset is introduced, in the meaning of Ultra-High Resolution dataset with Ultra-Rich Context. As the name suggests, URUR contai… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2023

  40. arXiv:2305.03944  [pdf, other

    cs.CV

    Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation

    Authors: Deyi Ji, Haoran Wang, Mingyuan Tao, Jianqiang Huang, Xian-Sheng Hua, Hongtao Lu

    Abstract: Existing knowledge distillation works for semantic segmentation mainly focus on transferring high-level contextual knowledge from teacher to student. However, low-level texture knowledge is also of vital importance for characterizing the local structural pattern and global statistical property, such as boundary, smoothness, regularity and color contrast, which may not be well addressed by high-lev… ▽ More

    Submitted 5 July, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2022

  41. On the Robustness of Aspect-based Sentiment Analysis: Rethinking Model, Data, and Training

    Authors: Hao Fei, Tat-Seng Chua, Chenliang Li, Donghong Ji, Meishan Zhang, Yafeng Ren

    Abstract: Aspect-based sentiment analysis (ABSA) aims at automatically inferring the specific sentiment polarities toward certain aspects of products or services behind the social media texts or reviews, which has been a fundamental application to the real-world society. Since the early 2010s, ABSA has achieved extraordinarily high accuracy with various deep neural models. However, existing ABSA models with… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: Accepted in ACM Transactions on Information Systems

    Journal ref: [J]. ACM Transactions on Information Systems, 2022, 41(2): 1-32

  42. arXiv:2304.09222  [pdf, other

    astro-ph.GA astro-ph.CO

    BUFFALO/Flashlights: Constraints on the abundance of lensed supergiant stars in the Spock galaxy at redshift 1

    Authors: Jose M. Diego, Sung Kei Li, Ashish K. Meena, Anna Niemiec, Ana Acebron, Mathilde Jauzac, Mitchell F. Struble, Alfred Amruth, Tom J. Broadhurst, Catherine Cerny, Harald Ebeling, Alexei V. Filippenko, Eric Jullo, Patrick Kelly, Anton M. Koekemoer, David Lagatutta, Jeremy Lim, Marceau Limousin, Guillaume Mahler, Nency Patel, Juan Remolina, Johan Richard, Keren Sharon, Charles Steinhardt, Keichii Umetsu , et al. (5 additional authors not shown)

    Abstract: We present a constraint on the abundance of supergiant (SG) stars at redshift z approx. 1, based on recent observations of a strongly lensed arc at this redshift. First we derive a free-form model of MACS J0416.1-2403 using data from the BUFFALO program. The new lens model is based on 72 multiply lensed galaxies that produce 214 multiple images, making it the largest sample of spectroscopically co… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 24 pages & 18 figures

  43. arXiv:2302.13200  [pdf, other

    physics.flu-dyn

    Interactions between two adjacent convection rolls in turbulent Rayleigh-Benard convection

    Authors: Eric Brown, Dandan Ji

    Abstract: Rayleigh-B{é}nard convection experiments were done with two adjacent cubic cells with a partial wall in between to force the generation of two interacting convection rolls. Observed stable states include both counter-rotating and co-rotating states. The stability of each of these states and their dynamics were modeled by stochastic ordinary differential equations of motion in terms of the orientat… ▽ More

    Submitted 12 June, 2023; v1 submitted 25 February, 2023; originally announced February 2023.

    Comments: 28 pages, 17 figures

    Journal ref: Phys. Rev. Fluids 8, 064608 (2023)

  44. Booster Free From Spin Resonance For Future 100~km-scale Circular e$^{+}$e$^{-}$ Colliders

    Authors: Tao Chen, Zhe Duan, Daheng Ji, Dou Wang

    Abstract: Acceleration of polarized electron~(positron) beams in a booster synchrotron may suffer from depolarization due to crossings of many spin depolarization resonances, which could limit its applications. We have studied the spin depolarization resonance structure of a 100~km scale booster lattice of the Circular Electron Positron Collider~(CEPC). The lattice has 8 arc regions with hundreds of FODO ce… ▽ More

    Submitted 6 June, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: 19 pages, 13 figures

  45. arXiv:2211.05705  [pdf, other

    cs.CL

    DiaASQ : A Benchmark of Conversational Aspect-based Sentiment Quadruple Analysis

    Authors: Bobo Li, Hao Fei, Fei Li, Yuhan Wu, Jinsong Zhang, Shengqiong Wu, Jingye Li, Yijiang Liu, Lizi Liao, Tat-Seng Chua, Donghong Ji

    Abstract: The rapid development of aspect-based sentiment analysis (ABSA) within recent decades shows great potential for real-world society. The current ABSA works, however, are mostly limited to the scenario of a single text piece, leaving the study in dialogue contexts unexplored. To bridge the gap between fine-grained sentiment analysis and conversational opinion mining, in this work, we introduce a nov… ▽ More

    Submitted 22 May, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted to Findings of ACL 2023

  46. Umehara algebra and complex submanifolds of indefinite complex space forms

    Authors: Xu Zhang, Donghai Ji

    Abstract: The Umehara algebra is studied with motivation on the problem of the non-existence of common complex submanifolds. In this paper, we prove some new results in Umehara algebra and obtain some applications. In particular, if a complex manifolds admits a holomorphic polynomial isometric immersion to one indefinite complex space form, then it cannot admits a holomorphic isometric immersion to another… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:1601.05907 by other authors

    Journal ref: Annals of Global Analysis and Geometry (2023)

  47. arXiv:2211.00684  [pdf, other

    cs.CL cs.AI

    TOE: A Grid-Tagging Discontinuous NER Model Enhanced by Embedding Tag/Word Relations and More Fine-Grained Tags

    Authors: Jiang Liu, Donghong Ji, Jingye Li, Dongdong Xie, Chong Teng, Liang Zhao, Fei Li

    Abstract: So far, discontinuous named entity recognition (NER) has received increasing research attention and many related methods have surged such as hypergraph-based methods, span-based methods, and sequence-to-sequence (Seq2Seq) methods, etc. However, these methods more or less suffer from some problems such as decoding ambiguity and efficiency, which limit their performance. Recently, grid-tagging metho… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  48. arXiv:2210.16541  [pdf, other

    cs.CL

    Entity-centered Cross-document Relation Extraction

    Authors: Fengqi Wang, Fei Li, Hao Fei, Jingye Li, Shengqiong Wu, Fangfang Su, Wenxuan Shi, Donghong Ji, Bo Cai

    Abstract: Relation Extraction (RE) is a fundamental task of information extraction, which has attracted a large amount of research attention. Previous studies focus on extracting the relations within a sentence or document, while currently researchers begin to explore cross-document RE. However, current cross-document RE methods directly utilize text snippets surrounding target entities in multiple given do… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

    Comments: This paper was accepted by EMNLP 2022 conference

  49. arXiv:2210.07506  [pdf, other

    cs.CV

    Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation

    Authors: Peihao Chen, Dongyu Ji, Kunyang Lin, Runhao Zeng, Thomas H. Li, Mingkui Tan, Chuang Gan

    Abstract: We address a practical yet challenging problem of training robot agents to navigate in an environment following a path described by some language instructions. The instructions often contain descriptions of objects in the environment. To achieve accurate and efficient navigation, it is critical to build a map that accurately represents both spatial location and the semantic information of the envi… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022

  50. arXiv:2210.07505  [pdf, other

    cs.CV cs.RO

    Learning Active Camera for Multi-Object Navigation

    Authors: Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H. Li, Mingkui Tan, Chuang Gan

    Abstract: Getting robots to navigate to multiple objects autonomously is essential yet difficult in robot applications. One of the key challenges is how to explore environments efficiently with camera sensors only. Existing navigation methods mainly focus on fixed cameras and few attempts have been made to navigate with active cameras. As a result, the agent may take a very long time to perceive the environ… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022