Zum Hauptinhalt springen

Showing 201–250 of 572 results for author: Ji, H

.
  1. arXiv:2210.13715  [pdf, other

    cs.CL cs.AI

    PALT: Parameter-Lite Transfer of Language Models for Knowledge Graph Completion

    Authors: Jianhao Shen, Chenguang Wang, Ye Yuan, Jiawei Han, Heng Ji, Koushik Sen, Ming Zhang, Dawn Song

    Abstract: This paper presents a parameter-lite transfer learning approach of pretrained language models (LM) for knowledge graph (KG) completion. Instead of finetuning, which modifies all LM parameters, we only tune a few new parameters while keeping the original LM parameters fixed. We establish this via reformulating KG completion as a "fill-in-the-blank" task, and introducing a parameter-lite encoder on… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  2. arXiv:2210.12810  [pdf, other

    cs.CL

    Code4Struct: Code Generation for Few-Shot Event Structure Prediction

    Authors: Xingyao Wang, Sha Li, Heng Ji

    Abstract: Large Language Model (LLM) trained on a mixture of text and code has demonstrated impressive capability in translating natural language (NL) into structured code. We observe that semantic structures can be conveniently translated into code and propose Code4Struct to leverage such text-to-structure translation capability to tackle structured prediction tasks. As a case study, we formulate Event Arg… ▽ More

    Submitted 24 May, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: ACL 2023

  3. arXiv:2210.12582  [pdf, other

    cs.CL cs.AI

    Language Model Pre-Training with Sparse Latent Typing

    Authors: Liliang Ren, Zixuan Zhang, Han Wang, Clare R. Voss, Chengxiang Zhai, Heng Ji

    Abstract: Modern large-scale Pre-trained Language Models (PLMs) have achieved tremendous success on a wide range of downstream tasks. However, most of the LM pre-training objectives only focus on text reconstruction, but have not sought to learn latent-level interpretable representations of sentences. In this paper, we manage to push the language models to obtain a deeper understanding of sentences by propo… ▽ More

    Submitted 26 October, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022 (Oral)

  4. arXiv:2210.12444  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    Weakly-Supervised Temporal Article Grounding

    Authors: Long Chen, Yulei Niu, Brian Chen, Xudong Lin, Guangxing Han, Christopher Thomas, Hammad Ayyubi, Heng Ji, Shih-Fu Chang

    Abstract: Given a long untrimmed video and natural language queries, video grounding (VG) aims to temporally localize the semantically-aligned video segments. Almost all existing VG work holds two simple but unrealistic assumptions: 1) All query sentences can be grounded in the corresponding video. 2) All query sentences for the same video are always at the same semantic scale. Unfortunately, both assumptio… ▽ More

    Submitted 23 February, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022, https://github.com/zjuchenlong/WSAG

  5. arXiv:2210.11768  [pdf, other

    cs.CL cs.AI

    Augmentation with Projection: Towards an Effective and Efficient Data Augmentation Paradigm for Distillation

    Authors: Ziqi Wang, Yuexin Wu, Frederick Liu, Daogao Liu, Le Hou, Hongkun Yu, Jing Li, Heng Ji

    Abstract: Knowledge distillation is one of the primary methods of transferring knowledge from large to small models. However, it requires massive task-specific data, which may not be plausible in many real-world applications. Data augmentation methods such as representation interpolation, token replacement, or augmentation with models are applied to tackle this problem. However, these data augmentation meth… ▽ More

    Submitted 10 March, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: 20 pages, 5 figures. Accepted by ICLR 2023

  6. arXiv:2210.10402  [pdf

    astro-ph.SR astro-ph.IM physics.space-ph

    Solar Ring Mission: Building a Panorama of the Sun and Inner-heliosphere

    Authors: Yuming Wang, Xianyong Bai, Changyong Chen, Linjie Chen, Xin Cheng, Lei Deng, Linhua Deng, Yuanyong Deng, Li Feng, Tingyu Gou, Jingnan Guo, Yang Guo, Xinjun Hao, Jiansen He, Junfeng Hou, Huang Jiangjiang, Zhenghua Huang, Haisheng Ji, Chaowei Jiang, Jie Jiang, Chunlan Jin, Xiaolei Li, Yiren Li, Jiajia Liu, Kai Liu , et al. (29 additional authors not shown)

    Abstract: Solar Ring (SOR) is a proposed space science mission to monitor and study the Sun and inner heliosphere from a full 360° perspective in the ecliptic plane. It will deploy three 120°-separated spacecraft on the 1-AU orbit. The first spacecraft, S1, locates 30° upstream of the Earth, the second, S2, 90° downstream, and the third, S3, completes the configuration. This design with necessary science in… ▽ More

    Submitted 23 October, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 41 pages, 6 figures, 1 table, to be published in Advances in Space Research

  7. arXiv:2210.08604  [pdf, other

    cs.CL cs.AI

    NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly

    Authors: Yi R. Fung, Tuhin Chakraborty, Hao Guo, Owen Rambow, Smaranda Muresan, Heng Ji

    Abstract: Norm discovery is important for understanding and reasoning about the acceptable behaviors and potential violations in human communication and interactions. We introduce NormSage, a framework for addressing the novel task of conversation-grounded multi-lingual, multi-cultural norm discovery, based on language model prompting and self-verification. NormSAGE leverages the expressiveness and implicit… ▽ More

    Submitted 13 January, 2024; v1 submitted 16 October, 2022; originally announced October 2022.

  8. arXiv:2210.07197  [pdf, other

    cs.CL

    Towards a Unified Multi-Dimensional Evaluator for Text Generation

    Authors: Ming Zhong, Yang Liu, Da Yin, Yuning Mao, Yizhu Jiao, Pengfei Liu, Chenguang Zhu, Heng Ji, Jiawei Han

    Abstract: Multi-dimensional evaluation is the dominant paradigm for human evaluation in Natural Language Generation (NLG), i.e., evaluating the generated text from multiple explainable dimensions, such as coherence and fluency. However, automatic evaluation in NLG is still dominated by similarity-based metrics, and we lack a reliable framework for a more comprehensive evaluation of advanced models. In this… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  9. arXiv:2210.06533  [pdf, other

    physics.plasm-ph astro-ph.HE

    Super-Fermi Acceleration in Multiscale MHD Reconnection

    Authors: Stephen Majeski, Hantao Ji

    Abstract: We investigate the Fermi acceleration of charged particles in 2D MHD anti-parallel plasmoid reconnection, finding a drastic enhancement in energization rate $\dot{\varepsilon}$ over a standard Fermi model of $\dot{\varepsilon} \sim \varepsilon$. The shrinking particle orbit width around a magnetic island due to $\vec{E}\times\vec{B}$ drift produces a… ▽ More

    Submitted 30 March, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: 7 pages, 7 figures

  10. arXiv:2210.05919  [pdf, other

    astro-ph.SR

    Multiwavelength observations of a partial filament eruption on 13 June 2011

    Authors: Yanjie Zhang, Qingmin Zhang, Jun Dai, Dong Li, Haisheng Ji

    Abstract: In this paper, we report the multiwavelength observations of the partial filament eruption associated with a C1.2 class flare in NOAA active region 11236 on 13 June 2011. The event occurred at the eastern limb in the field of view (FOV) of Atmospheric Imaging Assembly (AIA) on board the Solar Dynamics Observatory (SDO) spacecraft and was close to the disk center in the FOV of Extreme-UltraViolet I… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 18 pages, 7 figures, accepted by Solar Physics (SoPh)

  11. arXiv:2210.04405  [pdf, other

    math.AP

    Finite-time self-similar rupture in a generalized elastohydrodynamic lubrication model

    Authors: William Chang, Hanjie Ji

    Abstract: Thin film rupture is a type of nonlinear instability that causes the solution to touch down to zero at finite time. We investigate the finite-time rupture behavior of a generalized elastohydrodynamic lubrication model. This model features the interplay between destabilizing disjoining pressure and stabilizing elastic bending pressure and surface tension. The governing equation is a sixth-order non… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    MSC Class: 76A20

  12. arXiv:2210.04287  [pdf, other

    cs.CV

    Learning to Decompose Visual Features with Latent Textual Prompts

    Authors: Feng Wang, Manling Li, Xudong Lin, Hairong Lv, Alexander G. Schwing, Heng Ji

    Abstract: Recent advances in pre-training vision-language models like CLIP have shown great potential in learning transferable visual representations. Nonetheless, for downstream inference, CLIP-like models suffer from either 1) degraded accuracy and robustness in the case of inaccurate text descriptions during retrieval-based inference (the challenge for zero-shot protocol); or 2) breaking the well-establi… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  13. arXiv:2210.00185  [pdf, other

    cs.CL

    Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks

    Authors: Zhenhailong Wang, Xiaoman Pan, Dian Yu, Dong Yu, Jianshu Chen, Heng Ji

    Abstract: Although large language models have achieved impressive zero-shot ability, the huge model size generally incurs high cost. Recently, semi-parametric language models, which augment a smaller language model with an external retriever, have demonstrated promising language modeling capabilities. However, it remains unclear whether such semi-parametric language models can perform competitively well as… ▽ More

    Submitted 22 May, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: Accepted as a conference paper at Findings of ACL 2023

  14. arXiv:2209.15354  [pdf

    physics.optics

    Design of Partially Etched GaP-OI Microresonators for Two-Color Kerr Soliton Generation at NIR and MIR

    Authors: Houling Ji, Zhaoting Geng, Weiren Cheng, Zhuoyu Yu, Pengzhuo Wu, Yi Li, Qiancheng Zhao

    Abstract: We present and theoretically investigate a dispersion engineered GaP-OI microresonator containing a partially-etched gap of 250 nm x 410 nm in a 600 nm x 2990 nm waveguide. This gap enables a 3.25 μm wide anomalous dispersion spectral span covering both the near-infrared and the mid-infrared spectra. This anomalous dispersion is manifested by two mechanisms, being the hybridization of the fundamen… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  15. Ion and Electron Acoustic Bursts during Anti-Parallel Magnetic Reconnection Driven by Lasers

    Authors: Shu Zhang, Abraham Chien, Lan Gao, Hantao Ji, Eric G. Blackman, Russ Follett, Dustin H. Froula, Joseph Katz, Chikang Li, Andrew Birkel, Richard Petrasso, John Moody, Hui Chen

    Abstract: Magnetic reconnection converts magnetic energy into thermal and kinetic energy in plasma. Among numerous candidate mechanisms, ion acoustic instabilities driven by the relative drift between ions and electrons, or equivalently electric current, have been suggested to play a critical role in dissipating magnetic energy in collisionless plasmas. However, their existence and effectiveness during reco… ▽ More

    Submitted 29 March, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

  16. arXiv:2209.09696  [pdf

    eess.IV q-bio.NC

    Synthesis of realistic fetal MRI with conditional Generative Adversarial Networks

    Authors: Marina Fernandez Garcia, Rodrigo Gonzalez Laiz, Hui Ji, Kelly Payette, Andras Jakab

    Abstract: Fetal brain magnetic resonance imaging serves as an emerging modality for prenatal counseling and diagnosis in disorders affecting the brain. Machine learning based segmentation plays an important role in the quantification of brain development. However, a limiting factor is the lack of sufficiently large, labeled training data. Our study explored the application of SPADE, a conditional general ad… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  17. arXiv:2209.09104  [pdf, other

    cs.CV cs.AI cs.LG

    VS-CAM: Vertex Semantic Class Activation Mapping to Interpret Vision Graph Neural Network

    Authors: Zhenpeng Feng, Xiyang Cui, Hongbing Ji, Mingzhe Zhu, Ljubisa Stankovic

    Abstract: Graph convolutional neural network (GCN) has drawn increasing attention and attained good performance in various computer vision tasks, however, there lacks a clear interpretation of GCN's inner mechanism. For standard convolutional neural networks (CNNs), class activation mapping (CAM) methods are commonly used to visualize the connection between CNN's decision and image region by generating a he… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 10 pages, 10 figures

  18. arXiv:2209.08679  [pdf, other

    cs.CL

    Dynamic Global Memory for Document-level Argument Extraction

    Authors: Xinya Du, Sha Li, Heng Ji

    Abstract: Extracting informative arguments of events from news articles is a challenging problem in information extraction, which requires a global contextual understanding of each document. While recent work on document-level extraction has gone beyond single-sentence and increased the cross-sentence inference capability of end-to-end models, they are still restricted by certain input sequence length const… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: ACL 2022 main conference (12 pages)

  19. arXiv:2209.08457  [pdf, other

    astro-ph.IM astro-ph.HE physics.flu-dyn

    Observation of axisymmetric standard magnetorotational instability in the laboratory

    Authors: Yin Wang, Erik P. Gilson, Fatima Ebrahimi, Jeremy Goodman, Hantao Ji

    Abstract: We report the first direct evidence for the axisymmetric standard magnetorotational instability (SMRI) from a combined experimental and numerical study of a magnetized liquid-metal shear flow in a Taylor-Couette cell with independently rotating and electrically conducting end caps. When a uniform vertical magnetic field $B_i$ is applied along the rotation axis, the measured radial magnetic field… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: 10 pages; 11 figures

    Journal ref: Physical Review Letters 129, 115001 (2022)

  20. arXiv:2209.08410  [pdf, other

    physics.plasm-ph astro-ph.HE astro-ph.IM physics.flu-dyn

    Identification of a non-axisymmetric mode in laboratory experiments searching for standard magnetorotational instability

    Authors: Yin Wang, Erik P. Gilson, Fatima Ebrahimi, Jeremy Goodman, Kyle J. Caspary, Himawan W. Winarto, Hantao Ji

    Abstract: The standard magnetorotational instability (SMRI) is a promising mechanism for turbulence and rapid accretion in astrophysical disks. It is a magnetohydrodynamic (MHD) instability that destabilizes otherwise hydrodynamically stable disk flow. Due to its microscopic nature at astronomical distances and stringent requirements in laboratory experiments, SMRI has remained unconfirmed since its proposa… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: 15 pages, 16 figures

    Journal ref: Nature Communications 13, 4679 (2022)

  21. arXiv:2209.03611  [pdf, other

    astro-ph.IM astro-ph.SR physics.plasm-ph physics.space-ph

    Advancing Theory and Modeling Efforts in Heliophysics

    Authors: Fan Guo, Spiro Antiochos, Paul Cassak, Bin Chen, Xiaohang Chen, Chuanfei Dong, Cooper Downs, Joe Giacalone, Colby C. Haggerty, Hantao Ji, Judith Karpen, James Klimchuk, Wen Li, Xiaocan Li, Mitsuo Oka, Katharine K. Reeves, Marc Swisdak, Weichao Tu

    Abstract: Heliophysics theory and modeling build understanding from fundamental principles to motivate, interpret, and predict observations. Together with observational analysis, they constitute a comprehensive scientific program in heliophysics. As observations and data analysis become increasingly detailed, it is critical that theory and modeling develop more quantitative predictions and iterate with obse… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: White paper submitted to Heliophysics 2024 Decadal Survey

  22. arXiv:2209.02071  [pdf, other

    cs.CL

    CONCRETE: Improving Cross-lingual Fact-checking with Cross-lingual Retrieval

    Authors: Kung-Hsiang Huang, ChengXiang Zhai, Heng Ji

    Abstract: Fact-checking has gained increasing attention due to the widespread of falsified information. Most fact-checking approaches focus on claims made in English only due to the data scarcity issue in other languages. The lack of fact-checking datasets in low-resource languages calls for an effective cross-lingual transfer technique for fact-checking. Additionally, trustworthy information in different l… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: Accepted by COLING 2022

  23. arXiv:2209.01988  [pdf, other

    cs.CV

    A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-Rays

    Authors: Haoqin Ji, Haozhe Liu, Yuexiang Li, Jinheng Xie, Nanjun He, Yawen Huang, Dong Wei, Xinrong Chen, Linlin Shen, Yefeng Zheng

    Abstract: Accurate abnormality localization in chest X-rays (CXR) can benefit the clinical diagnosis of various thoracic diseases. However, the lesion-level annotation can only be performed by experienced radiologists, and it is tedious and time-consuming, thus difficult to acquire. Such a situation results in a difficulty to develop a fully-supervised abnormality localization system for CXR. In this regard… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: Accepted by MICCAI-2022

  24. arXiv:2209.00068  [pdf, other

    cs.CL cs.LG

    Incorporating Task-specific Concept Knowledge into Script Learning

    Authors: Chenkai Sun, Tie Xu, ChengXiang Zhai, Heng Ji

    Abstract: In this paper, we present Tetris, a new task of Goal-Oriented Script Completion. Unlike previous work, it considers a more realistic and general setting, where the input includes not only the goal but also additional user context, including preferences and history. To address this problem, we propose a novel approach, which uses two techniques to improve performance: (1) concept prompting, and (2)… ▽ More

    Submitted 23 April, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

  25. arXiv:2208.12306  [pdf, other

    cs.CL cs.AI cs.CV

    Multimedia Generative Script Learning for Task Planning

    Authors: Qingyun Wang, Manling Li, Hou Pong Chan, Lifu Huang, Julia Hockenmaier, Girish Chowdhary, Heng Ji

    Abstract: Goal-oriented generative script learning aims to generate subsequent steps to reach a particular goal, which is an essential task to assist robots or humans in performing stereotypical activities. An important aspect of this process is the ability to capture historical states visually, which provides detailed information that is not covered by text and will guide subsequent steps. Therefore, we pr… ▽ More

    Submitted 10 July, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: 21 pages, Accepted by Findings of the Association for Computational Linguistics: ACL 2023, Code and Resources at https://github.com/EagleW/Multimedia-Generative-Script-Learning

  26. arXiv:2208.05035  [pdf, ps, other

    eess.SP cs.LG cs.NI

    Adaptive Target-Condition Neural Network: DNN-Aided Load Balancing for Hybrid LiFi and WiFi Networks

    Authors: Han Ji, Qiang Wang, Stephen J. Redmond, Iman Tavakkolnia, Xiping Wu

    Abstract: Load balancing (LB) is a challenging issue in the hybrid light fidelity (LiFi) and wireless fidelity (WiFi) networks (HLWNets), due to the nature of heterogeneous access points (APs). Machine learning has the potential to provide a complexity-friendly LB solution with near-optimal network performance, at the cost of a training process. The state-of-the-art (SOTA) learning-aided LB methods, however… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: 13 pages, 9 figures, and 4 tables, submitted to IEEE JSAC SI-BeyondShannon

  27. arXiv:2207.08808  [pdf, other

    cs.CV

    Global-Local Stepwise Generative Network for Ultra High-Resolution Image Restoration

    Authors: Xin Feng, Haobo Ji, Wenjie Pei, Fanglin Chen, Guangming Lu

    Abstract: While the research on image background restoration from regular size of degraded images has achieved remarkable progress, restoring ultra high-resolution (e.g., 4K) images remains an extremely challenging task due to the explosion of computational complexity and memory usage, as well as the deficiency of annotated data. In this paper we present a novel model for ultra high-resolution image restora… ▽ More

    Submitted 17 May, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  28. arXiv:2206.12489  [pdf, other

    eess.AS cs.SD

    Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models

    Authors: Hang Ji, Tanvina Patel, Odette Scharenborg

    Abstract: In this work, we analyzed and compared speech representations extracted from different frozen self-supervised learning (SSL) speech pre-trained models on their ability to capture articulatory features (AF) information and their subsequent prediction of phone recognition performance for within and across language scenarios. Specifically, we compared CPC, wav2vec 2.0, and HuBert. First, frame-level… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: Submitted to INTERSPEECH 2022

  29. Sunspot shearing and sudden retraction motion associated with the 2013 August 17 M3.3 Flare

    Authors: Yanjie Zhang, Zhe Xu, Qingmin Zhang, Jun Dai, Haisheng Ji

    Abstract: In this Letter, we give a detailed analysis to the M3.3 class flare that occurred on August 17, 2013 (SOL2013-08-17T18:16). It presents a clear picture of mutual magnetic interaction initially from the photosphere to the corona via the abrupt rapid shearing motion of a small sunspot before the flare, and then suddenly from the corona back to the photosphere via the sudden retraction motion of the… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

  30. arXiv:2206.07296  [pdf, other

    cs.CL

    Enhanced Knowledge Selection for Grounded Dialogues via Document Semantic Graphs

    Authors: Sha Li, Mahdi Namazifar, Di Jin, Mohit Bansal, Heng Ji, Yang Liu, Dilek Hakkani-Tur

    Abstract: Providing conversation models with background knowledge has been shown to make open-domain dialogues more informative and engaging. Existing models treat knowledge selection as a sentence ranking or classification problem where each sentence is handled individually, ignoring the internal semantic connection among sentences in the background document. In this work, we propose to automatically conve… ▽ More

    Submitted 30 June, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: NAACL 2022. Please refer to https://www.amazon.science/publications/enhanced-knowledge-selection-for-grounded-dialogues-via-document-semantic-graphs for code and resources

  31. arXiv:2206.02921  [pdf, other

    cs.LG cs.AI cs.CL

    Schema-Guided Event Graph Completion

    Authors: Hongwei Wang, Zixuan Zhang, Sha Li, Jiawei Han, Yizhou Sun, Hanghang Tong, Joseph P. Olive, Heng Ji

    Abstract: We tackle a new task, event graph completion, which aims to predict missing event nodes for event graphs. Existing link prediction or graph completion methods have difficulty dealing with event graphs because they are usually designed for a single large graph such as a social network or a knowledge graph, rather than multiple small dynamic event graphs. Moreover, they can only predict missing edge… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  32. arXiv:2206.02712  [pdf, other

    cs.CL

    Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation

    Authors: Pei Ke, Haozhe Ji, Zhenyu Yang, Yi Huang, Junlan Feng, Xiaoyan Zhu, Minlie Huang

    Abstract: Despite the success of text-to-text pre-trained models in various natural language generation (NLG) tasks, the generation performance is largely restricted by the number of labeled data in downstream tasks, particularly in data-to-text generation tasks. Existing works mostly utilize abundant unlabeled structured data to conduct unsupervised pre-training for task adaption, which fail to model the c… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted by IJCAI 2022

  33. arXiv:2206.02082  [pdf, other

    cs.CV

    Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval

    Authors: Xudong Lin, Simran Tiwari, Shiyuan Huang, Manling Li, Mike Zheng Shou, Heng Ji, Shih-Fu Chang

    Abstract: Multi-channel video-language retrieval require models to understand information from different channels (e.g. video$+$question, video$+$speech) to correctly link a video with a textual response or query. Fortunately, contrastive multimodal models are shown to be highly effective at aligning entities in images/videos and text, e.g., CLIP; text contrastive models are extensively studied recently for… ▽ More

    Submitted 10 April, 2023; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: To appear in CVPR 2023; The code will be released at https://github.com/XudongLinthu/upgradable-multimodal-intelligence

  34. arXiv:2205.14847  [pdf, other

    cs.CL

    EA$^2$E: Improving Consistency with Event Awareness for Document-Level Argument Extraction

    Authors: Qi Zeng, Qiusi Zhan, Heng Ji

    Abstract: Events are inter-related in documents. Motivated by the one-sense-per-discourse theory, we hypothesize that a participant tends to play consistent roles across multiple events in the same document. However recent work on document-level event argument extraction models each individual event in isolation and therefore causes inconsistency among extracted arguments across events, which will further c… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: NAACL 2022 Findings

  35. arXiv:2205.13294  [pdf, other

    cs.CV eess.IV eess.SP

    Analytical Interpretation of Latent Codes in InfoGAN with SAR Images

    Authors: Zhenpeng Feng, Milos Dakovic, Hongbing Ji, Mingzhe Zhu, Ljubisa Stankovic

    Abstract: Generative Adversarial Networks (GANs) can synthesize abundant photo-realistic synthetic aperture radar (SAR) images. Some recent GANs (e.g., InfoGAN), are even able to edit specific properties of the synthesized images by introducing latent codes. It is crucial for SAR image synthesis since the targets in real SAR images are with different properties due to the imaging mechanism. Despite the succ… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: 13 pages, 14 figures

  36. arXiv:2205.11602  [pdf, other

    cs.CL

    Seeded Hierarchical Clustering for Expert-Crafted Taxonomies

    Authors: Anish Saha, Amith Ananthram, Emily Allaway, Heng Ji, Kathleen McKeown

    Abstract: Practitioners from many disciplines (e.g., political science) use expert-crafted taxonomies to make sense of large, unlabeled corpora. In this work, we study Seeded Hierarchical Clustering (SHC): the task of automatically fitting unlabeled data to such taxonomies using only a small set of labeled examples. We propose HierSeed, a novel weakly supervised algorithm for this task that uses only a smal… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  37. arXiv:2205.10977  [pdf, other

    cs.CL cs.HC

    What should I Ask: A Knowledge-driven Approach for Follow-up Questions Generation in Conversational Surveys

    Authors: Yubin Ge, Ziang Xiao, Jana Diesner, Heng Ji, Karrie Karahalios, Hari Sundaram

    Abstract: Generating follow-up questions on the fly could significantly improve conversational survey quality and user experiences by enabling a more dynamic and personalized survey structure. In this paper, we proposed a novel task for knowledge-driven follow-up question generation in conversational surveys. We constructed a new human-annotated dataset of human-written follow-up questions with dialogue his… ▽ More

    Submitted 13 October, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

  38. arXiv:2205.10747  [pdf, other

    cs.CV cs.AI

    Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

    Authors: Zhenhailong Wang, Manling Li, Ruochen Xu, Luowei Zhou, Jie Lei, Xudong Lin, Shuohang Wang, Ziyi Yang, Chenguang Zhu, Derek Hoiem, Shih-Fu Chang, Mohit Bansal, Heng Ji

    Abstract: The goal of this work is to build flexible video-language models that can generalize to various video-to-text tasks from few examples, such as domain-specific captioning, question answering, and future event prediction. Existing few-shot video-language learners focus exclusively on the encoder, resulting in the absence of a video-to-text decoder to handle generative tasks. Video captioners have be… ▽ More

    Submitted 13 October, 2022; v1 submitted 22 May, 2022; originally announced May 2022.

  39. arXiv:2205.07466  [pdf, other

    cs.CV cs.AI

    Robust Representation via Dynamic Feature Aggregation

    Authors: Haozhe Liu, Haoqin Ji, Yuexiang Li, Nanjun He, Haoqian Wu, Feng Liu, Linlin Shen, Yefeng Zheng

    Abstract: Deep convolutional neural network (CNN) based models are vulnerable to the adversarial attacks. One of the possible reasons is that the embedding space of CNN based model is sparse, resulting in a large space for the generation of adversarial samples. In this study, we propose a method, denoted as Dynamic Feature Aggregation, to compress the embedding space with a novel regularization. Particularl… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  40. arXiv:2205.00463  [pdf, other

    eess.IV cs.CV math.NA

    A Dataset-free Deep learning Method for Low-Dose CT Image Reconstruction

    Authors: Qiaoqiao Ding, Hui Ji, Yuhui Quan, Xiaoqun Zhang

    Abstract: Low-dose CT (LDCT) imaging attracted a considerable interest for the reduction of the object's exposure to X-ray radiation. In recent years, supervised deep learning (DL) has been extensively studied for LDCT image reconstruction, which trains a network over a dataset containing many pairs of normal-dose and low-dose images. However, the challenge on collecting many such pairs in the clinical setu… ▽ More

    Submitted 5 October, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

  41. arXiv:2204.11817  [pdf, other

    cs.CL cs.AI

    Translation between Molecules and Natural Language

    Authors: Carl Edwards, Tuan Lai, Kevin Ros, Garrett Honke, Kyunghyun Cho, Heng Ji

    Abstract: We present $\textbf{MolT5}$ $-$ a self-supervised learning framework for pretraining models on a vast amount of unlabeled natural language text and molecule strings. $\textbf{MolT5}$ allows for new, useful, and challenging analogs of traditional vision-language tasks, such as molecule captioning and text-based de novo molecule generation (altogether: translation between molecules and language), wh… ▽ More

    Submitted 3 November, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted at EMNLP 2022. Data and code can be found on [Github](https://github.com/blender-nlp/MolT5)

  42. Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval

    Authors: Revanth Gangi Reddy, Md Arafat Sultan, Martin Franz, Avirup Sil, Heng Ji

    Abstract: We show that supervised neural information retrieval (IR) models are prone to learning sparse attention patterns over passage tokens, which can result in key phrases including named entities receiving low attention weights, eventually leading to model under-performance. Using a novel targeted synthetic data generation method that identifies poorly attended entities and conditions the generation ep… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: Published at SIGIR 2022

  43. arXiv:2204.10502  [pdf, other

    cs.SE

    LiDetector: License Incompatibility Detection for Open Source Software

    Authors: Sihan Xu, Ya Gao, Lingling Fan, Zheli Liu, Yang Liu, Hua Ji

    Abstract: Open-source software (OSS) licenses dictate the conditions which should be followed to reuse, distribute, and modify the software. Apart from widely-used licenses such as the MIT License, developers are also allowed to customize their own licenses (called custom licenses), whose descriptions are more flexible. The presence of such various licenses imposes challenges to understanding licenses and t… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  44. Fetal Brain Tissue Annotation and Segmentation Challenge Results

    Authors: Kelly Payette, Hongwei Li, Priscille de Dumast, Roxane Licandro, Hui Ji, Md Mahfuzur Rahman Siddiquee, Daguang Xu, Andriy Myronenko, Hao Liu, Yuchen Pei, Lisheng Wang, Ying Peng, Juanying Xie, Huiquan Zhang, Guiming Dong, Hao Fu, Guotai Wang, ZunHyan Rieu, Donghyeon Kim, Hyun Gi Kim, Davood Karimi, Ali Gholipour, Helena R. Torres, Bruno Oliveira, João L. Vilaça , et al. (33 additional authors not shown)

    Abstract: In-utero fetal MRI is emerging as an important tool in the diagnosis and analysis of the developing human brain. Automatic segmentation of the developing fetal brain is a vital step in the quantitative analysis of prenatal neurodevelopment both in the research and clinical context. However, manual segmentation of cerebral structures is time-consuming and prone to error and inter-observer variabili… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: Results from FeTA Challenge 2021, held at MICCAI; Manuscript submitted

  45. arXiv:2204.07700  [pdf, other

    physics.plasm-ph

    Two-dimensional plasma density evolution local to the inversion layer during sawtooth crash events using Beam Emission Spectroscopy

    Authors: Sayak Bose, William Fox, Dingyun Liu, Zheng Yan, George McKee, Aaron Goodman, Hantao Ji

    Abstract: We present methods for analyzing Beam Emission Spectroscopy (BES) data to obtain the plasma density evolution associated with rapid sawtooth crash events at the DIII-D tokamak. BES allows coverage over a 2-D spatial plane, inherently local measurements, with fast time responses, and therefore provides a valuable new channel for data during sawtooth events. A method is developed to remove sawtooth-… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  46. arXiv:2204.07341  [pdf, other

    cs.CL

    LaMemo: Language Modeling with Look-Ahead Memory

    Authors: Haozhe Ji, Rongsheng Zhang, Zhenyu Yang, Zhipeng Hu, Minlie Huang

    Abstract: Although Transformers with fully connected self-attentions are powerful to model long-term dependencies, they are struggling to scale to long texts with thousands of words in language modeling. One of the solutions is to equip the model with a recurrence memory. However, existing approaches directly reuse hidden states from the previous segment that encodes contexts in a uni-directional way. As a… ▽ More

    Submitted 26 April, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted by NAACL 2022

  47. Enhancing Digital Health Services: A Machine Learning Approach to Personalized Exercise Goal Setting

    Authors: Ji Fang, Vincent CS Lee, Hao Ji, Haiyan Wang

    Abstract: The utilization of digital health has increased recently, and these services provide extensive guidance to encourage users to exercise frequently by setting daily exercise goals to promote a healthy lifestyle. These comprehensive guides evolved from the consideration of various personalized behavioral factors. Nevertheless, existing approaches frequently neglect the users dynamic behavior and the… ▽ More

    Submitted 4 March, 2024; v1 submitted 2 April, 2022; originally announced April 2022.

  48. Statistical analysis of circular-ribbon flares

    Authors: Yanjie Zhang, Qingmin Zhang, Dechao Song, Shuting Li, Jun Dai, Zhe Xu, Haisheng Ji

    Abstract: Circular-ribbon flares (CFs) are a special type of solar flares owing to their particular magnetic topology. In this paper, we conducted a comprehensive statistical analysis of 134 CFs from 2011 September to 2017 June, including four B-class, 82 C-class, 40 M-class, and eight X-class flares, respectively. The flares were observed by the Atmospheric Imaging Assembly (AIA) on board the Solar Dynamic… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: 17 pages, 22 figures, accepted for publication in The Astrophysical Journal Supplement Series, comments are welcome

  49. arXiv:2203.06879  [pdf, ps, other

    cond-mat.supr-con

    Planckian Dissipation and non-Ginzburg-Landau Type Upper Critical Field in Bi2201

    Authors: Qihao Zang, Zhengyan Zhu, Zuyu Xu, Shichao Qi, Haoran Ji, Yiwen Li, Jian Wang, Huiqian Luo, Hua-Bing Wang, Hai-Hu Wen

    Abstract: Resistivity and Hall effect measurements have been carried out on a micro-fabricated bridge of Bi2201 single crystal at low temperatures down to 0.4 K under high magnetic fields. When superconductivity is crashed by a high magnetic field, the recovered "normal state" resistivity still shows a linear temperature dependence in low temperature region. Combining with the effective mass and the charge… ▽ More

    Submitted 22 February, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: 8 pages, 4 figures

    Journal ref: Sci. China-Phys. Mech. Astron. 66, 237412 (2023)

  50. arXiv:2203.05967  [pdf, other

    cs.SI cs.CL

    A Weibo Dataset for the 2022 Russo-Ukrainian Crisis

    Authors: Yi R. Fung, Heng Ji

    Abstract: Online social networks such as Twitter and Weibo play an important role in how people stay informed and exchange reactions. Each crisis encompasses a new opportunity to study the portability of models for various tasks (e.g., information extraction, complex event understanding, misinformation detection, etc.), due to differences in domain, entities, and event types. We present the Russia-Ukraine C… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: Russia-Ukraine Crisis, Weibo Dataset