Zum Hauptinhalt springen

Showing 1–50 of 250 results for author: Ni, J

.
  1. arXiv:2408.14145  [pdf, ps, other

    math.AP

    Global well-posedness and decay rates of strong solutions to the incompressible Vlasov-MHD system

    Authors: Fucai Li, Jinkai Ni, Man Wu

    Abstract: In this paper, we study the global well-posedness and decay rates of strong solutions to an incompressible Vlasov-MHD model arising in magnetized plasmas. This model is consist of the Vlasov equation and the incompressible magnetohydrodynamic equations which interacts together via the Lorentz forces. It is readily to verify that it has two equilibria $(\bar f,\bar u,\bar B)=(0,0,0)$ and… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 34 pages

  2. arXiv:2408.14121  [pdf, ps, other

    math.AP

    Global existence and time decay of strong solutions to a fluid-particle coupled model with energy exchanges

    Authors: Fucai Li, Jinkai Ni, Man Wu

    Abstract: In this paper, we investigate a three-dimensional fluid-particle coupled model. % in whole space $\mathbb{R}^3$. This model combines the full compressible Navier-Stokes equations with the Vlasov-Fokker-Planck equation via the momentum and energy exchanges. We obtain the global existence and optimal time decay rates of strong solutions to the model in whole space $\mathbb{R}^3$ when the initial dat… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 45pages

    MSC Class: 35Q83; 76N10; 35B40

  3. arXiv:2408.11799  [pdf, other

    cs.CL

    Practical token pruning for foundation models in few-shot conversational virtual assistant systems

    Authors: Haode Qi, Cheng Qian, Jian Ni, Pratyush Singh, Reza Fazeli, Gengyu Wang, Zhongzheng Shu, Eric Wayne, Juergen Bross

    Abstract: In an enterprise Virtual Assistant (VA) system, intent classification is the crucial component that determines how a user input is handled based on what the user wants. The VA system is expected to be a cost-efficient SaaS service with low training and inference time while achieving high accuracy even with a small number of training samples. We pretrain a transformer-based sentence embedding model… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 6 pages, 3 figures

  4. arXiv:2408.02035  [pdf, other

    cs.CR

    Robustness of Watermarking on Text-to-Image Diffusion Models

    Authors: Xiaodong Wu, Xiangman Li, Jianbing Ni

    Abstract: Watermarking has become one of promising techniques to not only aid in identifying AI-generated images but also serve as a deterrent against the unethical use of these models. However, the robustness of watermarking techniques has not been extensively studied recently. In this paper, we investigate the robustness of generative watermarking, which is created from the integration of watermarking emb… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  5. arXiv:2407.20108  [pdf, other

    eess.IV cs.AI cs.CV

    Classification, Regression and Segmentation directly from k-Space in Cardiac MRI

    Authors: Ruochen Li, Jiazhen Pan, Youxiang Zhu, Juncheng Ni, Daniel Rueckert

    Abstract: Cardiac Magnetic Resonance Imaging (CMR) is the gold standard for diagnosing cardiovascular diseases. Clinical diagnoses predominantly rely on magnitude-only Digital Imaging and Communications in Medicine (DICOM) images, omitting crucial phase information that might provide additional diagnostic benefits. In contrast, k-space is complex-valued and encompasses both magnitude and phase information,… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  6. arXiv:2407.16933  [pdf, other

    eess.SY cs.LG

    Deep Koopman-based Control of Quality Variation in Multistage Manufacturing Systems

    Authors: Zhiyi Chen, Harshal Maske, Devesh Upadhyay, Huanyi Shui, Xun Huan, Jun Ni

    Abstract: This paper presents a modeling-control synthesis to address the quality control challenges in multistage manufacturing systems (MMSs). A new feedforward control scheme is developed to minimize the quality variations caused by process disturbances in MMSs. Notably, the control framework leverages a stochastic deep Koopman (SDK) model to capture the quality propagation mechanism in the MMSs, highlig… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: The paper was in the proceeding of 2024 American Control Conference. This submitted version addresses a minor correction to one equation (Eq. 14), while the results and conclusions remain the same

  7. arXiv:2406.17621  [pdf

    cond-mat.soft cond-mat.mes-hall

    Quasiphase transition of a single-file water chain influenced by atomic charges in water model using orientational-biased replica exchange Monte Carlo simulations

    Authors: Liang Zhao, Junqing Ni, Zhi Zhu, Yusong Tu, Chunlei Wang

    Abstract: The recently observed temperature-dependent quasiphase transition of the single-file water chain confined within a carbon nanotube in experiments has been validated by simple lattice theory and molecular dynamic simulations. It has been pointed out that atomic charges in water model is an important issue, yet how the values will affect the structural details and thermodynamic properties of the qua… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 14 pages and 7 figures in Main text, 4 figures in Appendix

  8. arXiv:2406.16715  [pdf, other

    cs.LG

    GC-Bench: A Benchmark Framework for Graph Condensation with New Insights

    Authors: Shengbo Gong, Juntong Ni, Noveen Sachdeva, Carl Yang, Wei Jin

    Abstract: Graph condensation (GC) is an emerging technique designed to learn a significantly smaller graph that retains the essential information of the original graph. This condensed graph has shown promise in accelerating graph neural networks while preserving performance comparable to those achieved with the original, larger graphs. Additionally, this technique facilitates downstream applications such as… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 9 pages

  9. arXiv:2406.15658  [pdf, other

    cs.CV cs.AI

    TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning

    Authors: Nemin Wu, Qian Cao, Zhangyu Wang, Zeping Liu, Yanlin Qi, Jielu Zhang, Joshua Ni, Xiaobai Yao, Hongxu Ma, Lan Mu, Stefano Ermon, Tanuja Ganu, Akshay Nambi, Ni Lao, Gengchen Mai

    Abstract: Spatial representation learning (SRL) aims at learning general-purpose neural network representations from various types of spatial data (e.g., points, polylines, polygons, networks, images, etc.) in their native formats. Learning good spatial representations is a fundamental problem for various downstream applications such as species distribution modeling, weather forecasting, trajectory generati… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures. Submitted to NeurIPS 2024 Datasets and Benchmarks Track. Under review

  10. arXiv:2406.14162  [pdf, other

    cs.IR cs.AI cs.CL

    DIRAS: Efficient LLM-Assisted Annotation of Document Relevance in Retrieval Augmented Generation

    Authors: Jingwei Ni, Tobias Schimanski, Meihong Lin, Mrinmaya Sachan, Elliott Ash, Markus Leippold

    Abstract: Retrieval Augmented Generation (RAG) is widely employed to ground responses to queries on domain-specific documents. But do RAG implementations leave out important information or excessively include irrelevant information? To allay these concerns, it is necessary to annotate domain-specific benchmarks to evaluate information retrieval (IR) performance, as relevance definitions vary across queries… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  11. arXiv:2406.09818  [pdf, other

    cs.IR

    ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures

    Authors: Tobias Schimanski, Jingwei Ni, Roberto Spacey, Nicola Ranger, Markus Leippold

    Abstract: To handle the vast amounts of qualitative data produced in corporate climate communication, stakeholders increasingly rely on Retrieval Augmented Generation (RAG) systems. However, a significant gap remains in evaluating domain-specific information retrieval - the basis for answer generation. To address this challenge, this work simulates the typical tasks of a sustainability analyst by examining… ▽ More

    Submitted 17 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  12. arXiv:2406.08380  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Unsupervised Speech Recognition Without Pronunciation Models

    Authors: Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao, Mark Hasegawa-Johnson, Chang D. Yoo

    Abstract: Recent advancements in supervised automatic speech recognition (ASR) have achieved remarkable performance, largely due to the growing availability of large transcribed speech corpora. However, most languages lack sufficient paired speech and text data to effectively train these systems. In this article, we tackle the challenge of developing ASR systems without paired speech and text corpora by pro… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  13. arXiv:2406.06565  [pdf, other

    cs.CL cs.AI cs.LG

    MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures

    Authors: Jinjie Ni, Fuzhao Xue, Xiang Yue, Yuntian Deng, Mahir Shah, Kabir Jain, Graham Neubig, Yang You

    Abstract: Evaluating large language models (LLMs) is challenging. Traditional ground-truth-based benchmarks fail to capture the comprehensiveness and nuance of real-world queries, while LLM-as-judge benchmarks suffer from grading biases and limited query quantity. Both of them may also become contaminated over time. User-facing evaluation, such as Chatbot Arena, provides reliable signals but is costly and s… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2405.15929  [pdf, other

    econ.GN cs.HC

    Product Design Using Generative Adversarial Network: Incorporating Consumer Preference and External Data

    Authors: Hui Li, Jian Ni, Fangzhu Yang

    Abstract: The development of generative artificial intelligence (AI) enables large-scale product design automation. However, this automated process usually does not incorporate consumer preference information from the internal dataset of a company. Furthermore, external sources such as social media and user-generated content (UGC) websites often contain rich product design and consumer preference informatio… ▽ More

    Submitted 2 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 46 pages, 26 figures, 5 tables

    ACM Class: I.2.6; I.5.1; I.5.4; H.2.8; J.4

  15. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  16. arXiv:2404.16666  [pdf, other

    cs.CV

    PhyRecon: Physically Plausible Neural Scene Reconstruction

    Authors: Junfeng Ni, Yixin Chen, Bohan Jing, Nan Jiang, Bin Wang, Bo Dai, Puhao Li, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

    Abstract: Neural implicit representations have gained popularity in multi-view 3D reconstruction. However, most previous work struggles to yield physically plausible results, limiting their utility in domains requiring rigorous physical accuracy, such as embodied AI and robotics. This lack of plausibility stems from the absence of physics modeling in existing methods and their inability to recover intricate… ▽ More

    Submitted 2 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: project page: https://phyrecon.github.io/. arXiv admin note: text overlap with arXiv:2303.08605 by other authors

  17. arXiv:2404.15349  [pdf, other

    eess.SP cs.LG cs.MM

    A Survey on Multimodal Wearable Sensor-based Human Action Recognition

    Authors: Jianyuan Ni, Hao Tang, Syed Tousiful Haque, Yan Yan, Anne H. H. Ngu

    Abstract: The combination of increased life expectancy and falling birth rates is resulting in an aging population. Wearable Sensor-based Human Activity Recognition (WSHAR) emerges as a promising assistive technology to support the daily lives of older individuals, unlocking vast potential for human-centric applications. However, recent surveys in WSHAR have been limited, focusing either solely on deep lear… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Multimodal Survey for Wearable Sensor-based Human Action Recognition

  18. Discovering Quirks through Timing at FASER and Future Forward Experiments at the LHC

    Authors: Jonathan L. Feng, Jinmian Li, Xufei Liao, Jian Ni, Junle Pei

    Abstract: Quirks are generic predictions of strongly-coupled dark sectors. For weak-scale masses and a broad range of confining scales in the dark sector, quirks can be discovered only at the energy frontier, but quirk--anti-quirk pairs are produced with unusual signatures at low $p_T$, making them difficult to detect at the large LHC detectors. We determine the prospects for discovering quirks using timing… ▽ More

    Submitted 20 June, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: 29 pages, 11 figures, version to appear in JHEP

  19. arXiv:2403.16476  [pdf

    eess.IV

    A Method for Target Detection Based on Mmw Radar and Vision Fusion

    Authors: Ming Zong, Jiaying Wu, Zhanyu Zhu, Jingen Ni

    Abstract: An efficient and accurate traffic monitoring system often takes advantages of multi-sensor detection to ensure the safety of urban traffic, promoting the accuracy and robustness of target detection and tracking. A method for target detection using Radar-Vision Fusion Path Aggregation Fully Convolutional One-Stage Network (RV-PAFCOS) is proposed in this paper, which is extended from Fully Convoluti… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  20. arXiv:2403.11391  [pdf, other

    cs.LG cs.CV

    Investigating the Benefits of Projection Head for Representation Learning

    Authors: Yihao Xue, Eric Gan, Jiayi Ni, Siddharth Joshi, Baharan Mirzasoleiman

    Abstract: An effective technique for obtaining high-quality representations is adding a projection head on top of the encoder during training, then discarding it and using the pre-projection representations. Despite its proven practical effectiveness, the reason behind the success of this technique is poorly understood. The pre-projection representations are not directly optimized by the loss function, rais… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Journal ref: ICLR 2024

  21. arXiv:2402.11615  [pdf, ps, other

    math.FA

    Littlewood-type theorems for Hardy spaces in infinitely many variables

    Authors: Jiaqi Ni

    Abstract: Littlewood's theorem is one of the pioneering results in random analytic functions over the open unit disk. In this paper, we prove some analogues of this theorem for Hardy spaces in infinitely many variables. Our results not only cover finite-variable setting, but also apply in cases of Dirichlet series.

    Submitted 18 February, 2024; originally announced February 2024.

    MSC Class: 46E50 (Primary) 30B50; 32A35 (Secondary)

  22. arXiv:2402.11073  [pdf, other

    cs.CL cs.AI

    AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators

    Authors: Jingwei Ni, Minjing Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold

    Abstract: With the rise of generative AI, automated fact-checking methods to combat misinformation are becoming more and more important. However, factual claim detection, the first step in a fact-checking pipeline, suffers from two key issues that limit its scalability and generalizability: (1) inconsistency in definitions of the task and what a claim is, and (2) the high cost of manual annotation. To addre… ▽ More

    Submitted 2 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL2024 Main Conference

  23. arXiv:2402.09668  [pdf, other

    cs.LG cs.AI cs.CL

    How to Train Data-Efficient LLMs

    Authors: Noveen Sachdeva, Benjamin Coleman, Wang-Cheng Kang, Jianmo Ni, Lichan Hong, Ed H. Chi, James Caverlee, Julian McAuley, Derek Zhiyuan Cheng

    Abstract: The training of large language models (LLMs) is expensive. In this paper, we study data-efficient approaches for pre-training LLMs, i.e., techniques that aim to optimize the Pareto frontier of model quality and training resource/data consumption. We seek to understand the tradeoffs associated with data selection routines based on (i) expensive-to-compute data-quality estimates, and (ii) maximizati… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Under review. 44 pages, 30 figures

  24. arXiv:2402.08277  [pdf, other

    cs.CL cs.LG

    Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering

    Authors: Tobias Schimanski, Jingwei Ni, Mathias Kraus, Elliott Ash, Markus Leippold

    Abstract: Advances towards more faithful and traceable answers of Large Language Models (LLMs) are crucial for various research and practical endeavors. One avenue in reaching this goal is basing the answers on reliable sources. However, this Evidence-Based QA has proven to work insufficiently with LLMs in terms of citing the correct sources (source quality) and truthfully representing the information withi… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  25. arXiv:2402.03358  [pdf, other

    cs.SI cs.AI cs.DS cs.LG

    A Comprehensive Survey on Graph Reduction: Sparsification, Coarsening, and Condensation

    Authors: Mohammad Hashemi, Shengbo Gong, Juntong Ni, Wenqi Fan, B. Aditya Prakash, Wei Jin

    Abstract: Many real-world datasets can be naturally represented as graphs, spanning a wide range of domains. However, the increasing complexity and size of graph datasets present significant challenges for analysis and computation. In response, graph reduction, or graph summarization, has gained prominence for simplifying large graphs while preserving essential properties. In this survey, we aim to provide… ▽ More

    Submitted 29 June, 2024; v1 submitted 28 January, 2024; originally announced February 2024.

    Comments: Accepted by IJCAI 2024 (This ArXiv version is a long version of our IJCAI paper)

  26. arXiv:2402.02036  [pdf, other

    cs.LG

    Generating In-Distribution Proxy Graphs for Explaining Graph Neural Networks

    Authors: Zhuomin Chen, Jiaxing Zhang, Jingchao Ni, Xiaoting Li, Yuchen Bian, Md Mezbahul Islam, Ananda Mohan Mondal, Hua Wei, Dongsheng Luo

    Abstract: Graph Neural Networks (GNNs) have become a building block in graph data processing, with wide applications in critical domains. The growing needs to deploy GNNs in high-stakes applications necessitate explainability for users in the decision-making processes. A popular paradigm for the explainability of GNNs is to identify explainable subgraphs by comparing their labels with the ones of original g… ▽ More

    Submitted 29 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted to International Conference on Machine Learning (ICML 2024)

  27. arXiv:2402.01739  [pdf, other

    cs.CL cs.AI cs.DC cs.LG

    OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models

    Authors: Fuzhao Xue, Zian Zheng, Yao Fu, Jinjie Ni, Zangwei Zheng, Wangchunshu Zhou, Yang You

    Abstract: To help the open-source community have a better understanding of Mixture-of-Experts (MoE) based large language models (LLMs), we train and release OpenMoE, a series of fully open-sourced and reproducible decoder-only MoE LLMs, ranging from 650M to 34B parameters and trained on up to over 1T tokens. Our investigation confirms that MoE-based LLMs can offer a more favorable cost-effectiveness trade-o… ▽ More

    Submitted 27 March, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

  28. arXiv:2401.17865  [pdf, other

    cs.LG cs.AI

    Manipulating Predictions over Discrete Inputs in Machine Teaching

    Authors: Xiaodong Wu, Yufei Han, Hayssam Dahrouj, Jianbing Ni, Zhenwen Liang, Xiangliang Zhang

    Abstract: Machine teaching often involves the creation of an optimal (typically minimal) dataset to help a model (referred to as the `student') achieve specific goals given by a teacher. While abundant in the continuous domain, the studies on the effectiveness of machine teaching in the discrete domain are relatively limited. This paper focuses on machine teaching in the discrete domain, specifically on man… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 8 pages, 2 figures

    ACM Class: I.2.6

  29. arXiv:2401.12566  [pdf, other

    cs.CL

    Automated Fact-Checking of Climate Change Claims with Large Language Models

    Authors: Markus Leippold, Saeid Ashraf Vaghefi, Dominik Stammbach, Veruska Muccione, Julia Bingler, Jingwei Ni, Chiara Colesanti-Senni, Tobias Wekhof, Tobias Schimanski, Glen Gostlow, Tingyu Yu, Juerg Luterbacher, Christian Huggel

    Abstract: This paper presents Climinator, a novel AI-based tool designed to automate the fact-checking of climate change claims. Utilizing an array of Large Language Models (LLMs) informed by authoritative sources like the IPCC reports and peer-reviewed scientific literature, Climinator employs an innovative Mediator-Advocate framework. This design allows Climinator to effectively synthesize varying scienti… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  30. arXiv:2401.10338  [pdf, ps, other

    cs.LG

    MELODY: Robust Semi-Supervised Hybrid Model for Entity-Level Online Anomaly Detection with Multivariate Time Series

    Authors: Jingchao Ni, Gauthier Guinet, Peihong Jiang, Laurent Callot, Andrey Kan

    Abstract: In large IT systems, software deployment is a crucial process in online services as their code is regularly updated. However, a faulty code change may degrade the target service's performance and cause cascading outages in downstream services. Thus, software deployments should be comprehensively monitored, and their anomalies should be detected timely. In this paper, we study the problem of anomal… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  31. arXiv:2401.07237  [pdf, other

    cs.CL cs.AI

    Distilling Event Sequence Knowledge From Large Language Models

    Authors: Somin Wadhwa, Oktie Hassanzadeh, Debarun Bhattacharjya, Ken Barker, Jian Ni

    Abstract: Event sequence models have been found to be highly effective in the analysis and prediction of events. Building such models requires availability of abundant high-quality event sequence data. In certain applications, however, clean structured event sequences are not available, and automated sequence extraction results in data that is too noisy and incomplete. In this work, we explore the use of La… ▽ More

    Submitted 1 July, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: In Proceedings of 23rd International Semantic Web Conference (ISWC), 2024

  32. arXiv:2401.01203  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Origin of zigzag antiferromagnetic orders in XPS3 (X= Fe, Ni) monolayers

    Authors: Ping Li, Xueyang Li, Junsheng Feng, Jinyang Ni, Zhi-Xin Guo, Hongjun Xiang

    Abstract: Recently, two monolayer magnetic materials, i.e., FePS3 and NiPS3, have been successfully fabricated. Despite that they have the same atomic structure, the two monolayers exhibit distinct magnetic properties. FePS3 holds an out-of-plane zigzag antiferromagnetic (AFM-ZZ) structure, while NiPS3 exhibits an in-plane AFM-ZZ structure. However, there is no theoretical model which can properly describe… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 7 pages, 4 figures

  33. arXiv:2312.17337  [pdf, other

    cs.CL econ.GN

    Exploring Nature: Datasets and Models for Analyzing Nature-Related Disclosures

    Authors: Tobias Schimanski, Chiara Colesanti Senni, Glen Gostlow, Jingwei Ni, Tingyu Yu, Markus Leippold

    Abstract: Nature is an amorphous concept. Yet, it is essential for the planet's well-being to understand how the economy interacts with it. To address the growing demand for information on corporate nature disclosure, we provide datasets and classifiers to detect nature communication by companies. We ground our approach in the guidelines of the Taskforce on Nature-related Financial Disclosures (TNFD). Parti… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  34. arXiv:2312.06904  [pdf, other

    eess.SY

    Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks

    Authors: Hongyue Fan, Jingjie Ni, Fangfei Li

    Abstract: In this paper, we investigate the problem of controlling probabilistic Boolean control networks (PBCNs) to achieve reachability with maximum probability in the finite time horizon. We address three questions: 1) finding control policies that achieve reachability with maximum probability under fixed, and particularly, varied finite time horizon, 2) leveraging prior knowledge to solve question 1) wi… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  35. arXiv:2311.15486  [pdf, other

    hep-ph

    Detection prospects of long-lived quirk pairs at the LHC far detectors

    Authors: Jinmian Li, Xufei Liao, Jian Ni, Junle Pei

    Abstract: We examine the sensitivity reaches of several LHC far detectors, such as FASER2, MATHUSLA, ANUBIS, SND@LHC, and FACET, to five simplified quirk scenarios. We include the next-to-leading order QCD corrections in our simulation of quirk events, which enhance the total production rate and increase the fraction of events in the forward direction for most cases. We calculate the time scales for the qui… ▽ More

    Submitted 29 April, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 21 pages, 11 figures

  36. arXiv:2311.10255  [pdf, other

    cs.LG q-bio.PE

    FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems

    Authors: Shiyuan Luo, Juntong Ni, Shengyu Chen, Runlong Yu, Yiqun Xie, Licheng Liu, Zhenong Jin, Huaxiu Yao, Xiaowei Jia

    Abstract: Modeling environmental ecosystems is critical for the sustainability of our planet, but is extremely challenging due to the complex underlying processes driven by interactions amongst a large number of physical variables. As many variables are difficult to measure at large scales, existing works often utilize a combination of observable features and locally available measurements or modeled values… ▽ More

    Submitted 19 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  37. arXiv:2311.09114  [pdf, other

    cs.CL cs.AI cs.LG

    Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification

    Authors: Haoqiang Kang, Juntong Ni, Huaxiu Yao

    Abstract: Large Language Models (LLMs) have demonstrated remarkable proficiency in generating fluent text. However, they often encounter the challenge of generating inaccurate or hallucinated content. This issue is common in both non-retrieval-based generation and retrieval-augmented generation approaches, and existing post-hoc rectification methods may not address the accumulated hallucination errors that… ▽ More

    Submitted 24 February, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  38. arXiv:2311.07912  [pdf, other

    cs.CV eess.SP

    Detection of Small Targets in Sea Clutter Based on RepVGG and Continuous Wavelet Transform

    Authors: Jingchen Ni, Haoru Li, Lilin Xu, Jing Liang

    Abstract: Constructing a high-performance target detector under the background of sea clutter is always necessary and important. In this work, we propose a RepVGGA0-CWT detector, where RepVGG is a residual network that gains a high detection accuracy. Different from traditional residual networks, RepVGG keeps an acceptable calculation speed. Giving consideration to both accuracy and speed, the RepVGGA0 is s… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  39. arXiv:2311.05800  [pdf, other

    cs.IR cs.AI cs.CL

    Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval

    Authors: Nandan Thakur, Jianmo Ni, Gustavo Hernández Ábrego, John Wieting, Jimmy Lin, Daniel Cer

    Abstract: There has been limited success for dense retrieval models in multilingual retrieval, due to uneven and scarce training data available across multiple languages. Synthetic training data generation is promising (e.g., InPars or Promptagator), but has been investigated only for English. Therefore, to study model capabilities across both cross-lingual and monolingual retrieval tasks, we develop SWIM-I… ▽ More

    Submitted 15 April, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted at NAACL 2024. Data released at https://github.com/google-research-datasets/swim-ir

  40. arXiv:2311.00457  [pdf, other

    cs.CV

    Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture

    Authors: Yixin Chen, Junfeng Ni, Nan Jiang, Yaowei Zhang, Yixin Zhu, Siyuan Huang

    Abstract: Reconstructing detailed 3D scenes from single-view images remains a challenging task due to limitations in existing approaches, which primarily focus on geometric shape recovery, overlooking object appearances and fine shape details. To address these challenges, we propose a novel framework for simultaneous high-fidelity recovery of object shapes and textures from single-view images. Our approach… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 3DV 2024, project page: https://dali-jack.github.io/SSR/

  41. arXiv:2310.18345  [pdf, other

    cs.CL cs.AI

    A Survey on Semantic Processing Techniques

    Authors: Rui Mao, Kai He, Xulang Zhang, Guanyi Chen, Jinjie Ni, Zonglin Yang, Erik Cambria

    Abstract: Semantic processing is a fundamental research domain in computational linguistics. In the era of powerful pre-trained language models and large language models, the advancement of research in this domain appears to be decelerating. However, the study of semantics is multi-dimensional in linguistics. The research depth and breadth of computational semantic processing can be largely improved with ne… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Published at Information Fusion, Volume 101, 2024, 101988, ISSN 1566-2535. The equal contribution mark is missed in the published version due to the publication policies. Please contact Prof. Erik Cambria for details

  42. arXiv:2310.09983  [pdf, other

    cs.LG cs.AI cs.CL cs.IR

    Farzi Data: Autoregressive Data Distillation

    Authors: Noveen Sachdeva, Zexue He, Wang-Cheng Kang, Jianmo Ni, Derek Zhiyuan Cheng, Julian McAuley

    Abstract: We study data distillation for auto-regressive machine learning tasks, where the input and output have a strict left-to-right causal structure. More specifically, we propose Farzi, which summarizes an event sequence dataset into a small number of synthetic sequences -- Farzi Data -- which are optimized to maintain (if not improve) model performance compared to training on the full dataset. Under t… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Under review. 23 pages, 9 figures

  43. arXiv:2310.00402  [pdf, other

    cs.IR cs.DB

    DiskANN++: Efficient Page-based Search over Isomorphic Mapped Graph Index using Query-sensitivity Entry Vertex

    Authors: Jiongkang Ni, Xiaoliang Xu, Yuxiang Wang, Can Li, Jiajie Yao, Shihai Xiao, Xuecang Zhang

    Abstract: Given a vector dataset $\mathcal{X}$ and a query vector $\vec{x}_q$, graph-based Approximate Nearest Neighbor Search (ANNS) aims to build a graph index $G$ and approximately return vectors with minimum distances to $\vec{x}_q$ by searching over $G$. The main drawback of graph-based ANNS is that a graph index would be too large to fit into the memory especially for a large-scale $\mathcal{X}$. To s… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: 15 pages including references

  44. arXiv:2309.13604  [pdf, other

    cs.CV cs.AI

    Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation

    Authors: Jiayi Ni, Senqiao Yang, Ran Xu, Jiaming Liu, Xiaoqi Li, Wenyu Jiao, Zehui Chen, Yi Liu, Shanghang Zhang

    Abstract: Since autonomous driving systems usually face dynamic and ever-changing environments, continual test-time adaptation (CTTA) has been proposed as a strategy for transferring deployed models to continually changing target domains. However, the pursuit of long-term adaptation often introduces catastrophic forgetting and error accumulation problems, which impede the practical implementation of CTTA in… ▽ More

    Submitted 29 March, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

  45. Stochastic Deep Koopman Model for Quality Propagation Analysis in Multistage Manufacturing Systems

    Authors: Zhiyi Chen, Harshal Maske, Huanyi Shui, Devesh Upadhyay, Michael Hopka, Joseph Cohen, Xingjian Lai, Xun Huan, Jun Ni

    Abstract: The modeling of multistage manufacturing systems (MMSs) has attracted increased attention from both academia and industry. Recent advancements in deep learning methods provide an opportunity to accomplish this task with reduced cost and expertise. This study introduces a stochastic deep Koopman (SDK) framework to model the complex behavior of MMSs. Specifically, we present a novel application of K… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Journal ref: Journal of Manufacturing Systems 71 (2023) 609-619

  46. arXiv:2308.15027  [pdf, ps, other

    cs.IR cs.CL

    Improving Neural Ranking Models with Traditional IR Methods

    Authors: Anik Saha, Oktie Hassanzadeh, Alex Gittens, Jian Ni, Kavitha Srinivas, Bulent Yener

    Abstract: Neural ranking methods based on large transformer models have recently gained significant attention in the information retrieval community, and have been adopted by major commercial solutions. Nevertheless, they are computationally expensive to create, and require a great deal of labeled data for specialized corpora. In this paper, we explore a low resource alternative which is a bag-of-embedding… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Short paper, 4 pages

  47. arXiv:2308.13666  [pdf, other

    astro-ph.HE

    A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run

    Authors: C. Fletcher, J. Wood, R. Hamburg, P. Veres, C. M. Hui, E. Bissaldi, M. S. Briggs, E. Burns, W. H. Cleveland, M. M. Giles, A. Goldstein, B. A. Hristov, D. Kocevski, S. Lesage, B. Mailyan, C. Malacaria, S. Poolakkil, A. von Kienlin, C. A. Wilson-Hodge, The Fermi Gamma-ray Burst Monitor Team, M. Crnogorčević, J. DeLaunay, A. Tohuvavohu, R. Caputo, S. B. Cenko , et al. (1674 additional authors not shown)

    Abstract: We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses,… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  48. arXiv:2308.03891  [pdf, other

    cs.CL

    A Cross-Domain Evaluation of Approaches for Causal Knowledge Extraction

    Authors: Anik Saha, Oktie Hassanzadeh, Alex Gittens, Jian Ni, Kavitha Srinivas, Bulent Yener

    Abstract: Causal knowledge extraction is the task of extracting relevant causes and effects from text by detecting the causal relation. Although this task is important for language understanding and knowledge discovery, recent works in this domain have largely focused on binary classification of a text segment as causal or non-causal. In this regard, we perform a thorough analysis of three sequence tagging… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  49. arXiv:2307.15770  [pdf, other

    cs.CL cs.AI

    CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based Tools

    Authors: Jingwei Ni, Julia Bingler, Chiara Colesanti-Senni, Mathias Kraus, Glen Gostlow, Tobias Schimanski, Dominik Stammbach, Saeid Ashraf Vaghefi, Qian Wang, Nicolas Webersinke, Tobias Wekhof, Tingyu Yu, Markus Leippold

    Abstract: In the face of climate change, are companies really taking substantial steps toward more sustainable operations? A comprehensive answer lies in the dense, information-rich landscape of corporate sustainability reports. However, the sheer volume and complexity of these reports make human analysis very costly. Therefore, only a few entities worldwide have the resources to analyze these reports at sc… ▽ More

    Submitted 11 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: 6 pages. arXiv admin note: text overlap with arXiv:2306.15518

  50. arXiv:2307.14192  [pdf, other

    cs.CR cs.AI

    Unveiling Security, Privacy, and Ethical Concerns of ChatGPT

    Authors: Xiaodong Wu, Ran Duan, Jianbing Ni

    Abstract: This paper delves into the realm of ChatGPT, an AI-powered chatbot that utilizes topic modeling and reinforcement learning to generate natural responses. Although ChatGPT holds immense promise across various industries, such as customer service, education, mental health treatment, personal productivity, and content creation, it is essential to address its security, privacy, and ethical implication… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.