Zum Hauptinhalt springen

Showing 1–50 of 217 results for author: Zhong, M

.
  1. arXiv:2408.10895  [pdf, ps, other

    cs.AI

    Analytical and Empirical Study of Herding Effects in Recommendation Systems

    Authors: Hong Xie, Mingze Zhong, Defu Lian, Zhen Wang, Enhong Chen

    Abstract: Online rating systems are often used in numerous web or mobile applications, e.g., Amazon and TripAdvisor, to assess the ground-truth quality of products. Due to herding effects, the aggregation of historical ratings (or historical collective opinion) can significantly influence subsequent ratings, leading to misleading and erroneous assessments. We study how to manage product ratings via rating a… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 29 pages

  2. arXiv:2408.09439  [pdf, other

    cs.IR cs.AI

    Towards Boosting LLMs-driven Relevance Modeling with Progressive Retrieved Behavior-augmented Prompting

    Authors: Zeyuan Chen, Haiyan Wu, Kaixin Wu, Wei Chen, Mingjie Zhong, Jia Xu, Zhongyi Liu, Wei Zhang

    Abstract: Relevance modeling is a critical component for enhancing user experience in search engines, with the primary objective of identifying items that align with users' queries. Traditional models only rely on the semantic congruence between queries and items to ascertain relevance. However, this approach represents merely one aspect of the relevance judgement, and is insufficient in isolation. Even pow… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  3. arXiv:2408.08978  [pdf, other

    cs.CL

    See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses

    Authors: Yulong Chen, Yang Liu, Jianhao Yan, Xuefeng Bai, Ming Zhong, Yinghao Yang, Ziyi Yang, Chenguang Zhu, Yue Zhang

    Abstract: The impressive performance of Large Language Models (LLMs) has consistently surpassed numerous human-designed benchmarks, presenting new challenges in assessing the shortcomings of LLMs. Designing tasks and finding LLMs' limitations are becoming increasingly important. In this paper, we investigate the question of whether an LLM can discover its own limitations from the errors it makes. To this en… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: COLM 2024

  4. arXiv:2408.05457  [pdf, other

    cs.CL cs.AI

    Investigating Instruction Tuning Large Language Models on Graphs

    Authors: Kerui Zhu, Bo-Wei Huang, Bowen Jin, Yizhu Jiao, Ming Zhong, Kevin Chang, Shou-De Lin, Jiawei Han

    Abstract: Inspired by the recent advancements of Large Language Models (LLMs) in NLP tasks, there's growing interest in applying LLMs to graph-related tasks. This study delves into the capabilities of instruction-following LLMs for engaging with real-world graphs, aiming to offer empirical insights into how LLMs can effectively interact with graphs and generalize across graph tasks. We begin by constructing… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: COLM 2024

  5. arXiv:2408.02877  [pdf, other

    nucl-ex astro-ph.SR hep-ex physics.ins-det

    First Measurement of Solar $^8$B Neutrinos via Coherent Elastic Neutrino-Nucleus Scattering with XENONnT

    Authors: E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, D. Antón Martin, F. Arneodo, L. Baudis, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, K. Boese, A. Brown, G. Bruno, R. Budnik, C. Cai, C. Capelli, J. M. R. Cardoso, A. P. Cimental Chávez, A. P. Colijn, J. Conrad, J. J. Cuenca-García , et al. (142 additional authors not shown)

    Abstract: We present the first measurement of nuclear recoils from solar $^8$B neutrinos via coherent elastic neutrino-nucleus scattering with the XENONnT dark matter experiment. The central detector of XENONnT is a low-background, two-phase time projection chamber with a 5.9\,t sensitive liquid xenon target. A blind analysis with an exposure of 3.51\,t$\times$y resulted in 37 observed events above 0.5\,keV… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  6. arXiv:2407.12441  [pdf, ps, other

    nlin.PS math-ph physics.comp-ph physics.optics quant-ph

    Dynamics of discrete solitons in the fractional discrete nonlinear Schrödinger equation with the quasi-Riesz derivative

    Authors: Ming Zhong, Boris A. Malomed, Zhenya Yan

    Abstract: We elaborate a fractional discrete nonlinear Schrödinger (FDNLS) equation based on an appropriately modified definition of the Riesz fractional derivative, which is characterized by its Lévy index (LI). This FDNLS equation represents a novel discrete system, in which the nearest-neighbor coupling is combined with long-range interactions, that decay as the inverse square of the separation between l… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 15 pages, 8 figures (to be published in Phys. Rev. E, 2024)

  7. arXiv:2407.02811  [pdf, other

    cs.LG cs.IT

    SPLITZ: Certifiable Robustness via Split Lipschitz Randomized Smoothing

    Authors: Meiyu Zhong, Ravi Tandon

    Abstract: Certifiable robustness gives the guarantee that small perturbations around an input to a classifier will not change the prediction. There are two approaches to provide certifiable robustness to adversarial examples: a) explicitly training classifiers with small Lipschitz constants, and b) Randomized smoothing, which adds random noise to the input to create a smooth classifier. We propose \textit{S… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  8. arXiv:2406.13638  [pdf, other

    physics.data-an astro-ph.IM hep-ex physics.ins-det

    XENONnT WIMP Search: Signal & Background Modeling and Statistical Inference

    Authors: XENON Collaboration, E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, D. Antón Martin, F. Arneodo, L. Baudis, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, K. Boese, A. Brown, G. Bruno, R. Budnik, J. M. R. Cardoso, A. P. Cimental Chávez, A. P. Colijn, J. Conrad, J. J. Cuenca-García, V. D'Andrea , et al. (139 additional authors not shown)

    Abstract: The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 20 pages, 10 figures

  9. arXiv:2406.13282  [pdf, other

    cs.CL

    Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective

    Authors: Meizhi Zhong, Chen Zhang, Yikun Lei, Xikai Liu, Yan Gao, Yao Hu, Kehai Chen, Min Zhang

    Abstract: Enabling LLMs to handle lengthy context is currently a research hotspot. Most LLMs are built upon rotary position embedding (RoPE), a popular position encoding method. Therefore, a prominent path is to extrapolate the RoPE trained on comparably short texts to far longer texts. A heavy bunch of efforts have been dedicated to boosting the extrapolation via extending the formulations of the RoPE, how… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  10. arXiv:2406.08394  [pdf, other

    cs.CV

    VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

    Authors: Jiannan Wu, Muyan Zhong, Sen Xing, Zeqiang Lai, Zhaoyang Liu, Wenhai Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Ping Luo, Yu Qiao, Jifeng Dai

    Abstract: We present VisionLLM v2, an end-to-end generalist multimodal large model (MLLM) that unifies visual perception, understanding, and generation within a single framework. Unlike traditional MLLMs limited to text output, VisionLLM v2 significantly broadens its application scope. It excels not only in conventional visual question answering (VQA) but also in open-ended, cross-domain vision tasks such a… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 43 pages

  11. arXiv:2406.08335  [pdf, other

    cs.LG cs.AI cs.DB stat.CO

    A Survey of Pipeline Tools for Data Engineering

    Authors: Anthony Mbata, Yaji Sripada, Mingjun Zhong

    Abstract: Currently, a variety of pipeline tools are available for use in data engineering. Data scientists can use these tools to resolve data wrangling issues associated with data and accomplish some data engineering tasks from data ingestion through data preparation to utilization as input for machine learning (ML). Some of these tools have essential built-in components or can be combined with other tool… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  12. arXiv:2406.07239  [pdf, other

    cs.CL

    On the Hallucination in Simultaneous Machine Translation

    Authors: Meizhi Zhong, Kehai Chen, Zhengshan Xue, Lemao Liu, Mingming Yang, Min Zhang

    Abstract: It is widely known that hallucination is a critical issue in Simultaneous Machine Translation (SiMT) due to the absence of source-side information. While many efforts have been made to enhance performance for SiMT, few of them attempt to understand and analyze hallucination in SiMT. Therefore, we conduct a comprehensive analysis of hallucination in SiMT from two perspectives: understanding the dis… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  13. arXiv:2405.14386  [pdf, other

    cs.CV

    Capsule Network Projectors are Equivariant and Invariant Learners

    Authors: Miles Everett, Aiden Durrant, Mingjun Zhong, Georgios Leontidis

    Abstract: Learning invariant representations has been the longstanding approach to self-supervised learning. However, recently progress has been made in preserving equivariant properties in representations, yet do so with highly prescribed architectures. In this work, we propose an invariant-equivariant self-supervised architecture that employs Capsule Networks (CapsNets) which have been shown to capture eq… ▽ More

    Submitted 6 August, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 17 pages, 7 figures, 10 Tables; code to be released at: https://github.com/AberdeenML/CapsIE V2: corrected typos, added a new Table 3 and additional results in Table 1 and Table 2

  14. arXiv:2405.07393  [pdf, other

    cs.LG cs.AI cs.IT

    Intrinsic Fairness-Accuracy Tradeoffs under Equalized Odds

    Authors: Meiyu Zhong, Ravi Tandon

    Abstract: With the growing adoption of machine learning (ML) systems in areas like law enforcement, criminal justice, finance, hiring, and admissions, it is increasingly critical to guarantee the fairness of decisions assisted by ML. In this paper, we study the tradeoff between fairness and accuracy under the statistical notion of equalized odds. We present a new upper bound on the accuracy (that holds for… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  15. arXiv:2405.06280  [pdf, ps, other

    math.AP

    Green's Function and Pointwise Space-time Behaviors of the Three-Dimensional Relativistic Boltzmann Equation

    Authors: Yanchao Li, Mingying Zhong

    Abstract: The pointwise space-time behavior of the Green's function of the three-dimensional relativistic Boltzmann equation is studied in this paper. It is shown that the Green's function has a decomposition of the macroscopic diffusive waves and Huygens waves with the speed $\sqrt{a^2+b^2}$ at low-frequency, the singular kinetic wave and the remainder term decaying exponentially in space and time. In addi… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2303.13021

    MSC Class: 76P05; 82C40; 82D05

  16. arXiv:2404.18389  [pdf, ps, other

    math.AP

    Diffusion Limit with Optimal Convergence Rate of Classical Solutions to the Vlasov-Maxwell-Boltzmann System

    Authors: Tong Yang, Mingying Zhong

    Abstract: We study the diffusion limit of the strong solution to the Vlasov-Maxwell-Boltzmann (VMB) system with initial data near a global Maxwellian. By introducing a new decomposition of the solution to identify the essential components for generating the initial layer, we prove the convergence and establish the opitmal convergence rate of the classical solution to the VMB system to the solution of the Na… ▽ More

    Submitted 9 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    MSC Class: 76P05; 82C40; 82D05

  17. arXiv:2404.16189  [pdf, other

    math.NA

    Structure Preserving PINN for Solving Time Dependent PDEs with Periodic Boundary

    Authors: Baoli Hao, Ulisses Braga-Neto, Chun Liu, Lifan Wang, Ming Zhong

    Abstract: We present a structure preserving PINN for solving a series of time dependent PDEs with periodic boundary. Our method can incorporate the periodic boundary condition as the natural output of any deep neural net, hence significantly improving the training accuracy of baseline PINN. Together with mini-batching and other PINN variants (SA-PINN, RBA-PINN, etc.), our structure preserving PINN can even… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  18. arXiv:2404.07458  [pdf, other

    physics.plasm-ph

    I-mode Plasma Confinement Improvement by Real-time Lithium Injection and its Classification on EAST Tokamak

    Authors: X. M. Zhong, X. L. Zou, A. D. Liu, Y. T. Song, G. Zhuang, H. Q. Liu, L. Q. Xu, E. Z. Li, B. Zhang, G. Z. Zuo, Z. Wang, C. Zhou, J. Zhang, W. X. Shi, L. T. Gao, S. F. Wang, W. Gao, T. Q. Jia, Q. Zang, H. L. Zhao, M. Wang, H. D. Xu, X. J. Wang, X. Gao, X. D. Lin , et al. (3 additional authors not shown)

    Abstract: I-mode is a promising regime for future fusion reactors due to the high energy confinement and the moderate particle confinement. However, the effect of lithium, which has been widely applied for particle recycling and impurity control, on I-mode plasma is still unclear. Recently, experiments of real-time lithium powder injection on I-mode plasma have been carried out in EAST Tokamak. It was found… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  19. arXiv:2404.05817  [pdf, other

    cs.LG

    Label Propagation Training Schemes for Physics-Informed Neural Networks and Gaussian Processes

    Authors: Ming Zhong, Dehao Liu, Raymundo Arroyave, Ulisses Braga-Neto

    Abstract: This paper proposes a semi-supervised methodology for training physics-informed machine learning methods. This includes self-training of physics-informed neural networks and physics-informed Gaussian processes in isolation, and the integration of the two via co-training. We demonstrate via extensive numerical experiments how these methods can ameliorate the issue of propagating information forward… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  20. arXiv:2403.14878  [pdf, other

    hep-ex physics.ins-det

    Offline tagging of radon-induced backgrounds in XENON1T and applicability to other liquid xenon detectors

    Authors: E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, G. Bruno, R. Budnik, T. K. Bui, J. M. R. Cardoso, A. P. Cimental Chavez, A. P. Colijn, J. Conrad , et al. (142 additional authors not shown)

    Abstract: This paper details the first application of a software tagging algorithm to reduce radon-induced backgrounds in liquid noble element time projection chambers, such as XENON1T and XENONnT. The convection velocity field in XENON1T was mapped out using $^{222}\text{Rn}$ and $^{218}\text{Po}$ events, and the root-mean-square convection speed was measured to be $0.30 \pm 0.01$ cm/s. Given this velocity… ▽ More

    Submitted 19 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 17 pages, 19 figures

  21. arXiv:2403.06813  [pdf, other

    cs.CV

    LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations

    Authors: Mohammad Alkhalefi, Georgios Leontidis, Mingjun Zhong

    Abstract: Contrastive instance discrimination approaches outperform supervised learning in downstream tasks like image classification and object detection. However, these approaches heavily rely on data augmentation during representation learning, which may result in inferior results if not properly implemented. Random cropping followed by resizing is a common form of data augmentation used in contrastive l… ▽ More

    Submitted 18 July, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 14 pages, 5 figures, 7 tables

  22. arXiv:2403.04724  [pdf, other

    cs.CV

    Masked Capsule Autoencoders

    Authors: Miles Everett, Mingjun Zhong, Georgios Leontidis

    Abstract: We propose Masked Capsule Autoencoders (MCAE), the first Capsule Network that utilises pretraining in a self-supervised manner. Capsule Networks have emerged as a powerful alternative to Convolutional Neural Networks (CNNs), and have shown favourable properties when compared to Vision Transformers (ViT), but have struggled to effectively learn when presented with more complex data, leading to Caps… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 14 pages, 6 figures, 4 tables

  23. arXiv:2403.02595  [pdf, other

    math.NA

    Learning Stochastic Dynamics from Data

    Authors: Ziheng Guo, Igor Cialenco, Ming Zhong

    Abstract: We present a noise guided trajectory based system identification method for inferring the dynamical structure from observation generated by stochastic differential equations. Our method can handle various kinds of noise, including the case when the the components of the noise is correlated. Our method can also learn both the noise level and drift term together from trajectory. We present various n… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  24. arXiv:2402.16843  [pdf, other

    cs.CV cs.AI cs.CL cs.GR cs.LG

    Multi-LoRA Composition for Image Generation

    Authors: Ming Zhong, Yelong Shen, Shuohang Wang, Yadong Lu, Yizhu Jiao, Siru Ouyang, Donghan Yu, Jiawei Han, Weizhu Chen

    Abstract: Low-Rank Adaptation (LoRA) is extensively utilized in text-to-image models for the accurate rendition of specific elements like distinct characters or unique styles in generated images. Nonetheless, existing methods face challenges in effectively composing multiple LoRAs, especially as the number of LoRAs to be integrated grows, thus hindering the creation of complex imagery. In this paper, we stu… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Project Website: https://maszhongming.github.io/Multi-LoRA-Composition/

  25. arXiv:2402.11499  [pdf, other

    math.NA

    Acousto-electric tomography by the convergence of Kaczamrz two-point gradient-$Θ$ method

    Authors: Kai Zhu, Jijun Liu, Min Zhong

    Abstract: We study the numerical reconstruction problem in acousto-electric tomography (AET) of recovering the conductivity distribution in a bounded domain from multiple interior power density data. The Two-Point-Gradient-$Θ$ (TPG-$Θ$) in Kaczmarz type is proposed, with a general convex penalty term $Θ$, the algorithm can be utilized in AET problem for recovering sparse and discontinuous conductivity distr… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  26. arXiv:2402.10446  [pdf, other

    physics.ins-det astro-ph.IM hep-ex

    The XENONnT Dark Matter Experiment

    Authors: XENON Collaboration, E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, M. Balata, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui , et al. (170 additional authors not shown)

    Abstract: The multi-staged XENON program at INFN Laboratori Nazionali del Gran Sasso aims to detect dark matter with two-phase liquid xenon time projection chambers of increasing size and sensitivity. The XENONnT experiment is the latest detector in the program, planned to be an upgrade of its predecessor XENON1T. It features an active target of 5.9 tonnes of cryogenic liquid xenon (8.5 tonnes total mass in… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 32 pages, 19 figures

  27. arXiv:2401.06059  [pdf, other

    cs.CL cs.AI cs.LG

    Investigating Data Contamination for Pre-training Language Models

    Authors: Minhao Jiang, Ken Ziyu Liu, Ming Zhong, Rylan Schaeffer, Siru Ouyang, Jiawei Han, Sanmi Koyejo

    Abstract: Language models pre-trained on web-scale corpora demonstrate impressive capabilities on diverse downstream tasks. However, there is increasing concern whether such capabilities might arise from evaluation datasets being included in the pre-training corpus -- a phenomenon known as \textit{data contamination} -- in a manner that artificially increases performance. There has been little understanding… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 16 pages, 5 figures

  28. arXiv:2401.01031  [pdf, other

    cond-mat.stat-mech

    Quantum phase transitions in the alternating XY chain with three-site interactions

    Authors: Kaiyuan Cao, Hao Fu, Xue Liu, Ming Zhong, Peiqing Tong

    Abstract: We investigate the quantum phase transition in the alternating XY chain with the XZX+YZY type of three-spin interactions. We present the exact solution derived by means of the Jordan-Wigner transformation and study the average magnetization, spin correlations, and von Neumann entropy to establish the phase diagram. The phase diagram consists of the ferromagnetic phases, the paramagnetic phases, an… ▽ More

    Submitted 4 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: 10 pages,12 figures

  29. arXiv:2312.14238  [pdf, other

    cs.CV

    InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

    Authors: Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai

    Abstract: The exponential growth of large language models (LLMs) has opened up numerous possibilities for multimodal AGI systems. However, the progress in vision and vision-language foundation models, which are also critical elements of multi-modal AGI, has not kept pace with LLMs. In this work, we design a large-scale vision-language foundation model (InternVL), which scales up the vision foundation model… ▽ More

    Submitted 15 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 25 pages, 5 figures, 28 tables

  30. arXiv:2312.01150  [pdf, other

    cs.NE

    Pointer Networks Trained Better via Evolutionary Algorithms

    Authors: Muyao Zhong, Shengcai Liu, Bingdong Li, Haobo Fu, Ke Tang, Peng Yang

    Abstract: Pointer Network (PtrNet) is a specific neural network for solving Combinatorial Optimization Problems (COPs). While PtrNets offer real-time feed-forward inference for complex COPs instances, its quality of the results tends to be less satisfactory. One possible reason is that such issue suffers from the lack of global search ability of the gradient descent, which is frequently employed in traditio… ▽ More

    Submitted 11 March, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: None

    MSC Class: 68T07

  31. arXiv:2311.12947  [pdf, other

    cs.AI eess.SY

    PINNs-Based Uncertainty Quantification for Transient Stability Analysis

    Authors: Ren Wang, Ming Zhong, Kaidi Xu, Lola Giráldez Sánchez-Cortés, Ignacio de Cominges Guerra

    Abstract: This paper addresses the challenge of transient stability in power systems with missing parameters and uncertainty propagation in swing equations. We introduce a novel application of Physics-Informed Neural Networks (PINNs), specifically an Ensemble of PINNs (E-PINNs), to estimate critical parameters like rotor angle and inertia coefficient with enhanced accuracy and reduced computational load. E-… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  32. arXiv:2311.07066  [pdf, other

    cs.CL

    Context Consistency between Training and Testing in Simultaneous Machine Translation

    Authors: Meizhi Zhong, Lemao Liu, Kehai Chen, Mingming Yang, Min Zhang

    Abstract: Simultaneous Machine Translation (SiMT) aims to yield a real-time partial translation with a monotonically growing the source-side context. However, there is a counterintuitive phenomenon about the context usage between training and testing: e.g., the wait-k testing model consistently trained with wait-k is much worse than that model inconsistently trained with wait-k' (k' is not equal to k) in te… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  33. arXiv:2311.00875  [pdf, other

    cs.LG cs.MA math.DS

    Learning Collective Behaviors from Observation

    Authors: Jinchao Feng, Ming Zhong

    Abstract: We present a comprehensive examination of learning methodologies employed for the structural identification of dynamical systems. These techniques are designed to elucidate emergent phenomena within intricate systems of interacting agents. Our approach not only ensures theoretical convergence guarantees but also exhibits computational efficiency when handling high-dimensional observational data. T… ▽ More

    Submitted 4 April, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  34. arXiv:2310.16040  [pdf, other

    cs.CL cs.AI

    Instruct and Extract: Instruction Tuning for On-Demand Information Extraction

    Authors: Yizhu Jiao, Ming Zhong, Sha Li, Ruining Zhao, Siru Ouyang, Heng Ji, Jiawei Han

    Abstract: Large language models with instruction-following capabilities open the door to a wider group of users. However, when it comes to information extraction - a classic task in natural language processing - most task-specific systems cannot align well with long-tail ad hoc extraction use cases for non-expert users. To address this, we propose a novel paradigm, termed On-Demand Information Extraction, t… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  35. arXiv:2310.12418  [pdf, other

    cs.CL

    The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions

    Authors: Siru Ouyang, Shuohang Wang, Yang Liu, Ming Zhong, Yizhu Jiao, Dan Iter, Reid Pryzant, Chenguang Zhu, Heng Ji, Jiawei Han

    Abstract: Recent progress in Large Language Models (LLMs) has produced models that exhibit remarkable performance across a variety of NLP tasks. However, it remains unclear whether the existing focus of NLP research accurately captures the genuine requirements of human users. This paper provides a comprehensive analysis of the divergence between current NLP research and the needs of real-world NLP applicati… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  36. arXiv:2310.11451  [pdf, other

    cs.CL cs.AI cs.LG

    Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective

    Authors: Ming Zhong, Chenxin An, Weizhu Chen, Jiawei Han, Pengcheng He

    Abstract: Large Language Models (LLMs) inherently encode a wealth of knowledge within their parameters through pre-training on extensive corpora. While prior research has delved into operations on these parameters to manipulate the underlying implicit knowledge (encompassing detection, editing, and merging), there remains an ambiguous understanding regarding their transferability across models with varying… ▽ More

    Submitted 8 May, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  37. arXiv:2310.06314  [pdf, ps, other

    math.CO math.NT

    Power-partible Reduction and Congruences for Schröder Polynomials

    Authors: Chen-Bo Jia, Rong-Hua Wang, Michael X. X. Zhong

    Abstract: In this note, we apply the power-partible reduction to show the following arithmetic properties of large Schröder polynomials $S_n(z)$ and little Schröder polynomials $s_n(z)$: for any odd prime $p$, nonnegative integer $r\in\mathbb{N}$, $\varepsilon\in\{-1,1\}$ and $z\in\mathbb{Z}$ with $\gcd(p,z(z+1))=1$, we have \[ \sum_{k=0}^{p-1}(2k+1)^{2r+1}\varepsilon^k S_k(z)\equiv 1\pmod {p}\quad \text{an… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 11

    MSC Class: 05A19

  38. arXiv:2309.11996  [pdf, other

    hep-ex physics.ins-det

    Design and performance of the field cage for the XENONnT experiment

    Authors: E. Aprile, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso, D. Cichon , et al. (139 additional authors not shown)

    Abstract: The precision in reconstructing events detected in a dual-phase time projection chamber depends on an homogeneous and well understood electric field within the liquid target. In the XENONnT TPC the field homogeneity is achieved through a double-array field cage, consisting of two nested arrays of field shaping rings connected by an easily accessible resistor chain. Rather than being connected to t… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Journal ref: Eur. Phys. J. C 84, 138 (2024)

  39. Attractive and repulsive interactions in the one-dimensional swarmalator model

    Authors: Baoli Hao, Ming Zhong, Kevin O'Keeffe

    Abstract: We study a population of swarmalators, mobile variants of phase oscillators, which run on a ring and have both attractive and repulsive interactions. This one-dimensional (1D) swarmalator model produces several of collective states: the standard sync and async states as well as a splaylike "polarized" state and several unsteady states such as active bands or swirling. The model's simplicity allows… ▽ More

    Submitted 4 January, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

  40. arXiv:2309.00361  [pdf, ps, other

    cs.DB cs.DS

    A Unified and Scalable Algorithm Framework of User-Defined Temporal $(k,\mathcal{X})$-Core Query

    Authors: Ming Zhong, Junyong Yang, Yuanyuan Zhu, Tieyun Qian, Mengchi Liu, Jeffrey Xu Yu

    Abstract: Querying cohesive subgraphs on temporal graphs (e.g., social network, finance network, etc.) with various conditions has attracted intensive research interests recently. In this paper, we study a novel Temporal $(k,\mathcal{X})$-Core Query (TXCQ) that extends a fundamental Temporal $k$-Core Query (TCQ) proposed in our conference paper by optimizing or constraining an arbitrary metric… ▽ More

    Submitted 21 December, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2301.03770

  41. arXiv:2307.11088  [pdf, other

    cs.CL

    L-Eval: Instituting Standardized Evaluation for Long Context Language Models

    Authors: Chenxin An, Shansan Gong, Ming Zhong, Xingjian Zhao, Mukai Li, Jun Zhang, Lingpeng Kong, Xipeng Qiu

    Abstract: Recently, there has been growing interest in extending the context length of large language models (LLMs), aiming to effectively process long inputs of one turn or conversations with more extensive histories. While proprietary models such as GPT-4 and Claude can largely preserve the reasoning ability in an extended context, open-source models are still progressing through the early stages of devel… ▽ More

    Submitted 4 October, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  42. arXiv:2307.09944  [pdf, other

    cs.CV

    ProtoCaps: A Fast and Non-Iterative Capsule Network Routing Method

    Authors: Miles Everett, Mingjun Zhong, Georgios Leontidis

    Abstract: Capsule Networks have emerged as a powerful class of deep learning architectures, known for robust performance with relatively few parameters compared to Convolutional Neural Networks (CNNs). However, their inherent efficiency is often overshadowed by their slow, iterative routing mechanisms which establish connections between Capsule layers, posing computational challenges resulting in an inabili… ▽ More

    Submitted 8 March, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 13 pages, 5 figures, 5 tables

    Journal ref: TMLR December 2023 (https://openreview.net/pdf?id=Id10mlBjcx)

  43. arXiv:2307.09696  [pdf, other

    cs.CV

    Towards Saner Deep Image Registration

    Authors: Bin Duan, Ming Zhong, Yan Yan

    Abstract: With recent advances in computing hardware and surges of deep-learning architectures, learning-based deep image registration methods have surpassed their traditional counterparts, in terms of metric performance and inference time. However, these methods focus on improving performance measurements such as Dice, resulting in less attention given to model behaviors that are equally desirable for regi… ▽ More

    Submitted 12 March, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: ICCV 2023, fix typos

  44. arXiv:2307.04018  [pdf, other

    cs.CL

    Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation

    Authors: Yulong Chen, Huajian Zhang, Yijie Zhou, Xuefeng Bai, Yueguan Wang, Ming Zhong, Jianhao Yan, Yafu Li, Judy Li, Michael Zhu, Yue Zhang

    Abstract: Most existing cross-lingual summarization (CLS) work constructs CLS corpora by simply and directly translating pre-annotated summaries from one language to another, which can contain errors from both summarization and translation processes. To address this issue, we propose ConvSumX, a cross-lingual conversation summarization benchmark, through a new annotation schema that explicitly considers sou… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: ACL2023

  45. arXiv:2307.01448  [pdf, other

    cs.CL

    ReactIE: Enhancing Chemical Reaction Extraction with Weak Supervision

    Authors: Ming Zhong, Siru Ouyang, Minhao Jiang, Vivian Hu, Yizhu Jiao, Xuan Wang, Jiawei Han

    Abstract: Structured chemical reaction information plays a vital role for chemists engaged in laboratory work and advanced endeavors such as computer-aided drug design. Despite the importance of extracting structured reactions from scientific literature, data annotation for this purpose is cost-prohibitive due to the significant labor required from domain experts. Consequently, the scarcity of sufficient tr… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: Findings of ACL 2023, Short Paper

  46. arXiv:2306.16552  [pdf, other

    cs.LG cs.AI cs.CY cs.IT

    Learning Fair Classifiers via Min-Max F-divergence Regularization

    Authors: Meiyu Zhong, Ravi Tandon

    Abstract: As machine learning (ML) based systems are adopted in domains such as law enforcement, criminal justice, finance, hiring and admissions, ensuring the fairness of ML aided decision-making is becoming increasingly important. In this paper, we focus on the problem of fair classification, and introduce a novel min-max F-divergence regularization framework for learning fair classification models while… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  47. arXiv:2306.16122  [pdf, other

    cs.CV cs.LG

    Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination methods

    Authors: Mohammad Alkhalefi, Georgios Leontidis, Mingjun Zhong

    Abstract: Self-supervised learning algorithms (SSL) based on instance discrimination have shown promising results, performing competitively or even outperforming supervised learning counterparts in some downstream tasks. Such approaches employ data augmentation to create two views of the same instance (i.e., positive pairs) and encourage the model to learn good representations by attracting these views clos… ▽ More

    Submitted 25 April, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: 17 pages, 6 figures, 12 tables

    Journal ref: TMLR 2024 (https://openreview.net/pdf?id=z5AXLMBWdU)

  48. arXiv:2306.11871  [pdf, other

    hep-ex physics.ins-det

    Search for events in XENON1T associated with Gravitational Waves

    Authors: XENON Collaboration, E. Aprile, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antoń Martin, F. Arneodo, L. Baudis, A. L. Baxter, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, E. J. Brookes, A. Brown, S. Bruenner, G. Bruno, R. Budnik, T. K. Bui, C. Cai, J. M. R. Cardoso , et al. (138 additional authors not shown)

    Abstract: We perform a blind search for particle signals in the XENON1T dark matter detector that occur close in time to gravitational wave signals in the LIGO and Virgo observatories. No particle signal is observed in the nuclear recoil, electronic recoil, CE$ν$NS, and S2-only channels within $\pm$ 500 seconds of observations of the gravitational wave signals GW170104, GW170729, GW170817, GW170818, and GW1… ▽ More

    Submitted 27 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  49. arXiv:2306.08416  [pdf, other

    physics.plasm-ph

    Characteristics of the edge temperature ring oscillation during stationary improved confnement mode in EAST

    Authors: A. D. Liu, X. L. Zou, X. M. Zhong, Y. T. Song, M. K. Han, Y. M. Duan, H. Q. Liu, T. B. Wang, E. Z. Li, L. Zhang, X. Feng, G. Zhuang, EAST I-mode working group

    Abstract: I-mode is a natural ELMy-free regime with H-mode like improved energy confnement and L-mode like particle confnement, making it an attractive scenario for future tokamak based fusion reactors. A kind of low frequency oscillation was widely found and appeared to be unique in I-mode, with the frequency between stationary zonal flow and geodesic-acoustic mode (GAM) zonal flow. In EAST, 90 percent I-m… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 23 pages, 16 figures

  50. arXiv:2306.06601  [pdf, other

    cs.CL

    Mimicking the Thinking Process for Emotion Recognition in Conversation with Prompts and Paraphrasing

    Authors: Ting Zhang, Zhuang Chen, Ming Zhong, Tieyun Qian

    Abstract: Emotion recognition in conversation, which aims to predict the emotion for all utterances, has attracted considerable research attention in recent years. It is a challenging task since the recognition of the emotion in one utterance involves many complex factors, such as the conversational context, the speaker's background, and the subtle difference between emotion labels. In this paper, we propos… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted to IJCAI 2023, AI and Social Good track