Zum Hauptinhalt springen

Showing 201–250 of 2,032 results for author: Ding, Y

.
  1. arXiv:2403.10065  [pdf, other

    cs.CL

    Triple GNNs: Introducing Syntactic and Semantic Information for Conversational Aspect-Based Quadruple Sentiment Analysis

    Authors: Binbin Li, Yuqing Li, Siyu Jia, Bingnan Ma, Yu Ding, Zisen Qi, Xingbang Tan, Menghan Guo, Shenghui Liu

    Abstract: Conversational Aspect-Based Sentiment Analysis (DiaASQ) aims to detect quadruples \{target, aspect, opinion, sentiment polarity\} from given dialogues. In DiaASQ, elements constituting these quadruples are not necessarily confined to individual sentences but may span across multiple utterances within a dialogue. This necessitates a dual focus on both the syntactic information of individual utteran… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by CSCWD2024

  2. arXiv:2403.09284  [pdf, other

    cs.LG cs.DC

    DA-PFL: Dynamic Affinity Aggregation for Personalized Federated Learning

    Authors: Xu Yang, Jiyuan Feng, Songyue Guo, Ye Wang, Ye Ding, Binxing Fang, Qing Liao

    Abstract: Personalized federated learning becomes a hot research topic that can learn a personalized learning model for each client. Existing personalized federated learning models prefer to aggregate similar clients with similar data distribution to improve the performance of learning models. However, similaritybased personalized federated learning methods may exacerbate the class imbalanced problem. In th… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  3. arXiv:2403.08642  [pdf, other

    cond-mat.stat-mech cond-mat.str-el quant-ph

    Reweight-annealing method for calculating the value of partition function via quantum Monte Carlo

    Authors: Yi-Ming Ding, Jun-Song Sun, Nvsen Ma, Gaopei Pan, Chen Cheng, Zheng Yan

    Abstract: Efficient and accurate algorithm for partition function, free energy and thermal entropy calculations is of great significance in statistical physics and quantum many-body physics. Here we present an unbiased but low-technical-barrier algorithm within the quantum Monte Carlo framework, which has exceptionally high accuracy and no systemic error. Compared with the conventional specific heat integra… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 10 pages, 7 figures

  4. arXiv:2403.08580  [pdf, other

    cs.CV cs.MM eess.IV

    Leveraging Compressed Frame Sizes For Ultra-Fast Video Classification

    Authors: Yuxing Han, Yunan Ding, Chen Ye Gan, Jiangtao Wen

    Abstract: Classifying videos into distinct categories, such as Sport and Music Video, is crucial for multimedia understanding and retrieval, especially when an immense volume of video content is being constantly generated. Traditional methods require video decompression to extract pixel-level features like color, texture, and motion, thereby increasing computational and storage demands. Moreover, these meth… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 5 pages, 5 figures, 1 table. arXiv admin note: substantial text overlap with arXiv:2309.07361

  5. arXiv:2403.07964  [pdf, other

    cs.AI

    Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services

    Authors: Maqsood Hussain Shah, Yue Ding, Shaoshu Zhu, Yingqi Gu, Mingming Liu

    Abstract: With the rising concern over transportation emissions and pollution on a global scale, shared electric mobility services like E-cars, E-bikes, and E-scooters have emerged as promising solutions to mitigate these pressing challenges. However, existing shared E-mobility services exhibit critical design deficiencies, including insufficient service integration, imprecise energy consumption forecasting… ▽ More

    Submitted 1 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 15 pages, 5 figures

  6. arXiv:2403.07207  [pdf, other

    stat.ML cs.LG

    Tracking Dynamic Gaussian Density with a Theoretically Optimal Sliding Window Approach

    Authors: Yinsong Wang, Yu Ding, Shahin Shahrampour

    Abstract: Dynamic density estimation is ubiquitous in many applications, including computer vision and signal processing. One popular method to tackle this problem is the "sliding window" kernel density estimator. There exist various implementations of this method that use heuristically defined weight sequences for the observed data. The weight sequence, however, is a key aspect of the estimator affecting t… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  7. Thought Graph: Generating Thought Process for Biological Reasoning

    Authors: Chi-Yang Hsu, Kyle Cox, Jiawei Xu, Zhen Tan, Tianhua Zhai, Mengzhou Hu, Dexter Pratt, Tianlong Chen, Ziniu Hu, Ying Ding

    Abstract: We present the Thought Graph as a novel framework to support complex reasoning and use gene set analysis as an example to uncover semantic relationships between biological processes. Our framework stands out for its ability to provide a deeper understanding of gene sets, significantly surpassing GSEA by 40.28% and LLM baselines by 5.38% based on cosine similarity to human annotations. Our analysis… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 4 pages. Accepted by Web Conf 2024

  8. arXiv:2403.06898  [pdf, other

    cs.DB cs.DC

    SFVInt: Simple, Fast and Generic Variable-Length Integer Decoding using Bit Manipulation Instructions

    Authors: Gang Liao, Ye Liu, Yonghua Ding, Le Cai, Jianjun Chen

    Abstract: The ubiquity of variable-length integers in data storage and communication necessitates efficient decoding techniques. In this paper, we present SFVInt, a simple and fast approach to decode the prevalent Little Endian Base-128 (LEB128) varints. Our approach effectively utilizes the Bit Manipulation Instruction Set 2 (BMI2) in modern Intel and AMD processors, achieving significant performance impro… ▽ More

    Submitted 7 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: DaMoN 2024

  9. arXiv:2403.06766  [pdf, other

    hep-ex

    Determination of the number of $ψ(3686)$ events taken at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The number of $ψ(3686)$ events collected by the BESIII detector during the 2021 run period is determined to be $(2259.3\pm 11.1)\times 10^6$ by counting inclusive $ψ(3686)$ hadronic events. The uncertainty is systematic and the statistical uncertainty is negligible. Meanwhile, the numbers of $ψ(3686)$ events collected during the 2009 and 2012 run periods are updated to be… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  10. arXiv:2403.06660  [pdf, other

    cs.MM cs.AI cs.MA

    FashionReGen: LLM-Empowered Fashion Report Generation

    Authors: Yujuan Ding, Yunshan Ma, Wenqi Fan, Yige Yao, Tat-Seng Chua, Qing Li

    Abstract: Fashion analysis refers to the process of examining and evaluating trends, styles, and elements within the fashion industry to understand and interpret its current state, generating fashion reports. It is traditionally performed by fashion professionals based on their expertise and experience, which requires high labour cost and may also produce biased results for relying heavily on a small group… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  11. arXiv:2403.06363  [pdf, other

    cs.CV

    Say Anything with Any Style

    Authors: Shuai Tan, Bin Ji, Yu Ding, Ye Pan

    Abstract: Generating stylized talking head with diverse head motions is crucial for achieving natural-looking videos but still remains challenging. Previous works either adopt a regressive method to capture the speaking style, resulting in a coarse style that is averaged across all training data, or employ a universal network to synthesize videos with different styles which causes suboptimal performance. To… ▽ More

    Submitted 12 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 9 pages, 5 figures, conference

  12. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  13. arXiv:2403.05329  [pdf, other

    cs.CV

    OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction

    Authors: Ji Zhang, Yiran Ding, Zixin Liu

    Abstract: 3D occupancy prediction based on multi-sensor fusion,crucial for a reliable autonomous driving system, enables fine-grained understanding of 3D scenes. Previous fusion-based 3D occupancy predictions relied on depth estimation for processing 2D image features. However, depth estimation is an ill-posed problem, hindering the accuracy and robustness of these methods. Furthermore, fine-grained occupan… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  14. arXiv:2403.03500  [pdf, other

    hep-ex

    Observation of the decay $h_{c}\to3(π^{+}π^{-})π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we study the decays $h_{c}\to3(π^{+}π^{-})π^{0}$, $h_{c}\to2(π^{+}π^{-})ω$, $h_{c}\to2(π^{+}π^{-})π^{0}η$, $h_{c}\to2(π^{+}π^{-})η$, and $h_{c}\to p\bar{p}$ via $ψ(3686)\toπ^{0}h_{c}$. The decay channel $h_{c}\to3(π^{+}π^{-})π^{0}$ is observed for the first time, and its branching fraction is determined to… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  15. arXiv:2403.01829  [pdf, other

    quant-ph

    OnePerc: A Randomness-aware Compiler for Photonic Quantum Computing

    Authors: Hezi Zhang, Jixuan Ruan, Hassan Shapourian, Ramana Rao Kompella, Yufei Ding

    Abstract: The photonic platform holds great promise for quantum computing. Nevertheless, the intrinsic probabilistic characteristics of its native fusion operations introduces substantial randomness into the computing process, posing significant challenges to achieving scalability and efficiency in program execution. In this paper, we introduce a randomness-aware compilation framework designed to concurrent… ▽ More

    Submitted 7 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  16. arXiv:2403.01761  [pdf, other

    hep-ex

    Observation of $ψ(3686)\to 3φ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times 10^9$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of $ψ(3686)\to 3φ$ decay with a significance larger than 10$σ$. The branching fraction of this decay is determined to be $(1.46\pm0.05\pm0.17)\times10^{-5}$, where the first uncertainty is statistical and the second is systematic. No significant str… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  17. arXiv:2403.00327  [pdf, other

    cs.CV

    Task Indicating Transformer for Task-conditional Dense Predictions

    Authors: Yuxiang Lu, Shalayiding Sirejiding, Bayram Bayramli, Suizhi Huang, Yue Ding, Hongtao Lu

    Abstract: The task-conditional model is a distinctive stream for efficient multi-task learning. Existing works encounter a critical limitation in learning task-agnostic and task-specific representations, primarily due to shortcomings in global context modeling arising from CNN-based architectures, as well as a deficiency in multi-scale feature interaction within the decoder. In this paper, we introduce a no… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted by ICASSP 2024

  18. arXiv:2403.00277  [pdf, other

    cs.CL

    Gender Bias in Large Language Models across Multiple Languages

    Authors: Jinman Zhao, Yitian Ding, Chen Jia, Yining Wang, Zifan Qian

    Abstract: With the growing deployment of large language models (LLMs) across various applications, assessing the influence of gender biases embedded in LLMs becomes crucial. The topic of gender bias within the realm of natural language processing (NLP) has gained considerable focus, particularly in the context of English. Nonetheless, the investigation of gender bias in languages other than English is still… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 20 pages, 27 tables, 7 figures, submitted to ACL2024

  19. arXiv:2403.00245  [pdf, other

    cs.CV

    YOLO-MED : Multi-Task Interaction Network for Biomedical Images

    Authors: Suizhi Huang, Shalayiding Sirejiding, Yuxiang Lu, Yue Ding, Leheng Liu, Hui Zhou, Hongtao Lu

    Abstract: Object detection and semantic segmentation are pivotal components in biomedical image analysis. Current single-task networks exhibit promising outcomes in both detection and segmentation tasks. Multi-task networks have gained prominence due to their capability to simultaneously tackle segmentation and detection tasks, while also accelerating the segmentation inference. Nevertheless, recent multi-t… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted by ICASSP 2024

  20. arXiv:2402.18140  [pdf, other

    cs.CV

    OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction

    Authors: Jian Liu, Sipeng Zhang, Chuixin Kong, Wenyuan Zhang, Yuhang Wu, Yikang Ding, Borun Xu, Ruibo Ming, Donglai Wei, Xianming Liu

    Abstract: This technical report presents our solution, "occTransformer" for the 3D occupancy prediction track in the autonomous driving challenge at CVPR 2023. Our method builds upon the strong baseline BEVFormer and improves its performance through several simple yet effective techniques. Firstly, we employed data augmentation to increase the diversity of the training data and improve the model's generaliz… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Innovation Award in the 3D Occupancy Prediction Challenge (CVPR23)

  21. arXiv:2402.17983  [pdf, other

    cs.CL cs.CV

    3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding

    Authors: Yihao Ding, Lorenzo Vaiani, Caren Han, Jean Lee, Paolo Garza, Josiah Poon, Luca Cagliero

    Abstract: This paper presents a groundbreaking multimodal, multi-task, multi-teacher joint-grained knowledge distillation model for visually-rich form document understanding. The model is designed to leverage insights from both fine-grained and coarse-grained levels by facilitating a nuanced correlation between token and entity representations, addressing the complexities inherent in form documents. Additio… ▽ More

    Submitted 26 July, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted at Findings of ACL 2024

  22. arXiv:2402.17315  [pdf

    cond-mat.supr-con

    Superconducting-transition-temperature dependence of superfluid density and conductivity in pressurized cuprate superconductors

    Authors: Jinyu Zhao, Shu Cai, Yiwen Chen, Genda Gu, Hongtao Yan, Jing Guo, Jinyu Han, Pengyu Wang, Yazhou Zhou, Yanchun Li, Xiaodong Li, Zhian Ren, Qi Wu, Xingjiang Zhou, Yang Ding, Tao Xiang, Ho-kwang Mao, Liling Sun

    Abstract: What factors fundamentally determine the value of superconducting transition temperature (Tc) in high temperature superconductors has been the subject of intense debate. Following the establishment of an empirical law known as Homes'law, there is a growing consensus in the community that the Tc value of the cuprate superconductors is closely linked to its superfluid density and conductivity. Howev… ▽ More

    Submitted 28 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 11 pages, 3 figures

    Journal ref: Chinese Phys. Lett. 41(2024)047401

  23. arXiv:2402.16602  [pdf, other

    cs.CL

    Rethinking Negative Instances for Generative Named Entity Recognition

    Authors: Yuyang Ding, Juntao Li, Pinzheng Wang, Zecheng Tang, Bowen Yan, Min Zhang

    Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities for generalizing in unseen tasks. In the Named Entity Recognition (NER) task, recent advancements have seen the remarkable improvement of LLMs in a broad range of entity domains via instruction tuning, by adopting entity-centric schema. In this work, we explore the potential enhancement of the existing methods by incorporating… ▽ More

    Submitted 18 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Findings

  24. arXiv:2402.16264  [pdf

    cond-mat.mes-hall

    Intrinsic supercurrent diode effect in NbSe2 nanobridge

    Authors: Yiwen Zhang, Jiliang Cai, Peng Dong, Jiadian He, Yifan Ding, Jinghui Wang, Xiang Zhou, Kecheng Cao, Yueshen Wu, Jun Li

    Abstract: The significance of the superconducting diode effect lies in its potential application as a fundamental component in the development of next-generation superconducting circuit technology. The stringent operating conditions at low temperatures have posed challenges for the conventional semiconductor diode, primarily due to its exceptionally high resistivity. In response to this limitation, various… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  25. arXiv:2402.16257  [pdf

    cond-mat.str-el

    A novel method for determining the resistivity of compressed superconducting materials

    Authors: Liling Sun, Qi Wu, Shu Cai, Yang Ding, Ho-kwang Mao

    Abstract: The resistivity of a superconductor in its normal state plays a critical role in determining its superconducting ground state. However, measuring the resistivity of a material under high pressure has long presented a significant technical challenge due to pressure-induced changes in the crystallographic directions, especially for samples with anisotropic layered structures like high-Tc superconduc… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 8 pages, 1 figure

    Journal ref: Matter Radiat. Extremes 9 (2024) 043001

  26. arXiv:2402.15231  [pdf, other

    cs.LG cs.CV

    Which Model to Transfer? A Survey on Transferability Estimation

    Authors: Yuhe Ding, Bo Jiang, Aijing Yu, Aihua Zheng, Jian Liang

    Abstract: Transfer learning methods endeavor to leverage relevant knowledge from existing source pre-trained models or datasets to solve downstream target tasks. With the increase in the scale and quantity of available pre-trained models nowadays, it becomes critical to assess in advance whether they are suitable for a specific target task. Model transferability estimation is an emerging and growing area of… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  27. arXiv:2402.14858  [pdf, other

    cs.CL cs.AI

    ChatEL: Entity Linking with Chatbots

    Authors: Yifan Ding, Qingkai Zeng, Tim Weninger

    Abstract: Entity Linking (EL) is an essential and challenging task in natural language processing that seeks to link some text representing an entity within a document or sentence with its corresponding entry in a dictionary or knowledge base. Most existing approaches focus on creating elaborate contextual models that look for clues the words surrounding the entity-text to help solve the linking problem. Al… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  28. arXiv:2402.13753  [pdf, other

    cs.CL

    LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

    Authors: Yiran Ding, Li Lyna Zhang, Chengruidong Zhang, Yuanyuan Xu, Ning Shang, Jiahang Xu, Fan Yang, Mao Yang

    Abstract: Large context window is a desirable feature in large language models (LLMs). However, due to high fine-tuning costs, scarcity of long texts, and catastrophic values introduced by new token positions, current extended context windows are limited to around 128k tokens. This paper introduces LongRoPE that, for the first time, extends the context window of pre-trained LLMs to an impressive 2048k token… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  29. arXiv:2402.13587  [pdf, other

    cs.CL cs.CV

    A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation

    Authors: Yunxin Li, Baotian Hu, Wenhan Luo, Lin Ma, Yuxin Ding, Min Zhang

    Abstract: In this paper, we propose a new setting for generating product descriptions from images, augmented by marketing keywords. It leverages the combined power of visual and textual information to create descriptions that are more tailored to the unique features of products. For this setting, previous methods utilize visual and textual encoders to encode the image and keywords and employ a language mode… ▽ More

    Submitted 7 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted by LREC-COLING 2024

  30. arXiv:2402.12876  [pdf, other

    cs.LG cs.CR cs.DC

    Federated Multi-Task Learning on Non-IID Data Silos: An Experimental Study

    Authors: Yuwen Yang, Yuxiang Lu, Suizhi Huang, Shalayiding Sirejiding, Hongtao Lu, Yue Ding

    Abstract: The innovative Federated Multi-Task Learning (FMTL) approach consolidates the benefits of Federated Learning (FL) and Multi-Task Learning (MTL), enabling collaborative model training on multi-task learning datasets. However, a comprehensive evaluation method, integrating the unique features of both FL and MTL, is currently absent in the field. This paper fills this void by introducing a novel fram… ▽ More

    Submitted 15 April, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted by ICMR'24

  31. arXiv:2402.11960  [pdf, other

    cs.LG cs.AI cs.CL

    DB-LLM: Accurate Dual-Binarization for Efficient LLMs

    Authors: Hong Chen, Chengtao Lv, Liang Ding, Haotong Qin, Xiabin Zhou, Yifu Ding, Xuebo Liu, Min Zhang, Jinyang Guo, Xianglong Liu, Dacheng Tao

    Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing, while the expensive memory and computation consumption impede their practical deployment. Quantization emerges as one of the most effective methods for improving the computational efficiency of LLMs. However, existing ultra-low-bit quantization always causes severe accuracy drops. In this paper, we e… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  32. arXiv:2402.11566  [pdf, other

    cs.CV

    Boosting Semi-Supervised 2D Human Pose Estimation by Revisiting Data Augmentation and Consistency Training

    Authors: Huayi Zhou, Mukun Luo, Fei Jiang, Yue Ding, Hongtao Lu

    Abstract: The 2D human pose estimation (HPE) is a basic visual problem. However, its supervised learning requires massive keypoint labels, which is labor-intensive to collect. Thus, we aim at boosting a pose estimator by excavating extra unlabeled data with semi-supervised learning (SSL). Most previous SSHPE methods are consistency-based and strive to maintain consistent outputs for differently augmented in… ▽ More

    Submitted 7 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 14 pages. Semi-Supervised 2D Human Pose Estimation

  33. arXiv:2402.11550  [pdf, other

    cs.CL cs.AI

    LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

    Authors: Jun Zhao, Can Zu, Hao Xu, Yi Lu, Wei He, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Large language models (LLMs) have demonstrated impressive performance in understanding language and executing complex reasoning tasks. However, LLMs with long context windows have been notorious for their expensive training costs and high inference latency. Even the most advanced models such as GPT-4 and Claude2 often make mistakes when processing inputs of over $100k$ tokens, a phenomenon also kn… ▽ More

    Submitted 13 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  34. arXiv:2402.11207  [pdf, ps, other

    hep-ex

    Search for the production of deuterons and antideuterons in e^+e^- annihilation at center-of-mass energies between 4.13 and 4.70 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (593 additional authors not shown)

    Abstract: Using a data sample of $e^+e^-$ collision data corresponding to an integrated luminosity of 19 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider, we search for the production of deuterons and antideuterons via $e^+e^-\to ppπ^-\bar{d}+c.c.$ for the first time at center-of-mass energies between 4.13 and 4.70 GeV. No significant signal is observed and the upper limit of the… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  35. arXiv:2402.10628  [pdf, other

    cs.IR

    FairSync: Ensuring Amortized Group Exposure in Distributed Recommendation Retrieval

    Authors: Chen Xu, Jun Xu, Yiming Ding, Xiao Zhang, Qi Qi

    Abstract: In pursuit of fairness and balanced development, recommender systems (RS) often prioritize group fairness, ensuring that specific groups maintain a minimum level of exposure over a given period. For example, RS platforms aim to ensure adequate exposure for new providers or specific categories of items according to their needs. Modern industry RS usually adopts a two-stage pipeline: stage-1 (retrie… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted in WWW'24

  36. arXiv:2402.09829  [pdf, ps, other

    math.NT

    On a conjecture on shifted primes with large prime factors, II

    Authors: Yuchen Ding

    Abstract: Let $\mathcal{P}$ be the set of primes and $π(x)$ the number of primes not exceeding $x$. Let also $P^+(n)$ be the largest prime factor of $n$ with convention $P^+(1)=1$ and $$ T_c(x)=\#\left\{p\le x:p\in \mathcal{P},P^+(p-1)\ge p^c\right\}. $$ Motivated by a 2017 conjecture of Chen and Chen, we show that for any $8/9\le c<1$ $$ \limsup_{x\rightarrow\infty}T_c(x)/π(x)\le 8(1/c-1), $$ which clearly… ▽ More

    Submitted 27 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: The expositions on the history of this topic is updated in the new version

  37. arXiv:2402.09675  [pdf, other

    eess.SY

    Repurposing Coal Power Plants into Thermal Energy Storage for Supporting Zero-carbon Data Centers

    Authors: Yifu Ding, Serena Patel, Dharik Mallapragada, Robert James Stoner

    Abstract: Coal power plants will need to be phased out and face stranded asset risks under the net-zero energy system transition. Repurposing coal power plants could recoup profits and reduce carbon emissions using the existing infrastructure and grid connections. This paper investigates a retrofitting strategy that turns coal power plants into thermal energy storage (TES) and zero-carbon data centers (DCs)… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  38. arXiv:2402.08432  [pdf, other

    physics.optics

    Rhythmic soliton interactions for integrated dual-microcomb spectroscopy

    Authors: Zihao Wang, Yifei Wang, Baoqi Shi, Chen Shen, Wei Sun, Yulei Ding, Changxi Yang, Junqiu Liu, Chengying Bao

    Abstract: Rotation symmetry of microresonators supports the generation of phase-locked counter-propagating (CP) solitons that can potentially miniaturize dual-comb systems. Realization of these dual-comb compatible solitons in photonic integrated circuits remains a challenge. Here, we synthesized such CP solitons in an integrated silicon nitride microresonator and observed forced soliton oscillation due to… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  39. An Analysis of the Recovery Path of the Consumer Sector in the Post-Pandemic Era

    Authors: Wenbo Lyu, Jiayi Zhu, Yunan Ding, Keming Zhang

    Abstract: This paper proposes a referencable pattern of the recovery of the consumption sector, a new dimension to observe and evaluate the intrinsic value of the consumption sector, and proposes the concept of sensory-based consumption and the ranking of the weights of different categories;creates the concept of digital consumption index, coupled with digital RMB index and China-style digital economy index… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  40. arXiv:2402.06761  [pdf, other

    cs.LG

    Embedding Compression for Teacher-to-Student Knowledge Transfer

    Authors: Yiwei Ding, Alexander Lerch

    Abstract: Common knowledge distillation methods require the teacher model and the student model to be trained on the same task. However, the usage of embeddings as teachers has also been proposed for different source tasks and target tasks. Prior work that uses embeddings as teachers ignores the fact that the teacher embeddings are likely to contain irrelevant knowledge for the target task. To address this… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 5+1 pages. In ICASSP 2024 Satellite Workshop Deep Neural Network Model Compression

  41. arXiv:2402.06738  [pdf, other

    cs.CL

    EntGPT: Linking Generative Large Language Models with Knowledge Bases

    Authors: Yifan Ding, Amrit Poudel, Qingkai Zeng, Tim Weninger, Balaji Veeramani, Sanmitra Bhattacharya

    Abstract: The ability of Large Language Models (LLMs) to generate factually correct output remains relatively unexplored due to the lack of fact-checking and knowledge grounding during training and inference. In this work, we aim to address this challenge through the Entity Disambiguation (ED) task. We first consider prompt engineering, and design a three-step hard-prompting method to probe LLMs' ED perform… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  42. arXiv:2402.05808  [pdf, other

    cs.AI cs.CL cs.LG

    Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

    Authors: Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: In this paper, we propose R$^3$: Learning Reasoning through Reverse Curriculum Reinforcement Learning (RL), a novel method that employs only outcome supervision to achieve the benefits of process supervision for large language models. The core challenge in applying RL to complex reasoning is to identify a sequence of actions that result in positive rewards and provide appropriate supervision for o… ▽ More

    Submitted 17 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Preprint. Codes released: https://github.com/WooooDyy/LLM-Reverse-Curriculum-RL

  43. arXiv:2402.05383  [pdf, other

    nucl-ex hep-ex

    First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay

    Authors: Daya Bay Collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  44. arXiv:2402.04631  [pdf, other

    cs.CL

    The Future of Cognitive Strategy-enhanced Persuasive Dialogue Agents: New Perspectives and Trends

    Authors: Mengqi Chen, Bin Guo, Hao Wang, Haoyu Li, Qian Zhao, Jingqi Liu, Yasan Ding, Yan Pan, Zhiwen Yu

    Abstract: Persuasion, as one of the crucial abilities in human communication, has garnered extensive attention from researchers within the field of intelligent dialogue systems. We humans tend to persuade others to change their viewpoints, attitudes or behaviors through conversations in various scenarios (e.g., persuasion for social good, arguing in online platforms). Developing dialogue agents that can per… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 36 pages, 6 figures

  45. Precise Measurement of Born Cross Sections for $e^+e^-\to D\bar{D}$ at $\sqrt{s} = 3.80-4.95$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. Many clear peaks in the line shape of… ▽ More

    Submitted 22 August, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 9 pages, 3 figures, 1 Supplemental Material, consistent with the publication in Phys. Rev. Lett. 133 (2024) 081901

    Journal ref: Phys. Rev. Lett. 133 (2024) 081901

  46. arXiv:2402.03817   

    eess.SY

    Improvement of Frequency Source Phase Noise Reduction Design under Vibration Condition

    Authors: Liwei Yin, Yongjiang Shu, Heng Zhang, Yuefei Dai, Xiaopeng Lu, Yunlong Lian, Zhonghua Wang, Yong Ding

    Abstract: Reasonable vibration reduction design is an important way to achieve low phase noise index of airborne frequency source output signal. Aiming at the problem of phase noise deterioration of an airborne frequency source under random condition, this paper proposes to improve the vibration reduction mode crystal oscillator and reduce the distance between the barycenter of frequency source and crystal… ▽ More

    Submitted 16 July, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: There are many errors. 1.Fig. 2 Block Diagram of Frequency Source Circuit is not correct. 2.C-band C1 signal 6000MHz continuous wave signal is error. 3.Fig. 4 Steady State Phase Noise and Spectrum of 2400MHz before Improvement is error. 4.Table 1 Steady State Phase Noise at each Frequency Point of the Output of the Frequency Source before Improvement is error. 5. Frequency range is error

    MSC Class: D.3.2 ACM Class: B.6.2

  47. arXiv:2402.03771  [pdf, other

    cs.LG

    Reinforcement Learning from Bagged Reward

    Authors: Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama

    Abstract: In Reinforcement Learning (RL), it is commonly assumed that an immediate reward signal is generated for each action taken by the agent, helping the agent maximize cumulative rewards to obtain the optimal policy. However, in many real-world scenarios, immediate reward signals are not obtainable; instead, agents receive a single reward that is contingent upon a partial sequence or a complete traject… ▽ More

    Submitted 27 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  48. arXiv:2402.02075  [pdf, other

    physics.comp-ph

    A Compact Gas-Kinetic Scheme with Scalable Geometric Multigrid Acceleration for Steady-State Computation on 3D Unstructured Meshes

    Authors: Hongyu Liu, Xing Ji, Yunpeng Mao, Yuan Ding, Kun Xu

    Abstract: In this paper, we present an advanced high-order compact gas-kinetic scheme (CGKS) for 3D unstructured mixed-element meshes, augmented with a geometric multigrid technique to accelerate steady-state convergence. The scheme evolves cell-averaged flow variables and their gradients on the original mesh. Mesh coarsening employs a two-step parallel agglomeration algorithm using a random hash for cell i… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  49. arXiv:2402.01993  [pdf, other

    hep-ex

    Measurement of the Electromagnetic Transition Form-factors in the decays $η'\rightarrowπ^+π^-l^+l^-$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (618 additional authors not shown)

    Abstract: With a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η'\rightarrowπ^+π^-l^+l^-(l=e,$ $μ)$ via the process $J/ψ\rightarrowγη'$. The branching fractions are measured to be $\mathcal{B}(η'\rightarrowπ^+π^-e^+e^-)=(2.45\pm0.02(\rm{stat.})\pm0.08(\rm{syst.})) \times10^{-3}$ and… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  50. arXiv:2402.01807  [pdf, other

    cs.CR

    AOC-IDS: Autonomous Online Framework with Contrastive Learning for Intrusion Detection

    Authors: Xinchen Zhang, Running Zhao, Zhihan Jiang, Zhicong Sun, Yulong Ding, Edith C. H. Ngai, Shuang-Hua Yang

    Abstract: The rapid expansion of the Internet of Things (IoT) has raised increasing concern about targeted cyber attacks. Previous research primarily focused on static Intrusion Detection Systems (IDSs), which employ offline training to safeguard IoT systems. However, such static IDSs struggle with real-world scenarios where IoT system behaviors and attack strategies can undergo rapid evolution, necessitati… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.