Zum Hauptinhalt springen

Showing 1–20 of 20 results for author: Zhan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00782  [pdf, other

    cs.CL

    Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning

    Authors: Zimu Lu, Aojun Zhou, Ke Wang, Houxing Ren, Weikang Shi, Junting Pan, Mingjie Zhan, Hongsheng Li

    Abstract: Direct Preference Optimization (DPO) has proven effective at improving the performance of large language models (LLMs) on downstream tasks such as reasoning and alignment. In this work, we propose Step-Controlled DPO (SCDPO), a method for automatically providing stepwise error supervision by creating negative samples of mathematical reasoning rationales that start making errors at a specified step… ▽ More

    Submitted 14 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2405.17103  [pdf, other

    cs.CL cs.AI

    Empowering Character-level Text Infilling by Eliminating Sub-Tokens

    Authors: Houxing Ren, Mingjie Zhan, Zhongyuan Wu, Hongsheng Li

    Abstract: In infilling tasks, sub-tokens, representing instances where a complete token is segmented into two parts, often emerge at the boundaries of prefixes, middles, and suffixes. Traditional methods focused on training models at the token level, leading to sub-optimal performance in character-level infilling tasks during the inference stage. Alternately, some approaches considered character-level infil… ▽ More

    Submitted 14 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024 (main conference)

  3. arXiv:2405.17057  [pdf, other

    cs.CL cs.AI

    ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation

    Authors: Houxing Ren, Mingjie Zhan, Zhongyuan Wu, Aojun Zhou, Junting Pan, Hongsheng Li

    Abstract: Code generation plays a crucial role in various tasks, such as code auto-completion and mathematical reasoning. Previous work has proposed numerous methods to enhance code generation performance, including integrating feedback from the compiler. Inspired by this, we present ReflectionCoder, a novel approach that effectively leverages reflection sequences constructed by integrating compiler feedbac… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2402.16352  [pdf, other

    cs.CL cs.AI

    MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs

    Authors: Zimu Lu, Aojun Zhou, Houxing Ren, Ke Wang, Weikang Shi, Junting Pan, Mingjie Zhan, Hongsheng Li

    Abstract: Large language models (LLMs) have exhibited great potential in mathematical reasoning. However, there remains a performance gap in this area between existing open-source models and closed-source models such as GPT-4. In this paper, we introduce MathGenie, a novel method for generating diverse and reliable math problems from a small-scale problem-solution dataset (denoted as seed data). We augment… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  5. arXiv:2402.14804  [pdf, other

    cs.CV cs.AI cs.CL cs.LG math.HO

    Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset

    Authors: Ke Wang, Junting Pan, Weikang Shi, Zimu Lu, Mingjie Zhan, Hongsheng Li

    Abstract: Recent advancements in Large Multimodal Models (LMMs) have shown promising results in mathematical reasoning within visual contexts, with models approaching human-level performance on existing benchmarks such as MathVista. However, we observe significant limitations in the diversity of questions and breadth of subjects covered by these benchmarks. To address this issue, we present the MATH-Vision… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  6. arXiv:2401.13870  [pdf, other

    cs.IR

    Integrating Large Language Models into Recommendation via Mutual Augmentation and Adaptive Aggregation

    Authors: Sichun Luo, Yuxuan Yao, Bowei He, Yinya Huang, Aojun Zhou, Xinyi Zhang, Yuanzhang Xiao, Mingjie Zhan, Linqi Song

    Abstract: Conventional recommendation methods have achieved notable advancements by harnessing collaborative or sequential information from user behavior. Recently, large language models (LLMs) have gained prominence for their capabilities in understanding and reasoning over textual semantics, and have found utility in various domains, including recommendation. Conventional recommendation methods and LLMs e… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  7. arXiv:2312.16018  [pdf, other

    cs.IR

    RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation

    Authors: Sichun Luo, Bowei He, Haohan Zhao, Wei Shao, Yanlin Qi, Yinya Huang, Aojun Zhou, Yuxuan Yao, Zongpeng Li, Yuanzhang Xiao, Mingjie Zhan, Linqi Song

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities and have been extensively deployed across various domains, including recommender systems. Prior research has employed specialized \textit{prompts} to leverage the in-context learning capabilities of LLMs for recommendation purposes. More recent studies have utilized instruction tuning techniques to align LLMs with human prefere… ▽ More

    Submitted 31 March, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  8. arXiv:2312.00823  [pdf, other

    cs.LG cs.AI cs.CV

    Adaptive Multi-Modality Prompt Learning

    Authors: Zongqian Wu, Yujing Liu, Mengmeng Zhan, Jialie Shen, Ping Hu, Xiaofeng Zhu

    Abstract: Although current prompt learning methods have successfully been designed to effectively reuse the large pre-trained models without fine-tuning their large number of parameters, they still have limitations to be addressed, i.e., without considering the adverse impact of meaningless patches in every image and without simultaneously considering in-sample generalization and out-of-sample generalizatio… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  9. arXiv:2310.19019  [pdf, other

    cs.CL cs.AI

    TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

    Authors: Nan He, Hanyu Lai, Chenyang Zhao, Zirui Cheng, Junting Pan, Ruoyu Qin, Ruofan Lu, Rui Lu, Yunchen Zhang, Gangming Zhao, Zhaohui Hou, Zhiyuan Huang, Shaoqing Lu, Ding Liang, Mingjie Zhan

    Abstract: Large Language Models (LLMs) exhibit impressive reasoning and data augmentation capabilities in various NLP tasks. However, what about small models? In this work, we propose TeacherLM-7.1B, capable of annotating relevant fundamentals, chain of thought, and common mistakes for most NLP samples, which makes annotation more than just an answer, thus allowing other models to learn "why" instead of jus… ▽ More

    Submitted 15 July, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: 5 figures, 15 pages

  10. arXiv:2310.03731  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning

    Authors: Ke Wang, Houxing Ren, Aojun Zhou, Zimu Lu, Sichun Luo, Weikang Shi, Renrui Zhang, Linqi Song, Mingjie Zhan, Hongsheng Li

    Abstract: The recently released GPT-4 Code Interpreter has demonstrated remarkable proficiency in solving challenging math problems, primarily attributed to its ability to seamlessly reason with natural language, generate code, execute code, and continue reasoning based on the execution output. In this paper, we present a method to fine-tune open-source language models, enabling them to use code for modelin… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: The state-of-the-art open-source language models for mathematical reasoning

  11. arXiv:2308.07921  [pdf, other

    cs.CL cs.AI cs.CV

    Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

    Authors: Aojun Zhou, Ke Wang, Zimu Lu, Weikang Shi, Sichun Luo, Zipeng Qin, Shaoqing Lu, Anya Jia, Linqi Song, Mingjie Zhan, Hongsheng Li

    Abstract: Recent progress in large language models (LLMs) like GPT-4 and PaLM-2 has brought significant advancements in addressing math reasoning problems. In particular, OpenAI's latest version of GPT-4, known as GPT-4 Code Interpreter, shows remarkable performance on challenging math datasets. In this paper, we explore the effect of code on enhancing LLMs' reasoning capability by introducing different con… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

  12. arXiv:2305.14374  [pdf, other

    cs.LG eess.SY

    Inferring Attracting Basins of Power System with Machine Learning

    Authors: Yao Du, Qing Li, Huawei Fan, Meng Zhan, Jinghua Xiao, Xingang Wang

    Abstract: Power systems dominated by renewable energy encounter frequently large, random disturbances, and a critical challenge faced in power-system management is how to anticipate accurately whether the perturbed systems will return to the functional state after the transient or collapse. Whereas model-based studies show that the key to addressing the challenge lies in the attracting basins of the functio… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: 13 pages, 7 figures

  13. arXiv:2305.07988  [pdf, other

    cs.CL

    Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts

    Authors: Haochen Tan, Han Wu, Wei Shao, Xinyun Zhang, Mingjie Zhan, Zhaohui Hou, Ding Liang, Linqi Song

    Abstract: Meetings typically involve multiple participants and lengthy conversations, resulting in redundant and trivial content. To overcome these challenges, we propose a two-step framework, Reconstruct before Summarize (RbS), for effective and efficient meeting summarization. RbS first leverages a self-supervised paradigm to annotate essential contents by reconstructing the meeting transcripts. Secondly,… ▽ More

    Submitted 22 October, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023 main conference

  14. arXiv:2305.05280  [pdf, other

    cs.CL cs.AI

    VCSUM: A Versatile Chinese Meeting Summarization Dataset

    Authors: Han Wu, Mingjie Zhan, Haochen Tan, Zhaohui Hou, Ding Liang, Linqi Song

    Abstract: Compared to news and chat summarization, the development of meeting summarization is hugely decelerated by the limited data. To this end, we introduce a versatile Chinese meeting summarization dataset, dubbed VCSum, consisting of 239 real-life meetings, with a total duration of over 230 hours. We claim our dataset is versatile because we provide the annotations of topic segmentation, headlines, se… ▽ More

    Submitted 15 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023 (long paper). GitHub: https://github.com/hahahawu/VCSum

  15. arXiv:2205.14583  [pdf, other

    cs.CL cs.IR cs.LG

    Learning Locality and Isotropy in Dialogue Modeling

    Authors: Han Wu, Haochen Tan, Mingjie Zhan, Gangming Zhao, Shaoqing Lu, Ding Liang, Linqi Song

    Abstract: Existing dialogue modeling methods have achieved promising performance on various dialogue tasks with the aid of Transformer and the large-scale pre-trained language models. However, some recent studies revealed that the context representations produced by these methods suffer the problem of anisotropy. In this paper, we find that the generated representations are also not conversational, losing t… ▽ More

    Submitted 29 January, 2023; v1 submitted 29 May, 2022; originally announced May 2022.

    Comments: To appear in ICLR 2023

  16. arXiv:2203.00770  [pdf

    cs.IT eess.SP

    Short-Packet Interleaver against Impulse Interference in Practical Industrial Environments

    Authors: Ming Zhan, Zhibo Pang, Dacfey Dzung, Kan Yu, Ming Xiao

    Abstract: The most common cause of transmission failure in Wireless High Performance (WirelessHP) target industry environments is impulse interference. As interleavers are commonly used to improve the reliability on the Orthogonal Frequency Division Multiplexing (OFDM) symbol level for long packet transmission, this paper considers the feasibility of applying short-packet bit interleaving to enhance the imp… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: 14 pages, 12 figures, submitted to IEEE Transactions on Wireless Communications

  17. arXiv:2111.00695  [pdf

    cs.IT

    Noise Error Pattern Generation Based on Successive Addition-Subtraction for Guessing Decoding

    Authors: Ming Zhan, Zhibo Pang, Kan Yu, Jing Xu, Fang Wu

    Abstract: Guessing random additive noise decoding (GRAND) algorithm has emerged as an excellent decoding strategy that can meet both the high reliability and low latency constraints. This paper proposes a successive addition-subtraction algorithm to generate noise error permutations. A noise error patterns generation scheme is presented by embedding the "1" and "0" bursts alternately. Then detailed procedur… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 6 pages, 7 figures, submitted to IEEE Communications Letters

  18. arXiv:2105.04650  [pdf, other

    cs.CL cs.AI

    GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding

    Authors: Zilong Wang, Mingjie Zhan, Houxing Ren, Zhaohui Hou, Yuwei Wu, Xingyan Zhang, Ding Liang

    Abstract: Forms are a common type of document in real life and carry rich information through textual contents and the organizational structure. To realize automatic processing of forms, word grouping and relation extraction are two fundamental and crucial steps after preliminary processing of optical character reader (OCR). Word grouping is to aggregate words that belong to the same semantic entity, and re… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  19. arXiv:2010.11685  [pdf, other

    cs.CV cs.AI

    DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding

    Authors: Zilong Wang, Mingjie Zhan, Xuebo Liu, Ding Liang

    Abstract: Form understanding depends on both textual contents and organizational structure. Although modern OCR performs well, it is still challenging to realize general form understanding because forms are commonly used and of various formats. The table detection and handcrafted features in previous works cannot apply to all forms because of their requirements on formats. Therefore, we concentrate on the m… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted to EMNLP 2020 Findings

  20. arXiv:1905.07321  [pdf, other

    cs.CL

    Availability-Based Production Predicts Speakers' Real-time Choices of Mandarin Classifiers

    Authors: Meilin Zhan, Roger Levy

    Abstract: Speakers often face choices as to how to structure their intended message into an utterance. Here we investigate the influence of contextual predictability on the encoding of linguistic content manifested by speaker choice in a classifier language. In English, a numeral modifies a noun directly (e.g., three computers). In classifier languages such as Mandarin Chinese, it is obligatory to use a cla… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: To appear in proceedings of CogSci 2019