Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Zhang-Li, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19226  [pdf, other

    cs.CL cs.HC

    Simulating Classroom Education with LLM-Empowered Agents

    Authors: Zheyuan Zhang, Daniel Zhang-Li, Jifan Yu, Linlu Gong, Jinchang Zhou, Zhiyuan Liu, Lei Hou, Juanzi Li

    Abstract: Large language models (LLMs) have been employed in various intelligent educational tasks to assist teaching. While preliminary explorations have focused on independent LLM-empowered agents for specific educational tasks, the potential for LLMs within a multi-agent collaborative framework to simulate a classroom with real user participation remains unexplored. In this work, we propose SimClass, a m… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2403.19318  [pdf, other

    cs.CL

    TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

    Authors: Xiaokang Zhang, Jing Zhang, Zeyao Ma, Yang Li, Bohan Zhang, Guanlin Li, Zijun Yao, Kangli Xu, Jinchang Zhou, Daniel Zhang-Li, Jifan Yu, Shu Zhao, Juanzi Li, Jie Tang

    Abstract: We introduce TableLLM, a robust large language model (LLM) with 13 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to real-world office scenarios. We propose a distant supervision method for training, which comprises a reasoning process extension strategy, aiding in training LLMs to un… ▽ More

    Submitted 1 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: https://tablellm.github.io/

  3. arXiv:2403.05845  [pdf, other

    cs.CL cs.AI

    Reverse That Number! Decoding Order Matters in Arithmetic Learning

    Authors: Daniel Zhang-Li, Nianyi Lin, Jifan Yu, Zheyuan Zhang, Zijun Yao, Xiaokang Zhang, Lei Hou, Jing Zhang, Juanzi Li

    Abstract: Recent advancements in pretraining have demonstrated that modern Large Language Models (LLMs) possess the capability to effectively learn arithmetic operations. However, despite acknowledging the significance of digit order in arithmetic computation, current methodologies predominantly rely on sequential, step-by-step approaches for teaching LLMs arithmetic, resulting in a conclusion where obtaini… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  4. arXiv:2306.09296  [pdf, other

    cs.CL

    KoLA: Carefully Benchmarking World Knowledge of Large Language Models

    Authors: Jifan Yu, Xiaozhi Wang, Shangqing Tu, Shulin Cao, Daniel Zhang-Li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi , et al. (10 additional authors not shown)

    Abstract: The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations. Rather than merely exploring the breadth of LLM abilities, we believe meticulous and thoughtful designs are essential to thorough, unbiased, and applicable evaluations. Given the importance of world knowledge to LLMs, we construct a Knowledge-oriented LLM Assessment benchmark (KoLA), in which we… ▽ More

    Submitted 30 June, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted by ICLR 2024

  5. arXiv:2302.14401  [pdf, other

    cs.CL cs.AI

    GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation

    Authors: Jing Zhang, Xiaokang Zhang, Daniel Zhang-Li, Jifan Yu, Zijun Yao, Zeyao Ma, Yiqi Xu, Haohua Wang, Xiaohan Zhang, Nianyi Lin, Sunrui Lu, Juanzi Li, Jie Tang

    Abstract: We present GLM-Dialog, a large-scale language model (LLM) with 10B parameters capable of knowledge-grounded conversation in Chinese using a search engine to access the Internet knowledge. GLM-Dialog offers a series of applicable techniques for exploiting various external knowledge including both helpful and noisy knowledge, enabling the creation of robust knowledge-grounded dialogue LLMs with limi… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.