Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Jiayang, C

.
  1. arXiv:2408.08067  [pdf, other

    cs.CL cs.AI

    RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation

    Authors: Dongyu Ru, Lin Qiu, Xiangkun Hu, Tianhang Zhang, Peng Shi, Shuaichen Chang, Cheng Jiayang, Cunxiang Wang, Shichao Sun, Huanyu Li, Zizhao Zhang, Binjie Wang, Jiarong Jiang, Tong He, Zhiguo Wang, Pengfei Liu, Yue Zhang, Zheng Zhang

    Abstract: Despite Retrieval-Augmented Generation (RAG) showing promising capability in leveraging external knowledge, a comprehensive evaluation of RAG systems is still challenging due to the modular nature of RAG, evaluation of long-form responses and reliability of measurements. In this paper, we propose a fine-grained evaluation framework, RAGChecker, that incorporates a suite of diagnostic metrics for b… ▽ More

    Submitted 16 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: Under Review. Github Repo: https://github.com/amazon-science/RAGChecker

  2. arXiv:2408.06941  [pdf, other

    cs.IR

    OpenResearcher: Unleashing AI for Accelerated Scientific Research

    Authors: Yuxiang Zheng, Shichao Sun, Lin Qiu, Dongyu Ru, Cheng Jiayang, Xuefeng Li, Jifan Lin, Binjie Wang, Yun Luo, Renjie Pan, Yang Xu, Qingkai Min, Zizhao Zhang, Yiwen Wang, Wenjie Li, Pengfei Liu

    Abstract: The rapid growth of scientific literature imposes significant challenges for researchers endeavoring to stay updated with the latest advancements in their fields and delve into new areas. We introduce OpenResearcher, an innovative platform that leverages Artificial Intelligence (AI) techniques to accelerate the research process by answering diverse questions from researchers. OpenResearcher is bui… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  3. arXiv:2406.11375  [pdf, other

    cs.CL cs.AI

    Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models Empower Student Models?

    Authors: Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang

    Abstract: Analogical reasoning plays a critical role in human cognition, enabling us to understand new concepts by associating them with familiar ones. Previous research in the AI community has mainly focused on identifying and generating analogies and then examining their quality under human evaluation, which overlooks the practical application of these analogies in real-world settings. Inspired by the hum… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2404.13627  [pdf, other

    cs.CL cs.AI

    NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding

    Authors: Chunkit Chan, Cheng Jiayang, Yauwai Yim, Zheye Deng, Wei Fan, Haoran Li, Xin Liu, Hongming Zhang, Weiqi Wang, Yangqiu Song

    Abstract: Large Language Models (LLMs) have sparked substantial interest and debate concerning their potential emergence of Theory of Mind (ToM) ability. Theory of mind evaluations currently focuses on testing models using machine-generated data or game settings prone to shortcuts and spurious correlations, which lacks evaluation of machine ToM ability in real-world human interaction scenarios. This poses a… ▽ More

    Submitted 4 July, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: Dataset: https://github.com/HKUST-KnowComp/NegotiationToM

  5. arXiv:2404.00209  [pdf, other

    cs.CL

    EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs

    Authors: Cheng Jiayang, Lin Qiu, Chunkit Chan, Xin Liu, Yangqiu Song, Zheng Zhang

    Abstract: Narrative reasoning relies on the understanding of eventualities in story contexts, which requires a wealth of background world knowledge. To help machines leverage such knowledge, existing solutions can be categorized into two groups. Some focus on implicitly modeling eventuality knowledge by pretraining language models (LMs) with eventuality-aware objectives. However, this approach breaks down k… ▽ More

    Submitted 7 July, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

  6. arXiv:2310.12874  [pdf, other

    cs.CL

    StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding

    Authors: Cheng Jiayang, Lin Qiu, Tsz Ho Chan, Tianqing Fang, Weiqi Wang, Chunkit Chan, Dongyu Ru, Qipeng Guo, Hongming Zhang, Yangqiu Song, Yue Zhang, Zheng Zhang

    Abstract: Analogy-making between narratives is crucial for human reasoning. In this paper, we evaluate the ability to identify and generate analogies by constructing a first-of-its-kind large-scale story-level analogy corpus, \textsc{StoryAnalogy}, which contains 24K story pairs from diverse domains with human annotations on two similarities from the extended Structure-Mapping Theory. We design a set of tes… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 main conference

  7. arXiv:2310.07521  [pdf, other

    cs.CL

    Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

    Authors: Cunxiang Wang, Xiaoze Liu, Yuanhao Yue, Xiangru Tang, Tianhang Zhang, Cheng Jiayang, Yunzhi Yao, Wenyang Gao, Xuming Hu, Zehan Qi, Yidong Wang, Linyi Yang, Jindong Wang, Xing Xie, Zheng Zhang, Yue Zhang

    Abstract: This survey addresses the crucial issue of factuality in Large Language Models (LLMs). As LLMs find applications across diverse domains, the reliability and accuracy of their outputs become vital. We define the Factuality Issue as the probability of LLMs to produce content inconsistent with established facts. We first delve into the implications of these inaccuracies, highlighting the potential co… ▽ More

    Submitted 16 December, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 62 pages; 300+ references