Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Ruan, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16546  [pdf, other

    cs.IR cs.CL

    Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

    Authors: Sunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen

    Abstract: The proliferation of Large Language Models (LLMs) has led to an influx of AI-generated content (AIGC) on the internet, transforming the corpus of Information Retrieval (IR) systems from solely human-written to a coexistence with LLM-generated content. The impact of this surge in AIGC on IR systems remains an open question, with the primary challenge being the lack of a dedicated benchmark for rese… ▽ More

    Submitted 2 July, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted by Findings of ACL 2024; Datasets Link: https://huggingface.co/IR-Cocktail

  2. arXiv:2405.11430  [pdf, other

    cs.CL

    MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation

    Authors: Jianbo Dai, Jianqiao Lu, Yunlong Feng, Rongju Ruan, Ming Cheng, Haochen Tan, Zhijiang Guo

    Abstract: Recent advancements in large language models (LLMs) have greatly improved code generation, specifically at the function level. For instance, GPT-4 has achieved an 88.4% pass rate on HumanEval. However, this draws into question the adequacy of existing benchmarks in thoroughly assessing function-level code generation capabilities. Our study analyzed two common benchmarks, HumanEval and MBPP, and fo… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 39 pages, dataset and code are available at https://github.com/SparksofAGI/MHPP

  3. arXiv:2306.05987  [pdf, other

    q-fin.ST cs.LG stat.ML

    Liquidity takers behavior representation through a contrastive learning approach

    Authors: Ruihua Ruan, Emmanuel Bacry, Jean-François Muzy

    Abstract: Thanks to the access to the labeled orders on the CAC40 data from Euronext, we are able to analyze agents' behaviors in the market based on their placed orders. In this study, we construct a self-supervised learning model using triplet loss to effectively learn the representation of agent market orders. By acquiring this learned representation, various downstream tasks become feasible. In this wor… ▽ More

    Submitted 10 July, 2023; v1 submitted 9 June, 2023; originally announced June 2023.