Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Yu, Z Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.06176  [pdf

    cs.CL cs.AI cs.LG

    Fine-tuning Language Models with Generative Adversarial Reward Modelling

    Authors: Zhang Ze Yu, Lau Jia Jaw, Zhang Hui, Bryan Kian Hsiang Low

    Abstract: Reinforcement Learning with Human Feedback (RLHF) has been demonstrated to significantly enhance the performance of large language models (LLMs) by aligning their outputs with desired human values through instruction tuning. However, RLHF is constrained by the expertise and productivity limitations of human evaluators. A response to this downside is to fall back to supervised fine-tuning (SFT) wit… ▽ More

    Submitted 5 March, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 22 pages, 9 figures, 12 tables

  2. arXiv:2305.05953  [pdf

    quant-ph cs.DS cs.ET

    Novel Quantum Information Processing Methods and Investigation

    Authors: Zhang Ze Yu

    Abstract: Quantum information processing and its subfield, quantum image processing, are rapidly growing fields as a result of advancements in the practicality of quantum mechanics. In this paper, we propose a quantum algorithm for processing information, such as one-dimensional time series and two-dimensional images, in the frequency domain. The information of interest is encoded into the magnitude of prob… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 12 pages, 53 figures