Skip to main content

Showing 1–5 of 5 results for author: Tam, K Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.14341  [pdf, other

    cs.CL

    Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives

    Authors: Jiaxin Liu, Yi Yang, Kar Yan Tam

    Abstract: In this paper, we introduce the Financial-STS task, a financial domain-specific NLP task designed to measure the nuanced semantic similarity between pairs of financial narratives. These narratives originate from the financial statements of the same company but correspond to different periods, such as year-over-year comparisons. Measuring the subtle semantic differences between these paired narrati… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  2. arXiv:2402.09733  [pdf, other

    cs.CL

    Do LLMs Know about Hallucination? An Empirical Investigation of LLM's Hidden States

    Authors: Hanyu Duan, Yi Yang, Kar Yan Tam

    Abstract: Large Language Models (LLMs) can make up answers that are not real, and this is known as hallucination. This research aims to see if, how, and to what extent LLMs are aware of hallucination. More specifically, we check whether and how an LLM reacts differently in its hidden states when it answers a question right versus when it hallucinates. To do this, we introduce an experimental framework which… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 9 pages, 8 figures, 2 tables (13 pages, 12 figures, 13 tables including references and appendices)

  3. arXiv:2311.10395  [pdf, other

    cs.CL

    Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads

    Authors: Yi Yang, Hanyu Duan, Ahmed Abbasi, John P. Lalor, Kar Yan Tam

    Abstract: Transformer-based pretrained large language models (PLM) such as BERT and GPT have achieved remarkable success in NLP tasks. However, PLMs are prone to encoding stereotypical biases. Although a burgeoning literature has emerged on stereotypical bias mitigation in PLMs, such as work on debiasing gender and racial stereotyping, how such biases manifest and behave internally within PLMs remains large… ▽ More

    Submitted 15 June, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: 14 pages, 7 figures, 3 tables including references and appendices

  4. arXiv:2311.10367  [pdf, other

    cs.CL

    Exploring the Relationship between In-Context Learning and Instruction Tuning

    Authors: Hanyu Duan, Yixuan Tang, Yi Yang, Ahmed Abbasi, Kar Yan Tam

    Abstract: In-Context Learning (ICL) and Instruction Tuning (IT) are two primary paradigms of adopting Large Language Models (LLMs) to downstream applications. However, they are significantly different. In ICL, a set of demonstrations are provided at inference time but the LLM's parameters are not updated. In IT, a set of demonstrations are used to tune LLM's parameters in training time but no demonstrations… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  5. arXiv:2309.13064  [pdf, other

    q-fin.GN cs.AI cs.CL cs.LG

    InvestLM: A Large Language Model for Investment using Financial Domain Instruction Tuning

    Authors: Yi Yang, Yixuan Tang, Kar Yan Tam

    Abstract: We present a new financial domain large language model, InvestLM, tuned on LLaMA-65B (Touvron et al., 2023), using a carefully curated instruction dataset related to financial investment. Inspired by less-is-more-for-alignment (Zhou et al., 2023), we manually curate a small yet diverse instruction dataset, covering a wide range of financial related topics, from Chartered Financial Analyst (CFA) ex… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Link: https://github.com/AbaciNLP/InvestLM