Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Finlayson, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08001  [pdf, other

    cs.CL cs.AI cs.IR

    Automated Neural Patent Landscaping in the Small Data Regime

    Authors: Tisa Islam Erana, Mark A. Finlayson

    Abstract: Patent landscaping is the process of identifying all patents related to a particular technological area, and is important for assessing various aspects of the intellectual property context. Traditionally, constructing patent landscapes is intensely laborious and expensive, and the rapid expansion of patenting activity in recent decades has driven an increasing need for efficient and effective auto… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 11 pages, 4 figures

  2. arXiv:2406.16838  [pdf, other

    cs.CL cs.LG

    From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

    Authors: Sean Welleck, Amanda Bertsch, Matthew Finlayson, Hailey Schoelkopf, Alex Xie, Graham Neubig, Ilia Kulikov, Zaid Harchaoui

    Abstract: One of the most striking findings in modern research on large language models (LLMs) is that scaling up compute during training leads to better results. However, less attention has been given to the benefits of scaling compute during inference. This survey focuses on these inference-time approaches. We explore three areas under a unified mathematical formalism: token-level generation algorithms, m… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.05265  [pdf, other

    cs.CL cs.AI cs.IR

    TLEX: An Efficient Method for Extracting Exact Timelines from TimeML Temporal Graphs

    Authors: Mustafa Ocal, Ning Xie, Mark Finlayson

    Abstract: A timeline provides a total ordering of events and times, and is useful for a number of natural language understanding tasks. However, qualitative temporal graphs that can be derived directly from text -- such as TimeML annotations -- usually explicitly reveal only partial orderings of events and times. In this work, we apply prior work on solving point algebra problems to the task of extracting t… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 25 pages, 9 figures

  4. arXiv:2403.09539  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Logits of API-Protected LLMs Leak Proprietary Information

    Authors: Matthew Finlayson, Xiang Ren, Swabha Swayamdipta

    Abstract: The commercialization of large language models (LLMs) has led to the common practice of high-level API-only access to proprietary models. In this work, we show that even with a conservative assumption about the model architecture, it is possible to learn a surprisingly large amount of non-public information about an API-protected LLM from a relatively small number of API queries (e.g., costing und… ▽ More

    Submitted 14 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  5. arXiv:2312.02810  [pdf

    cs.SI physics.soc-ph

    Using the SP!CE Framework to Code Influence Campaign Activity on Social Media: Case Study on the 2022 Brazilian Presidential Election

    Authors: Alexander Gocso, Claudia Perez Brito, Bryan Ruesca, Allen Mendes, Mark A. Finlayson

    Abstract: We describe a case study in the use of the Structured Process for Information Campaign Enhancement (SP!CE, version 2.1) to evaluate influence campaigns present in the 2nd round of the Brazilian presidential election in 2022 October. SP!CE is a US-military focused framework for describing both friendly and adversary actions in influence campaigns, and is inter-operable with the Disinformation Analy… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 23 pages, 34 figures, 1 table

  6. arXiv:2310.01693  [pdf, other

    cs.CL

    Closing the Curious Case of Neural Text Degeneration

    Authors: Matthew Finlayson, John Hewitt, Alexander Koller, Swabha Swayamdipta, Ashish Sabharwal

    Abstract: Despite their ubiquity in language generation, it remains unknown why truncation sampling heuristics like nucleus sampling are so effective. We provide a theoretical explanation for the effectiveness of the truncation sampling by proving that truncation methods that discard tokens below some probability threshold (the most common type of truncation) can guarantee that all sampled tokens have nonze… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    MSC Class: 68T50 ACM Class: I.2.7

  7. arXiv:2305.14596  [pdf, other

    cs.CL cs.LG

    Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy

    Authors: Sarah Wiegreffe, Matthew Finlayson, Oyvind Tafjord, Peter Clark, Ashish Sabharwal

    Abstract: When pretrained language models (LMs) are applied to discriminative tasks such as multiple-choice questions, they place probability mass on vocabulary tokens that aren't among the given answer choices. Spreading probability mass across multiple surface forms with identical meaning (such as "bath" and "bathtub") is thought to cause an underestimation of a model's true performance, referred to as th… ▽ More

    Submitted 31 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  8. arXiv:2210.17517  [pdf, other

    cs.CL cs.AI

    Lila: A Unified Benchmark for Mathematical Reasoning

    Authors: Swaroop Mishra, Matthew Finlayson, Pan Lu, Leonard Tang, Sean Welleck, Chitta Baral, Tanmay Rajpurohit, Oyvind Tafjord, Ashish Sabharwal, Peter Clark, Ashwin Kalyan

    Abstract: Mathematical reasoning skills are essential for general-purpose intelligent systems to perform tasks from grocery shopping to climate modeling. Towards evaluating and improving AI systems in this domain, we propose LILA, a unified mathematical reasoning benchmark consisting of 23 diverse tasks along four dimensions: (i) mathematical abilities e.g., arithmetic, calculus (ii) language format e.g., q… ▽ More

    Submitted 8 March, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

    MSC Class: 68T50 ACM Class: I.2.7

  9. arXiv:2210.02406  [pdf, other

    cs.CL

    Decomposed Prompting: A Modular Approach for Solving Complex Tasks

    Authors: Tushar Khot, Harsh Trivedi, Matthew Finlayson, Yao Fu, Kyle Richardson, Peter Clark, Ashish Sabharwal

    Abstract: Few-shot prompting is a surprisingly powerful way to use Large Language Models (LLMs) to solve various tasks. However, this approach struggles as the task complexity increases or when the individual reasoning steps of the task themselves are hard to learn, especially when embedded in more complex tasks. To address this, we propose Decomposed Prompting, a new approach to solve complex tasks by deco… ▽ More

    Submitted 11 April, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: ICLR'23 Camera Ready

  10. arXiv:2204.09148  [pdf, other

    cs.CL cs.AI

    What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

    Authors: Matthew Finlayson, Kyle Richardson, Ashish Sabharwal, Peter Clark

    Abstract: The instruction learning paradigm -- where a model learns to perform new tasks from task descriptions alone -- has become popular in general-purpose model research. The capabilities of large transformer models as instruction learners, however, remain poorly understood. We use a controlled synthetic environment to characterize such capabilities. Specifically, we use the task of deciding whether a g… ▽ More

    Submitted 24 May, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Typos corrected, rewordings

    MSC Class: 68T50 ACM Class: I.2.7

  11. arXiv:2204.06085  [pdf, other

    cs.AI cs.CL

    Finding Trolls Under Bridges: Preliminary Work on a Motif Detector

    Authors: W. Victor H. Yarlott, Armando Ochoa, Anurag Acharya, Laurel Bobrow, Diego Castro Estrada, Diana Gomez, Joan Zheng, David McDonald, Chris Miller, Mark A. Finlayson

    Abstract: Motifs are distinctive recurring elements found in folklore that have significance as communicative devices in news, literature, press releases, and propaganda. Motifs concisely imply a large constellation of culturally-relevant information, and their broad usage suggests their cognitive importance as touchstones of cultural knowledge, making their detection a worthy step toward culturally-aware n… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 13 pages, 2 figures, Presented at The Ninth Advances in Cognitive Systems (ACS) Conference 2021 (arXiv:2201.06134)

    Report number: ACS2021/23

  12. arXiv:2201.10618  [pdf, other

    cs.CL

    The ABBE Corpus: Animate Beings Being Emotional

    Authors: Samira Zad, Joshuan Jimenez, Mark A. Finlayson

    Abstract: Emotion detection is an established NLP task of demonstrated utility for text understanding. However, basic emotion detection leaves out key information, namely, who is experiencing the emotion in question. For example, it may be the author, the narrator, or a character; or the emotion may correspond to something the audience is supposed to feel, or even be unattributable to a specific being, e.g.… ▽ More

    Submitted 15 February, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: 9 pages, 1 figure

  13. arXiv:2106.06087  [pdf, other

    cs.CL

    Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models

    Authors: Matthew Finlayson, Aaron Mueller, Sebastian Gehrmann, Stuart Shieber, Tal Linzen, Yonatan Belinkov

    Abstract: Targeted syntactic evaluations have demonstrated the ability of language models to perform subject-verb agreement given difficult contexts. To elucidate the mechanisms by which the models accomplish this behavior, this study applies causal mediation analysis to pre-trained neural language models. We investigate the magnitude of models' preferences for grammatical inflections, as well as whether ne… ▽ More

    Submitted 22 June, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL-IJCNLP 2021

    MSC Class: 68T50 ACM Class: I.2.7

  14. arXiv:2009.05664  [pdf, other

    cs.AI cs.CL

    Towards an Atlas of Cultural Commonsense for Machine Reasoning

    Authors: Anurag Acharya, Kartik Talamadupula, Mark A Finlayson

    Abstract: Existing commonsense reasoning datasets for AI and NLP tasks fail to address an important aspect of human life: cultural differences. We introduce an approach that extends prior work on crowdsourcing commonsense knowledge by incorporating differences in knowledge that are attributable to cultural or national groups. We demonstrate the technique by collecting commonsense knowledge that surrounds si… ▽ More

    Submitted 18 December, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: 9 pages, 9 figures

    ACM Class: I.2.6; I.2.7

  15. arXiv:1602.05753  [pdf

    cs.CL cs.HC

    Overview of Annotation Creation: Processes & Tools

    Authors: Mark A. Finlayson, Tomaž Erjavec

    Abstract: Creating linguistic annotations requires more than just a reliable annotation scheme. Annotation can be a complex endeavour potentially involving many people, stages, and tools. This chapter outlines the process of creating end-to-end linguistic annotations, identifying specific tasks that researchers often perform. Because tool support is so central to achieving high quality, reusable annotations… ▽ More

    Submitted 18 February, 2016; originally announced February 2016.

    Comments: To appear in: James Pustejovsky and Nancy Ide (eds.) "Handbook of Linguistic Annotation." 2016. New York: Springer