Skip to main content

Showing 1–10 of 10 results for author: Fulda, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03621  [pdf, other

    cs.CL

    The Mysterious Case of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics of Meta's Llama 2 Model

    Authors: Brenden Smith, Dallin Baker, Clayton Chase, Myles Barney, Kaden Parker, Makenna Allred, Peter Hu, Alex Evans, Nancy Fulda

    Abstract: Large Language Models (LLMs) have an unrivaled and invaluable ability to "align" their output to a diverse range of human preferences, by mirroring them in the text they generate. The internal characteristics of such models, however, remain largely opaque. This work presents the Injectable Realignment Model (IRM) as a novel approach to language model interpretability and explainability. Inspired b… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 21 pages, 17 figures

  2. arXiv:2309.15197  [pdf, other

    cs.HC cs.CY cs.SI

    A Tale of Two Cultures: Comparing Interpersonal Information Disclosure Norms on Twitter

    Authors: Mainack Mondal, Anju Punuru, Tyng-Wen Scott Cheng, Kenneth Vargas, Chaz Gundry, Nathan S Driggs, Noah Schill, Nathaniel Carlson, Josh Bedwell, Jaden Q Lorenc, Isha Ghosh, Yao Li, Nancy Fulda, Xinru Page

    Abstract: We present an exploration of cultural norms surrounding online disclosure of information about one's interpersonal relationships (such as information about family members, colleagues, friends, or lovers) on Twitter. The literature identifies the cultural dimension of individualism versus collectivism as being a major determinant of offline communication differences in terms of emotion, topic, and… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: This work will be presented at the 26th ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW 2023). This paper will also be published in The Proceedings of the ACM on Human Computer Interaction

  3. arXiv:2306.02177  [pdf, other

    cs.AI

    Towards Coding Social Science Datasets with Language Models

    Authors: Christopher Michael Rytting, Taylor Sorensen, Lisa Argyle, Ethan Busby, Nancy Fulda, Joshua Gubler, David Wingate

    Abstract: Researchers often rely on humans to code (label, annotate, etc.) large sets of texts. This kind of human coding forms an important part of social science research, yet the coding process is both resource intensive and highly variable from application to application. In some cases, efforts to automate this process have achieved human-level accuracies, but to achieve this, these attempts frequently… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  4. Out of One, Many: Using Language Models to Simulate Human Samples

    Authors: Lisa P. Argyle, Ethan C. Busby, Nancy Fulda, Joshua Gubler, Christopher Rytting, David Wingate

    Abstract: We propose and explore the possibility that language models can be studied as effective proxies for specific human sub-populations in social science research. Practical and research applications of artificial intelligence tools have sometimes been limited by problematic biases (such as racism or sexism), which are often treated as uniform properties of the models. We show that the "algorithmic bia… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  5. arXiv:2209.06295  [pdf, other

    cs.CL

    Data-adaptive Transfer Learning for Translation: A Case Study in Haitian and Jamaican

    Authors: Nathaniel R. Robinson, Cameron J. Hogan, Nancy Fulda, David R. Mortensen

    Abstract: Multilingual transfer techniques often improve low-resource machine translation (MT). Many of these techniques are applied without considering data characteristics. We show in the context of Haitian-to-English translation that transfer effectiveness is correlated with amount of training data and relationships between knowledge-sharing languages. Our experiments suggest that for some languages beyo… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  6. An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels

    Authors: Taylor Sorensen, Joshua Robinson, Christopher Michael Rytting, Alexander Glenn Shaw, Kyle Jeffrey Rogers, Alexia Pauline Delorey, Mahmoud Khalil, Nancy Fulda, David Wingate

    Abstract: Pre-trained language models derive substantial linguistic and factual knowledge from the massive corpora on which they are trained, and prompt engineering seeks to align these models to specific tasks. Unfortunately, existing prompt engineering methods require significant amounts of labeled data, access to model parameters, or both. We introduce a new method for selecting prompt templates \textit{… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  7. arXiv:2012.05983  [pdf, other

    cs.CL cs.AI

    Towards Neural Programming Interfaces

    Authors: Zachary C. Brown, Nathaniel Robinson, David Wingate, Nancy Fulda

    Abstract: It is notoriously difficult to control the behavior of artificial neural networks such as generative neural language models. We recast the problem of controlling natural language generation as that of learning to interface with a pretrained language model, just as Application Programming Interfaces (APIs) control the behavior of programs by altering hyperparameters. In this new paradigm, a special… ▽ More

    Submitted 17 February, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 24 pages total (13 for main paper and references, 11 for Appendix 1), accepted for publication in Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

    Journal ref: Neural Information Processing Systems 33 (2020) 17416-17428

  8. arXiv:2007.06763  [pdf

    cs.CR cs.LG

    Machine Learning for Offensive Security: Sandbox Classification Using Decision Trees and Artificial Neural Networks

    Authors: Will Pearce, Nick Landers, Nancy Fulda

    Abstract: The merits of machine learning in information security have primarily focused on bolstering defenses. However, machine learning (ML) techniques are not reserved for organizations with deep pockets and massive data repositories; the democratization of ML has lead to a rise in the number of security teams using ML to support offensive operations. The research presented here will explore two models t… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: SAI Conference on Computing

  9. arXiv:1808.04891  [pdf, other

    cs.CL

    Embedding Grammars

    Authors: David Wingate, William Myers, Nancy Fulda, Tyler Etchart

    Abstract: Classic grammars and regular expressions can be used for a variety of purposes, including parsing, intent detection, and matching. However, the comparisons are performed at a structural level, with constituent elements (words or characters) matched exactly. Recent advances in word embeddings show that semantically related words share common features in a vector-space representation, suggesting the… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

  10. arXiv:1703.03429  [pdf, other

    cs.AI cs.CL

    What can you do with a rock? Affordance extraction via word embeddings

    Authors: Nancy Fulda, Daniel Ricks, Ben Murdoch, David Wingate

    Abstract: Autonomous agents must often detect affordances: the set of behaviors enabled by a situation. Affordance detection is particularly helpful in domains with large action spaces, allowing the agent to prune its search space by avoiding futile behaviors. This paper presents a method for affordance extraction via word embeddings trained on a Wikipedia corpus. The resulting word vectors are treated as a… ▽ More

    Submitted 9 March, 2017; originally announced March 2017.

    Comments: 7 pages, 7 figures, 2 algorithms, data runs were performed using the Autoplay learning environment for interactive fiction

    Journal ref: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI), Pages 1039-1045, 2017