Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Vafa, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17972  [pdf, other

    cs.LG cs.CL econ.EM

    LABOR-LLM: Language-Based Occupational Representations with Large Language Models

    Authors: Tianyu Du, Ayush Kanodia, Herman Brunborg, Keyon Vafa, Susan Athey

    Abstract: Many empirical studies of labor market questions rely on estimating relatively simple predictive models using small, carefully constructed longitudinal survey datasets based on hand-engineered features. Large Language Models (LLMs), trained on massive datasets, encode vast quantities of world knowledge and can be used for the next job prediction problem. However, while an off-the-shelf LLM produce… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.03689  [pdf, other

    cs.CL cs.AI

    Evaluating the World Model Implicit in a Generative Model

    Authors: Keyon Vafa, Justin Y. Chen, Jon Kleinberg, Sendhil Mullainathan, Ashesh Rambachan

    Abstract: Recent work suggests that large language models may implicitly learn world models. How should we assess this possibility? We formalize this question for the case where the underlying reality is governed by a deterministic finite automaton. This includes problems as diverse as simple logical reasoning, geographic navigation, game-playing, and chemistry. We propose new evaluation metrics for world m… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2406.01382  [pdf, other

    cs.CL cs.AI

    Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function

    Authors: Keyon Vafa, Ashesh Rambachan, Sendhil Mullainathan

    Abstract: What makes large language models (LLMs) impressive is also what makes them hard to evaluate: their diversity of uses. To evaluate these models, we must understand the purposes they will be used for. We consider a setting where these deployment decisions are made by people, and in particular, people's beliefs about where an LLM will perform well. We model such beliefs as the consequence of a human… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  4. arXiv:2312.14804  [pdf, other

    cs.CY

    Use large language models to promote equity

    Authors: Emma Pierson, Divya Shanmugam, Rajiv Movva, Jon Kleinberg, Monica Agrawal, Mark Dredze, Kadija Ferryman, Judy Wawira Gichoya, Dan Jurafsky, Pang Wei Koh, Karen Levy, Sendhil Mullainathan, Ziad Obermeyer, Harini Suresh, Keyon Vafa

    Abstract: Advances in large language models (LLMs) have driven an explosion of interest about their societal impacts. Much of the discourse around how they will impact social equity has been cautionary or negative, focusing on questions like "how might LLMs be biased and how would we mitigate those biases?" This is a vital discussion: the ways in which AI generally, and LLMs specifically, can entrench biase… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  5. arXiv:2312.02331  [pdf, ps, other

    cs.CL cs.LG

    Revisiting Topic-Guided Language Models

    Authors: Carolina Zheng, Keyon Vafa, David M. Blei

    Abstract: A recent line of work in natural language processing has aimed to combine language models and topic models. These topic-guided language models augment neural language models with topic models, unsupervised learning methods that can discover document-level patterns of word use. This paper compares the effectiveness of these methods in a standardized setting. We study four topic-guided language mode… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR) (12/2023)

  6. arXiv:2306.00198  [pdf, other

    cs.CL cs.LG

    An Invariant Learning Characterization of Controlled Text Generation

    Authors: Carolina Zheng, Claudia Shi, Keyon Vafa, Amir Feder, David M. Blei

    Abstract: Controlled generation refers to the problem of creating text that contains stylistic or semantic attributes of interest. Many approaches reduce this problem to training a predictor of the desired attribute. For example, researchers hoping to deploy a large language model to produce non-toxic content may use a toxicity classifier to filter generated text. In practice, the generated text to classify… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: To appear in the 2023 Conference of the Association for Computational Linguistics (ACL 2023)

  7. arXiv:2202.08370  [pdf, other

    cs.LG econ.EM

    CAREER: A Foundation Model for Labor Sequence Data

    Authors: Keyon Vafa, Emil Palikot, Tianyu Du, Ayush Kanodia, Susan Athey, David M. Blei

    Abstract: Labor economists regularly analyze employment data by fitting predictive models to small, carefully constructed longitudinal survey datasets. Although machine learning methods offer promise for such problems, these survey datasets are too small to take advantage of them. In recent years large datasets of online resumes have also become available, providing data about the career trajectories of mil… ▽ More

    Submitted 29 February, 2024; v1 submitted 16 February, 2022; originally announced February 2022.

  8. arXiv:2109.06387  [pdf, other

    cs.CL cs.LG

    Rationales for Sequential Predictions

    Authors: Keyon Vafa, Yuntian Deng, David M. Blei, Alexander M. Rush

    Abstract: Sequence models are a critical component of modern NLP systems, but their predictions are difficult to explain. We consider model explanations though rationales, subsets of context that can explain individual model predictions. We find sequential rationales by solving a combinatorial optimization: the best rationale is the smallest subset of input tokens that would predict the same output as the f… ▽ More

    Submitted 17 November, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Appeared in the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

  9. arXiv:2005.04232  [pdf, other

    cs.CL cs.LG stat.ML

    Text-Based Ideal Points

    Authors: Keyon Vafa, Suresh Naidu, David M. Blei

    Abstract: Ideal point models analyze lawmakers' votes to quantify their political positions, or ideal points. But votes are not the only way to express a political position. Lawmakers also give speeches, release press statements, and post tweets. In this paper, we introduce the text-based ideal point model (TBIP), an unsupervised probabilistic topic model that analyzes texts to quantify the political positi… ▽ More

    Submitted 21 July, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: Appeared in Proceedings of the 2020 Conference of the Association for Computational Linguistics (ACL 2020)

  10. arXiv:1905.10347  [pdf, other

    cs.LG stat.ML

    Discrete Flows: Invertible Generative Models of Discrete Data

    Authors: Dustin Tran, Keyon Vafa, Kumar Krishna Agrawal, Laurent Dinh, Ben Poole

    Abstract: While normalizing flows have led to significant advances in modeling high-dimensional continuous distributions, their applicability to discrete distributions remains unknown. In this paper, we show that flows can in fact be extended to discrete events---and under a simple change-of-variables formula not requiring log-determinant-Jacobian computations. Discrete flows have numerous applications. We… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.