Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: O'Donnell, T J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11977  [pdf, other

    cs.CL

    Reframing linguistic bootstrapping as joint inference using visually-grounded grammar induction models

    Authors: Eva Portelance, Siva Reddy, Timothy J. O'Donnell

    Abstract: Semantic and syntactic bootstrapping posit that children use their prior knowledge of one linguistic domain, say syntactic relations, to help later acquire another, such as the meanings of new words. Empirical results supporting both theories may tempt us to believe that these are different learning strategies, where one may precede the other. Here, we argue that they are instead both contingent o… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; I.2.10; I.2.6; F.4.2

  2. arXiv:2406.05186  [pdf, other

    cs.CL

    Correlation Does Not Imply Compensation: Complexity and Irregularity in the Lexicon

    Authors: Amanda Doucette, Ryan Cotterell, Morgan Sonderegger, Timothy J. O'Donnell

    Abstract: It has been claimed that within a language, morphologically irregular words are more likely to be phonotactically simple and morphologically regular words are more likely to be phonotactically complex. This inverse correlation has been demonstrated in English for a small sample of words, but has yet to be shown for a larger sample of languages. Furthermore, frequency and word length are known to i… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: To appear in Proceedings of the Society for Computation in Linguistics 2024

  3. arXiv:2403.01187  [pdf, ps, other

    cs.CL

    A Compositional Typed Semantics for Universal Dependencies

    Authors: Laurestine Bradford, Timothy John O'Donnell, Siva Reddy

    Abstract: Languages may encode similar meanings using different sentence structures. This makes it a challenge to provide a single set of formal rules that can derive meanings from sentences in many languages at once. To overcome the challenge, we can take advantage of language-general connections between meaning and syntax, and build on cross-linguistically parallel syntactic structures. We introduce UD Ty… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 10 pages, 6 figures, 1 table. For related code, see https://github.com/McGill-NLP/ud-to-meaning

  4. arXiv:2302.06784  [pdf, other

    cs.CL

    The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation

    Authors: Kushal Arora, Timothy J. O'Donnell, Doina Precup, Jason Weston, Jackie C. K. Cheung

    Abstract: State-of-the-art language generation models can degenerate when applied to open-ended generation problems such as text completion, story generation, or dialog modeling. This degeneration usually shows up in the form of incoherence, lack of vocabulary diversity, and self-repetition or copying from the context. In this paper, we postulate that ``human-like'' generations usually lie in a narrow and n… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  5. arXiv:2203.12788  [pdf, other

    cs.CL

    Evaluating Distributional Distortion in Neural Language Modeling

    Authors: Benjamin LeBrun, Alessandro Sordoni, Timothy J. O'Donnell

    Abstract: A fundamental characteristic of natural language is the high rate at which speakers produce novel expressions. Because of this novelty, a heavy-tail of rare events accounts for a significant amount of the total probability mass of distributions in language (Baayen, 2001). Standard language modeling metrics such as perplexity quantify the performance of language models (LM) in aggregate. As a resul… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Journal ref: International Conference on Learning Representations. 2022

  6. arXiv:2112.00578  [pdf, other

    cs.CL cs.LG

    Systematic Generalization with Edge Transformers

    Authors: Leon Bergen, Timothy J. O'Donnell, Dzmitry Bahdanau

    Abstract: Recent research suggests that systematic generalization in natural language understanding remains a challenge for state-of-the-art neural models such as Transformers and Graph Neural Networks. To tackle this challenge, we propose Edge Transformer, a new model that combines inspiration from Transformers and rule-based symbolic AI. The first key idea in Edge Transformers is to associate vector state… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: Accepted as a conference paper at NeurIPS 2021

  7. arXiv:2110.06843  [pdf, other

    cs.CL

    Compositional Generalization in Dependency Parsing

    Authors: Emily Goodwin, Siva Reddy, Timothy J. O'Donnell, Dzmitry Bahdanau

    Abstract: Compositionality -- the ability to combine familiar units like words into novel phrases and sentences -- has been the focus of intense interest in artificial intelligence in recent years. To test compositional generalization in semantic parsing, Keysers et al. (2020) introduced Compositional Freebase Queries (CFQ). This dataset maximizes the similarity between the test and train distributions over… ▽ More

    Submitted 15 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: 12 pages 7 figures

  8. Linguistic Dependencies and Statistical Dependence

    Authors: Jacob Louis Hoover, Alessandro Sordoni, Wenyu Du, Timothy J. O'Donnell

    Abstract: Are pairs of words that tend to occur together also likely to stand in a linguistic dependency? This empirical question is motivated by a long history of literature in cognitive science, psycholinguistics, and NLP. In this work we contribute an extensive analysis of the relationship between linguistic dependencies and statistical dependence between words. Improving on previous work, we introduce t… ▽ More

    Submitted 29 April, 2022; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: EMNLP2021 camera-ready version. 9 pages, plus references and appendices

    Report number: 2021.emnlp-main.234

    Journal ref: Proceedings EMNLP (2021), 2941--2963

  9. arXiv:2104.08664  [pdf, other

    cs.CL

    Characterizing Idioms: Conventionality and Contingency

    Authors: Michaela Socolof, Jackie Chi Kit Cheung, Michael Wagner, Timothy J. O'Donnell

    Abstract: Idioms are unlike most phrases in two important ways. First, the words in an idiom have non-canonical meanings. Second, the non-canonical meanings of words in an idiom are contingent on the presence of other words in the idiom. Linguistic theories differ on whether these properties depend on one another, as well as whether special theoretical machinery is needed to accommodate idioms. We define tw… ▽ More

    Submitted 14 September, 2022; v1 submitted 17 April, 2021; originally announced April 2021.

  10. arXiv:2104.06645  [pdf, other

    cs.CL

    Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention

    Authors: Leon Bergen, Dzmitry Bahdanau, Timothy J. O'Donnell

    Abstract: We present a model that jointly learns the denotations of words together with their groundings using a truth-conditional semantics. Our model builds on the neurosymbolic approach of Mao et al. (2019), learning to ground objects in the CLEVR dataset (Johnson et al., 2017) using a novel parallel attention mechanism. The model achieves state of the art performance on visual question answering, learni… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

  11. arXiv:2010.04704  [pdf, other

    cs.CL cs.LG

    Recursive Top-Down Production for Sentence Generation with Latent Trees

    Authors: Shawn Tan, Yikang Shen, Timothy J. O'Donnell, Alessandro Sordoni, Aaron Courville

    Abstract: We model the recursive production property of context-free grammars for natural and synthetic languages. To this end, we present a dynamic programming algorithm that marginalises over latent binary tree structures with $N$ leaves, allowing us to compute the likelihood of a sequence of $N$ tokens under a latent tree model, which we maximise to train a recursive neural function. We demonstrate perfo… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  12. arXiv:2005.05864  [pdf

    cs.CL

    Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach

    Authors: Wenyu Du, Zhouhan Lin, Yikang Shen, Timothy J. O'Donnell, Yoshua Bengio, Yue Zhang

    Abstract: It is commonly believed that knowledge of syntactic structure should improve language modeling. However, effectively and computationally efficiently incorporating syntactic structure into neural language models has been a challenging topic. In this paper, we make use of a multi-task objective, i.e., the models simultaneously predict words as well as ground truth parse trees in a form called "synta… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: ACL20

  13. arXiv:2005.04315  [pdf, other

    cs.CL

    Probing Linguistic Systematicity

    Authors: Emily Goodwin, Koustuv Sinha, Timothy J. O'Donnell

    Abstract: Recently, there has been much interest in the question of whether deep natural language understanding models exhibit systematicity; generalizing such that units like words make consistent contributions to the meaning of the sentences in which they appear. There is accumulating evidence that neural models often generalize non-systematically. We examined the notion of systematicity from a linguistic… ▽ More

    Submitted 25 August, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: To appear at ACL2020, 9 pages, 2 figures

  14. arXiv:1912.05783  [pdf, other

    cs.AI cs.LG

    CLOSURE: Assessing Systematic Generalization of CLEVR Models

    Authors: Dzmitry Bahdanau, Harm de Vries, Timothy J. O'Donnell, Shikhar Murty, Philippe Beaudoin, Yoshua Bengio, Aaron Courville

    Abstract: The CLEVR dataset of natural-looking questions about 3D-rendered scenes has recently received much attention from the research community. A number of models have been proposed for this task, many of which achieved very high accuracies of around 97-99%. In this work, we study how systematic the generalization of such models is, that is to which extent they are capable of handling novel combinations… ▽ More

    Submitted 17 October, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: Technical report

  15. arXiv:1906.11483  [pdf, other

    cs.CL

    Morphological Irregularity Correlates with Frequency

    Authors: Shijie Wu, Ryan Cotterell, Timothy J. O'Donnell

    Abstract: We present a study of morphological irregularity. Following recent work, we define an information-theoretic measure of irregularity based on the predictability of forms in a language. Using a neural transduction model, we estimate this quantity for the forms in 28 languages. We first present several validatory and exploratory analyses of irregularity. We then show that our analyses provide evidenc… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: ACL 2019

  16. arXiv:1710.11350  [pdf, other

    cs.CL

    Grammar Induction for Minimalist Grammars using Variational Bayesian Inference : A Technical Report

    Authors: Eva Portelance, Amelia Bruno, Daniel Harasim, Leon Bergen, Timothy J. O'Donnell

    Abstract: The following technical report presents a formal approach to probabilistic minimalist grammar parameter estimation. We describe a formalization of a minimalist grammar. We then present an algorithm for the application of variational Bayesian inference to this formalization.

    Submitted 28 August, 2019; v1 submitted 31 October, 2017; originally announced October 2017.

  17. arXiv:1710.11301  [pdf, other

    cs.CL cs.FL

    A generalized parsing framework for Abstract Grammars

    Authors: Daniel Harasim, Chris Bruno, Eva Portelance, Martin Rohrmeier, Timothy J. O'Donnell

    Abstract: This technical report presents a general framework for parsing a variety of grammar formalisms. We develop a grammar formalism, called an Abstract Grammar, which is general enough to represent grammars at many levels of the hierarchy, including Context Free Grammars, Minimalist Grammars, and Generalized Context-free Grammars. We then develop a single parsing framework which is capable of parsing g… ▽ More

    Submitted 19 January, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: Technical Report [v2: added Martin Rohrmeier as author.] [v3: fixed error stating that AGs are equivalent to GCFGS. this is in fact not known yet. other minor typos fixed.]