Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Nam, A J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.16183  [pdf, other

    cs.LG cs.AI cs.CL

    Passive learning of active causal strategies in agents and language models

    Authors: Andrew Kyle Lampinen, Stephanie C Y Chan, Ishita Dasgupta, Andrew J Nam, Jane X Wang

    Abstract: What can be learned about causality and experimentation from passive data? This question is salient given recent successes of passively-trained language models in interactive domains such as tool use. Passive learning is inherently limited. However, we show that purely passive learning can in fact allow an agent to learn generalizable strategies for determining and using causal structures, as long… ▽ More

    Submitted 2 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2023). 10 pages main text

  2. arXiv:2210.03275  [pdf, other

    cs.LG

    Achieving and Understanding Out-of-Distribution Generalization in Systematic Reasoning in Small-Scale Transformers

    Authors: Andrew J. Nam, Mustafa Abdool, Trevor Maxfield, James L. McClelland

    Abstract: Out-of-distribution generalization (OODG) is a longstanding challenge for neural networks. This challenge is quite apparent in tasks with well-defined variables and rules, where explicit use of the rules could solve problems independently of the particular values of the variables, but networks tend to be tied to the range of values sampled in their training data. Large transformer-based language m… ▽ More

    Submitted 13 December, 2022; v1 submitted 6 October, 2022; originally announced October 2022.

  3. arXiv:2210.02615  [pdf, other

    cs.LG cs.AI cs.CL

    Learning to Reason With Relational Abstractions

    Authors: Andrew J. Nam, Mengye Ren, Chelsea Finn, James L. McClelland

    Abstract: Large language models have recently shown promising progress in mathematical reasoning when fine-tuned with human-generated sequences walking through a sequence of solution steps. However, the solution sequences are not formally structured and the resulting model-generated sequences may not reflect the kind of systematic reasoning we might expect an expert human to produce. In this paper, we study… ▽ More

    Submitted 5 December, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

  4. arXiv:2107.06994  [pdf, other

    cs.LG cs.AI cs.SC

    Systematic human learning and generalization from a brief tutorial with explanatory feedback

    Authors: Andrew J. Nam, James L. McClelland

    Abstract: Neural networks have long been used to model human intelligence, capturing elements of behavior and cognition, and their neural basis. Recent advancements in deep learning have enabled neural network models to reach and even surpass human levels of intelligence in many respects, yet unlike humans, their ability to learn new tasks quickly remains a challenge. People can reason not only in familiar… ▽ More

    Submitted 28 March, 2023; v1 submitted 9 July, 2021; originally announced July 2021.

    Comments: 27 pages, 108 references, 8 Figures, and one Table, plus Supplementary Materials

  5. arXiv:1811.07974  [pdf, other

    cs.CY

    A Map of Knowledge

    Authors: Zachary A. Pardos, Andrew Joo Hun Nam

    Abstract: Knowledge representation has gained in relevance as data from the ubiquitous digitization of behaviors amass and academia and industry seek methods to understand and reason about the information they encode. Success in this pursuit has emerged with data from natural language, where skip-grams and other linear connectionist models of distributed representation have surfaced scrutable relational str… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.