Zum Hauptinhalt springen

Showing 1–38 of 38 results for author: Tarlow, D

Searching in archive cs. Search in all archives.
.
  1. AI-Assisted Assessment of Coding Practices in Modern Code Review

    Authors: Manushree Vijayvergiya, Małgorzata Salawa, Ivan Budiselić, Dan Zheng, Pascal Lamblin, Marko Ivanković, Juanjo Carin, Mateusz Lewko, Jovan Andonov, Goran Petrović, Daniel Tarlow, Petros Maniatis, René Just

    Abstract: Modern code review is a process in which an incremental code contribution made by a code author is reviewed by one or more peers before it is committed to the version control system. An important element of modern code review is verifying that code contributions adhere to best practices. While some of these best practices can be automatically verified, verifying others is commonly left to human re… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: To appear at the ACM International Conference on AI-Powered Software (AIware '24)

  2. arXiv:2402.08733  [pdf, other

    cs.LG

    Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs

    Authors: Daniel D. Johnson, Daniel Tarlow, David Duvenaud, Chris J. Maddison

    Abstract: Identifying how much a model ${\widehat{p}}_θ(Y|X)$ knows about the stochastic real-world process $p(Y|X)$ it was trained on is important to ensure it avoids producing incorrect or "hallucinated" answers or taking unsafe actions. But this is difficult for generative models because probabilistic predictions do not distinguish between per-response noise (aleatoric uncertainty) and lack of knowledge… ▽ More

    Submitted 27 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024. 9 pages, 6 figures

  3. arXiv:2303.00732  [pdf, other

    cs.LG cs.SE

    R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents

    Authors: Daniel D. Johnson, Daniel Tarlow, Christian Walder

    Abstract: Large language models show impressive results at predicting structured text such as code, but also commonly introduce errors and hallucinations in their output. When used to assist software developers, these models may make mistakes that users must go back and fix, or worse, introduce subtle bugs that users may miss entirely. We propose Randomized Utility-driven Synthesis of Uncertain REgions (R-U… ▽ More

    Submitted 28 April, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: To appear at ICML 2023. 9 pages, 6 figures

  4. arXiv:2208.07461  [pdf, other

    cs.LG cs.PL cs.SE

    A Library for Representing Python Programs as Graphs for Machine Learning

    Authors: David Bieber, Kensen Shi, Petros Maniatis, Charles Sutton, Vincent Hellendoorn, Daniel Johnson, Daniel Tarlow

    Abstract: Graph representations of programs are commonly a central element of machine learning for code research. We introduce an open source Python library python_graphs that applies static analysis to construct graph representations of Python programs suitable for training machine learning models. Our library admits the construction of control-flow graphs, data-flow graphs, and composite ``program graphs'… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: 21 pages, 14 figures

  5. arXiv:2208.05297  [pdf, other

    cs.SE cs.LG

    Learning to Improve Code Efficiency

    Authors: Binghong Chen, Daniel Tarlow, Kevin Swersky, Martin Maas, Pablo Heiber, Ashish Naik, Milad Hashemi, Parthasarathy Ranganathan

    Abstract: Improvements in the performance of computing systems, driven by Moore's Law, have transformed society. As such hardware-driven gains slow down, it becomes even more important for software developers to focus on performance and efficiency during development. While several studies have demonstrated the potential from such improved code efficiency (e.g., 2x better generational improvements compared t… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  6. arXiv:2206.12839  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    Repository-Level Prompt Generation for Large Language Models of Code

    Authors: Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

    Abstract: With the success of large language models (LLMs) of code and their use as code assistants (e.g. Codex used in GitHub Copilot), techniques for introducing domain-specific knowledge in the prompt design process become important. In this work, we propose a framework called Repo-Level Prompt Generator that learns to generate example-specific prompts using prompt proposals. The prompt proposals take co… ▽ More

    Submitted 5 June, 2023; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: ICML 2023 (Camera-Ready version)

    Journal ref: ICML, 2023

  7. arXiv:2203.03771  [pdf, other

    cs.LG cs.PL

    Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions

    Authors: David Bieber, Rishab Goel, Daniel Zheng, Hugo Larochelle, Daniel Tarlow

    Abstract: The execution behavior of a program often depends on external resources, such as program inputs or file contents, and so cannot be run in isolation. Nevertheless, software developers benefit from fast iteration loops where automated tools identify errors as early as possible, even before programs can be compiled and run. This presents an interesting machine learning challenge: can we predict runti… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 20 pages, 7 figures

  8. arXiv:2111.06888  [pdf, other

    cs.LG stat.CO stat.ML

    Learning Generalized Gumbel-max Causal Mechanisms

    Authors: Guy Lorberbom, Daniel D. Johnson, Chris J. Maddison, Daniel Tarlow, Tamir Hazan

    Abstract: To perform counterfactual reasoning in Structural Causal Models (SCMs), one needs to know the causal mechanisms, which provide factorizations of conditional distributions into noise sources and deterministic functions mapping realizations of noise to samples. Unfortunately, the causal mechanism is not uniquely identified by data that can be gathered by observing and interacting with the world, so… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: Accepted to NeurIPS 2021 (Spotlight)

  9. arXiv:2107.07675  [pdf, other

    cs.LG

    Beyond In-Place Corruption: Insertion and Deletion In Denoising Probabilistic Models

    Authors: Daniel D. Johnson, Jacob Austin, Rianne van den Berg, Daniel Tarlow

    Abstract: Denoising diffusion probabilistic models (DDPMs) have shown impressive results on sequence generation by iteratively corrupting each example and then learning to map corrupted versions back to the original. However, previous work has largely focused on in-place corruption, adding noise to each pixel or token individually while keeping their locations the same. In this work, we consider a broader c… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted at the ICML 2021 Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models (poster)

  10. arXiv:2107.03006  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Structured Denoising Diffusion Models in Discrete State-Spaces

    Authors: Jacob Austin, Daniel D. Johnson, Jonathan Ho, Daniel Tarlow, Rianne van den Berg

    Abstract: Denoising diffusion probabilistic models (DDPMs) (Ho et al. 2020) have shown impressive results on image and waveform generation in continuous state spaces. Here, we introduce Discrete Denoising Diffusion Probabilistic Models (D3PMs), diffusion-like generative models for discrete data that generalize the multinomial diffusion model of Hoogeboom et al. 2021, by going beyond corruption processes wit… ▽ More

    Submitted 22 February, 2023; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: 10 pages plus references and appendices. First two authors contributed equally

  11. arXiv:2106.07175  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    Learning to Combine Per-Example Solutions for Neural Program Synthesis

    Authors: Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

    Abstract: The goal of program synthesis from examples is to find a computer program that is consistent with a given set of input-output examples. Most learning-based approaches try to find a program that satisfies all examples at once. Our work, by contrast, considers an approach that breaks the problem into two stages: (a) find programs that satisfy only one example, and (b) leverage these per-example solu… ▽ More

    Submitted 1 November, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 (camera-ready version)

  12. arXiv:2105.14038  [pdf, other

    cs.LG cs.SE

    Learning to Extend Program Graphs to Work-in-Progress Code

    Authors: Xuechen Li, Chris J. Maddison, Daniel Tarlow

    Abstract: Source code spends most of its time in a broken or incomplete state during software development. This presents a challenge to machine learning for code, since high-performing models typically rely on graph structured representations of programs derived from traditional program analyses. Such analyses may be undefined for broken or incomplete code. We extend the notion of program graphs to work-in-… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  13. arXiv:2010.12621  [pdf, other

    cs.LG

    Learning to Execute Programs with Instruction Pointer Attention Graph Neural Networks

    Authors: David Bieber, Charles Sutton, Hugo Larochelle, Daniel Tarlow

    Abstract: Graph neural networks (GNNs) have emerged as a powerful tool for learning software engineering tasks including code completion, bug finding, and program repair. They benefit from leveraging program structure like control flow graphs, but they are not well-suited to tasks like program execution that require far more sequential reasoning steps than number of GNN propagation steps. Recurrent neural n… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted at NeurIPS 2020

  14. arXiv:2007.04929  [pdf, other

    cs.LG stat.ML

    Learning Graph Structure With A Finite-State Automaton Layer

    Authors: Daniel D. Johnson, Hugo Larochelle, Daniel Tarlow

    Abstract: Graph-based neural network models are producing strong results in a number of domains, in part because graphs provide flexibility to encode domain knowledge in the form of relational structure (edges) between nodes in the graph. In practice, edges are used both to represent intrinsic structure (e.g., abstract syntax trees of programs) and more abstract relations that aid reasoning for a downstream… ▽ More

    Submitted 6 November, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted at NeurIPS 2020 (spotlight)

  15. arXiv:2007.01231  [pdf, other

    cs.LG cs.SE stat.ML

    Software Engineering Event Modeling using Relative Time in Temporal Knowledge Graphs

    Authors: Kian Ahrabian, Daniel Tarlow, Hehuimin Cheng, Jin L. C. Guo

    Abstract: We present a multi-relational temporal Knowledge Graph based on the daily interactions between artifacts in GitHub, one of the largest social coding platforms. Such representation enables posing many user-activity and project management questions as link prediction and time queries over the knowledge graph. In particular, we introduce two new datasets for i) interpolated time-conditioned link pred… ▽ More

    Submitted 12 July, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: 11 pages, 1 figure. 37th International Conference on Machine Learning (ICML 2020) - Workshop on Graph Representation Learning and Beyond

  16. arXiv:2006.08063  [pdf, other

    stat.ML cs.LG

    Gradient Estimation with Stochastic Softmax Tricks

    Authors: Max B. Paulus, Dami Choi, Daniel Tarlow, Andreas Krause, Chris J. Maddison

    Abstract: The Gumbel-Max trick is the basis of many relaxed gradient estimators. These estimators are easy to implement and low variance, but the goal of scaling them comprehensively to large combinatorial distributions is still outstanding. Working within the perturbation model framework, we introduce stochastic softmax tricks, which generalize the Gumbel-Softmax trick to combinatorial spaces. Our framewor… ▽ More

    Submitted 28 February, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020, final copy

  17. arXiv:2003.11768   

    cs.LG cs.AI cs.SE stat.ML

    On-the-Fly Adaptation of Source Code Models using Meta-Learning

    Authors: Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

    Abstract: The ability to adapt to unseen, local contexts is an important challenge that successful models of source code must overcome. One of the most popular approaches for the adaptation of such models is dynamic evaluation. With dynamic evaluation, when running a model on an unseen file, the model is updated immediately after having observed each token in that file. In this work, we propose instead to f… ▽ More

    Submitted 19 September, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: This paper has been withdrawn because we found a bug in the FOMAML implementation that invalidates some of the key claims in the paper

  18. arXiv:1911.01205  [pdf, other

    cs.LG cs.AI cs.SE stat.ML

    Learning to Fix Build Errors with Graph2Diff Neural Networks

    Authors: Daniel Tarlow, Subhodeep Moitra, Andrew Rice, Zimin Chen, Pierre-Antoine Manzagol, Charles Sutton, Edward Aftandilian

    Abstract: Professional software developers spend a significant amount of time fixing builds, but this has received little attention as a problem in automatic program repair. We present a new deep learning architecture, called Graph2Diff, for automatically localizing and fixing build errors. We represent source code, build configuration files, and compiler diagnostic messages as a graph, and then use a Graph… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Comments: Submitted for review on Aug 23, 2019

  19. arXiv:1906.11786  [pdf, other

    stat.ML cs.LG

    Fast Training of Sparse Graph Neural Networks on Dense Hardware

    Authors: Matej Balog, Bart van Merriënboer, Subhodeep Moitra, Yujia Li, Daniel Tarlow

    Abstract: Graph neural networks have become increasingly popular in recent years due to their ability to naturally encode relational input data and their ability to scale to large graphs by operating on a sparse representation of graph adjacency matrices. As we look to scale up these models using custom hardware, a natural assumption would be that we need hardware tailored to sparse operations and/or dynami… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

  20. arXiv:1906.07181  [pdf, other

    cs.LG cs.AI cs.PL stat.ML

    Learning Execution through Neural Code Fusion

    Authors: Zhan Shi, Kevin Swersky, Daniel Tarlow, Parthasarathy Ranganathan, Milad Hashemi

    Abstract: As the performance of computer systems stagnates due to the end of Moore's Law, there is a need for new models that can understand and optimize the execution of general purpose code. While there is a growing body of work on using Graph Neural Networks (GNNs) to learn representations of source code, these representations do not understand how code dynamically executes. In this work, we propose a ne… ▽ More

    Submitted 10 March, 2020; v1 submitted 17 June, 2019; originally announced June 2019.

    Comments: 14 pages,7 figures

  21. arXiv:1906.06062  [pdf, other

    cs.LG cs.AI stat.ML

    Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces

    Authors: Guy Lorberbom, Chris J. Maddison, Nicolas Heess, Tamir Hazan, Daniel Tarlow

    Abstract: Direct optimization is an appealing framework that replaces integration with optimization of a random objective for approximating gradients in models with discrete random variables. A$^\star$ sampling is a framework for optimizing such random objectives over large spaces. We show how to combine these techniques to yield a reinforcement learning algorithm that approximates a policy gradient by find… ▽ More

    Submitted 23 October, 2020; v1 submitted 14 June, 2019; originally announced June 2019.

  22. arXiv:1904.02818  [pdf, other

    cs.LG cs.CL cs.SE stat.ML

    Neural Networks for Modeling Source Code Edits

    Authors: Rui Zhao, David Bieber, Kevin Swersky, Daniel Tarlow

    Abstract: Programming languages are emerging as a challenging and interesting domain for machine learning. A core task, which has received significant attention in recent years, is building generative models of source code. However, to our knowledge, previous generative models have always been framed in terms of generating static snapshots of code. In this work, we instead treat source code as a dynamic obj… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: Deanonymized version of ICLR 2019 submission

  23. arXiv:1803.06272  [pdf, other

    cs.LG stat.ML

    Graph Partition Neural Networks for Semi-Supervised Classification

    Authors: Renjie Liao, Marc Brockschmidt, Daniel Tarlow, Alexander L. Gaunt, Raquel Urtasun, Richard Zemel

    Abstract: We present graph partition neural networks (GPNN), an extension of graph neural networks (GNNs) able to handle extremely large graphs. GPNNs alternate between locally propagating information between nodes in small subgraphs and globally propagating information between the subgraphs. To efficiently partition graphs, we experiment with several partitioning algorithms and also propose a novel variant… ▽ More

    Submitted 16 March, 2018; originally announced March 2018.

  24. arXiv:1705.09786  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    AMPNet: Asynchronous Model-Parallel Training for Dynamic Neural Networks

    Authors: Alexander L. Gaunt, Matthew A. Johnson, Maik Riechert, Daniel Tarlow, Ryota Tomioka, Dimitrios Vytiniotis, Sam Webster

    Abstract: New types of machine learning hardware in development and entering the market hold the promise of revolutionizing deep learning in a manner as profound as GPUs. However, existing software frameworks and training algorithms for deep learning have yet to evolve to fully leverage the capability of the new wave of silicon. We already see the limitations of existing algorithms for models that exploit s… ▽ More

    Submitted 22 June, 2017; v1 submitted 27 May, 2017; originally announced May 2017.

    Comments: 17 pages, 13 figures

  25. arXiv:1702.03334  [pdf, other

    stat.ML cs.LG

    Batch Policy Gradient Methods for Improving Neural Conversation Models

    Authors: Kirthevasan Kandasamy, Yoram Bachrach, Ryota Tomioka, Daniel Tarlow, David Carter

    Abstract: We study reinforcement learning of chatbots with recurrent neural network architectures when the rewards are noisy and expensive to obtain. For instance, a chatbot used in automated customer service support can be scored by quality assurance agents, but this process can be expensive, time consuming and noisy. Previous reinforcement learning work for natural language processing uses on-policy updat… ▽ More

    Submitted 10 February, 2017; originally announced February 2017.

    Comments: International Conference on Learning Representations (ICLR) 2017

  26. arXiv:1612.00817  [pdf, other

    cs.LG cs.AI cs.NE

    Summary - TerpreT: A Probabilistic Programming Language for Program Induction

    Authors: Alexander L. Gaunt, Marc Brockschmidt, Rishabh Singh, Nate Kushman, Pushmeet Kohli, Jonathan Taylor, Daniel Tarlow

    Abstract: We study machine learning formulations of inductive program synthesis; that is, given input-output examples, synthesize source code that maps inputs to corresponding outputs. Our key contribution is TerpreT, a domain-specific language for expressing program synthesis problems. A TerpreT model is composed of a specification of a program representation and an interpreter that describes how programs… ▽ More

    Submitted 2 December, 2016; originally announced December 2016.

    Comments: 7 pages, 2 figures, 4 tables in 1st Workshop on Neural Abstract Machines & Program Induction (NAMPI), @NIPS 2016

  27. arXiv:1611.02109  [pdf, other

    cs.LG

    Differentiable Programs with Neural Libraries

    Authors: Alexander L. Gaunt, Marc Brockschmidt, Nate Kushman, Daniel Tarlow

    Abstract: We develop a framework for combining differentiable programming languages with neural networks. Using this framework we create end-to-end trainable systems that learn to write interpretable algorithms with perceptual components. We explore the benefits of inductive biases for strong generalization and modularity that come from the program-like structure of our models. In particular, modularity all… ▽ More

    Submitted 2 March, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

  28. arXiv:1611.01989  [pdf, other

    cs.LG

    DeepCoder: Learning to Write Programs

    Authors: Matej Balog, Alexander L. Gaunt, Marc Brockschmidt, Sebastian Nowozin, Daniel Tarlow

    Abstract: We develop a first line of attack for solving programming competition-style problems from input-output examples using deep learning. The approach is to train a neural network to predict properties of the program that generated the outputs from the inputs. We use the neural network's predictions to augment search techniques from the programming languages community, including enumerative search and… ▽ More

    Submitted 8 March, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: Submitted to ICLR 2017

  29. arXiv:1611.01988  [pdf, ps, other

    cs.PL cs.LG

    Differentiable Functional Program Interpreters

    Authors: John K. Feser, Marc Brockschmidt, Alexander L. Gaunt, Daniel Tarlow

    Abstract: Programming by Example (PBE) is the task of inducing computer programs from input-output examples. It can be seen as a type of machine learning where the hypothesis space is the set of legal programs in some programming language. Recent work on differentiable interpreters relaxes the discrete space of programs into a continuous space so that search over programs can be performed using gradient-bas… ▽ More

    Submitted 2 March, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

  30. arXiv:1608.04428  [pdf, other

    cs.LG cs.AI cs.NE

    TerpreT: A Probabilistic Programming Language for Program Induction

    Authors: Alexander L. Gaunt, Marc Brockschmidt, Rishabh Singh, Nate Kushman, Pushmeet Kohli, Jonathan Taylor, Daniel Tarlow

    Abstract: We study machine learning formulations of inductive program synthesis; given input-output examples, we try to synthesize source code that maps inputs to corresponding outputs. Our aims are to develop new machine learning approaches based on neural networks and graphical models, and to understand the capabilities of machine learning techniques relative to traditional alternatives, such as those bas… ▽ More

    Submitted 15 August, 2016; originally announced August 2016.

    Comments: 50 pages, 20 figures, 4 tables

  31. arXiv:1511.05493  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Gated Graph Sequence Neural Networks

    Authors: Yujia Li, Daniel Tarlow, Marc Brockschmidt, Richard Zemel

    Abstract: Graph-structured data appears frequently in domains including chemistry, natural language semantics, social networks, and knowledge bases. In this work, we study feature learning techniques for graph-structured inputs. Our starting point is previous work on Graph Neural Networks (Scarselli et al., 2009), which we modify to use gated recurrent units and modern optimization techniques and then exten… ▽ More

    Submitted 22 September, 2017; v1 submitted 17 November, 2015; originally announced November 2015.

    Comments: Published as a conference paper in ICLR 2016. Fixed a typo

  32. arXiv:1412.3369  [pdf, other

    cs.CV

    Candidate Constrained CRFs for Loss-Aware Structured Prediction

    Authors: Faruk Ahmed, Daniel Tarlow, Dhruv Batra

    Abstract: When evaluating computer vision systems, we are often concerned with performance on a task-specific evaluation measure such as the Intersection-Over-Union score used in the PASCAL VOC image segmentation challenge. Ideally, our systems would be tuned specifically to these evaluation measures. However, despite much work on loss-aware structured prediction, top performing systems do not use these tec… ▽ More

    Submitted 10 December, 2014; originally announced December 2014.

    Comments: 20 pages including Supplement

  33. arXiv:1410.7452  [pdf, other

    cs.CV cs.AI cs.LG

    Consensus Message Passing for Layered Graphical Models

    Authors: Varun Jampani, S. M. Ali Eslami, Daniel Tarlow, Pushmeet Kohli, John Winn

    Abstract: Generative models provide a powerful framework for probabilistic reasoning. However, in many domains their use has been hampered by the practical difficulties of inference. This is particularly the case in computer vision, where models of the imaging process tend to be large, loopy and layered. For this reason bottom-up conditional models have traditionally dominated in such domains. We find that… ▽ More

    Submitted 26 January, 2015; v1 submitted 27 October, 2014; originally announced October 2014.

    Comments: Appearing in Proceedings of the 18th International Conference on Artificial Intelligence and Statistics (AISTATS) 2015

  34. arXiv:1401.0514  [pdf, other

    cs.PL cs.LG stat.ML

    Structured Generative Models of Natural Source Code

    Authors: Chris J. Maddison, Daniel Tarlow

    Abstract: We study the problem of building generative models of natural source code (NSC); that is, source code written and understood by humans. Our primary contribution is to describe a family of generative models for NSC that have three key properties: First, they incorporate both sequential and hierarchical structure. Second, we learn a distributed representation of source code elements. Finally, they i… ▽ More

    Submitted 20 June, 2014; v1 submitted 2 January, 2014; originally announced January 2014.

  35. arXiv:1309.6848  [pdf

    cs.AI

    Tighter Linear Program Relaxations for High Order Graphical Models

    Authors: Elad Mezuman, Daniel Tarlow, Amir Globerson, Yair Weiss

    Abstract: Graphical models with High Order Potentials (HOPs) have received considerable interest in recent years. While there are a variety of approaches to inference in these models, nearly all of them amount to solving a linear program (LP) relaxation with unary consistency constraints between the HOP and the individual variables. In many cases, the resulting relaxations are loose, and in these cases the… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-421-430

  36. arXiv:1210.4899  [pdf

    cs.LG stat.ML

    Fast Exact Inference for Recursive Cardinality Models

    Authors: Daniel Tarlow, Kevin Swersky, Richard S. Zemel, Ryan Prescott Adams, Brendan J. Frey

    Abstract: Cardinality potentials are a generally useful class of high order potential that affect probabilities based on how many of D binary variables are active. Maximum a posteriori (MAP) inference for cardinality potential models is well-understood, with efficient computations taking O(DlogD) time. Yet efficient marginalization and sampling have not been addressed as thoroughly in the machine learning c… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-825-834

  37. arXiv:1206.3294  [pdf

    cs.LG stat.ML

    Flexible Priors for Exemplar-based Clustering

    Authors: Daniel Tarlow, Richard S. Zemel, Brendan J. Frey

    Abstract: Exemplar-based clustering methods have been shown to produce state-of-the-art results on a number of synthetic and real-world clustering problems. They are appealing because they offer computational benefits over latent-mean models and can handle arbitrary pairwise similarity measures between data points. However, when trying to recover underlying structure in clustering problems, tailored similar… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-537-545

  38. arXiv:1105.1178  [pdf, other

    cs.LG cs.DS stat.ML

    Interpreting Graph Cuts as a Max-Product Algorithm

    Authors: Daniel Tarlow, Inmar E. Givoni, Richard S. Zemel, Brendan J. Frey

    Abstract: The maximum a posteriori (MAP) configuration of binary variable models with submodular graph-structured energy functions can be found efficiently and exactly by graph cuts. Max-product belief propagation (MP) has been shown to be suboptimal on this class of energy functions by a canonical counterexample where MP converges to a suboptimal fixed point (Kulesza & Pereira, 2008). In this work, we sh… ▽ More

    Submitted 5 May, 2011; originally announced May 2011.