Zum Hauptinhalt springen

Showing 1–48 of 48 results for author: Clark, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17583  [pdf, other

    cs.AI cs.LG cs.LO math.CT

    Towards Compositional Interpretability for XAI

    Authors: Sean Tull, Robin Lorenz, Stephen Clark, Ilyas Khan, Bob Coecke

    Abstract: Artificial intelligence (AI) is currently based largely on black-box machine learning models which lack interpretability. The field of eXplainable AI (XAI) strives to address this major concern, being critical in high-stakes areas such as the finance, legal and health sectors. We present an approach to defining AI models and their interpretability based on category theory. For this we employ the… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2402.11288  [pdf

    cs.CV

    Enhancing Surgical Performance in Cardiothoracic Surgery with Innovations from Computer Vision and Artificial Intelligence: A Narrative Review

    Authors: Merryn D. Constable, Hubert P. H. Shum, Stephen Clark

    Abstract: When technical requirements are high, and patient outcomes are critical, opportunities for monitoring and improving surgical skills via objective motion analysis feedback may be particularly beneficial. This narrative review synthesises work on technical and non-technical surgical skills, collaborative task performance, and pose estimation to illustrate new opportunities to advance cardiothoracic… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  3. arXiv:2401.08585  [pdf, other

    q-bio.NC cs.AI quant-ph

    From Conceptual Spaces to Quantum Concepts: Formalising and Learning Structured Conceptual Models

    Authors: Sean Tull, Razin A. Shaikh, Sara Sabrina Zemljic, Stephen Clark

    Abstract: In this article we present a new modelling framework for structured concepts using a category-theoretic generalisation of conceptual spaces, and show how the conceptual representations can be learned automatically from data, using two very different instantiations: one classical and one quantum. A contribution of the work is a thorough category-theoretic formalisation of our framework. We claim th… ▽ More

    Submitted 6 November, 2023; originally announced January 2024.

    Comments: This article consolidates our previous reports on concept formalisation and learning: arXiv:2302.14822 and arXiv:2203.11216

  4. arXiv:2311.15696  [pdf, other

    quant-ph cs.AI cs.LG

    Peptide Binding Classification on Quantum Computers

    Authors: Charles London, Douglas Brown, Wenduan Xu, Sezen Vatansever, Christopher James Langmead, Dimitri Kartsaklis, Stephen Clark, Konstantinos Meichanetzidis

    Abstract: We conduct an extensive study on using near-term quantum computers for a task in the domain of computational biology. By constructing quantum models based on parameterised quantum circuits we perform sequence classification on a task relevant to the design of therapeutic proteins, and find competitive performance with classical baselines of similar scale. To study the effect of noise, we run some… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  5. arXiv:2310.02074  [pdf, other

    physics.ao-ph cs.LG

    ACE: A fast, skillful learned global atmospheric model for climate prediction

    Authors: Oliver Watt-Meyer, Gideon Dresdner, Jeremy McGibbon, Spencer K. Clark, Brian Henn, James Duncan, Noah D. Brenowitz, Karthik Kashinath, Michael S. Pritchard, Boris Bonev, Matthew E. Peters, Christopher S. Bretherton

    Abstract: Existing ML-based atmospheric models are not suitable for climate prediction, which requires long-term stability and physical consistency. We present ACE (AI2 Climate Emulator), a 200M-parameter, autoregressive machine learning emulator of an existing comprehensive 100-km resolution global atmospheric model. The formulation of ACE allows evaluation of physical laws such as the conservation of mass… ▽ More

    Submitted 6 December, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted at Tackling Climate Change with Machine Learning: workshop at NeurIPS 2023

  6. arXiv:2302.14822  [pdf, other

    q-bio.NC cs.AI quant-ph

    Formalising and Learning a Quantum Model of Concepts

    Authors: Sean Tull, Razin A. Shaikh, Sara Sabrina Zemljic, Stephen Clark

    Abstract: In this report we present a new modelling framework for concepts based on quantum theory, and demonstrate how the conceptual representations can be learned automatically from data. A contribution of the work is a thorough category-theoretic formalisation of our framework. We claim that the use of category theory, and in particular the use of string diagrams to describe quantum processes, helps elu… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  7. arXiv:2212.03106  [pdf, other

    cs.RO

    Scale-Invariant Specifications for Human-Swarm Systems

    Authors: Joel Meyer, Ahalya Prabhakar, Allison Pinosky, Ian Abraham, Annalisa Taylor, Millicent Schlafly, Katarina Popovic, Giovani Diniz, Brendan Teich, Borislava Simidchieva, Shane Clark, Todd Murphey

    Abstract: We present a method for controlling a swarm using its spectral decomposition -- that is, by describing the set of trajectories of a swarm in terms of a spatial distribution throughout the operational domain -- guaranteeing scale invariance with respect to the number of agents both for computation and for the operator tasked with controlling the swarm. We use ergodic control, decentralized across t… ▽ More

    Submitted 12 December, 2022; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: Journal of Field Robotics, Accepted for Publication. 25 pages

  8. arXiv:2211.11820  [pdf, other

    physics.ao-ph cs.LG

    Machine-learned climate model corrections from a global storm-resolving model

    Authors: Anna Kwa, Spencer K. Clark, Brian Henn, Noah D. Brenowitz, Jeremy McGibbon, W. Andre Perkins, Oliver Watt-Meyer, Lucas Harris, Christopher S. Bretherton

    Abstract: Due to computational constraints, running global climate models (GCMs) for many years requires a lower spatial grid resolution (${\gtrsim}50$ km) than is optimal for accurately resolving important physical processes. Such processes are approximated in GCMs via subgrid parameterizations, which contribute significantly to the uncertainty in GCM predictions. One approach to improving the accuracy of… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  9. arXiv:2205.14507  [pdf, other

    cs.DC

    HPC Extensions to the OpenKIM Processing Pipeline

    Authors: Daniel S. Karls, Steven M. Clark, Brendon A. Waters, Ryan S. Elliott, Ellad B. Tadmor

    Abstract: The Open Knowledgebase of Interatomic Models (OpenKIM) is an NSF Science Gateway that archives fully functional computer implementations of interatomic models (potentials and force fields) and simulation codes that use them to compute material properties. Interatomic models are coupled with compatible simulation codes and executed in a fully automated manner by the OpenKIM processing pipeline, a c… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  10. arXiv:2203.11216  [pdf, other

    cs.LG cs.AI

    The Conceptual VAE

    Authors: Razin A. Shaikh, Sara Sabrina Zemljic, Sean Tull, Stephen Clark

    Abstract: In this report we present a new model of concepts, based on the framework of variational autoencoders, which is designed to have attractive properties such as factored conceptual domains, and at the same time be learnable from data. The model is inspired by, and closely related to, the Beta-VAE model of concepts, but is designed to be more closely connected with language, so that the names of conc… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  11. arXiv:2112.06872  [pdf, other

    cs.CR cs.LG

    Efficient Differentially Private Secure Aggregation for Federated Learning via Hardness of Learning with Errors

    Authors: Timothy Stevens, Christian Skalka, Christelle Vincent, John Ring, Samuel Clark, Joseph Near

    Abstract: Federated machine learning leverages edge computing to develop models from network user data, but privacy in federated learning remains a major challenge. Techniques using differential privacy have been proposed to address this, but bring their own challenges -- many require a trusted third party or else add too much noise to produce useful models. Recent advances in \emph{secure aggregation} usin… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 16 pages, 4 figures

  12. Sim2Ls: FAIR simulation workflows and data

    Authors: Martin Hunt, Steven Clark, Daniel Mejia, Saaketh Desai, Alejandro Strachan

    Abstract: Just like the scientific data they generate, simulation workflows for research should be findable, accessible, interoperable, and reusable (FAIR). However, while significant progress has been made towards FAIR data, the majority of science and engineering workflows used in research remain poorly documented and often unavailable, involving ad hoc scripts and manual steps, hindering reproducibility… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: 23 pages, 5 figures

  13. arXiv:2110.04236  [pdf, other

    cs.CL cs.AI quant-ph

    lambeq: An Efficient High-Level Python Library for Quantum NLP

    Authors: Dimitri Kartsaklis, Ian Fan, Richie Yeung, Anna Pearson, Robin Lorenz, Alexis Toumi, Giovanni de Felice, Konstantinos Meichanetzidis, Stephen Clark, Bob Coecke

    Abstract: We present lambeq, the first high-level Python library for Quantum Natural Language Processing (QNLP). The open-source toolkit offers a detailed hierarchy of modules and classes implementing all stages of a pipeline for converting sentences to string diagrams, tensor networks, and quantum circuits ready to be used on a quantum computer. lambeq supports syntactic parsing, rewriting and simplificati… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  14. arXiv:2109.10044  [pdf, other

    cs.CL

    Something Old, Something New: Grammar-based CCG Parsing with Transformer Models

    Authors: Stephen Clark

    Abstract: This report describes the parsing problem for Combinatory Categorial Grammar (CCG), showing how a combination of Transformer-based neural models and a symbolic CCG grammar can lead to substantial gains over existing approaches. The report also documents a 20-year research program, showing how NLP methods have evolved over this time. The staggering accuracy improvements provided by neural models fo… ▽ More

    Submitted 28 September, 2021; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: o Added to the description of the formal properties of CCG o Added more description of how maxent and neural taggers differ o Added a ref to some very recent CCG parsing work o Fixed a bug in one of the figures o Added a note and ref to the conclusions o Added to the acknowledgements

  15. arXiv:2108.09416  [pdf

    cs.SI cs.CL

    2020 U.S. presidential election in swing states: Gender differences in Twitter conversations

    Authors: Amir Karami, Spring B. Clark, Anderson Mackenzie, Dorathea Lee, Michael Zhu, Hannah R. Boyajieff, Bailey Goldschmidt

    Abstract: Social media is commonly used by the public during election campaigns to express their opinions regarding different issues. Among various social media channels, Twitter provides an efficient platform for researchers and politicians to explore public opinion regarding a wide range of topics such as the economy and foreign policy. Current literature mainly focuses on analyzing the content of tweets… ▽ More

    Submitted 13 July, 2022; v1 submitted 20 August, 2021; originally announced August 2021.

  16. arXiv:2101.05125  [pdf, other

    cs.AI

    Formalising Concepts as Grounded Abstractions

    Authors: Stephen Clark, Alexander Lerchner, Tamara von Glehn, Olivier Tieleman, Richard Tanburn, Misha Dashevskiy, Matko Bosnjak

    Abstract: The notion of concept has been studied for centuries, by philosophers, linguists, cognitive scientists, and researchers in artificial intelligence (Margolis & Laurence, 1999). There is a large literature on formal, mathematical models of concepts, including a whole sub-field of AI -- Formal Concept Analysis -- devoted to this topic (Ganter & Obiedkov, 2016). Recently, researchers in machine learni… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  17. arXiv:2012.05672  [pdf, other

    cs.LG cs.AI cs.MA

    Imitating Interactive Intelligence

    Authors: Josh Abramson, Arun Ahuja, Iain Barr, Arthur Brussee, Federico Carnevale, Mary Cassin, Rachita Chhaparia, Stephen Clark, Bogdan Damoc, Andrew Dudzik, Petko Georgiev, Aurelia Guy, Tim Harley, Felix Hill, Alden Hung, Zachary Kenton, Jessica Landon, Timothy Lillicrap, Kory Mathewson, Soňa Mokrá, Alistair Muldal, Adam Santoro, Nikolay Savinov, Vikrant Varma, Greg Wayne , et al. (4 additional authors not shown)

    Abstract: A common vision from science fiction is that robots will one day inhabit our physical spaces, sense the world as we do, assist our physical labours, and communicate with us through natural language. Here we study how to design artificial agents that can interact naturally with humans using the simplification of a virtual environment. This setting nevertheless integrates a number of the central cha… ▽ More

    Submitted 20 January, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

  18. arXiv:2011.04838  [pdf, other

    cs.DB

    Answer Graph: Factorization Matters in Large Graphs

    Authors: Zahid Abul-Basher, Nikolay Yakovets, Parke Godfrey, Stanley Clark, Mark Chignell

    Abstract: Our answer-graph method to evaluate SPARQL conjunctive queries (CQs) finds a factorized answer set first, an answer graph, and then finds the embedding tuples from this. This approach can reduce greatly the cost to evaluate CQs. This affords a second advantage: we can construct a cost-based planner. We present the answer-graph approach, and overview our prototype system, Wireframe. We then offer p… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  19. arXiv:2010.09892  [pdf, other

    cs.LG cs.IR cs.SI

    Understanding YouTube Communities via Subscription-based Channel Embeddings

    Authors: Sam Clark, Anna Zaitsev

    Abstract: YouTube is an important source of news and entertainment worldwide, but the scale makes it challenging to study the ideas and topics being discussed on the platform. This paper presents new methods to discover and classify YouTube channels which enable the analysis of communities and categories on the platform using orders of magnitude more channels than have been used in previous studies. Instead… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  20. Learning to Personalize for Web Search Sessions

    Authors: Saad Aloteibi, Stephen Clark

    Abstract: The task of session search focuses on using interaction data to improve relevance for the user's next query at the session level. In this paper, we formulate session search as a personalization task under the framework of learning to rank. Personalization approaches re-rank results to match a user model. Such user models are usually accumulated over time based on the user's browsing behaviour. We… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: 10 pages; Preprint of the full paper accepted at CIKM 2020

    ACM Class: H.3

  21. arXiv:2009.01719  [pdf, other

    cs.CL cs.AI

    Grounded Language Learning Fast and Slow

    Authors: Felix Hill, Olivier Tieleman, Tamara von Glehn, Nathaniel Wong, Hamza Merzic, Stephen Clark

    Abstract: Recent work has shown that large text-based neural language models, trained with conventional supervised learning objectives, acquire a surprising propensity for few- and one-shot learning. Here, we show that an embodied agent situated in a simulated 3D world, and endowed with a novel dual-coding external memory, can exhibit similar one-shot word learning when trained with conventional reinforceme… ▽ More

    Submitted 14 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

  22. arXiv:2006.06081  [pdf, other

    cs.RO

    Ergodic Specifications for Flexible Swarm Control: From User Commands to Persistent Adaptation

    Authors: Ahalya Prabhakar, Ian Abraham, Annalisa Taylor, Millicent Schlafly, Katarina Popovic, Giovani Diniz, Brendan Teich, Borislava Simidchieva, Shane Clark, Todd Murphey

    Abstract: This paper presents a formulation for swarm control and high-level task planning that is dynamically responsive to user commands and adaptable to environmental changes. We design an end-to-end pipeline from a tactile tablet interface for user commands to onboard control of robotic agents based on decentralized ergodic coverage. Our approach demonstrates reliable and dynamic control of a swarm coll… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Journal ref: Robotics: Science and Systems (RSS), 2020

  23. arXiv:2006.01016  [pdf, other

    cs.AI cs.CL cs.LG

    Probing Emergent Semantics in Predictive Agents via Question Answering

    Authors: Abhishek Das, Federico Carnevale, Hamza Merzic, Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Gregory Wayne, Felix Hill

    Abstract: Recent work has shown how predictive modeling can endow agents with rich knowledge of their surroundings, improving their ability to act in complex environments. We propose question-answering as a general paradigm to decode and understand the representations that such agents develop, applying our method to two recent approaches to predictive modeling -action-conditional CPC (Guo et al., 2018) and… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: ICML 2020

  24. arXiv:2005.03684  [pdf, other

    cs.CL cs.CV

    Learning to Segment Actions from Observation and Narration

    Authors: Daniel Fried, Jean-Baptiste Alayrac, Phil Blunsom, Chris Dyer, Stephen Clark, Aida Nematzadeh

    Abstract: We apply a generative segmental model of task structure, guided by narration, to action segmentation in video. We focus on unsupervised and weakly-supervised settings where no action labels are known during training. Despite its simplicity, our model performs competitively with previous work on a dataset of naturalistic instructional videos. Our model allows us to vary the sources of supervision u… ▽ More

    Submitted 11 August, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: ACL 2020

  25. arXiv:1912.06686  [pdf, other

    q-bio.NC cs.CV eess.IV

    Systematic Misestimation of Machine Learning Performance in Neuroimaging Studies of Depression

    Authors: Claas Flint, Micah Cearns, Nils Opel, Ronny Redlich, David M. A. Mehler, Daniel Emden, Nils R. Winter, Ramona Leenings, Simon B. Eickhoff, Tilo Kircher, Axel Krug, Igor Nenadic, Volker Arolt, Scott Clark, Bernhard T. Baune, Xiaoyi Jiang, Udo Dannlowski, Tim Hahn

    Abstract: We currently observe a disconcerting phenomenon in machine learning studies in psychiatry: While we would expect larger samples to yield better results due to the availability of more data, larger machine learning studies consistently show much weaker performance than the numerous small-scale studies. Here, we systematically investigated this effect focusing on one of the most heavily studied ques… ▽ More

    Submitted 3 May, 2021; v1 submitted 13 December, 2019; originally announced December 2019.

    Journal ref: Neuropsychopharmacology 46 (2021) 1510-1517

  26. arXiv:1910.00571  [pdf, other

    cs.AI

    Environmental drivers of systematicity and generalization in a situated agent

    Authors: Felix Hill, Andrew Lampinen, Rosalia Schneider, Stephen Clark, Matthew Botvinick, James L. McClelland, Adam Santoro

    Abstract: The question of whether deep neural networks are good at generalising beyond their immediate training experience is of critical importance for learning-based approaches to AI. Here, we consider tests of out-of-sample generalisation that require an agent to respond to never-seen-before instructions by manipulating and positioning objects in a 3D Unity simulated room. We first describe a comparative… ▽ More

    Submitted 19 February, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

  27. arXiv:1909.11049  [pdf, ps, other

    cs.CL

    Neural Generative Rhetorical Structure Parsing

    Authors: Amandla Mabona, Laura Rimell, Stephen Clark, Andreas Vlachos

    Abstract: Rhetorical structure trees have been shown to be useful for several document-level tasks including summarization and document classification. Previous approaches to RST parsing have used discriminative models; however, these are less sample efficient than generative models, and RST parsing datasets are typically small. In this paper, we present the first generative model for RST parsing. Our model… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  28. arXiv:1906.06438  [pdf, other

    cs.CL cs.LG

    Scalable Syntax-Aware Language Models Using Knowledge Distillation

    Authors: Adhiguna Kuncoro, Chris Dyer, Laura Rimell, Stephen Clark, Phil Blunsom

    Abstract: Prior work has shown that, on small amounts of training data, syntactic neural language models learn structurally sensitive generalisations more successfully than sequential language models. However, their computational complexity renders scaling difficult, and it remains an open question whether structural biases are still necessary when sequential models have access to ever larger amounts of tra… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

    Comments: ACL 2019

  29. Latent Tree Learning with Differentiable Parsers: Shift-Reduce Parsing and Chart Parsing

    Authors: Jean Maillard, Stephen Clark

    Abstract: Latent tree learning models represent sentences by composing their words according to an induced parse tree, all based on a downstream task. These models often outperform baselines which use (externally provided) syntax trees to drive the composition order. This work contributes (a) a new latent tree learning model based on shift-reduce parsing, with competitive downstream performance and non-triv… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Comments: ACL 2018 workshop on Relevance of Linguistic Structure in Neural Architectures for NLP

    Journal ref: Proceedings of the Workshop on the Relevance of Linguistic Structure in Neural Architectures for NLP, ACL 2018

  30. arXiv:1805.07051  [pdf, other

    stat.ML cs.LG

    Bayesian Joint Spike-and-Slab Graphical Lasso

    Authors: Zehang Richard Li, Tyler H. McCormick, Samuel J. Clark

    Abstract: In this article, we propose a new class of priors for Bayesian inference with multiple Gaussian graphical models. We introduce fully Bayesian treatments of two popular procedures, the group graphical lasso and the fused graphical lasso, and extend them to a continuous spike-and-slab framework to allow self-adaptive shrinkage and model selection simultaneously. We develop an EM algorithm that perfo… ▽ More

    Submitted 9 May, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

  31. arXiv:1804.07707  [pdf, ps, other

    cs.CL

    Factorising AMR generation through syntax

    Authors: Kris Cao, Stephen Clark

    Abstract: Generating from Abstract Meaning Representation (AMR) is an underspecified problem, as many syntactic decisions are not constrained by the semantic graph. To explicitly account for this underspecification, we break down generating from AMR into two steps: first generate a syntactic structure, and then generate the surface form. We show that decomposing the generation process this way leads to stat… ▽ More

    Submitted 3 April, 2019; v1 submitted 20 April, 2018; originally announced April 2018.

    Comments: Camera ready; accepted at NAACL-HLT 2019

  32. arXiv:1804.03984  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input

    Authors: Angeliki Lazaridou, Karl Moritz Hermann, Karl Tuyls, Stephen Clark

    Abstract: The ability of algorithms to evolve or learn (compositional) communication protocols has traditionally been studied in the language evolution literature through the use of emergent communication tasks. Here we scale up this research by using contemporary deep learning methods and by training reinforcement-learning neural network agents on referential communication games. We extend previous work, i… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: To appear at ICLR 2018

  33. arXiv:1804.03980  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    Emergent Communication through Negotiation

    Authors: Kris Cao, Angeliki Lazaridou, Marc Lanctot, Joel Z Leibo, Karl Tuyls, Stephen Clark

    Abstract: Multi-agent reinforcement learning offers a way to study how communication could emerge in communities of agents needing to solve specific problems. In this paper, we study the emergence of communication in the negotiation environment, a semi-cooperative model of agent interaction. We introduce two communication protocols -- one grounded in the semantics of the game, and one which is \textit{a pri… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: Published as a conference paper at ICLR 2018

  34. arXiv:1710.09867  [pdf, other

    cs.CL cs.AI cs.NE

    Understanding Early Word Learning in Situated Artificial Agents

    Authors: Felix Hill, Stephen Clark, Karl Moritz Hermann, Phil Blunsom

    Abstract: Neural network-based systems can now learn to locate the referents of words and phrases in images, answer questions about visual scenes, and execute symbolic instructions as first-person actors in partially-observable worlds. To achieve this so-called grounded language learning, models must overcome challenges that infants face when learning their first words. While it is notable that models with… ▽ More

    Submitted 1 October, 2019; v1 submitted 26 October, 2017; originally announced October 2017.

  35. Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

    Authors: Jean Maillard, Stephen Clark, Dani Yogatama

    Abstract: We introduce a neural network that represents sentences by composing their words according to induced binary parse trees. We use Tree-LSTM as our composition function, applied along a tree structure found by a fully differentiable natural language chart parser. Our model simultaneously optimises both the composition function and the parser, thus eliminating the need for externally-provided parse t… ▽ More

    Submitted 25 May, 2017; originally announced May 2017.

    Journal ref: Natural Language Engineering 25, no. 4 (2019): 433-49

  36. arXiv:1702.05962  [pdf, other

    cs.CL

    Latent Variable Dialogue Models and their Diversity

    Authors: Kris Cao, Stephen Clark

    Abstract: We present a dialogue generation model that directly captures the variability in possible responses to a given input, which reduces the `boring output' issue of deterministic dialogue models. Experiments show that our model generates more diverse outputs than baseline models, and also generates more consistently acceptable output than sampling from a deterministic encoder-decoder model.

    Submitted 20 February, 2017; originally announced February 2017.

    Comments: Accepted at EACL 2017

  37. arXiv:1612.04858  [pdf, other

    cs.LG

    Bayesian Optimization for Machine Learning : A Practical Guidebook

    Authors: Ian Dewancker, Michael McCourt, Scott Clark

    Abstract: The engineering of machine learning systems is still a nascent field; relying on a seemingly daunting collection of quickly evolving tools and best practices. It is our hope that this guidebook will serve as a useful resource for machine learning practitioners looking to take advantage of Bayesian optimization techniques. We outline four example machine learning problems that can be solved using o… ▽ More

    Submitted 14 December, 2016; originally announced December 2016.

  38. arXiv:1610.07432  [pdf, ps, other

    cs.AI cs.CL cs.CV

    Virtual Embodiment: A Scalable Long-Term Strategy for Artificial Intelligence Research

    Authors: Douwe Kiela, Luana Bulat, Anita L. Vero, Stephen Clark

    Abstract: Meaning has been called the "holy grail" of a variety of scientific disciplines, ranging from linguistics to philosophy, psychology and the neurosciences. The field of Artifical Intelligence (AI) is very much a part of that list: the development of sophisticated natural language semantics is a sine qua non for achieving a level of intelligence comparable to humans. Embodiment theories in cognitive… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

    MSC Class: 68T01 ACM Class: I.2.6

  39. arXiv:1605.06170  [pdf, other

    cs.LG

    Evaluation System for a Bayesian Optimization Service

    Authors: Ian Dewancker, Michael McCourt, Scott Clark, Patrick Hayes, Alexandra Johnson, George Ke

    Abstract: Bayesian optimization is an elegant solution to the hyperparameter optimization problem in machine learning. Building a reliable and robust Bayesian optimization service requires careful testing methodology and sound statistical analysis. In this talk we will outline our development of an evaluation framework to rigorously test and measure the impact of changes to the SigOpt optimization service.… ▽ More

    Submitted 19 May, 2016; originally announced May 2016.

  40. arXiv:1603.09441  [pdf, other

    cs.LG stat.ML

    A Stratified Analysis of Bayesian Optimization Methods

    Authors: Ian Dewancker, Michael McCourt, Scott Clark, Patrick Hayes, Alexandra Johnson, George Ke

    Abstract: Empirical analysis serves as an important complement to theoretical analysis for studying practical Bayesian optimization. Often empirical insights expose strengths and weaknesses inaccessible to theoretical analysis. We define two metrics for comparing the performance of Bayesian optimization methods and propose a ranking mechanism for summarizing performance within various genres or strata of te… ▽ More

    Submitted 30 March, 2016; originally announced March 2016.

  41. arXiv:1411.7942  [pdf, other

    cs.CL

    Using Sentence Plausibility to Learn the Semantics of Transitive Verbs

    Authors: Tamara Polajnar, Laura Rimell, Stephen Clark

    Abstract: The functional approach to compositional distributional semantics considers transitive verbs to be linear maps that transform the distributional vectors representing nouns into a vector representing a sentence. We conduct an initial investigation that uses a matrix consisting of the parameters of a logistic regression classifier trained on a plausibility task as a transitive verb function. We comp… ▽ More

    Submitted 12 December, 2014; v1 submitted 28 November, 2014; originally announced November 2014.

    Comments: Full updated paper for NIPS learning semantics workshop, with some minor errata fixed

  42. The Frobenius anatomy of word meanings II: possessive relative pronouns

    Authors: Mehrnoosh Sadrzadeh, Stephen Clark, Bob Coecke

    Abstract: Within the categorical compositional distributional model of meaning, we provide semantic interpretations for the subject and object roles of the possessive relative pronoun `whose'. This is done in terms of Frobenius algebras over compact closed categories. These algebras and their diagrammatic language expose how meanings of words in relative clauses interact with each other. We show how our int… ▽ More

    Submitted 18 June, 2014; originally announced June 2014.

    Comments: 40 pages, Journal of Logic and Computation, Essays dedicated to Roy Dyckhoff on the occasion of his retirement, S. Graham-Lengrand and D. Galmiche (eds.), 2014

    MSC Class: 18Dxx; 18Axx ACM Class: I.2.7; F.4.1

  43. The Frobenius anatomy of word meanings I: subject and object relative pronouns

    Authors: Mehrnoosh Sadrzadeh, Stephen Clark, Bob Coecke

    Abstract: This paper develops a compositional vector-based semantics of subject and object relative pronouns within a categorical framework. Frobenius algebras are used to formalise the operations required to model the semantics of relative pronouns, including passing information between the relative clause and the modified noun phrase, as well as copying, combining, and discarding parts of the relative cla… ▽ More

    Submitted 21 April, 2014; originally announced April 2014.

    Comments: 31 pages

    Journal ref: Journal of Logic and Computation, Special Issue: The Incomputable, an Isaac Newton Institute Workshop, 23(6), pp.1293-1317, 2013

  44. arXiv:1312.5985  [pdf, other

    cs.CL cs.LG

    Learning Type-Driven Tensor-Based Meaning Representations

    Authors: Tamara Polajnar, Luana Fagarasan, Stephen Clark

    Abstract: This paper investigates the learning of 3rd-order tensors representing the semantics of transitive verbs. The meaning representations are part of a type-driven tensor-based semantic framework, from the newly emerging field of compositional distributional semantics. Standard techniques from the neural networks literature are used to learn the tensors, which are tested on a selectional preference-st… ▽ More

    Submitted 18 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: Submitted as part of the open review process for ICLR'14. The paper contains 10 pages, 3 figures, 4 tables

    ACM Class: H.3.1

  45. arXiv:1305.0556  [pdf, other

    cs.CL quant-ph

    A quantum teleportation inspired algorithm produces sentence meaning from word meaning and grammatical structure

    Authors: Stephen Clark, Bob Coecke, Edward Grefenstette, Stephen Pulman, Mehrnoosh Sadrzadeh

    Abstract: We discuss an algorithm which produces the meaning of a sentence given meanings of its words, and its resemblance to quantum teleportation. In fact, this protocol was the main source of inspiration for this algorithm which has many applications in the area of Natural Language Processing.

    Submitted 11 October, 2013; v1 submitted 2 May, 2013; originally announced May 2013.

    Comments: 10 pages, many pictures

    MSC Class: 68T50 ACM Class: I.2.7

  46. arXiv:1101.0309  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Concrete Sentence Spaces for Compositional Distributional Models of Meaning

    Authors: Edward Grefenstette, Mehrnoosh Sadrzadeh, Stephen Clark, Bob Coecke, Stephen Pulman

    Abstract: Coecke, Sadrzadeh, and Clark (arXiv:1003.4394v1 [cs.CL]) developed a compositional model of meaning for distributional semantics, in which each word in a sentence has a meaning vector and the distributional meaning of the sentence is a function of the tensor products of the word vectors. Abstractly speaking, this function is the morphism corresponding to the grammatical structure of the sentence i… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.

    Comments: 10 pages, presented at the International Conference on Computational Semantics 2011 (IWCS'11), to be published in proceedings

    MSC Class: 68T50 ACM Class: G.1.3; H.3.1; H.3.3

    Journal ref: Proceedings of the 9th International Conference on Computational Semantics (2011)

  47. arXiv:1012.0531  [pdf, ps, other

    quant-ph cond-mat.other cs.CC cs.LO math-ph

    Categorical Tensor Network States

    Authors: Jacob D. Biamonte, Stephen R. Clark, Dieter Jaksch

    Abstract: We examine the use of string diagrams and the mathematics of category theory in the description of quantum states by tensor networks. This approach lead to a unification of several ideas, as well as several results and methods that have not previously appeared in either side of the literature. Our approach enabled the development of a tensor network framework allowing a solution to the quantum dec… ▽ More

    Submitted 17 December, 2011; v1 submitted 2 December, 2010; originally announced December 2010.

    Comments: 39 pages, 31 figures, published version

    Journal ref: AIP Advances 1(4), 042172 (2011)

  48. arXiv:1003.4394  [pdf, other

    cs.CL cs.LO math.CT

    Mathematical Foundations for a Compositional Distributional Model of Meaning

    Authors: Bob Coecke, Mehrnoosh Sadrzadeh, Stephen Clark

    Abstract: We propose a mathematical framework for a unification of the distributional theory of meaning in terms of vector space models, and a compositional theory for grammatical types, for which we rely on the algebra of Pregroups, introduced by Lambek. This mathematical framework enables us to compute the meaning of a well-typed sentence from the meanings of its constituents. Concretely, the type reduct… ▽ More

    Submitted 23 March, 2010; originally announced March 2010.

    Comments: to appear

    Journal ref: Lambek Festschirft, special issue of Linguistic Analysis, 2010.