Zum Hauptinhalt springen

Showing 1–50 of 52 results for author: John, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12881  [pdf, other

    physics.acc-ph cs.CL

    Towards Unlocking Insights from Logbooks Using AI

    Authors: Antonin Sulc, Alex Bien, Annika Eichler, Daniel Ratner, Florian Rehm, Frank Mayet, Gregor Hartmann, Hayden Hoschouer, Henrik Tuennermann, Jan Kaiser, Jason St. John, Jennefer Maldonado, Kyle Hazelwood, Raimund Kammering, Thorsten Hellert, Tim Wilksen, Verena Kain, Wan-Lin Hu

    Abstract: Electronic logbooks contain valuable information about activities and events concerning their associated particle accelerator facilities. However, the highly technical nature of logbook entries can hinder their usability and automation. As natural language processing (NLP) continues advancing, it offers opportunities to address various challenges that logbooks present. This work explores jointly t… ▽ More

    Submitted 25 May, 2024; originally announced June 2024.

    Comments: 5 pages, 1 figure, 15th International Particle Accelerator Conference

  2. arXiv:2406.03337  [pdf, other

    cs.LG stat.ML

    Identifying latent state transition in non-linear dynamical systems

    Authors: Çağlar Hızlı, Çağatay Yıldız, Matthias Bethge, ST John, Pekka Marttinen

    Abstract: This work aims to improve generalization and interpretability of dynamical systems by recovering the underlying lower-dimensional latent states and their time evolutions. Previous work on disentangled representation learning within the realm of dynamical systems focused on the latent states, possibly with linear transition approximations. As such, they cannot identify nonlinear transition dynamics… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2404.04965  [pdf

    cs.HC q-bio.NC

    Towards Developing Brain-Computer Interfaces for People with Multiple Sclerosis

    Authors: John S. Russo, Tim Mahoney, Kirill Kokorin, Ashley Reynolds, Chin-Hsuan Sophie Lin, Sam E. John, David B. Grayden

    Abstract: Multiple Sclerosis (MS) is a severely disabling condition that leads to various neurological symptoms. A Brain-Computer Interface (BCI) may substitute some lost function; however, there is a lack of BCI research in people with MS. To progress this research area effectively and efficiently, we aimed to evaluate user needs and assess the feasibility and user-centric requirements of a BCI for people… ▽ More

    Submitted 8 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: 18 pages, 9 figures, 1 table. For supplementary material, please contact the corresponding author; corrected ordering of figures 6 and 7

  4. arXiv:2312.17372  [pdf, other

    cs.LG cs.AI physics.acc-ph

    Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e

    Authors: Chenwei Xu, Jerry Yao-Chieh Hu, Aakaash Narayanan, Mattson Thieme, Vladimir Nagaslaev, Mark Austin, Jeremy Arnold, Jose Berlioz, Pierrick Hanlet, Aisha Ibrahim, Dennis Nicklaus, Jovan Mitrevski, Jason Michael St. John, Gauri Pradhan, Andrea Saewert, Kiyomi Seiya, Brian Schupbach, Randy Thurman-Keup, Nhan Tran, Rui Shi, Seda Ogrenci, Alexis Maya-Isabelle Shuping, Kyle Hazelwood, Han Liu

    Abstract: We introduce a novel Proximal Policy Optimization (PPO) algorithm aimed at addressing the challenge of maintaining a uniform proton beam intensity delivery in the Muon to Electron Conversion Experiment (Mu2e) at Fermi National Accelerator Laboratory (Fermilab). Our primary objective is to regulate the spill process to ensure a consistent intensity profile, with the ultimate goal of creating an aut… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 10 pages, accepted at NeurIPS 2023 ML4Phy Workshop

  5. arXiv:2311.03129  [pdf, other

    stat.ML cs.LG

    Nonparametric modeling of the composite effect of multiple nutrients on blood glucose dynamics

    Authors: Arina Odnoblyudova, Çağlar Hizli, ST John, Andrea Cognolato, Anne Juuti, Simo Särkkä, Kirsi Pietiläinen, Pekka Marttinen

    Abstract: In biomedical applications it is often necessary to estimate a physiological response to a treatment consisting of multiple components, and learn the separate effects of the components in addition to the joint effect. Here, we extend existing probabilistic nonparametric approaches to explicitly address this problem. We also develop a new convolution-based model for composite treatment-response cur… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  6. arXiv:2310.11527  [pdf, other

    stat.ML cs.LG

    Thin and Deep Gaussian Processes

    Authors: Daniel Augusto de Souza, Alexander Nikitin, ST John, Magnus Ross, Mauricio A. Álvarez, Marc Peter Deisenroth, João P. P. Gomes, Diego Mesquita, César Lincoln C. Mattos

    Abstract: Gaussian processes (GPs) can provide a principled approach to uncertainty quantification with easy-to-interpret kernel hyperparameters, such as the lengthscale, which controls the correlation distance of function values. However, selecting an appropriate kernel can be challenging. Deep GPs avoid manual kernel engineering by successively parameterizing kernels with GP layers, allowing them to learn… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted at the Conference on Neural Information Processing Systems (NeurIPS) 2023

  7. arXiv:2307.03093  [pdf, other

    cs.LG stat.ML

    Beyond Intuition, a Framework for Applying GPs to Real-World Data

    Authors: Kenza Tazi, Jihao Andreas Lin, Ross Viljoen, Alex Gardner, ST John, Hong Ge, Richard E. Turner

    Abstract: Gaussian Processes (GPs) offer an attractive method for regression over small, structured and correlated datasets. However, their deployment is hindered by computational costs and limited guidelines on how to apply GPs beyond simple low-dimensional datasets. We propose a framework to identify the suitability of GPs to a given problem and how to set up a robust and well-specified GP model. The guid… ▽ More

    Submitted 17 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at the ICML Workshop on Structured Probabilistic Inference and Generative Modelling (2023)

  8. arXiv:2306.10915  [pdf, other

    stat.ML cs.LG

    Practical Equivariances via Relational Conditional Neural Processes

    Authors: Daolang Huang, Manuel Haussmann, Ulpu Remes, ST John, Grégoire Clarté, Kevin Sebastian Luck, Samuel Kaski, Luigi Acerbi

    Abstract: Conditional Neural Processes (CNPs) are a class of metalearning models popular for combining the runtime efficiency of amortized inference with reliable uncertainty quantification. Many relevant machine learning tasks, such as in spatio-temporal modeling, Bayesian Optimization and continuous control, inherently contain equivariances -- for example to translation -- which the model can exploit for… ▽ More

    Submitted 5 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 38 pages, 8 figures. Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  9. arXiv:2306.09656  [pdf, other

    cs.LG stat.ME

    Temporal Causal Mediation through a Point Process: Direct and Indirect Effects of Healthcare Interventions

    Authors: Çağlar Hızlı, ST John, Anne Juuti, Tuure Saarinen, Kirsi Pietiläinen, Pekka Marttinen

    Abstract: Deciding on an appropriate intervention requires a causal model of a treatment, the outcome, and potential mediators. Causal mediation analysis lets us distinguish between direct and indirect effects of the intervention, but has mostly been studied in a static setting. In healthcare, data come in the form of complex, irregularly sampled time-series, with dynamic interdependencies between a treatme… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  10. arXiv:2306.04201  [pdf, other

    cs.LG stat.ML

    Improving Hyperparameter Learning under Approximate Inference in Gaussian Process Models

    Authors: Rui Li, ST John, Arno Solin

    Abstract: Approximate inference in Gaussian process (GP) models with non-conjugate likelihoods gets entangled with the learning of the model hyperparameters. We improve hyperparameter learning in GP models and focus on the interplay between variational inference (VI) and the learning target. While VI's lower bound to the marginal likelihood is a suitable objective for inferring the approximate posterior, we… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  11. arXiv:2306.03566  [pdf, other

    cs.LG stat.ML

    Memory-Based Dual Gaussian Processes for Sequential Learning

    Authors: Paul E. Chang, Prakhar Verma, S. T. John, Arno Solin, Mohammad Emtiyaz Khan

    Abstract: Sequential learning with Gaussian processes (GPs) is challenging when access to past data is limited, for example, in continual and active learning. In such cases, errors can accumulate over time due to inaccuracies in the posterior, hyperparameters, and inducing points, making accurate learning challenging. Here, we present a method to keep all such errors in check using the recently proposed dua… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  12. arXiv:2305.14120  [pdf, other

    cs.LG stat.ML

    Learning Relevant Contextual Variables Within Bayesian Optimization

    Authors: Julien Martinelli, Ayush Bharti, Armi Tiihonen, S. T. John, Louis Filstroff, Sabina J. Sloman, Patrick Rinke, Samuel Kaski

    Abstract: Contextual Bayesian Optimization (CBO) efficiently optimizes black-box functions with respect to design variables, while simultaneously integrating contextual information regarding the environment, such as experimental conditions. However, the relevance of contextual variables is not necessarily known beforehand. Moreover, contextual variables can sometimes be optimized themselves at an additional… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  13. Queer In AI: A Case Study in Community-Led Participatory AI

    Authors: Organizers Of QueerInAI, :, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubička, Hang Yuan, Hetvi J, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, A Pranav , et al. (26 additional authors not shown)

    Abstract: We present Queer in AI as a case study for community-led participatory design in AI. We examine how participatory design and intersectional tenets started and shaped this community's programs over the years. We discuss different challenges that emerged in the process, look at ways this organization has fallen short of operationalizing participatory and intersectional principles, and then assess th… ▽ More

    Submitted 8 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: To appear at FAccT 2023

    Journal ref: 2023 ACM Conference on Fairness, Accountability, and Transparency

  14. arXiv:2303.01661  [pdf

    eess.IV cs.LG physics.app-ph physics.ins-det physics.optics

    Longwave infrared multispectral image sensor system using aluminum-germanium plasmonic filter arrays

    Authors: Noor E Karishma Shaik, Bryce Widdicombe, Dechuan Sun, Sam E John, Dongryeol Ryu, Ampalavanapillai Nirmalathas, Ranjith R Unnithan

    Abstract: A multispectral camera records image data in various wavelengths across the electromagnetic spectrum to acquire additional information that a conventional camera fails to capture. With the advent of high-resolution image sensors and colour filter technologies, multispectral imagers in the visible wavelengths have become popular with increasing commercial viability in the last decade. However, mult… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  15. arXiv:2302.10786  [pdf, other

    cs.CL cs.CY cs.HC cs.IR

    Real-World Deployment and Evaluation of Kwame for Science, An AI Teaching Assistant for Science Education in West Africa

    Authors: George Boateng, Samuel John, Samuel Boateng, Philemon Badu, Patrick Agyeman-Budu, Victor Kumbol

    Abstract: Africa has a high student-to-teacher ratio which limits students' access to teachers for learning support such as educational question answering. In this work, we extended Kwame, a bilingual AI teaching assistant for coding education, adapted it for science education, and deployed it as a web app. Kwame for Science provides passages from well-curated knowledge sources and related past national exa… ▽ More

    Submitted 26 April, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 14 pages, Accepted for publication at the 25th International Conference on Artificial Intelligence in Education (AIED 2024)

  16. Plug & Play Directed Evolution of Proteins with Gradient-based Discrete MCMC

    Authors: Patrick Emami, Aidan Perreault, Jeffrey Law, David Biagioni, Peter C. St. John

    Abstract: A long-standing goal of machine-learning-based protein engineering is to accelerate the discovery of novel mutations that improve the function of a known protein. We introduce a sampling framework for evolving proteins in silico that supports mixing and matching a variety of unsupervised models, such as protein language models, and supervised models that predict protein function from sequence. By… ▽ More

    Submitted 6 April, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 31 pages, 8 figures. To appear in the Machine Learning: Science & Technology (ML:S&T) journal. Code is available at https://github.com/pemami4911/ppde. A short version of this work appeared at the NeurIPS 2022 Machine Learning in Structural Biology Workshop

  17. arXiv:2211.06260  [pdf, other

    cs.LG stat.ML

    Towards Improved Learning in Gaussian Processes: The Best of Two Worlds

    Authors: Rui Li, ST John, Arno Solin

    Abstract: Gaussian process training decomposes into inference of the (approximate) posterior and learning of the hyperparameters. For non-Gaussian (non-conjugate) likelihoods, two common choices for approximate inference are Expectation Propagation (EP) and Variational Inference (VI), which have complementary strengths and weaknesses. While VI's lower bound to the marginal likelihood is a suitable objective… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: In the 2022 NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems

  18. arXiv:2211.01053  [pdf, other

    cs.LG stat.ML

    Fantasizing with Dual GPs in Bayesian Optimization and Active Learning

    Authors: Paul E. Chang, Prakhar Verma, ST John, Victor Picheny, Henry Moss, Arno Solin

    Abstract: Gaussian processes (GPs) are the main surrogate functions used for sequential modelling such as Bayesian Optimization and Active Learning. Their drawbacks are poor scaling with data and the need to run an optimization loop when using a non-Gaussian likelihood. In this paper, we focus on `fantasizing' batch acquisition functions that need the ability to condition on new fantasized data computationa… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: In the 2022 NeurIPS Workshop on Gaussian Processes, Spatiotemporal Modeling, and Decision-making Systems

  19. A Low-Power 1 Gb/s Line Driver with Configurable Pre-Emphasis for Lossy Transmission Lines

    Authors: Nicholas St. John, Soumyajit Mandal, Grzegorz W. Deptuch, Eric Raguzin, Sergio Rescia

    Abstract: A line driver with configurable pre-emphasis is implemented in a 65 nm CMOS process. The driver utilizes a three-tap feed-forward equalization (FFE) architecture. The relative delays between the taps are selectable in increments of 1/16th of the unit interval (UI) via an 8-stage delay-locked loop (DLL) and digital interpolator. It is also possible to control the output amplitude and source impedan… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Submitted to JINST

  20. arXiv:2209.04142  [pdf, other

    cs.LG stat.ME

    Causal Modeling of Policy Interventions From Sequences of Treatments and Outcomes

    Authors: Çağlar Hızlı, ST John, Anne Juuti, Tuure Saarinen, Kirsi Pietiläinen, Pekka Marttinen

    Abstract: A treatment policy defines when and what treatments are applied to affect some outcome of interest. Data-driven decision-making requires the ability to predict what happens if a policy is changed. Existing methods that predict how the outcome evolves under different scenarios assume that the tentative sequences of future treatments are fixed in advance, while in practice the treatments are determi… ▽ More

    Submitted 20 June, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: Accepted at ICML 2023

  21. arXiv:2206.13703  [pdf, other

    cs.CL cs.CY cs.HC

    Kwame for Science: An AI Teaching Assistant Based on Sentence-BERT for Science Education in West Africa

    Authors: George Boateng, Samuel John, Andrew Glago, Samuel Boateng, Victor Kumbol

    Abstract: Africa has a high student-to-teacher ratio which limits students' access to teachers. Consequently, students struggle to get answers to their questions. In this work, we extended Kwame, our previous AI teaching assistant, adapted it for science education, and deployed it as a web app. Kwame for Science answers questions of students based on the Integrated Science subject of the West African Senior… ▽ More

    Submitted 10 July, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 5 pages, Accepted at the Fourth Workshop on Intelligent Textbooks (iTextbooks) at the 23th International Conference on Artificial Intelligence in Education (AIED 2022)

  22. arXiv:2111.08524  [pdf, other

    cs.LG stat.ML

    Non-separable Spatio-temporal Graph Kernels via SPDEs

    Authors: Alexander Nikitin, ST John, Arno Solin, Samuel Kaski

    Abstract: Gaussian processes (GPs) provide a principled and direct approach for inference and learning on graphs. However, the lack of justified graph kernels for spatio-temporal modelling has held back their use in graph problems. We leverage an explicit link between stochastic partial differential equations (SPDEs) and GPs on graphs, introduce a framework for deriving graph kernels via SPDEs, and derive n… ▽ More

    Submitted 22 March, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

  23. arXiv:2110.11466  [pdf, other

    cs.LG cs.DC

    MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems

    Authors: Steven Farrell, Murali Emani, Jacob Balma, Lukas Drescher, Aleksandr Drozd, Andreas Fink, Geoffrey Fox, David Kanter, Thorsten Kurth, Peter Mattson, Dawei Mu, Amit Ruhela, Kento Sato, Koichi Shirahata, Tsuguchika Tabaru, Aristeidis Tsaris, Jan Balewski, Ben Cumming, Takumi Danjo, Jens Domke, Takaaki Fukai, Naoto Fukumoto, Tatsuya Fukushi, Balazs Gerofi, Takumi Honda , et al. (18 additional authors not shown)

    Abstract: Scientific communities are increasingly adopting machine learning and deep learning models in their applications to accelerate scientific insights. High performance computing systems are pushing the frontiers of performance with a rich diversity of hardware resources and massive scale-out capabilities. There is a critical need to understand fair and effective benchmarking of machine learning appli… ▽ More

    Submitted 26 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

  24. arXiv:2110.10828  [pdf, other

    cs.LG

    AdamD: Improved bias-correction in Adam

    Authors: John St John

    Abstract: Here I present a small update to the bias-correction term in the Adam optimizer that has the advantage of making smaller gradient updates in the first several steps of training. With the default bias-correction, Adam may actually make larger than requested gradient updates early in training. By only including the well-justified bias-correction of the second moment gradient estimate, $v_t$, and exc… ▽ More

    Submitted 22 October, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: 8 pages, 1 figure

  25. arXiv:2104.05674  [pdf, ps, other

    stat.ML cs.LG

    GPflux: A Library for Deep Gaussian Processes

    Authors: Vincent Dutordoir, Hugh Salimbeni, Eric Hambro, John McLeod, Felix Leibfried, Artem Artemev, Mark van der Wilk, James Hensman, Marc P. Deisenroth, ST John

    Abstract: We introduce GPflux, a Python library for Bayesian deep learning with a strong emphasis on deep Gaussian processes (DGPs). Implementing DGPs is a challenging endeavour due to the various mathematical subtleties that arise when dealing with multivariate Gaussian distributions and the complex bookkeeping of indices. To date, there are no actively maintained, open-sourced and extendable libraries ava… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  26. arXiv:2012.13962  [pdf, other

    cs.LG stat.ML

    A Tutorial on Sparse Gaussian Processes and Variational Inference

    Authors: Felix Leibfried, Vincent Dutordoir, ST John, Nicolas Durrande

    Abstract: Gaussian processes (GPs) provide a framework for Bayesian inference that can offer principled uncertainty estimates for a large range of problems. For example, if we consider regression problems with Gaussian likelihoods, a GP model enjoys a posterior in closed form. However, identifying the posterior GP scales cubically with the number of training examples and requires to store all examples in me… ▽ More

    Submitted 18 December, 2022; v1 submitted 27 December, 2020; originally announced December 2020.

  27. arXiv:2012.02328  [pdf, other

    cs.LG cs.DC

    MLPerf Mobile Inference Benchmark

    Authors: Vijay Janapa Reddi, David Kanter, Peter Mattson, Jared Duke, Thai Nguyen, Ramesh Chukka, Ken Shiring, Koan-Sin Tan, Mark Charlebois, William Chou, Mostafa El-Khamy, Jungwook Hong, Tom St. John, Cindy Trinh, Michael Buch, Mark Mazumder, Relia Markovic, Thomas Atta, Fatih Cakir, Masoud Charkhabi, Xiaodong Chen, Cheng-Ming Chiang, Dave Dexter, Terry Heo, Gunther Schmuelling , et al. (2 additional authors not shown)

    Abstract: This paper presents the first industry-standard open-source machine learning (ML) benchmark to allow perfor mance and accuracy evaluation of mobile devices with different AI chips and software stacks. The benchmark draws from the expertise of leading mobile-SoC vendors, ML-framework providers, and model producers. It comprises a suite of models that operate with standard data sets, quality metrics… ▽ More

    Submitted 6 April, 2022; v1 submitted 3 December, 2020; originally announced December 2020.

  28. Maximum Covering Subtrees for Phylogenetic Networks

    Authors: Nathan Davidov, Amanda Hernandez, Justin Jian, Patrick McKenna, K. A. Medlin, Roadra Mojumder, Megan Owen, Andrew Quijano, Amanda Rodriguez, Katherine St. John, Katherine Thai, Meliza Uraga

    Abstract: Tree-based phylogenetic networks, which may be roughly defined as leaf-labeled networks built by adding arcs only between the original tree edges, have elegant properties for modeling evolutionary histories. We answer an open question of Francis, Semple, and Steel about the complexity of determining how far a phylogenetic network is from being tree-based, including non-binary phylogenetic networks… ▽ More

    Submitted 24 November, 2020; v1 submitted 25 September, 2020; originally announced September 2020.

  29. arXiv:2008.12408  [pdf, other

    cs.MM cs.IT cs.LG

    Rate distortion optimization over large scale video corpus with machine learning

    Authors: Sam John, Akshay Gadde, Balu Adsumilli

    Abstract: We present an efficient codec-agnostic method for bitrate allocation over a large scale video corpus with the goal of minimizing the average bitrate subject to constraints on average and minimum quality. Our method clusters the videos in the corpus such that videos within one cluster have similar rate-distortion (R-D) characteristics. We train a support vector machine classifier to predict the R-D… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: Accepted in 2020 IEEE International Conference on Image Processing (ICIP)

  30. arXiv:2003.04125  [pdf, other

    stat.ML cs.LG stat.ME

    Amortized variance reduction for doubly stochastic objectives

    Authors: Ayman Boustati, Sattar Vakili, James Hensman, ST John

    Abstract: Approximate inference in complex probabilistic models such as deep Gaussian processes requires the optimisation of doubly stochastic objective functions. These objectives incorporate randomness both from mini-batch subsampling of the data and from Monte Carlo estimation of expectations. If the gradient variance is high, the stochastic optimisation problem becomes difficult with a slow rate of conv… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  31. arXiv:2003.01115  [pdf, other

    stat.ML cs.LG

    A Framework for Interdomain and Multioutput Gaussian Processes

    Authors: Mark van der Wilk, Vincent Dutordoir, ST John, Artem Artemev, Vincent Adam, James Hensman

    Abstract: One obstacle to the use of Gaussian processes (GPs) in large-scale problems, and as a component in deep learning system, is the need for bespoke derivations and implementations for small variations in the model or inference. In order to improve the utility of GPs we need a modular system that allows rapid implementation and testing, as seen in the neural network community. We present a mathematica… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  32. arXiv:1911.05796  [pdf, ps, other

    astro-ph.IM cs.AI physics.soc-ph

    Response to NITRD, NCO, NSF Request for Information on "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan"

    Authors: J. Amundson, J. Annis, C. Avestruz, D. Bowring, J. Caldeira, G. Cerati, C. Chang, S. Dodelson, D. Elvira, A. Farahi, K. Genser, L. Gray, O. Gutsche, P. Harris, J. Kinney, J. B. Kowalkowski, R. Kutschke, S. Mrenna, B. Nord, A. Para, K. Pedro, G. N. Perdue, A. Scheinker, P. Spentzouris, J. St. John , et al. (5 additional authors not shown)

    Abstract: We present a response to the 2018 Request for Information (RFI) from the NITRD, NCO, NSF regarding the "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan." Through this document, we provide a response to the question of whether and how the National Artificial Intelligence Research and Development Strategic Plan (NAIRDSP) should be updated from the perspect… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Report number: FERMILAB-FN-1092-SCD

  33. arXiv:1911.02549  [pdf, other

    cs.LG cs.PF stat.ML

    MLPerf Inference Benchmark

    Authors: Vijay Janapa Reddi, Christine Cheng, David Kanter, Peter Mattson, Guenther Schmuelling, Carole-Jean Wu, Brian Anderson, Maximilien Breughe, Mark Charlebois, William Chou, Ramesh Chukka, Cody Coleman, Sam Davis, Pan Deng, Greg Diamos, Jared Duke, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Idgunji, Thomas B. Jablin, Jeff Jiao, Tom St. John, Pankaj Kanwar, David Lee , et al. (22 additional authors not shown)

    Abstract: Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devic… ▽ More

    Submitted 9 May, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: ISCA 2020

  34. arXiv:1910.01500  [pdf, other

    cs.LG cs.PF stat.ML

    MLPerf Training Benchmark

    Authors: Peter Mattson, Christine Cheng, Cody Coleman, Greg Diamos, Paulius Micikevicius, David Patterson, Hanlin Tang, Gu-Yeon Wei, Peter Bailis, Victor Bittorf, David Brooks, Dehao Chen, Debojyoti Dutta, Udit Gupta, Kim Hazelwood, Andrew Hock, Xinyuan Huang, Atsushi Ike, Bill Jia, Daniel Kang, David Kanter, Naveen Kumar, Jeffery Liao, Guokai Ma, Deepak Narayanan , et al. (12 additional authors not shown)

    Abstract: Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges absent from other domains: optimizations that improve training throughput can increase the time to solution, training is stochastic and time to solution exhibits h… ▽ More

    Submitted 2 March, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: MLSys 2020

  35. arXiv:1907.05647  [pdf, other

    cs.AI cs.SE

    Automatic Generation of Atomic Consistency Preserving Search Operators for Search-Based Model Engineering

    Authors: Alexandru Burdusel, Steffen Zschaler, Stefan John

    Abstract: Recently there has been increased interest in combining the fields of Model-Driven Engineering (MDE) and Search-Based Software Engineering (SBSE). Such approaches use meta-heuristic search guided by search operators (model mutators and sometimes breeders) implemented as model transformations. The design of these operators can substantially impact the effectiveness and efficiency of the meta-heuris… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

    Comments: Technical report version of the MODELS 2019 paper with the same title

  36. arXiv:1902.10974  [pdf, other

    stat.ML cs.LG

    Gaussian Process Modulated Cox Processes under Linear Inequality Constraints

    Authors: Andrés F. López-Lopera, ST John, Nicolas Durrande

    Abstract: Gaussian process (GP) modulated Cox processes are widely used to model point patterns. Existing approaches require a mapping (link function) between the unconstrained GP and the positive intensity function. This commonly yields solutions that do not have a closed form or that are restricted to specific covariance functions. We introduce a novel finite approximation of GP-modulated Cox processes wh… ▽ More

    Submitted 28 February, 2019; originally announced February 2019.

  37. arXiv:1812.11106  [pdf, other

    cs.LG stat.ML

    Scalable GAM using sparse variational Gaussian processes

    Authors: Vincent Adam, Nicolas Durrande, ST John

    Abstract: Generalized additive models (GAMs) are a widely used class of models of interest to statisticians as they provide a flexible way to design interpretable models of data beyond linear models. We here propose a scalable and well-calibrated Bayesian treatment of GAMs using Gaussian processes (GPs) and leveraging recent advances in variational inference. We use sparse GPs to represent each component an… ▽ More

    Submitted 28 December, 2018; originally announced December 2018.

    Journal ref: 1st Symposium on Advances in Approximate Bayesian Inference, 2018

  38. arXiv:1808.07269  [pdf, other

    hep-ex cs.CV physics.data-an physics.ins-det

    A Deep Neural Network for Pixel-Level Electromagnetic Particle Identification in the MicroBooNE Liquid Argon Time Projection Chamber

    Authors: MicroBooNE collaboration, C. Adams, M. Alrashed, R. An, J. Anthony, J. Asaadi, A. Ashkenazi, M. Auger, S. Balasubramanian, B. Baller, C. Barnes, G. Barr, M. Bass, F. Bay, A. Bhat, K. Bhattacharya, M. Bishai, A. Blake, T. Bolton, L. Camilleri, D. Caratelli, I. Caro Terrazas, R. Carr, R. Castillo Fernandez, F. Cavanna , et al. (148 additional authors not shown)

    Abstract: We have developed a convolutional neural network (CNN) that can make a pixel-level prediction of objects in image data recorded by a liquid argon time projection chamber (LArTPC) for the first time. We describe the network design, training techniques, and software tools developed to train this network. The goal of this work is to develop a complete deep neural network based data reconstruction cha… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Journal ref: Phys. Rev. D 99, 092001 (2019)

  39. arXiv:1808.05563  [pdf, other

    cs.LG stat.ML

    Learning Invariances using the Marginal Likelihood

    Authors: Mark van der Wilk, Matthias Bauer, ST John, James Hensman

    Abstract: Generalising well in supervised learning tasks relies on correctly extrapolating the training data to a large region of the input space. One way to achieve this is to constrain the predictions to be invariant to transformations on the input that are known to be irrelevant (e.g. translation). Commonly, this is done through data augmentation, where the training set is enlarged by applying hand-craft… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

  40. arXiv:1807.10363  [pdf, other

    physics.comp-ph cs.LG stat.ML

    Message-passing neural networks for high-throughput polymer screening

    Authors: Peter C. St. John, Caleb Phillips, Travis W. Kemper, A. Nolan Wilson, Michael F. Crowley, Mark R. Nimlos, Ross E. Larsen

    Abstract: Machine learning methods have shown promise in predicting molecular properties, and given sufficient training data machine learning approaches can enable rapid high-throughput virtual screening of large libraries of compounds. Graph-based neural network architectures have emerged in recent years as the most successful approach for predictions based on molecular structure, and have consistently ach… ▽ More

    Submitted 5 April, 2019; v1 submitted 26 July, 2018; originally announced July 2018.

    Comments: 7 pages, 3 figures

  41. arXiv:1807.09887  [pdf, other

    cs.DB

    Compiling Database Application Programs

    Authors: Mohammad Dashti, Sachin Basil John, Thierry Coppey, Amir Shaikhha, Vojin Jovanovic, Christoph Koch

    Abstract: There is a trend towards increased specialization of data management software for performance reasons. In this paper, we study the automatic specialization and optimization of database application programs -- sequences of queries and updates, augmented with control flow constructs as they appear in database scripts, UDFs, transactional workloads and triggers in languages such as PL/SQL. We show ho… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

    Comments: 16 pages

    ACM Class: H.2.4

  42. arXiv:1804.01016  [pdf, other

    stat.ML cs.LG

    Large-Scale Cox Process Inference using Variational Fourier Features

    Authors: S. T. John, James Hensman

    Abstract: Gaussian process modulated Poisson processes provide a flexible framework for modelling spatiotemporal point patterns. So far this had been restricted to one dimension, binning to a pre-determined grid, or small data sets of up to a few thousand data points. Here we introduce Cox process inference based on Fourier features. This sparse representation induces global rather than local constraints on… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

  43. arXiv:1803.11175  [pdf, other

    cs.CL

    Universal Sentence Encoder

    Authors: Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, r… ▽ More

    Submitted 12 April, 2018; v1 submitted 29 March, 2018; originally announced March 2018.

    Comments: 7 pages; fixed module URL in Listing 1

  44. arXiv:1603.00542  [pdf, other

    cs.DB

    Repairing Conflicts among MVCC Transactions

    Authors: Mohammad Dashti, Sachin Basil John, Amir Shaikhha, Christoph Koch

    Abstract: The optimistic variants of MVCC (Multi-Version Concurrency Control) avoid blocking concurrent transactions at the cost of having a validation phase. Upon failure in the validation phase, the transaction is usually aborted and restarted from scratch. The "abort and restart" approach becomes a performance bottleneck for the use cases with high contention objects or long running transactions. In addi… ▽ More

    Submitted 1 March, 2016; originally announced March 2016.

    Comments: 12 pages, 9 figures

    ACM Class: H.2.4

  45. arXiv:1602.02739  [pdf, other

    q-bio.PE cs.DS

    On Determining if Tree-based Networks Contain Fixed Trees

    Authors: Maria Anaya, Olga Anipchenko-Ulaj, Aisha Ashfaq, Joyce Chiu, Mahedi Kaiser, Max Shoji Ohsawa, Megan Owen, Ella Pavlechko, Katherine St. John, Shivam Suleria, Keith Thompson, Corrine Yap

    Abstract: We address an open question of Francis and Steel about phylogenetic networks and trees. They give a polynomial time algorithm to decide if a phylogenetic network, N, is tree-based and pose the problem: given a fixed tree T and network N, is N based on T? We show that it is NP-hard to decide, by reduction from 3-Dimensional Matching (3DM), and further, that the problem is fixed parameter tractable.

    Submitted 8 February, 2016; originally announced February 2016.

    Comments: 7 pages, 4 figures

  46. arXiv:1411.7338  [pdf, other

    q-bio.PE cs.DS

    Bounds on the Expected Size of the Maximum Agreement Subtree

    Authors: Daniel Irving Bernstein, Lam Si Tung Ho, Colby Long, Mike Steel, Katherine St. John, Seth Sullivant

    Abstract: We prove polynomial upper and lower bounds on the expected size of the maximum agreement subtree of two random binary phylogenetic trees under both the uniform distribution and Yule-Harding distribution. This positively answers a question posed in earlier work. Determining tight upper and lower bounds remains an open problem.

    Submitted 31 August, 2015; v1 submitted 26 November, 2014; originally announced November 2014.

    Comments: Revised version

  47. arXiv:1403.7115  [pdf, ps, other

    cs.NI

    Active Switching: Packet Steering Flow Annotations

    Authors: Saul St. John, Aditya Akella

    Abstract: Our previous experience building systems for middlebox chain composition and scaling in software-defined networks has revealed that existing mechanisms of flow annotation commonly do not survive middlebox-traversals, or suffer from extreme identifier domain limitations resulting in excessive flow table size. In this paper, we analyze the structural artifacts resulting in these challenges, and offe… ▽ More

    Submitted 27 March, 2014; originally announced March 2014.

    MSC Class: 68M10 ACM Class: C.2.1

  48. arXiv:1305.6360  [pdf

    cs.CY

    Using Visual Aids as a Motivational Tool in Enhancing Students Interest in Reading Literary Texts

    Authors: Melor Md Yunus, Hadi Salehi, Dexter Sigan Anak John

    Abstract: This study aims to investigate the teachers perceptions on the use of visual aids (e.g., animation videos, pictures, films and projectors) as a motivational tool in enhancing students interest in reading literary texts. To achieve the aim of the study, the mixed-method approach was used to collect the required data. Therefore, 52 English teachers from seven national secondary schools in Kapit, Sar… ▽ More

    Submitted 28 May, 2013; originally announced May 2013.

    Comments: 4 Pages

    Journal ref: Proceedings of the 4th International Conference on Education and Educational Technologies (EET '13), 114-117, 2013

  49. arXiv:1305.0209  [pdf, ps, other

    cs.NI

    Stratos: A Network-Aware Orchestration Layer for Virtual Middleboxes in Clouds

    Authors: Aaron Gember, Anand Krishnamurthy, Saul St. John, Robert Grandl, Xiaoyang Gao, Ashok Anand, Theophilus Benson, Vyas Sekar, Aditya Akella

    Abstract: Enterprises want their in-cloud services to leverage the performance and security benefits that middleboxes offer in traditional deployments. Such virtualized deployments create new opportunities (e.g., flexible scaling) as well as new challenges (e.g., dynamics, multiplexing) for middlebox management tasks such as service composition and provisioning. Unfortunately, enterprises lack systematic to… ▽ More

    Submitted 11 March, 2014; v1 submitted 1 May, 2013; originally announced May 2013.

  50. arXiv:1101.2170  [pdf, other

    q-bio.PE cs.CC cs.DS

    The Complexity of Finding Multiple Solutions to Betweenness and Quartet Compatibility

    Authors: Maria Luisa Bonet, Simone Linz, Katherine St. John

    Abstract: We show that two important problems that have applications in computational biology are ASP-complete, which implies that, given a solution to a problem, it is NP-complete to decide if another solution exists. We show first that a variation of Betweenness, which is the underlying problem of questions related to radiation hybrid mapping, is ASP-complete. Subsequently, we use that result to show that… ▽ More

    Submitted 28 March, 2011; v1 submitted 11 January, 2011; originally announced January 2011.

    Comments: 25 pages, 7 figures